Insurance Co.
"800% ROI achieved based on hardware savings, elimination of
consulting costs, and avoidance of SLA compliance penalties."
- DBA Manager

Insurance Company Problems:
- Finger pointing existed between DBAs and Developers as to who was responsible for performance.
- Many "Critical Situation" fire drills took many hours to resolve.
- Undocumented code problems with outside vendors made resolution impossible.
- DBA team felt the were "flying blind."
Confio Ignite Solution:
- Ignite deployed across 70 servers, up to 500 CPUs, to monitor Oracle across enterprise infrastructure.
- Ignite was also made available to developers creating Oracle based applications.
- Deployment of Ignite was configured for the majority of servers providing universal visibility.
Confio Ignite Benefits:
- Identification of additional index reduced I/O wait times by 60%.
- Reduced hardware demands by 25% and eliminated need for middle-of-night intervention by DBA.
- Eliminated $700K server capacity upgrade for data warehousing application.
Customer Description
Customer is a U.S.
based, Fortune 100 financial services organization with diversified
operations in property, casualty and life insurance markets and
$150B in assets. Customer operates over 50 servers ranging from 2
to 24 CPU each running Oracle RDBMS. Databases support a wide range
of applications including financial and accounting operations,
billing, eCommerce, customer relations, call center, and project
management. Applications are both internally developed as well as
licensed from major commercial vendors and customized to meet
specific customer requirements. Both high volume transaction
processing and data warehouse applications are supported.
Customer maintains a dedicated DBA staff responsible for operating Oracle RDBMS systems and providing high level of service to application owners in functional and business areas. DBA team is small and skilled. Size and volume of applications has grown rapidly in recent years but DBA staff has not grown proportionally. Application
development has also accelerated with more customer-facing applications being introduced and demands for shorter development times and tighter service level guarantees for both internal and externally visible applications. Result is a larger number of developers with varying levels of Oracle experience creating applications dependent on the Oracle environment.
Customer Challenge
Faced with frequent and varied performance problems and response time variations the DBA team was forced to add hardware and Oracle CPU licenses as a standard procedure to maintain acceptable performance levels. As an example, server size for systems ranging in cost from less than $25,000 to more than $700,000 was routinely
doubled in attempts to overcome perceived "horsepower" shortages in the Oracle infrastructure. Consultants were retained to provide application expertise costing $5K-10K per month for each project on a continuing basis.
Most critically, the DBA team felt they were "flying blind." They perceived that despite their knowledge and skills in Oracle administration and tuning, they did not have visibility into the environment they were operating thus could not effectively deliver service to meet the growing demands of their internal customers.
The Solution
After an extensive evaluation, customer chose Confio Ignite™ for Oracle as a primary performance monitoring and optimization tool. Ignite was introduced for use by all members of the DBA team ranging in Oracle skill-level from mid-level to expert. It was also made accessible to developers creating Oracle based applications. Ignite
monitoring was configured for the majority of Oracle servers in the organization providing universal visibility across the Oracle installation.
Results
Customer has been able to demonstrate a performance improvement of over 50% through use of Ignite. As a result, DBA team has been able to significantly reduce acquisition of new hardware and software licenses and has been able to scale back consulting expenditures tied to new project deployment. IT organization was also
able to demonstrate compliance with internal Service Level Agreements (SLA) with business unit application owners avoiding cost penalties. Overall, customer has been able to demonstrate a ROI of 800% through use of Ignite for both ongoing optimization and resolution of acute performance problems.
Examples of Problem Resolution
- Identification of additional index reduced I/O wait times by 60%. Ignite identified multiple instances of a single query causing system wide I/O delays over an extended period. Application owner validated that the query was running as specified, so the DBA inserted a new index tied to this SQL and eliminated over 60% of multi-block I/O on the system.
- Reduced hardware demands by 25% and eliminated need for middle-of-night intervention by DBA. Monitoring of overnight processes identified spikes in resource usage and wait times with both CPU and I/O hitting maximum occurring regularly. DBA was able to identify changes in index structure and recommend query modifications to development. Result is an elimination of 13 hours of wait time for the nightly processing. This eliminated the need for a planned addition of 2 CPUs and Oracle licenses to a 6 CPU server. Importantly, the ability to monitor overnight processes and to identify specific behavior from captured data allowed the responsible DBA to avoid remaining at work for overnight system monitoring tasks.
- Eliminated $700K server capacity upgrade for data warehousing application. A 24 CPU server was consistently operating at greater than 90% CPU utilization and a doubling of hardware and software capacity was recommended as the standard attempt at resolution. Ignite identified parallel queries and direct path reads as the source of the bottleneck. With the database configured to run up to 16 parallel threads for each query, it was being consumed with inter-process communications. Armed with this illumination of the true problem source, the expert DBA was able to reconfigure the parallel threads and reduce the CPU and I/O overhead. The result was elimination of the need for a 100% capacity increase plus software, project management, and implementation resources required to put the new system into production.