Cpo - Analyze Data Faster With Db2 Blu On Power v7.9
Cpo - Analyze Data Faster With Db2 Blu On Power v7.9
Cpo - Analyze Data Faster With Db2 Blu On Power v7.9
Executive Summary
Big Data and Analytics has become a quintessential part of every business in the market today.
Organizations want to glean through the meaningful information from the massive amount of
data coming from a variety of sources as well as data stored in the data warehouses. Line of
Business leaders are looking for ways to analyze this data, called 'Big Data', to make key business
decisions. However, having the right data solution in place is a big challenge. Many data solution
providers offer a partial solution that does not meet the client's requirements. IBM DB2 BLU
Acceleration running on newly released POWER8 servers provide a better and faster solution for
Big Data Analytics. In this white paper we look at these two technologies in detail. We will show
how DB2 BLU running on POWER8 server delivers superior performance at lower cost.
We compared the price/performance of DB2 BLU on Power and competitor database running on
pre-integrated database machine with various Cognos Business Intelligence (BI) test cases. The
test results show that POWER8 server delivers up to 70% more throughput and has 5.7x lower
cost than competitor's solution.
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 1
There are three major forces driving the need for Analytics: the growth of mobile devices,
emergence of social media, and the shift of power from companies to consumers. Each of these
pose a special set of challenges to organizations in the area of big data and warrant a special
look at why they are important.
http://www.emc.com/about/news/press/2012/20121211-01.htm
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 2
Another interesting side effect of the factors above is that consumers have started to rely on
reviews from other consumers. Reviews on sites like Yelp and Tripadvisor can make or break a
restaurant or a hotel. They have become so important that most big hotels have dedicated staff
whose job is to respond to these reviews.
In addition, data from smart meters, banking transactions, weblogs, GPS and more can be
tracked, stored, and analyzed in many different ways. In short, for every business, Analysis of
Big Data whether it is structured or unstructured can make a substantial impact to:
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 3
IBM DB2 BLU Acceleration running on POWER8 server is a great implementation of the Big
Data platform. The combination of IBM hardware and software can run most complex
analytical queries much faster.
Figure: 1, IBM Big Data Platform Architecture: Analyze all data from any source
Actionable Compression- BLU Acceleration preserves the order of the data, enabling
compressed data in BLU acceleration tables to be used without decompression. A broad
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 4
range of operations like predicate and joins can be completed on compressed data. Most
frequent values are encoded with fewer bits to optimize the compression.
Parallel Vector Processing- All the latest processors like POWER8, POWER7+/7, Intel Ivy
bridge etc. have this new feature called SIMD (Single Instruction Multiple Data) which
allows you to fetch many data elements simultaneously with a single instruction, thus
parallelizing the operations for faster processing at the chip level. BLU Acceleration is
designed to take advantage of this feature.
Data Skipping- BLU Acceleration can automatically detect large sections of data that does
not qualify for a query and effectively skips or ignores it. Data skipping utilizes a secondary
object called a synopsis table, which is a tiny in size just .1% of the size of the user table that
is created and maintained automatically. BLU Acceleration keeps the minimum and
maximum range of data values on "chunks" of data (about 1,000 records) in this table. So it
can get the min and max amongst 1000 records with just 1 read.
POWER8 Systems
1.9-3.4 GHz
1, 2*
3.0-4.15 GHz
1, 2, 4, 8
30
96
Max L1 Cache
Max L2 Cache
Max L3 Cache
Max L4 Cache
32 KB*
256 KB
37.5 MB
0
64 KB
512 KB
96 MB
128 MB
68-85**
230-410
Clock rates
SMT Options
Max Threads/Socket
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 5
In addition, there are key innovations in POWER8 processor which boosts performance:
Coherent Accelerator Processor Interface (CAPI): a new Open interface which allows PCIe3
devices to with higher bandwidth and low latency.
Simple reports are simple in nature. They are very fast running reports like dashboard and adhoc reports. They take seconds or sub second of time to execute.
Intermediate reports are advanced reports, require predicate evaluation over large sales_fact
table, joins, and aggregation of relatively small result set. They take minutes to complete.
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 6
Complex reports are much more complex and resource intensive, require multiple joins and
aggregations on the sales_fact table. For 1TB database size, these reports scan all the 9 Billion
records in the sales_fact table. Complex reports can take hours to finish.
BI tests were conducted against a database with 5 schemas and 80 tables designed in a
snowflake star schema. Database contains retail sample data of 1TB (Raw) in size. Each report
contains multiple sql queries. All these reports are generated in three different testing scenarios
1. Operational Analytics Test ( Multiple Concurrent user test)
2. Deep Analytics Test ( Multiple Concurrent user test)
3. Fixed execution Test ( Multiple Concurrent user test)
IBM POWER8
Competitor System
System
Processor
RAM
256 GB
256 GB
Storage
Operating System
Database
Competitor database
Performance Results
Operational Analytics
In Operational Analytics test all three types of reports simple, intermediate and complex were
generated simultaneously by 80 concurrent users and we measured the total throughput of both
systems. Below are the performance results.
POWER8 system generated 70% more Intermediate reports than competitor with
similar complex reports
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 7
more
more
Intermediate reports/hour
Simple reports/hour
591,355
472,229
POWER8
Competitor
POWER8
Competitor
Throughput in reports/hour
Throughput in reports/hour
227
137
POWER8
Competitor
POWER8
Competitor
Deep Analytics
Second test was Deep Analytics test with heavier workload. In this scenario, 24 concurrent users
generated intermediate and complex reports simultaneously and measured the total throughput
of both systems. Below are the results of this test.
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 8
Intermediate reports/hour
Complex reports/hour
237
149
POWER8
Competitor
POWER8
Competitor
Throughput in reports/hour
Throughput in reports/hour
3.27
2.27
POWER8
Competitor
POWER8
Competitor
Fixed Execution
Lastly we ran the Fixed Execution test. In this test we had a predefined workload size (161,166
reports) which were executed on both servers. We measured the total time taken by each system
to complete all the reports. As the diagram below shows, POWER8 was 13% faster than preintegrated database competitor system.
POWER8 Completed Fixed Execution test 13% Faster
124
POWER8
POWER8
Competitor
141
Competitor
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 9
Pricing
We saw that DB2 BLU on POWER8 beats the competitor hands down. But performance and
price go hand in hand. So we compared the total cost of both solutions. The table below shows
Total Cost of Acquisition (TCA). It includes the cost of hardware, Software with 3 years of
maintenance/support. All prices are US list price.
POWER S824
System Hardware Including Operating System
Competitor ( Rack )
$98,388
$488,921
$8,688
$610,880
$429,820
$597,600
$536,896
$1,697,401
$79,380
$1,261,600
$571,040
$79,380
$1,832,640
$616,276
$3,530,041
( 3 Year TCA)
As the table above shows, POWER8 S824 System provides 5.7x lower TCA compared to Preintegrated database competitor V3 ( Rack ).
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 10
can operate on compressed data but competitor database has additional overheads of
decompression before it makes any operation and worsen the performance. And lastly BLU
Acceleration is easy to deploy and administer. Just load and go. No need for indexes and
complex performance tuning.
Conclusion
POWER8 server is designed specifically for todays complex workload like Big Data Analytics.
IBMs DB2 BLU Acceleration is optimized for POWER8 server and together it provides a
breakthrough performance for Analytics workloads at 5.7x lower cost. DB2 BLU has
innovations like dynamic in-memory, columnar organized data, actionable compression and
exploitation of SIMD that are simply unmatched by any competition.
The combination of BLU Acceleration and POWER8 provides the best solution to clients who
are looking to gain new insights into their data, in both price and performance.
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 11
DISCLAIMER NOTICE
This case studys results are based on measurements and projections using standard IBM workloads in a
controlled environment. This information is presented along with general recommendations to assist
the reader to have a better understanding of IBM (*) products. The actual throughput or performance
that any user will experience will vary depending upon considerations such as the amount of
multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the
workload processed. Therefore, no assurance can be given that an individual user will achieve
throughput or performance or power savings improvements equivalent to the ratios stated here. All
performance and power data contained in this publication was obtained in the specific operating
environment and under the conditions described within the document and is presented as an
illustration. Performance or power characteristics obtained in other operating environments may vary
and customers should conduct their own testing.
Information is provided "AS IS" without warranty of any kind.
The use of this information or the implementation of any of these techniques is a customer
responsibility and depends on the customer's ability to evaluate and integrate them into the customer's
operational environment. While each item may have been reviewed by IBM for accuracy in a specific
situation, there is no guarantee that the same or similar results will be obtained elsewhere. Customers
attempting to adapt these techniques to their own environments do so at their own risk.
Please Recycle
Analyze Data Faster with DB2 BLU Acceleration and POWER8 Systems
pg. 12