DB2 with BLU acceleration: so much more than just a column store

doi:10.14778/2536222.2536233

Journal ArticleDOI

DB2 with BLU acceleration: so much more than just a column store

- Vol. 6, Iss: 11, pp 1080-1091

TLDR

Full integration with DB2 ensures that DB2 with BLU Acceleration benefits from the full functionality and robust utilities of a mature product, while still enjoying order-of-magnitude performance gains from revolutionary technology without even having to change the SQL.

Abstract:

DB2 with BLU Acceleration deeply integrates innovative new techniques for defining and processing column-organized tables that speed read-mostly Business Intelligence queries by 10 to 50 times and improve compression by 3 to 10 times, compared to traditional row-organized tables, without the complexity of defining indexes or materialized views on those tables. But DB2 BLU is much more than just a column store. Exploiting frequency-based dictionary compression and main-memory query processing technology from the Blink project at IBM Research - Almaden, DB2 BLU performs most SQL operations - predicate application (even range predicates and IN-lists), joins, and grouping - on the compressed values, which can be packed bit-aligned so densely that multiple values fit in a register and can be processed simultaneously via SIMD (single-instruction, multipledata) instructions. Designed and built from the ground up to exploit modern multi-core processors, DB2 BLU's hardware-conscious algorithms are carefully engineered to maximize parallelism by using novel data structures that need little latching, and to minimize data-cache and instruction-cache misses. Though DB2 BLU is optimized for in-memory processing, database size is not limited by the size of main memory. Fine-grained synopses, late materialization, and a new probabilistic buffer pool protocol for scans minimize disk I/Os, while aggressive prefetching reduces I/O stalls. Full integration with DB2 ensures that DB2 with BLU Acceleration benefits from the full functionality and robust utilities of a mature product, while still enjoying order-of-magnitude performance gains from revolutionary technology without even having to change the SQL, and can mix column-organized and row-organized tables in the same tablespace and even within the same query.

DB2 with BLU acceleration: so much more than just a column store

Citations

In-Memory Big Data Management and Processing: A Survey

Impala: A Modern, Open-Source SQL Engine for Hadoop.

Morsel-driven parallelism: a NUMA-aware query evaluation framework for the many-core age

Rethinking SIMD Vectorization for In-Memory Databases

The Design and Implementation of Modern Column-Oriented Database Systems

References

Access path selection in a relational database management system

ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging

C-store: a column-oriented DBMS

C-store: a column-oriented DBMS

Implementation techniques for main memory database systems

Related Papers (5)

HyPer: A hybrid OLTP&OLAP main memory database system based on virtual memory snapshots

The SAP HANA Database - An Architecture Overview

C-store: a column-oriented DBMS

Efficiently compiling efficient query plans for modern hardware

MonetDB/X100: Hyper-Pipelining Query Execution