OLTP through the looking glass, and what we found there

doi:10.1145/1376616.1376713

Proceedings ArticleDOI

OLTP through the looking glass, and what we found there

- pp 981-992

TLDR

Overall, overheads and optimizations that explain a total difference of about a factor of 20x in raw performance are identified and it is shown that there is no single "high pole in the tent" in modern (memory resident) database systems, but that substantial time is spent in logging, latching, locking, B-tree, and buffer management operations.

Abstract:

Online Transaction Processing (OLTP) databases include a suite of features - disk-resident B-trees and heap files, locking-based concurrency control, support for multi-threading - that were optimized for computer technology of the late 1970's Advances in modern processors, memories, and networks mean that today's computers are vastly different from those of 30 years ago, such that many OLTP databases will now fit in main memory, and most OLTP transactions can be processed in milliseconds or less Yet database architecture has changed littleBased on this observation, we look at some interesting variants of conventional database systems that one might build that exploit recent hardware trends, and speculate on their performance through a detailed instruction-level breakdown of the major components involved in a transaction processing database system (Shore) running a subset of TPC-C Rather than simply profiling Shore, we progressively modified it so that after every feature removal or optimization, we had a (faster) working system that fully ran our workload Overall, we identify overheads and optimizations that explain a total difference of about a factor of 20x in raw performance We also show that there is no single "high pole in the tent" in modern (memory resident) database systems, but that substantial time is spent in logging, latching, locking, B-tree, and buffer management operations

OLTP through the looking glass, and what we found there

Citations

HyPer: A hybrid OLTP&OLAP main memory database system based on virtual memory snapshots

Hekaton: SQL server's memory-optimized OLTP engine

SQL databases v. NoSQL databases

In-Memory Big Data Management and Processing: A Survey

Skew-aware automatic database partitioning in shared-nothing, parallel OLTP systems

References

MapReduce: simplified data processing on large clusters

Chord: A scalable peer-to-peer lookup service for internet applications

The Design and Analysis of Computer Algorithms

Bigtable: A Distributed Storage System for Structured Data (Awarded Best Paper!).

Dynamo: amazon's highly available key-value store

Related Papers (5)

H-store: a high-performance, distributed main memory transaction processing system

HyPer: A hybrid OLTP&OLAP main memory database system based on virtual memory snapshots

Benchmarking cloud serving systems with YCSB

ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging

Schism: a workload-driven approach to database replication and partitioning