OLTP-Bench: an extensible testbed for benchmarking relational databases

doi:10.14778/2732240.2732246

Citations

PDF

Open Access

More filters

Proceedings Article•DOI•

BLOCKBENCH: A Framework for Analyzing Private Blockchains

[...]

Tien Tuan Anh Dinh¹, Ji Wang¹, Gang Chen², Rui Liu¹, Beng Chin Ooi¹, Kian-Lee Tan¹ - Show less +2 more•Institutions (2)

National University of Singapore¹, Zhejiang University²

09 May 2017

TL;DR: Blockbench as mentioned in this paper is an evaluation framework for analyzing private blockchains, which can be used to assess blockchains' viability as another distributed data processing platform, while helping developers to identify bottlenecks and accordingly improve their platforms.

...read moreread less

Abstract: Blockchain technologies are taking the world by storm. Public blockchains, such as Bitcoin and Ethereum, enable secure peer-to-peer applications like crypto-currency or smart contracts. Their security and performance are well studied. This paper concerns recent private blockchain systems designed with stronger security (trust) assumption and performance requirement. These systems target and aim to disrupt applications which have so far been implemented on top of database systems, for example banking, finance and trading applications. Multiple platforms for private blockchains are being actively developed and fine tuned. However, there is a clear lack of a systematic framework with which different systems can be analyzed and compared against each other. Such a framework can be used to assess blockchains' viability as another distributed data processing platform, while helping developers to identify bottlenecks and accordingly improve their platforms. In this paper, we first describe BLOCKBENCH, the first evaluation framework for analyzing private blockchains. It serves as a fair means of comparison for different platforms and enables deeper understanding of different system design choices. Any private blockchain can be integrated to BLOCKBENCH via simple APIs and benchmarked against workloads that are based on real and synthetic smart contracts. BLOCKBENCH measures overall and component-wise performance in terms of throughput, latency, scalability and fault-tolerance. Next, we use BLOCKBENCH to conduct comprehensive evaluation of three major private blockchains: Ethereum, Parity and Hyperledger Fabric. The results demonstrate that these systems are still far from displacing current database systems in traditional data processing workloads. Furthermore, there are gaps in performance among the three systems which are attributed to the design choices at different layers of the blockchain's software stack. We have released BLOCKBENCH for public use.

...read moreread less

731 citations

Proceedings Article•DOI•

Automatic Database Management System Tuning Through Large-scale Machine Learning

[...]

Dana Van Aken¹, Andrew Pavlo¹, Geoffrey J. Gordon¹, Bohan Zhang²•Institutions (2)

Carnegie Mellon University¹, Peking University²

09 May 2017

TL;DR: An automated approach that leverages past experience and collects new information to tune DBMS configurations and recommends configurations that are as good as or better than ones generated by existing tools or a human expert is presented.

...read moreread less

Abstract: Database management system (DBMS) configuration tuning is an essential aspect of any data-intensive application effort. But this is historically a difficult task because DBMSs have hundreds of configuration "knobs" that control everything in the system, such as the amount of memory to use for caches and how often data is written to storage. The problem with these knobs is that they are not standardized (i.e., two DBMSs use a different name for the same knob), not independent (i.e., changing one knob can impact others), and not universal (i.e., what works for one application may be sub-optimal for another). Worse, information about the effects of the knobs typically comes only from (expensive) experience. To overcome these challenges, we present an automated approach that leverages past experience and collects new information to tune DBMS configurations: we use a combination of supervised and unsupervised machine learning methods to (1) select the most impactful knobs, (2) map unseen database workloads to previous workloads from which we can transfer experience, and (3) recommend knob settings. We implemented our techniques in a new tool called OtterTune and tested it on two DBMSs. Our evaluation shows that OtterTune recommends configurations that are as good as or better than ones generated by existing tools or a human expert.

...read moreread less

418 citations

Cites methods from "OLTP-Bench: an extensible testbed f..."

...For these experiments, we use workloads from the OLTP-Bench testbed that differ in complexity and system demands [3, 23]:...
[...]

Journal Article•DOI•

Coordination avoidance in database systems

[...]

Peter Bailis¹, Alan Fekete², Michael J. Franklin¹, Ali Ghodsi¹, Joseph M. Hellerstein¹, Ion Stoica¹ - Show less +2 more•Institutions (2)

University of California, Berkeley¹, University of Sydney²

01 Nov 2014

TL;DR: A formal framework is developed that determines whether an application requires coordination for correct execution by operating on application-level invariants over database states and shows that many are invariant confluent and therefore achievable without coordination.

...read moreread less

Abstract: Minimizing coordination, or blocking communication between concurrently executing operations, is key to maximizing scalability, availability, and high performance in database systems. However, uninhibited coordination-free execution can compromise application correctness, or consistency. When is coordination necessary for correctness? The classic use of serializable transactions is sufficient to maintain correctness but is not necessary for all applications, sacrificing potential scalability. In this paper, we develop a formal framework, invariant confluence, that determines whether an application requires coordination for correct execution. By operating on application-level invariants over database states (e.g., integrity constraints), invariant confluence analysis provides a necessary and sufficient condition for safe, coordination-free execution. When programmers specify their application invariants, this analysis allows databases to coordinate only when anomalies that might violate invariants are possible. We analyze the invariant confluence of common invariants and operations from real-world database systems (i.e., integrity constraints) and applications and show that many are invariant confluent and therefore achievable without coordination. We apply these results to a proof-of-concept coordination-avoiding database prototype and demonstrate sizable performance gains compared to serializable execution, notably a 25-fold improvement over prior TPC-C New-Order performance on a 200 server cluster.

...read moreread less

194 citations

Cites background or methods or result from "OLTP-Bench: an extensible testbed f..."

...The TPC-C benchmark is the gold standard for database concurrency control [23] both in research and in industry [55], and in recent years has been used as a yardstick for distributed database concurrency control performance [52, 54, 57]....
[...]
...In this section, we apply these combinations to the workloads of the OLTP-Bench suite [23], with a focus on the TPC-C benchmark....
[...]
...For greater variety, we also studied the workloads of the recently assembled OLTP-Bench suite [23], performing a similar analysis to that of Section 6....
[...]
...As an extended case study, we examine the TPC-C benchmark [55], the preferred standard for evaluating new concurrency control algorithms [23, 35, 46, 52, 54]....
[...]
...We found (and confirmed with an author of [23]) that for nine of fourteen remaining (non-TPC-C) OLTPBench applications, the workload transactions did not involve integrity constraints (e....
[...]

Proceedings Article•DOI•

Thermostat: Application-transparent Page Management for Two-tiered Main Memory

[...]

Neha Agarwal¹, Thomas F. Wenisch¹•Institutions (1)

University of Michigan¹

04 Apr 2017

TL;DR: This work presents Thermostat, an application-transparent huge-page-aware mechanism to place pages in a dual-technology hybrid memory system while achieving both the cost advantages of two-tiered memory and performance advantages of transparent huge pages, and implements and evaluates its effectiveness on representative cloud computing workloads running under KVM virtualization.

...read moreread less

Abstract: The advent of new memory technologies that are denser and cheaper than commodity DRAM has renewed interest in two-tiered main memory schemes. Infrequently accessed application data can be stored in such memories to achieve significant memory cost savings. Past research on two-tiered main memory has assumed a 4KB page size. However, 2MB huge pages are performance critical in cloud applications with large memory footprints, especially in virtualized cloud environments, where nested paging drastically increases the cost of 4KB page management. We present Thermostat, an application-transparent huge-page-aware mechanism to place pages in a dual-technology hybrid memory system while achieving both the cost advantages of two-tiered memory and performance advantages of transparent huge pages. We present an online page classification mechanism that accurately classifies both 4KB and 2MB pages as hot or cold while incurring no observable performance overhead across several representative cloud applications. We implement Thermostat in Linux kernel version 4.5 and evaluate its effectiveness on representative cloud computing workloads running under KVM virtualization. We emulate slow memory with performance characteristics approximating near-future high-density memory technology and show that Thermostat migrates up to 50% of application footprint to slow memory while limiting performance degradation to 3%, thereby reducing memory cost up to 30%.

...read moreread less

146 citations

Cites methods from "OLTP-Bench: an extensible testbed f..."

...We use the open-source TPCC implementation from OLTP-Bench [20] (available at https://github....
[...]
...We use the open-source TPCC implementation from OLTP-Bench [20] (available at https://github. com/oltpbenchmark/oltpbench)....
[...]

Proceedings Article•DOI•

Query-based Workload Forecasting for Self-Driving Database Management Systems

[...]

Lin Ma¹, Dana Van Aken¹, Ahmed Hefny¹, Gustavo Mezerhane¹, Andrew Pavlo¹, Geoffrey J. Gordon¹ - Show less +2 more•Institutions (1)

Carnegie Mellon University¹

27 May 2018

TL;DR: This work presents a robust forecasting framework called QueryBot 5000 that allows a DBMS to predict the expected arrival rate of queries in the future based on historical data and presents a clustering-based technique for reducing the total number of forecasting models to maintain.

...read moreread less

Abstract: The first step towards an autonomous database management system (DBMS) is the ability to model the target application's workload. This is necessary to allow the system to anticipate future workload needs and select the proper optimizations in a timely manner. Previous forecasting techniques model the resource utilization of the queries. Such metrics, however, change whenever the physical design of the database and the hardware resources change, thereby rendering previous forecasting models useless. We present a robust forecasting framework called QueryBot 5000 that allows a DBMS to predict the expected arrival rate of queries in the future based on historical data. To better support highly dynamic environments, our approach uses the logical composition of queries in the workload rather than the amount of physical resources used for query execution. It provides multiple horizons (short- vs. long-term) with different aggregation intervals. We also present a clustering-based technique for reducing the total number of forecasting models to maintain. To evaluate our approach, we compare our forecasting models against other state-of-the-art models on three real-world database traces. We implemented our models in an external controller for PostgreSQL and MySQL and demonstrate their effectiveness in selecting indexes.

...read moreread less

139 citations

Collapse

OLTP-Bench: an extensible testbed for benchmarking relational databases

Citations

Cites methods from "OLTP-Bench: an extensible testbed f..."

Cites background or methods or result from "OLTP-Bench: an extensible testbed f..."

Cites methods from "OLTP-Bench: an extensible testbed f..."

References

"OLTP-Bench: an extensible testbed f..." refers background in this paper

"OLTP-Bench: an extensible testbed f..." refers background in this paper

Related Papers (5)