Showing papers on "Concurrency control published in 2015"

PDF

Open Access

Journal Article•DOI•

In-Memory Big Data Management and Processing: A Survey

[...]

Hao Zhang¹, Gang Chen², Beng Chin Ooi¹, Kian-Lee Tan¹, Meihui Zhang³ - Show less +1 more•Institutions (3)

National University of Singapore¹, Zhejiang University², Singapore University of Technology and Design³

01 Jul 2015-IEEE Transactions on Knowledge and Data Engineering

TL;DR: This survey aims to provide a thorough review of a wide range of in-memory data management and processing proposals and systems, including both data storage systems and data processing frameworks.

...read moreread less

Abstract: Growing main memory capacity has fueled the development of in-memory big data management and processing. By eliminating disk I/O bottleneck, it is now possible to support interactive data analytics. However, in-memory systems are much more sensitive to other sources of overhead that do not matter in traditional I/O-bounded disk-based systems. Some issues such as fault-tolerance and consistency are also more challenging to handle in in-memory environment. We are witnessing a revolution in the design of database systems that exploits main memory as its data storage layer. Many of these researches have focused along several dimensions: modern CPU and memory hierarchy utilization, time/space efficiency, parallelism, and concurrency control. In this survey, we aim to provide a thorough review of a wide range of in-memory data management and processing proposals and systems, including both data storage systems and data processing frameworks. We also give a comprehensive presentation of important technology in memory management, and some key factors that need to be considered in order to achieve efficient in-memory data management and processing.

...read moreread less

391 citations

Proceedings Article•DOI•

Fast in-memory transaction processing using RDMA and HTM

[...]

Xingda Wei¹, Jiaxin Shi¹, Yanzhe Chen¹, Rong Chen¹, Haibo Chen¹ - Show less +1 more•Institutions (1)

Shanghai Jiao Tong University¹

04 Oct 2015

TL;DR: This work presents DrTM, a fast in-memory transaction processing system that exploits advanced hardware features (i.e., RDMA and HTM) to improve latency and throughput by over one order of magnitude compared to state-of-the-art distributed transaction systems.

...read moreread less

Abstract: We present DrTM, a fast in-memory transaction processing system that exploits advanced hardware features (i.e., RDMA and HTM) to improve latency and throughput by over one order of magnitude compared to state-of-the-art distributed transaction systems. The high performance of DrTM are enabled by mostly offloading concurrency control within a local machine into HTM and leveraging the strong consistency between RDMA and HTM to ensure serializability among concurrent transactions across machines. We further build an efficient hash table for DrTM by leveraging HTM and RDMA to simplify the design and notably improve the performance. We describe how DrTM supports common database features like read-only transactions and logging for durability. Evaluation using typical OLTP workloads including TPC-C and SmallBank show that DrTM scales well on a 6-node cluster and achieves over 5.52 and 138 million transactions per second for TPC-C and SmallBank Respectively. This number outperforms a state-of-the-art distributed transaction system (namely Calvin) by at least 17.9X for TPC-C.

...read moreread less

239 citations

Proceedings Article•DOI•

Fast Serializable Multi-Version Concurrency Control for Main-Memory Database Systems

[...]

Thomas Neumann¹, Tobias Mühlbauer¹, Alfons Kemper¹•Institutions (1)

Technische Universität München¹

27 May 2015

TL;DR: This work presents a novel MVCC implementation for main-memory database systems that has very little overhead compared to serial execution with single-version concurrency control and verifies that the (extensional) writes of recently committed transactions do not intersect with the (intensional) read predicate space of a committing transaction.

...read moreread less

Abstract: Multi-Version Concurrency Control (MVCC) is a widely employed concurrency control mechanism, as it allows for execution modes where readers never block writers. However, most systems implement only snapshot isolation (SI) instead of full serializability. Adding serializability guarantees to existing SI implementations tends to be prohibitively expensive. We present a novel MVCC implementation for main-memory database systems that has very little overhead compared to serial execution with single-version concurrency control, even when maintaining serializability guarantees. Updating data in-place and storing versions as before-image deltas in undo buffers not only allows us to retain the high scan performance of single-version systems but also forms the basis of our cheap and fine-grained serializability validation mechanism. The novel idea is based on an adaptation of precision locking and verifies that the (extensional) writes of recently committed transactions do not intersect with the (intensional) read predicate space of a committing transaction. We experimentally show that our MVCC model allows very fast processing of transactions with point accesses as well as read-heavy transactions and that there is little need to prefer SI over full serializability any longer.

...read moreread less

221 citations

Dissertation•

An evaluation of concurrency control with one thousand cores

[...]

Xiangyao Yu

01 Jan 2015

TL;DR: It is concluded that rather than pursuing incremental solutions, many-core chips may require a completely redesigned DBMS architecture that is built from ground up and is tightly coupled with the hardware.

...read moreread less

Abstract: Computer architectures are moving towards an era dominated by many-core machines with dozens or even hundreds of cores on a single chip. This unprecedented level of on-chip parallelism introduces a new dimension to scalability that current database management systems (DBMSs) were not designed for. In particular, as the number of cores increases, the problem of concurrency control becomes extremely challenging. With hundreds of threads running in parallel, the complexity of coordinating competing accesses to data will likely diminish the gains from increased core counts. To better understand just how unprepared current DBMSs are for future CPU architectures, we performed an evaluation of concurrency control for on-line transaction processing (OLTP) workloads on many-core chips. We implemented seven concurrency control algorithms on a main-memory DBMS and using computer simulations scaled our system to 1024 cores. Our analysis shows that all algorithms fail to scale to this magnitude but for different reasons. In each case, we identify fundamental bottlenecks that are independent of the particular database implementation and argue that even state-of-the-art DBMSs suffer from these limitations. We conclude that rather than pursuing incremental solutions, many-core chips may require a completely redesigned DBMS architecture that is built from ground up and is tightly coupled with the hardware.

...read moreread less

156 citations

Journal Article•DOI•

Rethinking serializable multiversion concurrency control

[...]

Jose M. Faleiro¹, Daniel J. Abadi¹•Institutions (1)

Yale University¹

01 Jul 2015

TL;DR: Bohm performs well in both high contention and low contention settings, and is able to dramatically outperform state-of-the-art multi-versioned systems despite maintaining the full set of serializability guarantees.

...read moreread less

Abstract: Multi-versioned database systems have the potential to significantly increase the amount of concurrency in transaction processing because they can avoid read-write conflicts. Unfortunately, the increase in concurrency usually comes at the cost of transaction serializability. If a database user requests full serializability, modern multi-versioned systems significantly constrain read-write concurrency among conflicting transactions and employ expensive synchronization patterns in their design. In main-memory multi-core settings, these additional constraints are so burdensome that multi-versioned systems are often significantly outperformed by single-version systems.We propose Bohm, a new concurrency control protocol for main-memory multi-versioned database systems. Bohm guarantees serializable execution while ensuring that reads never block writes. In addition, Bohm does not require reads to perform any bookkeeping whatsoever, thereby avoiding the overhead of tracking reads via contended writes to shared memory. This leads to excellent scalability and performance in multi-core settings. Bohm has all the above characteristics without performing validation based concurrency control. Instead, it is pessimistic, and is therefore not prone to excessive aborts in the presence of contention. An experimental evaluation shows that Bohm performs well in both high contention and low contention settings, and is able to dramatically outperform state-of-the-art multi-versioned systems despite maintaining the full set of serializability guarantees.

...read moreread less

129 citations

Proceedings Article•DOI•

High-performance ACID via modular concurrency control

[...]

Chao Xie¹, Chunzhi Su¹, Cody Littley¹, Lorenzo Alvisi¹, Manos Kapritsos², Yang Wang³ - Show less +2 more•Institutions (3)

University of Texas at Austin¹, Microsoft², Ohio State University³

04 Oct 2015

TL;DR: This paper describes the design, implementation, and evaluation of Callas, a distributed database system that offers to unmodified, transactional ACID applications the opportunity to achieve a level of performance that can currently only be reached by rewriting all or part of the application in a BASE/NoSQL Style.

...read moreread less

Abstract: This paper describes the design, implementation, and evaluation of Callas, a distributed database system that offers to unmodified, transactional ACID applications the opportunity to achieve a level of performance that can currently only be reached by rewriting all or part of the application in a BASE/NoSQL Style. The key to combining performance and ease of programming is to decouple the ACID abstraction---which Callas offers identically for all transactions---from the mechanism used to support it. MCC, the new Modular approach to Concurrency Control at the core of Callas, makes it possible to partition transactions in groups with the guarantee that, as long as the concurrency control mechanism within each group upholds a given isolation property, that property will also hold among transactions in different groups. Because of their limited and specialized scope, these group-specific mechanisms can be customized for concurrency with unprecedented aggressiveness. In our MySQL Cluster-based prototype, Callas yields an 8.2x throughput gain for TPC-C with no programming effort.

...read moreread less

82 citations

Proceedings Article•DOI•

Squall: Fine-Grained Live Reconfiguration for Partitioned Main Memory Databases

[...]

Aaron J. Elmore¹, Vaibhav Arora², Rebecca Taft³, Andrew Pavlo⁴, Divyakant Agrawal², Amr El Abbadi² - Show less +2 more•Institutions (4)

University of Chicago¹, University of California, Santa Barbara², Massachusetts Institute of Technology³, Carnegie Mellon University⁴

27 May 2015

TL;DR: The Squall technique for supporting live reconfiguration in partitioned, main memory DBMSs supports fine-grained repartitioning of databases in the presence of distributed transactions, high throughput client workloads, and replicated data.

...read moreread less

Abstract: For data-intensive applications with many concurrent users, modern distributed main memory database management systems (DBMS) provide the necessary scale-out support beyond what is possible with single-node systems. These DBMSs are optimized for the short-lived transactions that are common in on-line transaction processing (OLTP) workloads. One way that they achieve this is to partition the database into disjoint subsets and use a single-threaded transaction manager per partition that executes transactions one-at-a-time in serial order. This minimizes the overhead of concurrency control mechanisms, but requires careful partitioning to limit distributed transactions that span multiple partitions. Previous methods used off-line analysis to determine how to partition data, but the dynamic nature of these applications means that they are prone to hotspots. In these situations, the DBMS needs to reconfigure how data is partitioned in real-time to maintain performance objectives. Bringing the system off-line to reorganize the database is unacceptable for on-line applications. To overcome this problem, we introduce the Squall technique for supporting live reconfiguration in partitioned, main memory DBMSs. Squall supports fine-grained repartitioning of databases in the presence of distributed transactions, high throughput client workloads, and replicated data. An evaluation of our approach on a distributed DBMS shows that Squall can reconfigure a database with no downtime and minimal overhead on transaction latency.

...read moreread less

69 citations

Proceedings Article•DOI•

Feral Concurrency Control: An Empirical Investigation of Modern Application Integrity

[...]

Peter Bailis¹, Alan Fekete², Michael J. Franklin¹, Ali Ghodsi¹, Joseph M. Hellerstein¹, Ion Stoica¹ - Show less +2 more•Institutions (2)

University of California, Berkeley¹, University of Sydney²

27 May 2015

TL;DR: This work quantitatively analyzes the use of feral mechanisms for maintaining database integrity in a range of open source applications written using the Ruby on Rails ORM and finds that feral invariants are the most popular means of ensuring integrity.

...read moreread less

Abstract: The rise of data-intensive "Web 2.0" Internet services has led to a range of popular new programming frameworks that collectively embody the latest incarnation of the vision of Object-Relational Mapping (ORM) systems, albeit at unprecedented scale. In this work, we empirically investigate modern ORM-backed applications' use and disuse of database concurrency control mechanisms. Specifically, we focus our study on the common use of feral, or application-level, mechanisms for maintaining database integrity, which, across a range of ORM systems, often take the form of declarative correctness criteria, or invariants. We quantitatively analyze the use of these mechanisms in a range of open source applications written using the Ruby on Rails ORM and find that feral invariants are the most popular means of ensuring integrity (and, by usage, are over 37 times more popular than transactions). We evaluate which of these feral invariants actually ensure integrity (by usage, up to 86.9%) and which---due to concurrency errors and lack of database support---may lead to data corruption (the remainder), which we experimentally quantify. In light of these findings, we present recommendations for database system designers for better supporting these modern ORM programming patterns, thus eliminating their adverse effects on application integrity.

...read moreread less

66 citations

Proceedings Article•DOI•

ConcBugAssist: constraint solving for diagnosis and repair of concurrency bugs

[...]

Sepideh Khoshnood¹, Markus Kusano¹, Chao Wang¹•Institutions (1)

Virginia Tech¹

13 Jul 2015

TL;DR: A logical constraint based symbolic analysis method to aid in the diagnosis of concurrency bugs and to recommend repairs and it is demonstrated its effectiveness in di- agnosing bugs in multithreaded C programs.

...read moreread less

Abstract: Programmers often have to spend a significant amount of time in- specting the software code and execution traces to identify the cause of a bug. For a multithreaded program, debugging is even more challenging due to the subtle interactions between threads and the often astronomical number of interleavings. In this work, we pro- pose a logical constraint based symbolic analysis method to aid in the diagnosis of concurrency bugs and to recommend repairs. Both diagnosis and repair are formulated as constraint solving prob- lems. Our method, by leveraging the power of satisfiability (SAT) solvers and a bounded model checker, performs a semantic analy- sis of the sequential computation as well as thread interactions. The constraint based analysis is designed for handling critical software with small to medium code size, but complex concurrency control, such as device drivers, implementations of synchronization proto- cols, and concurrent data structures. We have implemented our new method in a software tool and demonstrated its effectiveness in di- agnosing bugs in multithreaded C programs.

...read moreread less

65 citations

Proceedings Article•DOI•

On the Design and Scalability of Distributed Shared-Data Databases

[...]

Simon Loesing¹, Markus Pilman¹, Thomas Etter¹, Donald Kossmann²•Institutions (2)

ETH Zurich¹, Microsoft²

27 May 2015

TL;DR: This paper analyzes an alternative architecture design for distributed relational databases that overcomes the limitations of partitioned databases and introduces techniques for scalable transaction processing in shared-data environments.

...read moreread less

Abstract: Database scale-out is commonly implemented by partitioning data across several database instances. This approach, however, has several restrictions. In particular, partitioned databases are inflexible in large-scale deployments and assume a partition-friendly workload in order to scale. In this paper, we analyze an alternative architecture design for distributed relational databases that overcomes the limitations of partitioned databases. The architecture is based on two fundamental principles: We decouple query processing and transaction management from data storage, and we share data across query processing nodes. The combination of these design choices provides scalability, elasticity, and operational flexibility without making any assumptions on the workload. As a drawback, sharing data among multiple database nodes causes synchronization overhead. To address this limitation, we introduce techniques for scalable transaction processing in shared-data environments. Specifically, we describe mechanisms for efficient data access, concurrency control, and data buffering. In combination with new hardware trends, the techniques enable performance characteristics that top state-of-the-art partitioned databases.

...read moreread less

64 citations

Journal Article•DOI•

Towards scalable real-time analytics: an architecture for scale-out of OLxP workloads

[...]

Anil Kumar Goel, Jeffrey Pound, Nathan Auch, Peter Bumbulis, Scott MacLean, Franz Färber, Francis Gropengiesser, Christian Mathis, Thomas Bodner, Wolfgang Lehner¹ - Show less +6 more•Institutions (1)

Dresden University of Technology¹

01 Aug 2015

TL;DR: An overview of the work on the SAP HANA Scale-out Extension, a novel distributed database architecture designed to support large scale analytics over real-time data, and advocate the decoupling of core database components such as query processing, concurrency control, and persistence.

...read moreread less

Abstract: We present an overview of our work on the SAP HANA Scale-out Extension, a novel distributed database architecture designed to support large scale analytics over real-time data. This platform permits high performance OLAP with massive scale-out capabilities, while concurrently allowing OLTP workloads. This dual capability enables analytics over real-time changing data and allows fine grained user-specified service level agreements (SLAs) on data freshness. We advocate the decoupling of core database components such as query processing, concurrency control, and persistence, a design choice made possible by advances in high-throughput low-latency networks and storage devices. We provide full ACID guarantees and build on a logical timestamp mechanism to provide MVCC-based snapshot isolation, while not requiring synchronous updates of replicas. Instead, we use asynchronous update propagation guaranteeing consistency with timestamp validation.We provide a view into the design and development of a large scale data management platform for real-time analytics, driven by the needs of modern enterprise customers.

...read moreread less

Patent•

High performance transactions in database management systems

[...]

David B. Lomet¹, Justin J. Levandoski¹, Sudipta Sengupta¹, Ryan Stutsman¹•Institutions (1)

Microsoft¹

07 Oct 2015

TL;DR: In this article, a multi-version concurrency control (MVCC) module accesses a latch-free hash table that includes respective hash table entries that include buckets of respective bucket items.

...read moreread less

Abstract: A transaction engine includes a multi-version concurrency control (MVCC) module that accesses a latch-free hash table that includes respective hash table entries that include respective buckets of respective bucket items. The bucket items represent respective records, the respective bucket items each including a value indicating a temporal most recent read time of the item and a version list of descriptions that describe respective versions of the respective records, the MVCC module performing timestamp order concurrency control, using the latch-free hash table. Recovery log buffers may be used as cache storage for the transaction engine.

...read moreread less

Journal Article•DOI•

A large-scale study on the usage of Java's concurrent programming constructs

[...]

Gustavo Pinto¹, Weslley Torres¹, Benito Fernandes¹, Fernando Castor¹, Roberto Souto Maior de Barros¹ - Show less +1 more•Institutions (1)

Federal University of Pernambuco¹

01 Aug 2015-Journal of Systems and Software

TL;DR: An analysis of 2227 Java projects, comprising more than 650 million lines of code, finds that more than 75% of the latest versions of the projects either explicitly create threads or employ some concurrency control mechanism.

...read moreread less

Proceedings Article•DOI•

ShardFS vs. IndexFS: replication vs. caching strategies for distributed metadata management in cloud storage systems

[...]

Lin Xiao¹, Kai Ren¹, Qing Zheng¹, Garth A. Gibson¹•Institutions (1)

Carnegie Mellon University¹

27 Aug 2015

TL;DR: This paper explores explicit replication of directory lookup state in all servers as an alternative to caching this information in all clients and suggests that server replication can scale more linearly than client caching and provide lower 70 percentile response times as well.

...read moreread less

Abstract: The rapid growth of cloud storage systems calls for fast and scalable namespace processing. While few commercial file systems offer anything better than federating individually non-scalable namespace servers, a recent academic file system, IndexFS, demonstrates scalable namespace processing based on client caching of directory entries and permissions (directory lookup state) with no per-client state in servers. In this paper we explore explicit replication of directory lookup state in all servers as an alternative to caching this information in all clients. Both eliminate most repeated RPCs to different servers in order to resolve hierarchical permission tests. Our realization for server replicated directory lookup state, ShardFS, employs a novel file system specific hybrid optimistic and pessimistic concurrency control favoring single object transactions over distributed transactions. Our experimentation suggests that if directory lookup state mutation is a fixed fraction of operations (strong scaling for metadata), server replication does not scale as well as client caching, but if directory lookup state mutation is proportional to the number of jobs, not the number of processes per job, (weak scaling for metadata), then server replication can scale more linearly than client caching and provide lower 70 percentile response times as well.

...read moreread less

Journal Article•DOI•

Object-sensitive cost analysis for concurrent objects

[...]

Elvira Albert¹, Puri Arenas¹, Jesús Correas¹, Samir Genaim¹, Miguel Gómez-Zamalloa¹, Germán Puebla², Guillermo Román-Díez² - Show less +3 more•Institutions (2)

Complutense University of Madrid¹, Technical University of Madrid²

01 May 2015-Software Testing, Verification & Reliability

TL;DR: This article presents a novel cost analysis framework for concurrent objects by means of a novel form of object‐sensitive recurrence equations that use cost centres in order to keep the resource usage assigned to the different components separate.

...read moreread less

Abstract: This article presents a novel cost analysis framework for concurrent objects. Concurrent objects form a well-established model for distributed concurrent systems. In this model, objects are the concurrency units that communicate among them via asynchronous method calls. Cost analysis aims at automatically approximating the resource consumption of executing a program in terms of its input parameters. While cost analysis for sequential programming languages has received considerable attention, concurrency and distribution have been notably less studied. The main challenges of cost analysis in a concurrent setting are as follows. First, inferring precise size abstractions for data in the program in the presence of shared memory. This information is essential for bounding the number of iterations of loops. Second, distribution suggests that analysis must infer the cost of the diverse distributed components separately. We handle this by means of a novel form of object-sensitive recurrence equations that use cost centres in order to keep the resource usage assigned to the different components separate. We have implemented our analysis and evaluated it on several small applications that are classical examples of concurrent and distributed programming. Copyright © 2015John Wiley & Sons, Ltd.

...read moreread less

Proceedings Article•DOI•

Hybrid Monitors for Concurrent Noninterference

[...]

Aslan Askarov¹, Stephen Chong², Heiko Mantel³•Institutions (3)

Aarhus University¹, Harvard University², Technische Universität Darmstadt³

13 Jul 2015

TL;DR: It is proved that the hybrid monitoring approach enforces a knowledge-based progress-sensitive non-interference security condition, and rely-guarantee-style reasoning is guaranteed by all monitors cooperatively.

...read moreread less

Abstract: Controlling confidential information in concurrent systems is difficult, due to covert channels resulting from interaction between threads. This problem is exacerbated if threads share resources at fine granularity. In this work, we propose a novel monitoring framework to enforce strong information security in concurrent programs. Our monitors are hybrid, combining dynamic and static program analysis to enforce security in a sound and rather precise fashion. In our framework, each thread is guarded by its own local monitor, and there is a single global monitor. We instantiate our monitoring framework to support rely-guarantee style reasoning about the use of shared resources, at the granularity of individual memory locations, and then specialize local monitors further to enforce flow-sensitive progress-sensitive information-flow control. Our local monitors exploit rely-guarantee-style reasoning about shared memory to achieve high precision. Soundness of rely-guarantee-style reasoning is guaranteed by all monitors cooperatively. The global monitor is invoked only when threads synchronize, and so does not needlessly restrict concurrency. We prove that our hybrid monitoring approach enforces a knowledge-based progress-sensitive non-interference security condition.

...read moreread less

Journal Article•DOI•

Weaver: A High-Performance, Transactional Graph Database Based on Refinable Timestamps

[...]

Ayush Dubey¹, Greg D. Hill², Robert Escriva¹, Emin Gün Sirer¹•Institutions (2)

Cornell University¹, Stanford University²

28 Sep 2015-arXiv: Distributed, Parallel, and Cluster Computing

TL;DR: Weaver as discussed by the authors is a distributed graph database that enables efficient, transactional graph analyses as well as strictly serializable ACID transactions on dynamic graphs, and it combines strict serializability with horizontal scalability.

...read moreread less

Abstract: Graph databases have become an increasingly common infrastructure component. Yet existing systems either operate on offline snapshots, provide weak consistency guarantees, or use expensive concurrency control techniques that limit performance. In this paper, we introduce a new distributed graph database, called Weaver, which enables efficient, transactional graph analyses as well as strictly serializable ACID transactions on dynamic graphs. The key insight that allows Weaver to combine strict serializability with horizontal scalability and high performance is a novel request ordering mechanism called refinable timestamps. This technique couples coarse-grained vector timestamps with a fine-grained timeline oracle to pay the overhead of strong consistency only when needed. Experiments show that Weaver enables a Bitcoin blockchain explorer that is 8x faster than Blockchain.info, and achieves 12x higher throughput than the Titan graph database on social network workloads and 4x lower latency than GraphLab on offline graph traversal workloads.

...read moreread less

Proceedings Article•DOI•

Capsule-oriented programming

[...]

Hridesh Rajan¹•Institutions (1)

Iowa State University¹

16 May 2015

TL;DR: In a recently sponsored NSF project the authors are developing a model that they call capsule-oriented programming and its realization in the Panini project, which favors modularity over explicit concurrency, encourages concurrency correctness by construction, and exploits modular structure of programs to expose implicit concurrency.

...read moreread less

Abstract: "Explicit concurrency should be abolished from all higher-level programming languages (i.e. everything except - perhaps- plain machine code.)." Dijkstra [1] (paraphrased). A promising class of concurrency abstractions replaces explicit concurrency mechanisms with a single linguistic mechanism that combines state and control and uses asynchronous messages for communications, e.g. active objects or actors, but that doesn't remove the hurdle of understanding non-local control transfer. What if the programming model enabled programmers to simply do what they do best, that is, to describe a system in terms of its modular structure and write sequential code to implement the operations of those modules and handles details of concurrency? In a recently sponsored NSF project we are developing such a model that we call capsule-oriented programming and its realization in the Panini project. This model favors modularity over explicit concurrency, encourages concurrency correctness by construction, and exploits modular structure of programs to expose implicit concurrency.

...read moreread less

Journal Article•DOI•

Multi-version range concurrency control in Deuteronomy

[...]

Justin J. Levandoski¹, David B. Lomet¹, Sudipta Sengupta¹, Ryan Stutsman¹, Rui Wang¹ - Show less +1 more•Institutions (1)

Microsoft¹

01 Sep 2015

TL;DR: With this range enhancement, Deuteronomy can reach scan speeds of nearly 250 million records/s (more than 27 GB/s) on modern hardware, while providing serializable isolation complete with phantom prevention.

...read moreread less

Abstract: The Deuteronomy transactional key value store executes millions of serializable transactions/second by exploiting multi-version timestamp order concurrency control. However, it has not supported range operations, only individual record operations (e.g., create, read, update, delete). In this paper, we enhance our multi-version timestamp order technique to handle range concurrency and prevent phantoms. Importantly, we maintain high performance while respecting the clean separation of duties required by Deuteronomy, where a transaction component performs purely logical concurrency control (including range support), while a data component performs data storage and management duties. Like the rest of the Deuteronomy stack, our range technique manages concurrency information in a latch-free manner. With our range enhancement, Deuteronomy can reach scan speeds of nearly 250 million records/s (more than 27 GB/s) on modern hardware, while providing serializable isolation complete with phantom prevention.

...read moreread less

Proceedings Article•DOI•

Madeus: Database Live Migration Middleware under Heavy Workloads for Cloud Environment

[...]

Takeshi Mishima¹, Yasuhiro Fujiwara¹•Institutions (1)

Nippon Telegraph and Telephone¹

27 May 2015

TL;DR: Madeus provides efficient database live migration by implementing the lazy snapshot isolation rule (LSIR) under snapshot isolation that enables concurrently propagating syncsets, which are the datasets needed to synchronize slave with master databases.

...read moreread less

Abstract: Database-as-a-service has been gaining popularity in cloud computing because multitenant databases can reduce costs by sharing off-the-shelf resources. However, due to heavy workloads, resource sharing often causes a hot spot; one node is overloaded even while others are not. Unfortunately, a hot spot can lead to violation of service level agreements and destroy customer satisfaction. To efficiently address the hot spot problem, we propose a middleware approach called Madeus that conducts database live migration. To make efficient database live migration possible, we also introduce the lazy snapshot isolation rule (LSIR) that enables concurrently propagating syncsets, which are the datasets needed to synchronize slave with master databases. Madeus provides efficient database live migration by implementing the LSIR under snapshot isolation. Unlike current approaches, Madeus is pure middleware that is transparent to the database management system and based on commodity hardware and software. To demonstrate the superiority of our approach over current approaches, we experimentally evaluated Madeus by using PostgreSQL with the TPC-W benchmark. The results indicate that Madeus achieves more efficient live migration than three other types of middleware approaches, especially under heavy workloads; therefore, it can effectively resolve hot spots.

...read moreread less

Patent•

Efficient Methods and Systems for Consistent Read in Record-Based Multi-Version Concurrency Control

[...]

Guogen Zhang

16 Mar 2015

TL;DR: In this paper, a method for consistent read in a record-based multi-version concurrent control (MVCC) database (DB) management system for snapshot consistent read is presented.

...read moreread less

Abstract: System and method embodiments are provided for consistent read in a record-based multi-version concurrency control (MVCC) in database (DB) management systems. In an embodiment, a method in a record-based multi-version concurrent control (MVCC) database (DB) management system for a snapshot consistent read includes copying a system commit transaction identifier (TxID) and a current log record sequence number (LSN) from a transaction log at a start of a reader without backfilling of a commit LSN of a transaction to records that are changed and without copying an entire transaction table by the reader; and determining whether a record is visible according to a record TxID, the commit TxID and a current LSN, wherein a transaction table is consulted only when the record TxID is equal to or larger than a commit TxID at a transaction start.

...read moreread less

Proceedings Article•DOI•

Automatic scalable atomicity via semantic locking

[...]

Guy Golan-Gueta¹, Ganesan Ramalingam², Mooly Sagiv³, Eran Yahav⁴•Institutions (4)

Yahoo!¹, Microsoft², Tel Aviv University³, Technion – Israel Institute of Technology⁴

24 Jan 2015

TL;DR: This paper presents a synthesis algorithm that automatically enforces atomicity of given code fragments (in a client program) by inserting pessimistic synchronization that guarantees atomicity and deadlock-freedom (without using any rollback mechanism).

...read moreread less

Abstract: In this paper, we consider concurrent programs in which the shared state consists of instances of linearizable ADTs (abstract data types). We present an automated approach to concurrency control that addresses a common need: the need to atomically execute a code fragment, which may contain multiple ADT operations on multiple ADT instances. We present a synthesis algorithm that automatically enforces atomicity of given code fragments (in a client program) by inserting pessimistic synchronization that guarantees atomicity and deadlock-freedom (without using any rollback mechanism). Our algorithm takes a commutativity specification as an extra input. This specification indicates for every pair of ADT operations the conditions under which the operations commute. Our algorithm enables greater parallelism by permitting commuting operations to execute concurrently. We have implemented the synthesis algorithm in a Java compiler, and applied it to several Java programs. Our results show that our approach produces efficient and scalable synchronization.

...read moreread less

Journal Article•DOI•

VLL: a lock manager redesign for main memory database systems

[...]

Kun Ren¹, Alexander Thomson², Daniel J. Abadi¹•Institutions (2)

Yale University¹, Google²

01 Oct 2015

TL;DR: This paper introduces very lightweight locking (VLL), an alternative approach to pessimistic concurrency control for main memory database systems, which avoids almost all overhead associated with traditional lock manager operations and proposes a protocol called selective contention analysis (SCA), which enables systems implementing VLL to achieve high transactional throughput under high-contention workloads.

...read moreread less

Abstract: Lock managers are increasingly becoming a bottleneck in database systems that use pessimistic concurrency control. In this paper, we introduce very lightweight locking (VLL), an alternative approach to pessimistic concurrency control for main memory database systems, which avoids almost all overhead associated with traditional lock manager operations. We also propose a protocol called selective contention analysis (SCA), which enables systems implementing VLL to achieve high transactional throughput under high-contention workloads. We implement these protocols both in a traditional single-machine multi-core database server setting and in a distributed database where data are partitioned across many commodity machines in a shared-nothing cluster. Furthermore, we show how VLL and SCA can be extended to enable range locking. Our experiments show that VLL dramatically reduces locking overhead and thereby increases transactional throughput in both settings.

...read moreread less

Proceedings Article•DOI•

The Serial Safety Net: Efficient Concurrency Control on Modern Hardware

[...]

Tianzheng Wang¹, Ryan Johnson¹, Alan Fekete², Ippokratis Pandis³•Institutions (3)

University of Toronto¹, University of Sydney², Cloudera³

31 May 2015

TL;DR: The serial safety net (SSN) is proposed, a serializability-enforcing certifier for modern hardware with substantial core count and large main memory that can be applied with minimal overhead on top of various CC schemes that offer higher performance but admit anomalies.

...read moreread less

Abstract: Concurrency control (CC) algorithms must trade off strictness for performance, with serializable schemes generally paying high cost---both in runtime overhead such as contention on lock tables, and in wasted efforts by aborting transactions---to prevent anomalies We propose the serial safety net (SSN), a serializability-enforcing certifier for modern hardware with substantial core count and large main memory SSN can be applied with minimal overhead on top of various CC schemes that offer higher performance but admit anomalies, eg, snapshot isolation and read committed We demonstrate the efficiency, accuracy and robustness of SSN using a memory-optimized OLTP engine with different CC schemes We find that SSN is a promising approach to serializability with low abort rates and robust performance for various workloads

...read moreread less

Proceedings Article•DOI•

Low-overhead software transactional memory with progress guarantees and strong semantics

[...]

Minjia Zhang¹, Jipeng Huang¹, Man Cao¹, Michael D. Bond¹•Institutions (1)

Ohio State University¹

24 Jan 2015

TL;DR: An adaptive version of LarkTM is designed that uses alternative concurrency control for high-contention objects, and not only provides low single-thread overhead, but their multithreaded performance compares favorably with existing high-performance STMs.

...read moreread less

Abstract: Software transactional memory offers an appealing alternative to locks by improving programmability, reliability, and scalability However, existing STMs are impractical because they add high instrumentation costs and often provide weak progress guarantees and/or semantics This paper introduces a novel STM called LarkTM that provides three significant features (1) Its instrumentation adds low overhead except when accesses actually conflict, enabling low single-thread overhead and scaling well on low-contention workloads (2) It uses eager concurrency control mechanisms, yet naturally supports flexible conflict resolution, enabling strong progress guarantees (3) It naturally provides strong atomicity semantics at low cost LarkTM's design works well for low-contention workloads, but adds significant overhead under higher contention, so we design an adaptive version of LarkTM that uses alternative concurrency control for high-contention objects An implementation and evaluation in a Java virtual machine show that the basic and adaptive versions of LarkTM not only provide low single-thread overhead, but their multithreaded performance compares favorably with existing high-performance STMs

...read moreread less

Patent•

Techniques for analytics-driven hybrid concurrency control in clouds

[...]

Eugen Feller¹, Julien Forgeat¹, Muhammad Mudassar Nazar¹•Institutions (1)

Ericsson¹

23 Nov 2015

TL;DR: In this paper, a hybrid resource allocation module that can concurrently utilize an optimistic allocation scheme alongside a pessimistic allocation scheme is presented. But the authors focus on the use of machine learning techniques to train a cluster model that is integrated by a hybrid resources allocation module to classify applications in either a pessimistic cluster or an optimistic cluster.

...read moreread less

Abstract: Exemplary techniques for analytics-driven hybrid concurrency control in clouds are disclosed that include a hybrid resource allocation module that can concurrently utilize an optimistic allocation scheme alongside a pessimistic allocation scheme. Machine learning techniques utilizing previous activity history of applications can be used to train a cluster model that is integrated by a hybrid resource allocation module to classify applications in either a pessimistic cluster or an optimistic cluster that identifies under which scheme requests from the applications will be processed.

...read moreread less

Journal Article•DOI•

Controlled Owicki-Gries Concurrency: Reasoning about the Preemptible eChronos Embedded Operating System

[...]

June Andronick¹, Corey Lewis¹, Carroll Morgan¹•Institutions (1)

NICTA¹

13 Nov 2015-arXiv: Logic in Computer Science

TL;DR: A controlled concurrency framework, derived from the Owicki-Gries method, is introduced for describing a hardware interface in detail sufficient to suppor t the modelling and verification of small, embedded operating systems (OS’s) whose run-time responsiveness is paramount.

...read moreread less

Abstract: We introduce a controlled concurrency framework, derived from the Owicki-Gries method, for describing a hardware interface in detail sufficient to support the modelling and verification of small, embedded operating systems (OS's) whose run-time responsiveness is paramount. Such real-time systems run with interrupts mostly enabled, including during scheduling. That differs from many other successfully modelled and verified OS's that typically reduce the complexity of concurrency by running on uniprocessor platforms and by switching interrupts off as much as possible. Our framework builds on the traditional Owicki-Gries method, for its fine-grained concurrency is needed for high-performance system code. We adapt it to support explicit concurrency control, by providing a simple, faithful representation of the hardware interface that allows software to control the degree of interleaving between user code, OS code, interrupt handlers and a scheduler that controls context switching. We then apply this framework to model the interleaving behavior of the eChronos OS, a preemptible real-time OS for embedded micro-controllers. We discuss the accuracy and usability of our approach when instantiated to model the eChronos OS. Both our framework and the eChronos model are formalised in the Isabelle/HOL theorem prover, taking advantage of the high level of automation in modern reasoning tools.

...read moreread less

Proceedings Article•DOI•

Experiments with Hardware-based Transactional Memory in Parallel Simulation

[...]

Joshua Hay¹, Philip A. Wilsey²•Institutions (2)

Intel¹, University of Cincinnati²

10 Jun 2015

TL;DR: Evaluation of both forms of transactional memory found in the Intel Haswell processor, Hardware Lock Elision (HLE) and Restricted Transactional Memory (RTM), are evaluated and show that RTM generally outperforms conventional locking mechanisms and that HLE provides consistently better performance than conventional locking mechanism.

...read moreread less

Abstract: Transactional memory is a concurrency control mechanism that dynamically determines when threads may safely execute critical sections of code. It provides the performance of fine-grained locking mechanisms with the simplicity of coarse-grained locking mechanisms. With hardware based transactions, the protection of shared data accesses and updates can be evaluated at runtime so that only true collisions to shared data force serialization. This paper explores the use of transactional memory as an alternative to conventional synchronization mechanisms for managing the pending event set in a Time Warp synchronized parallel simulator. In particular, we explore the application of Intel's hardware-based transactional memory (TSX) to manage shared access to the pending event set by the simulation threads. Comparison between conventional locking mechanisms and transactional memory access is performed to evaluate each within the warped Time Warp synchronized parallel simulation kernel. In this testing, evaluation of both forms of transactional memory found in the Intel Haswell processor, Hardware Lock Elision (HLE) and Restricted Transactional Memory (RTM), are evaluated. The results show that RTM generally outperforms conventional locking mechanisms and that HLE provides consistently better performance than conventional locking mechanisms, in some cases as much as 27%.

...read moreread less

Book Chapter•DOI•

PR-STM: Priority Rule Based Software Transactions for the GPU

[...]

Qi Shen¹, Craig Sharp¹, William Blewitt¹, Gary Ushaw¹, Graham Morgan¹ - Show less +1 more•Institutions (1)

Newcastle University¹

24 Aug 2015

TL;DR: This paper is the first to provide results comparing GPU based STMs with a CPU based STM, and demonstrates competitive performance results in comparison with existing STMs for both the GPU and CPU.

...read moreread less

Abstract: In this paper we describe an implementation of a software transactional memory library for the GPU written in CUDA. We describe the implementation of our transaction mechanism which features both tentative and regular locking along with a contention management policy based on a simple, yet effective, static priority rule called Priority Rule Software Transactional Memory (PR-STM). We demonstrate competitive performance results in comparison with existing STMs for both the GPU and CPU. While GPU comparisons have been studied, to the best of our knowledge we are the first to provide results comparing GPU based STMs with a CPU based STM.

...read moreread less

Proceedings Article•DOI•

Composing concurrency control

[...]

Ofri Ziv¹, Alex Aiken², Guy Golan-Gueta¹, Ganesan Ramalingam³, Mooly Sagiv¹ - Show less +1 more•Institutions (3)

Tel Aviv University¹, Stanford University², Microsoft³

03 Jun 2015

TL;DR: This work formalizes the usually desired requirements of concurrency control as well as stronger versions of these properties that enable composition and shows how to compose protocols satisfying these properties so that the resulting combined protocol also satisfies these properties.

...read moreread less

Abstract: Concurrency control poses significant challenges when composing computations over multiple data-structures (objects) with different concurrency-control implementations. We formalize the usually desired requirements (serializability, abort-safety, deadlock-safety, and opacity) as well as stronger versions of these properties that enable composition. We show how to compose protocols satisfying these properties so that the resulting combined protocol also satisfies these properties. Our approach generalizes well-known protocols (such as two-phase-locking and two-phase-commit) and leads to new protocols. We apply this theory to show how we can safely compose optimistic and pessimistic concurrency control. For example, we show how we can execute a transaction that accesses two objects, one controlled by an STM and another by locking.

...read moreread less