Journal ArticleDOI
F1: a distributed SQL database that scales
Jeff Shute,Radek Vingralek,Bart Samwel,Ben Handy,Chad Whipkey,Eric Rollins,Mircea Oancea,Kyle Littlefield,David Menestrina,Stephan Ellner,John Cieslewicz,Ian Rae,Traian Stancescu,Himani Apte +13 more
- Vol. 6, Iss: 11, pp 1068-1079
TLDR
F1 is a distributed relational database system built at Google to support the AdWords business that combines high availability, the scalability of NoSQL systems like Bigtable and the consistency and usability of traditional SQL databases.Abstract:
F1 is a distributed relational database system built at Google to support the AdWords business. F1 is a hybrid database that combines high availability, the scalability of NoSQL systems like Bigtable, and the consistency and usability of traditional SQL databases. F1 is built on Spanner, which provides synchronous cross-datacenter replication and strong consistency. Synchronous replication implies higher commit latency, but we mitigate that latency by using a hierarchical schema model with structured data types and through smart application design. F1 also includes a fully functional distributed SQL query engine and automatic change tracking and publishing.read more
Citations
More filters
Journal ArticleDOI
Coordination avoidance in database systems
TL;DR: A formal framework is developed that determines whether an application requires coordination for correct execution by operating on application-level invariants over database states and shows that many are invariant confluent and therefore achievable without coordination.
Proceedings ArticleDOI
Scalable atomic visibility with RAMP transactions
TL;DR: This work presents algorithms for Read Atomic Multipartition (RAMP) transactions that enforce atomic visibility while offering excellent scalability, guaranteed commit despite partial failures, and minimized communication between servers (via partition independence).
Journal ArticleDOI
Size Bounds for Factorised Representations of Query Results
Dan Olteanu,Jakub Závodný +1 more
TL;DR: Two succinct representation systems for relational data based on relational algebra expressions with unions, Cartesian products, and singleton relations are studied: f-representations, which employ algebraic factorisation using distributivity of product over union, and d-representation, which are f- Representations where further succinctness is brought by explicit sharing of repeated subexpressions.
Proceedings ArticleDOI
CockroachDB: The Resilient Geo-Distributed SQL Database
Rebecca Taft,Irfan Sharif,Andrei Matei,Nathan VanBenschoten,Jordan Lewis,Tobias Grieger,Kai Niemi,Andy Woods,Anne Birzin,Raphael Poss,Paul Bardea,Amruta Ranade,Ben Darnell,Bram Gruneir,Justin Jaffray,Lucy Zhang,Peter Mattis +16 more
TL;DR: The design of CockroachDB and its novel transaction model that supports consistent geo-distributed transactions on commodity hardware is presented and its distributed SQL layer automatically scales with the size of the database cluster while providing the standard SQL interface that users expect.
Journal ArticleDOI
What's Really New with NewSQL?
Andrew Pavlo,Matthew Aslett +1 more
TL;DR: The history of databases is discussed to understand how NewSQL systems came about and a detailed explanation of what the term NewSQL means and the different categories of systems that fall under this definition is provided.
References
More filters
Journal ArticleDOI
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
Journal ArticleDOI
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.
Proceedings Article
Bigtable: A Distributed Storage System for Structured Data (Awarded Best Paper!).
Fay W. Chang,Jeffrey Dean,Sanjay Ghemawat,Wilson C. Hsieh,Deborah A. Wallach,Michael Burrows,Tushar Deepak Chandra,Andrew Fikes,Robert Gruber +8 more
TL;DR: Bigtable as mentioned in this paper is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers, including web indexing, Google Earth and Google Finance.
Proceedings ArticleDOI
Dynamo: amazon's highly available key-value store
Giuseppe deCandia,Deniz Hastorun,Madan Mohan Rao Jampani,Gunavardhan Kakulapati,Avinash Lakshman,Alex Pilchin,Swaminathan Sivasubramanian,Peter Sven Vosshall,Werner Vogels +8 more
TL;DR: D Dynamo is presented, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience and makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.
Journal ArticleDOI
Bigtable: A Distributed Storage System for Structured Data
Fay W. Chang,Jeffrey Dean,Sanjay Ghemawat,Wilson C. Hsieh,Deborah A. Wallach,Michael Burrows,Tushar Deepak Chandra,Andrew Fikes,Robert E. Gruber +8 more
TL;DR: The simple data model provided by Bigtable is described, which gives clients dynamic control over data layout and format, and the design and implementation of Bigtable are described.
Related Papers (5)
Spanner: Google's globally-distributed database
James C. Corbett,Jeffrey Dean,Michael James Boyer Epstein,Andrew Fikes,Christopher Frost,J. J. Furman,Sanjay Ghemawat,Andrey Gubarev,Christopher Heiser,Peter Hochschild,Wilson C. Hsieh,Sebastian Kanthak,Eugene Kogan,Hongyi Li,Alexander Lloyd,Sergey Melnik,David Mwaura,David Nagle,Sean Quinlan,Rajesh Rao,Lindsay Rolig,Yasushi Saito,Michal Piotr Szymaniak,Chris Jorgen Taylor,Ruth Wang,Dale Woodford +25 more