Spanner: Google’s Globally Distributed Database
James C. Corbett,Jeffrey Dean,Michael James Boyer Epstein,Andrew Fikes,Christopher Frost,J. J. Furman,Sanjay Ghemawat,Andrey Gubarev,Christopher Heiser,Peter Hochschild,Wilson C. Hsieh,Sebastian Kanthak,Eugene Kogan,Hongyi Li,Alexander Lloyd,Sergey Melnik,David Mwaura,David Nagle,Sean Quinlan,Rajesh Rao,Lindsay Rolig,Yasushi Saito,Michal Piotr Szymaniak,Chris Jorgen Taylor,Ruth Wang,Dale Woodford +25 more
Reads0
Chats0
TLDR
Spanner as mentioned in this paper is Google's scalable, multiversion, globally distributed, and synchronously replicated database, which is the first system to distribute data at global scale and support externally-consistent distributed transactions.Abstract:
Spanner is Google’s scalable, multiversion, globally distributed, and synchronously replicated database. It is the first system to distribute data at global scale and support externally-consistent distributed transactions. This article describes how Spanner is structured, its feature set, the rationale underlying various design decisions, and a novel time API that exposes clock uncertainty. This API and its implementation are critical to supporting external consistency and a variety of powerful features: nonblocking reads in the past, lock-free snapshot transactions, and atomic schema changes, across all of Spanner.read more
Citations
More filters
Proceedings ArticleDOI
Speeding up Consensus by Chasing Fast Decisions
TL;DR: CAESAR as mentioned in this paper is a multi-leader generalized consensus protocol for geographically replicated sites, which does not reject a fast decision for a client request if a quorum of nodes reply with different dependency sets for that request.
Patent
Background format optimization for enhanced SQL-like queries in Hadoop
TL;DR: A format conversion engine for Apache Hadoop that converts data from its original format to a database-like format at certain time points for use by a low latency (LL) query engine is described in this article.
Proceedings ArticleDOI
Dynamic Scalable State Machine Replication
TL;DR: Dynamic S- SMR (DS-SMR) solves the issue of scalability by repartitioning the state dynamically, based on the workload, which significantly improves scalability.
Proceedings ArticleDOI
On Sharding Permissioned Blockchains
TL;DR: This paper introduces a model that leverages transaction parallelism by partitioning the nodes into clusters (partitions) and processing independent transactions on different partitions simultaneously, and includes both intra-shards and cross-shard transactions.
Proceedings ArticleDOI
Deferred lightweight indexing for log-structured key-value stores
TL;DR: DELI is presented, a DEferred Lightweight Indexing scheme on the log-structured key-value stores that optimizes the performance of index garbage collection through tightly coupling its execution with a native routine process called compaction.
References
More filters
Journal ArticleDOI
The Google file system
TL;DR: This paper presents file system interface extensions designed to support distributed applications, discusses many aspects of the design, and reports measurements from both micro-benchmarks and real world use.
Journal ArticleDOI
Linearizability: a correctness condition for concurrent objects
TL;DR: This paper defines linearizability, compares it to other correctness conditions, presents and demonstrates a method for proving the correctness of implementations, and shows how to reason about concurrent objects, given they are linearizable.
Journal ArticleDOI
Bigtable: A Distributed Storage System for Structured Data
Fay W. Chang,Jeffrey Dean,Sanjay Ghemawat,Wilson C. Hsieh,Deborah A. Wallach,Michael Burrows,Tushar Deepak Chandra,Andrew Fikes,Robert E. Gruber +8 more
TL;DR: The simple data model provided by Bigtable is described, which gives clients dynamic control over data layout and format, and the design and implementation of Bigtable are described.
Journal ArticleDOI
The part-time parliament
TL;DR: The Paxon parliament's protocol provides a new way of implementing the state machine approach to the design of distributed systems.
Journal ArticleDOI
MapReduce: a flexible data processing tool
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: MapReduce advantages over parallel databases include storage-system independence and fine-grain fault tolerance for large jobs.