Proceedings ArticleDOI
Dynamo: amazon's highly available key-value store
Giuseppe deCandia,Deniz Hastorun,Madan Mohan Rao Jampani,Gunavardhan Kakulapati,Avinash Lakshman,Alex Pilchin,Swaminathan Sivasubramanian,Peter Sven Vosshall,Werner Vogels +8 more
- Vol. 41, Iss: 6, pp 205-220
TLDR
D Dynamo is presented, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience and makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.Abstract:
Reliability at massive scale is one of the biggest challenges we face at Amazon.com, one of the largest e-commerce operations in the world; even the slightest outage has significant financial consequences and impacts customer trust. The Amazon.com platform, which provides services for many web sites worldwide, is implemented on top of an infrastructure of tens of thousands of servers and network components located in many datacenters around the world. At this scale, small and large components fail continuously and the way persistent state is managed in the face of these failures drives the reliability and scalability of the software systems.This paper presents the design and implementation of Dynamo, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience. To achieve this level of availability, Dynamo sacrifices consistency under certain failure scenarios. It makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.read more
Citations
More filters
Department of Computer Science and Engineering
TL;DR: In this article, the authors present a survey of postgraduate students: Vladimír Arnot, Daniel Čapek, Rudolf Čejka, Dao Minh, Tomá Dulík, Martin Hrubý, Radek Kočí, Petr Kotásek, Marek Křejpský and Bohuslav KŘena, Vladislav Kubíček.
Proceedings ArticleDOI
High performance database logging using storage class memory
TL;DR: The detailed design of an SCM-based approach for DBMSs logging is presented, which achieves high performance by simplified system design and better concurrency support and solutions to tackle several major issues arising during system recovery, including hole detection, partial write detection, and any-point failure recovery are discussed.
Proceedings ArticleDOI
DARE: High-Performance State Machine Replication on RDMA Networks
Marius Poke,Torsten Hoefler +1 more
TL;DR: A new set of protocols based on Remote Direct Memory Access (RDMA) primitives, using a strongly consistent key-value store, are proposed that enable operators to fully utilize the new capabilities of the quickly growing number of RDMA-capable datacenter networks.
Proceedings ArticleDOI
Adapting microsoft SQL server for cloud computing
Philip A. Bernstein,Istvan Cseri,Nishant V. Dani,Nigel R. Ellis,Ajay Kalhan,Gopal Kakivaya,David B. Lomet,Ramesh Manne,Lev Novik,Tomas Talius +9 more
TL;DR: Cloud SQL Server is a relational database system designed to scale-out to cloud computing workloads and currently serves as the storage engine for Microsoft's Exchange Hosted Archive and SQL Azure.
Proceedings ArticleDOI
CockroachDB: The Resilient Geo-Distributed SQL Database
Rebecca Taft,Irfan Sharif,Andrei Matei,Nathan VanBenschoten,Jordan Lewis,Tobias Grieger,Kai Niemi,Andy Woods,Anne Birzin,Raphael Poss,Paul Bardea,Amruta Ranade,Ben Darnell,Bram Gruneir,Justin Jaffray,Lucy Zhang,Peter Mattis +16 more
TL;DR: The design of CockroachDB and its novel transaction model that supports consistent geo-distributed transactions on commodity hardware is presented and its distributed SQL layer automatically scales with the size of the database cluster while providing the standard SQL interface that users expect.
References
More filters
Proceedings ArticleDOI
Chord: A scalable peer-to-peer lookup service for internet applications
TL;DR: Results from theoretical analysis, simulations, and experiments show that Chord is scalable, with communication cost and the state maintained by each node scaling logarithmically with the number of Chord nodes.
Book ChapterDOI
Time, clocks, and the ordering of events in a distributed system
TL;DR: In this paper, the concept of one event happening before another in a distributed system is examined, and a distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events.
Book ChapterDOI
Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems
Antony Rowstron,Peter Druschel +1 more
TL;DR: Pastry as mentioned in this paper is a scalable, distributed object location and routing substrate for wide-area peer-to-peer ap- plications, which performs application-level routing and object location in a po- tentially very large overlay network of nodes connected via the Internet.
Journal ArticleDOI
Time, clocks, and the ordering of events in a distributed system
TL;DR: In this article, the concept of one event happening before another in a distributed system is examined, and a distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events.
Journal ArticleDOI
The Google file system
TL;DR: This paper presents file system interface extensions designed to support distributed applications, discusses many aspects of the design, and reports measurements from both micro-benchmarks and real world use.