Dynamo: amazon's highly available key-value store

doi:10.1145/1294261.1294281

Proceedings ArticleDOI

Dynamo: amazon's highly available key-value store

- Vol. 41, Iss: 6, pp 205-220

TLDR

D Dynamo is presented, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience and makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.

Abstract:

Reliability at massive scale is one of the biggest challenges we face at Amazon.com, one of the largest e-commerce operations in the world; even the slightest outage has significant financial consequences and impacts customer trust. The Amazon.com platform, which provides services for many web sites worldwide, is implemented on top of an infrastructure of tens of thousands of servers and network components located in many datacenters around the world. At this scale, small and large components fail continuously and the way persistent state is managed in the face of these failures drives the reliability and scalability of the software systems.This paper presents the design and implementation of Dynamo, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience. To achieve this level of availability, Dynamo sacrifices consistency under certain failure scenarios. It makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.

Citations

PDF

Open Access

More filters

Department of Computer Science and Engineering

Mehmet Gonullu

TL;DR: In this article, the authors present a survey of postgraduate students: Vladimír Arnot, Daniel Čapek, Rudolf Čejka, Dao Minh, Tomá Dulík, Martin Hrubý, Radek Kočí, Petr Kotásek, Marek Křejpský and Bohuslav KŘena, Vladislav Kubíček.

...read moreread less

Proceedings ArticleDOI

High performance database logging using storage class memory

Ru Fang, +4 more

TL;DR: The detailed design of an SCM-based approach for DBMSs logging is presented, which achieves high performance by simplified system design and better concurrency support and solutions to tackle several major issues arising during system recovery, including hole detection, partial write detection, and any-point failure recovery are discussed.

...read moreread less

Proceedings ArticleDOI

DARE: High-Performance State Machine Replication on RDMA Networks

Marius Poke, +1 more

TL;DR: A new set of protocols based on Remote Direct Memory Access (RDMA) primitives, using a strongly consistent key-value store, are proposed that enable operators to fully utilize the new capabilities of the quickly growing number of RDMA-capable datacenter networks.

...read moreread less

Proceedings ArticleDOI

Adapting microsoft SQL server for cloud computing

Philip A. Bernstein, +9 more

TL;DR: Cloud SQL Server is a relational database system designed to scale-out to cloud computing workloads and currently serves as the storage engine for Microsoft's Exchange Hosted Archive and SQL Azure.

...read moreread less

Proceedings ArticleDOI

CockroachDB: The Resilient Geo-Distributed SQL Database

Rebecca Taft, +16 more

TL;DR: The design of CockroachDB and its novel transaction model that supports consistent geo-distributed transactions on commodity hardware is presented and its distributed SQL layer automatically scales with the size of the database cluster while providing the standard SQL interface that users expect.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Chord: A scalable peer-to-peer lookup service for internet applications

Ion Stoica, +4 more

TL;DR: Results from theoretical analysis, simulations, and experiments show that Chord is scalable, with communication cost and the state maintained by each node scaling logarithmically with the number of Chord nodes.

...read moreread less

Book ChapterDOI

Time, clocks, and the ordering of events in a distributed system

Leslie Lamport

- 04 Oct 2019 -

Concurrency and Computation: Practice an...

TL;DR: In this paper, the concept of one event happening before another in a distributed system is examined, and a distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events.

...read moreread less

Book ChapterDOI

Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems

Antony Rowstron, +1 more

- 12 Nov 2001 -

Lecture Notes in Computer Science

TL;DR: Pastry as mentioned in this paper is a scalable, distributed object location and routing substrate for wide-area peer-to-peer ap- plications, which performs application-level routing and object location in a po- tentially very large overlay network of nodes connected via the Internet.

...read moreread less

Journal ArticleDOI

Time, clocks, and the ordering of events in a distributed system

Leslie Lamport

- 01 Jul 1978 -

Communications of The ACM

TL;DR: In this article, the concept of one event happening before another in a distributed system is examined, and a distributed algorithm is given for synchronizing a system of logical clocks which can be used to totally order the events.

...read moreread less

Journal ArticleDOI

The Google file system

Sanjay Ghemawat, +2 more

TL;DR: This paper presents file system interface extensions designed to support distributed applications, discusses many aspects of the design, and reports measurements from both micro-benchmarks and real world use.

...read moreread less

Dynamo: amazon's highly available key-value store

Citations

Department of Computer Science and Engineering

High performance database logging using storage class memory

DARE: High-Performance State Machine Replication on RDMA Networks

Adapting microsoft SQL server for cloud computing

CockroachDB: The Resilient Geo-Distributed SQL Database

References

Chord: A scalable peer-to-peer lookup service for internet applications

Time, clocks, and the ordering of events in a distributed system

Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems

Time, clocks, and the ordering of events in a distributed system

The Google file system

Related Papers (5)

Cassandra: a decentralized structured storage system

Bigtable: A Distributed Storage System for Structured Data

The Google file system

MapReduce: simplified data processing on large clusters

Benchmarking cloud serving systems with YCSB

Trending Questions (1)