Windows Azure Storage: a highly available cloud storage service with strong consistency

doi:10.1145/2043556.2043571

Proceedings ArticleDOI

Windows Azure Storage: a highly available cloud storage service with strong consistency

Brad Calder, +26 more

- pp 143-157

Chats0

TLDR

The WAS architecture, global namespace, and data model is described, as well as its resource provisioning, load balancing, and replication systems.

Abstract:

Windows Azure Storage (WAS) is a cloud storage system that provides customers the ability to store seemingly limitless amounts of data for any duration of time. WAS customers have access to their data from anywhere at any time and only pay for what they use and store. In WAS, data is stored durably using both local and geographic replication to facilitate disaster recovery. Currently, WAS storage comes in the form of Blobs (files), Tables (structured storage), and Queues (message delivery). In this paper, we describe the WAS architecture, global namespace, and data model, as well as its resource provisioning, load balancing, and replication systems.

Citations

PDF

Open Access

More filters

Proceedings Article

Erasure coding in windows azure storage

Cheng Huang, +7 more

TL;DR: This paper describes how LRC is used in WAS to provide low overhead durable storage with consistently low read latencies, and introduces a new set of codes for erasure coding called Local Reconstruction Codes (LRC).

...read moreread less

Proceedings ArticleDOI

Naiad: a timely dataflow system

Derek G. Murray, +5 more

TL;DR: It is shown that many powerful high-level programming models can be built on Naiad's low-level primitives, enabling such diverse tasks as streaming data analysis, iterative machine learning, and interactive graph mining.

...read moreread less

Journal ArticleDOI

Big Data computing and clouds

Marcos Dias De Assuncao, +4 more

- 01 May 2015 -

Journal of Parallel and Distributed Comp...

TL;DR: This paper discusses approaches and environments for carrying out analytics on Clouds for Big Data applications, and identifies possible gaps in technology and provides recommendations for the research community on future directions on Cloud-supported Big Data computing and analytics solutions.

...read moreread less

Journal ArticleDOI

XORing elephants: novel erasure codes for big data

Maheswaran Sathiamoorthy, +6 more

TL;DR: In this article, the authors present a family of erasure codes that are efficient repairable and offer higher reliability compared to Reed-Solomon codes, which is the standard design choice and their high repair cost is often considered an unavoidable price to pay for high storage efficiency and high reliability.

...read moreread less

Proceedings ArticleDOI

Paragon: QoS-aware scheduling for heterogeneous datacenters

Christina Delimitrou, +1 more

TL;DR: Paragon is an online and scalable DC scheduler that is heterogeneity and interference-aware, derived from robust analytical methods and uses collaborative filtering techniques to quickly and accurately classify an unknown, incoming workload, by identifying similarities to previously scheduled applications.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

The Google file system

Sanjay Ghemawat, +2 more

TL;DR: This paper presents file system interface extensions designed to support distributed applications, discusses many aspects of the design, and reports measurements from both micro-benchmarks and real world use.

...read moreread less

Proceedings Article

Bigtable: A Distributed Storage System for Structured Data (Awarded Best Paper!).

Fay W. Chang, +8 more

TL;DR: Bigtable as mentioned in this paper is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers, including web indexing, Google Earth and Google Finance.

...read moreread less

Proceedings ArticleDOI

Dynamo: amazon's highly available key-value store

Giuseppe deCandia, +8 more

TL;DR: D Dynamo is presented, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience and makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.

...read moreread less

Journal ArticleDOI

Bigtable: A Distributed Storage System for Structured Data

Fay W. Chang, +8 more

- 01 Jun 2008 -

ACM Transactions on Computer Systems

TL;DR: The simple data model provided by Bigtable is described, which gives clients dynamic control over data layout and format, and the design and implementation of Bigtable are described.

...read moreread less

Journal ArticleDOI

The part-time parliament

Leslie Lamport

- 01 May 1998 -

ACM Transactions on Computer Systems

TL;DR: The Paxon parliament's protocol provides a new way of implementing the state machine approach to the design of distributed systems.

...read moreread less

Windows Azure Storage: a highly available cloud storage service with strong consistency

Citations

Erasure coding in windows azure storage

Naiad: a timely dataflow system

Big Data computing and clouds

XORing elephants: novel erasure codes for big data

Paragon: QoS-aware scheduling for heterogeneous datacenters

References

The Google file system

Bigtable: A Distributed Storage System for Structured Data (Awarded Best Paper!).

Dynamo: amazon's highly available key-value store

Bigtable: A Distributed Storage System for Structured Data

The part-time parliament

Related Papers (5)

The Google file system

Dynamo: amazon's highly available key-value store

The Hadoop Distributed File System

Polynomial Codes Over Certain Finite Fields

Cassandra: a decentralized structured storage system