Conference

European Conference on Computer Systems

About: European Conference on Computer Systems is an academic conference. The conference publishes majorly in the area(s): Cloud computing & Computer science. Over the lifetime, 917 publications have been published by the conference receiving 57222 citations.

...read moreread less

Topics: Cloud computing, Computer science, Scalability, Cache, Server ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Dryad: distributed data-parallel programs from sequential building blocks

[...]

Michael Isard¹, Mihai Budiu¹, Yuan Yu¹, Andrew Birrell¹, Dennis Fetterly¹ - Show less +1 more•Institutions (1)

Microsoft¹

21 Mar 2007

TL;DR: The Dryad execution engine handles all the difficult problems of creating a large distributed, concurrent application: scheduling the use of computers and their CPUs, recovering from communication or computer failures, and transporting data between vertices.

...read moreread less

Abstract: Dryad is a general-purpose distributed execution engine for coarse-grain data-parallel applications. A Dryad application combines computational "vertices" with communication "channels" to form a dataflow graph. Dryad runs the application by executing the vertices of this graph on a set of available computers, communicating as appropriate through flies, TCP pipes, and shared-memory FIFOs.The vertices provided by the application developer are quite simple and are usually written as sequential programs with no thread creation or locking. Concurrency arises from Dryad scheduling vertices to run simultaneously on multiple computers, or on multiple CPU cores within a computer. The application can discover the size and placement of data at run time, and modify the graph as the computation progresses to make efficient use of the available resources.Dryad is designed to scale from powerful multi-core single computers, through small clusters of computers, to data centers with thousands of computers. The Dryad execution engine handles all the difficult problems of creating a large distributed, concurrent application: scheduling the use of computers and their CPUs, recovering from communication or computer failures, and transporting data between vertices.

...read moreread less

2,867 citations

Proceedings Article•DOI•

Hyperledger fabric: a distributed operating system for permissioned blockchains

[...]

Elli Androulaki¹, Artem Barger¹, Vita Bortnikov¹, Christian Cachin¹, Konstantinos Christidis¹, Angelo De Caro¹, David Michael Enyeart¹, Christopher Ferris¹, Gennady Laventman¹, Yacov Manevich¹, Srinivasan Muralidharan², Chet Murthy¹, Binh Nguyen², Manish Sethi¹, Gari Singh¹, Keith Smith¹, Alessandro Sorniotti¹, Chrysoula Stathakopoulou¹, Marko Vukolic¹, Sharon Weed Cocco¹, Jason Yellick¹ - Show less +17 more•Institutions (2)

IBM¹, State Street Corporation²

23 Apr 2018

TL;DR: This paper describes Fabric, its architecture, the rationale behind various design decisions, its most prominent implementation aspects, as well as its distributed application programming model, and shows that Fabric achieves end-to-end throughput of more than 3500 transactions per second in certain popular deployment configurations.

...read moreread less

Abstract: Fabric is a modular and extensible open-source system for deploying and operating permissioned blockchains and one of the Hyperledger projects hosted by the Linux Foundation (www.hyperledger.org). Fabric is the first truly extensible blockchain system for running distributed applications. It supports modular consensus protocols, which allows the system to be tailored to particular use cases and trust models. Fabric is also the first blockchain system that runs distributed applications written in standard, general-purpose programming languages, without systemic dependency on a native cryptocurrency. This stands in sharp contrast to existing block-chain platforms that require "smart-contracts" to be written in domain-specific languages or rely on a cryptocurrency. Fabric realizes the permissioned model using a portable notion of membership, which may be integrated with industry-standard identity management. To support such flexibility, Fabric introduces an entirely novel blockchain design and revamps the way blockchains cope with non-determinism, resource exhaustion, and performance attacks. This paper describes Fabric, its architecture, the rationale behind various design decisions, its most prominent implementation aspects, as well as its distributed application programming model. We further evaluate Fabric by implementing and benchmarking a Bitcoin-inspired digital currency. We show that Fabric achieves end-to-end throughput of more than 3500 transactions per second in certain popular deployment configurations, with sub-second latency, scaling well to over 100 peers.

...read moreread less

2,813 citations

Proceedings Article•DOI•

CloneCloud: elastic execution between mobile device and cloud

[...]

Byung-Gon Chun¹, Sunghwan Ihm², Petros Maniatis¹, Mayur Naik¹, Ashwin Patti¹ - Show less +1 more•Institutions (2)

Intel¹, Princeton University²

10 Apr 2011

TL;DR: The design and implementation of CloneCloud is presented, a system that automatically transforms mobile applications to benefit from the cloud that enables unmodified mobile applications running in an application-level virtual machine to seamlessly off-load part of their execution from mobile devices onto device clones operating in a computational cloud.

...read moreread less

Abstract: Mobile applications are becoming increasingly ubiquitous and provide ever richer functionality on mobile devices. At the same time, such devices often enjoy strong connectivity with more powerful machines ranging from laptops and desktops to commercial clouds. This paper presents the design and implementation of CloneCloud, a system that automatically transforms mobile applications to benefit from the cloud. The system is a flexible application partitioner and execution runtime that enables unmodified mobile applications running in an application-level virtual machine to seamlessly off-load part of their execution from mobile devices onto device clones operating in a computational cloud. CloneCloud uses a combination of static analysis and dynamic profiling to partition applications automatically at a fine granularity while optimizing execution time and energy use for a target computation and communication environment. At runtime, the application partitioning is effected by migrating a thread from the mobile device at a chosen point to the clone in the cloud, executing there for the remainder of the partition, and re-integrating the migrated thread back to the mobile device. Our evaluation shows that CloneCloud can adapt application partitioning to different environments, and can help some applications achieve as much as a 20x execution speed-up and a 20-fold decrease of energy spent on the mobile device.

...read moreread less

2,054 citations

Proceedings Article•DOI•

Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling

[...]

Matei Zaharia¹, Dhruba Borthakur², Joydeep Sen Sarma², Khaled Elmeleegy³, Scott Shenker¹, Ion Stoica¹ - Show less +2 more•Institutions (3)

University of California, Berkeley¹, Facebook², Yahoo!³

13 Apr 2010

TL;DR: This work proposes a simple algorithm called delay scheduling, which achieves nearly optimal data locality in a variety of workloads and can increase throughput by up to 2x while preserving fairness.

...read moreread less

Abstract: As organizations start to use data-intensive cluster computing systems like Hadoop and Dryad for more applications, there is a growing need to share clusters between users. However, there is a conflict between fairness in scheduling and data locality (placing tasks on nodes that contain their input data). We illustrate this problem through our experience designing a fair scheduler for a 600-node Hadoop cluster at Facebook. To address the conflict between locality and fairness, we propose a simple algorithm called delay scheduling: when the job that should be scheduled next according to fairness cannot launch a local task, it waits for a small amount of time, letting other jobs launch tasks instead. We find that delay scheduling achieves nearly optimal data locality in a variety of workloads and can increase throughput by up to 2x while preserving fairness. In addition, the simplicity of delay scheduling makes it applicable under a wide variety of scheduling policies beyond fair sharing.

...read moreread less

1,514 citations

Proceedings Article•DOI•

Large-scale cluster management at Google with Borg

[...]

Abhishek Verma¹, Luis Pedrosa¹, Madhukar R. Korupolu¹, David Oppenheimer¹, Eric S. Tune¹, John Wilkes¹ - Show less +2 more•Institutions (1)

Google¹

17 Apr 2015

TL;DR: A summary of the Borg system architecture and features, important design decisions, a quantitative analysis of some of its policy decisions, and a qualitative examination of lessons learned from a decade of operational experience with it are presented.

...read moreread less

Abstract: Google's Borg system is a cluster manager that runs hundreds of thousands of jobs, from many thousands of different applications, across a number of clusters each with up to tens of thousands of machines. It achieves high utilization by combining admission control, efficient task-packing, over-commitment, and machine sharing with process-level performance isolation. It supports high-availability applications with runtime features that minimize fault-recovery time, and scheduling policies that reduce the probability of correlated failures. Borg simplifies life for its users by offering a declarative job specification language, name service integration, real-time job monitoring, and tools to analyze and simulate system behavior. We present a summary of the Borg system architecture and features, important design decisions, a quantitative analysis of some of its policy decisions, and a qualitative examination of lessons learned from a decade of operational experience with it.

...read moreread less

1,185 citations

Collapse

Performance

Metrics

917

Papers

57,222

Citations

No. of papers from the Conference in previous years
Year	Papers
2023	46
2022	47
2021	87
2020	79
2019	73
2018	71