Showing papers on "Scalability published in 2012"

PDF

Open Access

Journal Article•DOI•

[...]

Van L. Jacobson, Diana K. Smetters, James D. Thornton, Michael F. Plass, Nicholas H. Briggs, R. Braynard - Show less +2 more

01 Jan 2012-Communications of The ACM

TL;DR: Content-Centric Networking (CCN) is presented which uses content chunks as a primitive---decoupling location from identity, security and access, and retrieving chunks of content by name, and simultaneously achieves scalability, security, and performance.

...read moreread less

Abstract: Current network use is dominated by content distribution and retrieval yet current networking protocols are designed for conversations between hosts. Accessing content and services requires mapping from the what that users care about to the network's where. We present Content-Centric Networking (CCN) which uses content chunks as a primitive---decoupling location from identity, security and access, and retrieving chunks of content by name. Using new approaches to routing named content, derived from IP, CCN simultaneously achieves scalability, security, and performance. We describe our implementation of the architecture's basic features and demonstrate its performance and resilience with secure file downloads and VoIP calls.

...read moreread less

3,122 citations

Proceedings Article•DOI•

Spanner: Google's globally-distributed database

[...]

James C. Corbett¹, Jeffrey Dean¹, Michael James Boyer Epstein¹, Andrew Fikes¹, Christopher Frost¹, J. J. Furman¹, Sanjay Ghemawat¹, Andrey Gubarev¹, Christopher Heiser¹, Peter Hochschild¹, Wilson C. Hsieh¹, Sebastian Kanthak¹, Eugene Kogan¹, Hongyi Li¹, Alexander Lloyd¹, Sergey Melnik¹, David Mwaura¹, David Nagle¹, Sean Quinlan¹, Rajesh Rao¹, Lindsay Rolig¹, Yasushi Saito¹, Michal Piotr Szymaniak¹, Chris Jorgen Taylor¹, Ruth Wang¹, Dale Woodford¹ - Show less +22 more•Institutions (1)

Google¹

08 Oct 2012

TL;DR: This article describes how Spanner is structured, its feature set, the rationale underlying various design decisions, and a novel time API that exposes clock uncertainty, critical to supporting external consistency and a variety of powerful features.

...read moreread less

Abstract: Spanner is Google's scalable, multi-version, globally-distributed, and synchronously-replicated database. It is the first system to distribute data at global scale and support externally-consistent distributed transactions. This paper describes how Spanner is structured, its feature set, the rationale underlying various design decisions, and a novel time API that exposes clock uncertainty. This API and its implementation are critical to supporting external consistency and a variety of powerful features: nonblocking reads in the past, lock-free read-only transactions, and atomic schema changes, across all of Spanner.

...read moreread less

1,366 citations

Proceedings Article•DOI•

ThinkAir: Dynamic resource allocation and parallel execution in the cloud for mobile code offloading

[...]

Sokol Kosta, Andrius Aucinas¹, Pan Hui², Richard Mortier³, Xinwen Zhang⁴ - Show less +1 more•Institutions (4)

University of Cambridge¹, Deutsche Telekom², University of Nottingham³, Huawei⁴

25 Mar 2012

TL;DR: This paper proposes ThinkAir, a framework that makes it simple for developers to migrate their smartphone applications to the cloud and enhances the power of mobile cloud computing by parallelizing method execution using multiple virtual machine (VM) images.

...read moreread less

Abstract: Smartphones have exploded in popularity in recent years, becoming ever more sophisticated and capable. As a result, developers worldwide are building increasingly complex applications that require ever increasing amounts of computational power and energy. In this paper we propose ThinkAir, a framework that makes it simple for developers to migrate their smartphone applications to the cloud. ThinkAir exploits the concept of smartphone virtualization in the cloud and provides method-level computation offloading. Advancing on previous work, it focuses on the elasticity and scalability of the cloud and enhances the power of mobile cloud computing by parallelizing method execution using multiple virtual machine (VM) images. We implement ThinkAir and evaluate it with a range of benchmarks starting from simple micro-benchmarks to more complex applications. First, we show that the execution time and energy consumption decrease two orders of magnitude for a N-queens puzzle application and one order of magnitude for a face detection and a virus scan application. We then show that a parallelizable application can invoke multiple VMs to execute in the cloud in a seamless and on-demand manner such as to achieve greater reduction on execution time and energy consumption. We finally use a memory-hungry image combiner tool to demonstrate that applications can dynamically request VMs with more computational power in order to meet their computational requirements.

...read moreread less

1,215 citations

A scalable peer-to-peer lookup protocol for Internet applications

[...]

Gade Krishna

01 Jan 2012

TL;DR: This paper proposes EQUATOR (Equivalent Servant Locator), an unstructured overlay implementing the above mentioned operating principles, based on an overlay construction algorithm that well approximates an ideal scale-free construction model.

...read moreread less

Abstract: while peer-to-peer networks are mainly used to locate unique resources across the Internet, new interesting deployment scenarios are emerging. Particularly, some applications (e.g., VoIP) are proposing the creation of overlays for the localization of services based on equivalent servants (e.g., voice relays). This paper explores the possible overlay architectures that can be adopted to provide such services, showing how an unstructured solution based on a scale-free overlay topology is an effective option to deploy in this context. Consequently, we propose EQUATOR (Equivalent Servant Locator), an unstructured overlay implementing the above mentioned operating principles, based on an overlay construction algorithm that well approximates an ideal scale-free construction model. We present both analytical and simulation results which support our overlay topology selection and validate the proposed architecture.

...read moreread less

1,030 citations

Proceedings Article•DOI•

The controller placement problem

[...]

Brandon Heller¹, Rob Sherwood², Nick McKeown¹•Institutions (2)

Stanford University¹, Switch²

13 Aug 2012

TL;DR: This paper examines fundamental limits to control plane propagation latency on an upcoming Internet2 production deployment, then expands the scope to over 100 publicly available WAN topologies and finds that one controller location is often sufficient to meet existing reaction-time requirements.

...read moreread less

Abstract: Network architectures such as Software-Defined Networks (SDNs) move the control logic off packet processing devices and onto external controllers. These network architectures with decoupled control planes open many unanswered questions regarding reliability, scalability, and performance when compared to more traditional purely distributed systems. This paper opens the investigation by focusing on two specific questions: given a topology, how many controllers are needed, and where should they go? To answer these questions, we examine fundamental limits to control plane propagation latency on an upcoming Internet2 production deployment, then expand our scope to over 100 publicly available WAN topologies. As expected, the answers depend on the topology. More surprisingly, one controller location is often sufficient to meet existing reaction-time requirements (though certainly not fault tolerance requirements).

...read moreread less

893 citations

Journal Article•DOI•

BEDOPS: high-performance genomic feature operations.

[...]

Shane Neph¹, Scott Kuehn¹, Alex Reynolds¹, Eric Haugen¹, Robert E. Thurman¹, Audra K. Johnson¹, Eric Rynes¹, Matthew T. Maurano¹, Jeff Vierstra¹, Sean Thomas¹, Richard Sandstrom¹, Richard Humbert¹, John A. Stamatoyannopoulos¹ - Show less +9 more•Institutions (1)

University of Washington¹

15 Jul 2012-Bioinformatics

TL;DR: BEDOPS is introduced, a software suite for common genomic analysis tasks which offers improved flexibility, scalability and execution time characteristics over previously published packages, and includes a utility to compress large inputs into a lossless format that can provide greater space savings and faster data extractions than alternatives.

...read moreread less

Abstract: Summary: The large and growing number of genome-wide datasets highlights the need for high-performance feature analysis and data comparison methods, in addition to efficient data storage and retrieval techniques. We introduce BEDOPS, a software suite for common genomic analysis tasks which offers improved flexibility, scalability and execution time characteristics over previously published packages. The suite includes a utility to compress large inputs into a lossless format that can provide greater space savings and faster data extractions than alternatives. Availability: http://code.google.com/p/bedops/ includes binaries, source and documentation. Contact: ude.notgnihsaw.u@njs and ude.notgnihsaw.u@matsj Supplementary information: Supplementary data are available at Bioinformatics online.

...read moreread less

769 citations

Proceedings Article•DOI•

Kandoo: a framework for efficient and scalable offloading of control applications

[...]

Soheil Hassas Yeganeh¹, Yashar Ganjali¹•Institutions (1)

University of Toronto¹

13 Aug 2012

TL;DR: Kandoo is proposed, a framework for preserving scalability without changing switches that enables network operators to replicate local controllers on demand and relieve the load on the top layer, which is the only potential bottleneck in terms of scalability.

...read moreread less

Abstract: Limiting the overhead of frequent events on the control plane is essential for realizing a scalable Software-Defined Network. One way of limiting this overhead is to process frequent events in the data plane. This requires modifying switches and comes at the cost of visibility in the control plane. Taking an alternative route, we propose Kandoo, a framework for preserving scalability without changing switches. Kandoo has two layers of controllers: (i) the bottom layer is a group of controllers with no interconnection, and no knowledge of the network-wide state, and (ii) the top layer is a logically centralized controller that maintains the network-wide state. Controllers at the bottom layer run only local control applications (i.e., applications that can function using the state of a single switch) near datapaths. These controllers handle most of the frequent events and effectively shield the top layer. Kandoo's design enables network operators to replicate local controllers on demand and relieve the load on the top layer, which is the only potential bottleneck in terms of scalability. Our evaluations show that a network controlled by Kandoo has an order of magnitude lower control channel consumption compared to normal OpenFlow networks.

...read moreread less

697 citations

Proceedings Article•DOI•

Calvin: fast distributed transactions for partitioned database systems

[...]

Alexander Thomson¹, Thaddeus Diamond¹, Shu-Chun Weng¹, Kun Ren¹, Philip Shao¹, Daniel J. Abadi¹ - Show less +2 more•Institutions (1)

Yale University¹

20 May 2012

TL;DR: Calvin is a practical transaction scheduling and data replication layer that uses a deterministic ordering guarantee to significantly reduce the normally prohibitive contention costs associated with distributed transactions.

...read moreread less

Abstract: Many distributed storage systems achieve high data access throughput via partitioning and replication, each system with its own advantages and tradeoffs. In order to achieve high scalability, however, today's systems generally reduce transactional support, disallowing single transactions from spanning multiple partitions. Calvin is a practical transaction scheduling and data replication layer that uses a deterministic ordering guarantee to significantly reduce the normally prohibitive contention costs associated with distributed transactions. Unlike previous deterministic database system prototypes, Calvin supports disk-based storage, scales near-linearly on a cluster of commodity machines, and has no single point of failure. By replicating transaction inputs rather than effects, Calvin is also able to support multiple consistency levels---including Paxos-based strong consistency across geographically distant replicas---at no cost to transactional throughput.

...read moreread less

534 citations

Proceedings Article•

A NICE way to test openflow applications

[...]

Marco Canini¹, Daniele Venzano¹, Peter Peresini¹, Dejan Kostic¹, Jennifer Rexford² - Show less +1 more•Institutions (2)

École Polytechnique Fédérale de Lausanne¹, Princeton University²

25 Apr 2012

TL;DR: This paper proposes a novel way to augment model checking with symbolic execution of event handlers (to identify representative packets that exercise code paths on the controller) and presents a simplified OpenFlow switch model (to reduce the state space), and effective strategies for generating event interleavings likely to uncover bugs.

...read moreread less

Abstract: The emergence of OpenFlow-capable switches enables exciting new network functionality, at the risk of programming errors that make communication less reliable. The centralized programming model, where a single controller program manages the network, seems to reduce the likelihood of bugs. However, the system is inherently distributed and asynchronous, with events happening at different switches and end hosts, and inevitable delays affecting communication with the controller. In this paper, we present efficient, systematic techniques for testing unmodified controller programs. Our NICE tool applies model checking to explore the state space of the entire system--the controller, the switches, and the hosts. Scalability is the main challenge, given the diversity of data packets, the large system state, and the many possible event orderings. To address this, we propose a novel way to augment model checking with symbolic execution of event handlers (to identify representative packets that exercise code paths on the controller). We also present a simplified OpenFlow switch model (to reduce the state space), and effective strategies for generating event interleavings likely to uncover bugs. Our prototype tests Python applications on the popular NOX platform. In testing three real applications--a MAC-learning switch, in-network server load balancing, and energy-efficient traffic engineering--we uncover eleven bugs.

...read moreread less

531 citations

Journal Article•DOI•

HASBE: A Hierarchical Attribute-Based Solution for Flexible and Scalable Access Control in Cloud Computing

[...]

Zhiguo Wan¹, Jun'e Liu¹, Robert H. Deng•Institutions (1)

Tsinghua University¹

01 Apr 2012-IEEE Transactions on Information Forensics and Security

TL;DR: The security of HASBE is formally proved based on security of the ciphertext-policy attribute-based encryption (CP-ABE) scheme by Bethencourt and its performance and computational complexity are formally analyzed.

...read moreread less

Abstract: Cloud computing has emerged as one of the most influential paradigms in the IT industry in recent years. Since this new computing technology requires users to entrust their valuable data to cloud providers, there have been increasing security and privacy concerns on outsourced data. Several schemes employing attribute-based encryption (ABE) have been proposed for access control of outsourced data in cloud computing; however, most of them suffer from inflexibility in implementing complex access control policies. In order to realize scalable, flexible, and fine-grained access control of outsourced data in cloud computing, in this paper, we propose hierarchical attribute-set-based encryption (HASBE) by extending ciphertext-policy attribute-set-based encryption (ASBE) with a hierarchical structure of users. The proposed scheme not only achieves scalability due to its hierarchical structure, but also inherits flexibility and fine-grained access control in supporting compound attributes of ASBE. In addition, HASBE employs multiple value assignments for access expiration time to deal with user revocation more efficiently than existing schemes. We formally prove the security of HASBE based on security of the ciphertext-policy attribute-based encryption (CP-ABE) scheme by Bethencourt and analyze its performance and computational complexity. We implement our scheme and show that it is both efficient and flexible in dealing with access control for outsourced data in cloud computing with comprehensive experiments.

...read moreread less

497 citations

Journal Article•DOI•

Cooperative Provable Data Possession for Integrity Verification in Multicloud Storage

[...]

Yan Zhu¹, Hongxin Hu², Gail-Joon Ahn², Mengyang Yu¹•Institutions (2)

Peking University¹, Arizona State University²

01 Dec 2012-IEEE Transactions on Parallel and Distributed Systems

TL;DR: This paper addresses the construction of an efficient PDP scheme for distributed cloud storage to support the scalability of service and data migration, in which it considers the existence of multiple cloud service providers to cooperatively store and maintain the clients' data.

...read moreread less

Abstract: Provable data possession (PDP) is a technique for ensuring the integrity of data in storage outsourcing. In this paper, we address the construction of an efficient PDP scheme for distributed cloud storage to support the scalability of service and data migration, in which we consider the existence of multiple cloud service providers to cooperatively store and maintain the clients' data. We present a cooperative PDP (CPDP) scheme based on homomorphic verifiable response and hash index hierarchy. We prove the security of our scheme based on multiprover zero-knowledge proof system, which can satisfy completeness, knowledge soundness, and zero-knowledge properties. In addition, we articulate performance optimization mechanisms for our scheme, and in particular present an efficient method for selecting optimal parameter values to minimize the computation costs of clients and storage service providers. Our experiments show that our solution introduces lower computation and communication overheads in comparison with noncooperative approaches.

...read moreread less

Patent•

Modular backup and retrieval system used in conjunction with a storage area network

[...]

John Crescenti, Srinivas Kavuri, David Alan Oshinsky, Anand Prahlad

13 Sep 2012

TL;DR: In this article, a modular computer storage system and method is provided for managing and directing data archiving functions, which is scalable and comprehends various storage media as well as diverse operating systems on a plurality of client devices.

...read moreread less

Abstract: A modular computer storage system and method is provided for managing and directing data archiving functions, which is scalable and comprehends various storage media as well as diverse operating systems on a plurality of client devices. A client component is associated with one or more client devices for generating archival request. A file processor directs one or more storage devices, through one or more media components, which control the actual physical level backup on various storage devices. Each media component creates a library indexing system for locating stored data. A management component coordinates the archival functions between the various client components and the file processor, including setting scheduling policies, aging policies, index pruning policies, drive cleaning policies, configuration information, and keeping track of running and waiting jobs.

...read moreread less

Proceedings Article•DOI•

Streaming graph partitioning for large distributed graphs

[...]

Isabelle Stanton¹, Gabriel Kliot²•Institutions (2)

University of California, Berkeley¹, Microsoft²

12 Aug 2012

TL;DR: This work proposes natural, simple heuristics for graph partitioning and compares their performance to hashing and METIS, a fast, offline heuristic, and shows on a large collection of graph datasets that they are a significant improvement.

...read moreread less

Abstract: Extracting knowledge by performing computations on graphs is becoming increasingly challenging as graphs grow in size. A standard approach distributes the graph over a cluster of nodes, but performing computations on a distributed graph is expensive if large amount of data have to be moved. Without partitioning the graph, communication quickly becomes a limiting factor in scaling the system up. Existing graph partitioning heuristics incur high computation and communication cost on large graphs, sometimes as high as the future computation itself. Observing that the graph has to be loaded into the cluster, we ask if the partitioning can be done at the same time with a lightweight streaming algorithm.We propose natural, simple heuristics and compare their performance to hashing and METIS, a fast, offline heuristic. We show on a large collection of graph datasets that our heuristics are a significant improvement, with the best obtaining an average gain of 76%. The heuristics are scalable in the size of the graphs and the number of partitions. Using our streaming partitioning methods, we are able to speed up PageRank computations on Spark, a distributed computation system, by 18% to 39% for large social networks.

...read moreread less

Journal Article•DOI•

A framework for partitioning and execution of data stream applications in mobile cloud computing

[...]

Lei Yang, Jiannong Cao, Yin Yuan, Tao Li, Andy Han, Alvin T. S. Chan - Show less +2 more

24 Jun 2012

TL;DR: This work studies the computation partitioning, which aims at optimizing the partition of a data stream application between mobile and cloud such that the application has maximum speed/throughput in processing the streaming data.

...read moreread less

Abstract: The contribution of cloud computing and mobile computing technologies lead to the newly emerging mobile cloud computing paradigm. Three major approaches have been proposed for mobile cloud applications: 1) extending the access to cloud services to mobile devices; 2) enabling mobile devices to work collaboratively as cloud resource providers; 3) augmenting the execution of mobile applications on portable devices using cloud resources. In this paper, we focus on the third approach in supporting mobile data stream applications. More specifically, we study how to optimize the computation partitioning of a data stream application between mobile and cloud to achieve maximum speed/throughput in processing the streaming data.To the best of our knowledge, it is the first work to study the partitioning problem for mobile data stream applications, where the optimization is placed on achieving high throughput of processing the streaming data rather than minimizing the makespan of executions as in other applications. We first propose a framework to provide runtime support for the dynamic computation partitioning and execution of the application. Different from existing works, the framework not only allows the dynamic partitioning for a single user but also supports the sharing of computation instances among multiple users in the cloud to achieve efficient utilization of the underlying cloud resources. Meanwhile, the framework has better scalability because it is designed on the elastic cloud fabrics. Based on the framework, we design a genetic algorithm for optimal computation partition. Both numerical evaluation and real world experiment have been performed, and the results show that the partitioned application can achieve at least two times better performance in terms of throughput than the application without partitioning.

...read moreread less

Proceedings Article•DOI•

Toward Software-Defined Cellular Networks

[...]

Li Erran Li¹, Z. Morley Mao², Jennifer Rexford³•Institutions (3)

Alcatel-Lucent¹, University of Michigan², Princeton University³

25 Oct 2012

TL;DR: It is argued that software defined networking (SDN) can simplify the design and management of cellular data networks, while enabling new services, but supporting many subscribers, frequent mobility, fine-grained measurement and control, and real-time adaptation introduces new scalability challenges that future SDN architectures should address.

...read moreread less

Abstract: Existing cellular networks suffer from inflexible and expensive equipment, complex control-plane protocols, and vendor-specific configuration interfaces. In this position paper, we argue that software defined networking (SDN) can simplify the design and management of cellular data networks, while enabling new services. However, supporting many subscribers, frequent mobility, fine-grained measurement and control, and real-time adaptation introduces new scalability challenges that future SDN architectures should address. As a first step, we propose extensions to controller platforms, switches, and base stations to enable controller applications to (i) express high-level policies based on subscriber attributes, rather than addresses and locations, (ii) apply real-time, fine-grained control through local agents on the switches, (iii)perform deep packet inspection and header compression on packets, and (iv)remotely manage shares of base-station resources.

...read moreread less

Proceedings Article•DOI•

Logically centralized?: state distribution trade-offs in software defined networks

[...]

Dan Levin¹, Andreas Wundsam², Brandon Heller³, Nikhil Handigol³, Anja Feldmann¹ - Show less +1 more•Institutions (3)

Technical University of Berlin¹, University of California, Berkeley², Stanford University³

13 Aug 2012

TL;DR: The state exchange points in a distributed SDN control plane are characterized and two key state distribution trade-offs are identified and simulated in the context of an existing SDN load balancer application.

...read moreread less

Abstract: Software Defined Networks (SDN) give network designers freedom to refactor the network control plane. One core benefit of SDN is that it enables the network control logic to be designed and operated on a global network view, as though it were a centralized application, rather than a distributed system - logically centralized. Regardless of this abstraction, control plane state and logic must inevitably be physically distributed to achieve responsiveness, reliability, and scalability goals. Consequently, we ask: "How does distributed SDN state impact the performance of a logically centralized control application?"Motivated by this question, we characterize the state exchange points in a distributed SDN control plane and identify two key state distribution trade-offs. We simulate these exchange points in the context of an existing SDN load balancer application. We evaluate the impact of inconsistent global network view on load balancer performance and compare different state management approaches. Our results suggest that SDN control state inconsistency significantly degrades performance of logically centralized control applications agnostic to the underlying state distribution.

...read moreread less

Proceedings Article•DOI•

On the role of burst buffers in leadership-class storage systems

[...]

Ning Liu¹, Jason Cope², Philip Carns², Christopher D. Carothers¹, Robert Ross², Gary Grider³, Adam Crume⁴, Carlos Maltzahn⁴ - Show less +4 more•Institutions (4)

Rensselaer Polytechnic Institute¹, Argonne National Laboratory², Los Alamos National Laboratory³, University of California, Santa Cruz⁴

16 Apr 2012

TL;DR: It is shown that burst buffers can accelerate the application perceived throughput to the external storage system and can reduce the amount of external storage bandwidth required to meet a desired application perceived bottleneck goal.

...read moreread less

Abstract: The largest-scale high-performance (HPC) systems are stretching parallel file systems to their limits in terms of aggregate bandwidth and numbers of clients. To further sustain the scalability of these file systems, researchers and HPC storage architects are exploring various storage system designs. One proposed storage system design integrates a tier of solid-state burst buffers into the storage system to absorb application I/O requests. In this paper, we simulate and explore this storage system design for use by large-scale HPC systems. First, we examine application I/O patterns on an existing large-scale HPC system to identify common burst patterns. Next, we describe enhancements to the CODES storage system simulator to enable our burst buffer simulations. These enhancements include the integration of a burst buffer model into the I/O forwarding layer of the simulator, the development of an I/O kernel description language and interpreter, the development of a suite of I/O kernels that are derived from observed I/O patterns, and fidelity improvements to the CODES models. We evaluate the I/O performance for a set of multiapplication I/O workloads and burst buffer configurations. We show that burst buffers can accelerate the application perceived throughput to the external storage system and can reduce the amount of external storage bandwidth required to meet a desired application perceived throughput goal.

...read moreread less

Journal Article•DOI•

StreamCloud: An Elastic and Scalable Data Streaming System

[...]

Vincenzo Gulisano, Ricardo Jiménez-Peris, Marta Patiño-Martínez, Claudio Soriente, Patrick Valduriez¹ - Show less +1 more•Institutions (1)

French Institute for Research in Computer Science and Automation¹

01 Dec 2012-IEEE Transactions on Parallel and Distributed Systems

TL;DR: StreamCloud is presented, a scalable and elastic stream processing engine for processing large data stream volumes that uses a novel parallelization technique that splits queries into subqueries that are allocated to independent sets of nodes in a way that minimizes the distribution overhead.

...read moreread less

Abstract: Many applications in several domains such as telecommunications, network security, large-scale sensor networks, require online processing of continuous data flows. They produce very high loads that requires aggregating the processing capacity of many nodes. Current Stream Processing Engines do not scale with the input load due to single-node bottlenecks. Additionally, they are based on static configurations that lead to either under or overprovisioning. In this paper, we present StreamCloud, a scalable and elastic stream processing engine for processing large data stream volumes. StreamCloud uses a novel parallelization technique that splits queries into subqueries that are allocated to independent sets of nodes in a way that minimizes the distribution overhead. Its elastic protocols exhibit low intrusiveness, enabling effective adjustment of resources to the incoming load. Elasticity is combined with dynamic load balancing to minimize the computational resources used. The paper presents the system design, implementation, and a thorough evaluation of the scalability and elasticity of the fully implemented system.

...read moreread less

Journal Article•DOI•

iCanCloud: A Flexible and Scalable Cloud Infrastructure Simulator

[...]

Alberto Núñez¹, Jose Luis Vazquez-Poletti¹, Agustín C. Caminero², Gabriel G. Castañé³, Jesus Carretero³, Ignacio M. Llorente¹ - Show less +2 more•Institutions (3)

Complutense University of Madrid¹, National University of Distance Education², Charles III University of Madrid³

01 Mar 2012

TL;DR: The iCanCloud simulator is introduced and validates, a novel simulator of cloud infrastructures with remarkable features such as flexibility, scalability, performance and usability, targeted to conduct large experiments.

...read moreread less

Abstract: Simulation techniques have become a powerful tool for deciding the best starting conditions on pay-as-you-go scenarios. This is the case of public cloud infrastructures, where a given number and type of virtual machines (in short VMs) are instantiated during a specified time, being this reflected in the final budget. With this in mind, this paper introduces and validates iCanCloud, a novel simulator of cloud infrastructures with remarkable features such as flexibility, scalability, performance and usability. Furthermore, the iCanCloud simulator has been built on the following design principles: (1) it's targeted to conduct large experiments, as opposed to others simulators from literature; (2) it provides a flexible and fully customizable global hypervisor for integrating any cloud brokering policy; (3) it reproduces the instance types provided by a given cloud infrastructure; and finally, (4) it contains a user-friendly GUI for configuring and launching simulations, that goes from a single VM to large cloud computing systems composed of thousands of machines.

...read moreread less

Journal Article•DOI•

Staged memory scheduling: achieving high performance and scalability in heterogeneous systems

[...]

Rachata Ausavarungnirun¹, Kevin K. Chang¹, Lavanya Subramanian¹, Gabriel H. Loh², Onur Mutlu¹ - Show less +1 more•Institutions (2)

Carnegie Mellon University¹, Advanced Micro Devices²

09 Jun 2012

TL;DR: The Staged Memory Scheduler (SMS) is proposed, which improves CPU performance without degrading GPU frame rate beyond a generally acceptable level, while being significantly less complex to implement than previous application-aware schedulers.

...read moreread less

Abstract: When multiple processor (CPU) cores and a GPU integrated together on the same chip share the off-chip main memory, requests from the GPU can heavily interfere with requests from the CPU cores, leading to low system performance and starvation of CPU cores. Unfortunately, state-of-the-art application-aware memory scheduling algorithms are ineffective at solving this problem at low complexity due to the large amount of GPU traffic. A large and costly request buffer is needed to provide these algorithms with enough visibility across the global request stream, requiring relatively complex hardware implementations. This paper proposes a fundamentally new approach that decouples the memory controller's three primary tasks into three significantly simpler structures that together improve system performance and fairness, especially in integrated CPU-GPU systems. Our three-stage memory controller first groups requests based on row-buffer locality. This grouping allows the second stage to focus only on inter-application request scheduling. These two stages enforce high-level policies regarding performance and fairness, and therefore the last stage consists of simple per-bank FIFO queues (no further command reordering within each bank) and straightforward logic that deals only with low-level DRAM commands and timing. We evaluate the design trade-offs involved in our Staged Memory Scheduler (SMS) and compare it against three state-of-the-art memory controller designs. Our evaluations show that SMS improves CPU performance without degrading GPU frame rate beyond a generally acceptable level, while being significantly less complex to implement than previous application-aware schedulers. Furthermore, SMS can be configured by the system software to prioritize the CPU or the GPU at varying levels to address different performance needs.

...read moreread less

Journal Article•DOI•

Solving big data challenges for enterprise application performance management

[...]

Tilmann Rabl¹, Sergio Gómez-Villamor², Mohammad Sadoghi¹, Victor Muntés-Mulero, Hans-Arno Jacobsen¹, Serge Mankovskii - Show less +2 more•Institutions (2)

University of Toronto¹, Polytechnic University of Catalonia²

01 Aug 2012

TL;DR: In this article, the authors present their experience and a comprehensive performance evaluation of six modern (Open-source) data stores in the context of application performance monitoring as part of CA Technologies initiative.

...read moreread less

Abstract: As the complexity of enterprise systems increases, the need for monitoring and analyzing such systems also grows. A number of companies have built sophisticated monitoring tools that go far beyond simple resource utilization reports. For example, based on instrumentation and specialized APIs, it is now possible to monitor single method invocations and trace individual transactions across geographically distributed systems. This high-level of detail enables more precise forms of analysis and prediction but comes at the price of high data rates (i.e., big data). To maximize the benefit of data monitoring, the data has to be stored for an extended period of time for ulterior analysis. This new wave of big data analytics imposes new challenges especially for the application performance monitoring systems. The monitoring data has to be stored in a system that can sustain the high data rates and at the same time enable an up-to-date view of the underlying infrastructure. With the advent of modern key-value stores, a variety of data storage systems have emerged that are built with a focus on scalability and high data rates as predominant in this monitoring use case.In this work, we present our experience and a comprehensive performance evaluation of six modern (open-source) data stores in the context of application performance monitoring as part of CA Technologies initiative. We evaluated these systems with data and workloads that can be found in application performance monitoring, as well as, on-line advertisement, power monitoring, and many other use cases. We present our insights not only as performance results but also as lessons learned and our experience relating to the setup and configuration complexity of these data stores in an industry setting.

...read moreread less

Proceedings Article•DOI•

Practical verified computation with streaming interactive proofs

[...]

Graham Cormode¹, Michael Mitzenmacher², Justin Thaler²•Institutions (2)

AT&T Labs¹, Harvard University²

08 Jan 2012

TL;DR: In this paper, the authors present a verifiable computation prover that runs in O(S(n) log S(n)), where S is the size of an arithmetic circuit computing the function of interest; this compares favorably to the poly(S n) runtime for the prover promised in [19].

...read moreread less

Abstract: When delegating computation to a service provider, as in the cloud computing paradigm, we seek some reassurance that the output is correct and complete. Yet recomputing the output as a check is inefficient and expensive, and it may not even be feasible to store all the data locally. We are therefore interested in what can be validated by a streaming (sublinear space) user, who cannot store the full input, or perform the full computation herself. Our aim in this work is to advance a recent line of work on "proof systems" in which the service provider proves the correctness of its output to a user. The goal is to minimize the time and space costs of both parties in generating and checking the proof. Only very recently have there been attempts to implement such proof systems, and thus far these have been quite limited in functionality.Here, our approach is two-fold. First, we describe a carefully chosen instantiation of one of the most efficient general-purpose constructions for arbitrary computations (streaming or otherwise), due to Goldwasser, Kalai, and Rothblum [19]. This requires several new insights and enhancements to move the methodology from a theoretical result to a practical possibility. Our main contribution is in achieving a prover that runs in time O(S(n) log S(n)), where S(n) is the size of an arithmetic circuit computing the function of interest; this compares favorably to the poly(S(n)) runtime for the prover promised in [19]. Our experimental results demonstrate that a practical general-purpose protocol for verifiable computation may be significantly closer to reality than previously realized.Second, we describe a set of techniques that achieve genuine scalability for protocols fine-tuned for specific important problems in streaming and database processing. Focusing in particular on non-interactive protocols for problems ranging from matrix-vector multiplication to bipartite perfect matching, we build on prior work [8, 5] to achieve a prover that runs in nearly linear-time, while obtaining optimal tradeoffs between communication cost and the user's working memory. Existing techniques required (substantially) superlinear time for the prover. Finally, we develop improved interactive protocols for specific problems based on a linearization technique originally due to Shen [33]. We argue that even if general-purpose methods improve, fine-tuned protocols will remain valuable in real-world settings for key problems, and hence special attention to specific problems is warranted.

...read moreread less

Proceedings Article•DOI•

Cray cascade: a scalable HPC system based on a Dragonfly network

[...]

Greg Faanes¹, Bataineh Abdulla M¹, Duncan Roweth¹, Tom Court¹, Edwin L. Froese¹, Bob Alverson¹, Timothy J. Johnson¹, Joe Kopnick¹, Mike Higgins¹, James Reinhard¹ - Show less +6 more•Institutions (1)

Cray¹

10 Nov 2012

TL;DR: This paper presents the architecture of the Cray Cascade system, a distributed memory system based on the Dragonfly network topology, and describes a set of advanced features supporting both mainstream high performance computing applications and emerging global address space programing models.

...read moreread less

Abstract: Higher global bandwidth requirement for many applications and lower network cost have motivated the use of the Dragonfly network topology for high performance computing systems. In this paper we present the architecture of the Cray Cascade system, a distributed memory system based on the Dragonfly [1] network topology. We describe the structure of the system, its Dragonfly network and the routing algorithms. We describe a set of advanced features supporting both mainstream high performance computing applications and emerging global address space programing models. We present a combination of performance results from prototype systems and simulation data for large systems. We demonstrate the value of the Dragonfly topology and the benefits obtained through extensive use of adaptive routing.

...read moreread less

Cloud Load Balancing Techniques : A Step Towards Green Computing

[...]

Nidhi Jain Kansal, Inderveer Chana

01 Jan 2012

TL;DR: The existing load balancing techniques in cloud computing are discussed and further compares them based on various parameters like performance, scalability, associated overhead etc that are considered in different techniques.

...read moreread less

Abstract: Cloud computing is emerging as a new paradigm of large-scale distributed computing. It is a framework for enabling convenient, on-demand network access to a shared pool of computing resources. Load balancing is one of the main challenges in cloud computing which is required to distribute the dynamic workload across multiple nodes to ensure that no single node is overwhelmed. It helps in optimal utilization of resources and hence in enhancing the performance of the system. The goal of load balancing is to minimize the resource consumption which will further reduce energy consumption and carbon emission rate that is the dire need of cloud computing. This determines the need of new metrics, energy consumption and carbon emission for energy-efficient load balancing in cloud computing. This paper discusses the existing load balancing techniques in cloud computing and further compares them based on various parameters like performance, scalability, associated overhead etc. that are considered in different techniques. It further discusses these techniques from energy consumption and carbon emission perspective.

...read moreread less

Journal Article•DOI•

Scalable Distributed Communication Architectures to Support Advanced Metering Infrastructure in Smart Grid

[...]

Jiazhen Zhou¹, Rose Qingyang Hu², Yi Qian¹•Institutions (2)

University of Nebraska–Lincoln¹, Utah State University²

01 Sep 2012-IEEE Transactions on Parallel and Distributed Systems

TL;DR: A new performance metric, accumulated bandwidthdistance product (ABDP), is introduced, to represent the total communication resource usages, and demonstrates that the total cost for the centralized architecture scales linearly as O(λN), with N being the number of smart meters, and λ being the average traffic rate on a smart meter.

...read moreread less

Abstract: In this paper, we investigate the scalability of three communication architectures for advanced metering infrastructure (AMI) in smart grid. AMI in smart grid is a typical cyber-physical system (CPS) example, in which large amount of data from hundreds of thousands of smart meters are collected and processed through an AMI communication infrastructure. Scalability is one of the most important issues for the AMI deployment in smart grid. In this study, we introduce a new performance metric, accumulated bandwidthdistance product (ABDP), to represent the total communication resource usages. For each distributed communication architecture, we formulate an optimization problem and obtain the solutions for minimizing the total cost of the system that considers both the ABDP and the deployment cost of the meter data management system (MDMS). The simulation results indicate the significant benefits of the distributed communication architectures over the traditional centralized one. More importantly, we analyze the scalability of the total cost of the communication system (including MDMS) with regard to the traffic load on the smart meters for both the centralized and the distributed communication architectures. Through the closed form expressions obtained in our analysis, we demonstrate that the total cost for the centralized architecture scales linearly as O(λN), with N being the number of smart meters, and λ being the average traffic rate on a smart meter. In contrast, the total cost for the fully distributed communication architecture is O(λ2/3 N2/3), which is significantly lower.

...read moreread less

Proceedings Article•DOI•

HyperDex: a distributed, searchable key-value store

[...]

Robert Escriva¹, Bernard Wong², Emin Gün Sirer¹•Institutions (2)

Cornell University¹, University of Waterloo²

13 Aug 2012

TL;DR: The key insight behind HyperDex is the concept of hyperspace hashing in which objects with multiple attributes are mapped into a multidimensional hyperspace, which leads to efficient implementations not only for retrieval by primary key, but also for partially-specified secondary attribute searches and range queries.

...read moreread less

Abstract: Distributed key-value stores are now a standard component of high-performance web services and cloud computing applications. While key-value stores offer significant performance and scalability advantages compared to traditional databases, they achieve these properties through a restricted API that limits object retrieval---an object can only be retrieved by the (primary and only) key under which it was inserted. This paper presents HyperDex, a novel distributed key-value store that provides a unique search primitive that enables queries on secondary attributes. The key insight behind HyperDex is the concept of hyperspace hashing in which objects with multiple attributes are mapped into a multidimensional hyperspace. This mapping leads to efficient implementations not only for retrieval by primary key, but also for partially-specified secondary attribute searches and range queries. A novel chaining protocol enables the system to achieve strong consistency, maintain availability and guarantee fault tolerance. An evaluation of the full system shows that HyperDex is 12-13x faster than Cassandra and MongoDB for finding partially specified objects. Additionally, HyperDex achieves 2-4x higher throughput for get/put operations.

...read moreread less

Proceedings Article•DOI•

Dis-function: Learning distance functions interactively

[...]

Eli T. Brown¹, Jingjing Liu¹, Carla E. Brodley¹, Remco Chang¹•Institutions (1)

Tufts University¹

14 Oct 2012

TL;DR: It is illustrated empirically that with only a few iterations of interaction and optimization, a user can achieve a scatterplot view and its corresponding distance function that reflect the user's knowledge of the data.

...read moreread less

Abstract: The world's corpora of data grow in size and complexity every day, making it increasingly difficult for experts to make sense out of their data. Although machine learning offers algorithms for finding patterns in data automatically, they often require algorithm-specific parameters, such as an appropriate distance function, which are outside the purview of a domain expert. We present a system that allows an expert to interact directly with a visual representation of the data to define an appropriate distance function, thus avoiding direct manipulation of obtuse model parameters. Adopting an iterative approach, our system first assumes a uniformly weighted Euclidean distance function and projects the data into a two-dimensional scatterplot view. The user can then move incorrectly-positioned data points to locations that reflect his or her understanding of the similarity of those data points relative to the other data points. Based on this input, the system performs an optimization to learn a new distance function and then re-projects the data to redraw the scatter-plot. We illustrate empirically that with only a few iterations of interaction and optimization, a user can achieve a scatterplot view and its corresponding distance function that reflect the user's knowledge of the data. In addition, we evaluate our system to assess scalability in data size and data dimension, and show that our system is computationally efficient and can provide an interactive or near-interactive user experience.

...read moreread less

Journal Article•DOI•

Algorithms and data structures for massively parallel generic adaptive finite element codes

[...]

Wolfgang Bangerth¹, Carsten Burstedde², Timo Heister³, Martin Kronbichler⁴•Institutions (4)

Texas A&M University¹, University of Texas at Austin², University of Göttingen³, Uppsala University⁴

05 Jan 2012-ACM Transactions on Mathematical Software

TL;DR: This work develops scalable algorithms and data structures for generic finite element methods that consider the parallel distribution of mesh data, global enumeration of degrees of freedom, constraints, and postprocessing, and removes the bottlenecks that typically limit large-scale adaptive finite element analyses.

...read moreread less

Abstract: Today's largest supercomputers have 100,000s of processor cores and offer the potential to solve partial differential equations discretized by billions of unknowns. However, the complexity of scaling to such large machines and problem sizes has so far prevented the emergence of generic software libraries that support such computations, although these would lower the threshold of entry and enable many more applications to benefit from large-scale computing.We are concerned with providing this functionality for mesh-adaptive finite element computations. We assume the existence of an “oracle” that implements the generation and modification of an adaptive mesh distributed across many processors, and that responds to queries about its structure. Based on querying the oracle, we develop scalable algorithms and data structures for generic finite element methods. Specifically, we consider the parallel distribution of mesh data, global enumeration of degrees of freedom, constraints, and postprocessing. Our algorithms remove the bottlenecks that typically limit large-scale adaptive finite element analyses.We demonstrate scalability of complete finite element workflows on up to 16,384 processors. An implementation of the proposed algorithms, based on the open source software p4est as mesh oracle, is provided under an open source license through the widely used deal.II finite element software library.

...read moreread less

Journal Article•DOI•

Toward scalable internet traffic measurement and analysis with Hadoop

[...]

Yeon Hee Lee¹, Youngseok Lee¹•Institutions (1)

Chungnam National University¹

09 Jan 2012

TL;DR: This paper presents a Hadoop-based traffic monitoring system that performs IP, TCP, HTTP, and NetFlow analysis of multi-terabytes of Internet traffic in a scalable manner and explains the performance issues related with traffic analysis MapReduce jobs.

...read moreread less

Abstract: Internet traffic measurement and analysis has long been used to characterize network usage and user behaviors, but faces the problem of scalability under the explosive growth of Internet traffic and high-speed access. Scalable Internet traffic measurement and analysis is difficult because a large data set requires matching computing and storage resources. Hadoop, an open-source computing platform of MapReduce and a distributed file system, has become a popular infrastructure for massive data analytics because it facilitates scalable data processing and storage services on a distributed computing system consisting of commodity hardware. In this paper, we present a Hadoop-based traffic monitoring system that performs IP, TCP, HTTP, and NetFlow analysis of multi-terabytes of Internet traffic in a scalable manner. From experiments with a 200-node testbed, we achieved 14 Gbps throughput for 5 TB files with IP and HTTP-layer analysis MapReduce jobs. We also explain the performance issues related with traffic analysis MapReduce jobs.

...read moreread less

Proceedings Article•DOI•

Efficient transaction processing in SAP HANA database: the end of a column store myth

[...]

Vishal Sikka, Franz Färber, Wolfgang Lehner, Sang Kyun Cha, Thomas Peh, Christof Bornhövd - Show less +2 more

20 May 2012

TL;DR: The paper aims at illustrating how the SAP HANA database is able to efficiently work in analytical as well as transactional workload environments.

...read moreread less

Abstract: The SAP HANA database is the core of SAP's new data management platform. The overall goal of the SAP HANA database is to provide a generic but powerful system for different query scenarios, both transactional and analytical, on the same data representation within a highly scalable execution environment. Within this paper, we highlight the main features that differentiate the SAP HANA database from classical relational database engines. Therefore, we outline the general architecture and design criteria of the SAP HANA in a first step. In a second step, we challenge the common belief that column store data structures are only superior in analytical workloads and not well suited for transactional workloads. We outline the concept of record life cycle management to use different storage formats for the different stages of a record. We not only discuss the general concept but also dive into some of the details of how to efficiently propagate records through their life cycle and moving database entries from write-optimized to read-optimized storage formats. In summary, the paper aims at illustrating how the SAP HANA database is able to efficiently work in analytical as well as transactional workload environments.

...read moreread less

Collapse