Showing papers on "Scalability published in 2011"

PDF

Open Access

Journal Article•DOI•

A distributed, architecture-centric approach to computing accurate recommendations from very large and sparse datasets

[...]

Hossein Saiedian¹, Serhiy Morozov¹•Institutions (1)

01 Jan 2011-Annual Review of Ecology, Evolution, and Systematics

TL;DR: This work introduces a novel architecture model that supports scalable, distributed suggestions from multiple independent nodes, and proposes a novel algorithm that generates a more optimal recommender input, which is the reason for a considerable accuracy improvement.

...read moreread less

Abstract: The use of recommender systems is an emerging trend today, when user behavior information is abundant. There are many large datasets available for analysis because many businesses are interested in future user opinions. Sophisticated algorithms that predict such opinions can simplify decision-making, improve customer satisfaction, and increase sales. However, modern datasets contain millions of records, which represent only a small fraction of all possible data. Furthermore, much of the information in such sparse datasets may be considered irrelevant for making individual recommendations. As a result, there is a demand for a way to make personalized suggestions from large amounts of noisy data. Current recommender systems are usually all-in-one applications that provide one type of recommendation. Their inflexible architectures prevent detailed examination of recommendation accuracy and its causes. We introduce a novel architecture model that supports scalable, distributed suggestions from multiple independent nodes. Our model consists of two components, the input matrix generation algorithm and multiple platform-independent combination algorithms. A dedicated input generation component provides the necessary data for combination algorithms, reduces their size, and eliminates redundant data processing. Likewise, simple combination algorithms can produce recommendations from the same input, so we can more easily distinguish between the benefits of a particular combination algorithm and the quality of the data it receives. Such flexible architecture is more conducive for a comprehensive examination of our system. We believe that a user's future opinion may be inferred from a small amount of data, provided that this data is most relevant. We propose a novel algorithm that generates a more optimal recommender input. Unlike existing approaches, our method sorts the relevant data twice. Doing this is slower, but the quality of the resulting input is considerably better. Furthermore, the modular nature of our approach may improve its performance, especially in the cloud computing context. We implement and validate our proposed model via mathematical modeling, by appealing to statistical theories, and through extensive experiments, data analysis, and empirical studies. Our empirical study examines the effectiveness of accuracy improvement techniques for collaborative filtering recommender systems. We evaluate our proposed architecture model on the Netflix dataset, a popular (over 130,000 solutions), large (over 100,000,000 records), and extremely sparse (1.1%) collection of movie ratings. The results show that combination algorithm tuning has little effect on recommendation accuracy. However, all algorithms produce better results when supplied with a more relevant input. Our input generation algorithm is the reason for a considerable accuracy improvement.

...read moreread less

1,957 citations

Journal Article•DOI•

Scalable SQL and NoSQL data stores

[...]

Rick Cattell

06 May 2011

TL;DR: This paper examines a number of SQL and socalled "NoSQL" data stores designed to scale simple OLTP-style application loads over many servers, and contrasts the new systems on their data model, consistency mechanisms, storage mechanisms, durability guarantees, availability, query support, and other dimensions.

...read moreread less

Abstract: In this paper, we examine a number of SQL and socalled "NoSQL" data stores designed to scale simple OLTP-style application loads over many servers. Originally motivated by Web 2.0 applications, these systems are designed to scale to thousands or millions of users doing updates as well as reads, in contrast to traditional DBMSs and data warehouses. We contrast the new systems on their data model, consistency mechanisms, storage mechanisms, durability guarantees, availability, query support, and other dimensions. These systems typically sacrifice some of these dimensions, e.g. database-wide transaction consistency, in order to achieve others, e.g. higher availability and scalability.

...read moreread less

1,412 citations

Book•

Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data

[...]

Paul Zikopoulos, Chris Eaton

19 Oct 2011

TL;DR: This book reveals how IBM is leveraging open source Big Data technology, infused with IBM technologies, to deliver a robust, secure, highly available, enterprise-class Big Data platform.

...read moreread less

Abstract: Big Data represents a new era in data exploration and utilization, and IBM is uniquely positioned to help clients navigate this transformation. This book reveals how IBM is leveraging open source Big Data technology, infused with IBM technologies, to deliver a robust, secure, highly available, enterprise-class Big Data platform. The three defining characteristics of Big Data--volume, variety, and velocity--are discussed. You'll get a primer on Hadoop and how IBM is hardening it for the enterprise, and learn when to leverage IBM InfoSphere BigInsights (Big Data at rest) and IBM InfoSphere Streams (Big Data in motion) technologies. Industry use cases are also included in this practical guide. Learn how IBM hardens Hadoop for enterprise-class scalability and reliability Gain insight into IBM's unique in-motion and at-rest Big Data analytics platform Learn tips and tricks for Big Data use cases and solutions Get a quick Hadoop primer

...read moreread less

1,290 citations

Journal Article•DOI•

A survey on clustering algorithms for wireless sensor networks

[...]

Olutayo Boyinbode, Hanh Le, Makoto Takizawa

25 May 2011-International Journal of Space-Based and Situated Computing

TL;DR: This paper synthesises existing clustering algorithms news's and highlights the challenges in clustering.

...read moreread less

Abstract: A wireless sensor network (WSN) consisting of a large number of tiny sensors can be an effective tool for gathering data in diverse kinds of environments. The data collected by each sensor is communicated to the base station, which forwards the data to the end user. Clustering is introduced to WSNs because it has proven to be an effective approach to provide better data aggregation and scalability for large WSNs. Clustering also conserves the limited energy resources of the sensors. This paper synthesises existing clustering algorithms in WSNs and highlights the challenges in clustering.

...read moreread less

1,097 citations

Benchmarking modern multiprocessors

[...]

Kai Li¹, Christian Bienia¹•Institutions (1)

Princeton University¹

01 Jan 2011

TL;DR: A methodology to design effective benchmark suites is developed and its effectiveness is demonstrated by developing and deploying a benchmark suite for evaluating multiprocessors called PARSEC, which has been adopted by many architecture groups in both research and industry.

...read moreread less

Abstract: Benchmarking has become one of the most important methods for quantitative performance evaluation of processor and computer system designs. Benchmarking of modern multiprocessors such as chip multiprocessors is challenging because of their application domain, scalability and parallelism requirements. In my thesis, I have developed a methodology to design effective benchmark suites and demonstrated its effectiveness by developing and deploying a benchmark suite for evaluating multiprocessors. More specifically, this thesis includes several contributions. First, the thesis shows that a new benchmark suite for multiprocessors is needed because the behavior of modern parallel programs is significantly different from those represented by SPLASH-2, the most popular parallel benchmark suite developed over ten years ago. Second, the thesis quantitatively describes the requirements and characteristics of a set of multithreaded programs and their underlying technology trends. Third, the thesis presents a systematic approach to scale and select benchmark inputs with the goal of optimizing benchmarking accuracy subject to constrained execution or simulation time. Finally, the thesis describes a parallel benchmark suite called PARSEC for evaluating modern shared-memory multiprocessors. Since its initial release, PARSEC has been adopted by many architecture groups in both research and industry.

...read moreread less

1,043 citations

Journal Article•DOI•

VL2: a scalable and flexible data center network

[...]

Albert Greenberg¹, James R. Hamilton², Navendu Jain¹, Srikanth Kandula¹, Changhoon Kim¹, Parantap Lahiri¹, David A. Maltz¹, Parveen Patel¹, Sudipta Sengupta¹ - Show less +5 more•Institutions (2)

Microsoft¹, Amazon.com²

01 Mar 2011-Communications of The ACM

TL;DR: VL2 is a practical network architecture that scales to support huge data centers with uniform high capacity between servers, performance isolation between services, and Ethernet layer-2 semantics and can be deployed today, and a working prototype is built.

...read moreread less

Abstract: To be agile and cost effective, data centers must allow dynamic resource allocation across large server pools. In particular, the data center network should provide a simple flat abstraction: it should be able to take any set of servers anywhere in the data center and give them the illusion that they are plugged into a physically separate, noninterfering Ethernet switch with as many ports as the service needs. To meet this goal, we present VL2, a practical network architecture that scales to support huge data centers with uniform high capacity between servers, performance isolation between services, and Ethernet layer-2 semantics. VL2 uses (1) flat addressing to allow service instances to be placed anywhere in the network, (2) Valiant Load Balancing to spread traffic uniformly across network paths, and (3) end system--based address resolution to scale to large server pools without introducing complexity to the network control plane. VL2's design is driven by detailed measurements of traffic and fault data from a large operational cloud service provider. VL2's implementation leverages proven network technologies, already available at low cost in high-speed hardware implementations, to build a scalable and reliable network architecture. As a result, VL2 networks can be deployed today, and we have built a working prototype. We evaluate the merits of the VL2 design using measurement, analysis, and experiments. Our VL2 prototype shuffles 2.7 TB of data among 75 servers in 395 s---sustaining a rate that is 94% of the maximum possible.

...read moreread less

981 citations

Book Chapter•DOI•

SpaceEx: scalable verification of hybrid systems

[...]

Goran Frehse¹, Colas Le Guernic², Alexandre Donzé¹, Scott Cotton¹, Rajarshi Ray¹, Olivier Lebeltel¹, Rodolfo Ripado¹, Antoine Girard³, Thao Dang¹, Oded Maler¹ - Show less +6 more•Institutions (3)

Centre national de la recherche scientifique¹, New York University², University of Grenoble³

14 Jul 2011

TL;DR: A scalable reachability algorithm for hybrid systems with piecewise affine, non-deterministic dynamics that combines polyhedra and support function representations of continuous sets to compute an over-approximation of the reachable states is presented.

...read moreread less

Abstract: We present a scalable reachability algorithm for hybrid systems with piecewise affine, non-deterministic dynamics. It combines polyhedra and support function representations of continuous sets to compute an over-approximation of the reachable states. The algorithm improves over previous work by using variable time steps to guarantee a given local error bound. In addition, we propose an improved approximation model, which drastically improves the accuracy of the algorithm. The algorithm is implemented as part of SpaceEx, a new verification platform for hybrid systems, available at spaceex.imag.fr. Experimental results of full fixed-point computations with hybrid systems with more than 100 variables illustrate the scalability of the approach.

...read moreread less

901 citations

Proceedings Article•DOI•

Sniper: exploring the level of abstraction for scalable and accurate parallel multi-core simulation

[...]

Trevor E. Carlson¹, Wim Heirmant¹, Lieven Eeckhout¹•Institutions (1)

Ghent University¹

12 Nov 2011

TL;DR: Interval simulation provides a balance between detailed cycle-accurate simulation and one-IPC simulation, allowing long-running simulations to be modeled much faster than with detailed cycle, while still providing the detail necessary to observe core-uncore interactions across the entire system.

...read moreread less

Abstract: Two major trends in high-performance computing, namely, larger numbers of cores and the growing size of on-chip cache memory, are creating significant challenges for evaluating the design space of future processor architectures. Fast and scalable simulations are therefore needed to allow for sufficient exploration of large multi-core systems within a limited simulation time budget. By bringing together accurate high-abstraction analytical models with fast parallel simulation, architects can trade off accuracy with simulation speed to allow for longer application runs, covering a larger portion of the hardware design space. Interval simulation provides this balance between detailed cycle-accurate simulation and one-IPC simulation, allowing long-running simulations to be modeled much faster than with detailed cycle-accurate simulation, while still providing the detail necessary to observe core-uncore interactions across the entire system. Validations against real hardware show average absolute errors within 25% for a variety of multi-threaded workloads; more than twice as accurate on average as one-IPC simulation. Further, we demonstrate scalable simulation speed of up to 2.0 MIPS when simulating a 16-core system on an 8-core SMP machine.

...read moreread less

818 citations

Proceedings Article•

Megastore: Providing Scalable, Highly Available Storage for Interactive Services

[...]

Jason Baker¹, Christopher N. Bond¹, James C. Corbett¹, J. J. Furman¹, Andrey Khorlin¹, James Larson¹, Jean-Michel Leon¹, Yawei Li¹, Alexander Lloyd¹, Vadim Yushprakh¹ - Show less +6 more•Institutions (1)

Google¹

01 Jan 2011

TL;DR: Megastore provides fully serializable ACID semantics within ne-grained partitions of data, which allows us to synchronously replicate each write across a wide area network with reasonable latency and support seamless failover between datacenters.

...read moreread less

Abstract: Megastore is a storage system developed to meet the requirements of today’s interactive online services. Megastore blends the scalability of a NoSQL datastore with the convenience of a traditional RDBMS in a novel way, and provides both strong consistency guarantees and high availability. We provide fully serializable ACID semantics within ne-grained partitions of data. This partitioning allows us to synchronously replicate each write across a wide area network with reasonable latency and support seamless failover between datacenters. This paper describes Megastore’s semantics and replication algorithm. It also describes our experience supporting a wide range of Google production services built with Megastore.

...read moreread less

802 citations

Journal Article•DOI•

Simulating LTE Cellular Systems: An Open-Source Framework

[...]

Giuseppe Piro, Luigi Alfredo Grieco, Gennaro Boggia, Francesco Capozzi, Pietro Camarda - Show less +1 more

01 Feb 2011-IEEE Transactions on Vehicular Technology

TL;DR: The open-source framework LTE-Sim is presented to provide a complete performance verification of LTE networks and has been conceived to simulate uplink and downlink scheduling strategies in multicell/multiuser environments, taking into account user mobility, radio resource optimization, frequency reuse techniques, the adaptive modulation and coding module, and other aspects that are very relevant to the industrial and scientific communities.

...read moreread less

Abstract: Long-term evolution (LTE) represents an emerging and promising technology for providing broadband ubiquitous Internet access. For this reason, several research groups are trying to optimize its performance. Unfortunately, at present, to the best of our knowledge, no open-source simulation platforms, which the scientific community can use to evaluate the performance of the entire LTE system, are freely available. The lack of a common reference simulator does not help the work of researchers and poses limitations on the comparison of results claimed by different research groups. To bridge this gap, herein, the open-source framework LTE-Sim is presented to provide a complete performance verification of LTE networks. LTE-Sim has been conceived to simulate uplink and downlink scheduling strategies in multicell/multiuser environments, taking into account user mobility, radio resource optimization, frequency reuse techniques, the adaptive modulation and coding module, and other aspects that are very relevant to the industrial and scientific communities. The effectiveness of the proposed simulator has been tested and verified considering 1) the software scalability test, which analyzes both memory and simulation time requirements; and 2) the performance evaluation of a realistic LTE network providing a comparison among well-known scheduling strategies.

...read moreread less

685 citations

Proceedings Article•DOI•

Don't settle for eventual: scalable causal consistency for wide-area storage with COPS

[...]

Wyatt Lloyd¹, Michael J. Freedman¹, Michael Kaminsky², David G. Andersen³•Institutions (3)

Princeton University¹, Intel², Carnegie Mellon University³

23 Oct 2011

TL;DR: This paper identifies and defines a consistency model---causal consistency with convergent conflict handling, or causal+---that is the strongest achieved under these constraints and presents the design and implementation of COPS, a key-value store that delivers this consistency model across the wide-area.

...read moreread less

Abstract: Geo-replicated, distributed data stores that support complex online applications, such as social networks, must provide an "always-on" experience where operations always complete with low latency. Today's systems often sacrifice strong consistency to achieve these goals, exposing inconsistencies to their clients and necessitating complex application logic. In this paper, we identify and define a consistency model---causal consistency with convergent conflict handling, or causal+---that is the strongest achieved under these constraints. We present the design and implementation of COPS, a key-value store that delivers this consistency model across the wide-area. A key contribution of COPS is its scalability, which can enforce causal dependencies between keys stored across an entire cluster, rather than a single server like previous systems. The central approach in COPS is tracking and explicitly checking whether causal dependencies between keys are satisfied in the local cluster before exposing writes. Further, in COPS-GT, we introduce get transactions in order to obtain a consistent view of multiple keys without locking or blocking. Our evaluation shows that COPS completes operations in less than a millisecond, provides throughput similar to previous systems when using one server per cluster, and scales well as we increase the number of servers in each cluster. It also shows that COPS-GT provides similar latency, throughput, and scaling to COPS for common workloads.

...read moreread less

Journal Article•DOI•

Scalable SPARQL querying of large RDF graphs

[...]

Jiewen Huang¹, Daniel J. Abadi¹, Kun Ren²•Institutions (2)

Yale University¹, Northwestern Polytechnical University²

01 Aug 2011

TL;DR: This paper introduces a scalable RDF data management system that is up to three orders of magnitude more efficient than popular multi-node RDFData management systems.

...read moreread less

Abstract: The generation of RDF data has accelerated to the point where many data sets need to be partitioned across multiple machines in order to achieve reasonable performance when querying the data. Although tremendous progress has been made in the Semantic Web community for achieving high performance data management on a single node, current solutions that allow the data to be partitioned across multiple machines are highly inefficient. In this paper, we introduce a scalable RDF data management system that is up to three orders of magnitude more efficient than popular multi-node RDF data management systems. In so doing, we introduce techniques for (1) leveraging state-of-the-art single node RDF-store technology (2) partitioning the data across nodes in a manner that helps accelerate query processing through locality optimizations and (3) decomposing SPARQL queries into high performance fragments that take advantage of how data is partitioned in a cluster.

...read moreread less

Journal Article•DOI•

Dynamically scaling applications in the cloud

[...]

Luis M. Vaquero¹, Luis Rodero-Merino², Rajkumar Buyya³•Institutions (3)

Hewlett-Packard¹, French Institute for Research in Computer Science and Automation², University of Melbourne³

22 Jan 2011

TL;DR: Most notable initiatives towards whole application scalability in cloud environments are presented and relevant efforts at the edge of state of the art technology are presented, providing an encompassing overview of the trends they each follow.

...read moreread less

Abstract: Scalability is said to be one of the major advantages brought by the cloud paradigm and, more specifically, the one that makes it different to an "advanced outsourcing" solution. However, there are some important pending issues before making the dreamed automated scaling for applications come true. In this paper, the most notable initiatives towards whole application scalability in cloud environments are presented. We present relevant efforts at the edge of state of the art technology, providing an encompassing overview of the trends they each follow. We also highlight pending challenges that will likely be addressed in new research efforts and present an ideal scalable cloud system.

...read moreread less

Journal Article•DOI•

The MOPED framework: Object recognition and pose estimation for manipulation

[...]

Alvaro Collet¹, Manuel Martinez¹, Siddhartha S. Srinivasa²•Institutions (2)

Carnegie Mellon University¹, Intel²

01 Sep 2011-The International Journal of Robotics Research

TL;DR: MOPED, a framework for Multiple Object Pose Estimation and Detection that seamlessly integrates single-image and multi-image object recognition and pose estimation in one optimized, robust, and scalable framework is presented.

...read moreread less

Abstract: We present MOPED, a framework for Multiple Object Pose Estimation and Detection that seamlessly integrates single-image and multi-image object recognition and pose estimation in one optimized, robust, and scalable framework. We address two main challenges in computer vision for robotics: robust performance in complex scenes, and low latency for real-time operation. We achieve robust performance with Iterative Clustering Estimation (ICE), a novel algorithm that iteratively combines feature clustering with robust pose estimation. Feature clustering quickly partitions the scene and produces object hypotheses. The hypotheses are used to further refine the feature clusters, and the two steps iterate until convergence. ICE is easy to parallelize, and easily integrates single- and multi-camera object recognition and pose estimation. We also introduce a novel object hypothesis scoring function based on M-estimator theory, and a novel pose clustering algorithm that robustly handles recognition outliers. We achieve scalability and low latency with an improved feature matching algorithm for large databases, a GPU/CPU hybrid architecture that exploits parallelism at all levels, and an optimized resource scheduler. We provide extensive experimental results demonstrating state-of-the-art performance in terms of recognition, scalability, and latency in real-world robotic applications.

...read moreread less

Journal Article•DOI•

The Combinatorial BLAS: design, implementation, and applications

[...]

Aydin Buluc¹, John R. Gilbert²•Institutions (2)

Lawrence Berkeley National Laboratory¹, University of California, Berkeley²

01 Nov 2011

TL;DR: The parallel Combinatorial BLAS is described, which consists of a small but powerful set of linear algebra primitives specifically targeting graph and data mining applications, and an extensible library interface and some guiding principles for future development are provided.

...read moreread less

Abstract: This paper presents a scalable high-performance software library to be used for graph analysis and data mining. Large combinatorial graphs appear in many applications of high-performance computing, including computational biology, informatics, analytics, web search, dynamical systems, and sparse matrix methods. Graph computations are difficult to parallelize using traditional approaches due to their irregular nature and low operational intensity. Many graph computations, however, contain sufficient coarse-grained parallelism for thousands of processors, which can be uncovered by using the right primitives. We describe the parallel Combinatorial BLAS, which consists of a small but powerful set of linear algebra primitives specifically targeting graph and data mining applications. We provide an extensible library interface and some guiding principles for future development. The library is evaluated using two important graph algorithms, in terms of both performance and ease-of-use. The scalability and raw performance of the example applications, using the Combinatorial BLAS, are unprecedented on distributed memory clusters.

...read moreread less

Proceedings Article•DOI•

Hybrid memory cube (HMC)

[...]

J. Thomas Pawlowski¹•Institutions (1)

Micron Technology¹

01 Aug 2011

Journal Article•DOI•

A Survey of Large Scale Data Management Approaches in Cloud Environments

[...]

Sherif Sakr¹, Anna Liu¹, Daniel Macedo Batista², Mohammad Ahmed Alomari³•Institutions (3)

University of New South Wales¹, State University of Campinas², Information Technology University³

05 Apr 2011-IEEE Communications Surveys and Tutorials

TL;DR: This paper gives a comprehensive survey of numerous approaches and mechanisms of deploying data-intensive applications in the cloud and analyzes the various design decisions of each approach and its suitability to support certain classes of applications and end-users.

...read moreread less

Abstract: In the last two decades, the continuous increase of computational power has produced an overwhelming flow of data. Moreover, the recent advances in Web technology has made it easy for any user to provide and consume content of any form. This has called for a paradigm shift in the computing architecture and large scale data processing mechanisms. Cloud computing is associated with a new paradigm for the provision of computing infrastructure. This paradigm shifts the location of this infrastructure to the network to reduce the costs associated with the management of hardware and software resources. This paper gives a comprehensive survey of numerous approaches and mechanisms of deploying data-intensive applications in the cloud which are gaining a lot of momentum in both research and industrial communities. We analyze the various design decisions of each approach and its suitability to support certain classes of applications and end-users. A discussion of some open issues and future challenges pertaining to scalability, consistency, economical processing of large scale data on the cloud is provided. We highlight the characteristics of the best candidate classes of applications that can be deployed in the cloud.

...read moreread less

Journal Article•DOI•

Dynamic QoS Management and Optimization in Service-Based Systems

[...]

Radu Calinescu¹, Lars Grunske², Marta Kwiatkowska³, Raffaela Mirandola, Giordano Tamburrelli - Show less +1 more•Institutions (3)

Aston University¹, Swinburne University of Technology², University of Oxford³

01 May 2011-IEEE Transactions on Software Engineering

TL;DR: This work introduces a novel, tool-supported framework for the development of adaptive service-based systems called QoSMOS (QoS Management and Optimization of Service- based systems), which translates high-level QoS requirements specified by their administrators into probabilistic temporal logic formulae, which are then formally and automatically analyzed to identify and enforce optimal system configurations.

...read moreread less

Abstract: Service-based systems that are dynamically composed at runtime to provide complex, adaptive functionality are currently one of the main development paradigms in software engineering. However, the Quality of Service (QoS) delivered by these systems remains an important concern, and needs to be managed in an equally adaptive and predictable way. To address this need, we introduce a novel, tool-supported framework for the development of adaptive service-based systems called QoSMOS (QoS Management and Optimization of Service-based systems). QoSMOS can be used to develop service-based systems that achieve their QoS requirements through dynamically adapting to changes in the system state, environment, and workload. QoSMOS service-based systems translate high-level QoS requirements specified by their administrators into probabilistic temporal logic formulae, which are then formally and automatically analyzed to identify and enforce optimal system configurations. The QoSMOS self-adaptation mechanism can handle reliability and performance-related QoS requirements, and can be integrated into newly developed solutions or legacy systems. The effectiveness and scalability of the approach are validated using simulations and a set of experiments based on an implementation of an adaptive service-based system for remote medical assistance.

...read moreread less

Journal Article•DOI•

Join-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services

[...]

Yi Lu¹, Qiaomin Xie¹, Gabriel Kliot², Alan S. Geller², James R. Larus², Albert Greenberg² - Show less +2 more•Institutions (2)

University of Illinois at Urbana–Champaign¹, Microsoft²

01 Nov 2011-Performance Evaluation

TL;DR: This work proposes a novel class of algorithms called Join-Idle-Queue (JIQ) for distributed load balancing in large systems, which effectively results in a reduced system load and produces 30-fold reduction in queueing overhead compared to Power-of-Two at medium to high load.

...read moreread less

Relational Cloud: A Database-as-a-Service for the Cloud

[...]

Carlo Curino¹, Evan P. C. Jones¹, Raluca Ada Popa¹, Nirmesh Malviya¹, Eugene Wu¹, Samuel Madden¹, Hari Balakrishnan¹, Nickolai Zeldovich¹ - Show less +4 more•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2011

TL;DR: Relational Cloud as discussed by the authors is a transactional database-as-a-service (DBaaS) system that uses a graph-based data partitioning algorithm to achieve near-linear elastic scalability.

...read moreread less

Abstract: This paper introduces a new transactional “database-as-a-service” (DBaaS) called Relational Cloud. A DBaaS promises to move much of the operational burden of provisioning, configuration, scaling, performance tuning, backup, privacy, and access control from the database users to the service operator, offering lower overall costs to users. Early DBaaS efforts include Amazon RDS and Microsoft SQL Azure, which are promising in terms of establishing the market need for such a service, but which do not address three important challenges: efficient multi-tenancy, elastic scalability, and database privacy. We argue that these three challenges must be overcome before outsourcing database software and management becomes attractive to many users, and cost-effective for service providers. The key technical features of Relational Cloud include: (1) a workload-aware approach to multi-tenancy that identifies the workloads that can be co-located on a database server, achieving higher consolidation and better performance than existing approaches; (2) the use of a graph-based data partitioning algorithm to achieve near-linear elastic scale-out even for complex transactional workloads; and (3) an adjustable security scheme that enables SQL queries to run over encrypted data, including ordering operations, aggregates, and joins. An underlying theme in the design of the components of Relational Cloud is the notion of workload awareness: by monitoring query patterns and data accesses, the system obtains information useful for various optimization and security functions, reducing the configuration effort for users and operators.

...read moreread less

Book Chapter•DOI•

Conflict-free replicated data types

[...]

Marc Shapiro¹, Nuno Preguiça², Carlos Baquero³, Marek Zawirski⁴•Institutions (4)

French Institute for Research in Computer Science and Automation¹, Universidade Nova de Lisboa², University of Minho³, Pierre-and-Marie-Curie University⁴

10 Oct 2011

TL;DR: The Conflict-free Replicated Data Type (CRDT) as discussed by the authors is a data type that is guaranteed to converge in a self-stabilising manner, despite any number of failures.

...read moreread less

Abstract: Replicating data under Eventual Consistency (EC) allows any replica to accept updates without remote synchronisation. This ensures performance and scalability in large-scale distributed systems (e.g., clouds). However, published EC approaches are ad-hoc and error-prone. Under a formal Strong Eventual Consistency (SEC) model, we study sufficient conditions for convergence. A data type that satisfies these conditions is called a Conflict-free Replicated Data Type (CRDT). Replicas of any CRDT are guaranteed to converge in a self-stabilising manner, despite any number of failures. This paper formalises two popular approaches (state- and operation-based) and their relevant sufficient conditions. We study a number of useful CRDTs, such as sets with clean semantics, supporting both add and remove operations, and consider in depth the more complex Graph data type. CRDT types can be composed to develop large-scale distributed applications, and have interesting theoretical properties.

...read moreread less

Journal Article•DOI•

SenseLess: A database-driven white spaces network

[...]

Rohan Murty¹, Ranveer Chandra², Thomas Moscibroda², Paramvir Bahl²•Institutions (2)

Massachusetts Institute of Technology¹, Microsoft²

03 May 2011

TL;DR: This is the first paper that identifies and examines the systems and networking challenges that arise from operating a white space network, which is solely dependent on a channel occupancy database, and presents SenseLess, a database driven white spaces network.

...read moreread less

Abstract: The most recent FCC ruling proposes relying on a database of incumbents as the primary means of determining white space availability at any white spaces device (WSD). While the ruling provides broad guidelines for the database, the specifics of its design, features, implementation, and use are yet to be determined. Furthermore, architecting a network where all WSDs rely on the database raises several systems and networking challenges that have remained unexplored. Also, the ruling treats the database only as a storehouse for incumbents. We believe that the mandated use of the database has an additional opportunity: a means to dynamically manage the RF spectrum. Motivated by this opportunity, in this paper we present SenseLess, a database driven white spaces network. As suggested by its very name, in SenseLess, WSDs obviate the need to sense the spectrum by relying entirely on a database service to determine white spaces availability. The service, using a combination of an up-to-date database of incumbents, sophisticated signal propagation modeling, and an efficient content dissemination mechanism ensures efficient, scalable, and safe white space network operation. We build, deploy, and evaluate SenseLess and compare our results to ground truth spectrum measurements. We present the unique system design considerations that arise due to operating over the white spaces. We also evaluate its efficiency and scalability. To the best of our knowledge, this is the first paper that identifies and examines the systems and networking challenges that arise from operating a white space network, which is solely dependent on a channel occupancy database.

...read moreread less

Patent•

Distributed deduplicated storage system

[...]

Manoj Kumar Vijayan Retnamma, Rajiv Kottomtharayil, Deepak Raghunath Attarde

13 Dec 2011

TL;DR: In this article, a distributed, deduplicated storage system according to certain embodiments is arranged in a parallel configuration including multiple dedusplication nodes, where the dedus can be networked together and communicate with one another according using a light-weight, customized communication scheme (e.g., a scheme based on FTP or HTTP).

...read moreread less

Abstract: A distributed, deduplicated storage system according to certain embodiments is arranged in a parallel configuration including multiple deduplication nodes. Deduplicated data is distributed across the deduplication nodes. The deduplication nodes can be networked together and communicate with one another according using a light-weight, customized communication scheme (e.g., a scheme based on FTP or HTTP). In some cases, deduplication management information including deduplication signatures and/or other metadata is stored separately from the deduplicated data in deduplication management nodes, improving performance and scalability.

...read moreread less

Proceedings Article•DOI•

NetworkCloudSim: Modelling Parallel Applications in Cloud Simulations

[...]

Saurabh Garg¹, Rajkumar Buyya¹•Institutions (1)

University of Melbourne¹

05 Dec 2011

TL;DR: A popular Cloud simulator (CloudSim) is extended with a scalable network and generalized application model, which allows more accurate evaluation of scheduling and resource provisioning policies to optimize the performance of a Cloud infrastructure.

...read moreread less

Abstract: As interest in adopting Cloud computing for various applications is rapidly growing, it is important to understand how these applications and systems will perform when deployed on Clouds Due to the scale and complexity of shared resources, it is often hard to analyze the performance of new scheduling and provisioning algorithms on actual Cloud test beds Therefore, simulation tools are becoming more and more important in the evaluation of the Cloud computing model Simulation tools allow researchers to rapidly evaluate the efficiency, performance and reliability of their new algorithms on a large heterogeneous Cloud infrastructure However, current solutions lack either advanced application models such as message passing applications and workflows or scalable network model of data center To fill this gap, we have extended a popular Cloud simulator (CloudSim) with a scalable network and generalized application model, which allows more accurate evaluation of scheduling and resource provisioning policies to optimize the performance of a Cloud infrastructure

...read moreread less

Journal Article•DOI•

Scalable Localization with Mobility Prediction for Underwater Sensor Networks

[...]

Zhong Zhou¹, Zheng Peng¹, Jun-Hong Cui¹, Zhijie Shi¹, Amvrossios C. Bagtzoglou¹ - Show less +1 more•Institutions (1)

University of Connecticut¹

01 Mar 2011-IEEE Transactions on Mobile Computing

TL;DR: By utilizing the predictable mobility patterns of underwater objects, a scheme, called Scalable Localization scheme with Mobility Prediction (SLMP), for underwater sensor networks is proposed, and results show that SLMP can greatly reduce localization communication cost while maintaining relatively high localization coverage and localization accuracy.

...read moreread less

Abstract: Due to harsh aqueous environments, non-negligible node mobility and large network scale, localization for large-scale mobile underwater sensor networks is very challenging. In this paper, by utilizing the predictable mobility patterns of underwater objects, we propose a scheme, called Scalable Localization scheme with Mobility Prediction (SLMP), for underwater sensor networks. In SLMP, localization is performed in a hierarchical way, and the whole localization process is divided into two parts: anchor node localization and ordinary node localization. During the localization process, every node predicts its future mobility pattern according to its past known location information, and it can estimate its future location based on the predicted mobility pattern. Anchor nodes with known locations in the network will control the localization process in order to balance the trade-off between localization accuracy, localization coverage, and communication cost. We conduct extensive simulations, and our results show that SLMP can greatly reduce localization communication cost while maintaining relatively high localization coverage and localization accuracy.

...read moreread less

Journal Article•DOI•

CloVR: A virtual machine for automated and portable sequence analysis from the desktop using cloud computing

[...]

Samuel V. Angiuoli¹, Samuel V. Angiuoli², Malcolm Matalka¹, Aaron Gussman¹, Kevin Galens¹, Mahesh Vangala¹, David R. Riley¹, Cesar Arze¹, James R. White¹, Owen White¹, W. Florian Fricke¹ - Show less +7 more•Institutions (2)

University of Maryland, Baltimore¹, University of Maryland, College Park²

30 Aug 2011-BMC Bioinformatics

TL;DR: The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing.

...read moreread less

Abstract: Next-generation sequencing technologies have decentralized sequence acquisition, increasing the demand for new bioinformatics tools that are easy to use, portable across multiple platforms, and scalable for high-throughput applications Cloud computing platforms provide on-demand access to computing infrastructure over the Internet and can be used in combination with custom built virtual machines to distribute pre-packaged with pre-configured software We describe the Cloud Virtual Resource, CloVR, a new desktop application for push-button automated sequence analysis that can utilize cloud computing resources CloVR is implemented as a single portable virtual machine (VM) that provides several automated analysis pipelines for microbial genomics, including 16S, whole genome and metagenome sequence analysis The CloVR VM runs on a personal computer, utilizes local computer resources and requires minimal installation, addressing key challenges in deploying bioinformatics workflows In addition CloVR supports use of remote cloud computing resources to improve performance for large-scale sequence processing In a case study, we demonstrate the use of CloVR to automatically process next-generation sequencing data on multiple cloud computing platforms The CloVR VM and associated architecture lowers the barrier of entry for utilizing complex analysis protocols on both local single- and multi-core computers and cloud systems for high throughput data processing

...read moreread less

Journal Article•DOI•

The structural simulation toolkit

[...]

Arun Rodrigues¹, Karl Scott Hemmert¹, Brian W. Barrett¹, C. Kersey¹, Ron A. Oldfield¹, M. Weston¹, R. Risen¹, Jeanine Cook², Paul Rosenfeld³, Elliott Cooper-Balis³, Bruce Jacob³ - Show less +7 more•Institutions (3)

Sandia National Laboratories¹, New Mexico State University², University of Maryland, College Park³

29 Mar 2011

TL;DR: The Structural Simulation Toolkit (SST) as discussed by the authors is an open, modular, parallel, multi-criteria, multiscale simulation framework for HPC systems that includes a number of processor, memory, and network models.

...read moreread less

Abstract: As supercomputers grow, understanding their behavior and performance has become increasingly challenging. New hurdles in scalability, programmability, power consumption, reliability, cost, and cooling are emerging, along with new technologies such as 3D integration, GP-GPUs, silicon-photonics, and other "game changers". Currently, they HPC community lacks a unified toolset to evaluate these technologies and design for these challenges.To address this problem, a number of institutions have joined together to create the Structural Simulation Toolkit (SST), an open, modular, parallel, multi-criteria, multi-scale simulation framework. The SST includes a number of processor, memory, and network models. The SST has been used in a variety of network, memory, and application studies and aims to become the standard simulation framework for designing and procuring HPC systems.

...read moreread less

Proceedings Article•DOI•

Modeling and performance evaluation of an OpenFlow architecture

[...]

Michael Jarschel¹, Simon Oechsner¹, Daniel Schlosser¹, Rastin Pries¹, Sebastian Goll¹, Phuoc Tran-Gia¹ - Show less +2 more•Institutions (1)

University of Würzburg¹

06 Sep 2011

TL;DR: A basic model for the forwarding speed and blocking probability of an OpenFlow switch combined with an Open Flow controller is derived and can be used to estimate the packet sojourn time and the probability of lost packets in such a system.

...read moreread less

Abstract: The OpenFlow concept of flow-based forwarding and separation of the control plane from the data plane provides a new flexibility in network innovation. While initially used solely in the research domain, OpenFlow is now finding its way into commercial applications. However, this creates new challenges, as questions of OpenFlow scalability and performance have not yet been answered. This paper is a first step towards that goal. Based on measurements of switching times of current OpenFlow hardware, we derive a basic model for the forwarding speed and blocking probability of an OpenFlow switch combined with an OpenFlow controller and validate it using a simulation. This model can be used to estimate the packet sojourn time and the probability of lost packets in such a system and can give hints to developers and researchers on questions how an OpenFlow architecture will perform given certain parameters.

...read moreread less

Proceedings Article•DOI•

NetLord: a scalable multi-tenant network architecture for virtualized datacenters

[...]

Jayaram Mudigonda¹, Praveen Yalagandula¹, Jeffrey C. Mogul¹, Bryan Stiekes¹, Yanick Pouffary¹ - Show less +1 more•Institutions (1)

Hewlett-Packard¹

15 Aug 2011

TL;DR: NetLord provides tenants with simple and flexible network abstractions, by fully and efficiently virtualizing the address space at both L2 and L3, and achieving order-of-magnitude goodput improvements over previous approaches.

...read moreread less

Abstract: Providers of "Infrastructure-as-a-Service" need datacenter networks that support multi-tenancy, scale, and ease of operation, at low cost. Most existing network architectures cannot meet all of these needs simultaneously.In this paper we present NetLord, a novel multi-tenant network architecture. NetLord provides tenants with simple and flexible network abstractions, by fully and efficiently virtualizing the address space at both L2 and L3. NetLord can exploit inexpensive commodity equipment to scale the network to several thousands of tenants and millions of virtual machines. NetLord requires only a small amount of offline, one-time configuration. We implemented NetLord on a testbed, and demonstrated its scalability, while achieving order-of-magnitude goodput improvements over previous approaches.

...read moreread less

Journal Article•DOI•

SNVer: a statistical tool for variant calling in analysis of pooled or individual next-generation sequencing data

[...]

Zhi-Jian Wei¹, Wei-wei Wang², Pingzhao Hu², Gholson J. Lyon², Hakon Hakonarson² - Show less +1 more•Institutions (2)

New Jersey Institute of Technology¹, Children's Hospital of Philadelphia²

01 Oct 2011-Nucleic Acids Research

TL;DR: A statistical tool SNVer is developed for calling common and rare variants in analysis of pooled or individual next-generation sequencing (NGS) data and employs a binomial–binomial model to test the significance of observed allele frequency against sequencing error.

...read moreread less

Abstract: We develop a statistical tool SNVer for calling common and rare variants in analysis of pooled or individual next-generation sequencing (NGS) data. We formulate variant calling as a hypothesis testing problem and employ a binomial-binomial model to test the significance of observed allele frequency against sequencing error. SNVer reports one single overall P-value for evaluating the significance of a candidate locus being a variant based on which multiplicity control can be obtained. This is particularly desirable because tens of thousands loci are simultaneously examined in typical NGS experiments. Each user can choose the false-positive error rate threshold he or she considers appropriate, instead of just the dichotomous decisions of whether to 'accept or reject the candidates' provided by most existing methods. We use both simulated data and real data to demonstrate the superior performance of our program in comparison with existing methods. SNVer runs very fast and can complete testing 300 K loci within an hour. This excellent scalability makes it feasible for analysis of whole-exome sequencing data, or even whole-genome sequencing data using high performance computing cluster. SNVer is freely available at http://snver.sourceforge.net/.

...read moreread less

Collapse