Showing papers on "Scalability published in 2013"

PDF

Open Access

Journal Article•DOI•

A survey of mobile cloud computing: architecture, applications, and approaches

[...]

Hoang T. Dinh¹, Chonho Lee¹, Dusit Niyato¹, Ping Wang¹•Institutions (1)

25 Dec 2013-Wireless Communications and Mobile Computing

TL;DR: A survey of MCC is given, which helps general readers have an overview of the MCC including the definition, architecture, and applications and the issues, existing solutions, and approaches are presented.

...read moreread less

Abstract: Together with an explosive growth of the mobile applications and emerging of cloud computing concept, mobile cloud computing (MCC) has been introduced to be a potential technology for mobile services. MCC integrates the cloud computing into the mobile environment and overcomes obstacles related to the performance (e.g., battery life, storage, and bandwidth), environment (e.g., heterogeneity, scalability, and availability), and security (e.g., reliability and privacy) discussed in mobile computing. This paper gives a survey of MCC, which helps general readers have an overview of the MCC including the definition, architecture, and applications. The issues, existing solutions, and approaches are presented. In addition, the future research directions of MCC are discussed. Copyright © 2011 John Wiley & Sons, Ltd.

...read moreread less

2,259 citations

Journal Article•DOI•

Are we ready for SDN? Implementation challenges for software-defined networks

[...]

Sakir Sezer¹, Sandra Scott-Hayward¹, Pushpinder Kaur Chouhan¹, B. Fraser², D. Lake², J. Finnegan, Nicolaas J. Viljoen, M. Miller, N. Rao - Show less +5 more•Institutions (2)

Queen's University Belfast¹, Cisco Systems, Inc.²

12 Jul 2013-IEEE Communications Magazine

TL;DR: The question of how to achieve a successful carrier grade network with software-defined networking is raised and specific focus is placed on the challenges of network performance, scalability, security, and interoperability with the proposal of potential solution directions.

...read moreread less

Abstract: Cloud services are exploding, and organizations are converging their data centers in order to take advantage of the predictability, continuity, and quality of service delivered by virtualization technologies. In parallel, energy-efficient and high-security networking is of increasing importance. Network operators, and service and product providers require a new network solution to efficiently tackle the increasing demands of this changing network landscape. Software-defined networking has emerged as an efficient network technology capable of supporting the dynamic nature of future network functions and intelligent applications while lowering operating costs through simplified hardware, software, and management. In this article, the question of how to achieve a successful carrier grade network with software-defined networking is raised. Specific focus is placed on the challenges of network performance, scalability, security, and interoperability with the proposal of potential solution directions.

...read moreread less

943 citations

Journal Article•DOI•

MillWheel: fault-tolerant stream processing at internet scale

[...]

Tyler Akidau¹, Alex Balikov¹, Kaya Bekiroglu¹, Slava Chernyak¹, Josh Haberman¹, Reuven Lax¹, Sam McVeety¹, Daniel Mills¹, Paul Nordstrom¹, Sam Whittle¹ - Show less +6 more•Institutions (1)

Google¹

01 Aug 2013

TL;DR: In practice, this paper finds that MillWheel's unique combination of scalability, fault tolerance, and a versatile programming model lends itself to a wide variety of problems at Google.

...read moreread less

Abstract: MillWheel is a framework for building low-latency data-processing applications that is widely used at Google. Users specify a directed computation graph and application code for individual nodes, and the system manages persistent state and the continuous flow of records, all within the envelope of the framework's fault-tolerance guarantees.This paper describes MillWheel's programming model as well as its implementation. The case study of a continuous anomaly detector in use at Google serves to motivate how many of MillWheel's features are used. MillWheel's programming model provides a notion of logical time, making it simple to write time-based aggregations. MillWheel was designed from the outset with fault tolerance and scalability in mind. In practice, we find that MillWheel's unique combination of scalability, fault tolerance, and a versatile programming model lends itself to a wide variety of problems at Google.

...read moreread less

582 citations

Journal Article•DOI•

Hadoop GIS: a high performance spatial data warehousing system over mapreduce

[...]

Ablimit Aji¹, Fusheng Wang¹, Hoang Vo¹, Rubao Lee², Qiaoling Liu¹, Xiaodong Zhang², Joel H. Saltz¹ - Show less +3 more•Institutions (2)

Emory University¹, Ohio State University²

01 Aug 2013

TL;DR: Hadoop-GIS - a scalable and high performance spatial data warehousing system for running large scale spatial queries on Hadoop and integrated into Hive to support declarative spatial queries with an integrated architecture is presented.

...read moreread less

Abstract: Support of high performance queries on large volumes of spatial data becomes increasingly important in many application domains, including geospatial problems in numerous fields, location based services, and emerging scientific applications that are increasingly data- and compute-intensive. The emergence of massive scale spatial data is due to the proliferation of cost effective and ubiquitous positioning technologies, development of high resolution imaging technologies, and contribution from a large number of community users. There are two major challenges for managing and querying massive spatial data to support spatial queries: the explosion of spatial data, and the high computational complexity of spatial queries. In this paper, we present Hadoop-GIS - a scalable and high performance spatial data warehousing system for running large scale spatial queries on Hadoop. Hadoop-GIS supports multiple types of spatial queries on MapReduce through spatial partitioning, customizable spatial query engine RESQUE, implicit parallel spatial query execution on MapReduce, and effective methods for amending query results through handling boundary objects. Hadoop-GIS utilizes global partition indexing and customizable on demand local spatial indexing to achieve efficient query processing. Hadoop-GIS is integrated into Hive to support declarative spatial queries with an integrated architecture. Our experiments have demonstrated the high efficiency of Hadoop-GIS on query response and high scalability to run on commodity clusters. Our comparative experiments have showed that performance of Hadoop-GIS is on par with parallel SDBMS and outperforms SDBMS for compute-intensive queries. Hadoop-GIS is available as a set of library for processing spatial queries, and as an integrated software package in Hive.

...read moreread less

571 citations

Proceedings Article•DOI•

Towards an elastic distributed SDN controller

[...]

Advait Dixit¹, Fang Hao², Sarit Mukherjee², T. V. Lakshman², Ramana Rao Kompella¹ - Show less +1 more•Institutions (2)

Purdue University¹, Bell Labs²

16 Aug 2013

TL;DR: ElastiCon is proposed, an elastic distributed controller architecture in which the controller pool is dynamically grown or shrunk according to traffic conditions and the load is dynamically shifted across controllers, which conforms with the Openflow standard.

...read moreread less

Abstract: Distributed controllers have been proposed for Software Defined Networking to address the issues of scalability and reliability that a centralized controller suffers from. One key limitation of the distributed controllers is that the mapping between a switch and a controller is statically configured, which may result in uneven load distribution among the controllers. To address this problem, we propose ElastiCon, an elastic distributed controller architecture in which the controller pool is dynamically grown or shrunk according to traffic conditions and the load is dynamically shifted across controllers. We propose a novel switch migration protocol for enabling such load shifting, which conforms with the Openflow standard. We also build a prototype to demonstrate the efficacy of our design.

...read moreread less

567 citations

Proceedings Article•DOI•

GPS: a graph processing system

[...]

Semih Salihoglu¹, Jennifer Widom¹•Institutions (1)

Stanford University¹

29 Jul 2013

TL;DR: This paper describes the implementation of GPS and its novel features, and presents experimental results on the performance effects of both static and dynamic graph partitioning schemes, and describes the compilation of a high-level domain-specific programming language to GPS, enabling easy expression of complex algorithms.

...read moreread less

Abstract: GPS (for Graph Processing System) is a complete open-source system we developed for scalable, fault-tolerant, and easy-to-program execution of algorithms on extremely large graphs. This paper serves the dual role of describing the GPS system, and presenting techniques and experimental results for graph partitioning in distributed graph-processing systems like GPS. GPS is similar to Google's proprietary Pregel system, with three new features: (1) an extended API to make global computations more easily expressed and more efficient; (2) a dynamic repartitioning scheme that reassigns vertices to different workers during the computation, based on messaging patterns; and (3) an optimization that distributes adjacency lists of high-degree vertices across all compute nodes to improve performance. In addition to presenting the implementation of GPS and its novel features, we also present experimental results on the performance effects of both static and dynamic graph partitioning schemes, and we describe the compilation of a high-level domain-specific programming language to GPS, enabling easy expression of complex algorithms.

...read moreread less

541 citations

Proceedings Article•DOI•

Speedy transactions in multicore in-memory databases

[...]

Stephen Tu¹, Wenting Zheng¹, Eddie Kohler², Barbara Liskov¹, Samuel Madden¹ - Show less +1 more•Institutions (2)

Massachusetts Institute of Technology¹, Harvard University²

03 Nov 2013

TL;DR: A commit protocol based on optimistic concurrency control that provides serializability while avoiding all shared-memory writes for records that were only read, which achieves excellent performance and scalability on modern multicore machines.

...read moreread less

Abstract: Silo is a new in-memory database that achieves excellent performance and scalability on modern multicore machines. Silo was designed from the ground up to use system memory and caches efficiently. For instance, it avoids all centralized contention points, including that of centralized transaction ID assignment. Silo's key contribution is a commit protocol based on optimistic concurrency control that provides serializability while avoiding all shared-memory writes for records that were only read. Though this might seem to complicate the enforcement of a serial order, correct logging and recovery is provided by linking periodically-updated epochs with the commit protocol. Silo provides the same guarantees as any serializable database without unnecessary scalability bottlenecks or much additional latency. Silo achieves almost 700,000 transactions per second on a standard TPC-C workload mix on a 32-core machine, as well as near-linear scalability. Considered per core, this is several times higher than previously reported results.

...read moreread less

509 citations

Journal Article•DOI•

Spanner: Google’s Globally Distributed Database

[...]

James C. Corbett¹, Jeffrey Dean¹, Michael James Boyer Epstein¹, Andrew Fikes¹, Christopher Frost¹, J. J. Furman¹, Sanjay Ghemawat¹, Andrey Gubarev¹, Christopher Heiser¹, Peter Hochschild¹, Wilson C. Hsieh¹, Sebastian Kanthak¹, Eugene Kogan¹, Hongyi Li¹, Alexander Lloyd¹, Sergey Melnik¹, David Mwaura¹, David Nagle¹, Sean Quinlan¹, Rajesh Rao¹, Lindsay Rolig¹, Yasushi Saito¹, Michal Piotr Szymaniak¹, Chris Jorgen Taylor¹, Ruth Wang¹, Dale Woodford¹ - Show less +22 more•Institutions (1)

Google¹

01 Aug 2013-ACM Transactions on Computer Systems

TL;DR: Spanner as mentioned in this paper is Google's scalable, multiversion, globally distributed, and synchronously replicated database, which is the first system to distribute data at global scale and support externally-consistent distributed transactions.

...read moreread less

Abstract: Spanner is Google’s scalable, multiversion, globally distributed, and synchronously replicated database. It is the first system to distribute data at global scale and support externally-consistent distributed transactions. This article describes how Spanner is structured, its feature set, the rationale underlying various design decisions, and a novel time API that exposes clock uncertainty. This API and its implementation are critical to supporting external consistency and a variety of powerful features: nonblocking reads in the past, lock-free snapshot transactions, and atomic schema changes, across all of Spanner.

...read moreread less

493 citations

Journal Article•DOI•

On scalability of software-defined networking

[...]

Soheil Hassas Yeganeh¹, Amin Tootoonchian¹, Yashar Ganjali¹•Institutions (1)

University of Toronto¹

14 Feb 2013-IEEE Communications Magazine

TL;DR: This article deconstruct scalability concerns in software-defined networking and argues that they are not unique to SDN, and enumerate overlooked yet important opportunities and challenges in scalability beyond the commonly used performance metrics.

...read moreread less

Abstract: In this article, we deconstruct scalability concerns in software-defined networking and argue that they are not unique to SDN. We explore the often voiced concerns in different settings, discuss scalability trade-offs in the SDN design space, and present some recent research on SDN scalability. Moreover, we enumerate overlooked yet important opportunities and challenges in scalability beyond the commonly used performance metrics.

...read moreread less

491 citations

Proceedings Article•

Elasticity in Cloud Computing: What It Is, and What It Is Not.

[...]

Nikolas Herbst¹, Samuel Kounev¹, Ralf Reussner¹•Institutions (1)

Karlsruhe Institute of Technology¹

01 Jan 2013

TL;DR: A precise definition of elasticity is proposed and its core properties and requirements explicitly distinguishing from related terms such as scalability and efficiency are analyzed.

...read moreread less

Abstract: Originating from the field of physics and economics, the term elasticity is nowadays heavily used in the context of cloud computing. In this context, elasticity is commonly understood as the ability of a system to automatically provision and deprovision computing resources on demand as workloads change. However, elasticity still lacks a precise definition as well as representative metrics coupled with a benchmarking methodology to enable comparability of systems. Existing definitions of elasticity are largely inconsistent and unspecific, which leads to confusion in the use of the term and its differentiation from related terms such as scalability and efficiency; the proposed measurement methodologies do not provide means to quantify elasticity without mixing it with efficiency or scalability aspects. In this short paper, we propose a precise definition of elasticity and analyze its core properties and requirements explicitly distinguishing from related terms such as scalability and efficiency. Furthermore, we present a set of appropriate elasticity metrics and sketch a new elasticity tailored benchmarking methodology addressing the special requirements on workload design and calibration.

...read moreread less

441 citations

Proceedings Article•DOI•

SoftCell: scalable and flexible cellular core network architecture

[...]

Xin Jin¹, Li Erran Li², Laurent Vanbever¹, Jennifer Rexford¹•Institutions (2)

Princeton University¹, Bell Labs²

09 Dec 2013

TL;DR: The presented SoftCell is a scalable architecture that supports fine-grained policies for mobile devices in cellular core networks, using commodity switches and servers, and enables operators to realize high-level service policies that direct traffic through sequences of middleboxes based on subscriber attributes and applications.

...read moreread less

Abstract: Cellular core networks suffer from inflexible and expensive equipment, as well as from complex control-plane protocols. To address these challenges, we present SoftCell, a scalable architecture that supports fine-grained policies for mobile devices in cellular core networks, using commodity switches and servers. SoftCell enables operators to realize high-level service policies that direct traffic through sequences of middleboxes based on subscriber attributes and applications. To minimize the size of the forwarding tables, SoftCell aggregates traffic along multiple dimensions---the service policy, the base station, and the mobile device---at different switches in the network. Since most traffic originates from mobile devices, SoftCell performs fine-grained packet classification at the access switches, next to the base stations, where software switches can easily handle the state and bandwidth requirements. SoftCell guarantees that packets belonging to the same connection traverse the same sequence of middleboxes in both directions, even in the presence of mobility. We demonstrate that SoftCell improves the scalability and flexibility of cellular core networks by analyzing real LTE workloads, performing micro-benchmarks on our prototype controller as well as large-scale simulations.

...read moreread less

Journal Article•DOI•

A Survey on Sensor-Cloud: Architecture, Applications, and Approaches

[...]

Atif Alamri¹, Wasai Shadab Ansari¹, Mohammad Mehedi Hassan¹, M. Shamim Hossain¹, Abdulhameed Alelaiwi¹, M. Anwar Hossain¹ - Show less +2 more•Institutions (1)

King Saud University¹

28 Feb 2013-International Journal of Distributed Sensor Networks

TL;DR: This paper presents a comprehensive study of representative works on Sensor-Cloud infrastructure, which will provide general readers an overview of the Sensor- Cloud platform including its definition, architecture, and applications.

...read moreread less

Abstract: Nowadays, wireless sensor network (WSN) applications have been used in several important areas, such as healthcare, military, critical infrastructure monitoring, environment monitoring, and manufacturing. However, due to the limitations of WSNs in terms of memory, energy, computation, communication, and scalability, efficient management of the large number of WSNs data in these areas is an important issue to deal with. There is a need for a powerful and scalable high-performance computing and massive storage infrastructure for real-time processing and storing of the WSN data as well as analysis (online and offline) of the processed information under context using inherently complex models to extract events of interest. In this scenario, cloud computing is becoming a promising technology to provide a flexible stack of massive computing, storage, and software services in a scalable and virtualized manner at low cost. Therefore, in recent years, Sensor-Cloud infrastructure is becoming popular that can provide an open, flexible, and reconfigurable platform for several monitoring and controlling applications. In this paper, we present a comprehensive study of representative works on Sensor-Cloud infrastructure, which will provide general readers an overview of the Sensor-Cloud platform including its definition, architecture, and applications. The research challenges, existing solutions, and approaches as well as future research directions are also discussed in this paper.

...read moreread less

Proceedings Article•DOI•

Dynamic Controller Provisioning in Software Defined Networks

[...]

Faizul Bari¹, Arup Raton Roy¹, Shihabur Rahman Chowdhury¹, Qi Zhang¹, Mohamed Faten Zhani¹, Reaz Ahmed¹, Raouf Boutaba¹ - Show less +3 more•Institutions (1)

University of Waterloo¹

01 Oct 2013

TL;DR: This paper proposes a framework for deploying multiple controllers within an WAN that dynamically adjusts the number of active controllers and delegates each controller with a subset of Openflow switches according to network dynamics while ensuring minimal flow setup time and communication overhead.

...read moreread less

Abstract: Software Defined Networking (SDN) has emerged as a new paradigm that offers the programmability required to dynamically configure and control a network. A traditional SDN implementation relies on a logically centralized controller that runs the control plane. However, in a large-scale WAN deployment, this rudimentary centralized approach has several limitations related to performance and scalability. To address these issues, recent proposals have advocated deploying multiple controllers that work cooperatively to control a network. Nonetheless, this approach drags in an interesting problem, which we call the Dynamic Controller Provisioning Problem (DCPP). DCPP dynamically adapts the number of controllers and their locations with changing network conditions, in order to minimize flow setup time and communication overhead. In this paper, we propose a framework for deploying multiple controllers within an WAN. Our framework dynamically adjusts the number of active controllers and delegates each controller with a subset of Openflow switches according to network dynamics while ensuring minimal flow setup time and communication overhead. To this end, we formulate the optimal controller provisioning problem as an Integer Linear Program (ILP) and propose two heuristics to solve it. Simulation results show that our solution minimizes flow setup time while incurring very low communication overhead.

...read moreread less

Proceedings Article•DOI•

When private set intersection meets big data: an efficient and scalable protocol

[...]

Changyu Dong¹, Liqun Chen², Zikai Wen¹•Institutions (2)

University of Strathclyde¹, Hewlett-Packard²

04 Nov 2013

TL;DR: A new Private Set Intersection (PSI) protocol that is extremely efficient and highly scalable compared with existing protocols, based on a novel approach that is oblivious Bloom intersection, which has linear complexity and relies mostly on efficient symmetric key operations.

...read moreread less

Abstract: Large scale data processing brings new challenges to the design of privacy-preserving protocols: how to meet the increasing requirements of speed and throughput of modern applications, and how to scale up smoothly when data being protected is big. Efficiency and scalability become critical criteria for privacy preserving protocols in the age of Big Data. In this paper, we present a new Private Set Intersection (PSI) protocol that is extremely efficient and highly scalable compared with existing protocols. The protocol is based on a novel approach that we call oblivious Bloom intersection. It has linear complexity and relies mostly on efficient symmetric key operations. It has high scalability due to the fact that most operations can be parallelized easily. The protocol has two versions: a basic protocol and an enhanced protocol, the security of the two variants is analyzed and proved in the semi-honest model and the malicious model respectively. A prototype of the basic protocol has been built. We report the result of performance evaluation and compare it against the two previously fastest PSI protocols. Our protocol is orders of magnitude faster than these two protocols. To compute the intersection of two million-element sets, our protocol needs only 41 seconds (80-bit security) and 339 seconds (256-bit security) on moderate hardware in parallel mode.

...read moreread less

Journal Article•DOI•

Scalable Network Virtualization in Software-Defined Networks

[...]

D. Drutskoy, Eric Keller¹, Jennifer Rexford²•Institutions (2)

University of Colorado Boulder¹, Princeton University²

01 Mar 2013-IEEE Internet Computing

TL;DR: The FlowN architecture gives each tenant the illusion of its own address space, topology, and controller, and leverages database technology to efficiently store and manipulate mappings between virtual networks and physical switches.

...read moreread less

Abstract: Network virtualization gives each "tenant" in a data center its own network topology and control over its traffic flow. Software-defined networking offers a standard interface between controller applications and switch-forwarding tables, and is thus a natural platform for network virtualization. Yet, supporting numerous tenants with different topologies and controller applications raises scalability challenges. The FlowN architecture gives each tenant the illusion of its own address space, topology, and controller, and leverages database technology to efficiently store and manipulate mappings between virtual networks and physical switches.

...read moreread less

Proceedings Article•DOI•

Mizan: a system for dynamic load balancing in large-scale graph processing

[...]

Zuhair Khayyat¹, Karim Awara¹, Amani AlOnazi¹, Hani Jamjoom², Dan Williams², Panos Kalnis¹ - Show less +2 more•Institutions (2)

King Abdullah University of Science and Technology¹, IBM²

15 Apr 2013

TL;DR: Mizan is introduced, a Pregel system that achieves efficient load balancing to better adapt to changes in computing needs and does not assume any a priori knowledge of the structure of the graph or behavior of the algorithm.

...read moreread less

Abstract: Pregel [23] was recently introduced as a scalable graph mining system that can provide significant performance improvements over traditional MapReduce implementations. Existing implementations focus primarily on graph partitioning as a preprocessing step to balance computation across compute nodes. In this paper, we examine the runtime characteristics of a Pregel system. We show that graph partitioning alone is insufficient for minimizing end-to-end computation. Especially where data is very large or the runtime behavior of the algorithm is unknown, an adaptive approach is needed. To this end, we introduce Mizan, a Pregel system that achieves efficient load balancing to better adapt to changes in computing needs. Unlike known implementations of Pregel, Mizan does not assume any a priori knowledge of the structure of the graph or behavior of the algorithm. Instead, it monitors the runtime characteristics of the system. Mizan then performs efficient fine-grained vertex migration to balance computation and communication. We have fully implemented Mizan; using extensive evaluation we show that---especially for highly-dynamic workloads---Mizan provides up to 84% improvement over techniques leveraging static graph pre-partitioning.

...read moreread less

Journal Article•DOI•

Network of Information (NetInf) - An information-centric networking architecture

[...]

Christian Dannewitz¹, Dirk Kutscher, Börje Ohlman², Stephen Farrell³, Bengt Ahlgren⁴, Holger Karl¹ - Show less +2 more•Institutions (4)

University of Paderborn¹, Ericsson², Trinity College, Dublin³, Swedish Institute of Computer Science⁴

01 Apr 2013-Computer Communications

TL;DR: The results suggest that a scalable NRS for 10^1^5 and more objects with resolution latencies (well) below 100ms is possible, implying that a global Network of Information that removes the need for today's application-specific overlay solutions is feasible.

...read moreread less

Proceedings Article•

Stronger semantics for low-latency geo-replicated storage

[...]

Wyatt Lloyd¹, Michael J. Freedman¹, Michael Kaminsky², David G. Andersen³•Institutions (3)

Princeton University¹, Intel², Carnegie Mellon University³

02 Apr 2013

TL;DR: The evaluation shows that the Eiger system achieves low latency, has throughput competitive with eventually-consistent and non-transactional Cassandra, and scales out to large clusters almost linearly (averaging 96% increases up to 128 server clusters).

...read moreread less

Abstract: We present the first scalable, geo-replicated storage system that guarantees low latency, offers a rich data model, and provides "stronger" semantics. Namely, all client requests are satisfied in the local datacenter in which they arise; the system efficiently supports useful data model abstractions such as column families and counter columns; and clients can access data in a causally-consistent fashion with read-only and write-only transactional support, even for keys spread across many servers. The primary contributions of this work are enabling scalable causal consistency for the complex columnfamily data model, as well as novel, non-blocking algorithms for both read-only and write-only transactions. Our evaluation shows that our system, Eiger, achieves low latency (single-ms), has throughput competitive with eventually-consistent and non-transactional Cassandra (less than 7% overhead for one of Facebook's real-world workloads), and scales out to large clusters almost linearly (averaging 96% increases up to 128 server clusters).

...read moreread less

Proceedings Article•DOI•

StEERING: A software-defined networking for inline service chaining

[...]

Ying Zhang¹, Neda Beheshti¹, Ludovic Beliveau¹, Geoffrey Lefebvre¹, Ravi Manghirmalani¹, Ramesh Mishra¹, Ritun Patneyt, Meral Shirazipour¹, Ramesh Subrahmaniam¹, Catherine Truchan¹, Mallik Tatipamula¹ - Show less +7 more•Institutions (1)

Ericsson¹

01 Oct 2013

TL;DR: StEERING, short for SDN inlinE sERvices and forwardlNG, is a scalable framework for dynamically routing traffic through any sequence of middleboxes, built upon the recent software-defined networking architecture and OpenFlow protocol.

...read moreread less

Abstract: Network operators are faced with the challenge of deploying and managing middleboxes (also called inline services) such as firewalls within their broadband access, datacenter or enterprise networks. Due to the lack of available protocols to route traffic through middleboxes, operators still rely on error-prone and complex low-level configurations to coerce traffic through the desired set of middleboxes. Built upon the recent software-defined networking (SDN) architecture and OpenFlow protocol, this paper proposes StEERING, short for SDN inlinE sERvices and forwardlNG. It is a scalable framework for dynamically routing traffic through any sequence of middleboxes. With simple centralized configuration, StEERING can explicitly steer different types of flows through the desired set of middleboxes, scaling at the level of per-subscriber and per-application policies. With its capability to support flexible routing, we further propose an algorithm to select the best locations for placing services, such that the performance is optimized. Overall, StEERING allows network operators to monetize their middlebox deployment in new ways by allowing subscribers flexibly to select available network services.

...read moreread less

Proceedings Article•DOI•

Spanner: Google's globally-distributed database

[...]

Cooper Brian F¹•Institutions (1)

Google¹

30 Jun 2013

TL;DR: The design and implementation of Spanner is discussed, as well as some of the lessons it has learned along the way, and some open challenges in building scalable distributed storage systems are discussed.

...read moreread less

Abstract: Spanner is Google's scalable, multi-version, globally-distributed, and synchronously-replicated database. It provides strong transactional semantics, consistent replication, and high performance reads and writes for a variety of Google's applications. I'll discuss the design and implementation of Spanner, as well as some of the lessons we have learned along the way. I'll also discuss some open challenges that we still see in building scalable distributed storage systems.

...read moreread less

Proceedings Article•

EyeQ: practical network performance isolation at the edge

[...]

Vimalkumar Jeyakumar¹, Mohammad Alizadeh¹, David Mazières¹, Balaji Prabhakar¹, Changhoon Kim, Albert Greenberg - Show less +2 more•Institutions (1)

Stanford University¹

02 Apr 2013

TL;DR: EyeQ is presented, a simple and practical system that provides tenants with bandwidth guarantees as if their endpoints were connected to a dedicated switch, and leverages the high bisection bandwidth in a datacenter fabric and enforces admission control on traffic, regardless of the tenant transport protocol.

...read moreread less

Abstract: The datacenter network is shared among untrusted tenants in a public cloud, and hundreds of services in a private cloud. Today we lack fine-grained control over network bandwidth partitioning across tenants. In this paper we present EyeQ, a simple and practical system that provides tenants with bandwidth guarantees as if their endpoints were connected to a dedicated switch. To realize this goal, EyeQ leverages the high bisection bandwidth in a datacenter fabric and enforces admission control on traffic, regardless of the tenant transport protocol. We show that this pushes bandwidth contention to the network's edge, enabling EyeQ to support end-to-end minimum bandwidth guarantees to tenant end-points in a simple and scalable manner at the servers. EyeQ requires no changes to applications and is deployable with support from the network available today. We evaluate EyeQ with an efficient software implementation at 10Gb/s speeds using unmodified applications and adversarial traffic patterns. Our evaluation demonstrates EyeQ's promise of predictable network performance isolation. For instance, even with an adversarial tenant with bursty UDP traffic, EyeQ is able to maintain the 99.9th percentile latency for a collocated memcached application close to that of a dedicated deployment.

...read moreread less

Posted Content•

DISCO: Distributed Multi-domain SDN Controllers

[...]

Kevin Phemius¹, Mathieu Bouet¹, Jeremie Leguay¹•Institutions (1)

Thales Communications¹

28 Aug 2013-arXiv: Networking and Internet Architecture

TL;DR: This paper proposes DISCO, an extensible DIstributed SDN COntrol plane able to cope with the distributed and heterogeneous nature of modern overlay networks and is implemented on top of the Floodlight OpenFlow controller and the AMQP protocol.

...read moreread less

Abstract: Modern multi-domain networks now span over datacenter networks, enterprise networks, customer sites and mobile entities. Such networks are critical and, thus, must be resilient, scalable and easily extensible. The emergence of Software-Defined Networking (SDN) protocols, which enables to decouple the data plane from the control plane and dynamically program the network, opens up new ways to architect such networks. In this paper, we propose DISCO, an open and extensible DIstributed SDN COntrol plane able to cope with the distributed and heterogeneous nature of modern overlay networks and wide area networks. DISCO controllers manage their own network domain and communicate with each others to provide end-to-end network services. This communication is based on a unique lightweight and highly manageable control channel used by agents to self-adaptively share aggregated network-wide information. We implemented DISCO on top of the Floodlight OpenFlow controller and the AMQP protocol. We demonstrated how DISCO's control plane dynamically adapts to heterogeneous network topologies while being resilient enough to survive to disruptions and attacks and providing classic functionalities such as end-point migration and network-wide traffic engineering. The experimentation results we present are organized around three use cases: inter-domain topology disruption, end-to-end priority service request and virtual machine migration.

...read moreread less

Journal Article•DOI•

Information Weighted Consensus Filters and Their Application in Distributed Camera Networks

[...]

Ahmed Tashrif Kamal¹, Jay A. Farrell¹, Amit K. Roy-Chowdhury¹•Institutions (1)

University of California, Riverside¹

07 Aug 2013-IEEE Transactions on Automatic Control

TL;DR: Information-weighted consensus algorithms for distributed maximum a posteriori parameter estimation, and their extension to the information- Weighted consensus filter (ICF) for state estimation are proposed.

...read moreread less

Abstract: Due to their high fault-tolerance and scalability to large networks, consensus-based distributed algorithms have recently gained immense popularity in the sensor networks community. Large-scale camera networks are a special case. In a consensus-based state estimation framework, multiple neighboring nodes iteratively communicate with each other, exchanging their own local information about each target's state with the goal of converging to a single state estimate over the entire network. However, the state estimation problem becomes challenging when some nodes have limited observability of the state. In addition, the consensus estimate is suboptimal when the cross-covariances between the individual state estimates across different nodes are not incorporated in the distributed estimation framework. The cross-covariance is usually neglected because the computational and bandwidth requirements for its computation become unscalable for a large network. These limitations can be overcome by noting that, as the state estimates at different nodes converge, the information at each node becomes correlated. This fact can be utilized to compute the optimal estimate by proper weighting of the prior state and measurement information. Motivated by this idea, we propose information-weighted consensus algorithms for distributed maximum a posteriori parameter estimation, and their extension to the information-weighted consensus filter (ICF) for state estimation. We compare the performance of the ICF with existing consensus algorithms analytically, as well as experimentally by considering the scenario of a distributed camera network under various operating conditions.

...read moreread less

Journal Article•DOI•

Distributed Energy Trading: The Multiple-Microgrid Case

[...]

David Gregoratti, Javier Matamoros

29 Jul 2013-arXiv: Optimization and Control

TL;DR: In this article, a distributed convex optimization framework is developed for energy trading between islanded microgrids, where the problem consists of several island-grids that exchange energy flows by means of an arbitrary topology, and a subgradient-based cost minimization algorithm is proposed that converges to the optimal solution in a practical number of iterations.

...read moreread less

Abstract: In this paper, a distributed convex optimization framework is developed for energy trading between islanded microgrids. More specifically, the problem consists of several islanded microgrids that exchange energy flows by means of an arbitrary topology. Due to scalability issues and in order to safeguard local information on cost functions, a subgradient-based cost minimization algorithm is proposed that converges to the optimal solution in a practical number of iterations and with a limited communication overhead. Furthermore, this approach allows for a very intuitive economics interpretation that explains the algorithm iterations in terms of "supply--demand model" and "market clearing". Numerical results are given in terms of convergence rate of the algorithm and attained costs for different network topologies.

...read moreread less

Journal Article•DOI•

On the optimal allocation of virtual resources in cloud computing networks

[...]

Chrysa Papagianni¹, Aris Leivadeas¹, Symeon Papavassiliou¹, Vasilis Maglaris¹, Cristina Cervello-Pastor, Alvaro Monje - Show less +2 more•Institutions (1)

National and Kapodistrian University of Athens¹

01 Jun 2013-IEEE Transactions on Computers

TL;DR: This paper first formulate the optimal networked cloud mapping problem as a mixed integer programming (MIP) problem, indicating objectives related to cost efficiency of the resource mapping procedure, and proposes a method for the efficient mapping of resource requests onto a shared substrate interconnecting various islands of computing resources.

...read moreread less

Abstract: Cloud computing builds upon advances on virtualization and distributed computing to support cost-efficient usage of computing resources, emphasizing on resource scalability and on demand services. Moving away from traditional data-center oriented models, distributed clouds extend over a loosely coupled federated substrate, offering enhanced communication and computational services to target end-users with quality of service (QoS) requirements, as dictated by the future Internet vision. Toward facilitating the efficient realization of such networked computing environments, computing and networking resources need to be jointly treated and optimized. This requires delivery of user-driven sets of virtual resources, dynamically allocated to actual substrate resources within networked clouds, creating the need to revisit resource mapping algorithms and tailor them to a composite virtual resource mapping problem. In this paper, toward providing a unified resource allocation framework for networked clouds, we first formulate the optimal networked cloud mapping problem as a mixed integer programming (MIP) problem, indicating objectives related to cost efficiency of the resource mapping procedure, while abiding by user requests for QoS-aware virtual resources. We subsequently propose a method for the efficient mapping of resource requests onto a shared substrate interconnecting various islands of computing resources, and adopt a heuristic methodology to address the problem. The efficiency of the proposed approach is illustrated in a simulation/emulation environment, that allows for a flexible, structured, and comparative performance evaluation. We conclude by outlining a proof-of-concept realization of our proposed schema, mounted over the European future Internet test-bed FEDERICA, a resource virtualization platform augmented with network and computing facilities.

...read moreread less

Journal Article•DOI•

PaRSEC: Exploiting Heterogeneity to Enhance Scalability

[...]

George Bosilca¹, Aurelien Bouteiller¹, Anthony Danalis¹, Mathieu Faverge, Thomas Herault¹, Jack Dongarra¹ - Show less +2 more•Institutions (1)

University of Tennessee¹

01 Nov 2013

TL;DR: In this article, the authors present an approach based on task parallelism that reveals the application's parallelism by expressing its algorithm as a task flow, which allows the algorithm to be decoupled from the data distribution and the underlying hardware.

...read moreread less

Abstract: New high-performance computing system designs with steeply escalating processor and core counts, burgeoning heterogeneity and accelerators, and increasingly unpredictable memory access times call for one or more dramatically new programming paradigms. These new approaches must react and adapt quickly to unexpected contentions and delays, and they must provide the execution environment with sufficient intelligence and flexibility to rearrange the execution to improve resource utilization. The authors present an approach based on task parallelism that reveals the application's parallelism by expressing its algorithm as a task flow. This strategy allows the algorithm to be decoupled from the data distribution and the underlying hardware, since the algorithm is entirely expressed as flows of data. This kind of layering provides a clear separation of concerns among architecture, algorithm, and data distribution. Developers benefit from this separation because they can focus solely on the algorithmic level without the constraints involved with programming for current and future hardware trends.

...read moreread less

Journal Article•DOI•

LIMERIC: A Linear Adaptive Message Rate Algorithm for DSRC Congestion Control

[...]

Gaurav Bansal¹, John Kenney¹, Charles E. Rohrs¹•Institutions (1)

Toyota¹

26 Jul 2013-IEEE Transactions on Vehicular Technology

TL;DR: This paper defines a new adaptive congestion control algorithm that can be applied to the message rate of devices in this vehicular environment and employs standard NS-2 simulations to demonstrate the performance of LIMERIC in several high-density scenarios.

...read moreread less

Abstract: Wireless vehicle-to-vehicle (V2V) and vehicle-toinfrastructure (V2I) communication holds great promise for significantly reducing the human and financial costs of vehicle collisions A common characteristic of this communication is the broadcast of a device's core state information at regular intervals (eg, vehicle speed and location or traffic signal state and timing) Unless controlled, the aggregate of these broadcasts will congest the channel under dense traffic scenarios, reducing the effectiveness of collision avoidance applications that use transmitted information Active congestion control using distributed techniques is a topic of great interest for establishing the scalability of this technology This paper defines a new adaptive congestion control algorithm that can be applied to the message rate of devices in this vehicular environment While other published approaches rely on binary control, the LInear MEssage Rate Integrated Control (LIMERIC) algorithm takes advantage of full-precision control inputs that are available on the wireless channel The result is provable convergence to fair and efficient channel utilization in the deterministic environment, under simple criteria for setting adaptive parameters This “perfect” convergence avoids the limit cycle behavior that is inherent to binary control We also discuss several practical aspects associated with implementing LIMERIC, including guidelines for the choice of system parameters to obtain desired utilization outcomes, a gain saturation technique that maintains robust convergence under all conditions, convergence with asynchronous updates, and using channel load to determine the aggregate message rate that is observable at a receiver This paper also extends the convergence analysis for two important cases, ie, measurement noise in the input signal and delay in the update process This paper illustrates key analytical results using MATLAB numerical results and employs standard NS-2 simulations to demonstrate the performance of LIMERIC in several high-density scenarios

...read moreread less

Journal Article•DOI•

A tenant-based resource allocation model for scaling Software-as-a-Service applications over cloud computing infrastructures

[...]

Javier Espadas¹, Arturo Molina¹, Guillermo Jimenez¹, Martin Molina¹, Raúl Antonio Trejo Ramírez¹, David Concha¹ - Show less +2 more•Institutions (1)

Monterrey Institute of Technology and Higher Education¹

01 Jan 2013-Future Generation Computer Systems

TL;DR: This work attempts to establish formal measurements for under and over provisioning of virtualized resources in cloud infrastructures, specifically for SaaS platform deployments and proposes a resource allocation model to deploy SAAS applications over cloud computing platforms by taking into account their multi-tenancy, thus creating a cost-effective scalable environment.

...read moreread less

Journal Article•DOI•

Temporal locality in today's content caching: why it matters and how to model it

[...]

Stefano Traverso¹, Mohamed H. Ahmed, Michele Garetto², Paolo Giaccone¹, Emilio Leonardi¹, Saverio Niccolini - Show less +2 more•Institutions (2)

Polytechnic University of Turin¹, University of Turin²

04 Nov 2013

TL;DR: This paper proposes a new parsimonious traffic model, named the Shot Noise Model (SNM), that enables users to natively capture the dynamics of content popularity, whilst still being sufficiently simple to be employed effectively for both analytical and scalable simulative studies of caching systems.

...read moreread less

Abstract: The dimensioning of caching systems represents a difficult task in the design of infrastructures for content distribution in the current Internet. This paper addresses the problem of defining a realistic arrival process for the content requests generated by users, due its critical importance for both analytical and simulative evaluations of the performance of caching systems. First, with the aid of \youtube traces collected inside operational residential networks, we identify the characteristics of real traffic that need to be considered or can be safely neglected in order to accurately predict the performance of a cache. Second, we propose a new parsimonious traffic model, named the Shot Noise Model (SNM), that enables users to natively capture the dynamics of content popularity, whilst still being sufficiently simple to be employed effectively for both analytical and scalable simulative studies of caching systems. Finally, our results show that the SNM presents a much better solution to account for the temporal locality observed in real traffic compared to existing approaches.

...read moreread less

Journal Article•DOI•

NoSQL databases: a step to database scalability in web environment

[...]

Jaroslav Pokorny¹•Institutions (1)

Charles University in Prague¹

10 Nov 2013-International Journal of Web Information Systems

TL;DR: The paper shows vary different data models and query possibilities in a common terminology enabling comparison and categorization of NoSQL databases, particularly their horizontal scalability and concurrency model.

...read moreread less

Abstract: Purpose – The paper aims to focus on so‐called NoSQL databases in the context of cloud computing.Design/methodology/approach – Architectures and basic features of these databases are studied, particularly their horizontal scalability and concurrency model, that is mostly weaker than ACID transactions in relational SQL‐like database systems.Findings – Some characteristics like a data model and querying capabilities of NoSQL databases are discussed in more detail.Originality/value – The paper shows vary different data models and query possibilities in a common terminology enabling comparison and categorization of NoSQL databases.

...read moreread less

Collapse