Showing papers by "Timothy Wood published in 2014"

PDF

Open Access

Proceedings Article•DOI•

NetVM: high performance and flexible networking using virtualization on commodity platforms

[...]

Jinho Hwang¹, Kadangode K. Ramakrishnan², Timothy Wood¹•Institutions (2)

George Washington University¹, Rutgers University²

02 Apr 2014

TL;DR: NetVM as mentioned in this paper is built on top of the KVM platform and Intel DPDK library, allowing customizable data plane processing capabilities such as firewalls, proxies, and routers to be embedded within virtual machines, complementing the control plane capabilities of Software Defined Networks.

...read moreread less

Abstract: NetVM brings virtualization to the Network by enabling high bandwidth network functions to operate at near line speed, while taking advantage of the flexibility and customization of low cost commodity servers. NetVM allows customizable data plane processing capabilities such as firewalls, proxies, and routers to be embedded within virtual machines, complementing the control plane capabilities of Software Defined Networking. NetVM makes it easy to dynamically scale, deploy, and reprogram network functions. This provides far greater flexibility than existing purpose-built, sometimes proprietary hardware, while still allowing complex policies and full packet inspection to determine subsequent processing. It does so with dramatically higher throughput than existing software router platforms.NetVM is built on top of the KVM platform and Intel DPDK library. We detail many of the challenges we have solved such as adding support for high-speed inter-VM communication through shared huge pages and enhancing the CPU scheduler to prevent overheads caused by inter-core communication and context switching. NetVM allows true zero-copy delivery of data to VMs both for packet processing and messaging among VMs within a trust boundary. Our evaluation shows how NetVM can compose complex network functionality from multiple pipelined VMs and still obtain throughputs up to 10 Gbps, an improvement of more than 250% compared to existing techniques that use SR-IOV for virtualized networking.

...read moreread less

113 citations

Journal Article•DOI•

Cost-Aware Cloud Bursting for Enterprise Applications

[...]

Tian Guo¹, Upendra Sharma², Prashant Shenoy¹, Timothy Wood³, Sambit Sahu² - Show less +1 more•Institutions (3)

University of Massachusetts Amherst¹, IBM², George Washington University³

01 May 2014-ACM Transactions on Internet Technology

TL;DR: Seagull is a system designed to facilitate cloud bursting by determining which applications should be transitioned into the cloud and automating the movement process at the proper time, and optimizes the bursting of applications using an optimization algorithm as well as a more efficient but approximate greedy heuristic.

...read moreread less

Abstract: The high cost of provisioning resources to meet peak application demands has led to the widespread adoption of pay-as-you-go cloud computing services to handle workload fluctuations. Some enterprises with existing IT infrastructure employ a hybrid cloud model where the enterprise uses its own private resources for the majority of its computing, but then “bursts” into the cloud when local resources are insufficient. However, current commercial tools rely heavily on the system administrator’s knowledge to answer key questions such as when a cloud burst is needed and which applications must be moved to the cloud. In this article, we describe Seagull, a system designed to facilitate cloud bursting by determining which applications should be transitioned into the cloud and automating the movement process at the proper time. Seagull optimizes the bursting of applications using an optimization algorithm as well as a more efficient but approximate greedy heuristic. Seagull also optimizes the overhead of deploying applications into the cloud using an intelligent precopying mechanism that proactively replicates virtualized applications, lowering the bursting time from hours to minutes. Our evaluation shows over 100p improvement compared to naive solutions but produces more expensive solutions compared to ILP. However, the scalability of our greedy algorithm is dramatically better as the number of VMs increase. Our evaluation illustrates scenarios where our prototype can reduce cloud costs by more than 45p when bursting to the cloud, and that the incremental cost added by precopying applications is offset by a burst time reduction of nearly 95p.

...read moreread less

73 citations

Proceedings Article•DOI•

MIMP: deadline and interference aware scheduling of hadoop virtual machines

[...]

Wei Zhang¹, Sundaresan Rajasekaran¹, Timothy Wood¹, Mingfa Zhu²•Institutions (2)

George Washington University¹, Beihang University²

26 May 2014

TL;DR: This work proposes two schedulers: one in the virtualization layer designed to minimize interference on high priority interactive services, and one inThe Hadoop framework that helps batch processing jobs meet their own performance deadlines.

...read moreread less

Abstract: Virtualization promised to dramatically increase server utilization levels, yet many data centers are still only lightly loaded. In some ways, big data applications are an ideal fit for using this residual capacity to perform meaningful work, but the high level of interference between interactive and batch processing workloads currently prevents this from being a practical solution in virtualized environments. Further, the variable nature of spare capacity may make it difficult to meet big data application deadlines.In this work we propose two schedulers: one in the virtualization layer designed to minimize interference on high priority interactive services, and one in the Hadoop framework that helps batch processing jobs meet their own performance deadlines. Our approach uses performance models to match Hadoop tasks to the servers that will benefit them the most, and deadline-aware scheduling to effectively order incoming jobs. The combination of these schedulers allows data center administrators to safely mix resource intensive Hadoop jobs with latency sensitive web applications, and still achieve predictable performance for both. We have implemented our system using Xen and Hadoop, and our evaluation shows that our schedulers allow a mixed cluster to reduce web response times by more than ten fold, while meeting more Hadoop deadlines and lowering total task execution times by 6.5%.

...read moreread less

64 citations

Proceedings Article•

Matrix: Achieving Predictable Virtual Machine Performance in the Clouds

[...]

Ron C. Chiang¹, Jinho Hwang², H. Howie Huang¹, Timothy Wood¹•Institutions (2)

George Washington University¹, IBM²

01 Jan 2014

TL;DR: This work proposes Matrix, a novel performance and resource management system that ensures the desired performance of an application achieved on a VM, and utilizes machine learning methods clustering models with probability estimates to predict the performance of new workloads in a virtualized environment, choose a suitable VM type, and dynamically adjust the resource configuration of a virtual machine on the fly.

...read moreread less

Abstract: The success of cloud computing builds largely upon on-demand supply of virtual machines (VMs) that provide the abstraction of a physical machine on shared resources. Unfortunately, despite recent advances in virtualization technology, there still exists an unpredictable performance gap between the real and desired performance. The main contributing factors include contention to the shared physical resources among co-located VMs, limited control of VM allocation, as well as lack of knowledge on the performance of a specific VM out of tens of VM types offered by public cloud providers. In this work, we propose Matrix, a novel performance and resource management system that ensures the desired performance of an application achieved on a VM. To this end, Matrix utilizes machine learning methods clustering models with probability estimates to predict the performance of new workloads in a virtualized environment, choose a suitable VM type, and dynamically adjust the resource configuration of a virtual machine on the fly. The evaluations on a private cloud, and two public clouds (Rackspace and Amazon EC2) show that for an extensive set of cloud applications, Matrix is able to estimate application performance with average 90% accuracy. In addition, Matrix can deliver the target performance within 3% variance, and do so with the best cost-efficiency in most cases.

...read moreread less

55 citations

Proceedings Article•DOI•

Mortar: filling the gaps in data center memory

[...]

Jinho Hwang¹, Ahsen J. Uppal², Timothy Wood², H. Howie Huang²•Institutions (2)

IBM¹, George Washington University²

01 Mar 2014

TL;DR: By expanding and contracting the data store size based on the free memory available, Mortar improves average response time of a web application by up to 35% compared to a fixed size memcached deployment, and improves overall video streaming performance by 45% through prefetching.

...read moreread less

Abstract: Data center servers are typically overprovisioned, leaving spare memory and CPU capacity idle to handle unpredictable workload bursts by the virtual machines running on them. While this allows for fast hotspot mitigation, it is also wasteful. Unfortunately, making use of spare capacity without impacting active applications is particularly difficult for memory since it typically must be allocated in coarse chunks over long timescales. In this work we propose re- purposing the poorly utilized memory in a data center to store a volatile data store that is managed by the hypervisor. We present two uses for our Mortar framework: as a cache for prefetching disk blocks, and as an application-level distributed cache that follows the memcached protocol. Both prototypes use the framework to ask the hypervisor to store useful, but recoverable data within its free memory pool. This allows the hypervisor to control eviction policies and prioritize access to the cache. We demonstrate the benefits of our prototypes using realistic web applications and disk benchmarks, as well as memory traces gathered from live servers in our university's IT department. By expanding and contracting the data store size based on the free memory available, Mortar improves average response time of a web application by up to 35% compared to a fixed size memcached deployment, and improves overall video streaming performance by 45% through prefetching.

...read moreread less

27 citations

Load Balancing of Heterogeneous Workloads in Memcached Clusters

[...]

Wei Zhang¹, Jinho Hwang², Timothy Wood¹, Kadangode K. Ramakrishnan³, H. Howie Huang¹ - Show less +1 more•Institutions (3)

George Washington University¹, IBM², Rutgers University³

01 Jan 2014

TL;DR: An automated load balancer that can perform line-rate request redirection in a far more dynamic manner is proposed and how stream analytic techniques can be used to efficiently detect key hotspots are described.

...read moreread less

Abstract: Web services, large and small, use in-memory caches like memcached to lower database loads and quickly respond to user requests. These cache clusters are typically provisioned to support peak load, both in terms of request processing capabilities and cache storage size. This kind of worst-case provisioning can be very expensive (e.g., Facebook reportedly uses more than 10,000 servers for its cache cluster) and does not take advantage of the dynamic resource allocation and virtual machine provisioning capabilities found in modern public and private clouds. Further, there can be great diversity in both the workloads running on a cache cluster and the types of nodes that compose the cluster, making manual management difficult. This paper identifies the challenges in designing large-scale self-managing caches. Rather than requiring all cache clients to know the key to server mapping, we propose an automated load balancer that can perform line-rate request redirection in a far more dynamic manner. We describe how stream analytic techniques can be used to efficiently detect key hotspots. A controller then guides the load balancer’s key mapping and replication level to prevent overload, and automatically starts additional servers when needed.

...read moreread less

23 citations

Proceedings Article•

SmartSwitch: blurring the line between network infrastructure & cloud applications

[...]

Wei Zhang¹, Timothy Wood², Kadangode K. Ramakrishnan³, Jinho Hwang⁴•Institutions (4)

Beihang University¹, George Washington University², Rutgers University³, IBM⁴

17 Jun 2014

TL;DR: This work is developing an application aware networking platform that can perform not only basic packet switching, but also typical functions left to compute platforms such as load balancing based on application-level state, localized data caching, and even arbitrary computation.

...read moreread less

Abstract: A revolution is beginning in communication networks with the adoption of network function virtualization, which allows network services to be run on common off-the-shelf hardware--even in virtual machines--to increase flexibility and lower cost. An exciting prospect for cloud users is that these software-based network services can be merged with compute and storage resources to flexibly integrate all of the cloud's resources. We are developing an application aware networking platform that can perform not only basic packet switching, but also typical functions left to compute platforms such as load balancing based on application-level state, localized data caching, and even arbitrary computation. Our prototype "memcached-aware smart switch" reduces request latency by half and increases throughput by eight fold compared to Twitter's TwemProxy. We also describe how a Hadoop-aware switch could automatically cache data blocks near worker nodes, or perform some computation directly on the data stream. This approach enables a new breed of application designs that blur the line between the cloud's network and its servers.

...read moreread less

15 citations

Proceedings Article•DOI•

Topology Discovery and Service Classification for Distributed-Aware Clouds

[...]

Jinho Hwang¹, Guyue Liu², Sai Zeng¹, Frederick Y. Wu¹, Timothy Wood² - Show less +1 more•Institutions (2)

IBM¹, George Washington University²

11 Mar 2014

TL;DR: A light-weight topology detection system that maps distributed applications and a service classification algorithm that can determine not only overall application types, but individual VM roles as well are developed.

...read moreread less

Abstract: Cloud data centers are difficult to manage because providers have no knowledge of what applications are being run by customers or how they interact. As a consequence, current clouds provide minimal automated management functionality, passing the problem on to users who have access to even fewer tools since they lack insight into the underlying infrastructure. Ideally, the cloud platform, not the customer, should be managing data center resources in order to both use them efficiently and provide strong application-level performance and reliability guarantees. To do this, we believe that clouds must become "distibuted-aware" so that they can deduce the overall structure and dependencies within a client's distributed applications and use that knowledge to better guide management services. Towards this end we are developing a light-weight topology detection system that maps distributed applications and a service classification algorithm that can determine not only overall application types, but individual VM roles as well.

...read moreread less

15 citations

Proceedings Article•DOI•

UniCache: Hypervisor Managed Data Storage in RAM and Flash

[...]

Jinho Hwang¹, Wei Zhang², Ron C. Chiang³, Timothy Wood³, H. Howie Huang³ - Show less +1 more•Institutions (3)

IBM¹, Beihang University², George Washington University³

27 Jun 2014

TL;DR: UniCache is a system that provides a hypervisor managed volatile data store that can cache data either in hypervisor controlled main memory (hot data) or on Flash based storage (cold data).

...read moreread less

Abstract: Application and OS-level caches are crucial for hiding I/O latency and improving application performance. However, caches are designed to greedily consume memory, which can cause memory-hogging problems in a virtualized data centers since the hypervisor cannot tell for what a virtual machine uses its memory. A group of virtual machines may contain a wide range of caches: database query pools, memcached key-value stores, disk caches, etc., each of which would like as much memory as possible. The relative importance of these caches can vary significantly, yet system administrators currently have no easy way to dynamically manage the resources assigned to a range of virtual machine data caches in a unified way. To improve this situation, we have developed UniCache, a system that provides a hypervisor managed volatile data store that can cache data either in hypervisor controlled main memory (hot data) or on Flash based storage (cold data). We propose a two-level cache management system that uses a combination of recency information, object size, and a prediction of the cost to recover an object to guide its eviction algorithm. We have built a prototype of UniCache using Xen, and have evaluated its effectiveness in a shared environment where multiple virtual machines compete for storage resources.

...read moreread less

2 citations