Journal ArticleDOI
VL2: a scalable and flexible data center network
Albert Greenberg,James R. Hamilton,Navendu Jain,Srikanth Kandula,Changhoon Kim,Parantap Lahiri,David A. Maltz,Parveen Patel,Sudipta Sengupta +8 more
TLDR
VL2 is a practical network architecture that scales to support huge data centers with uniform high capacity between servers, performance isolation between services, and Ethernet layer-2 semantics and can be deployed today, and a working prototype is built.Abstract:
To be agile and cost effective, data centers must allow dynamic resource allocation across large server pools. In particular, the data center network should provide a simple flat abstraction: it should be able to take any set of servers anywhere in the data center and give them the illusion that they are plugged into a physically separate, noninterfering Ethernet switch with as many ports as the service needs. To meet this goal, we present VL2, a practical network architecture that scales to support huge data centers with uniform high capacity between servers, performance isolation between services, and Ethernet layer-2 semantics. VL2 uses (1) flat addressing to allow service instances to be placed anywhere in the network, (2) Valiant Load Balancing to spread traffic uniformly across network paths, and (3) end system--based address resolution to scale to large server pools without introducing complexity to the network control plane. VL2's design is driven by detailed measurements of traffic and fault data from a large operational cloud service provider. VL2's implementation leverages proven network technologies, already available at low cost in high-speed hardware implementations, to build a scalable and reliable network architecture. As a result, VL2 networks can be deployed today, and we have built a working prototype. We evaluate the merits of the VL2 design using measurement, analysis, and experiments. Our VL2 prototype shuffles 2.7 TB of data among 75 servers in 395 s---sustaining a rate that is 94% of the maximum possible.read more
Citations
More filters
Journal ArticleDOI
RingCube - An incrementally scale-out optical interconnect for cloud computing data center
TL;DR: A scalable all-optical interconnect named RingCube is proposed to solve the scalability issues of the centralized optical switching architecture by embedding hypercube into ring topology and utilizing the multi-wavelength communication strategy.
Dissertation
User-Centric Traffic Engineering in Software Defined Networks
TL;DR: This paper presents a meta-modelling architecture suitable for dynamic control plane management in software-defined networks (SDN) using OpenFlow, and some examples show how this architecture can be modified for mobile devices.
Book ChapterDOI
Communication Aspects of Resource Management in Hybrid Clouds
TL;DR: The authors depict the network as a fundamental component to provide quality of service, discussing its influence in the hybrid cloud management and resource allocation and present the uncertainty in the network channels as a problem to be tackled to avoid application delays and unexpected costs from the leasing of public cloud resources.
Posted Content
Latency and Throughput Optimization in Modern Networks: A Comprehensive Survey.
TL;DR: This paper surveys major attempts on reducing latency and increasing the throughput on different networks and surroundings such as wired networks, wireless networks, application layer transport control, Remote Direct Memory Access, and machine learning based transport control.
Proceedings ArticleDOI
MegTaiChi
Zhongzhe Hu,Jun-Yi Xiao,Zheye Deng,Mingyi Li,Kewei Zhang,Xiaoyang Zhang,Ke Meng,Ninghui Sun,Guangming Tan +8 more
TL;DR: In this paper , the authors proposed a dynamic tensor-based memory management optimization module for the DNN training, which first achieves an efficient coordination of tensor partition and tensor rematerialization.
References
More filters
Journal ArticleDOI
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
Journal ArticleDOI
A scalable, commodity data center network architecture
TL;DR: This paper shows how to leverage largely commodity Ethernet switches to support the full aggregate bandwidth of clusters consisting of tens of thousands of elements and argues that appropriately architected and interconnected commodity switches may deliver more performance at less cost than available from today's higher-end solutions.
Book
Principles and Practices of Interconnection Networks
William J. Dally,Brian Towles +1 more
TL;DR: This book offers a detailed and comprehensive presentation of the basic principles of interconnection network design, clearly illustrating them with numerous examples, chapter exercises, and case studies, allowing a designer to see all the steps of the process from abstract design to concrete implementation.
Journal ArticleDOI
The part-time parliament
TL;DR: The Paxon parliament's protocol provides a new way of implementing the state machine approach to the design of distributed systems.
Proceedings Article
The Art of Computer Systems Performance Analysis.
TL;DR: The authors' goal is always to offer you an assortment of cost-free ebooks too as aid resolve your troubles.