Presto: Edge-based Load Balancing for Fast Datacenter Networks

doi:10.1145/2785956.2787507

Proceedings ArticleDOI

Presto: Edge-based Load Balancing for Fast Datacenter Networks

Keqiang He, +5 more

- Vol. 45, Iss: 4, pp 465-478

Chats0

TLDR

A soft-edge load balancing scheme that closely tracks that of a single, non-blocking switch over many workloads and is adaptive to failures and topology asymmetry, called Presto is designed and implemented.

Abstract:

Datacenter networks deal with a variety of workloads, ranging from latency-sensitive small flows to bandwidth-hungry large flows. Load balancing schemes based on flow hashing, e.g., ECMP, cause congestion when hash collisions occur and can perform poorly in asymmetric topologies. Recent proposals to load balance the network require centralized traffic engineering, multipath-aware transport, or expensive specialized hardware. We propose a mechanism that avoids these limitations by (i) pushing load-balancing functionality into the soft network edge (e.g., virtual switches) such that no changes are required in the transport layer, customer VMs, or networking hardware, and (ii) load balancing on fine-grained, near-uniform units of data (flowcells) that fit within end-host segment offload optimizations used to support fast networking speeds. We design and implement such a soft-edge load balancing scheme, called Presto, and evaluate it on a 10 Gbps physical testbed. We demonstrate the computational impact of packet reordering on receivers and propose a mechanism to handle reordering in the TCP receive offload functionality. Presto's performance closely tracks that of a single, non-blocking switch over many workloads and is adaptive to failures and topology asymmetry.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

HULA: Scalable Load Balancing Using Programmable Data Planes

Naga Praveen Kumar Katta, +4 more

TL;DR: HULA is presented, a data-plane load-balancing algorithm that outperforms a scalable extension to CONGA in average flow completion time and is designed for emerging programmable switches and programed in P4 to demonstrate that HULA could be run on such programmable chipsets, without requiring custom hardware.

...read moreread less

Proceedings ArticleDOI

Re-architecting datacenter networks and stacks for low latency and high performance

Mark Handley, +6 more

TL;DR: NDP, a novel data-center transport architecture that achieves near-optimal completion times for short transfers and high flow throughput in a wide range of scenarios, including incast, is presented.

...read moreread less

Proceedings ArticleDOI

Homa: a receiver-driven low-latency transport protocol using network priorities

Behnam Montazeri, +3 more

TL;DR: Homa as discussed by the authors uses in-network priority queues to ensure low latency for short messages; priority allocation is managed dynamically by each receiver and integrated with a receiver-driven flow control mechanism.

...read moreread less

Proceedings ArticleDOI

DRILL: Micro Load Balancing for Low-latency Data Center Networks

Soudeh Ghorbani, +4 more

TL;DR: DRILL is presented, a datacenter fabric for Clos networks which performs micro load balancing to distribute load as evenly as possible on microsecond timescales and addresses the resulting key challenges of packet reordering and topological asymmetry.

...read moreread less

Proceedings ArticleDOI

Resilient Datacenter Load Balancing in the Wild

Hong Zhang, +4 more

TL;DR: Her Hermes is a datacenter load balancer that is resilient to the aforementioned uncertainties, and well handles uncertainties: under asymmetries, Hermes achieves up to 10% and 20% better flow completion time than CONGA and CLOVE; under switch failures, it outperforms all other schemes by over 32%.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

A Quantitative Measure Of Fairness And Discrimination For Resource Allocation In Shared Computer Systems

Raj Jain, +2 more

- 24 Sep 1998 -

arXiv: Networking and Internet Architect...

TL;DR: A quantitative measure called Indiex of FRairness, applicable to any resource sharing or allocation problem, which is independent of the amount of the resource, and boundedness aids intuitive understanding of the fairness index.

...read moreread less

Journal ArticleDOI

A scalable, commodity data center network architecture

Mohammad Al-Fares, +2 more

TL;DR: This paper shows how to leverage largely commodity Ethernet switches to support the full aggregate bandwidth of clusters consisting of tens of thousands of elements and argues that appropriately architected and interconnected commodity switches may deliver more performance at less cost than available from today's higher-end solutions.

...read moreread less

Proceedings ArticleDOI

VL2: a scalable and flexible data center network

Albert Greenberg, +8 more

TL;DR: VL2 is a practical network architecture that scales to support huge data centers with uniform high capacity between servers, performance isolation between services, and Ethernet layer-2 semantics, and is built on a working prototype.

...read moreread less

Proceedings ArticleDOI

Network traffic characteristics of data centers in the wild

Theophilus Benson, +2 more

TL;DR: An empirical study of the network traffic in 10 data centers belonging to three different categories, including university, enterprise campus, and cloud data centers, which includes not only data centers employed by large online service providers offering Internet-facing applications but also data centers used to host data-intensive (MapReduce style) applications.

...read moreread less

Journal ArticleDOI

CUBIC: a new TCP-friendly high-speed TCP variant

Sangtae Ha, +2 more

- 01 Jul 2008 -

Operating Systems Review

TL;DR: The CUBIC protocol modifies the linear window growth function of existing TCP standards to be a cubic function in order to improve the scalability of TCP over fast and long distance networks.

...read moreread less

Collapse

Presto: Edge-based Load Balancing for Fast Datacenter Networks

Citations

HULA: Scalable Load Balancing Using Programmable Data Planes

Re-architecting datacenter networks and stacks for low latency and high performance

Homa: a receiver-driven low-latency transport protocol using network priorities

DRILL: Micro Load Balancing for Low-latency Data Center Networks

Resilient Datacenter Load Balancing in the Wild

References

A Quantitative Measure Of Fairness And Discrimination For Resource Allocation In Shared Computer Systems

A scalable, commodity data center network architecture

VL2: a scalable and flexible data center network

Network traffic characteristics of data centers in the wild

CUBIC: a new TCP-friendly high-speed TCP variant

Related Papers (5)

CONGA: distributed congestion-aware load balancing for datacenters

Data center TCP (DCTCP)

Hedera: dynamic flow scheduling for data center networks

A scalable, commodity data center network architecture

VL2: a scalable and flexible data center network