Scalable load balancing in networked systems: A survey of recent advances.

Open AccessPosted Content

Scalable load balancing in networked systems: A survey of recent advances.

Mark van der Boor, +3 more

- 14 Jun 2018 -

arXiv: Probability

Chats0

TLDR

It is demonstrated how Stochastic coupling techniques and stochastic-process limits play an instrumental role in establishing the asymptotic optimality and carries over to infinite-server settings, finite buffers, multiple dispatchers, servers arranged on graph topologies, and token-based load balancing including the popular Join-the-Idle-Queue (JIQ) scheme.

Abstract:

The basic load balancing scenario involves a single dispatcher where tasks arrive that must immediately be forwarded to one of $N$ single-server queues. We discuss recent advances on scalable load balancing schemes which provide favorable delay performance when $N$ grows large, and yet only require minimal implementation overhead. Join-the-Shortest-Queue (JSQ) yields vanishing delays as $N$ grows large, as in a centralized queueing arrangement, but involves a prohibitive communication burden. In contrast, power-of-$d$ or JSQ($d$) schemes that assign an incoming task to a server with the shortest queue among $d$ servers selected uniformly at random require little communication, but lead to constant delays. In order to examine this fundamental trade-off between delay performance and implementation overhead, we consider JSQ($d(N)$) schemes where the diversity parameter $d(N)$ depends on $N$ and investigate what growth rate of $d(N)$ is required to asymptotically match the optimal JSQ performance on fluid and diffusion scale. Stochastic coupling techniques and stochastic-process limits play an instrumental role in establishing the asymptotic optimality. We demonstrate how this methodology carries over to infinite-server settings, finite buffers, multiple dispatchers, servers arranged on graph topologies, and token-based load balancing including the popular Join-the-Idle-Queue (JIQ) scheme. In this way we provide a broad overview of the many recent advances in the field. This survey extends the short review presented at ICM 2018 (arXiv:1712.08555).

Scalable load balancing in networked systems: A survey of recent advances.

Citations

Join-the-Shortest Queue Diffusion Limit in Halfin-Whitt Regime: Tail Asymptotics and Scaling of Extrema

Power-of-d-Choices with Memory: Fluid Limit and Optimality

A law of large numbers for m/m/c/delayoff-setup queues with nonstationary arrivals

Transform Methods for Heavy-Traffic Analysis

Resource management in computer clusters : algorithm design and performance analysis

References

The power of two choices in randomized load balancing

Cuckoo hashing

Random Graphs and Complex Networks

Balanced Allocations

Heavy-Traffic Limits for Queues with Many Exponential Servers

Related Papers (5)

The power of two choices in randomized load balancing

Optimality of the shortest line discipline

Join-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services

Load Balancing in the Nondegenerate Slowdown Regime

State space collapse with application to heavy traffic limits for multiclass queueing networks