scispace - formally typeset
Open AccessPosted Content

Scalable load balancing in networked systems: A survey of recent advances.

Reads0
Chats0
TLDR
It is demonstrated how Stochastic coupling techniques and stochastic-process limits play an instrumental role in establishing the asymptotic optimality and carries over to infinite-server settings, finite buffers, multiple dispatchers, servers arranged on graph topologies, and token-based load balancing including the popular Join-the-Idle-Queue (JIQ) scheme.
Abstract
The basic load balancing scenario involves a single dispatcher where tasks arrive that must immediately be forwarded to one of $N$ single-server queues. We discuss recent advances on scalable load balancing schemes which provide favorable delay performance when $N$ grows large, and yet only require minimal implementation overhead. Join-the-Shortest-Queue (JSQ) yields vanishing delays as $N$ grows large, as in a centralized queueing arrangement, but involves a prohibitive communication burden. In contrast, power-of-$d$ or JSQ($d$) schemes that assign an incoming task to a server with the shortest queue among $d$ servers selected uniformly at random require little communication, but lead to constant delays. In order to examine this fundamental trade-off between delay performance and implementation overhead, we consider JSQ($d(N)$) schemes where the diversity parameter $d(N)$ depends on $N$ and investigate what growth rate of $d(N)$ is required to asymptotically match the optimal JSQ performance on fluid and diffusion scale. Stochastic coupling techniques and stochastic-process limits play an instrumental role in establishing the asymptotic optimality. We demonstrate how this methodology carries over to infinite-server settings, finite buffers, multiple dispatchers, servers arranged on graph topologies, and token-based load balancing including the popular Join-the-Idle-Queue (JIQ) scheme. In this way we provide a broad overview of the many recent advances in the field. This survey extends the short review presented at ICM 2018 (arXiv:1712.08555).

read more

Citations
More filters
Journal ArticleDOI

Join-the-Shortest Queue Diffusion Limit in Halfin-Whitt Regime: Tail Asymptotics and Scaling of Extrema

TL;DR: In this article, a detailed analysis of the steady state of the above diffusion process is presented, where the authors establish precise tail-asymptotics of the stationary distribution and scaling of extrema of the process on large time-interval.
Journal ArticleDOI

Power-of-d-Choices with Memory: Fluid Limit and Optimality

TL;DR: This paper considers the power-of-d-choice algorithm with the addition of a local memory that keeps track of the latest observations collected over time on the sampled servers, and shows that this algorithm is asymptotically optimal in the sense that the load balancer can always assign each job to an idle server in the large-system limit.

A law of large numbers for m/m/c/delayoff-setup queues with nonstationary arrivals

TL;DR: A weak law of large numbers or fluid limit theorem for the queue length and server processes as the number of arrivals and number of servers tends to infinity is proved.
Journal ArticleDOI

Transform Methods for Heavy-Traffic Analysis

TL;DR: This paper uses the use of transform techniques for heavy-traffic analysis and the moment-generating function method to obtain the stationary distribution of scaled queue lengths in heavy traffic in queuing systems that satisfy the complete resource pooling condition.
Dissertation

Resource management in computer clusters : algorithm design and performance analysis

Céline Comte
TL;DR: A scheduling algorithm that extends the principle of round-robin to a cluster where each incoming job is assigned to a pool of computers by which it can subsequently be processed in parallel and a load-balancing algorithm based on tokens for clusters where jobs have assignment constraints are proposed.
References
More filters
Journal ArticleDOI

The power of two choices in randomized load balancing

TL;DR: This work uses a limiting, deterministic model representing the behavior as n/spl rarr//spl infin/ to approximate the behavior of finite systems and provides simulations that demonstrate that the method accurately predicts system behavior, even for relatively small systems.
Journal ArticleDOI

Cuckoo hashing

TL;DR: In this paper, a simple dictionary with worst case constant lookup time was presented, equaling the theoretical performance of the classic dynamic perfect hashing scheme of Dietzfelbinger et al.
MonographDOI

Random Graphs and Complex Networks

TL;DR: This chapter explains why many real-world networks are small worlds and have large fluctuations in their degrees, and why Probability theory offers a highly effective way to deal with the complexity of networks, and leads us to consider random graphs.
Journal ArticleDOI

Balanced Allocations

TL;DR: It is shown that with high probability, the fullest box contains only ln ln n/ln 2 + O(1) balls---exponentially less than before and a similar gap exists in the infinite process, where at each step one ball, chosen uniformly at random, is deleted, and one ball is added in the manner above.
Journal ArticleDOI

Heavy-Traffic Limits for Queues with Many Exponential Servers

Shlomo Halfin, +1 more
- 01 Jun 1981 - 
TL;DR: In this article, two different kinds of heavy-traffic limit theorems have been proved for s-server queues: the first involves a sequence of queueing systems having a fixed number of servers with an associated sequence of traffic intensities that converges to the critical value of one from below.
Related Papers (5)