Restless bandits: activity allocation in a changing world

doi:10.2307/3214163

Journal ArticleDOI

Restless bandits: activity allocation in a changing world

Peter Whittle

- 01 Jan 1988 -

Journal of Applied Probability

- Vol. 25, pp 287-298

Chats0

TLDR

In this article, the Lagrange multiplier associated with this constraint defines an index which reduces to the Gittins index when projects not being operated are static, and arguments are advanced to support the conjecture that, for m and n large in constant ratio, the policy of operating the m projects of largest current index is nearly optimal.

Abstract:

We consider a population of n projects which in general continue to evolve whether in operation or not (although by different rules). It is desired to choose the projects in operation at each instant of time so as to maximise the expected rate of reward, under a constraint upon the expected number of projects in operation. The Lagrange multiplier associated with this constraint defines an index which reduces to the Gittins index when projects not being operated are static. If one is constrained to operate m projects exactly then arguments are advanced to support the conjecture that, for m and n large in constant ratio, the policy of operating the m projects of largest current index is nearly optimal. The index is evaluated for some particular projects.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Survey A survey of computational complexity results in systems and control

Vincent D. Blondel, +1 more

- 01 Sep 2000 -

Automatica

TL;DR: This paper considers problems related to stability or stabilizability of linear systems with parametric uncertainty, robust control, time-varying linear systems, nonlinear and hybrid systems, and stochastic optimal control.

...read moreread less

Journal ArticleDOI

The description-experience gap in risky choice

Ralph Hertwig, +1 more

- 01 Dec 2009 -

Trends in Cognitive Sciences

TL;DR: Converging findings show that when people make decisions based on experience, rare events tend to have less impact than they deserve according to their objective probabilities.

...read moreread less

Journal ArticleDOI

The Complexity of Optimal Queuing Network Control

Christos H. Papadimitriou, +1 more

- 01 May 1999 -

Mathematics of Operations Research

TL;DR: It is shown that several versions of the problem of optimally controlling a simple network of queues with simple arrival and service distributions and multiple customer classes is complete for exponential time.

...read moreread less

Journal ArticleDOI

A modern Bayesian look at the multi-armed bandit

Steven L. Scott

- 01 Nov 2010 -

Applied Stochastic Models in Business an...

TL;DR: A heuristic for managing multi-armed bandits called randomized probability matching is described, which randomly allocates observations to arms according the Bayesian posterior probability that each arm is optimal.

...read moreread less

Journal ArticleDOI

The Psychology and Neuroscience of Curiosity.

Celeste Kidd, +1 more

- 04 Nov 2015 -

Neuron

TL;DR: It is proposed that, rather than worry about defining curiosity, it is more helpful to consider the motivations for information-seeking behavior and to study it in its ethological context.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Posted Content

Bandit processes and dynamic allocation indices

J.C. Gittens

- 10 Dec 2010 -

Research Papers in Economics

Journal ArticleDOI

Bandit Processes and Dynamic Allocation Indices

J. C. Gittins

- 01 Jan 1979 -

Journal of the royal statistical society...

Journal ArticleDOI

Multi-Armed Bandits and the Gittins Index

Peter Whittle

- 01 Jan 1980 -

Journal of the royal statistical society...

Book

Systems in stochastic equilibrium

Peter Whittle

TL;DR: Preface Basic Material Abundance and Transfer Models Network Models Bonding Models: Polymerisation and Random Graphs Spatial Models, Random Fields Appendices.

...read moreread less

Journal ArticleDOI

Arm-Acquiring Bandits

Peter Whittle

- 01 Apr 1981 -

Annals of Probability

TL;DR: In this article, the problem of allocating effort between projects at different stages of development when new projects are also continually appearing is considered, and an expression for the expected reward yielded by the Gittins index policy is derived.

...read moreread less