The Linear Programming Approach to Approximate Dynamic Programming

doi:10.1287/OPRE.51.6.850.24925

Open AccessJournal ArticleDOI

The Linear Programming Approach to Approximate Dynamic Programming

Daniela Pucci de Farias, +1 more

- 01 Nov 2003 -

Operations Research

- Vol. 51, Iss: 6, pp 850-865

TLDR

In this article, an efficient method based on linear programming for approximating solutions to large-scale stochastic control problems is proposed. But the approach is not suitable for large scale queueing networks.

Abstract:

The curse of dimensionality gives rise to prohibitive computational requirements that render infeasible the exact solution of large-scale stochastic control problems. We study an efficient method based on linear programming for approximating solutions to such problems. The approach "fits" a linear combination of pre-selected basis functions to the dynamic programming cost-to-go function. We develop error bounds that offer performance guarantees and also guide the selection of both basis functions and "state-relevance weights" that influence quality of the approximation. Experimental results in the domain of queueing network control provide empirical support for the methodology.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Reinforcement learning and adaptive dynamic programming for feedback control

Frank L. Lewis, +1 more

- 01 Sep 2009 -

IEEE Circuits and Systems Magazine

TL;DR: This work describes mathematical formulations for reinforcement learning and a practical implementation method known as adaptive dynamic programming that give insight into the design of controllers for man-made engineered systems that both learn and exhibit optimal behavior.

...read moreread less

Book

Algorithms for Reinforcement Learning

Csaba Szepesvári

TL;DR: This book focuses on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming, and gives a fairly comprehensive catalog of learning problems, and describes the core ideas, followed by the discussion of their theoretical properties and limitations.

...read moreread less

Proceedings Article

Relative entropy policy search

Jan Peters, +2 more

TL;DR: The Relative Entropy Policy Search (REPS) method is suggested, which differs significantly from previous policy gradient approaches and yields an exact update step and works well on typical reinforcement learning benchmark problems.

...read moreread less

Dissertation

On the Sample Complexity of Reinforcement Learning

Sham M. Kakade

TL;DR: Novel algorithms with more restricted guarantees are suggested whose sample complexities are again independent of the size of the state space and depend linearly on the complexity of the policy class, but have only a polynomial dependence on the horizon time.

...read moreread less

Journal ArticleDOI

Robust Dynamic Programming

Garud Iyengar

- 01 May 2005 -

Mathematics of Operations Research

TL;DR: It is proved that when this set of measures has a certain "rectangularity" property, all of the main results for finite and infinite horizon DP extend to natural robust counterparts.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Linear Programming and Markov Decision Chains

Arie Hordijk, +1 more

- 01 Apr 1979 -

Management Science

TL;DR: It is shown that for a finite Markov decision process an average optimal policy can be found by solving only one linear programming problem.

...read moreread less

Journal ArticleDOI

A convex analytic approach to Markov decision processes

Vivek S. Borkar

- 01 Aug 1988 -

Probability Theory and Related Fields

TL;DR: In this article, the authors developed a new framework for the study of Markov decision processes in which the control problem is viewed as an optimization problem on the set of canonically induced measures on the trajectory space of the joint state and control process.

...read moreread less

Journal ArticleDOI

Performance of Multiclass Markovian Queueing Networks Via Piecewise Linear Lyapunov Functions

Dimitris Bertsimas, +2 more

- 01 Nov 2001 -

Annals of Applied Probability

TL;DR: In this article, a general methodology based on Lyapunov functions was proposed for the performance analysis of infinite state Markov chains and applied specifically to Markovian multiclass queueing networks.

...read moreread less

Proceedings Article

High-Performance Job-Shop Scheduling With A Time-Delay TD(λ) Network

Wei Zhang, +1 more

TL;DR: Experimental tests show that this TDNN-TD(λ) network can match the performance of the previous hand-engineered system, and both neural network approaches significantly outperform the best previous (non-learning) solution to this problem.

...read moreread less

Journal ArticleDOI

On Linear Programming in a Markov Decision Problem

Eric V. Denardo

- 01 Jan 1970 -

Management Science

TL;DR: In this article, a Markov decision problem with an infinite planning horizon and no discounting is treated, and the model is analyzed by application, perhaps repeated, of a simple linear program.

...read moreread less

Collapse

The Linear Programming Approach to Approximate Dynamic Programming

Citations

Reinforcement learning and adaptive dynamic programming for feedback control

Algorithms for Reinforcement Learning

Relative entropy policy search

On the Sample Complexity of Reinforcement Learning

Robust Dynamic Programming

References

Linear Programming and Markov Decision Chains

A convex analytic approach to Markov decision processes

Performance of Multiclass Markovian Queueing Networks Via Piecewise Linear Lyapunov Functions

High-Performance Job-Shop Scheduling With A Time-Delay TD(λ) Network

On Linear Programming in a Markov Decision Problem

Related Papers (5)

Dynamic Programming and Optimal Control

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Neuro-dynamic programming

Approximate dynamic programming : solving the curses of dimensionality

Reinforcement Learning: An Introduction