The Linear Programming Approach to Approximate Dynamic Programming

doi:10.1287/OPRE.51.6.850.24925

Open AccessJournal ArticleDOI

The Linear Programming Approach to Approximate Dynamic Programming

Daniela Pucci de Farias, +1 more

- 01 Nov 2003 -

Operations Research

- Vol. 51, Iss: 6, pp 850-865

TLDR

In this article, an efficient method based on linear programming for approximating solutions to large-scale stochastic control problems is proposed. But the approach is not suitable for large scale queueing networks.

Abstract:

The curse of dimensionality gives rise to prohibitive computational requirements that render infeasible the exact solution of large-scale stochastic control problems. We study an efficient method based on linear programming for approximating solutions to such problems. The approach "fits" a linear combination of pre-selected basis functions to the dynamic programming cost-to-go function. We develop error bounds that offer performance guarantees and also guide the selection of both basis functions and "state-relevance weights" that influence quality of the approximation. Experimental results in the domain of queueing network control provide empirical support for the methodology.

Citations

PDF

Open Access

More filters

Proceedings Article

Towards exploiting duality in approximate linear programming for MDPs

Dmitri A. Dolgov, +1 more

TL;DR: This paper proposes an LP formulation, which is called a composite ALP, that approximates both the primal and the dual optimization coordinates (the value function and the occupation measure), which is equivalent to approximating both the objective functions and the feasible regions of the LPs.

...read moreread less

Journal ArticleDOI

Computing monotone policies for Markov decision processes: a nearly-isotonic penalty approach

Robert Mattila, +3 more

- 01 Jul 2017 -

IFAC-PapersOnLine

TL;DR: A two-stage alternating convex optimization scheme that can accelerate the search for an optimal policy by exploiting the monotone property is proposed and it is shown that the alternating method of multipliers (ADMM) can be significantly accelerated using the regularization step.

...read moreread less

Posted Content

On the Synthesis of Bellman Inequalities for Data-Driven Optimal Control.

Andrea Martinelli, +2 more

- 27 Sep 2021 -

arXiv: Optimization and Control

TL;DR: In this article, a relatively small but sufficiently rich dataset can be exploited to generate new constraints offline and without observing the corresponding transitions, and the authors show how to reconstruct the associated unknown stage-costs.

...read moreread less

Journal ArticleDOI

Adaptive polyhedral meshing for approximate dynamic programming in control

- 01 Jan 2022 -

Engineering Applications of Artificial I...

TL;DR: In this paper , a new criterion for adaptive meshing in polyhedral partitions which interpolate a value function in approximate dynamic programming (ADP) in optimal control problems is proposed. But this criterion adds new points to a simplicial mesh, based on: a user-defined initial condition probability density function which determines ‘influential’ regions of the state space, uncertainty (variance) propagation, and temporal difference error.

...read moreread less

Posted Content

A Markov Decision Process Approach to Active Meta Learning.

Bingjia Wang, +2 more

- 10 Sep 2020 -

arXiv: Learning

TL;DR: This work proposes actively selecting samples on which to train by discerning covariates inside and between meta- training sets, and casts the problem of selecting a sample from a number of meta-training sets as either a multi-armed bandit or a Markov Decision Process (MDP), depending on how one encapsulates correlation across tasks.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Book

Neural networks for pattern recognition

Christopher M. Bishop

TL;DR: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition, and is designed as a text, with over 100 exercises, to benefit anyone involved in the fields of neural computation and pattern recognition.

...read moreread less

Book

Dynamic Programming and Optimal Control

Dimitri P. Bertsekas

TL;DR: The leading and most up-to-date textbook on the far-ranging algorithmic methododogy of Dynamic Programming, which can be used for optimal control, Markovian decision problems, planning and sequential decision making under uncertainty, and discrete/combinatorial optimization.

...read moreread less

Journal ArticleDOI

Learning to Predict by the Methods of Temporal Differences

Richard S. Sutton

- 01 Aug 1988 -

Machine Learning

TL;DR: This article introduces a class of incremental learning procedures specialized for prediction – that is, for using past experience with an incompletely known system to predict its future behavior – and proves their convergence and optimality for special cases and relation to supervised-learning methods.

...read moreread less

Book

Neuro-dynamic programming

Dimitri P. Bertsekas, +1 more

TL;DR: This is the first textbook that fully explains the neuro-dynamic programming/reinforcement learning methodology, which is a recent breakthrough in the practical application of neural networks and dynamic programming to complex problems of planning, optimal decision making, and intelligent control.

...read moreread less

Collapse

The Linear Programming Approach to Approximate Dynamic Programming

Citations

Towards exploiting duality in approximate linear programming for MDPs

Computing monotone policies for Markov decision processes: a nearly-isotonic penalty approach

On the Synthesis of Bellman Inequalities for Data-Driven Optimal Control.

Adaptive polyhedral meshing for approximate dynamic programming in control

A Markov Decision Process Approach to Active Meta Learning.

References

Reinforcement Learning: An Introduction

Neural networks for pattern recognition

Dynamic Programming and Optimal Control

Learning to Predict by the Methods of Temporal Differences

Neuro-dynamic programming

Related Papers (5)

Dynamic Programming and Optimal Control

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Neuro-dynamic programming

Approximate dynamic programming : solving the curses of dimensionality

Reinforcement Learning: An Introduction