Dynamic Programming and Optimal Control

Open AccessBook

Dynamic Programming and Optimal Control

TLDR

The leading and most up-to-date textbook on the far-ranging algorithmic methododogy of Dynamic Programming, which can be used for optimal control, Markovian decision problems, planning and sequential decision making under uncertainty, and discrete/combinatorial optimization.

Abstract:

The leading and most up-to-date textbook on the far-ranging algorithmic methododogy of Dynamic Programming, which can be used for optimal control, Markovian decision problems, planning and sequential decision making under uncertainty, and discrete/combinatorial optimization. The treatment focuses on basic unifying themes, and conceptual foundations. It illustrates the versatility, power, and generality of the method with many examples and applications from engineering, operations research, and other fields. It also addresses extensively the practical application of the methodology, possibly through the use of approximations, and provides an extensive treatment of the far-reaching methodology of Neuro-Dynamic Programming/Reinforcement Learning.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015 -

Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

Journal ArticleDOI

Machine learning

Thomas G. Dietterich

- 01 Dec 1996 -

ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Journal ArticleDOI

Reinforcement learning: a survey

Leslie Pack Kaelbling, +2 more

- 01 Jan 1996 -

Journal of Artificial Intelligence Resea...

TL;DR: Central issues of reinforcement learning are discussed, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state.

...read moreread less

BookDOI

Sequential Monte Carlo methods in practice

Arnaud Doucet, +3 more

TL;DR: This book presents the first comprehensive treatment of Monte Carlo techniques, including convergence results and applications to tracking, guidance, automated target recognition, aircraft navigation, robot navigation, econometrics, financial modeling, neural networks, optimal control, optimal filtering, communications, reinforcement learning, signal enhancement, model averaging and selection.

...read moreread less

Posted Content

Reinforcement Learning: A Survey

Leslie Pack Kaelbling, +2 more

- 01 May 1996 -

arXiv: Artificial Intelligence

TL;DR: A survey of reinforcement learning from a computer science perspective can be found in this article, where the authors discuss the central issues of RL, including trading off exploration and exploitation, establishing the foundations of RL via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Machine learning

Thomas G. Dietterich

- 01 Dec 1996 -

ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Journal ArticleDOI

Learning from delayed rewards

Ben Kröse

- 01 Oct 1995 -

Robotics and Autonomous Systems

TL;DR: The invention relates to a circuit for use in a receiver which can receive two-tone/stereo signals which is intended to make a choice between mono or stereo reproduction of signal A or of signal B and vice versa.

...read moreread less

Journal ArticleDOI

Simulation and the Monte Carlo Method.

Thomas M. F. O'Donovan, +1 more

- 01 Mar 1983 -

Biometrics

Book ChapterDOI

Residual algorithms: reinforcement learning with function approximation

Leemon C. Baird

TL;DR: Both direct and residual gradient algorithms are shown to be special cases of residual algorithms, and it is shown that residual algorithms can combine the advantages of each approach.

...read moreread less

Journal ArticleDOI

Convergence of Stochastic Iterative Dynamic Programming Algorithms

Tommi S. Jaakkola, +2 more

TL;DR: A rigorous proof of convergence of DP-based learning algorithms is provided by relating them to the powerful techniques of stochastic approximation theory via a new convergence theorem, which establishes a general class of convergent algorithms to which both TD() and Q-learning belong.

...read moreread less

Collapse

Journal of The Royal Statistical Society...

Dynamic Programming and Optimal Control

Citations

Deep learning in neural networks

Machine learning

Reinforcement learning: a survey

Sequential Monte Carlo methods in practice

Reinforcement Learning: A Survey

References

Machine learning

Learning from delayed rewards

Simulation and the Monte Carlo Method.

Residual algorithms: reinforcement learning with function approximation

Convergence of Stochastic Iterative Dynamic Programming Algorithms

Related Papers (5)

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Reinforcement Learning: An Introduction

Dynamic Programming

Convex Optimization

Markov Decision Processes