Reinforcement Learning: An Introduction

Open AccessBook

Reinforcement Learning: An Introduction

Chats0

TLDR

This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

Abstract:

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability. The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

Citations

PDF

Open Access

More filters

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Journal ArticleDOI

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015 -

Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

Pattern Recognition and Machine Learning

Christopher M. Bishop

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book ChapterDOI

Strategy Learning with Multilayer Connectionist Representations

Charles W. Anderson

TL;DR: A two-layer connectionist system is presented that develops its search from a weak to a task-specific strategy and fine-tunes its performance, applied to a simulated, real-time, balance-control task.

...read moreread less

Journal ArticleDOI

A Survey of Some Results in Stochastic Adaptive Control

P. R. Kumar

- 01 May 1985 -

Siam Journal on Control and Optimization

TL;DR: In this article, a survey of adaptive control of Markov chains and non-Bayesian adaptive control is presented, where the problems of converting an incompletely observed system into a completely observed one are discussed.

...read moreread less

Proceedings ArticleDOI

Comparisons of channel assignment strategies in cellular mobile telephone systems

Ming Zhang, +1 more

TL;DR: The locally optimized dynamic assignment (LODA) strategy and the borrowing with directional channel locking (BDCL) strategy are proposed and computer simulations show that the average call-blocking probability of the BDCL strategy is always the lowest.

...read moreread less

Journal ArticleDOI

Learning control systems--Review and outlook

King-Sun Fu

- 01 Apr 1970 -

IEEE Transactions on Automatic Control

TL;DR: The basic concept of learning control is introduced, and the following five learning schemes are briefly reviewed: 1) trainable controllers using pattern classifiers, 2) reinforcement learning control systems, 3) Bayesian estimation, 4) stochastic approximation, and 5) Stochastic automata models.

...read moreread less

Journal Article

Learning by statistical cooperation of self-interested neuron-like computing elements.

Andrew G. Barto

- 01 Jan 1985 -

Human neurobiology

TL;DR: It is argued that some of the longstanding problems concerning adaptation and learning by networks might be solvable by this form of cooperativity, and computer simulation experiments are described that show how networks of self-interested components that are sufficiently robust can solve rather difficult learning problems.

...read moreread less