Reinforcement Learning: An Introduction

Open AccessBook

Reinforcement Learning: An Introduction

TLDR

This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

Abstract:

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability. The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

Citations

PDF

Open Access

More filters

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Journal ArticleDOI

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015 -

Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

Pattern Recognition and Machine Learning

Christopher M. Bishop

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A comparison and evaluation of three machine learning procedures as applied to the game of checkers

Arnold K. Griffith

- 01 Jun 1974 -

Artificial Intelligence

TL;DR: Two new machine learning procedures used to arrive at “knowledgeable” static evaluators for checker board positions are presented and are found to perform about equally well, despite the relative simplicity of the second.

...read moreread less

Large-scale dynamic optimization using teams of reinforcement learning agents

Robert Crites, +1 more

TL;DR: This dissertation uses a team of RL agents, each of which is responsible for controlling one elevator car, to demonstrate the power of RL on a very large scale stochastic dynamic optimization problem of practical utility.

...read moreread less

Journal ArticleDOI

STELLA: A scheme for a learning machine

J.H. Andreae

- 01 Jun 1963 -

IFAC Proceedings Volumes

TL;DR: A scheme for a learning machine, which is being constructed in the form of a mechanical tortoise which takes its name from its laboratory origin, in which the machine explores the possibilities of its future actions with a view to modifying its performance.

...read moreread less

Book ChapterDOI

Learning a cost-sensitive internal representation for reinforcement learning

Ming Tan

TL;DR: The approach learns a task-dependent internal representation and a decision policy simultaneously in a finite, deterministic environment and maximizes the long-term discounted reward per action and reduces the average sensing cost per state.

...read moreread less

Journal ArticleDOI

A learning machine with monologue

J.H. Andreae, +1 more

- 01 Jan 1969 -

International Journal of Human-computer ...

TL;DR: An introductory description of the STeLLA machine is given with the help of a particular problem which is then used to illustrate the generation of control policies by a dual machine.

...read moreread less