Reinforcement Learning: An Introduction

Open AccessBook

Reinforcement Learning: An Introduction

Chats0

TLDR

This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

Abstract:

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability. The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

Citations

PDF

Open Access

More filters

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Journal ArticleDOI

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015 -

Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

Pattern Recognition and Machine Learning

Christopher M. Bishop

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Shaping robot behavior using principles from instrumental conditioning

Lisa M. Saksida, +2 more

- 01 Dec 1997 -

Robotics and Autonomous Systems

TL;DR: A computational model of this shaping process and its implementation on a mobile robot is described, which allows an RWI B21 robot to learn several distinct tasks derived from the same innate behavior.

...read moreread less

Proceedings Article

Temporal Difference Learning in Continuous Time and Space

Kenji Doya

TL;DR: A continuous-time, continuous-state version of the temporal difference algorithm is derived in order to facilitate the application of reinforcement learning to real-world control tasks and neurobiological modeling.

...read moreread less

Journal ArticleDOI

Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: response topography, neuronal firing, and interstimulus intervals.

John W. Moore, +5 more

- 01 Aug 1986 -

Behavioural Brain Research

TL;DR: The model successfully simulates the aforementioned features of NM response topography and is capable of simulating appropriate ISI functions, i.e. with maximum conditioning strength with ISIs of 250 ms, for forward-delay and trace conditioning paradigms.

...read moreread less

Journal ArticleDOI

A dynamic channel assignment policy through Q-learning

Junhong Nie, +1 more

- 01 Nov 1999 -

IEEE Transactions on Neural Networks

TL;DR: A novel approach to solving the dynamic channel assignment (DCA) problem by using a form of realtime reinforcement learning known as Q-learning in conjunction with neural network representation, capable of achieving a performance similar to that achieved by the MAXIAVIAL, but with a significantly reduced computational complexity.

...read moreread less

Proceedings ArticleDOI

Minimum-time control of the Acrobot

G. Boone

TL;DR: A direct search algorithm for finding swingup trajectories for the Acrobot is described, which uses a lookahead search that maximizes theAcrobot's total energy in an N-step window.

...read moreread less