Reinforcement Learning: An Introduction

Open AccessBook

Reinforcement Learning: An Introduction

Chats0

TLDR

This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

Abstract:

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability. The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

Citations

PDF

Open Access

More filters

Book

Deep Learning

Ian Goodfellow, +2 more

TL;DR: Deep learning as mentioned in this paper is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts, and it is used in many applications such as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames.

...read moreread less

Journal ArticleDOI

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015 -

Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

Pattern Recognition and Machine Learning

Christopher M. Bishop

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Synaptic tagging and long-term potentiation

Uwe Frey, +1 more

- 06 Feb 1997 -

Nature

TL;DR: It is shown that weak tetanic stimulation, which ordinarily leads only to early LTP, or repeated tetanization in the presence of protein-Synthesis inhibitors, each results in protein-synthesis-dependent late LTP; this indicates that the persistence of LTP depends not only on local events during its induction, but also on the prior activity of the neuron.

...read moreread less

Predictability, surprise, attention, and conditioning

Leon J. Kamin

TL;DR: The role of attention in Pavlovian conditioning, and use of auditory and visual stimuli to condition rats is discussed in this article, where the authors discuss the use of both visual and auditory stimuli.

...read moreread less

Book

Learning Automata: An Introduction

Kumpati S. Narendra, +1 more

TL;DR: From the combination of knowledge and actions, someone can improve their skill and ability and this learning automata an introduction tells you that any book will give certain knowledge to take all benefits.

...read moreread less

Journal ArticleDOI

Neurons in the orbitofrontal cortex encode economic value

Camillo Padoa-Schioppa, +1 more

- 11 May 2006 -

Nature

TL;DR: Neurons in the orbitofrontal cortex (OFC) encode the value of offered and chosen goods during economic choice, suggesting that economic choice is essentially choice between goods rather than choice between actions.

...read moreread less

Journal ArticleDOI

Monte Carlo Strategies in Scientific Computing

Tim Hesterberg

- 01 Nov 2002 -

Technometrics

TL;DR: The strength of this book is in bringing together advanced Monte Carlo methods developed in many disciplines, including the Ising model, molecular structure simulation, bioinformatics, target tracking, hypothesis testing for astronomical observations, Bayesian inference of multilevel models, missing-data problems.

...read moreread less