Prioritized Experience Replay

Open AccessProceedings Article

Prioritized Experience Replay

TLDR

Prioritized experience replay as mentioned in this paper is a framework for prioritizing experience, so as to replay important transitions more frequently, and therefore learn more efficiently, achieving human-level performance across many Atari games.

Abstract:

Experience replay lets online reinforcement learning agents remember and reuse experiences from the past. In prior work, experience transitions were uniformly sampled from a replay memory. However, this approach simply replays transitions at the same frequency that they were originally experienced, regardless of their significance. In this paper we develop a framework for prioritizing experience, so as to replay important transitions more frequently, and therefore learn more efficiently. We use prioritized experience replay in Deep Q-Networks (DQN), a reinforcement learning algorithm that achieved human-level performance across many Atari games. DQN with prioritized experience replay achieves a new state-of-the-art, outperforming DQN with uniform replay on 41 out of 49 games.

Citations

PDF

Open Access

More filters

Proceedings Article

Asynchronous methods for deep reinforcement learning

Volodymyr Mnih, +7 more

TL;DR: A conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers and shows that asynchronous actor-critic succeeds on a wide variety of continuous motor control problems as well as on a new task of navigating random 3D mazes using a visual input.

...read moreread less

Book

Neural Networks and Deep Learning

Charu C. Aggarwal

Journal ArticleDOI

Building machines that learn and think like people.

Brenden M. Lake, +3 more

- 01 Jan 2017 -

Behavioral and Brain Sciences

TL;DR: In this article, a review of recent progress in cognitive science suggests that truly human-like learning and thinking machines will have to reach beyond current engineering trends in both what they learn and how they learn it.

...read moreread less

Posted Content

Dueling Network Architectures for Deep Reinforcement Learning

Ziyu Wang, +5 more

- 20 Nov 2015 -

arXiv: Learning

TL;DR: This paper presents a new neural network architecture for model-free reinforcement learning that leads to better policy evaluation in the presence of many similar-valued actions and enables the RL agent to outperform the state-of-the-art on the Atari 2600 domain.

...read moreread less

Posted Content

Addressing Function Approximation Error in Actor-Critic Methods

Scott Fujimoto, +2 more

- 26 Feb 2018 -

arXiv: Artificial Intelligence

TL;DR: This paper builds on Double Q-learning, by taking the minimum value between a pair of critics to limit overestimation, and draws the connection between target networks and overestimation bias.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Dopaminergic neurons promote hippocampal reactivation and spatial memory persistence.

Colin G. McNamara, +4 more

- 01 Dec 2014 -

Nature Neuroscience

TL;DR: Findings reveal that midbrain dopaminergic neurons promote hippocampal network dynamics associated with memory persistence as well as improving the later recall of neural representations of space and stabilized memory performance.

...read moreread less

Book ChapterDOI

To recognize shapes, first learn to generate images.

Geoffrey E. Hinton

- 01 Jan 2007 -

Progress in Brain Research

TL;DR: This chapter describes several of the proposed algorithms and shows how they can be combined to produce hybrid methods that work efficiently in networks with many layers and millions of adaptive connections.

...read moreread less

Proceedings Article

Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning

Xiaoxiao Guo, +4 more

TL;DR: The central idea is to use the slow planning-based agents to provide training data for a deep-learning architecture capable of real-time play, and proposed new agents based on this idea are proposed and shown to outperform DQN.

...read moreread less

Journal ArticleDOI

Rewarded Outcomes Enhance Reactivation of Experience in the Hippocampus

Annabelle C. Singer, +1 more

- 24 Dec 2009 -

Neuron

TL;DR: It is shown that rat hippocampal CA3 principal cells are significantly more active during SWRs following receipt of reward and this enhanced reactivation in response to reward could be a mechanism to bind rewarding outcomes to the experiences that precede them.

...read moreread less

Journal ArticleDOI

Hippocampal place cells construct reward related sequences through unexplored space

H. Freyja Ólafsdóttir, +4 more

- 26 Jun 2015 -

eLife

TL;DR: It is reported that viewing the delivery of food to an unvisited portion of an environment leads to off-line pre-activation of place cells sequences corresponding to that space, suggesting goal-biased preplay may support preparation for future experiences in novel environments.

...read moreread less