Dueling network architectures for deep reinforcement learning

Open AccessProceedings Article

Dueling network architectures for deep reinforcement learning

- pp 1995-2003

TLDR

In this paper, a dueling network is proposed to represent two separate estimators for the state value function and the state-dependent advantage function, which leads to better policy evaluation in the presence of many similar-valued actions.

Abstract:

In recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders. In this paper, we present a new neural network architecture for model-free reinforcement learning. Our dueling network represents two separate estimators: one for the state value function and one for the state-dependent action advantage function. The main benefit of this factoring is to generalize learning across actions without imposing any change to the underlying reinforcement learning algorithm. Our results show that this architecture leads to better policy evaluation in the presence of many similar-valued actions. Moreover, the dueling architecture enables our RL agent to outperform the state-of-the-art on the Atari 2600 domain.

Citations

PDF

Open Access

More filters

Proceedings Article

Asynchronous methods for deep reinforcement learning

Volodymyr Mnih, +7 more

TL;DR: A conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers and shows that asynchronous actor-critic succeeds on a wide variety of continuous motor control problems as well as on a new task of navigating random 3D mazes using a visual input.

...read moreread less

Journal ArticleDOI

Deep Reinforcement Learning: A Brief Survey

Kai Arulkumaran, +3 more

- 09 Nov 2017 -

IEEE Signal Processing Magazine

TL;DR: Deep reinforcement learning (DRL) is poised to revolutionize the field of artificial intelligence (AI) and represents a step toward building autonomous systems with a higher-level understanding of the visual world as discussed by the authors.

...read moreread less

Journal ArticleDOI

The mythos of model interpretability

Zachary C. Lipton

- 26 Sep 2018 -

Communications of The ACM

TL;DR: In machine learning, the concept of interpretability is both important and slippery, so it is important to understand how these concepts can be modified.

...read moreread less

Journal ArticleDOI

The Mythos of Model Interpretability: In machine learning, the concept of interpretability is both important and slippery.

Zachary C. Lipton

- 01 Jun 2018 -

ACM Queue

TL;DR: In this article, the authors ask whether or not a supervised machine learning model will work in deployment, and what else can it tell you about the world, besides its predictive capabilities.

...read moreread less

Journal ArticleDOI

Applications of Deep Reinforcement Learning in Communications and Networking: A Survey

Nguyen Cong Luong, +6 more

- 14 May 2019 -

IEEE Communications Surveys and Tutorial...

TL;DR: This paper presents a comprehensive literature review on applications of deep reinforcement learning (DRL) in communications and networking, and presents applications of DRL for traffic routing, resource sharing, and data collection.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Deep learning

Yann LeCun, +4 more

- 28 May 2015 -

Nature

TL;DR: Deep learning is making major advances in solving problems that have resisted the best attempts of the artificial intelligence community for many years, and will have many more successes in the near future because it requires very little engineering by hand and can easily take advantage of increases in the amount of available computation and data.

...read moreread less

Book

Introduction to Reinforcement Learning

Richard S. Sutton, +1 more

TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.

...read moreread less

Proceedings Article

Policy Gradient Methods for Reinforcement Learning with Function Approximation

Richard S. Sutton, +3 more

TL;DR: This paper proves for the first time that a version of policy iteration with arbitrary differentiable function approximation is convergent to a locally optimal policy.

...read moreread less

arXiv: Learning

Dueling network architectures for deep reinforcement learning

Citations

Asynchronous methods for deep reinforcement learning

Deep Reinforcement Learning: A Brief Survey

The mythos of model interpretability

The Mythos of Model Interpretability: In machine learning, the concept of interpretability is both important and slippery.

Applications of Deep Reinforcement Learning in Communications and Networking: A Survey

References

Deep learning

Human-level control through deep reinforcement learning

Mastering the game of Go with deep neural networks and tree search

Introduction to Reinforcement Learning

Policy Gradient Methods for Reinforcement Learning with Function Approximation

Related Papers (5)

Human-level control through deep reinforcement learning

Reinforcement Learning: An Introduction

Asynchronous methods for deep reinforcement learning

Playing Atari with Deep Reinforcement Learning

Mastering the game of Go with deep neural networks and tree search