Marc Lanctot

Journal ArticleDOI

Mastering the game of Go with deep neural networks and tree search

- 28 Jan 2016 -

TL;DR: Using this search algorithm, the program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0.5, the first time that a computer program has defeated a human professional player in the full-sized game of Go.

...read moreread less

Journal ArticleDOI

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.

David Silver, +12 more

- 07 Dec 2018 -

Science

TL;DR: This paper generalizes the AlphaZero approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games, and convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.

...read moreread less

Posted Content

Dueling Network Architectures for Deep Reinforcement Learning

Ziyu Wang, +5 more

- 20 Nov 2015 -

arXiv: Learning

TL;DR: This paper presents a new neural network architecture for model-free reinforcement learning that leads to better policy evaluation in the presence of many similar-valued actions and enables the RL agent to outperform the state-of-the-art on the Atari 2600 domain.

...read moreread less

Proceedings Article

Dueling network architectures for deep reinforcement learning

Ziyu Wang, +5 more

TL;DR: In this paper, a dueling network is proposed to represent two separate estimators for the state value function and the state-dependent advantage function, which leads to better policy evaluation in the presence of many similar-valued actions.

...read moreread less

Posted Content

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

David Silver, +12 more

- 05 Dec 2017 -

arXiv: Artificial Intelligence

TL;DR: This paper generalises the approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains, and convincingly defeated a world-champion program in each case.

...read moreread less

Papers

Mastering the game of Go with deep neural networks and tree search

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.

Dueling Network Architectures for Deep Reinforcement Learning

Dueling network architectures for deep reinforcement learning

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm