Laurent Sifre

Journal ArticleDOI

Mastering the game of Go with deep neural networks and tree search

- 28 Jan 2016 -

TL;DR: Using this search algorithm, the program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0.5, the first time that a computer program has defeated a human professional player in the full-sized game of Go.

...read moreread less

Journal ArticleDOI

Mastering the game of Go without human knowledge

David Silver, +16 more

- 19 Oct 2017 -

Nature

TL;DR: An algorithm based solely on reinforcement learning is introduced, without human data, guidance or domain knowledge beyond game rules, that achieves superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.

...read moreread less

Journal ArticleDOI

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.

David Silver, +12 more

- 07 Dec 2018 -

Science

TL;DR: This paper generalizes the AlphaZero approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games, and convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.

...read moreread less

Journal ArticleDOI

Grandmaster level in StarCraft II using multi-agent reinforcement learning.

Oriol Vinyals, +41 more

- 30 Oct 2019 -

Nature

TL;DR: The agent, AlphaStar, is evaluated, which uses a multi-agent reinforcement learning algorithm and has reached Grandmaster level, ranking among the top 0.2% of human players for the real-time strategy game StarCraft II.

...read moreread less

Journal ArticleDOI

Improved protein structure prediction using potentials from deep learning

Andrew W. Senior, +19 more

- 15 Jan 2020 -

Nature

TL;DR: It is shown that a neural network can be trained to make accurate predictions of the distances between pairs of residues, which convey more information about the structure than contact predictions, and the resulting potential can be optimized by a simple gradient descent algorithm to generate structures without complex sampling procedures.

...read moreread less

Papers

Mastering the game of Go with deep neural networks and tree search

Mastering the game of Go without human knowledge

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.

Grandmaster level in StarCraft II using multi-agent reinforcement learning.

Improved protein structure prediction using potentials from deep learning