Giraffe: Using Deep Reinforcement Learning to Play Chess

Open AccessPosted Content

Giraffe: Using Deep Reinforcement Learning to Play Chess

- 04 Sep 2015 -

TLDR

Giraffe is the most successful attempt thus far at using end-to-end machine learning to play chess, with minimal hand-crafted knowledge given by the programmer.

Abstract:

This report presents Giraffe, a chess engine that uses self-play to discover all its domain-specific knowledge, with minimal hand-crafted knowledge given by the programmer. Unlike previous attempts using machine learning only to perform parameter-tuning on hand-crafted evaluation functions, Giraffe's learning system also performs automatic feature extraction and pattern recognition. The trained evaluation function performs comparably to the evaluation functions of state-of-the-art chess engines - all of which containing thousands of lines of carefully hand-crafted pattern recognizers, tuned over many years by both computer chess experts and human chess masters. Giraffe is the most successful attempt thus far at using end-to-end machine learning to play chess.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Mastering the game of Go without human knowledge

David Silver, +16 more

- 19 Oct 2017 -

Nature

TL;DR: An algorithm based solely on reinforcement learning is introduced, without human data, guidance or domain knowledge beyond game rules, that achieves superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.

...read moreread less

Journal ArticleDOI

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.

David Silver, +12 more

- 07 Dec 2018 -

Science

TL;DR: This paper generalizes the AlphaZero approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games, and convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.

...read moreread less

Book

Neural Networks and Deep Learning

Charu C. Aggarwal

Posted Content

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

David Silver, +12 more

- 05 Dec 2017 -

arXiv: Artificial Intelligence

TL;DR: This paper generalises the approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains, and convincingly defeated a world-champion program in each case.

...read moreread less

Proceedings ArticleDOI

Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning

Briland Hitaj, +2 more

TL;DR: In this article, the authors show that any privacy-preserving collaborative deep learning model is susceptible to a powerful attack that exploits the real-time nature of the learning process that allows the adversary to train a Generative Adversarial Network (GAN) that generates prototypical samples of the targeted training set that was meant to be private (the samples generated by the GAN are intended to come from the same distribution as the training data).

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Posted Content

Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih, +6 more

- 19 Dec 2013 -

arXiv: Learning

TL;DR: This work presents the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning, which outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

...read moreread less

Proceedings Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

John C. Duchi, +2 more

TL;DR: Adaptive subgradient methods as discussed by the authors dynamically incorporate knowledge of the geometry of the data observed in earlier iterations to perform more informative gradient-based learning, which allows us to find needles in haystacks in the form of very predictive but rarely seen features.

...read moreread less

Journal Article

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

John C. Duchi, +2 more

- 01 Feb 2011 -

Journal of Machine Learning Research

TL;DR: This work describes and analyze an apparatus for adaptively modifying the proximal function, which significantly simplifies setting a learning rate and results in regret guarantees that are provably as good as the best proximal functions that can be chosen in hindsight.

...read moreread less

Posted Content

ADADELTA: An Adaptive Learning Rate Method

Matthew D. Zeiler

- 22 Dec 2012 -

arXiv: Learning

TL;DR: A novel per-dimension learning rate method for gradient descent called ADADELTA that dynamically adapts over time using only first order information and has minimal computational overhead beyond vanilla stochastic gradient descent is presented.

...read moreread less

Collapse

Related Papers (5)

Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih, +6 more

- 19 Dec 2013 -

arXiv: Learning

Asynchronous methods for deep reinforcement learning

Volodymyr Mnih, +7 more

Giraffe: Using Deep Reinforcement Learning to Play Chess

Citations

Mastering the game of Go without human knowledge

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.

Neural Networks and Deep Learning

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning

References

ImageNet Classification with Deep Convolutional Neural Networks

Playing Atari with Deep Reinforcement Learning

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization.

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization

ADADELTA: An Adaptive Learning Rate Method

Related Papers (5)

Mastering the game of Go with deep neural networks and tree search

Human-level control through deep reinforcement learning

Mastering the game of Go without human knowledge

Playing Atari with Deep Reinforcement Learning

Asynchronous methods for deep reinforcement learning