Dustin Morrill

Journal ArticleDOI

DeepStack: Expert-level artificial intelligence in heads-up no-limit poker

- 05 May 2017 -

TL;DR: DeepStack is introduced, an algorithm for imperfect-information settings that combines recursive reasoning to handle information asymmetry, decomposition to focus computation on the relevant decision, and a form of intuition that is automatically learned from self-play using deep learning.

...read moreread less

Journal ArticleDOI

DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker

Matej Moravcik, +9 more

- 06 Jan 2017 -

arXiv: Artificial Intelligence

TL;DR: DeepStack as discussed by the authors combines recursive reasoning to handle information asymmetry, decomposition to focus computation on the relevant decision, and a form of intuition that is automatically learned from self-play using deep learning.

...read moreread less

Posted Content

OpenSpiel: A Framework for Reinforcement Learning in Games.

Marc Lanctot, +26 more

- 26 Aug 2019 -

arXiv: Learning

TL;DR: This document serves both as an overview of the code base and an introduction to the terminology, core concepts, and algorithms across the fields of reinforcement learning, computational game theory, and search.

...read moreread less

Proceedings Article

Solving games with functional regret estimation

Kevin Waugh, +3 more

TL;DR: This paper proposed an online learning method for minimizing regret in large extensive-form games, which learns a function approximator online to estimate the regret for choosing a particular action and uses these estimates in place of the true regrets to define a sequence of policies.

...read moreread less

Proceedings ArticleDOI

Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent

Edward Lockhart, +6 more

TL;DR: The exploitability descent algorithm is presented, a new algorithm to compute approximate equilibria in two-player zero-sum extensive-form games with imperfect information, by direct policy optimization against worst-case opponents, and it is proved that when following this optimization, the exploitability of a player's strategy converges asymptotically to zero.

...read moreread less

Papers

DeepStack: Expert-level artificial intelligence in heads-up no-limit poker

DeepStack: Expert-Level Artificial Intelligence in No-Limit Poker

OpenSpiel: A Framework for Reinforcement Learning in Games.

Solving games with functional regret estimation

Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent