A Problem Case for UCT

doi:10.1109/TCIAIG.2012.2220138

Journal ArticleDOI

A Problem Case for UCT

Cameron Browne

- 01 Mar 2013 -

IEEE Transactions on Computational Intel...

- Vol. 5, Iss: 1, pp 69-74

Chats0

TLDR

This paper examines a simple 5 × 5 Hex position that not only completely defeats flat Monte Carlo search, but also initially defeats plain upper confidence bounds for trees (UCT) search until an excessive number of iterations are performed.

Abstract:

This paper examines a simple 5 × 5 Hex position that not only completely defeats flat Monte Carlo search, but also initially defeats plain upper confidence bounds for trees (UCT) search until an excessive number of iterations are performed. The inclusion of domain knowledge during playouts significantly improves UCT performance, but a slight negative effect is shown for the rapid action value estimate (RAVE) heuristic under some circumstances. This example was drawn from an actual game during standard play, and highlights the dangers of relying on flat Monte Carlo and unenhanced UCT search even for rough estimates. A brief comparison is made with RAVE failure in Go.

Citations

PDF

Open Access

More filters

Proceedings Article

Strategic Features for General Games

Cameron Browne, +2 more

TL;DR: An ongoing research project that requires the automated self-play learning and evaluation of a large number of board games in digital form is described, taking to determine relevant features, for biasing MCTS playouts for arbitrary games played on arbitrary geometries.

...read moreread less

Journal ArticleDOI

Monte Carlo Tree Search: a review of recent modifications and applications

Maciej Świechowski

- 19 Jul 2022 -

Artificial Intelligence Review

TL;DR: Monte Carlo Tree Search (MCTS) as discussed by the authors is a powerful approach to designing game-playing bots or solving sequential decision problems, which relies on intelligent tree search that balances exploration and exploitation.

...read moreread less

Book ChapterDOI

Positional Games and QBF: The Corrective Encoding

Valentin Mayer-Eichberger, +1 more

TL;DR: A novel encoding of positional games into Quantified Boolean Formulas (QBFs) such that a game instance admits a winning strategy for first player if and only if the corresponding formula is true.

...read moreread less

Posted Content

Monte Carlo Tree Search: A Review of Recent Modifications and Applications.

Maciej Swiechowski, +3 more

- 08 Mar 2021 -

arXiv: Artificial Intelligence

TL;DR: Monte Carlo Tree Search (MCTS) as mentioned in this paper is a powerful approach to designing game-playing bots or solving sequential decision problems, which relies on intelligent tree search that balances exploration and exploitation.

...read moreread less

Journal ArticleDOI

Turn-Based War Chess Model and Its Search Algorithm per Turn

Hai Nan, +5 more

TL;DR: A theory frame involving combinational optimization on the one hand and game tree search on the other hand is proposed and it is proved that both of these algorithms are optimal, and the difference between their efficiencies is analyzed.

...read moreread less

References

PDF

Open Access

More filters

Book ChapterDOI

Bandit based monte-carlo planning

Levente Kocsis, +1 more

TL;DR: In this article, a bandit-based Monte-Carlo planning algorithm is proposed for large state-space Markovian decision problems (MDPs), which is one of the few viable approaches to find near-optimal solutions.

...read moreread less

Journal ArticleDOI

A Survey of Monte Carlo Tree Search Methods

Cameron Browne, +9 more

- 03 Feb 2012 -

IEEE Transactions on Computational Intel...

TL;DR: A survey of the literature to date of Monte Carlo tree search, intended to provide a snapshot of the state of the art after the first five years of MCTS research, outlines the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarizes the results from the key game and nongame domains.

...read moreread less

Book ChapterDOI

Efficient selectivity and backup operators in Monte-Carlo tree search

Rémi Coulom

TL;DR: A new framework to combine tree search with Monte-Carlo evaluation, that does not separate between a min-max phase and a Monte- carlo phase is presented, that provides finegrained control of the tree growth, at the level of individual simulations, and allows efficient selectivity.

...read moreread less

Journal ArticleDOI

Progressive Strategies for Monte-Carlo Tree Search

Guillaume M. J-B. Chaslot, +4 more

- 01 Nov 2008 -

New Mathematics and Natural Computation

TL;DR: Two progressive strategies for MCTS are introduced, called progressive bias and progressive unpruning, which enable the use of relatively time-expensive heuristic knowledge without speed reduction.

...read moreread less

Posted Content

Bandit Algorithms for Tree Search

Pierre-Arnaud Coquelin, +1 more

- 09 Aug 2014 -

arXiv: Artificial Intelligence

TL;DR: In this article, a bandit algorithm for smooth trees (BAST) is proposed, which takes into account ac- tual smoothness of the rewards for perform- ing efficient "cuts" of sub-optimal branches with high confidence.

...read moreread less