The Games Computers (and People) Play

doi:10.1016/S0065-2458(00)80019-4

Book ChapterDOI

The Games Computers (and People) Play

Jonathan Schaeffer

- Vol. 52, pp 1179

Chats0

TLDR

The past, present, and future of the development of games-playing programs are discussed and some surprising changes of direction occurring that will result in games being more of an experimental testbed for mainstream AI research, with less emphasis on building world-championship-caliber programs.

Abstract:

The development of high-performance game-playing programs has been one of the major successes of artificial intelligence research. The results have been outstanding but, with one notable exception (Deep Blue), they have not been widely disseminated. This talk will discuss the past, present, and future of the development of games-playing programs. Case studies for backgammon, bridge, checkers, chess, go, hex, Othello, poker, and Scrabble will be used. The research emphasis of the past has been on high performance (synonymous with brute-force search) for twoplayer perfect-information games. The research emphasis of the present encompasses multi-player imperfect/nondeterministic information games. And what of the future? There are some surprising changes of direction occurring that will result in games being more of an experimental testbed for mainstream AI research, with less emphasis on building world-championship-caliber programs. One of the most profound contributions to mankind’s knowledge has been made by the artificial intelligence (AI) research community: the realization that intelligence is not uniquely human. 1 Using computers, it is possible to achieve human-like behavior in nonhumans. In other words, the illusion of human intelligence can be created in a computer. This idea has been vividly illustrated throughout the history of computer games research. Unlike most of the early work in AI, game researchers were interested in developing high-performance, real-time solutions to challenging problems. This led to an ends-justify-the-means attitude: the result—a strong chess program—was all that mattered, not the means by which it was achieved. In contrast, much of the mainstream AI work used simplified domains, while eschewing real-time performance objectives. This research typically used human intelligence as a model: one only had to emulate the human example to achieve intelligent behavior. The battle (and philosophical) lines were drawn. The difference in philosophy can be easily illustrated. The human brain and the computer are different machines, each with its own sets of strengths and weaknesses. Humans are good at, for example, learning, reasoning by analogy, and

The Games Computers (and People) Play

Citations

Self-Play and Using an Expert to Learn to Play Backgammon with Temporal Difference Learning

An investigation, using co-evolution, to evolve an Awari player

Learning to play strong poker

PSO-based coevolutionary Game Learning

Behavioral biometrics for verification and recognition of malicious software agents

References

Reinforcement Learning: An Introduction

Learning to Predict by the Methods of Temporal Differences

Some studies in machine learning using the game of checkers

Depth-first iterative-deepening: an optimal admissible tree search

Temporal difference learning and TD-Gammon

Related Papers (5)

Some studies in machine learning using the game of checkers

Temporal difference learning and TD-Gammon

Bandit based monte-carlo planning

Efficient selectivity and backup operators in Monte-Carlo tree search

Learning to Predict by the Methods of Temporal Differences