A video game description language for model-based or interactive learning

doi:10.1109/CIG.2013.6633610

Open AccessProceedings ArticleDOI

A video game description language for model-based or interactive learning

- pp 1-8

TLDR

It is shown how to learn competent behaviors when a model of the game dynamics is available or when it is not, when full state information is given to the agent or just subjective observations, when learning is interactive or in batch-mode, and for a number of different learning algorithms, including reinforcement learning and evolutionary search.

Abstract:

We propose a powerful new tool for conducting research on computational intelligence and games. `PyVGDL' is a simple, high-level description language for 2D video games, and the accompanying software library permits parsing and instantly playing those games. The streamlined design of the language is based on defining locations and dynamics for simple building blocks, and the interaction effects when such objects collide, all of which are provided in a rich ontology. It can be used to quickly design games, without needing to deal with control structures, and the concise language is also accessible to generative approaches. We show how the dynamics of many classical games can be generated from a few lines of PyVGDL. The main objective of these generated games is to serve as diverse benchmark problems for learning and planning algorithms; so we provide a collection of interfaces for different types of learning agents, with visual or abstract observations, from a global or first-person viewpoint. To demonstrate the library's usefulness in a broad range of learning scenarios, we show how to learn competent behaviors when a model of the game dynamics is available or when it is not, when full state information is given to the agent or just subjective observations, when learning is interactive or in batch-mode, and for a number of different learning algorithms, including reinforcement learning and evolutionary search.

Citations

PDF

Open Access

More filters

Posted Content

StarCraft II: A New Challenge for Reinforcement Learning

Oriol Vinyals, +24 more

- 16 Aug 2017 -

arXiv: Learning

TL;DR: This paper introduces SC2LE (StarCraft II Learning Environment), a reinforcement learning environment based on the StarCraft II game that offers a new and challenging environment for exploring deep reinforcement learning algorithms and architectures and gives initial baseline results for neural networks trained from this data to predict game outcomes and player actions.

...read moreread less

Book

Procedural Content Generation in Games

Noor Shaker, +2 more

TL;DR: This book presents the most up-to-date coverage of procedural content generation (PCG) for games, specifically the procedural generation of levels, landscapes, items, rules, quests, or other types of content.

...read moreread less

Proceedings Article

The Malmo platform for artificial intelligence experimentation

Matthew Johnson, +3 more

TL;DR: Project Malmo provides a sophisticated abstraction layer on top of Minecraft that supports a wide range of experimentation scenarios, ranging from navigation and survival to collaboration and problem solving tasks, to support openness and collaboration in AI research.

...read moreread less

BookDOI

Artificial Intelligence and Games

Georgios N. Yannakakis, +1 more

TL;DR: This is the first textbook dedicated to explaining how artificial intelligence techniques can be used in and for games, and how to use AI to play games, to generate content for games and to model players.

...read moreread less

Journal ArticleDOI

The 2014 General Video Game Playing Competition

Diego Perez-Liebana, +8 more

- 01 Sep 2016 -

IEEE Transactions on Computational Intel...

TL;DR: All controllers submitted to the first General Video Game Playing Competition are described, with an in-depth description of four of them by their authors, including the winner and the runner-up entries of the contest.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997 -

Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Journal ArticleDOI

A Survey of Monte Carlo Tree Search Methods

Cameron Browne, +9 more

- 03 Feb 2012 -

IEEE Transactions on Computational Intel...

TL;DR: A survey of the literature to date of Monte Carlo tree search, intended to provide a snapshot of the state of the art after the first five years of MCTS research, outlines the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarizes the results from the key game and nongame domains.

...read moreread less

Journal ArticleDOI

The arcade learning environment: an evaluation platform for general agents

Marc G. Bellemare, +3 more

- 01 May 2013 -

Journal of Artificial Intelligence Resea...

TL;DR: The Arcade Learning Environment (ALE) as discussed by the authors is a platform for evaluating the development of general, domain-independent AI technology, which provides an interface to hundreds of Atari 2600 game environments, each one different, interesting, and designed to be a challenge for human players.

...read moreread less

Journal ArticleDOI

Least-squares policy iteration

Michail G. Lagoudakis, +1 more

- 01 Dec 2003 -

Journal of Machine Learning Research

TL;DR: The new algorithm, least-squares policy iteration (LSPI), learns the state-action value function which allows for action selection without a model and for incremental policy improvement within a policy-iteration framework.

...read moreread less