Rational and convergent learning in stochastic games

Open AccessProceedings Article

Rational and convergent learning in stochastic games

Michael Bowling, +1 more

- pp 1021-1026

Chats0

TLDR

This paper introduces two properties as desirable for a learning agent when in the presence of other learning agents, namely rationality and convergence, and contributes a new learning algorithm, WoLF policy hillclimbing, that is proven to be rational.

Abstract:

This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as desirable for a learning agent when in the presence of other learning agents, namely rationality and convergence. We examine existing reinforcement learning algorithms according to these two properties and notice that they fail to simultaneously meet both criteria. We then contribute a new learning algorithm, WoLF policy hillclimbing, that is based on a simple principle: “learn quickly while losing, slowly while winning.” The algorithm is proven to be rational and we present empirical results for a number of stochastic games showing the algorithm converges.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Machine learning

Thomas G. Dietterich

- 01 Dec 1996 -

ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Book

Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations

Yoav Shoham, +1 more

TL;DR: This exciting and pioneering new overview of multiagent systems, which are online systems composed of multiple interacting intelligent agents, i.e., online trading, offers a newly seen computer science perspective on multi agent systems, while integrating ideas from operations research, game theory, economics, logic, and even philosophy and linguistics.

...read moreread less

Journal ArticleDOI

A Comprehensive Survey of Multiagent Reinforcement Learning

Lucian Busoniu, +2 more

TL;DR: The benefits and challenges of MARL are described along with some of the problem domains where the MARL techniques have been applied, and an outlook for the field is provided.

...read moreread less

Journal ArticleDOI

Cooperative Multi-Agent Learning: The State of the Art

Liviu Panait, +1 more

- 01 Nov 2005 -

Autonomous Agents and Multi-Agent System...

TL;DR: This survey attempts to draw from multi-agent learning work in a spectrum of areas, including RL, evolutionary computation, game theory, complex systems, agent modeling, and robotics, and finds that this broad view leads to a division of the work into two categories.

...read moreread less

Journal ArticleDOI

Multiagent Learning Using a Variable Learning Rate

Michael Bowling, +1 more

- 01 Apr 2002 -

Artificial Intelligence

TL;DR: This article introduces the WoLF principle, “Win or Learn Fast”, for varying the learning rate, and examines this technique theoretically, proving convergence in self-play on a restricted class of iterated matrix games.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Machine learning

Thomas G. Dietterich

- 01 Dec 1996 -

ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Book

Decisions with Multiple Objectives: Preferences and Value Trade-Offs

Ralph L. Keeney, +2 more

TL;DR: In this article, a confused decision maker, who wishes to make a reasonable and responsible choice among alternatives, can systematically probe his true feelings in order to make those critically important, vexing trade-offs between incommensurable objectives.

...read moreread less

Journal ArticleDOI

Counterspeculation, auctions, and competitive sealed tenders

William Vickrey

- 01 Mar 1961 -

Journal of Finance

Journal ArticleDOI

Equilibrium points in n-person games

John F. Nash

- 01 Jan 1950 -

Proceedings of the National Academy of S...

TL;DR: A concept of an n -person game in which each player has a finite set of pure strategies and in which a definite set of payments to the n players corresponds to each n -tuple ofpure strategies, one strategy being taken for each player.

...read moreread less

Book

Introduction to Reinforcement Learning

Richard S. Sutton, +1 more

TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.

...read moreread less

Collapse

Rational and convergent learning in stochastic games

Citations

Machine learning

Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations

A Comprehensive Survey of Multiagent Reinforcement Learning

Cooperative Multi-Agent Learning: The State of the Art

Multiagent Learning Using a Variable Learning Rate

References

Machine learning

Decisions with Multiple Objectives: Preferences and Value Trade-Offs

Counterspeculation, auctions, and competitive sealed tenders

Equilibrium points in n-person games

Introduction to Reinforcement Learning

Related Papers (5)

Markov games as a framework for multi-agent reinforcement learning

Reinforcement Learning: An Introduction

Reinforcement learning: a survey

The theory of learning in games

A Comprehensive Survey of Multiagent Reinforcement Learning