Nash q-learning for general-sum stochastic games

doi:10.5555/945365.964288

Open AccessJournal ArticleDOI

Nash q-learning for general-sum stochastic games

Junling Hu, +1 more

- 01 Dec 2003 -

Journal of Machine Learning Research

- Vol. 4, pp 1039-1069

TLDR

This work extends Q-learning to a noncooperative multiagent context, using the framework of general-sum stochastic games, and implements an online version of Nash Q- learning that balances exploration with exploitation, yielding improved performance.

Abstract:

We extend Q-learning to a noncooperative multiagent context, using the framework of general-sum stochastic games. A learning agent maintains Q-functions over joint actions, and performs updates based on assuming Nash equilibrium behavior over the current Q-values. This learning protocol provably converges given certain restrictions on the stage games (defined by Q-values) that arise during learning. Experiments with a pair of two-player grid games suggest that such restrictions on the game structure are not necessarily required. Stage games encountered during learning in both grid environments violate the conditions. However, learning consistently converges in the first grid game, which has a unique equilibrium Q-function, but sometimes fails to converge in the second, which has three different equilibrium Q-functions. In a comparison of offline learning performance in both games, we find agents are more likely to reach a joint optimal path with Nash Q-learning than with a single-agent Q-learning method. When at least one agent adopts Nash Q-learning, the performance of both agents is better than using single-agent Q-learning. We have also implemented an online version of Nash Q-learning that balances exploration with exploitation, yielding improved performance.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Machine learning

Thomas G. Dietterich

- 01 Dec 1996 -

ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Journal ArticleDOI

A Comprehensive Survey of Multiagent Reinforcement Learning

Lucian Busoniu, +2 more

TL;DR: The benefits and challenges of MARL are described along with some of the problem domains where the MARL techniques have been applied, and an outlook for the field is provided.

...read moreread less

Journal ArticleDOI

Cooperative Multi-Agent Learning: The State of the Art

Liviu Panait, +1 more

- 01 Nov 2005 -

Autonomous Agents and Multi-Agent System...

TL;DR: This survey attempts to draw from multi-agent learning work in a spectrum of areas, including RL, evolutionary computation, game theory, complex systems, agent modeling, and robotics, and finds that this broad view leads to a division of the work into two categories.

...read moreread less

Journal ArticleDOI

Applications of Deep Reinforcement Learning in Communications and Networking: A Survey

Nguyen Cong Luong, +6 more

- 14 May 2019 -

IEEE Communications Surveys and Tutorial...

TL;DR: This paper presents a comprehensive literature review on applications of deep reinforcement learning (DRL) in communications and networking, and presents applications of DRL for traffic routing, resource sharing, and data collection.

...read moreread less

Proceedings ArticleDOI

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

Namhoon Lee, +5 more

TL;DR: The proposed Deep Stochastic IOC RNN Encoder-decoder framework, DESIRE, for the task of future predictions of multiple interacting agents in dynamic scenes significantly improves the prediction accuracy compared to other baseline methods.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Journal ArticleDOI

Machine learning

Thomas G. Dietterich

- 01 Dec 1996 -

ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

Book

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Martin L. Puterman

TL;DR: Puterman as discussed by the authors provides a uniquely up-to-date, unified, and rigorous treatment of the theoretical, computational, and applied research on Markov decision process models, focusing primarily on infinite horizon discrete time models and models with discrete time spaces while also examining models with arbitrary state spaces, finite horizon models, and continuous time discrete state models.

...read moreread less

Journal ArticleDOI

Technical Note : \cal Q -Learning

Chris Watkins, +1 more

- 01 May 1992 -

Machine Learning

TL;DR: This paper presents and proves in detail a convergence theorem forQ-learning based on that outlined in Watkins (1989), showing that Q-learning converges to the optimum action-values with probability 1 so long as all actions are repeatedly sampled in all states and the action- values are represented discretely.

...read moreread less

Book

A Course in Game Theory

Martin J. Osborne, +3 more

TL;DR: A Course in Game Theory as discussed by the authors presents the main ideas of game theory at a level suitable for graduate students and advanced undergraduates, emphasizing the theory's foundations and interpretations of its basic concepts.

...read moreread less

Collapse

Nash q-learning for general-sum stochastic games

Citations

Machine learning

A Comprehensive Survey of Multiagent Reinforcement Learning

Cooperative Multi-Agent Learning: The State of the Art

Applications of Deep Reinforcement Learning in Communications and Networking: A Survey

DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents

References

Reinforcement Learning: An Introduction

Machine learning

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Technical Note : \cal Q -Learning

A Course in Game Theory

Related Papers (5)

Markov games as a framework for multi-agent reinforcement learning

Reinforcement Learning: An Introduction

A Comprehensive Survey of Multiagent Reinforcement Learning

Reinforcement learning: a survey

Stochastic Games