Learning and Cooperation in Sequential Games

doi:10.1177/105971230601400304

Journal ArticleDOI

Learning and Cooperation in Sequential Games

Annapurna Valluri

- 01 Sep 2006 -

Adaptive Behavior

- Vol. 14, Iss: 3, pp 195-209

Chats0

TLDR

This work model agents with a reinforce ment learning algorithm and analyze cooperative behavior in a sequential prisoner's dilemma game and attributes the reciprocal-like behavior to the structural flow of information, which reduces the risks of exploitation faced by the second-mover.

Abstract:

The predictions of classical game theory for one-shot and finitely repeated play of many 2x2 simulta neous games do not correspond to human behavior observed in laboratory experiments. The promis ing results of learning models in tracking human behavior coupled with the growing electronic market and the number of e-commerce applications has resulted in an increased interest in studying the behavior of adaptive artificial agents in different economic games. We model agents with a reinforce ment learning algorithm and analyze cooperative behavior in a sequential prisoner's dilemma game. Our results demonstrate the ability of artificial agents to learn cooperative behavior even in sequential games where defection is the subgame perfect Nash equilibrium. We attribute the reciprocal-like behavior to the structural flow of information, which reduces the risks of exploitation faced by the second-mover. Additionally, we analyze the impact of the second-mover's temptation payoff and pay off risks on the rate of cooperative behavior.

Citations

PDF

Open Access

More filters

Posted Content

Perfect versus imperfect observability---An experimental test of Bagwell's result

Steffen Huck, +1 more

- 17 Apr 1998 -

Research Papers in Economics

TL;DR: An experimental test of Bagwell's claim that the first mover advantage vanishes completely if this action is only imperfectly observed by second movers finds some support for the noisy Stackelberg equilibrium emphasised by van Damme and Hurkens (1997).

...read moreread less

Journal ArticleDOI

A dynamic, embodied paradigm to investigate the role of serotonin in decision-making

Derrik E. Asher, +4 more

- 21 Nov 2013 -

Frontiers in Integrative Neuroscience

TL;DR: A cyclic multidisciplinary approach is proposed that may aid in addressing the complexity of exploring 5-HT and decision-making by iteratively updating the authors' assumptions and models of the serotonergic system through exhaustive experimentation.

...read moreread less

Journal ArticleDOI

A game theoretic framework for incentive-based models of intrinsic motivation in artificial systems

Kathryn E. Merrick, +1 more

- 30 Oct 2013 -

Frontiers in Psychology

TL;DR: This paper uses agent-based simulations to demonstrate that players with different optimally motivating incentive act differently as a result of their altered perception of the game, and discusses the implications both for modeling human behavior and for designing artificial agents or robots.

...read moreread less

Journal ArticleDOI

Is There a Violation of Savage's Sure-Thing Principle in the Prisoner's Dilemma Game?

Shu Li, +3 more

- 01 Jun 2010 -

Adaptive Behavior

TL;DR: It was found that the sure-thing principle was violated in the domain of gains as expected by Shafir and Tversky but obeyed in thedomain of losses.

...read moreread less

Journal ArticleDOI

Social contracts and human-computer interaction with simulated adapting agents

Alexis B. Craig, +4 more

- 01 Oct 2013 -

Adaptive Behavior

TL;DR: Results indicated that the adapting agent caused subjects to spend more time and effort in each game, exhibiting a more complicated path to their destination, which suggests that adapting agents exhibit behavior similar to human opponents, evoking more natural social responses in subjects.

...read moreread less

References

PDF

Open Access

More filters

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Book

The Evolution of Cooperation

Robert Axelrod, +1 more

TL;DR: In this paper, a model based on the concept of an evolutionarily stable strategy in the context of the Prisoner's Dilemma game was developed for cooperation in organisms, and the results of a computer tournament showed how cooperation based on reciprocity can get started in an asocial world, can thrive while interacting with a wide range of other strategies, and can resist invasion once fully established.

...read moreread less

Journal ArticleDOI

Moral Hazard and Observability

Bengt Holmstrom

- 01 Jan 1979 -

The Bell Journal of Economics

TL;DR: In this article, the role of imperfect information in a principal-agent relationship subject to moral hazard is considered, and a necessary and sufficient condition for imperfect information to improve on contracts based on the payoff alone is derived.

...read moreread less

Book

Introduction to Reinforcement Learning

Richard S. Sutton, +1 more

TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.

...read moreread less

Journal ArticleDOI

Reinforcement learning: a survey

Leslie Pack Kaelbling, +2 more

- 01 Jan 1996 -

Journal of Artificial Intelligence Resea...

TL;DR: Central issues of reinforcement learning are discussed, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state.

...read moreread less

Collapse

Learning and Cooperation in Sequential Games

Citations

Perfect versus imperfect observability---An experimental test of Bagwell's result

A dynamic, embodied paradigm to investigate the role of serotonin in decision-making

A game theoretic framework for incentive-based models of intrinsic motivation in artificial systems

Is There a Violation of Savage's Sure-Thing Principle in the Prisoner's Dilemma Game?

Social contracts and human-computer interaction with simulated adapting agents

References

Reinforcement Learning: An Introduction

The Evolution of Cooperation

Moral Hazard and Observability

Introduction to Reinforcement Learning

Reinforcement learning: a survey

Related Papers (5)

Game theory and economics

Utility Based Q-learning to Maintain Cooperation in Prisoner's Dilemma Games

Bargaining Set Solution Concepts in Dynamic Cooperative Games

The good, the bad and the cautious: safety level cooperative games

New Theory of Cooperative Games