scispace - formally typeset
Search or ask a question
Topic

Stochastic game

About: Stochastic game is a research topic. Over the lifetime, 9493 publications have been published within this topic receiving 202664 citations.


Papers
More filters
Posted Content
TL;DR: In this paper, the authors study decision problems under uncertainty where a decisionmaker observes an imperfect signal about the true state of the world and analyzes the information preferences and information demand of such decision-makers, based on properties of their payoff functions.
Abstract: February 2001 This paper studies decision problems under uncertainty where a decision-maker observes an imperfect signal about the true state of the world. We analyze the information preferences and information demand of such decision-makers, based on properties of their payoff functions. We restrict attention to "monotone decision problems," whereby the posterior beliefs induced by the signal can be ordered so that higher actions are chosen in response to higher signal realizations. Monotone decision problems are frequently encountered in economic modeling. We provide necessary and sufficient conditions for all decision makers with different classes of payoff functions to prefer one information structure to another. We also provide conditions under which two decision-makers in a given class can be ranked in terms of their marginal value for information and hence information demand. Applications and examples are given. Working Papers Index

108 citations

Journal ArticleDOI
TL;DR: In this paper, the authors present experimental results on humans playing a route choice game in a computer laboratory, which allow one to study decision behavior in repeated games beyond the Prisoner's Dilemma.
Abstract: In many social dilemmas, individuals tend to generate a situation with low payoffs instead of a system optimum ("tragedy of the commons"). Is the routing of traffic a similar problem? In order to address this question, we present experimental results on humans playing a route choice game in a computer laboratory, which allow one to study decision behavior in repeated games beyond the Prisoner's Dilemma. We will focus on whether individuals manage to find a cooperative and fair solution compatible with the system-optimal road usage. We find that individuals tend towards a user equilibrium with equal travel times in the beginning. However, after many iterations, they often establish a coherent oscillatory behavior, as taking turns performs better than applying pure or mixed strategies. The resulting behavior is fair and compatible with system-optimal road usage. In spite of the complex dynamics leading to coordinated oscillations, we have identified mathematical relationships quantifying the observed transition process. Our main experimental discoveries for 2- and 4-person games can be explained with a novel reinforcement learning model for an arbitrary number of persons, which is based on past experience and trial-and-error behavior. Gains in the average payoff seem to be an important driving force for the innovation of time-dependent response patterns, i.e. the evolution of more complex strategies. Our findings are relevant for decision support systems and routing in traffic or data networks.

108 citations

Proceedings ArticleDOI
11 Aug 2009
TL;DR: A family of games on which the discrete strategy improvement algorithm for solving parity games due to Voege and Jurdzinski requires exponentially many strategy iterations is outlined, answering in the negative the long-standing question whether this algorithm runs in polynomial time.
Abstract: This paper presents a new lower bound for the discrete strategy improvement algorithm for solving parity games due to Voege and Jurdzinski. First, we informally show which structures are difficult to solve for the algorithm. Second, we outline a family of games on which the algorithm requires exponentially many strategy iterations, answering in the negative the long-standing question whether this algorithm runs in polynomial time. Additionally we note that the same family of games can be used to prove a similar result w.r.t. the strategy improvement variant by Schewe as well as the strategy iteration for solving discounted payoff games due to Puri.

108 citations

Journal ArticleDOI
31 Aug 2001-Chaos
TL;DR: The combination of Parrondo's games can be therefore considered as a discrete-time Brownian ratchet, and the parameter space in which the paradoxical effect occurs is found and the winning rate analysis is carried out.
Abstract: Parrondo’s games present an apparently paradoxical situation where individually losing games can be combined to win. In this article we analyze the case of two coin tossing games. Game B is played with two biased coins and has state-dependent rules based on the player’s current capital. Game B can exhibit detailed balance or even negative drift (i.e., loss), depending on the chosen parameters. Game A is played with a single biased coin that produces a loss or negative drift in capital. However, a winning expectation is achieved by randomly mixing A and B. One possible interpretation pictures game A as a source of “noise” that is rectified by game B to produce overall positive drift—as in a Brownian ratchet. Game B has a state-dependent rule that favors a losing coin, but when this state dependence is broken up by the noise introduced by game A, a winning coin is favored. In this article we find the parameter space in which the paradoxical effect occurs and carry out a winning rate analysis. The significance of Parrondo’s games is that they are physically motivated and were originally derived by considering a Brownian ratchet—the combination of the games can be therefore considered as a discrete-time Brownian ratchet. We postulate the use of games of this type as a toy model for a number of physical and biological processes and raise a number of open questions for future research.

107 citations

Journal ArticleDOI
TL;DR: In this paper, a new decomposition of a symmetric extensive two-person game into a top and an abridgement is introduced, and sufficient conditions for a regular limit ESS are obtained on the basis of the result on regularity and essentiality.

107 citations


Network Information
Related Topics (5)
Markov chain
51.9K papers, 1.3M citations
81% related
Incentive
41.5K papers, 1M citations
81% related
Heuristics
32.1K papers, 956.5K citations
80% related
Linear programming
32.1K papers, 920.3K citations
79% related
Empirical research
51.3K papers, 1.9M citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023364
2022738
2021462
2020512
2019460
2018483