Home
/
Topics
/
Stochastic game

Topic

Stochastic game

About: Stochastic game is a research topic. Over the lifetime, 9493 publications have been published within this topic receiving 202664 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Posted Content•

Settling the complexity of computing approximate two-player Nash equilibria

[...]

Aviad Rubinstein¹•Institutions (1)

University of California, Berkeley¹

14 Jun 2016-arXiv: Computational Complexity

TL;DR: In this paper, it was shown that computing an approximate Nash equilibrium in a game with n players requires quasi-polynomial time, in the sense that the payoff tensors need to be queried every time a Nash equilibrium is reached.

...read moreread less

Abstract: We prove that there exists a constant $\epsilon>0$ such that, assuming the Exponential Time Hypothesis for PPAD, computing an $\epsilon$-approximate Nash equilibrium in a two-player (nXn) game requires quasi-polynomial time, $n^{\log^{1-o(1)} n}$. This matches (up to the o(1) term) the algorithm of Lipton, Markakis, and Mehta [LMM03]. Our proof relies on a variety of techniques from the study of probabilistically checkable proofs (PCP); this is the first time that such ideas are used for a reduction between problems inside PPAD. En route, we also prove new hardness results for computing Nash equilibria in games with many players. In particular, we show that computing an $\epsilon$-approximate Nash equilibrium in a game with n players requires $2^{\Omega(n)}$ oracle queries to the payoff tensors. This resolves an open problem posed by Hart and Nisan [HN13], Babichenko [Bab14], and Chen et al. [CCT15]. In fact, our results for n-player games are stronger: they hold with respect to the $(\epsilon,\delta)$-WeakNash relaxation recently introduced by Babichenko et al. [BPR16].

...read moreread less

50 citations

Journal Article•DOI•

Weakly monotonic solutions for cooperative games

[...]

André Casajus¹, Frank Huettner¹•Institutions (1)

HHL Leipzig Graduate School of Management¹

01 Nov 2014-Journal of Economic Theory

TL;DR: This work investigates the class of values that satisfy efficiency, symmetry, and weak monotonicity and it turns out that this class coincides with theclass of egalitarian Shapley values.

...read moreread less

50 citations

Book Chapter•DOI•

Improved second-order bounds for prediction with expert advice

[...]

Nicolò Cesa-Bianchi¹, Yishay Mansour², Gilles Stoltz³•Institutions (3)

University of Milan¹, Tel Aviv University², École Normale Supérieure³

27 Jun 2005

TL;DR: This work derives a simple and new forecasting strategy with regret at most order of Q*, the largest absolute value of any payoff, and devise a refined analysis of the weighted majority forecaster, which yields bounds of the same flavour.

...read moreread less

Abstract: This work studies external regret in sequential prediction games with arbitrary payoffs (nonnegative or non-positive). External regret measures the difference between the payoff obtained by the forecasting strategy and the payoff of the best action. We focus on two important parameters: M, the largest absolute value of any payoff, and Q*, the sum of squared payoffs of the best action. Given these parameters we derive first a simple and new forecasting strategy with regret at most order of $\sqrt{Q^{*}({\rm ln}N)}+M {\rm ln} N$, where N is the number of actions. We extend the results to the case where the parameters are unknown and derive similar bounds. We then devise a refined analysis of the weighted majority forecaster, which yields bounds of the same flavour. The proof techniques we develop are finally applied to the adversarial multi-armed bandit setting, and we prove bounds on the performance of an online algorithm in the case where there is no lower bound on the probability of each action.

...read moreread less

50 citations

Journal Article•DOI•

Multiple Prisoner's Dilemma Games with(out) an Outside Option: an Experimental Study

[...]

Esther Hauk¹•Institutions (1)

Pompeu Fabra University¹

01 May 2003-Theory and Decision

TL;DR: This paper showed that an attractive outside option enhances cooperation in the prisoner's dilemma game if the payoff for mutual defection is negative; while this tendency makes them stick to mutual defraction if its payoff is positive, subjects use probabilistic start and end effect behavior.

...read moreread less

Abstract: Experiments in which subjects play simultaneously several finite two-person prisoner's dilemma supergames with and without an outside option reveal that: (i) an attractive outside option enhances cooperation in the prisoner's dilemma game, (ii) if the payoff for mutual defection is negative, subjects' tendency to avoid losses leads them to cooperate; while this tendency makes them stick to mutual defection if its payoff is positive, (iii) subjects use probabilistic start and endeffect behavior.

...read moreread less

50 citations

Posted Content•

How Individuals Learn to Take Turns: Emergence of Alternating Cooperation in a Congestion Game and the Prisoner's Dilemma

[...]

Dirk Helbing¹, Martin Schoenhof², Hans-Ulrich Stark², Janusz A. Hołyst³•Institutions (3)

ETH Zurich¹, Dresden University of Technology², Warsaw University of Technology³

15 Apr 2005-Social Science Research Network

TL;DR: In this paper, the authors present experimental results on humans playing a route choice game in a computer laboratory, which allow one to study decision behavior in repeated games beyond the Prisoner's Dilemma.

...read moreread less

Abstract: In many social dilemmas, individuals tend to generate a situation with low payoffs instead of a system optimum (tragedy of the commons) Is the routing of traffic a similar problem? In order to address this question, we present experimental results on humans playing a route choice game in a computer laboratory, which allow one to study decision behavior in repeated games beyond the Prisoner's Dilemma We will focus on whether individuals manage to find a cooperative and fair solution compatible with the system-optimal road usage We find that individuals tend towards a user equilibrium with equal travel times in the beginning However, after many iterations, they often establish a coherent oscillatory behavior, as taking turns performs better than applying pure or mixed strategies The resulting behavior is fair and compatible with system-optimal road usage In spite of the complex dynamics leading to coordinated oscillations, we have identified mathematical relationships quantifying the observed transition process Our main experimental discoveries for 2- and 4-person games can be explained with a novel reinforcement learning model for an arbitrary number of persons, which is based on past experience and trial-and-error behavior Gains in the average payoff seem to be an important driving force for the innovation of time-dependent response patterns, ie the evolution of more complex strategies Our findings are relevant for decision support systems and routing in traffic or data networks

...read moreread less

50 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
…
186
187
188
189
190
191
192
…
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

10,612

Papers

226,366

Citations

No. of papers in the topic in previous years
Year	Papers
2023	364
2022	738
2021	462
2020	512
2019	460
2018	483

Stochastic game

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics