Home
/
Topics
/
Stochastic game

Topic

Stochastic game

About: Stochastic game is a research topic. Over the lifetime, 9493 publications have been published within this topic receiving 202664 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Posted Content•

Positive value of information in games

[...]

Bruno Bassan¹, Olivier Gossner², Marco Scarsini³, Shmuel Zamir⁴•Institutions (4)

Sapienza University of Rome¹, Université catholique de Louvain², University of Turin³, Hebrew University of Jerusalem⁴

01 Jul 2003-Research Papers in Economics

TL;DR: In this article, the authors consider a general class of interactive decision situations in which all the agents benefit from more information and show that for any information structure T that is coarser than S, all Nash payoff profiles of (G,S) are dominated by u. This class includes as a special case the classical comparison of statistical experiments 'a la Blackwell.

...read moreread less

Abstract: We exhibit a general class of interactive decision situations in which all the agents benefit from more information. This class includes as a special case the classical comparison of statistical experiments `a la Blackwell. More specifically, we consider pairs consisting of a game with incomplete information G and an information structure S such that the extended game (G,S) has a unique Pareto payoff profile u. We prove that u is a Nash payoff profile of (G,S), and that for any information structure T that is coarser than S, all Nash payoff profiles of (G,S) are dominated by u. We then prove that our condition is also necessary in the following sense: Given any convex compact polyhedron of payoff profiles, whose Pareto frontier is not a singleton, there exists an extended game (G,S) with that polyhedron as the convex hull of feasible payoffs, an information structure T coarser than S and a player i who strictly prefers a Nash equilibrium in (G,S) to any Nash equilibrium in (G,S).

...read moreread less

71 citations

Posted Content•DOI•

Learning in games with continuous action sets and unknown payoff functions

[...]

Panayotis Mertikopoulos¹, Zhengyuan Zhou²•Institutions (2)

University of Grenoble¹, Stanford University²

25 Aug 2016

TL;DR: This paper focuses on learning via "dual averaging", a widely used class of no-regret learning schemes where players take small steps along their individual payoff gradients and then "mirror" the output back to their action sets, and introduces the notion of variational stability.

...read moreread less

Abstract: This paper examines the convergence of no-regret learning in games with continuous action sets. For concreteness, we focus on learning via "dual averaging", a widely used class of no-regret learning schemes where players take small steps along their individual payoff gradients and then "mirror" the output back to their action sets. In terms of feedback, we assume that players can only estimate their payoff gradients up to a zero-mean error with bounded variance. To study the convergence of the induced sequence of play, we introduce the notion of variational stability, and we show that stable equilibria are locally attracting with high probability whereas globally stable equilibria are globally attracting with probability 1. We also discuss some applications to mixed-strategy learning in finite games, and we provide explicit estimates of the method's convergence speed.

...read moreread less

71 citations

Journal Article•DOI•

Learning Optimal Discriminant Functions through a Cooperative Game of Automata

[...]

M. A. L. Thathachar¹, P. S. Sastry¹•Institutions (1)

Indian Institute of Science¹

01 Jan 1987

TL;DR: It is proved that the team can obtain the optimal classifier to an arbitrary approximation when posed as a game with common payoff played by a team of mutually cooperating learning automata.

...read moreread less

Abstract: The problem of learning correct decision rules to minimize the probability of misclassification is a long-standing problem of supervised learning in pattern recognition. The problem of learning such optimal discriminant functions is considered for the class of problems where the statistical properties of the pattern classes are completely unknown. The problem is posed as a game with common payoff played by a team of mutually cooperating learning automata. This essentially results in a probabilistic search through the space of classifiers. The approach is inherently capable of learning discriminant functions that are nonlinear in their parameters also. A learning algorithm is presented for the team and convergence is established. It is proved that the team can obtain the optimal classifier to an arbitrary approximation. Simulation results with a few examples are presented where the team learns the optimal classifier.

...read moreread less

71 citations

Journal Article•DOI•

[...]

Eilon Solan¹, Eilon Solan², Nicolas Vieille³•Institutions (3)

Northwestern University¹, Tel Aviv University², École Polytechnique³

01 Feb 2002-Games and Economic Behavior

TL;DR: It is proved that any n-player stochastic game admits an autonomous correlated equilibrium payoff, when the game is positive and recursive, and a stationary correlation equilibrium payoff exists.

...read moreread less

71 citations

Journal Article•DOI•

Optimal cooperation-trap strategies for the iterated rock-paper-scissors game.

[...]

Zedong Bi¹, Hai-Jun Zhou¹•Institutions (1)

Chinese Academy of Sciences¹

29 Oct 2014-PLOS ONE

TL;DR: It is shown that maximal degree of cooperation is achievable in such a competitive system with cyclic dominance of actions, which may stimulate further theoretical and empirical studies on how to resolve conflicts and enhance cooperation in human societies.

...read moreread less

Abstract: In an iterated non-cooperative game, if all the players act to maximize their individual accumulated payoff, the system as a whole usually converges to a Nash equilibrium that poorly benefits any player. Here we show that such an undesirable destiny is avoidable in an iterated Rock-Paper-Scissors (RPS) game involving two rational players, X and Y. Player X has the option of proactively adopting a cooperation-trap strategy, which enforces complete cooperation from the rational player Y and leads to a highly beneficial and maximally fair situation to both players. That maximal degree of cooperation is achievable in such a competitive system with cyclic dominance of actions may stimulate further theoretical and empirical studies on how to resolve conflicts and enhance cooperation in human societies.

...read moreread less

70 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
…
124
125
126
127
128
129
130
…
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

10,612

Papers

226,366

Citations

No. of papers in the topic in previous years
Year	Papers
2023	364
2022	738
2021	462
2020	512
2019	460
2018	483

Stochastic game

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics