scispace - formally typeset
Search or ask a question
Topic

Stochastic game

About: Stochastic game is a research topic. Over the lifetime, 9493 publications have been published within this topic receiving 202664 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: In this paper, the authors study games with strategic complementarities, arbitrary numbers of players and actions, and slightly noisy payoff signals, and prove limit uniqueness: as the signal noise vanishes, the game has a unique strategy profile that survives iterative dominance.

279 citations

Journal ArticleDOI
TL;DR: In this paper, the authors study the transition towards effective payoffs in the prisoner's dilemma game on scale-free networks by introducing a normalization parameter guiding the system from accumulated payoffs to payoffs normalized with the connectivity of each agent.
Abstract: We study the transition towards effective payoffs in the prisoner’s dilemma game on scale-free networks by introducing a normalization parameter guiding the system from accumulated payoffs to payoffs normalized with the connectivity of each agent. We show that during this transition the heterogeneity-based ability of scale-free networks to facilitate cooperative behavior deteriorates continuously, eventually collapsing with the results obtained on regular graphs. The strategy donations and adaptation probabilities of agents with different connectivities are studied. Results reveal that strategies generally spread from agents with larger towards agents with smaller degree. However, this strategy adoption flow reverses sharply in the fully normalized payoff limit. Surprisingly, cooperators occupy the hubs even if the averaged cooperation level due to partly normalized payoffs is moderate.

279 citations

Journal ArticleDOI
TL;DR: By applying an optimum value of the increment, this simple mechanism spontaneously creates relevant inhomogeneities in the teaching activities that support the maintenance of cooperation for both the prisoner's dilemma and the snowdrift game.
Abstract: Evolutionary games are studied where the teaching activity of players can evolve in time. Initially all players following either the cooperative or defecting strategy are distributed on a square lattice. The rate of strategy adoption is determined by the payoff difference and a teaching activity characterizing the donor's capability to enforce its strategy on the opponent. Each successful strategy adoption process is accompanied with an increase in the donor's teaching activity. By applying an optimum value of the increment this simple mechanism spontaneously creates relevant inhomogeneities in the teaching activities that support the maintenance of cooperation for both the prisoner's dilemma and the snowdrift game.

276 citations

Posted Content
TL;DR: In this paper, the authors characterize incentive-compatible and individually rational mechanisms, and look at second price auctions (which, under some conditions, maximize the seller's revenue), and show that exclusion (i.e., the announcement of a reservation price such that a measure can never get the object) is not necessarilly optimal for the seller.
Abstract: In our framework, when a buyer does not obtain the auctioned object, he is no longer indifferent about the identity of the winner (i.e., eyternal effects are present). Buyer i's preferences are characterized by an N-dimensional vector t^i = (t1^i, t2^i,..,tN^i). The coordinate ti^i can be interpreted as the usual "private value" of player i, while each other coordinate tj^i represents i's total payoff should j get the object. In this framework, we characterize incentive-compatible and individually-rational mechanisms, and look at second price auctions (which, under some conditions, maximize the seller's revenue). Any incentive combatible mechanism induces a conditional probability assignement vector field which is conservative. A useful geometric property of conservative vector fields is used for the derivation of a differential equation which determines equilibrium bids. Finally, we show that exclusion (i.e., the announcement of a reservation price such that a measure can never get the object) is not necessarilly optimal for the seller. This contrasts with Armstrong's (Econometrica, 1995) insight about the optimality of exclusion in another multidimensional setting.

274 citations

Proceedings Article
30 Jun 2000
TL;DR: This paper provides a gradient-based distributed policy-search method for cooperative games and compares the notion of local optimum to that of Nash equilibrium, and demonstrates the effectiveness of this method experimentally in a small, partially observable simulated soccer domain.
Abstract: Cooperative games are those in which both agents share the same payoff structure Value-based reinforcement-learning algorithms, such as variants of Q-learning, have been applied to learning cooperative games, but they only apply when the game state is completely observable to both agents Policy search methods are a reasonable alternative to value-based methods for partially observable environments In this paper, we provide a gradient-based distributed policy-search method for cooperative games and compare the notion of local optimum to that of Nash equilibrium We demonstrate the effectiveness of this method experimentally in a small, partially observable simulated soccer domain

273 citations


Network Information
Related Topics (5)
Markov chain
51.9K papers, 1.3M citations
81% related
Incentive
41.5K papers, 1M citations
81% related
Heuristics
32.1K papers, 956.5K citations
80% related
Linear programming
32.1K papers, 920.3K citations
79% related
Empirical research
51.3K papers, 1.9M citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023364
2022738
2021462
2020512
2019460
2018483