Showing papers on "Stochastic game published in 2008"

PDF

Open Access

Book Chapter•DOI•

[...]

Parosh Aziz Abdulla¹, Noomene Ben Henda¹, Luca de Alfaro², Richard Mayr³, Sven Sandberg¹ - Show less +1 more•Institutions (3)

Uppsala University¹, University of California, Santa Cruz², North Carolina State University³

29 Mar 2008

TL;DR: Turn-based stochastic games on infinite graphs induced by game probabilistic lossy channel systems (GPLCS) are decidable, which generalizes the decidability result for PLCS-induced Markov decision processes in [10].

...read moreread less

Abstract: We consider turn-based stochastic games on infinite graphs induced by game probabilistic lossy channel systems (GPLCS), the game version of probabilistic lossy channel systems (PLCS). We study games with Buchi (repeated reachability) objectives and almost-sure winning conditions. These games are pure memoryless determined and, under the assumption that the target set is regular, a symbolic representation of the set of winning states for each player can be effectively constructed. Thus, turn-based stochastic games on GPLCS are decidable. This generalizes the decidability result for PLCS-induced Markov decision processes in [10].

...read moreread less

570 citations

Journal Article•DOI•

Competitive spectrum sharing in cognitive radio networks: a dynamic game approach

[...]

Dusit Niyato¹, Ekram Hossain¹•Institutions (1)

University of Manitoba¹

01 Jul 2008-IEEE Transactions on Wireless Communications

TL;DR: This paper considers the problem of spectrum sharing among a primary user and multiple secondary users as an oligopoly market competition and uses a noncooperative game to obtain the spectrum allocation for secondary users.

...read moreread less

Abstract: "Cognitive radio" is an emerging technique to improve the utilization of radio frequency spectrum in wireless networks. In this paper, we consider the problem of spectrum sharing among a primary user and multiple secondary users. We formulate this problem as an oligopoly market competition and use a noncooperative game to obtain the spectrum allocation for secondary users. Nash equilibrium is considered as the solution of this game. We first present the formulation of a static game for the case where all secondary users have the current information of the adopted strategies and the payoff of each other. However, this assumption may not be realistic in some cognitive radio systems. Therefore, we consider the case of bounded rationality in which the secondary users gradually and iteratively adjust their strategies based on the observations on their previous strategies. The speed of adjustment of the strategies is controlled by the learning rate. The stability condition of the dynamic behavior for this spectrum sharing scheme is investigated. The numerical results reveal the dynamics of distributed dynamic adaptation of spectrum sharing strategies.

...read moreread less

346 citations

Journal Article•DOI•

Deal or No Deal? Decision-making under Risk in a Large-payoff Game Show

[...]

Thierry Post, Martijn J. van den Assem, Guido Baltussen, Richard H. Thaler¹•Institutions (1)

University of Chicago¹

01 Mar 2008-The American Economic Review

TL;DR: This paper examined the risky choices of contestants in the popular TV game show "Deal or No Deal" and related classroom experiments and found that the choices can be explained in large part by previous outcomes experienced during the game.

...read moreread less

Abstract: We examine the risky choices of contestants in the popular TV game show “Deal or No Deal” and related classroom experiments Contrary to the traditional view of expected utility theory, the choices can be explained in large part by previous outcomes experienced during the game Risk aversion decreases after earlier expectations have been shattered by unfavorable outcomes or surpassed by favorable outcomes Our results point to reference-dependent choice theories such as prospect theory, and suggest that path-dependence is relevant, even when the choice problems are simple and well-defined, and when large real monetary amounts are at stake

...read moreread less

331 citations

Posted Content•

Multi-Armed Bandits in Metric Spaces

[...]

Robert Kleinberg¹, Aleksandrs Slivkins², Eli Upfal³•Institutions (3)

Cornell University¹, Microsoft², Brown University³

29 Sep 2008-arXiv: Data Structures and Algorithms

TL;DR: In this paper, the authors studied a general setting for the multi-armed bandit problem in which the strategies form a metric space, and the payoff function satisfies a Lipschitz condition with respect to the metric.

...read moreread less

Abstract: In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of trials so as to maximize the total payoff of the chosen strategies. While the performance of bandit algorithms with a small finite strategy set is quite well understood, bandit problems with large strategy sets are still a topic of very active investigation, motivated by practical applications such as online auctions and web advertisement. The goal of such research is to identify broad and natural classes of strategy sets and payoff functions which enable the design of efficient solutions. In this work we study a very general setting for the multi-armed bandit problem in which the strategies form a metric space, and the payoff function satisfies a Lipschitz condition with respect to the metric. We refer to this problem as the "Lipschitz MAB problem". We present a complete solution for the multi-armed problem in this setting. That is, for every metric space (L,X) we define an isometry invariant which bounds from below the performance of Lipschitz MAB algorithms for X, and we present an algorithm which comes arbitrarily close to meeting this bound. Furthermore, our technique gives even better results for benign payoff functions.

...read moreread less

329 citations

Journal Article•DOI•

Promotion of cooperation induced by appropriate payoff aspirations in a small-world networked game.

[...]

Xiaojie Chen¹, Long Wang¹•Institutions (1)

Peking University¹

24 Jan 2008-Physical Review E

TL;DR: This work investigates the fractions of links, provides analytical results of the cooperation level, and finds that the simulation results are in close agreement with analytical ones, which may be helpful in understanding the cooperative behavior induced by the aspiration level in society.

...read moreread less

Abstract: Based on learning theory, we adopt a stochastic learning updating rule to investigate the evolution of cooperation in the Prisoner's Dilemma game on Newman-Watts small-world networks with different payoff aspiration levels. Interestingly, simulation results show that the mechanism of intermediate aspiration promoting cooperation resembles a resonancelike behavior, and there exists a ping-pong vibration of cooperation for large payoff aspiration. To explain the nontrivial dependence of the cooperation level on the aspiration level, we investigate the fractions of links, provide analytical results of the cooperation level, and find that the simulation results are in close agreement with analytical ones. Our work may be helpful in understanding the cooperative behavior induced by the aspiration level in society.

...read moreread less

305 citations

Journal Article•DOI•

Coevolution of teaching activity promotes cooperation

[...]

Attila Szolnoki, Matjaz Perc¹•Institutions (1)

University of Maribor¹

21 Apr 2008-New Journal of Physics

TL;DR: In this paper, the authors studied evolutionary games where the teaching activity of players can evolve in time, and proposed a simple mechanism that spontaneously creates relevant inhomogeneities in the teaching activities that support the maintenance of cooperation for both the prisoner's dilemma and the snowdrift game.

...read moreread less

Abstract: Evolutionary games are studied where the teaching activity of players can evolve in time. Initially all players following either the cooperative or defecting strategy are distributed on a square lattice. The rate of strategy adoption is determined by the payoff difference and a teaching activity characterizing the donor's capability to enforce its strategy on the opponent. Each successful strategy adoption process is accompanied by an increase in the donor's teaching activity. By applying an optimum value of the increment, this simple mechanism spontaneously creates relevant inhomogeneities in the teaching activities that support the maintenance of cooperation for both the prisoner's dilemma and the snowdrift game.

...read moreread less

284 citations

Journal Article•DOI•

Towards effective payoffs in the prisoner's dilemma game on scale-free networks

[...]

Attila Szolnoki, Matjaž Perc¹, Zsuzsa Danku•Institutions (1)

University of Maribor¹

15 Mar 2008-Physica A-statistical Mechanics and Its Applications

TL;DR: In this paper, the authors study the transition towards effective payoffs in the prisoner's dilemma game on scale-free networks by introducing a normalization parameter guiding the system from accumulated payoffs to payoffs normalized with the connectivity of each agent.

...read moreread less

Abstract: We study the transition towards effective payoffs in the prisoner’s dilemma game on scale-free networks by introducing a normalization parameter guiding the system from accumulated payoffs to payoffs normalized with the connectivity of each agent. We show that during this transition the heterogeneity-based ability of scale-free networks to facilitate cooperative behavior deteriorates continuously, eventually collapsing with the results obtained on regular graphs. The strategy donations and adaptation probabilities of agents with different connectivities are studied. Results reveal that strategies generally spread from agents with larger towards agents with smaller degree. However, this strategy adoption flow reverses sharply in the fully normalized payoff limit. Surprisingly, cooperators occupy the hubs even if the averaged cooperation level due to partly normalized payoffs is moderate.

...read moreread less

279 citations

Journal Article•DOI•

Coevolution of teaching activity promotes cooperation

[...]

Attila Szolnoki, Matjaz Perc¹•Institutions (1)

University of Maribor¹

28 Mar 2008-arXiv: Physics and Society

TL;DR: By applying an optimum value of the increment, this simple mechanism spontaneously creates relevant inhomogeneities in the teaching activities that support the maintenance of cooperation for both the prisoner's dilemma and the snowdrift game.

...read moreread less

Abstract: Evolutionary games are studied where the teaching activity of players can evolve in time. Initially all players following either the cooperative or defecting strategy are distributed on a square lattice. The rate of strategy adoption is determined by the payoff difference and a teaching activity characterizing the donor's capability to enforce its strategy on the opponent. Each successful strategy adoption process is accompanied with an increase in the donor's teaching activity. By applying an optimum value of the increment this simple mechanism spontaneously creates relevant inhomogeneities in the teaching activities that support the maintenance of cooperation for both the prisoner's dilemma and the snowdrift game.

...read moreread less

276 citations

Journal Article•DOI•

Stochastic Differential Games and Viscosity Solutions of Hamilton-Jacobi-Bellman-Isaacs Equations

[...]

Rainer Buckdahn¹, Juan Li²•Institutions (2)

University of Western Brittany¹, Shandong University²

01 Jan 2008-Siam Journal on Control and Optimization

TL;DR: P Peng's BSDE method is extended from the framework of stochastic control theory into that of Stochastic differential games and is shown to prove a dynamic programming principle for both the upper and the lower value functions of the game in a straightforward way.

...read moreread less

Abstract: In this paper we study zero-sum two-player stochastic differential games with the help of the theory of backward stochastic differential equations (BSDEs). More precisely, we generalize the results of the pioneering work of Fleming and Souganidis [Indiana Univ. Math. J., 38 (1989), pp. 293-314] by considering cost functionals defined by controlled BSDEs and by allowing the admissible control processes to depend on events occurring before the beginning of the game. This extension of the class of admissible control processes has the consequence that the cost functionals become random variables. However, by making use of a Girsanov transformation argument, which is new in this context, we prove that the upper and the lower value functions of the game remain deterministic. Apart from the fact that this extension of the class of admissible control processes is quite natural and reflects the behavior of the players who always use the maximum of available information, its combination with BSDE methods, in particular that of the notion of stochastic “backward semigroups" introduced by Peng [BSDE and stochastic optimizations, in Topics in Stochastic Analysis, Science Press, Beijing, 1997], allows us then to prove a dynamic programming principle for both the upper and the lower value functions of the game in a straightforward way. The upper and the lower value functions are then shown to be the unique viscosity solutions of the upper and the lower Hamilton-Jacobi-Bellman-Isaacs equations, respectively. For this Peng's BSDE method is extended from the framework of stochastic control theory into that of stochastic differential games.

...read moreread less

268 citations

Journal Article•DOI•

The Power of Focal Points Is Limited: Even Minute Payoff Asymmetry May Yield Large Coordination Failures

[...]

Vincent P. Crawford, Uri Gneezy, Yuval Rottenstreich¹•Institutions (1)

New York University¹

01 Aug 2008-The American Economic Review

TL;DR: This article found that salient labels yield frequent coordination in symmetric games, but when the payoff is asymmetric, labels lose much of their effectiveness and miscoordination abounds, which raises questions about the extent to which the effectiveness of focal points based on label salience persists beyond the special case of symmetric game.

...read moreread less

Abstract: Since Schelling, it has often been assumed that players make use of salient decision labels to achieve coordination. Consistent with previous work, we find that given equal payoffs, salient labels yield frequent coordination. However, given even minutely asymmetric payoffs, labels lose much of their effectiveness and miscoordination abounds. This raises questions about the extent to which the effectiveness of focal points based on label salience persists beyond the special case of symmetric games. The patterns of miscoordination we observe vary with the magnitude of payoff differences in intricate ways that suggest nonequilibrium accounts based on "level-k" thinking and "team reasoning." (JEL C12, C92)

...read moreread less

259 citations

Collapse