scispace - formally typeset
Search or ask a question
Topic

Stochastic game

About: Stochastic game is a research topic. Over the lifetime, 9493 publications have been published within this topic receiving 202664 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: Simulation results justify the convergence of the proposed algorithms and present new insights toward more efficient energy management in the smart grids.
Abstract: In this paper, the problem of the smart grid energy management under stochastic dynamics is investigated. In the considered model, at the demand side, it is assumed that customers can act as prosumers who own renewable energy sources and can both produce and consume energy. Due to the coupling between the prosumers’ decisions and the stochastic nature of renewable energy, the interaction among prosumers is formulated as a stochastic game, in which each prosumer seeks to maximize its payoff, in terms of revenues, by controlling its energy consumption and demand. In particular, the subjective behavior of prosumers is explicitly reflected into their payoff functions using the prospect theory, a powerful framework that allows modeling real-life human choices, rather than objective, user-agnostic decisions, as normative models do. For this prospect-based stochastic game, it is shown that there always exists a stationary Nash equilibrium where the prosumers’ trading policies in the equilibrium are independent of the time and their histories of the play. Moreover, to obtain one of such equilibrium policies, a novel distributed algorithm with no information sharing among prosumers is proposed and shown to converge to an $\epsilon$ -Nash equilibrium in which each prosumer is able to achieve its optimal payoff in an equilibrium up to a small additive error $\epsilon$ . On the other hand, at the supply side, the interaction between the utility company and the prosumers is formulated as an online optimization problem in which the utility company's goal is to learn its optimal energy allocation rules. For this case, it is shown that such an optimization problem admits a no-regret algorithm meaning that regardless of the actual outcome of the game among the prosumers, the utility company can follow a strategy that mitigates its allocation costs as if it knew the entire demand market a priori . Simulation results justify the convergence of the proposed algorithms and present new insights toward more efficient energy management in the smart grids.

62 citations

Journal ArticleDOI
TL;DR: A new method for evaluating the degree of reliance on past choices is proposed, called equal payoff series extraction (EPSE), which shows that the Equal Payoff Series can be used as a criterion for optimizing parameters resulting in better parameter generalizability.

62 citations

Journal ArticleDOI
TL;DR: In this article, a stochastic behavioral description of decision guidance is developed to increase the adaptibility to changing conditions and to reduce the deviation from the time-dependent user equilibrium, thereby enhancing the average and individual payoffs.
Abstract: The coordinated and efficient distribution of limited resources by individual decisions is a fundamental, unsolved problem. When individuals compete for road capacities, time, space, money, goods, etc, they normally make decisions based on aggregate rather than complete information, such as TV news or stock market indices. In related experiments, we have observed a volatile decision dynamics and far-from-optimal payoff distributions. We have also identified methods of information presentation that can considerably improve the overall performance of the system. In order to determine optimal strategies of decision guidance by means of user-specific recommendations, a stochastic behavioural description is developed. These strategies manage to increase the adaptibility to changing conditions and to reduce the deviation from the time-dependent user equilibrium, thereby enhancing the average and individual payoffs. Hence, our guidance strategies can increase the performance of all users by reducing overreaction and stabilizing the decision dynamics. These results are highly significant for predicting decision behaviour, for reaching optimal behavioural distributions by decision support systems and for information service providers. One of the promising fields of application is traffic optimization.

62 citations

Proceedings ArticleDOI
09 Jan 2012
TL;DR: In this article, a simple payoff-based learning rule that is completely decentralized and leads to an efficient configuration of actions in virtually any distributed system is presented for the problem of optimizing energy production in wind farms.
Abstract: This paper surveys recent results in game theory and cooperative control and highlights their implications for the problem of optimizing energy production in wind farms. One such result is a simple payoff-based learning rule that is completely decentralized and leads to an efficient configuration of actions in virtually any distributed system. We demonstrate that this learning rule can be used to provably maximize energy production in wind farms without explicitly modeling the aerodynamic interaction amongst the turbines.

62 citations

Proceedings ArticleDOI
01 Jul 2007
TL;DR: This paper proposes to model network selection mechanism in 4G mobile networks using a game theoretic approach; specifically defining a game between the access networks themselves, competing in a non-cooperative manner to maximize their payoff.
Abstract: Network selection mechanism in 4G mobile networks is a newly introduced mechanism that handles the selection of the best network to satisfy a service request. Network selection needs to take part prior to admission of the service in a particular access network participating in the 4G system. This paper proposes to model this decision using a game theoretic approach; specifically defining a game between the access networks themselves, competing in a non-cooperative manner to maximize their payoff. The outcome of the game is a decision of which subset of the set of service requests made to the 4G converged system should be admitted by each access network.

62 citations


Network Information
Related Topics (5)
Markov chain
51.9K papers, 1.3M citations
81% related
Incentive
41.5K papers, 1M citations
81% related
Heuristics
32.1K papers, 956.5K citations
80% related
Linear programming
32.1K papers, 920.3K citations
79% related
Empirical research
51.3K papers, 1.9M citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
2023364
2022738
2021462
2020512
2019460
2018483