scispace - formally typeset
Journal ArticleDOI

An N-player sequential stochastic game with identical payoffs

Kumpati S. Narendra, +1 more
- Vol. 13, Iss: 6, pp 1154-1158
Reads0
Chats0
TLDR
It is shown that the expected change in each player's payoff is nonnegative at every instant, so that the group improves its performance monotonically, which appears to have important implications in decentralized decision-making in large complex systems.
Abstract
A sequential stochastic game among an arbitrary number of players in which all players' payoffs are identical is analyzed. The players are unaware that they are in a game and hence they have no knowledge of other players' strategies or the payoff structure. At each instant the players use a simple learning algorithm to update their mixed strategy choices based entirely on the response of a random environment. It is shown that the expected change in each player's payoff is nonnegative at every instant, so that the group improves its performance monotonically. This result appears to have important implications in decentralized decision-making in large complex systems.

read more

Citations
More filters
Book

Reinforcement Learning: An Introduction

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
Journal ArticleDOI

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

TL;DR: This article presents a general class of associative reinforcement learning algorithms for connectionist networks containing stochastic units that are shown to make weight adjustments in a direction that lies along the gradient of expected reinforcement in both immediate-reinforcement tasks and certain limited forms of delayed-reInforcement tasks, and they do this without explicitly computing gradient estimates.
Journal ArticleDOI

Adaptive load balancing: a study in multi-agent learning

TL;DR: In this paper, the authors study the process of multi-agent reinforcement learning in the context of load balancing in a distributed system, without use of either central coordination or explicit communication.
Patent

Processing device with intuitive learning capability

TL;DR: In this paper, a method and apparatus for providing learning capability to processing device, such as a computer game, educational toy, telephone, or television remote control, is provided to achieve one or more objective.
Journal ArticleDOI

Learning automata approach to hierarchical multiobjective analysis

TL;DR: A novel approach to hierarchical multiobjective analysis using the theory of learning automata is introduced and it is shown that if suitable learning algorithms are chosen at all the levels, the overall performance of the system will improve at each stage.
Related Papers (5)