Neuronlike adaptive elements that can solve difficult learning control problems

doi:10.1109/TSMC.1983.6313077

Journal ArticleDOI

Neuronlike adaptive elements that can solve difficult learning control problems

- Vol. 13, Iss: 5, pp 834-846

TLDR

In this article, a system consisting of two neuron-like adaptive elements can solve a difficult learning control problem, where the task is to balance a pole that is hinged to a movable cart by applying forces to the cart base.

Abstract:

It is shown how a system consisting of two neuronlike adaptive elements can solve a difficult learning control problem. The task is to balance a pole that is hinged to a movable cart by applying forces to the cart's base. It is argued that the learning problems faced by adaptive elements that are components of adaptive networks are at least as difficult as this version of the pole-balancing problem. The learning system consists of a single associative search element (ASE) and a single adaptive critic element (ACE). In the course of learning to balance the pole, the ASE constructs associations between input and output by searching under the influence of reinforcement feedback, and the ACE constructs a more informative evaluation function than reinforcement feedback alone can provide. The differences between this approach and other attempts to solve problems using neurolike elements are discussed, as is the relation of this work to classical and instrumental conditioning in animal learning studies and its possible implications for research in the neurosciences.

Citations

PDF

Open Access

More filters

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Journal ArticleDOI

Deep learning in neural networks

Jürgen Schmidhuber

- 01 Jan 2015 -

Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

Journal ArticleDOI

A Neural Substrate of Prediction and Reward

Wolfram Schultz, +2 more

- 14 Mar 1997 -

Science

TL;DR: Findings in this work indicate that dopaminergic neurons in the primate whose fluctuating output apparently signals changes or errors in the predictions of future salient and rewarding events can be understood through quantitative theories of adaptive optimizing control.

...read moreread less

Journal ArticleDOI

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Ronald J. Williams

- 01 May 1992 -

Machine Learning

TL;DR: This article presents a general class of associative reinforcement learning algorithms for connectionist networks containing stochastic units that are shown to make weight adjustments in a direction that lies along the gradient of expected reinforcement in both immediate-reinforcement tasks and certain limited forms of delayed-reInforcement tasks, and they do this without explicitly computing gradient estimates.

...read moreread less

Journal ArticleDOI

Reinforcement learning: a survey

Leslie Pack Kaelbling, +2 more

- 01 Jan 1996 -

Journal of Artificial Intelligence Resea...

TL;DR: Central issues of reinforcement learning are discussed, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Learning Automata - A Survey

Kumpati S. Narendra, +1 more

TL;DR: Attention has been focused on the norms of behavior of learning automata, issues in the design of updating schemes, convergence of the action probabilities, and interaction of several automata.

...read moreread less

Neocognitron--A New Algorithm for Pattern Recognition Tolerant of Deformations and Shifts in Position

Kunihiko Fukushima, +1 more

TL;DR: The neocognitron recognizes stimulus patterns correctly without being affected by shifts in position or even by considerable distortions in shape of the stimulus patterns.

...read moreread less

Book

Functionally accurate, cooperative distributed systems

Victor Lesser, +1 more

TL;DR: In this article, a new approach for structuring distributed processing systems, called functionally accurate, cooperative (FA/C), is proposed, which is especially suited to applications in which the data necessary to achieve a solution cannot be partitioned in such a way that a node can complete a task without seeing the intermediate state of task processing at other nodes.

...read moreread less

Journal ArticleDOI

Punish/Reward: Learning with a Critic in Adaptive Threshold Systems

Bernard Widrow, +2 more

TL;DR: An adaptive threshold element is able to "learn" a strategy of play for the game blackjack (twenty-one) with a performance close to that of the Thorp optimal strategy although the adaptive system has no prior knowledge of the game and of the objective of play.

...read moreread less

Journal ArticleDOI

Simulation of self-organizing systems by digital computer

B. Farley, +1 more

TL;DR: A general discussion of ideas and definitions relating to self-organizing systems and their synthesis is given, together with remarks concerning their simulation by digital computer.

...read moreread less

Neuronlike adaptive elements that can solve difficult learning control problems

Citations

Reinforcement Learning: An Introduction

Deep learning in neural networks

A Neural Substrate of Prediction and Reward

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Reinforcement learning: a survey

References

Learning Automata - A Survey

Neocognitron--A New Algorithm for Pattern Recognition Tolerant of Deformations and Shifts in Position

Functionally accurate, cooperative distributed systems

Punish/Reward: Learning with a Critic in Adaptive Threshold Systems

Simulation of self-organizing systems by digital computer

Related Papers (5)

Reinforcement Learning: An Introduction

Dynamic Programming

Reinforcement learning: a survey

Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Learning internal representations by error propagation