scispace - formally typeset
Search or ask a question

Showing papers on "Reinforcement learning published in 1971"


Journal Article

3 citations


Book ChapterDOI
K. S. Fu1
01 Jan 1971
TL;DR: Design techniques proposed for learning control systems include: (1) trainable controllers using pattern classifiers, (2) reinforcement learning algorithms, (3) Bayesian estimation, (4) stochastic approximation, and (5) Stochastic automata models.
Abstract: In designing an optimal control system, if the a priori information required is unknown or incompletely known, one possible approach is to design a controller which is capable of estimating the unknown information during its operation and determining the optimal control action on the basis of the estimated information. If the estimated information gradually approaches the true information as time proceeds, then the controller designed will approach the optimal controller; and, consequently, the performance of the control system is gradually improved. Because of the gradual improvement of performance due to the improvement of the estimated unknown information, this class of control systems has been called learning control systems. Design techniques proposed for learning control systems include: (1) trainable controllers using pattern classifiers, (2) reinforcement learning algorithms, (3) Bayesian estimation, (4) stochastic approximation, and (5) stochastic automata models. A survey of these techniques can be found in [1]. A general formulation using stochastic approximation has been treated extensively in [2, 3]. Practical applications include spacecraft control systems, the control of valve actuators, power systems, and production processes. In addition, several nonlinear learning algorithms have recently been proposed.

2 citations


Journal ArticleDOI
A.L. Girard1
TL;DR: Methods of Estimation Theory are used to show that reinforcement learning is implemented by sequential parameter estimation which alters both a priori and spontaneously learned templates feature by feature.