Top 3 papers published in the topic of Reinforcement learning in 1971

Journal Article•

Optimal learning systems

[...]

Vladimir Semenovich Pugachev

01 Jan 1971-Kybernetika

3 citations

Book Chapter•DOI•

A Critical Review of Learning Control Research

[...]

K. S. Fu¹•Institutions (1)

Purdue University¹

01 Jan 1971

TL;DR: Design techniques proposed for learning control systems include: (1) trainable controllers using pattern classifiers, (2) reinforcement learning algorithms, (3) Bayesian estimation, (4) stochastic approximation, and (5) Stochastic automata models.

...read moreread less

Abstract: In designing an optimal control system, if the a priori information required is unknown or incompletely known, one possible approach is to design a controller which is capable of estimating the unknown information during its operation and determining the optimal control action on the basis of the estimated information. If the estimated information gradually approaches the true information as time proceeds, then the controller designed will approach the optimal controller; and, consequently, the performance of the control system is gradually improved. Because of the gradual improvement of performance due to the improvement of the estimated unknown information, this class of control systems has been called learning control systems. Design techniques proposed for learning control systems include: (1) trainable controllers using pattern classifiers, (2) reinforcement learning algorithms, (3) Bayesian estimation, (4) stochastic approximation, and (5) stochastic automata models. A survey of these techniques can be found in [1]. A general formulation using stochastic approximation has been treated extensively in [2, 3]. Practical applications include spacecraft control systems, the control of valve actuators, power systems, and production processes. In addition, several nonlinear learning algorithms have recently been proposed.

...read moreread less

2 citations

Journal Article•DOI•

Parameter estimation and learning/classification threshold optimization applied to maxentropic adaptive pattern recognition

[...]

A.L. Girard¹•Institutions (1)

Raytheon¹

01 Oct 1971-Pattern Recognition

TL;DR: Methods of Estimation Theory are used to show that reinforcement learning is implemented by sequential parameter estimation which alters both a priori and spontaneously learned templates feature by feature.

...read moreread less

Showing papers on "Reinforcement learning published in 1971"