Search or ask a question

Showing papers on "Reinforcement learning published in 1986"

PDF

Open Access

Journal Article•DOI•

Learning Control Systems-Review and Outlook

[...]

King-Sun Fu¹•Institutions (1)

Purdue University¹

01 Mar 1986-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The basic concept of learning control is introduced, and the following five learning schemes are briefly reviewed: 1) trainable controllers using pattern classifiers, 2) reinforcement learning control systems, 3) Bayesian estimation, 4) stochastic approximation, and 5) Stochastic automata models.

...read moreread less

Abstract: The basic concept of learning control is introduced. The following five learning schemes are briefly reviewed: 1) trainable controllers using pattern classifiers, 2) reinforcement learning control systems, 3) Bayesian estimation, 4) stochastic approximation, and 5) stochastic automata models. Potential applications and problems for further research in learning control are outlined.

...read moreread less

121 citations

Book•

Learning and Problem Solving with Multilayer Connectionist Systems

[...]

Charles W. Anderson

31 Dec 1986

TL;DR: A novel algorithm is examined that combines ASPECTS of REINFORCEMENT LEARNING and a DATA-DIRECTED SEARCH for USEFUL WEIGHTS, and is shown to out perform reinFORMCEMENT-LEARNING ALGORITHMS.

...read moreread less

Abstract: THE DIFFICULTIES OF LEARNING IN MULTILAYERED NETWORKS OF COMPUTATIONAL UNITS HAS LIMITED THE USE OF CONNECTIONIST SYSTEMS IN COMPLEX DOMAINS. THIS DISSERTATION ELUCIDATES THE ISSUES OF LEARNING IN A NETWORK''S HIDDEN UNITS, AND REVIEWS METHODS FOR ADDRESSING THESE ISSUES THAT HAVE BEEN DEVELOPED THROUGH THE YEARS. ISSUES OF LEARNING IN HIDDEN UNITS ARE SHOWN TO BE ANALOGOUS TO LEARNING ISSUES FOR MULTILAYER SYSTEMS EMPLOYING SYMBOLIC REPRSENTATIONS. COMPARISONS OF A NUMBER OF ALGORITHMS FOR LEARNING IN HIDDEN UNITS ARE MADE BY APPLYING THEM IN A CONSISTENT MANNER TO SEVERAL TASKS. RECENTLY DEVELOPED ALGORITHMS, INCLUDING RUMELHART, ET AL''S, ERROR BACK-PROPOGATIONS ALGORITHM AND BARTO, ET AL''S, REINFORCEMENT-LEARNING ALGORITHMS, LEARN THE SOLUTIONS TO THE TASKS MUCH MORE SUCCESSFULLY THAN METHODS OF THE PAST. A NOVEL ALGORITHM IS EXAMINED THAT COMBINES ASPECTS OF REINFORCEMENT LEARNING AND A DATA-DIRECTED SEARCH FOR USEFUL WEIGHTS, AND IS SHOWN TO OUT PERFORM REINFORMCEMENT-LEARNING ALGORITHMS. A CONNECTIONIST FRAMEWORK FOR THE LEARNING OF STRATEGIES IS DESCRIBED WHICH COMBINES THE ERROR BACK-PROPOGATION ALGORITHM FOR LEARNING IN HIDDEN UNITS WITH SUTTON''S AHC ALGORITHM TO LEARN EVALUATION FUNCTIONS AND WITH A REINFORCEMENT-LEARNING ALGORITHM TO LEARN SEARCH HEURISTICS. THE GENERAL- ITY OF THIS HYBRID SYSTEM IS DEMONSTRATED THROUGH SUCCESSFUL APPLICATIONS

...read moreread less

115 citations

Learning and problem-solving with multilayer connectionist systems (adaptive, strategy learning, neural networks, reinforcement learning)

[...]

Charles W. Anderson

01 Jan 1986

20 citations

Book Chapter•DOI•

Cooperativity in Networks of Pattern Recognizing Stochastic Learning Automata

[...]

Andrew G. Barto¹, P. Anandan¹, Charles W. Anderson¹•Institutions (1)

University of Massachusetts Amherst¹

01 Jan 1986

TL;DR: A class of learning tasks is described that combines aspects of learning automaton tasks and supervised learning pattern-classification tasks and an algorithm is presented, called the associative reward-penalty, or A R−P, algorithm, for which a form of optimal performance has been proved.

...read moreread less

Abstract: A class of learning tasks is described that combines aspects of learning automaton tasks and supervised learning pattern-classification tasks. We call these associative reinforcement learning tasks. An algorithm is presented, called the associative reward-penalty, or A R−P , algorithm, for which a form of optimal performance has been proved. This algorithm simultaneously generalizes a class of stochastic learning automata and a class of supervised learning pattern-classification methods. Simulation results are presented that illustrate the associative reinforcement learning task and the performance of the the A R−P algorithm. Additional simulation results are presented showing how cooperative activity in networks of interconnected A R−P automata can olve difficult nonlinear associative learning problems.

...read moreread less

15 citations