Obstacle Avoidance through Reinforcement Learning

Open AccessProceedings Article

Obstacle Avoidance through Reinforcement Learning

Tony J. Prescott, +1 more

- Vol. 4, pp 523-530

Chats0

TLDR

A method is described for generating plan-like, reflexive, obstacle avoidance behaviour in a mobile robot that adapts its responses to sensory stimuli so as to minimise the negative reinforcement arising from collisions.

Abstract:

A method is described for generating plan-like, reflexive, obstacle avoidance behaviour in a mobile robot. The experiments reported here use a simulated vehicle with a primitive range sensor. Avoidance behaviour is encoded as a set of continuous functions of the perceptual input space. These functions are stored using CMACs and trained by a variant of Barto and Sutton's adaptive critic algorithm. As the vehicle explores its surroundings it adapts its responses to sensory stimuli so as to minimise the negative reinforcement arising from collisions. Strategies for local navigation are therefore acquired in an explicitly goal-driven fashion. The resulting trajectories form elegant collision-free paths through the environment.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Rapid, safe, and incremental learning of navigation strategies

J.del.R. Millan

TL;DR: A reinforcement connectionist learning architecture that allows an autonomous robot to acquire efficient navigation strategies in a few trials and has high tolerance to noisy sensory data and good generalization abilities is proposed.

...read moreread less

Journal ArticleDOI

Trajectory Planning and Obstacle Avoidance for Hyper-Redundant Serial Robots

Midhun S. Menon, +2 more

- 01 Aug 2017 -

Journal of Mechanisms and Robotics

TL;DR: In this paper, the authors presented an optimization algorithm for the motion planning of a hyper-redundant robot where the motion of one end (head) is an arbitrary desired path.

...read moreread less

The sensorimotor foundations of phonology: a computational model of early childhood articulatory and phonetic development

Kevin L. Markey

TL;DR: HABLAR as discussed by the authors is a computational model of the sensorimotor foundations of early childhood phonological development, which is intended to explain key characteristics of normal phonology development including the phonetic characteristics of babble, systematic and context sensitive patterns of sound substitutions and deletions, and overgeneralization of pronunciation patterns.

...read moreread less

Journal ArticleDOI

Learning Signaling Behaviors and Specialization in Cooperative Agents

Antonio Murciano, +1 more

- 01 Jun 1996 -

Adaptive Behavior

TL;DR: A learning mechanism that allows a multiagent system to cooperate to achieve a gathering task efficiently in unknown and changing environments is presented and simulation results show that the multi agent system always achieves near-optimal performances.

...read moreread less

Proceedings ArticleDOI

Path Planning of Humanoid Arm Based on Deep Deterministic Policy Gradient

Shuhuan Wen, +4 more

TL;DR: A new obstacle avoidance algorithm, based on an existing deep reinforcement learning framework called deep deterministic policy gradient (DDPG), is proposed to use DDPG to plan the trajectory of a robot arm to realize obstacle avoidance.

...read moreread less

References

PDF

Open Access

More filters

Book

The Sciences of the Artificial

Herbert A. Simon

TL;DR: A new edition of Simon's classic work on artificial intelligence as mentioned in this paper adds a chapter that sorts out the current themes and tools for analyzing complexity and complex systems, taking into account important advances in cognitive psychology and the science of design while confirming and extending Simon's basic thesis that a physical symbol system has the necessary and sufficient means for intelligent action.

...read moreread less

Journal ArticleDOI

The Sciences of the Artificial

Alex C. Michalos, +1 more

- 01 Jan 1970 -

Technology and Culture

Learning from delayed rewards

Chris Watkins

Journal ArticleDOI

Neuronlike adaptive elements that can solve difficult learning control problems

Andrew G. Barto, +2 more

TL;DR: In this article, a system consisting of two neuron-like adaptive elements can solve a difficult learning control problem, where the task is to balance a pole that is hinged to a movable cart by applying forces to the cart base.

...read moreread less

Journal ArticleDOI

A Theory of Cerebellar Function

James S. Albus

- 01 Feb 1971 -

Bellman Prize in Mathematical Bioscience...

TL;DR: It is demonstrated that, in order for the learning process to be stable, pattern storage must be accomplished principally by weakening synaptic weights rather than by strengthening them.

...read moreread less