Closing the Sensory-Motor Loop on Dopamine Signalled Reinforcement Learning

doi:10.1007/978-3-540-69134-1_28

Book ChapterDOI

Closing the Sensory-Motor Loop on Dopamine Signalled Reinforcement Learning

- pp 280-290

TLDR

It is shown that effective reinforcement learning is indeed possible, but only when stimuli are gated so as to occur as near-synchronous patterns of neural activity and when neuroanatomical constraints are imposed which predispose agents to exploratative behaviours.

Abstract:

It has been shown recently that dopamine signalled modulation of spike timing-dependent synaptic plasticity (DA-STDP) can enable reinforcement learning of delayed stimulus-reward associations when both stimulus and reward are delivered at precisely timed intervals Here, we test whether a similar model can support learning in an embodied context, in which timing of both sensory input and delivery of reward depend on the agent's behaviour We show that effective reinforcement learning is indeed possible, but only when stimuli are gated so as to occur as near-synchronous patterns of neural activity and when neuroanatomical constraints are imposed which predispose agents to exploratative behaviours Extinction of learned responses in this model is subsequently shown to result from agent-environment interactions and not directly from any specific neural mechanism

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Closed Loop Interactions between Spiking Neural Network and Robotic Simulators Based on MUSIC and ROS.

Philipp Weidel, +3 more

- 03 Aug 2016 -

Frontiers in Neuroinformatics

TL;DR: In this article, a middleware solution that bridges the Robotic Operating System (ROS) to the Multi-Simulator Coordinator (MUSIC) enables any robotic and neural simulators that implement corresponding interfaces to be efficiently coupled, allowing real-time performance for a wide range of configurations.

...read moreread less

Evolving Action Selection and Selective Attention Without Actions, Attention, or Selection.

Rolf Pfeifer, +3 more

TL;DR: A minimal animat architecture, consisting only of a set of autonomous, direct, and continuously active sensorimotor links, is shown to support a full range of ‘action selection’ phenomena.

...read moreread less

Dissertation

Evolutionary robotics in high altitude wind energy applications

Allister David John Furey

TL;DR: A multibody kite simulation that is used in an evolutionary process in which the kite is subject to deformation is introduced and the difficulty of the task must be increased during the evolutionary process to deal with this extreme variability in small increments.

...read moreread less

Journal ArticleDOI

Experimental Study of Reinforcement Learning in Mobile Robots Through Spiking Architecture of Thalamo-Cortico-Thalamic Circuitry of Mammalian Brain

Vahid Azimirad, +1 more

- 01 Sep 2020 -

Robotica

TL;DR: Experimental studies prove that through the proposed method, thalamo-cortical structure could be trained successfully to learn to perform various robotic tasks.

...read moreread less

Posted Content

Reinforcement Learning in a Neurally Controlled Robot Using Dopamine Modulated STDP.

Richard Evans

- 21 Feb 2015 -

arXiv: Neural and Evolutionary Computing

TL;DR: This work provides insights into the reasons behind some observed biological phenomena, such as the bursting behaviour observed in dopaminergic neurons, as well as demonstrating how spiking neural network controlled robots are able to solve a range of reinforcement learning tasks.

...read moreread less

References

PDF

Open Access

More filters

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Book

Introduction to Reinforcement Learning

Richard S. Sutton, +1 more

TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.

...read moreread less

Journal ArticleDOI

Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell Type

Guo-Qiang Bi, +1 more

- 15 Dec 1998 -

The Journal of Neuroscience

TL;DR: The results underscore the importance of precise spike timing, synaptic strength, and postsynaptic cell type in the activity-induced modification of central synapses and suggest that Hebb’s rule may need to incorporate a quantitative consideration of spike timing that reflects the narrow and asymmetric window for the induction of synaptic modification.

...read moreread less

Journal ArticleDOI

Simple model of spiking neurons

Eugene M. Izhikevich

- 01 Nov 2003 -

IEEE Transactions on Neural Networks

TL;DR: A model is presented that reproduces spiking and bursting behavior of known types of cortical neurons and combines the biologically plausibility of Hodgkin-Huxley-type dynamics and the computational efficiency of integrate-and-fire neurons.

...read moreread less