scispace - formally typeset
Book ChapterDOI

Closing the Sensory-Motor Loop on Dopamine Signalled Reinforcement Learning

Paul Chorley, +1 more
- pp 280-290
TLDR
It is shown that effective reinforcement learning is indeed possible, but only when stimuli are gated so as to occur as near-synchronous patterns of neural activity and when neuroanatomical constraints are imposed which predispose agents to exploratative behaviours.
Abstract
It has been shown recently that dopamine signalled modulation of spike timing-dependent synaptic plasticity (DA-STDP) can enable reinforcement learning of delayed stimulus-reward associations when both stimulus and reward are delivered at precisely timed intervals Here, we test whether a similar model can support learning in an embodied context, in which timing of both sensory input and delivery of reward depend on the agent's behaviour We show that effective reinforcement learning is indeed possible, but only when stimuli are gated so as to occur as near-synchronous patterns of neural activity and when neuroanatomical constraints are imposed which predispose agents to exploratative behaviours Extinction of learned responses in this model is subsequently shown to result from agent-environment interactions and not directly from any specific neural mechanism

read more

Citations
More filters
Journal ArticleDOI

Closed Loop Interactions between Spiking Neural Network and Robotic Simulators Based on MUSIC and ROS.

TL;DR: In this article, a middleware solution that bridges the Robotic Operating System (ROS) to the Multi-Simulator Coordinator (MUSIC) enables any robotic and neural simulators that implement corresponding interfaces to be efficiently coupled, allowing real-time performance for a wide range of configurations.

Evolving Action Selection and Selective Attention Without Actions, Attention, or Selection.

TL;DR: A minimal animat architecture, consisting only of a set of autonomous, direct, and continuously active sensorimotor links, is shown to support a full range of ‘action selection’ phenomena.
Dissertation

Evolutionary robotics in high altitude wind energy applications

TL;DR: A multibody kite simulation that is used in an evolutionary process in which the kite is subject to deformation is introduced and the difficulty of the task must be increased during the evolutionary process to deal with this extreme variability in small increments.
Journal ArticleDOI

Experimental Study of Reinforcement Learning in Mobile Robots Through Spiking Architecture of Thalamo-Cortico-Thalamic Circuitry of Mammalian Brain

Vahid Azimirad, +1 more
- 01 Sep 2020 - 
TL;DR: Experimental studies prove that through the proposed method, thalamo-cortical structure could be trained successfully to learn to perform various robotic tasks.
Posted Content

Reinforcement Learning in a Neurally Controlled Robot Using Dopamine Modulated STDP.

TL;DR: This work provides insights into the reasons behind some observed biological phenomena, such as the bursting behaviour observed in dopaminergic neurons, as well as demonstrating how spiking neural network controlled robots are able to solve a range of reinforcement learning tasks.
References
More filters
Book

Reinforcement Learning: An Introduction

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
Book

Introduction to Reinforcement Learning

TL;DR: In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning.
Journal ArticleDOI

Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell Type

TL;DR: The results underscore the importance of precise spike timing, synaptic strength, and postsynaptic cell type in the activity-induced modification of central synapses and suggest that Hebb’s rule may need to incorporate a quantitative consideration of spike timing that reflects the narrow and asymmetric window for the induction of synaptic modification.
Journal ArticleDOI

Simple model of spiking neurons

TL;DR: A model is presented that reproduces spiking and bursting behavior of known types of cortical neurons and combines the biologically plausibility of Hodgkin-Huxley-type dynamics and the computational efficiency of integrate-and-fire neurons.
Journal ArticleDOI

Predictive Reward Signal of Dopamine Neurons

TL;DR: Dopamine systems may have two functions, the phasic transmission of reward information and the tonic enabling of postsynaptic neurons.
Related Papers (5)