scispace - formally typeset
Open AccessJournal ArticleDOI

The Dopamine Prediction Error: Contributions to Associative Models of Reward Learning.

TLDR
This research hopes to support the expansion of how dopaminergic prediction errors are thought to contribute to the learning process beyond the traditional concept of transferring quantitative value.
Abstract
Phasic activity of midbrain dopamine neurons is currently thought to encapsulate the prediction-error signal described in Sutton and Barto’s (1981) model-free reinforcement learning algorithm. This phasic signal is thought to contain information about the quantitative value of reward, which transfers to the reward-predictive cue after learning. This is argued to endow the reward-predictive cue with the value inherent in the reward, motivating behavior towards cues signaling the presence of reward. Yet theoretical and empirical research has implicated prediction-error signaling in learning that extends far beyond a transfer of quantitative value to a reward-predictive cue. Here, we review the research which demonstrates the complexity of how dopaminergic prediction errors facilitate learning. After briefly discussing the literature demonstrating that phasic dopaminergic signals can act in the manner described by Sutton and Barto (1981), we consider how these signals may also influence attentional processing across multiple attentional systems in distinct brain circuits. Then, we discuss how prediction errors encode and promote the development of context-specific associations between cues and rewards. Finally, we consider recent evidence that shows dopaminergic activity contains information about causal relationships between cues and rewards that reflect information garnered from rich associative models of the world that can be adapted in the absence of direct experience. In discussing this research we hope to support the expansion of how dopaminergic prediction errors are thought to contribute to the learning process beyond the traditional concept of transferring quantitative value.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

A Novel Neural Prediction Error Found in Anterior Cingulate Cortex Ensembles

TL;DR: A novel mode of prediction error signaling by ACC neurons that is associated with the generation of an FN, the feedback-related negativity, is described.
Journal ArticleDOI

Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions

TL;DR: It is shown that, although both VTA and SNc DA neuron activation reinforces instrumental responding, only VTADA neuron activation during consumption of expected sucrose reward restores error-driven learning and promotes formation of a new cue→sucrose association.
Journal ArticleDOI

Illusory Motion Reproduced by Deep Neural Networks Trained for Prediction.

TL;DR: Surprisingly, the computer model accurately predicted the magnitude and direction of motion of a rotating propeller in unlearned videos and represented the rotational motion for illusion images that were not moving physically, much like human visual perception.
Journal ArticleDOI

Holistic Reinforcement Learning: The Role of Structure and Attention.

TL;DR: This work proposes an integration of Bayesian cognitive models in which structured knowledge learned via approximate Bayesian inference acts as a source of selective attention, which biases reinforcement learning towards relevant dimensions of the environment.
Journal ArticleDOI

A roadmap to integrate astrocytes into Systems Neuroscience.

TL;DR: The analysis suggests that astrocytes may carry out canonical computations in a time scale of subseconds to seconds in sensory processing, neuromodulation, brain state, memory formation, fear, and complex homeostatic reflexes.
References
More filters
Book

Reinforcement Learning: An Introduction

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
Journal ArticleDOI

An integrative theory of prefrontal cortex function

TL;DR: It is proposed that cognitive control stems from the active maintenance of patterns of activity in the prefrontal cortex that represent goals and the means to achieve them, which provide bias signals to other brain structures whose net effect is to guide the flow of activity along neural pathways that establish the proper mappings between inputs, internal states, and outputs needed to perform a given task.
Journal ArticleDOI

A Neural Substrate of Prediction and Reward

TL;DR: Findings in this work indicate that dopaminergic neurons in the primate whose fluctuating output apparently signals changes or errors in the predictions of future salient and rewarding events can be understood through quantitative theories of adaptive optimizing control.
Journal ArticleDOI

A model for Pavlovian learning: Variations in the effectiveness of conditioned but not of unconditioned stimuli.

TL;DR: A new model is proposed that deals with the explanation of cases in which learning does not occur in spite of the fact that the conditioned stimulus is a signal for the reinforcer by specifying that certain procedures cause a conditioned stimulus to lose effectiveness.
Journal ArticleDOI

A Theory of Attention: Variations in the Associability of Stimuli with Reinforcement

TL;DR: Overshadowing and blocking are better explained by the choice of an appropriate rule for changing a, such that a decreases to stimuli that signal no change from the probability of reinforcement predicted by other stimuli.
Related Papers (5)