Adaptive properties of differential learning rates for positive and negative outcomes

doi:10.1007/S00422-013-0571-5

Journal Article•DOI•

Adaptive properties of differential learning rates for positive and negative outcomes

Romain D. Cazé¹, Matthijs A. A. van der Meer²•Institutions (2)

Imperial College London¹, University of Waterloo²

01 Dec 2013-Biological Cybernetics (Springer Berlin Heidelberg)-Vol. 107, Iss: 6, pp 711-719

TL;DR: It is shown analytically how the optimal learning rate asymmetry depends on the reward distribution and how a biologically plausible algorithm that adapts the balance of positive and negative learning rates from experience is implemented.

read less

Abstract: The concept of the reward prediction error--the difference between reward obtained and reward predicted--continues to be a focal point for much theoretical and experimental work in psychology, cognitive science, and neuroscience Models that rely on reward prediction errors typically assume a single learning rate for positive and negative prediction errors However, behavioral data indicate that better-than-expected and worse-than-expected outcomes often do not have symmetric impacts on learning and decision-making Furthermore, distinct circuits within cortico-striatal loops appear to support learning from positive and negative prediction errors, respectively Such differential learning rates would be expected to lead to biased reward predictions and therefore suboptimal choice performance Contrary to this intuition, we show that on static "bandit" choice tasks, differential learning rates can be adaptive This occurs because asymmetric learning enables a better separation of learned reward probabilities We show analytically how the optimal learning rate asymmetry depends on the reward distribution and implement a biologically plausible algorithm that adapts the balance of positive and negative learning rates from experience These results suggest specific adaptive advantages for separate, differential learning rates in simple reinforcement learning settings and provide a novel, normative perspective on the interpretation of associated neural data

...read moreread less

Adaptive properties of differential learning rates for positive and negative outcomes

Citations

References

"Adaptive properties of differential..." refers background in this paper

"Adaptive properties of differential..." refers background in this paper

"Adaptive properties of differential..." refers background or methods in this paper

"Adaptive properties of differential..." refers background in this paper

Related Papers (5)