PVLV: the primary value and learned value Pavlovian learning algorithm.
read more
Citations
Reinforcement Learning: An Introduction
Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia
Hold your horses: a dynamic computational role for the subthalamic nucleus in decision making
2006 Special Issue Hold your horses: A dynamic computational role for the subthalamic nucleus in decision making
Reinforcement learning: The Good, The Bad and The Ugly
References
Long short-term memory
Reinforcement Learning: An Introduction
Learning to Predict by the Methods of Temporal Differences
Short-Term Synaptic Plasticity
Related Papers (5)
By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism
Frequently Asked Questions (11)
Q2. What contributions have the authors mentioned in the paper "Pvlv: the primary value and learned value pavlovian learning algorithm" ?
The authors present their primary value learned value ( PVLV ) model for understanding the rewardpredictive firing properties of dopamine ( DA ) neurons as an alternative to the temporal-differences ( TD ) algorithm. The authors show that the PVLV model can account for critical aspects of the DA firing data, making a number of clear predictions about lesion effects, several of which are consistent with existing data. Overall, the model provides a biologically plausible framework for understanding the neural basis of reward learning.
Q3. What is the role of the striosome/patch neurons in the ventral?
The striosome/patch neurons in the ventral striatum should be specifically responsible for producing the dip in dopamine firing at the time of expected rewards, when rewards are not delivered.
Q4. What is the role of the VS striosome/patch neurons in learning?
At a behavioral level, the VS striosome/patch neurons should be specifically important for extinction and reversal learning, which should depend on the dopamine dip for expected but not delivered rewards.
Q5. What was the main motivation for the development of the Rescorla learning rule?
The blocking effect (Kamin, 1968) was one of the main motivations leading to the development of the Rescorla–Wagner learning rule.
Q6. What is the role of the CNA in learning working memory tasks?
At a behavioral level, CNA should be critical for learning working memory tasks, in which working memory update actions (driven by more central regions of the dorsal striatum that project to prefrontal cortex) must be activated at the onset of task-relevant stimuli to encode these stimuli into working memory (O’Reilly & Frank, 2006).
Q7. What is the role of the basal ganglia in dopamine release?
their more general basal ganglia model suggests that the disinhibitory pathway from the striatum to the SNc via the ventral pallidum is responsible for disinhibiting dopamine release for actions that were initiated by “go” signals in the striatum and not for directly activating dopamine bursting (O’Reilly & Frank, 2006).
Q8. What is the standard account of these findings?
The standard account of these findings is that these additional brain systems are necessary for maintaining the stimulus trace through to the point of reward so that a reward association can be established.
Q9. What is the strongest support for the TD framework?
Perhaps the strongest support for this framework as a model of reward learning in the brain comes from the good fit between the functional properties of the PV and LV components of PVLV and those of brain areas that are known to support reward learning.
Q10. What is the effect of the ventral striatum pathway on the reward system?
this ventral striatum pathway achieves an effect similar to the LV system in PVLV (i.e., CS-onset firing), whereas the striosomal pathway achieves an effect similar to the PV system (i.e., reward burst canceling).
Q11. What is the pattern of results that is seen in the DA burst?
The actual pattern of results that is seen depends on the preexisting salience of the two stimuli, with the more salient stimulus overshadowing the lesser.