Literature DB >> 35114098

The role of state uncertainty in the dynamics of dopamine.

John G Mikhael1, HyungGoo R Kim2, Naoshige Uchida3, Samuel J Gershman4.   

Abstract

Reinforcement learning models of the basal ganglia map the phasic dopamine signal to reward prediction errors (RPEs). Conventional models assert that, when a stimulus predicts a reward with fixed delay, dopamine activity during the delay should converge to baseline through learning. However, recent studies have found that dopamine ramps up before reward in certain conditions even after learning, thus challenging the conventional models. In this work, we show that sensory feedback causes an unbiased learner to produce RPE ramps. Our model predicts that when feedback gradually decreases during a trial, dopamine activity should resemble a "bump," whose ramp-up phase should, furthermore, be greater than that of conditions where the feedback stays high. We trained mice on a virtual navigation task with varying brightness, and both predictions were empirically observed. In sum, our theoretical and experimental results reconcile the seemingly conflicting data on dopamine behaviors under the RPE hypothesis.
Copyright © 2022 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  bumps; dopamine; ramps; reinforcement learning; reward prediction error; sensory feedback; state uncertainty; state value

Mesh:

Substances:

Year:  2022        PMID: 35114098      PMCID: PMC8930519          DOI: 10.1016/j.cub.2022.01.025

Source DB:  PubMed          Journal:  Curr Biol        ISSN: 0960-9822            Impact factor:   10.834


  66 in total

Review 1.  Specious reward: a behavioral theory of impulsiveness and impulse control.

Authors:  G Ainslie
Journal:  Psychol Bull       Date:  1975-07       Impact factor: 17.737

2.  Sources of variability and systematic error in mouse timing behavior.

Authors:  C R Gallistel; Adam King; Robert McDonald
Journal:  J Exp Psychol Anim Behav Process       Date:  2004-01

3.  Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons.

Authors:  Genela Morris; David Arkadir; Alon Nevet; Eilon Vaadia; Hagai Bergman
Journal:  Neuron       Date:  2004-07-08       Impact factor: 17.173

4.  Log versus linear timing in human temporal bisection: A signal detection theory study.

Authors:  Jérémie Jozefowiez; Clément Gaudichon; Francis Mekkass; Armando Machado
Journal:  J Exp Psychol Anim Learn Cogn       Date:  2018-10       Impact factor: 2.478

5.  Midbrain Dopamine Neurons Signal Belief in Choice Accuracy during a Perceptual Decision.

Authors:  Armin Lak; Kensaku Nomoto; Mehdi Keramati; Masamichi Sakagami; Adam Kepecs
Journal:  Curr Biol       Date:  2017-03-09       Impact factor: 10.834

6.  Dopamine ramps are a consequence of reward prediction errors.

Authors:  Samuel J Gershman
Journal:  Neural Comput       Date:  2013-12-09       Impact factor: 2.026

7.  A selective role for dopamine in stimulus-reward learning.

Authors:  Shelly B Flagel; Jeremy J Clark; Terry E Robinson; Leah Mayo; Alayna Czuj; Ingo Willuhn; Christina A Akers; Sarah M Clinton; Paul E M Phillips; Huda Akil
Journal:  Nature       Date:  2010-12-08       Impact factor: 49.962

8.  Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.

Authors:  Kenji Morita; Ayaka Kato
Journal:  Front Neural Circuits       Date:  2014-04-09       Impact factor: 3.492

9.  Dynamic mesolimbic dopamine signaling during action sequence learning and expectation violation.

Authors:  Anne L Collins; Venuz Y Greenfield; Jeffrey K Bye; Kay E Linker; Alice S Wang; Kate M Wassum
Journal:  Sci Rep       Date:  2016-02-12       Impact factor: 4.379

10.  Tamping Ramping: Algorithmic, Implementational, and Computational Explanations of Phasic Dopamine Signals in the Accumbens.

Authors:  Kevin Lloyd; Peter Dayan
Journal:  PLoS Comput Biol       Date:  2015-12-23       Impact factor: 4.475

View more
  1 in total

1.  A unidirectional but not uniform striatal landscape of dopamine signaling for motivational stimuli.

Authors:  Wouter van Elzelingen; Jessica Goedhoop; Pascal Warnaar; Damiaan Denys; Tara Arbab; Ingo Willuhn
Journal:  Proc Natl Acad Sci U S A       Date:  2022-05-20       Impact factor: 12.779

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.