Literature DB >> 35798979

A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.

Ryunosuke Amo1, Sara Matias1, Akihiro Yamanaka2, Kenji F Tanaka3, Naoshige Uchida1, Mitsuko Watabe-Uchida4.   

Abstract

A large body of evidence has indicated that the phasic responses of midbrain dopamine neurons show a remarkable similarity to a type of teaching signal (temporal difference (TD) error) used in machine learning. However, previous studies failed to observe a key prediction of this algorithm: that when an agent associates a cue and a reward that are separated in time, the timing of dopamine signals should gradually move backward in time from the time of the reward to the time of the cue over multiple trials. Here we demonstrate that such a gradual shift occurs both at the level of dopaminergic cellular activity and dopamine release in the ventral striatum in mice. Our results establish a long-sought link between dopaminergic activity and the TD learning algorithm, providing fundamental insights into how the brain associates cues and rewards that are separated in time.
© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35798979     DOI: 10.1038/s41593-022-01109-2

Source DB:  PubMed          Journal:  Nat Neurosci        ISSN: 1097-6256            Impact factor:   28.771


  53 in total

1.  How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues.

Authors:  J Brown; D Bullock; S Grossberg
Journal:  J Neurosci       Date:  1999-12-01       Impact factor: 6.167

Review 2.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning.

Authors:  P R Montague; P Dayan; T J Sejnowski
Journal:  J Neurosci       Date:  1996-03-01       Impact factor: 6.167

Review 3.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

4.  A systems-neuroscience model of phasic dopamine.

Authors:  Jessica A Mollick; Thomas E Hazy; Kai A Krueger; Ananta Nair; Prescott Mackie; Seth A Herd; Randall C O'Reilly
Journal:  Psychol Rev       Date:  2020-06-11       Impact factor: 8.934

5.  Dopamine neurons report an error in the temporal prediction of reward during learning.

Authors:  J R Hollerman; W Schultz
Journal:  Nat Neurosci       Date:  1998-08       Impact factor: 24.884

6.  Dopamine cells respond to predicted events during classical conditioning: evidence for eligibility traces in the reward-learning network.

Authors:  Wei-Xing Pan; Robert Schmidt; Jeffery R Wickens; Brian I Hyland
Journal:  J Neurosci       Date:  2005-06-29       Impact factor: 6.167

7.  Dopamine encoding of Pavlovian incentive stimuli diminishes with extended training.

Authors:  Jeremy J Clark; Anne L Collins; Christina Akers Sanford; Paul E M Phillips
Journal:  J Neurosci       Date:  2013-02-20       Impact factor: 6.167

8.  Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens.

Authors:  Jeremy J Day; Mitchell F Roitman; R Mark Wightman; Regina M Carelli
Journal:  Nat Neurosci       Date:  2007-07-01       Impact factor: 24.884

9.  A selective role for dopamine in stimulus-reward learning.

Authors:  Shelly B Flagel; Jeremy J Clark; Terry E Robinson; Leah Mayo; Alayna Czuj; Ingo Willuhn; Christina A Akers; Sarah M Clinton; Paul E M Phillips; Huda Akil
Journal:  Nature       Date:  2010-12-08       Impact factor: 49.962

10.  Opposite initialization to novel cues in dopamine signaling in ventral and posterior striatum in mice.

Authors:  William Menegas; Benedicte M Babayan; Naoshige Uchida; Mitsuko Watabe-Uchida
Journal:  Elife       Date:  2017-01-05       Impact factor: 8.140

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.