Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.

Literature DB >> 35798979

A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.

Ryunosuke Amo¹, Sara Matias¹, Akihiro Yamanaka², Kenji F Tanaka³, Naoshige Uchida¹, Mitsuko Watabe-Uchida⁴.

Abstract

A large body of evidence has indicated that the phasic responses of midbrain dopamine neurons show a remarkable similarity to a type of teaching signal (temporal difference (TD) error) used in machine learning. However, previous studies failed to observe a key prediction of this algorithm: that when an agent associates a cue and a reward that are separated in time, the timing of dopamine signals should gradually move backward in time from the time of the reward to the time of the cue over multiple trials. Here we demonstrate that such a gradual shift occurs both at the level of dopaminergic cellular activity and dopamine release in the ventral striatum in mice. Our results establish a long-sought link between dopaminergic activity and the TD learning algorithm, providing fundamental insights into how the brain associates cues and rewards that are separated in time.

Entities: Chemical

Mesh：

Substances：
Dopamine

Year: 2022 PMID： 35798979 DOI： 10.1038/s41593-022-01109-2

Source DB: PubMed Journal: Nat Neurosci ISSN： 1097-6256 Impact factor: 28.771

Keyword Cloud
References

53 in total

1. How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues.

Authors: J Brown; D Bullock; S Grossberg
Journal: J Neurosci Date: 1999-12-01 Impact factor: 6.167

Review 2. A framework for mesencephalic dopamine systems based on predictive Hebbian learning.

Authors: P R Montague; P Dayan; T J Sejnowski
Journal: J Neurosci Date: 1996-03-01 Impact factor: 6.167

Review 3. A neural substrate of prediction and reward.

Authors: W Schultz; P Dayan; P R Montague
Journal: Science Date: 1997-03-14 Impact factor: 47.728

4. A systems-neuroscience model of phasic dopamine.

Authors: Jessica A Mollick; Thomas E Hazy; Kai A Krueger; Ananta Nair; Prescott Mackie; Seth A Herd; Randall C O'Reilly
Journal: Psychol Rev Date: 2020-06-11 Impact factor: 8.934

5. Dopamine neurons report an error in the temporal prediction of reward during learning.

Authors: J R Hollerman; W Schultz
Journal: Nat Neurosci Date: 1998-08 Impact factor: 24.884

6. Dopamine cells respond to predicted events during classical conditioning: evidence for eligibility traces in the reward-learning network.

Authors: Wei-Xing Pan; Robert Schmidt; Jeffery R Wickens; Brian I Hyland
Journal: J Neurosci Date: 2005-06-29 Impact factor: 6.167

A gradual temporal shift of dopamine responses mirrors the progression of temporal difference error in machine learning.

1. How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues.

Review 2. A framework for mesencephalic dopamine systems based on predictive Hebbian learning.

Review 3. A neural substrate of prediction and reward.

4. A systems-neuroscience model of phasic dopamine.

5. Dopamine neurons report an error in the temporal prediction of reward during learning.

6. Dopamine cells respond to predicted events during classical conditioning: evidence for eligibility traces in the reward-learning network.

7. Dopamine encoding of Pavlovian incentive stimuli diminishes with extended training.

8. Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens.

9. A selective role for dopamine in stimulus-reward learning.

10. Opposite initialization to novel cues in dopamine signaling in ventral and posterior striatum in mice.