Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Alternative time representation in dopamine models.

Literature DB >> 19847635

Alternative time representation in dopamine models.

François Rivest¹, John F Kalaska, Yoshua Bengio.

Abstract

Dopaminergic neuron activity has been modeled during learning and appetitive behavior, most commonly using the temporal-difference (TD) algorithm. However, a proper representation of elapsed time and of the exact task is usually required for the model to work. Most models use timing elements such as delay-line representations of time that are not biologically realistic for intervals in the range of seconds. The interval-timing literature provides several alternatives. One of them is that timing could emerge from general network dynamics, instead of coming from a dedicated circuit. Here, we present a general rate-based learning model based on long short-term memory (LSTM) networks that learns a time representation when needed. Using a naïve network learning its environment in conjunction with TD, we reproduce dopamine activity in appetitive trace conditioning with a constant CS-US interval, including probe trials with unexpected delays. The proposed model learns a representation of the environment dynamics in an adaptive biologically plausible framework, without recourse to delay lines or other special-purpose circuits. Instead, the model predicts that the task-dependent representation of time is learned by experience, is encoded in ramp-like changes in single-neuron activity distributed across small neural networks, and reflects a temporal integration mechanism resulting from the inherent dynamics of recurrent loops within the network. The model also reproduces the known finding that trace conditioning is more difficult than delay conditioning and that the learned representation of the task can be highly dependent on the types of trials experienced during training. Finally, it suggests that the phasic dopaminergic signal could facilitate learning in the cortex.

Entities: Chemical

Mesh：

Substances：
Dopamine

Year: 2009 PMID： 19847635 DOI： 10.1007/s10827-009-0191-1

Source DB: PubMed Journal: J Comput Neurosci ISSN： 0929-5313 Impact factor: 1.621

59 in total

1. Retrospective and prospective coding for predicted reward in the sensory thalamus.

Authors: Y Komura; R Tamura; T Uwano; H Nishijo; K Kaga; T Ono
Journal: Nature Date: 2001-08-02 Impact factor: 49.962

2. Neuronal activity in monkey ventral striatum related to the expectation of reward.

Authors: W Schultz; P Apicella; E Scarnati; T Ljungberg
Journal: J Neurosci Date: 1992-12 Impact factor: 6.167

Review 3. The connections of the dopaminergic system with the striatum in rats and primates: an analysis with respect to the functional and compartmental organization of the striatum.

Authors: D Joel; I Weiner
Journal: Neuroscience Date: 2000 Impact factor: 3.590

Review 4. What makes us tick? Functional and neural mechanisms of interval timing.

Authors: Catalin V Buhusi; Warren H Meck
Journal: Nat Rev Neurosci Date: 2005-10 Impact factor: 34.870

Review 5. Who's on first? What's on second? The time course of learning in corticostriatal systems.

Authors: Mark Laubach
Journal: Trends Neurosci Date: 2005-10 Impact factor: 13.837

6. Timing in the absence of clocks: encoding time in neural network states.

Authors: Uma R Karmarkar; Dean V Buonomano
Journal: Neuron Date: 2007-02-01 Impact factor: 17.173

7. Dorsal, ventral, and complete excitotoxic lesions of the hippocampus in rats failed to impair appetitive trace conditioning.

Authors: Geneviève Thibaudeau; Olivier Potvin; Kevin Allen; François Y Doré; Sonia Goulet
Journal: Behav Brain Res Date: 2007-07-12 Impact factor: 3.332

8. A spiking neural network model of an actor-critic learning agent.

Authors: Wiebke Potjans; Abigail Morrison; Markus Diesmann
Journal: Neural Comput Date: 2009-02 Impact factor: 2.026

9. Stimulus representation and the timing of reward-prediction errors in models of the dopamine system.

Authors: Elliot A Ludvig; Richard S Sutton; E James Kehoe
Journal: Neural Comput Date: 2008-12 Impact factor: 2.026

Review 10. A neural substrate of prediction and reward.

Authors: W Schultz; P Dayan; P R Montague
Journal: Science Date: 1997-03-14 Impact factor: 47.728

9 in total

Review 1. Towards a unified model of pavlovian conditioning: short review of trace conditioning models.

Authors: V I Kryukov
Journal: Cogn Neurodyn Date: 2012-02-22 Impact factor: 5.082

2. Reinforcement learning with Marr.

Authors: Yael Niv; Angela Langdon
Journal: Curr Opin Behav Sci Date: 2016-10

3. Time-scale-invariant information-theoretic contingencies in discrimination learning.

Authors: Abigail Kalmbach; Eileen Chun; Kathleen Taylor; Charles R Gallistel; Peter D Balsam
Journal: J Exp Psychol Anim Learn Cogn Date: 2019-04-25 Impact factor: 2.478

Review 4. The role of working memory and declarative memory in trace conditioning.

Authors: David A Connor; Thomas J Gould
Journal: Neurobiol Learn Mem Date: 2016-07-12 Impact factor: 2.877

5. Functional coding variation in the presynaptic dopamine transporter associated with neuropsychiatric disorders drives enhanced motivation and context-dependent impulsivity in mice.

Authors: Gwynne L Davis; Adele Stewart; Gregg D Stanwood; Raajaram Gowrishankar; Maureen K Hahn; Randy D Blakely
Journal: Behav Brain Res Date: 2017-09-28 Impact factor: 3.332

6. Time representation in reinforcement learning models of the basal ganglia.

Authors: Samuel J Gershman; Ahmed A Moustafa; Elliot A Ludvig
Journal: Front Comput Neurosci Date: 2014-01-09 Impact factor: 2.380

7. Tamping Ramping: Algorithmic, Implementational, and Computational Explanations of Phasic Dopamine Signals in the Accumbens.

Authors: Kevin Lloyd; Peter Dayan
Journal: PLoS Comput Biol Date: 2015-12-23 Impact factor: 4.475

8. Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity.

Authors: Pierre Berthet; Mikael Lindahl; Philip J Tully; Jeanette Hellgren-Kotaleski; Anders Lansner
Journal: Front Neural Circuits Date: 2016-07-21 Impact factor: 3.492

9. Timing and expectation of reward: a neuro-computational model of the afferents to the ventral tegmental area.

Authors: Julien Vitay; Fred H Hamker
Journal: Front Neurorobot Date: 2014-01-31 Impact factor: 2.650

9 in total