Literature DB >> 19847635

Alternative time representation in dopamine models.

François Rivest1, John F Kalaska, Yoshua Bengio.   

Abstract

Dopaminergic neuron activity has been modeled during learning and appetitive behavior, most commonly using the temporal-difference (TD) algorithm. However, a proper representation of elapsed time and of the exact task is usually required for the model to work. Most models use timing elements such as delay-line representations of time that are not biologically realistic for intervals in the range of seconds. The interval-timing literature provides several alternatives. One of them is that timing could emerge from general network dynamics, instead of coming from a dedicated circuit. Here, we present a general rate-based learning model based on long short-term memory (LSTM) networks that learns a time representation when needed. Using a naïve network learning its environment in conjunction with TD, we reproduce dopamine activity in appetitive trace conditioning with a constant CS-US interval, including probe trials with unexpected delays. The proposed model learns a representation of the environment dynamics in an adaptive biologically plausible framework, without recourse to delay lines or other special-purpose circuits. Instead, the model predicts that the task-dependent representation of time is learned by experience, is encoded in ramp-like changes in single-neuron activity distributed across small neural networks, and reflects a temporal integration mechanism resulting from the inherent dynamics of recurrent loops within the network. The model also reproduces the known finding that trace conditioning is more difficult than delay conditioning and that the learned representation of the task can be highly dependent on the types of trials experienced during training. Finally, it suggests that the phasic dopaminergic signal could facilitate learning in the cortex.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19847635     DOI: 10.1007/s10827-009-0191-1

Source DB:  PubMed          Journal:  J Comput Neurosci        ISSN: 0929-5313            Impact factor:   1.621


  59 in total

1.  Retrospective and prospective coding for predicted reward in the sensory thalamus.

Authors:  Y Komura; R Tamura; T Uwano; H Nishijo; K Kaga; T Ono
Journal:  Nature       Date:  2001-08-02       Impact factor: 49.962

2.  Neuronal activity in monkey ventral striatum related to the expectation of reward.

Authors:  W Schultz; P Apicella; E Scarnati; T Ljungberg
Journal:  J Neurosci       Date:  1992-12       Impact factor: 6.167

Review 3.  The connections of the dopaminergic system with the striatum in rats and primates: an analysis with respect to the functional and compartmental organization of the striatum.

Authors:  D Joel; I Weiner
Journal:  Neuroscience       Date:  2000       Impact factor: 3.590

Review 4.  What makes us tick? Functional and neural mechanisms of interval timing.

Authors:  Catalin V Buhusi; Warren H Meck
Journal:  Nat Rev Neurosci       Date:  2005-10       Impact factor: 34.870

Review 5.  Who's on first? What's on second? The time course of learning in corticostriatal systems.

Authors:  Mark Laubach
Journal:  Trends Neurosci       Date:  2005-10       Impact factor: 13.837

6.  Timing in the absence of clocks: encoding time in neural network states.

Authors:  Uma R Karmarkar; Dean V Buonomano
Journal:  Neuron       Date:  2007-02-01       Impact factor: 17.173

7.  Dorsal, ventral, and complete excitotoxic lesions of the hippocampus in rats failed to impair appetitive trace conditioning.

Authors:  Geneviève Thibaudeau; Olivier Potvin; Kevin Allen; François Y Doré; Sonia Goulet
Journal:  Behav Brain Res       Date:  2007-07-12       Impact factor: 3.332

8.  A spiking neural network model of an actor-critic learning agent.

Authors:  Wiebke Potjans; Abigail Morrison; Markus Diesmann
Journal:  Neural Comput       Date:  2009-02       Impact factor: 2.026

9.  Stimulus representation and the timing of reward-prediction errors in models of the dopamine system.

Authors:  Elliot A Ludvig; Richard S Sutton; E James Kehoe
Journal:  Neural Comput       Date:  2008-12       Impact factor: 2.026

Review 10.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

View more
  9 in total

Review 1.  Towards a unified model of pavlovian conditioning: short review of trace conditioning models.

Authors:  V I Kryukov
Journal:  Cogn Neurodyn       Date:  2012-02-22       Impact factor: 5.082

2.  Reinforcement learning with Marr.

Authors:  Yael Niv; Angela Langdon
Journal:  Curr Opin Behav Sci       Date:  2016-10

3.  Time-scale-invariant information-theoretic contingencies in discrimination learning.

Authors:  Abigail Kalmbach; Eileen Chun; Kathleen Taylor; Charles R Gallistel; Peter D Balsam
Journal:  J Exp Psychol Anim Learn Cogn       Date:  2019-04-25       Impact factor: 2.478

Review 4.  The role of working memory and declarative memory in trace conditioning.

Authors:  David A Connor; Thomas J Gould
Journal:  Neurobiol Learn Mem       Date:  2016-07-12       Impact factor: 2.877

5.  Functional coding variation in the presynaptic dopamine transporter associated with neuropsychiatric disorders drives enhanced motivation and context-dependent impulsivity in mice.

Authors:  Gwynne L Davis; Adele Stewart; Gregg D Stanwood; Raajaram Gowrishankar; Maureen K Hahn; Randy D Blakely
Journal:  Behav Brain Res       Date:  2017-09-28       Impact factor: 3.332

6.  Time representation in reinforcement learning models of the basal ganglia.

Authors:  Samuel J Gershman; Ahmed A Moustafa; Elliot A Ludvig
Journal:  Front Comput Neurosci       Date:  2014-01-09       Impact factor: 2.380

7.  Tamping Ramping: Algorithmic, Implementational, and Computational Explanations of Phasic Dopamine Signals in the Accumbens.

Authors:  Kevin Lloyd; Peter Dayan
Journal:  PLoS Comput Biol       Date:  2015-12-23       Impact factor: 4.475

8.  Functional Relevance of Different Basal Ganglia Pathways Investigated in a Spiking Model with Reward Dependent Plasticity.

Authors:  Pierre Berthet; Mikael Lindahl; Philip J Tully; Jeanette Hellgren-Kotaleski; Anders Lansner
Journal:  Front Neural Circuits       Date:  2016-07-21       Impact factor: 3.492

9.  Timing and expectation of reward: a neuro-computational model of the afferents to the ventral tegmental area.

Authors:  Julien Vitay; Fred H Hamker
Journal:  Front Neurorobot       Date:  2014-01-31       Impact factor: 2.650

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.