Literature DB >> 20858126

Internal-time temporal difference model for neural value-based decision making.

Hiroyuki Nakahara1, Sivaramakrishnan Kaveri.   

Abstract

The temporal difference (TD) learning framework is a major paradigm for understanding value-based decision making and related neural activities (e.g., dopamine activity). The representation of time in neural processes modeled by a TD framework, however, is poorly understood. To address this issue, we propose a TD formulation that separates the time of the operator (neural valuation processes), which we refer to as internal time, from the time of the observer (experiment), which we refer to as conventional time. We provide the formulation and theoretical characteristics of this TD model based on internal time, called internal-time TD, and explore the possible consequences of the use of this model in neural value-based decision making. Due to the separation of the two times, internal-time TD computations, such as TD error, are expressed differently, depending on both the time frame and time unit. We examine this operator-observer problem in relation to the time representation used in previous TD models. An internal time TD value function exhibits the co-appearance of exponential and hyperbolic discounting at different delays in intertemporal choice tasks. We further examine the effects of internal time noise on TD error, the dynamic construction of internal time, and the modulation of internal time with the internal time hypothesis of serotonin function. We also relate the internal TD formulation to research on interval timing and subjective time.

Entities:  

Mesh:

Year:  2010        PMID: 20858126     DOI: 10.1162/NECO_a_00049

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  8 in total

1.  Time-scale-invariant information-theoretic contingencies in discrimination learning.

Authors:  Abigail Kalmbach; Eileen Chun; Kathleen Taylor; Charles R Gallistel; Peter D Balsam
Journal:  J Exp Psychol Anim Learn Cogn       Date:  2019-04-25       Impact factor: 2.478

2.  Learning to represent reward structure: a key to adapting to complex environments.

Authors:  Hiroyuki Nakahara; Okihide Hikosaka
Journal:  Neurosci Res       Date:  2012-10-13       Impact factor: 3.304

3.  Positive temporal dependence of the biological clock implies hyperbolic discounting.

Authors:  Debajyoti Ray; Peter Bossaerts
Journal:  Front Neurosci       Date:  2011-01-28       Impact factor: 4.677

4.  Time representation in reinforcement learning models of the basal ganglia.

Authors:  Samuel J Gershman; Ahmed A Moustafa; Elliot A Ludvig
Journal:  Front Comput Neurosci       Date:  2014-01-09       Impact factor: 2.380

5.  Tamping Ramping: Algorithmic, Implementational, and Computational Explanations of Phasic Dopamine Signals in the Accumbens.

Authors:  Kevin Lloyd; Peter Dayan
Journal:  PLoS Comput Biol       Date:  2015-12-23       Impact factor: 4.475

6.  Computing reward-prediction error: an integrated account of cortical timing and basal-ganglia pathways for appetitive and aversive learning.

Authors:  Kenji Morita; Yasuo Kawaguchi
Journal:  Eur J Neurosci       Date:  2015-07-25       Impact factor: 3.386

7.  Formal comparison of dual-parameter temporal discounting models in controls and pathological gamblers.

Authors:  Jan Peters; Stephan Franz Miedl; Christian Büchel
Journal:  PLoS One       Date:  2012-11-30       Impact factor: 3.240

8.  Multiple modes of impulsivity in Parkinson's disease.

Authors:  Cristina Nombela; Timothy Rittman; Trevor W Robbins; James B Rowe
Journal:  PLoS One       Date:  2014-01-21       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.