Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Internal-time temporal difference model for neural value-based decision making.

Literature DB >> 20858126

Internal-time temporal difference model for neural value-based decision making.

Hiroyuki Nakahara¹, Sivaramakrishnan Kaveri.

Abstract

The temporal difference (TD) learning framework is a major paradigm for understanding value-based decision making and related neural activities (e.g., dopamine activity). The representation of time in neural processes modeled by a TD framework, however, is poorly understood. To address this issue, we propose a TD formulation that separates the time of the operator (neural valuation processes), which we refer to as internal time, from the time of the observer (experiment), which we refer to as conventional time. We provide the formulation and theoretical characteristics of this TD model based on internal time, called internal-time TD, and explore the possible consequences of the use of this model in neural value-based decision making. Due to the separation of the two times, internal-time TD computations, such as TD error, are expressed differently, depending on both the time frame and time unit. We examine this operator-observer problem in relation to the time representation used in previous TD models. An internal time TD value function exhibits the co-appearance of exponential and hyperbolic discounting at different delays in intertemporal choice tasks. We further examine the effects of internal time noise on TD error, the dynamic construction of internal time, and the modulation of internal time with the internal time hypothesis of serotonin function. We also relate the internal TD formulation to research on interval timing and subjective time.

Entities: Chemical Disease

Mesh：

Year: 2010 PMID： 20858126 DOI： 10.1162/NECO_a_00049

Source DB: PubMed Journal: Neural Comput ISSN： 0899-7667 Impact factor: 2.026

Keyword Cloud
Cited

8 in total

1. Time-scale-invariant information-theoretic contingencies in discrimination learning.

Authors: Abigail Kalmbach; Eileen Chun; Kathleen Taylor; Charles R Gallistel; Peter D Balsam
Journal: J Exp Psychol Anim Learn Cogn Date: 2019-04-25 Impact factor: 2.478

2. Learning to represent reward structure: a key to adapting to complex environments.

Authors: Hiroyuki Nakahara; Okihide Hikosaka
Journal: Neurosci Res Date: 2012-10-13 Impact factor: 3.304

3. Positive temporal dependence of the biological clock implies hyperbolic discounting.

Authors: Debajyoti Ray; Peter Bossaerts
Journal: Front Neurosci Date: 2011-01-28 Impact factor: 4.677

4. Time representation in reinforcement learning models of the basal ganglia.

Authors: Samuel J Gershman; Ahmed A Moustafa; Elliot A Ludvig
Journal: Front Comput Neurosci Date: 2014-01-09 Impact factor: 2.380

5. Tamping Ramping: Algorithmic, Implementational, and Computational Explanations of Phasic Dopamine Signals in the Accumbens.

Authors: Kevin Lloyd; Peter Dayan
Journal: PLoS Comput Biol Date: 2015-12-23 Impact factor: 4.475

6. Computing reward-prediction error: an integrated account of cortical timing and basal-ganglia pathways for appetitive and aversive learning.

Authors: Kenji Morita; Yasuo Kawaguchi
Journal: Eur J Neurosci Date: 2015-07-25 Impact factor: 3.386

7. Formal comparison of dual-parameter temporal discounting models in controls and pathological gamblers.

Authors: Jan Peters; Stephan Franz Miedl; Christian Büchel
Journal: PLoS One Date: 2012-11-30 Impact factor: 3.240

8. Multiple modes of impulsivity in Parkinson's disease.

Authors: Cristina Nombela; Timothy Rittman; Trevor W Robbins; James B Rowe
Journal: PLoS One Date: 2014-01-21 Impact factor: 3.240

8 in total