Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reinforcement learning: computing the temporal difference of values via distinct corticostriatal pathways.

Literature DB >> 22658226

Reinforcement learning: computing the temporal difference of values via distinct corticostriatal pathways.

Kenji Morita¹, Mieko Morishima, Katsuyuki Sakai, Yasuo Kawaguchi.

Abstract

Midbrain dopamine neurons supposedly encode reward prediction error, but how error signals are computed remains elusive. Here, we propose a mechanism based on recent findings regarding corticostriatal circuits. Specifically, we propose that two distinct subpopulations of corticostriatal neurons differentially represent the animal's current and previous states/actions through unidirectional connectivity from one subpopulation to the other and strong recurrent excitation that exists only within the recipient subpopulation. These corticostriatal subpopulations selectively connect to the direct and indirect pathways of the basal ganglia, such that the temporal difference between the values of current and previous states/actions--the core of the error signal--can be computed. Our hypothesis suggests a unified view of basal ganglia functions and has important clinical implications.

Entities: Chemical

Mesh：

Year: 2012 PMID： 22658226 DOI： 10.1016/j.tins.2012.04.009

Source DB: PubMed Journal: Trends Neurosci ISSN： 0166-2236 Impact factor: 13.837

Keyword Cloud
Cited

38 in total

Review 1. Categorization = decision making + generalization.

Authors: Carol A Seger; Erik J Peterson
Journal: Neurosci Biobehav Rev Date: 2013-03-30 Impact factor: 8.989

2. Dopaminergic control of motivation and reinforcement learning: a closed-circuit account for reward-oriented behavior.

Authors: Kenji Morita; Mieko Morishima; Katsuyuki Sakai; Yasuo Kawaguchi
Journal: J Neurosci Date: 2013-05-15 Impact factor: 6.167

3. Reinforcement learning with Marr.

Authors: Yael Niv; Angela Langdon
Journal: Curr Opin Behav Sci Date: 2016-10

4. Methamphetamine-induced neurotoxicity disrupts pharmacologically evoked dopamine transients in the dorsomedial and dorsolateral striatum.

Authors: John D Robinson; Christopher D Howard; Elissa D Pastuzyn; Diane L Byers; Kristen A Keefe; Paul A Garris
Journal: Neurotox Res Date: 2014-02-22 Impact factor: 3.911

5. Dorsal striatum is necessary for stimulus-value but not action-value learning in humans.

Authors: Khoi Vo; Robb B Rutledge; Anjan Chatterjee; Joseph W Kable
Journal: Brain Date: 2014-10-01 Impact factor: 13.501

Review 6. The functional logic of corticostriatal connections.

Authors: Stewart Shipp
Journal: Brain Struct Funct Date: 2016-07-13 Impact factor: 3.270

Review 7. Dopamine Prediction Errors in Reward Learning and Addiction: From Theory to Neural Circuitry.

Authors: Ronald Keiflin; Patricia H Janak
Journal: Neuron Date: 2015-10-21 Impact factor: 17.173

8. GENSAT BAC cre-recombinase driver lines to study the functional organization of cerebral cortical and basal ganglia circuits.

Authors: Charles R Gerfen; Ronald Paletzki; Nathaniel Heintz
Journal: Neuron Date: 2013-12-18 Impact factor: 17.173

9. Distributed and Mixed Information in Monosynaptic Inputs to Dopamine Neurons.

Authors: Ju Tian; Ryan Huang; Jeremiah Y Cohen; Fumitaka Osakada; Dmitry Kobak; Christian K Machens; Edward M Callaway; Naoshige Uchida; Mitsuko Watabe-Uchida
Journal: Neuron Date: 2016-09-08 Impact factor: 17.173

10. Bipolar oscillations between positive and negative mood states in a computational model of Basal Ganglia.

Authors: Pragathi Priyadharsini Balasubramani; V Srinivasa Chakravarthy
Journal: Cogn Neurodyn Date: 2019-11-20 Impact factor: 5.082