Literature DB >> 22658226

Reinforcement learning: computing the temporal difference of values via distinct corticostriatal pathways.

Kenji Morita1, Mieko Morishima, Katsuyuki Sakai, Yasuo Kawaguchi.   

Abstract

Midbrain dopamine neurons supposedly encode reward prediction error, but how error signals are computed remains elusive. Here, we propose a mechanism based on recent findings regarding corticostriatal circuits. Specifically, we propose that two distinct subpopulations of corticostriatal neurons differentially represent the animal's current and previous states/actions through unidirectional connectivity from one subpopulation to the other and strong recurrent excitation that exists only within the recipient subpopulation. These corticostriatal subpopulations selectively connect to the direct and indirect pathways of the basal ganglia, such that the temporal difference between the values of current and previous states/actions--the core of the error signal--can be computed. Our hypothesis suggests a unified view of basal ganglia functions and has important clinical implications.
Copyright © 2012 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 22658226     DOI: 10.1016/j.tins.2012.04.009

Source DB:  PubMed          Journal:  Trends Neurosci        ISSN: 0166-2236            Impact factor:   13.837


  38 in total

Review 1.  Categorization = decision making + generalization.

Authors:  Carol A Seger; Erik J Peterson
Journal:  Neurosci Biobehav Rev       Date:  2013-03-30       Impact factor: 8.989

2.  Dopaminergic control of motivation and reinforcement learning: a closed-circuit account for reward-oriented behavior.

Authors:  Kenji Morita; Mieko Morishima; Katsuyuki Sakai; Yasuo Kawaguchi
Journal:  J Neurosci       Date:  2013-05-15       Impact factor: 6.167

3.  Reinforcement learning with Marr.

Authors:  Yael Niv; Angela Langdon
Journal:  Curr Opin Behav Sci       Date:  2016-10

4.  Methamphetamine-induced neurotoxicity disrupts pharmacologically evoked dopamine transients in the dorsomedial and dorsolateral striatum.

Authors:  John D Robinson; Christopher D Howard; Elissa D Pastuzyn; Diane L Byers; Kristen A Keefe; Paul A Garris
Journal:  Neurotox Res       Date:  2014-02-22       Impact factor: 3.911

5.  Dorsal striatum is necessary for stimulus-value but not action-value learning in humans.

Authors:  Khoi Vo; Robb B Rutledge; Anjan Chatterjee; Joseph W Kable
Journal:  Brain       Date:  2014-10-01       Impact factor: 13.501

Review 6.  The functional logic of corticostriatal connections.

Authors:  Stewart Shipp
Journal:  Brain Struct Funct       Date:  2016-07-13       Impact factor: 3.270

Review 7.  Dopamine Prediction Errors in Reward Learning and Addiction: From Theory to Neural Circuitry.

Authors:  Ronald Keiflin; Patricia H Janak
Journal:  Neuron       Date:  2015-10-21       Impact factor: 17.173

8.  GENSAT BAC cre-recombinase driver lines to study the functional organization of cerebral cortical and basal ganglia circuits.

Authors:  Charles R Gerfen; Ronald Paletzki; Nathaniel Heintz
Journal:  Neuron       Date:  2013-12-18       Impact factor: 17.173

9.  Distributed and Mixed Information in Monosynaptic Inputs to Dopamine Neurons.

Authors:  Ju Tian; Ryan Huang; Jeremiah Y Cohen; Fumitaka Osakada; Dmitry Kobak; Christian K Machens; Edward M Callaway; Naoshige Uchida; Mitsuko Watabe-Uchida
Journal:  Neuron       Date:  2016-09-08       Impact factor: 17.173

10.  Bipolar oscillations between positive and negative mood states in a computational model of Basal Ganglia.

Authors:  Pragathi Priyadharsini Balasubramani; V Srinivasa Chakravarthy
Journal:  Cogn Neurodyn       Date:  2019-11-20       Impact factor: 5.082

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.