Literature DB >> 16987637

Heterarchical reinforcement-learning model for integration of multiple cortico-striatal loops: fMRI examination in stimulus-action-reward association learning.

Masahiko Haruno1, Mitsuo Kawato.   

Abstract

The brain's most difficult computation in decision-making learning is searching for essential information related to rewards among vast multimodal inputs and then integrating it into beneficial behaviors. Contextual cues consisting of limbic, cognitive, visual, auditory, somatosensory, and motor signals need to be associated with both rewards and actions by utilizing an internal representation such as reward prediction and reward prediction error. Previous studies have suggested that a suitable brain structure for such integration is the neural circuitry associated with multiple cortico-striatal loops. However, computational exploration still remains into how the information in and around these multiple closed loops can be shared and transferred. Here, we propose a "heterarchical reinforcement learning" model, where reward prediction made by more limbic and cognitive loops is propagated to motor loops by spiral projections between the striatum and substantia nigra, assisted by cortical projections to the pedunculopontine tegmental nucleus, which sends excitatory input to the substantia nigra. The model makes several fMRI-testable predictions of brain activity during stimulus-action-reward association learning. The caudate nucleus and the cognitive cortical areas are correlated with reward prediction error, while the putamen and motor-related areas are correlated with stimulus-action-dependent reward prediction. Furthermore, a heterogeneous activity pattern within the striatum is predicted depending on learning difficulty, i.e., the anterior medial caudate nucleus will be correlated more with reward prediction error when learning becomes difficult, while the posterior putamen will be correlated more with stimulus-action-dependent reward prediction in easy learning. Our fMRI results revealed that different cortico-striatal loops are operating, as suggested by the proposed model.

Mesh:

Substances:

Year:  2006        PMID: 16987637     DOI: 10.1016/j.neunet.2006.06.007

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  50 in total

Review 1.  From ventral-medial to dorsal-lateral striatum: neural correlates of reward-guided decision-making.

Authors:  Amanda C Burton; Kae Nakamura; Matthew R Roesch
Journal:  Neurobiol Learn Mem       Date:  2014-05-21       Impact factor: 2.877

Review 2.  Reward-guided learning beyond dopamine in the nucleus accumbens: the integrative functions of cortico-basal ganglia networks.

Authors:  Henry H Yin; Sean B Ostlund; Bernard W Balleine
Journal:  Eur J Neurosci       Date:  2008-09-10       Impact factor: 3.386

Review 3.  From 'understanding the brain by creating the brain' towards manipulative neuroscience.

Authors:  Mitsuo Kawato
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-06-27       Impact factor: 6.237

4.  fMRI and EEG predictors of dynamic decision parameters during human reinforcement learning.

Authors:  Michael J Frank; Chris Gagne; Erika Nyhus; Sean Masters; Thomas V Wiecki; James F Cavanagh; David Badre
Journal:  J Neurosci       Date:  2015-01-14       Impact factor: 6.167

Review 5.  Creating the brain and interacting with the brain: an integrated approach to understanding the brain.

Authors:  Jun Morimoto; Mitsuo Kawato
Journal:  J R Soc Interface       Date:  2015-03-06       Impact factor: 4.118

Review 6.  Neurocomputational models of basal ganglia function in learning, memory and choice.

Authors:  Michael X Cohen; Michael J Frank
Journal:  Behav Brain Res       Date:  2008-10-04       Impact factor: 3.332

7.  Abnormal ventromedial prefrontal cortex function in children with psychopathic traits during reversal learning.

Authors:  Elizabeth C Finger; Abigail A Marsh; Derek G Mitchell; Marguerite E Reid; Courtney Sims; Salima Budhani; David S Kosson; Gang Chen; Kenneth E Towbin; Ellen Leibenluft; Daniel S Pine; James R Blair
Journal:  Arch Gen Psychiatry       Date:  2008-05

Review 8.  Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action.

Authors:  Bernard W Balleine; John P O'Doherty
Journal:  Neuropsychopharmacology       Date:  2010-01       Impact factor: 7.853

9.  Reduced Activation in the Pallidal-Thalamic-Motor Pathway Is Associated With Deficits in Reward-Modulated Inhibitory Control in Adults With a History of Attention-Deficit/Hyperactivity Disorder.

Authors:  Neil P Jones; Amelia Versace; Rachel Lindstrom; Tracey K Wilson; Elizabeth M Gnagy; William E Pelham; Brooke S G Molina; Cecile D Ladouceur
Journal:  Biol Psychiatry Cogn Neurosci Neuroimaging       Date:  2020-06-30

Review 10.  Computational and dynamic models in neuroimaging.

Authors:  Karl J Friston; Raymond J Dolan
Journal:  Neuroimage       Date:  2009-12-28       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.