Literature DB >> 20510862

States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.

Jan Gläscher1, Nathaniel Daw, Peter Dayan, John P O'Doherty.   

Abstract

Reinforcement learning (RL) uses sequential experience with situations ("states") and outcomes to assess actions. Whereas model-free RL uses this experience directly, in the form of a reward prediction error (RPE), model-based RL uses it indirectly, building a model of the state transition and outcome structure of the environment, and evaluating actions by searching this model. A state prediction error (SPE) plays a central role, reporting discrepancies between the current model and the observed state transitions. Using functional magnetic resonance imaging in humans solving a probabilistic Markov decision task, we found the neural signature of an SPE in the intraparietal sulcus and lateral prefrontal cortex, in addition to the previously well-characterized RPE in the ventral striatum. This finding supports the existence of two unique forms of learning signal in humans, which may form the basis of distinct computational strategies for guiding behavior. Copyright 2010 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2010        PMID: 20510862      PMCID: PMC2895323          DOI: 10.1016/j.neuron.2010.04.016

Source DB:  PubMed          Journal:  Neuron        ISSN: 0896-6273            Impact factor:   17.173


  51 in total

1.  Tracking the hemodynamic responses to reward and punishment in the striatum.

Authors:  M R Delgado; L E Nystrom; C Fissell; D C Noll; J A Fiez
Journal:  J Neurophysiol       Date:  2000-12       Impact factor: 2.714

2.  Anticipation of increasing monetary reward selectively recruits nucleus accumbens.

Authors:  B Knutson; C M Adams; G W Fong; D Hommer
Journal:  J Neurosci       Date:  2001-08-15       Impact factor: 6.167

3.  Multiple model-based reinforcement learning.

Authors:  Kenji Doya; Kazuyuki Samejima; Ken-ichi Katagiri; Mitsuo Kawato
Journal:  Neural Comput       Date:  2002-06       Impact factor: 2.026

Review 4.  The primate basal ganglia: parallel and integrative networks.

Authors:  Suzanne N Haber
Journal:  J Chem Neuroanat       Date:  2003-12       Impact factor: 3.052

5.  Matching behavior and the representation of value in the parietal cortex.

Authors:  Leo P Sugrue; Greg S Corrado; William T Newsome
Journal:  Science       Date:  2004-06-18       Impact factor: 47.728

6.  Voluntary orienting is dissociated from target detection in human posterior parietal cortex.

Authors:  M Corbetta; J M Kincade; J M Ollinger; M P McAvoy; G L Shulman
Journal:  Nat Neurosci       Date:  2000-03       Impact factor: 24.884

Review 7.  A common framework for perceptual learning.

Authors:  Aaron R Seitz; Hubert R Dinse
Journal:  Curr Opin Neurobiol       Date:  2007-02-20       Impact factor: 6.627

8.  Novelty and target processing during an auditory novelty oddball: a simultaneous event-related potential and functional magnetic resonance imaging study.

Authors:  Alexander Strobel; Stefan Debener; Bettina Sorger; Judith C Peters; Cornelia Kranczioch; Karsten Hoechstetter; Andreas K Engel; Burkhard Brocke; Rainer Goebel
Journal:  Neuroimage       Date:  2007-12-15       Impact factor: 6.556

9.  Regulating the expectation of reward via cognitive strategies.

Authors:  Mauricio R Delgado; M Meredith Gillis; Elizabeth A Phelps
Journal:  Nat Neurosci       Date:  2008-06-29       Impact factor: 24.884

Review 10.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

View more
  400 in total

1.  The prefrontal cortex and hybrid learning during iterative competitive games.

Authors:  Hiroshi Abe; Hyojung Seo; Daeyeol Lee
Journal:  Ann N Y Acad Sci       Date:  2011-12       Impact factor: 5.691

2.  The brain's rose-colored glasses.

Authors:  Keise Izuma; Ralph Adolphs
Journal:  Nat Neurosci       Date:  2011-10-26       Impact factor: 24.884

3.  With age comes wisdom: decision making in younger and older adults.

Authors:  Darrell A Worthy; Marissa A Gorlick; Jennifer L Pacheco; David M Schnyer; W Todd Maddox
Journal:  Psychol Sci       Date:  2011-09-29

4.  A pallidus-habenula-dopamine pathway signals inferred stimulus values.

Authors:  Ethan S Bromberg-Martin; Masayuki Matsumoto; Simon Hong; Okihide Hikosaka
Journal:  J Neurophysiol       Date:  2010-06-10       Impact factor: 2.714

5.  Character studies.

Authors:  Ming Hsu; Adrianna C Jenkins
Journal:  Nat Neurosci       Date:  2015-09       Impact factor: 24.884

6.  Habit Learning by Naive Macaques Is Marked by Response Sharpening of Striatal Neurons Representing the Cost and Outcome of Acquired Action Sequences.

Authors:  Theresa M Desrochers; Ken-ichi Amemori; Ann M Graybiel
Journal:  Neuron       Date:  2015-08-19       Impact factor: 17.173

7.  Reward processing deficits and impulsivity in high-risk offspring of alcoholics: A study of event-related potentials during a monetary gambling task.

Authors:  Chella Kamarajan; Ashwini K Pandey; David B Chorlian; Niklas Manz; Arthur T Stimus; Lance O Bauer; Victor M Hesselbrock; Marc A Schuckit; Samuel Kuperman; John Kramer; Bernice Porjesz
Journal:  Int J Psychophysiol       Date:  2015-09-18       Impact factor: 2.997

Review 8.  Decoding Cognitive Processes from Neural Ensembles.

Authors:  Joni D Wallis
Journal:  Trends Cogn Sci       Date:  2018-09-29       Impact factor: 20.229

9.  The anatomy of choice: active inference and agency.

Authors:  Karl Friston; Philipp Schwartenbeck; Thomas Fitzgerald; Michael Moutoussis; Timothy Behrens; Raymond J Dolan
Journal:  Front Hum Neurosci       Date:  2013-09-25       Impact factor: 3.169

10.  Neural signatures of experience-based improvements in deterministic decision-making.

Authors:  Joshua J Tremel; Patryk A Laurent; David A Wolk; Mark E Wheeler; Julie A Fiez
Journal:  Behav Brain Res       Date:  2016-08-11       Impact factor: 3.332

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.