Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.

Literature DB >> 20510862

States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.

Jan Gläscher¹, Nathaniel Daw, Peter Dayan, John P O'Doherty.

Abstract

Reinforcement learning (RL) uses sequential experience with situations ("states") and outcomes to assess actions. Whereas model-free RL uses this experience directly, in the form of a reward prediction error (RPE), model-based RL uses it indirectly, building a model of the state transition and outcome structure of the environment, and evaluating actions by searching this model. A state prediction error (SPE) plays a central role, reporting discrepancies between the current model and the observed state transitions. Using functional magnetic resonance imaging in humans solving a probabilistic Markov decision task, we found the neural signature of an SPE in the intraparietal sulcus and lateral prefrontal cortex, in addition to the previously well-characterized RPE in the ventral striatum. This finding supports the existence of two unique forms of learning signal in humans, which may form the basis of distinct computational strategies for guiding behavior. Copyright 2010 Elsevier Inc. All rights reserved.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2010 PMID： 20510862 PMCID： PMC2895323 DOI： 10.1016/j.neuron.2010.04.016

Source DB: PubMed Journal: Neuron ISSN： 0896-6273 Impact factor: 17.173

51 in total

1. Tracking the hemodynamic responses to reward and punishment in the striatum.

Authors: M R Delgado; L E Nystrom; C Fissell; D C Noll; J A Fiez
Journal: J Neurophysiol Date: 2000-12 Impact factor: 2.714

2. Anticipation of increasing monetary reward selectively recruits nucleus accumbens.

Authors: B Knutson; C M Adams; G W Fong; D Hommer
Journal: J Neurosci Date: 2001-08-15 Impact factor: 6.167

3. Multiple model-based reinforcement learning.

Authors: Kenji Doya; Kazuyuki Samejima; Ken-ichi Katagiri; Mitsuo Kawato
Journal: Neural Comput Date: 2002-06 Impact factor: 2.026

Review 4. The primate basal ganglia: parallel and integrative networks.

Authors: Suzanne N Haber
Journal: J Chem Neuroanat Date: 2003-12 Impact factor: 3.052

5. Matching behavior and the representation of value in the parietal cortex.

Authors: Leo P Sugrue; Greg S Corrado; William T Newsome
Journal: Science Date: 2004-06-18 Impact factor: 47.728

6. Voluntary orienting is dissociated from target detection in human posterior parietal cortex.

Authors: M Corbetta; J M Kincade; J M Ollinger; M P McAvoy; G L Shulman
Journal: Nat Neurosci Date: 2000-03 Impact factor: 24.884

Review 7. A common framework for perceptual learning.

Authors: Aaron R Seitz; Hubert R Dinse
Journal: Curr Opin Neurobiol Date: 2007-02-20 Impact factor: 6.627

8. Novelty and target processing during an auditory novelty oddball: a simultaneous event-related potential and functional magnetic resonance imaging study.

Authors: Alexander Strobel; Stefan Debener; Bettina Sorger; Judith C Peters; Cornelia Kranczioch; Karsten Hoechstetter; Andreas K Engel; Burkhard Brocke; Rainer Goebel
Journal: Neuroimage Date: 2007-12-15 Impact factor: 6.556

9. Regulating the expectation of reward via cognitive strategies.

Authors: Mauricio R Delgado; M Meredith Gillis; Elizabeth A Phelps
Journal: Nat Neurosci Date: 2008-06-29 Impact factor: 24.884

Review 10. A neural substrate of prediction and reward.

Authors: W Schultz; P Dayan; P R Montague
Journal: Science Date: 1997-03-14 Impact factor: 47.728

400 in total

1. The prefrontal cortex and hybrid learning during iterative competitive games.

Authors: Hiroshi Abe; Hyojung Seo; Daeyeol Lee
Journal: Ann N Y Acad Sci Date: 2011-12 Impact factor: 5.691

2. The brain's rose-colored glasses.

Authors: Keise Izuma; Ralph Adolphs
Journal: Nat Neurosci Date: 2011-10-26 Impact factor: 24.884

3. With age comes wisdom: decision making in younger and older adults.

Authors: Darrell A Worthy; Marissa A Gorlick; Jennifer L Pacheco; David M Schnyer; W Todd Maddox
Journal: Psychol Sci Date: 2011-09-29

4. A pallidus-habenula-dopamine pathway signals inferred stimulus values.

Authors: Ethan S Bromberg-Martin; Masayuki Matsumoto; Simon Hong; Okihide Hikosaka
Journal: J Neurophysiol Date: 2010-06-10 Impact factor: 2.714

5. Character studies.

Authors: Ming Hsu; Adrianna C Jenkins
Journal: Nat Neurosci Date: 2015-09 Impact factor: 24.884

6. Habit Learning by Naive Macaques Is Marked by Response Sharpening of Striatal Neurons Representing the Cost and Outcome of Acquired Action Sequences.

Authors: Theresa M Desrochers; Ken-ichi Amemori; Ann M Graybiel
Journal: Neuron Date: 2015-08-19 Impact factor: 17.173

7. Reward processing deficits and impulsivity in high-risk offspring of alcoholics: A study of event-related potentials during a monetary gambling task.

Authors: Chella Kamarajan; Ashwini K Pandey; David B Chorlian; Niklas Manz; Arthur T Stimus; Lance O Bauer; Victor M Hesselbrock; Marc A Schuckit; Samuel Kuperman; John Kramer; Bernice Porjesz
Journal: Int J Psychophysiol Date: 2015-09-18 Impact factor: 2.997