Literature DB >> 22487039

Generalization of value in reinforcement learning by humans.

G Elliott Wimmer1, Nathaniel D Daw, Daphna Shohamy.   

Abstract

Research in decision-making has focused on the role of dopamine and its striatal targets in guiding choices via learned stimulus-reward or stimulus-response associations, behavior that is well described by reinforcement learning theories. However, basic reinforcement learning is relatively limited in scope and does not explain how learning about stimulus regularities or relations may guide decision-making. A candidate mechanism for this type of learning comes from the domain of memory, which has highlighted a role for the hippocampus in learning of stimulus-stimulus relations, typically dissociated from the role of the striatum in stimulus-response learning. Here, we used functional magnetic resonance imaging and computational model-based analyses to examine the joint contributions of these mechanisms to reinforcement learning. Humans performed a reinforcement learning task with added relational structure, modeled after tasks used to isolate hippocampal contributions to memory. On each trial participants chose one of four options, but the reward probabilities for pairs of options were correlated across trials. This (uninstructed) relationship between pairs of options potentially enabled an observer to learn about option values based on experience with the other options and to generalize across them. We observed blood oxygen level-dependent (BOLD) activity related to learning in the striatum and also in the hippocampus. By comparing a basic reinforcement learning model to one augmented to allow feedback to generalize between correlated options, we tested whether choice behavior and BOLD activity were influenced by the opportunity to generalize across correlated options. Although such generalization goes beyond standard computational accounts of reinforcement learning and striatal BOLD, both choices and striatal BOLD activity were better explained by the augmented model. Consistent with the hypothesized role for the hippocampus in this generalization, functional connectivity between the ventral striatum and hippocampus was modulated, across participants, by the ability of the augmented model to capture participants' choice. Our results thus point toward an interactive model in which striatal reinforcement learning systems may employ relational representations typically associated with the hippocampus.
© 2012 The Authors. European Journal of Neuroscience © 2012 Federation of European Neuroscience Societies and Blackwell Publishing Ltd.

Entities:  

Mesh:

Year:  2012        PMID: 22487039      PMCID: PMC3404618          DOI: 10.1111/j.1460-9568.2012.08017.x

Source DB:  PubMed          Journal:  Eur J Neurosci        ISSN: 0953-816X            Impact factor:   3.386


  94 in total

1.  Anticipation of increasing monetary reward selectively recruits nucleus accumbens.

Authors:  B Knutson; C M Adams; G W Fong; D Hommer
Journal:  J Neurosci       Date:  2001-08-15       Impact factor: 6.167

2.  Matching behavior and the representation of value in the parietal cortex.

Authors:  Leo P Sugrue; Greg S Corrado; William T Newsome
Journal:  Science       Date:  2004-06-18       Impact factor: 47.728

Review 3.  Basal ganglia and dopamine contributions to probabilistic category learning.

Authors:  D Shohamy; C E Myers; J Kalanithi; M A Gluck
Journal:  Neurosci Biobehav Rev       Date:  2007-08-10       Impact factor: 8.989

Review 4.  Model-based fMRI and its application to reward learning and decision making.

Authors:  John P O'Doherty; Alan Hampton; Hackjin Kim
Journal:  Ann N Y Acad Sci       Date:  2007-04-07       Impact factor: 5.691

Review 5.  A framework for studying the neurobiology of value-based decision making.

Authors:  Antonio Rangel; Colin Camerer; P Read Montague
Journal:  Nat Rev Neurosci       Date:  2008-06-11       Impact factor: 34.870

Review 6.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

7.  Distinct value signals in anterior and posterior ventromedial prefrontal cortex.

Authors:  David V Smith; Benjamin Y Hayden; Trong-Kha Truong; Allen W Song; Michael L Platt; Scott A Huettel
Journal:  J Neurosci       Date:  2010-02-17       Impact factor: 6.167

8.  Integrating memories in the human brain: hippocampal-midbrain encoding of overlapping events.

Authors:  Daphna Shohamy; Anthony D Wagner
Journal:  Neuron       Date:  2008-10-23       Impact factor: 17.173

Review 9.  A unified framework for addiction: vulnerabilities in the decision process.

Authors:  A David Redish; Steve Jensen; Adam Johnson
Journal:  Behav Brain Sci       Date:  2008-08       Impact factor: 21.357

10.  Striatal activity underlies novelty-based choice in humans.

Authors:  Bianca C Wittmann; Nathaniel D Daw; Ben Seymour; Raymond J Dolan
Journal:  Neuron       Date:  2008-06-26       Impact factor: 17.173

View more
  46 in total

1.  Changes in corticostriatal connectivity during reinforcement learning in humans.

Authors:  Guillermo Horga; Tiago V Maia; Rachel Marsh; Xuejun Hao; Dongrong Xu; Yunsuo Duan; Gregory Z Tau; Barbara Graniello; Zhishun Wang; Alayar Kangarlu; Diana Martinez; Mark G Packard; Bradley S Peterson
Journal:  Hum Brain Mapp       Date:  2014-11-12       Impact factor: 5.038

2.  Informatic parcellation of the network involved in the computation of subjective value.

Authors:  John A Clithero; Antonio Rangel
Journal:  Soc Cogn Affect Neurosci       Date:  2013-07-24       Impact factor: 3.436

3.  Pain and the PAG: learning from painful mistakes.

Authors:  Falk Eippert; Irene Tracey
Journal:  Nat Neurosci       Date:  2014-11       Impact factor: 24.884

4.  Structured, uncertainty-driven exploration in real-world consumer choice.

Authors:  Eric Schulz; Rahul Bhui; Bradley C Love; Bastien Brier; Michael T Todd; Samuel J Gershman
Journal:  Proc Natl Acad Sci U S A       Date:  2019-06-24       Impact factor: 11.205

5.  Habits without values.

Authors:  Kevin J Miller; Amitai Shenhav; Elliot A Ludvig
Journal:  Psychol Rev       Date:  2019-01-24       Impact factor: 8.934

6.  Hippocampal contributions to value-based learning: Converging evidence from fMRI and amnesia.

Authors:  Daniela J Palombo; Scott M Hayes; Allison G Reid; Mieke Verfaellie
Journal:  Cogn Affect Behav Neurosci       Date:  2019-06       Impact factor: 3.282

7.  How glitter relates to gold: similarity-dependent reward prediction errors in the human striatum.

Authors:  Thorsten Kahnt; Soyoung Q Park; Christopher J Burke; Philippe N Tobler
Journal:  J Neurosci       Date:  2012-11-14       Impact factor: 6.167

Review 8.  Decision making: from neuroscience to psychiatry.

Authors:  Daeyeol Lee
Journal:  Neuron       Date:  2013-04-24       Impact factor: 17.173

9.  Multiple memory systems as substrates for multiple decision systems.

Authors:  Bradley B Doll; Daphna Shohamy; Nathaniel D Daw
Journal:  Neurobiol Learn Mem       Date:  2014-05-15       Impact factor: 2.877

10.  Action selection in multi-effector decision making.

Authors:  Seth Madlon-Kay; Bijan Pesaran; Nathaniel D Daw
Journal:  Neuroimage       Date:  2012-12-07       Impact factor: 6.556

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.