Literature DB >> 19595993

Instructional control of reinforcement learning: a behavioral and neurocomputational investigation.

Bradley B Doll1, W Jake Jacobs, Alan G Sanfey, Michael J Frank.   

Abstract

Humans learn how to behave directly through environmental experience and indirectly through rules and instructions. Behavior analytic research has shown that instructions can control behavior, even when such behavior leads to sub-optimal outcomes (Hayes, S. (Ed.). 1989. Rule-governed behavior: cognition, contingencies, and instructional control. Plenum Press.). Here we examine the control of behavior through instructions in a reinforcement learning task known to depend on striatal dopaminergic function. Participants selected between probabilistically reinforced stimuli, and were (incorrectly) told that a specific stimulus had the highest (or lowest) reinforcement probability. Despite experience to the contrary, instructions drove choice behavior. We present neural network simulations that capture the interactions between instruction-driven and reinforcement-driven behavior via two potential neural circuits: one in which the striatum is inaccurately trained by instruction representations coming from prefrontal cortex/hippocampus (PFC/HC), and another in which the striatum learns the environmentally based reinforcement contingencies, but is "overridden" at decision output. Both models capture the core behavioral phenomena but, because they differ fundamentally on what is learned, make distinct predictions for subsequent behavioral and neuroimaging experiments. Finally, we attempt to distinguish between the proposed computational mechanisms governing instructed behavior by fitting a series of abstract "Q-learning" and Bayesian models to subject data. The best-fitting model supports one of the neural models, suggesting the existence of a "confirmation bias" in which the PFC/HC system trains the reinforcement system by amplifying outcomes that are consistent with instructions while diminishing inconsistent outcomes.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19595993      PMCID: PMC3050481          DOI: 10.1016/j.brainres.2009.07.007

Source DB:  PubMed          Journal:  Brain Res        ISSN: 0006-8993            Impact factor:   3.252


  59 in total

1.  The neural basis of economic decision-making in the Ultimatum Game.

Authors:  Alan G Sanfey; James K Rilling; Jessica A Aronson; Leigh E Nystrom; Jonathan D Cohen
Journal:  Science       Date:  2003-06-13       Impact factor: 47.728

Review 2.  A perspective on judgment and choice: mapping bounded rationality.

Authors:  Daniel Kahneman
Journal:  Am Psychol       Date:  2003-09

Review 3.  Model-based fMRI and its application to reward learning and decision making.

Authors:  John P O'Doherty; Alan Hampton; Hackjin Kim
Journal:  Ann N Y Acad Sci       Date:  2007-04-07       Impact factor: 5.691

Review 4.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

5.  Instance-based categorization: automatic versus intentional forms of retrieval.

Authors:  A Neal; B Hesketh; S Andrews
Journal:  Mem Cognit       Date:  1995-03

6.  A mechanistic account of striatal dopamine function in human cognition: psychopharmacological studies with cabergoline and haloperidol.

Authors:  Michael J Frank; Randall C O'Reilly
Journal:  Behav Neurosci       Date:  2006-06       Impact factor: 1.912

7.  Cortical and subcortical brain regions involved in rule-based category learning.

Authors:  J Vincent Filoteo; W Todd Maddox; Alan N Simmons; A David Ing; Xavier E Cagigas; Scott Matthews; Martin P Paulus
Journal:  Neuroreport       Date:  2005-02-08       Impact factor: 1.837

8.  Dissociable roles of ventral and dorsal striatum in instrumental conditioning.

Authors:  John O'Doherty; Peter Dayan; Johannes Schultz; Ralf Deichmann; Karl Friston; Raymond J Dolan
Journal:  Science       Date:  2004-04-16       Impact factor: 47.728

9.  A neuropsychological theory of multiple systems in category learning.

Authors:  F G Ashby; L A Alfonso-Reese; A U Turken; E M Waldron
Journal:  Psychol Rev       Date:  1998-07       Impact factor: 8.934

Review 10.  Multiple dopamine functions at different time courses.

Authors:  Wolfram Schultz
Journal:  Annu Rev Neurosci       Date:  2007       Impact factor: 12.449

View more
  81 in total

Review 1.  Adaptation, expertise, and giftedness: towards an understanding of cortical, subcortical, and cerebellar network contributions.

Authors:  Leonard F Koziol; Deborah Ely Budding; Dana Chidekel
Journal:  Cerebellum       Date:  2010-12       Impact factor: 3.847

2.  Social stress reactivity alters reward and punishment learning.

Authors:  James F Cavanagh; Michael J Frank; John J B Allen
Journal:  Soc Cogn Affect Neurosci       Date:  2010-05-07       Impact factor: 3.436

Review 3.  Developmental perspectives on risky and impulsive choice.

Authors:  Gail M Rosenbaum; Catherine A Hartley
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2019-02-18       Impact factor: 6.237

4.  Probabilistic reinforcement learning in adults with autism spectrum disorders.

Authors:  Marjorie Solomon; Anne C Smith; Michael J Frank; Stanford Ly; Cameron S Carter
Journal:  Autism Res       Date:  2011-03-18       Impact factor: 5.216

5.  Modulation of the feedback-related negativity by instruction and experience.

Authors:  Matthew M Walsh; John R Anderson
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-07       Impact factor: 11.205

6.  Dorsolateral prefrontal cortex drives mesolimbic dopaminergic regions to initiate motivated behavior.

Authors:  Ian C Ballard; Vishnu P Murty; R McKell Carter; Jeffrey J MacInnes; Scott A Huettel; R Alison Adcock
Journal:  J Neurosci       Date:  2011-07-13       Impact factor: 6.167

7.  Risk-taking behavior: dopamine D2/D3 receptors, feedback, and frontolimbic activity.

Authors:  Milky Kohno; Dara G Ghahremani; Angelica M Morales; Chelsea L Robertson; Kenji Ishibashi; Andrew T Morgan; Mark A Mandelkern; Edythe D London
Journal:  Cereb Cortex       Date:  2013-08-21       Impact factor: 5.357

8.  Neural signatures of experience-based improvements in deterministic decision-making.

Authors:  Joshua J Tremel; Patryk A Laurent; David A Wolk; Mark E Wheeler; Julie A Fiez
Journal:  Behav Brain Res       Date:  2016-08-11       Impact factor: 3.332

9.  The Outcome-Representation Learning Model: A Novel Reinforcement Learning Model of the Iowa Gambling Task.

Authors:  Nathaniel Haines; Jasmin Vassileva; Woo-Young Ahn
Journal:  Cogn Sci       Date:  2018-10-05

Review 10.  Rapid instructed task learning: a new window into the human brain's unique capacity for flexible cognitive control.

Authors:  Michael W Cole; Patryk Laurent; Andrea Stocco
Journal:  Cogn Affect Behav Neurosci       Date:  2013-03       Impact factor: 3.282

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.