Literature DB >> 22487033

How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis.

Anne G E Collins1, Michael J Frank.   

Abstract

Instrumental learning involves corticostriatal circuitry and the dopaminergic system. This system is typically modeled in the reinforcement learning (RL) framework by incrementally accumulating reward values of states and actions. However, human learning also implicates prefrontal cortical mechanisms involved in higher level cognitive functions. The interaction of these systems remains poorly understood, and models of human behavior often ignore working memory (WM) and therefore incorrectly assign behavioral variance to the RL system. Here we designed a task that highlights the profound entanglement of these two processes, even in simple learning problems. By systematically varying the size of the learning problem and delay between stimulus repetitions, we separately extracted WM-specific effects of load and delay on learning. We propose a new computational model that accounts for the dynamic integration of RL and WM processes observed in subjects' behavior. Incorporating capacity-limited WM into the model allowed us to capture behavioral variance that could not be captured in a pure RL framework even if we (implausibly) allowed separate RL systems for each set size. The WM component also allowed for a more reasonable estimation of a single RL process. Finally, we report effects of two genetic polymorphisms having relative specificity for prefrontal and basal ganglia functions. Whereas the COMT gene coding for catechol-O-methyl transferase selectively influenced model estimates of WM capacity, the GPR6 gene coding for G-protein-coupled receptor 6 influenced the RL learning rate. Thus, this study allowed us to specify distinct influences of the high-level and low-level cognitive functions on instrumental learning, beyond the possibilities offered by simple RL models.
© 2012 The Authors. European Journal of Neuroscience © 2012 Federation of European Neuroscience Societies and Blackwell Publishing Ltd.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22487033      PMCID: PMC3390186          DOI: 10.1111/j.1460-9568.2011.07980.x

Source DB:  PubMed          Journal:  Eur J Neurosci        ISSN: 0953-816X            Impact factor:   3.386


  65 in total

1.  Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops.

Authors:  Saori C Tanaka; Kenji Doya; Go Okada; Kazutaka Ueda; Yasumasa Okamoto; Shigeto Yamawaki
Journal:  Nat Neurosci       Date:  2004-07-04       Impact factor: 24.884

Review 2.  Model-based fMRI and its application to reward learning and decision making.

Authors:  John P O'Doherty; Alan Hampton; Hackjin Kim
Journal:  Ann N Y Acad Sci       Date:  2007-04-07       Impact factor: 5.691

3.  Discrete fixed-resolution representations in visual working memory.

Authors:  Weiwei Zhang; Steven J Luck
Journal:  Nature       Date:  2008-04-02       Impact factor: 49.962

Review 4.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

5.  The number and quality of representations in working memory.

Authors:  Weiwei Zhang; Steven J Luck
Journal:  Psychol Sci       Date:  2011-10-10

6.  Effect of catechol-O-methyltransferase val158met genotype on attentional control.

Authors:  Giuseppe Blasi; Venkata S Mattay; Alessandro Bertolino; Brita Elvevåg; Joseph H Callicott; Saumitra Das; Bhaskar S Kolachana; Michael F Egan; Terry E Goldberg; Daniel R Weinberger
Journal:  J Neurosci       Date:  2005-05-18       Impact factor: 6.167

7.  Catechol-o-methyltransferase inhibition improves set-shifting performance and elevates stimulated dopamine release in the rat prefrontal cortex.

Authors:  E M Tunbridge; D M Bannerman; T Sharp; P J Harrison
Journal:  J Neurosci       Date:  2004-06-09       Impact factor: 6.167

8.  Dynamic shifts of limited working memory resources in human vision.

Authors:  Paul M Bays; Masud Husain
Journal:  Science       Date:  2008-08-08       Impact factor: 47.728

9.  Influence of COMT gene polymorphism on fMRI-assessed sustained and transient activity during a working memory task.

Authors:  Cindy M de Frias; Petter Marklund; Elias Eriksson; Anne Larsson; Lena Oman; Kristina Annerbrink; Lars Bäckman; Lars-Göran Nilsson; Lars Nyberg
Journal:  J Cogn Neurosci       Date:  2010-07       Impact factor: 3.225

10.  Impulsive personality predicts dopamine-dependent changes in frontostriatal activity during component processes of working memory.

Authors:  Roshan Cools; Margaret Sheridan; Emily Jacobs; Mark D'Esposito
Journal:  J Neurosci       Date:  2007-05-16       Impact factor: 6.167

View more
  105 in total

Review 1.  Motivational Deficits in Schizophrenia and the Representation of Expected Value.

Authors:  James A Waltz; James M Gold
Journal:  Curr Top Behav Neurosci       Date:  2016

2.  An Obesity-Predisposing Variant of the FTO Gene Regulates D2R-Dependent Reward Learning.

Authors:  Meltem Sevgi; Lionel Rigoux; Anne B Kühn; Jan Mauer; Leonhard Schilbach; Martin E Hess; Theo O J Gruendler; Markus Ullsperger; Klaas Enno Stephan; Jens C Brüning; Marc Tittgemeyer
Journal:  J Neurosci       Date:  2015-09-09       Impact factor: 6.167

3.  Feedback-driven trial-by-trial learning in autism spectrum disorders.

Authors:  Marjorie Solomon; Michael J Frank; J Daniel Ragland; Anne C Smith; Tara A Niendam; Tyler A Lesh; David S Grayson; Jonathan S Beck; John C Matter; Cameron S Carter
Journal:  Am J Psychiatry       Date:  2014-10-31       Impact factor: 18.112

4.  Selective maintenance of value information helps resolve the exploration/exploitation dilemma.

Authors:  Michael N Hallquist; Alexandre Y Dombrovski
Journal:  Cognition       Date:  2018-11-28

Review 5.  Reinforcement learning models and their neural correlates: An activation likelihood estimation meta-analysis.

Authors:  Henry W Chase; Poornima Kumar; Simon B Eickhoff; Alexandre Y Dombrovski
Journal:  Cogn Affect Behav Neurosci       Date:  2015-06       Impact factor: 3.282

6.  Dopaminergic Genetic Polymorphisms Predict Rule-based Category Learning.

Authors:  Kaileigh A Byrne; Tyler Davis; Darrell A Worthy
Journal:  J Cogn Neurosci       Date:  2016-02-26       Impact factor: 3.225

Review 7.  Age-related variability in decision-making: Insights from neurochemistry.

Authors:  Anne S Berry; William J Jagust; Ming Hsu
Journal:  Cogn Affect Behav Neurosci       Date:  2019-06       Impact factor: 3.282

Review 8.  Towards a better understanding of the cannabinoid-related orphan receptors GPR3, GPR6, and GPR12.

Authors:  Paula Morales; Israa Isawi; Patricia H Reggio
Journal:  Drug Metab Rev       Date:  2018-02-01       Impact factor: 4.518

9.  Taming the beast: extracting generalizable knowledge from computational models of cognition.

Authors:  Matthew R Nassar; Michael J Frank
Journal:  Curr Opin Behav Sci       Date:  2016-10

10.  The drift diffusion model as the choice rule in reinforcement learning.

Authors:  Mads Lund Pedersen; Michael J Frank; Guido Biele
Journal:  Psychon Bull Rev       Date:  2017-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.