Literature DB >> 33229518

Computational evidence for hierarchically structured reinforcement learning in humans.

Maria K Eckstein1, Anne G E Collins2.   

Abstract

Humans have the fascinating ability to achieve goals in a complex and constantly changing world, still surpassing modern machine-learning algorithms in terms of flexibility and learning speed. It is generally accepted that a crucial factor for this ability is the use of abstract, hierarchical representations, which employ structure in the environment to guide learning and decision making. Nevertheless, how we create and use these hierarchical representations is poorly understood. This study presents evidence that human behavior can be characterized as hierarchical reinforcement learning (RL). We designed an experiment to test specific predictions of hierarchical RL using a series of subtasks in the realm of context-based learning and observed several behavioral markers of hierarchical RL, such as asymmetric switch costs between changes in higher-level versus lower-level features, faster learning in higher-valued compared to lower-valued contexts, and preference for higher-valued compared to lower-valued contexts. We replicated these results across three independent samples. We simulated three models-a classic RL, a hierarchical RL, and a hierarchical Bayesian model-and compared their behavior to human results. While the flat RL model captured some aspects of participants' sensitivity to outcome values, and the hierarchical Bayesian model captured some markers of transfer, only hierarchical RL accounted for all patterns observed in human behavior. This work shows that hierarchical RL, a biologically inspired and computationally simple algorithm, can capture human behavior in complex, hierarchical environments and opens the avenue for future research in this field.

Entities:  

Keywords:  computational modeling; hierarchy; reinforcement learning; structure learning; task-sets

Year:  2020        PMID: 33229518      PMCID: PMC7703642          DOI: 10.1073/pnas.1912330117

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  45 in total

Review 1.  Hierarchical reinforcement learning and decision making.

Authors:  Matthew Michael Botvinick
Journal:  Curr Opin Neurobiol       Date:  2012-06-11       Impact factor: 6.627

2.  Dopamine-mediated reinforcement learning signals in the striatum and ventromedial prefrontal cortex underlie value-based choices.

Authors:  Gerhard Jocham; Tilmann A Klein; Markus Ullsperger
Journal:  J Neurosci       Date:  2011-02-02       Impact factor: 6.167

3.  Mechanisms of hierarchical reinforcement learning in cortico-striatal circuits 2: evidence from fMRI.

Authors:  David Badre; Michael J Frank
Journal:  Cereb Cortex       Date:  2011-06-21       Impact factor: 5.357

4.  Reinforcement learning in multidimensional environments relies on attention mechanisms.

Authors:  Yael Niv; Reka Daniel; Andra Geana; Samuel J Gershman; Yuan Chang Leong; Angela Radulescu; Robert C Wilson
Journal:  J Neurosci       Date:  2015-05-27       Impact factor: 6.167

5.  On The Necessity of Abstraction.

Authors:  George Konidaris
Journal:  Curr Opin Behav Sci       Date:  2018-12-14

6.  The successor representation in human reinforcement learning.

Authors:  I Momennejad; E M Russek; J H Cheong; M M Botvinick; N D Daw; S J Gershman
Journal:  Nat Hum Behav       Date:  2017-08-28

7.  Prefrontal cortex as a meta-reinforcement learning system.

Authors:  Jane X Wang; Zeb Kurth-Nelson; Dharshan Kumaran; Dhruva Tirumala; Hubert Soyer; Joel Z Leibo; Demis Hassabis; Matthew Botvinick
Journal:  Nat Neurosci       Date:  2018-05-14       Impact factor: 24.884

8.  The Cost of Structure Learning.

Authors:  Anne G E Collins
Journal:  J Cogn Neurosci       Date:  2017-03-30       Impact factor: 3.225

Review 9.  Is the rostro-caudal axis of the frontal lobe hierarchical?

Authors:  David Badre; Mark D'Esposito
Journal:  Nat Rev Neurosci       Date:  2009-08-12       Impact factor: 34.870

10.  Feature-based learning improves adaptability without compromising precision.

Authors:  Shiva Farashahi; Katherine Rowe; Zohra Aslami; Daeyeol Lee; Alireza Soltani
Journal:  Nat Commun       Date:  2017-11-24       Impact factor: 14.919

View more
  11 in total

1.  The brain produces mind by modeling.

Authors:  Richard M Shiffrin; Danielle S Bassett; Nikolaus Kriegeskorte; Joshua B Tenenbaum
Journal:  Proc Natl Acad Sci U S A       Date:  2020-11-24       Impact factor: 11.205

2.  The Role of Executive Function in Shaping Reinforcement Learning.

Authors:  Milena Rmus; Samuel D McDougle; Anne G E Collins
Journal:  Curr Opin Behav Sci       Date:  2020-11-14

3.  What do Reinforcement Learning Models Measure? Interpreting Model Parameters in Cognition and Neuroscience.

Authors:  Maria K Eckstein; Linda Wilbrecht; Anne G E Collins
Journal:  Curr Opin Behav Sci       Date:  2021-07-03

Review 4.  Advances in modeling learning and decision-making in neuroscience.

Authors:  Anne G E Collins; Amitai Shenhav
Journal:  Neuropsychopharmacology       Date:  2021-08-27       Impact factor: 7.853

5.  Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Authors:  Jaron T Colas; Neil M Dundon; Raphael T Gerraty; Natalie M Saragosa-Harris; Karol P Szymula; Koranis Tanwisuth; J Michael Tyszka; Camilla van Geen; Harang Ju; Arthur W Toga; Joshua I Gold; Dani S Bassett; Catherine A Hartley; Daphna Shohamy; Scott T Grafton; John P O'Doherty
Journal:  Hum Brain Mapp       Date:  2022-07-21       Impact factor: 5.399

Review 6.  Hierarchical Reinforcement Learning, Sequential Behavior, and the Dorsal Frontostriatal System.

Authors:  Miriam Janssen; Christopher LeWarne; Diana Burk; Bruno B Averbeck
Journal:  J Cogn Neurosci       Date:  2022-07-01       Impact factor: 3.420

7.  How the Mind Creates Structure: Hierarchical Learning of Action Sequences.

Authors:  Maria K Eckstein; Anne G E Collins
Journal:  Cogsci       Date:  2021-07

8.  Eye movements reveal spatiotemporal dynamics of visually-informed planning in navigation.

Authors:  Seren Zhu; Kaushik J Lakshminarasimhan; Nastaran Arfaei; Dora E Angelaki
Journal:  Elife       Date:  2022-05-03       Impact factor: 8.713

9.  Temporal and state abstractions for efficient learning, transfer, and composition in humans.

Authors:  Liyu Xia; Anne G E Collins
Journal:  Psychol Rev       Date:  2021-05-20       Impact factor: 8.247

10.  A First Principles Approach to Subjective Experience.

Authors:  Brian Key; Oressia Zalucki; Deborah J Brown
Journal:  Front Syst Neurosci       Date:  2022-02-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.