Literature DB >> 21791294

A neural signature of hierarchical reinforcement learning.

José J F Ribas-Fernandes1, Alec Solway, Carlos Diuk, Joseph T McGuire, Andrew G Barto, Yael Niv, Matthew M Botvinick.   

Abstract

Human behavior displays hierarchical structure: simple actions cohere into subtask sequences, which work together to accomplish overall task goals. Although the neural substrates of such hierarchy have been the target of increasing research, they remain poorly understood. We propose that the computations supporting hierarchical behavior may relate to those in hierarchical reinforcement learning (HRL), a machine-learning framework that extends reinforcement-learning mechanisms into hierarchical domains. To test this, we leveraged a distinctive prediction arising from HRL. In ordinary reinforcement learning, reward prediction errors are computed when there is an unanticipated change in the prospects for accomplishing overall task goals. HRL entails that prediction errors should also occur in relation to task subgoals. In three neuroimaging studies we observed neural responses consistent with such subgoal-related reward prediction errors, within structures previously implicated in reinforcement learning. The results reported support the relevance of HRL to the neural processes underlying hierarchical behavior.
Copyright © 2011 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21791294      PMCID: PMC3145918          DOI: 10.1016/j.neuron.2011.05.042

Source DB:  PubMed          Journal:  Neuron        ISSN: 0896-6273            Impact factor:   17.173


  48 in total

Review 1.  Model-based fMRI and its application to reward learning and decision making.

Authors:  John P O'Doherty; Alan Hampton; Hackjin Kim
Journal:  Ann N Y Acad Sci       Date:  2007-04-07       Impact factor: 5.691

2.  Prefrontal organization of cognitive control according to levels of abstraction.

Authors:  Kalina Christoff; Kamyar Keramatian; Alan M Gordon; Rachelle Smith; Burkhard Mädler
Journal:  Brain Res       Date:  2009-06-06       Impact factor: 3.252

3.  Event-related brain potentials following incorrect feedback in a time-estimation task: evidence for a "generic" neural system for error detection.

Authors:  W H Miltner; C H Braun; M G Coles
Journal:  J Cogn Neurosci       Date:  1997-11       Impact factor: 3.225

4.  Reinforcement learning and higher level cognition: introduction to special issue.

Authors:  Nathaniel D Daw; Michael J Frank
Journal:  Cognition       Date:  2009-10-12

5.  Evidence for hierarchical error processing in the human brain.

Authors:  O E Krigolson; C B Holroyd
Journal:  Neuroscience       Date:  2005-12-15       Impact factor: 3.590

6.  The Psychophysics Toolbox.

Authors:  D H Brainard
Journal:  Spat Vis       Date:  1997

Review 7.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

8.  BOLD Responses to Negative Reward Prediction Errors in Human Habenula.

Authors:  Ramiro Salas; Philip Baldwin; Mariella de Biasi; P Read Montague
Journal:  Front Hum Neurosci       Date:  2010-05-11       Impact factor: 3.169

9.  Motivation and cognitive control in the human prefrontal cortex.

Authors:  Frédérique Kouneiher; Sylvain Charron; Etienne Koechlin
Journal:  Nat Neurosci       Date:  2009-06-07       Impact factor: 24.884

10.  Hierarchical cognitive control deficits following damage to the human frontal lobe.

Authors:  David Badre; Joshua Hoffman; Jeffrey W Cooney; Mark D'Esposito
Journal:  Nat Neurosci       Date:  2009-03-01       Impact factor: 24.884

View more
  55 in total

1.  Evidence integration in model-based tree search.

Authors:  Alec Solway; Matthew M Botvinick
Journal:  Proc Natl Acad Sci U S A       Date:  2015-08-31       Impact factor: 11.205

Review 2.  Navigating complex decision spaces: Problems and paradigms in sequential choice.

Authors:  Matthew M Walsh; John R Anderson
Journal:  Psychol Bull       Date:  2013-07-08       Impact factor: 17.737

Review 3.  The expected value of control: an integrative theory of anterior cingulate cortex function.

Authors:  Amitai Shenhav; Matthew M Botvinick; Jonathan D Cohen
Journal:  Neuron       Date:  2013-07-24       Impact factor: 17.173

4.  On the value of information and other rewards.

Authors:  Yael Niv; Stephanie Chan
Journal:  Nat Neurosci       Date:  2011-08-26       Impact factor: 24.884

5.  Evolution of protolinguistic abilities as a by-product of learning to forage in structured environments.

Authors:  Oren Kolodny; Shimon Edelman; Arnon Lotem
Journal:  Proc Biol Sci       Date:  2015-07-22       Impact factor: 5.349

6.  Reward-based contextual learning supported by anterior cingulate cortex.

Authors:  Akina Umemoto; Azadeh HajiHosseini; Michael E Yates; Clay B Holroyd
Journal:  Cogn Affect Behav Neurosci       Date:  2017-06       Impact factor: 3.282

7.  How the inference of hierarchical rules unfolds over time.

Authors:  Maria K Eckstein; Ariel Starr; Silvia A Bunge
Journal:  Cognition       Date:  2019-02-01

8.  Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures.

Authors:  Samuel D McDougle; Peter A Butcher; Darius E Parvin; Fasial Mushtaq; Yael Niv; Richard B Ivry; Jordan A Taylor
Journal:  Curr Biol       Date:  2019-05-02       Impact factor: 10.834

9.  Hierarchical learning induces two simultaneous, but separable, prediction errors in human basal ganglia.

Authors:  Carlos Diuk; Karin Tsai; Jonathan Wallis; Matthew Botvinick; Yael Niv
Journal:  J Neurosci       Date:  2013-03-27       Impact factor: 6.167

10.  Learning to represent reward structure: a key to adapting to complex environments.

Authors:  Hiroyuki Nakahara; Okihide Hikosaka
Journal:  Neurosci Res       Date:  2012-10-13       Impact factor: 3.304

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.