Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A neural signature of hierarchical reinforcement learning.

Literature DB >> 21791294

A neural signature of hierarchical reinforcement learning.

José J F Ribas-Fernandes¹, Alec Solway, Carlos Diuk, Joseph T McGuire, Andrew G Barto, Yael Niv, Matthew M Botvinick.

Abstract

Human behavior displays hierarchical structure: simple actions cohere into subtask sequences, which work together to accomplish overall task goals. Although the neural substrates of such hierarchy have been the target of increasing research, they remain poorly understood. We propose that the computations supporting hierarchical behavior may relate to those in hierarchical reinforcement learning (HRL), a machine-learning framework that extends reinforcement-learning mechanisms into hierarchical domains. To test this, we leveraged a distinctive prediction arising from HRL. In ordinary reinforcement learning, reward prediction errors are computed when there is an unanticipated change in the prospects for accomplishing overall task goals. HRL entails that prediction errors should also occur in relation to task subgoals. In three neuroimaging studies we observed neural responses consistent with such subgoal-related reward prediction errors, within structures previously implicated in reinforcement learning. The results reported support the relevance of HRL to the neural processes underlying hierarchical behavior.

Entities: Chemical Disease Gene Species

Mesh：

Substances：
Oxygen

Year: 2011 PMID： 21791294 PMCID： PMC3145918 DOI： 10.1016/j.neuron.2011.05.042

Source DB: PubMed Journal: Neuron ISSN： 0896-6273 Impact factor: 17.173

48 in total

Review 1. Model-based fMRI and its application to reward learning and decision making.

Authors: John P O'Doherty; Alan Hampton; Hackjin Kim
Journal: Ann N Y Acad Sci Date: 2007-04-07 Impact factor: 5.691

2. Prefrontal organization of cognitive control according to levels of abstraction.

Authors: Kalina Christoff; Kamyar Keramatian; Alan M Gordon; Rachelle Smith; Burkhard Mädler
Journal: Brain Res Date: 2009-06-06 Impact factor: 3.252

3. Event-related brain potentials following incorrect feedback in a time-estimation task: evidence for a "generic" neural system for error detection.

Authors: W H Miltner; C H Braun; M G Coles
Journal: J Cogn Neurosci Date: 1997-11 Impact factor: 3.225

A neural signature of hierarchical reinforcement learning.

Review 1. Model-based fMRI and its application to reward learning and decision making.

2. Prefrontal organization of cognitive control according to levels of abstraction.

3. Event-related brain potentials following incorrect feedback in a time-estimation task: evidence for a "generic" neural system for error detection.

4. Reinforcement learning and higher level cognition: introduction to special issue.

5. Evidence for hierarchical error processing in the human brain.

6. The Psychophysics Toolbox.

Review 7. A neural substrate of prediction and reward.

8. BOLD Responses to Negative Reward Prediction Errors in Human Habenula.

9. Motivation and cognitive control in the human prefrontal cortex.

10. Hierarchical cognitive control deficits following damage to the human frontal lobe.

1. Evidence integration in model-based tree search.

Review 2. Navigating complex decision spaces: Problems and paradigms in sequential choice.

Review 3. The expected value of control: an integrative theory of anterior cingulate cortex function.

4. On the value of information and other rewards.

5. Evolution of protolinguistic abilities as a by-product of learning to forage in structured environments.

6. Reward-based contextual learning supported by anterior cingulate cortex.

7. How the inference of hierarchical rules unfolds over time.

8. Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures.

9. Hierarchical learning induces two simultaneous, but separable, prediction errors in human basal ganglia.

10. Learning to represent reward structure: a key to adapting to complex environments.