Literature DB >> 33479182

Human subjects exploit a cognitive map for credit assignment.

Rani Moran1,2, Peter Dayan3,4, Raymond J Dolan5,2.   

Abstract

An influential reinforcement learning framework proposes that behavior is jointly governed by model-free (MF) and model-based (MB) controllers. The former learns the values of actions directly from past encounters, and the latter exploits a cognitive map of the task to calculate these prospectively. Considerable attention has been paid to how these systems interact during choice, but how and whether knowledge of a cognitive map contributes to the way MF and MB controllers assign credit (i.e., to how they revaluate actions and states following the receipt of an outcome) remains underexplored. Here, we examine such sophisticated credit assignment using a dual-outcome bandit task. We provide evidence that knowledge of a cognitive map influences credit assignment in both MF and MB systems, mediating subtly different aspects of apparent relevance. Specifically, we show MF credit assignment is enhanced for those rewards that are related to a choice, and this contrasted with choice-unrelated rewards that reinforced subsequent choices negatively. This modulation is only possible based on knowledge of task structure. On the other hand, MB credit assignment was boosted for outcomes that impacted on differences in values between offered bandits. We consider mechanistic accounts and the normative status of these findings. We suggest the findings extend the scope and sophistication of cognitive map-based credit assignment during reinforcement learning, with implications for understanding behavioral control.
Copyright © 2021 the Author(s). Published by PNAS.

Entities:  

Keywords:  cognitive maps; decision making; model-based; model-free; reinforcement learning

Mesh:

Year:  2021        PMID: 33479182      PMCID: PMC7848688          DOI: 10.1073/pnas.2016884118

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   12.779


  40 in total

1.  Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning.

Authors:  Henry H Yin; Barbara J Knowlton; Bernard W Balleine
Journal:  Eur J Neurosci       Date:  2004-01       Impact factor: 3.386

2.  The role of the dorsomedial striatum in instrumental conditioning.

Authors:  Henry H Yin; Sean B Ostlund; Barbara J Knowlton; Bernard W Balleine
Journal:  Eur J Neurosci       Date:  2005-07       Impact factor: 3.386

Review 3.  Learning task-state representations.

Authors:  Yael Niv
Journal:  Nat Neurosci       Date:  2019-09-24       Impact factor: 24.884

4.  Old processes, new perspectives: Familiarity is correlated with (not independent of) recollection and is more (not equally) variable for targets than for lures.

Authors:  Rani Moran; Yonatan Goshen-Gottstein
Journal:  Cogn Psychol       Date:  2015-04-17       Impact factor: 3.468

Review 5.  What Is a Cognitive Map? Organizing Knowledge for Flexible Behavior.

Authors:  Timothy E J Behrens; Timothy H Muller; James C R Whittington; Shirley Mark; Alon B Baram; Kimberly L Stachenfeld; Zeb Kurth-Nelson
Journal:  Neuron       Date:  2018-10-24       Impact factor: 17.173

6.  States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.

Authors:  Jan Gläscher; Nathaniel Daw; Peter Dayan; John P O'Doherty
Journal:  Neuron       Date:  2010-05-27       Impact factor: 17.173

Review 7.  Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action.

Authors:  Bernard W Balleine; John P O'Doherty
Journal:  Neuropsychopharmacology       Date:  2010-01       Impact factor: 7.853

8.  Coordination of actions and habits in the medial prefrontal cortex of rats.

Authors:  Simon Killcross; Etienne Coutureau
Journal:  Cereb Cortex       Date:  2003-04       Impact factor: 5.357

9.  Speed/accuracy trade-off between the habitual and the goal-directed processes.

Authors:  Mehdi Keramati; Amir Dezfouli; Payam Piray
Journal:  PLoS Comput Biol       Date:  2011-05-26       Impact factor: 4.475

10.  Disruption of dorsolateral prefrontal cortex decreases model-based in favor of model-free control in humans.

Authors:  Peter Smittenaar; Thomas H B FitzGerald; Vincenzo Romei; Nicholas D Wright; Raymond J Dolan
Journal:  Neuron       Date:  2013-10-24       Impact factor: 17.173

View more
  6 in total

1.  Dopamine enhances model-free credit assignment through boosting of retrospective model-based inference.

Authors:  Lorenz Deserno; Rani Moran; Jochen Michely; Ying Lee; Peter Dayan; Raymond J Dolan
Journal:  Elife       Date:  2021-12-09       Impact factor: 8.140

2.  Dopaminergic challenge dissociates learning from primary versus secondary sources of information.

Authors:  Alicia J Rybicki; Sophie L Sowden; Bianca Schuster; Jennifer L Cook
Journal:  Elife       Date:  2022-03-15       Impact factor: 8.713

3.  Assigning the right credit to the wrong action: compulsivity in the general population is associated with augmented outcome-irrelevant value-based learning.

Authors:  Nitzan Shahar; Tobias U Hauser; Rani Moran; Michael Moutoussis; Edward T Bullmore; Raymond J Dolan
Journal:  Transl Psychiatry       Date:  2021-11-05       Impact factor: 7.989

4.  Model-based learning retrospectively updates model-free values.

Authors:  Max Doody; Maaike M H Van Swieten; Sanjay G Manohar
Journal:  Sci Rep       Date:  2022-02-11       Impact factor: 4.996

5.  Efficiency and prioritization of inference-based credit assignment.

Authors:  Rani Moran; Peter Dayan; Raymond J Dolan
Journal:  Curr Biol       Date:  2021-04-21       Impact factor: 10.834

6.  Executive Function Assigns Value to Novel Goal-Congruent Outcomes.

Authors:  Samuel D McDougle; Ian C Ballard; Beth Baribault; Sonia J Bishop; Anne G E Collins
Journal:  Cereb Cortex       Date:  2021-11-23       Impact factor: 4.861

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.