Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Human subjects exploit a cognitive map for credit assignment.

Literature DB >> 33479182

Human subjects exploit a cognitive map for credit assignment.

Rani Moran^1,2, Peter Dayan^3,4, Raymond J Dolan^5,2.

Abstract

An influential reinforcement learning framework proposes that behavior is jointly governed by model-free (MF) and model-based (MB) controllers. The former learns the values of actions directly from past encounters, and the latter exploits a cognitive map of the task to calculate these prospectively. Considerable attention has been paid to how these systems interact during choice, but how and whether knowledge of a cognitive map contributes to the way MF and MB controllers assign credit (i.e., to how they revaluate actions and states following the receipt of an outcome) remains underexplored. Here, we examine such sophisticated credit assignment using a dual-outcome bandit task. We provide evidence that knowledge of a cognitive map influences credit assignment in both MF and MB systems, mediating subtly different aspects of apparent relevance. Specifically, we show MF credit assignment is enhanced for those rewards that are related to a choice, and this contrasted with choice-unrelated rewards that reinforced subsequent choices negatively. This modulation is only possible based on knowledge of task structure. On the other hand, MB credit assignment was boosted for outcomes that impacted on differences in values between offered bandits. We consider mechanistic accounts and the normative status of these findings. We suggest the findings extend the scope and sophistication of cognitive map-based credit assignment during reinforcement learning, with implications for understanding behavioral control.

Entities: Chemical

Keywords: cognitive maps; decision making; model-based; model-free; reinforcement learning

Mesh：

Year: 2021 PMID： 33479182 PMCID： PMC7848688 DOI： 10.1073/pnas.2016884118

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 12.779

40 in total

1. Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning.

Authors: Henry H Yin; Barbara J Knowlton; Bernard W Balleine
Journal: Eur J Neurosci Date: 2004-01 Impact factor: 3.386

2. The role of the dorsomedial striatum in instrumental conditioning.

Authors: Henry H Yin; Sean B Ostlund; Barbara J Knowlton; Bernard W Balleine
Journal: Eur J Neurosci Date: 2005-07 Impact factor: 3.386

Review 3. Learning task-state representations.

Authors: Yael Niv
Journal: Nat Neurosci Date: 2019-09-24 Impact factor: 24.884

4. Old processes, new perspectives: Familiarity is correlated with (not independent of) recollection and is more (not equally) variable for targets than for lures.

Authors: Rani Moran; Yonatan Goshen-Gottstein
Journal: Cogn Psychol Date: 2015-04-17 Impact factor: 3.468

Review 5. What Is a Cognitive Map? Organizing Knowledge for Flexible Behavior.

Authors: Timothy E J Behrens; Timothy H Muller; James C R Whittington; Shirley Mark; Alon B Baram; Kimberly L Stachenfeld; Zeb Kurth-Nelson
Journal: Neuron Date: 2018-10-24 Impact factor: 17.173

6. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning.

Authors: Jan Gläscher; Nathaniel Daw; Peter Dayan; John P O'Doherty
Journal: Neuron Date: 2010-05-27 Impact factor: 17.173

Review 7. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action.

Authors: Bernard W Balleine; John P O'Doherty
Journal: Neuropsychopharmacology Date: 2010-01 Impact factor: 7.853

8. Coordination of actions and habits in the medial prefrontal cortex of rats.

Authors: Simon Killcross; Etienne Coutureau
Journal: Cereb Cortex Date: 2003-04 Impact factor: 5.357

9. Speed/accuracy trade-off between the habitual and the goal-directed processes.

Authors: Mehdi Keramati; Amir Dezfouli; Payam Piray
Journal: PLoS Comput Biol Date: 2011-05-26 Impact factor: 4.475

10. Disruption of dorsolateral prefrontal cortex decreases model-based in favor of model-free control in humans.

Authors: Peter Smittenaar; Thomas H B FitzGerald; Vincenzo Romei; Nicholas D Wright; Raymond J Dolan
Journal: Neuron Date: 2013-10-24 Impact factor: 17.173

6 in total

1. Dopamine enhances model-free credit assignment through boosting of retrospective model-based inference.

Authors: Lorenz Deserno; Rani Moran; Jochen Michely; Ying Lee; Peter Dayan; Raymond J Dolan
Journal: Elife Date: 2021-12-09 Impact factor: 8.140

2. Dopaminergic challenge dissociates learning from primary versus secondary sources of information.

Authors: Alicia J Rybicki; Sophie L Sowden; Bianca Schuster; Jennifer L Cook
Journal: Elife Date: 2022-03-15 Impact factor: 8.713

3. Assigning the right credit to the wrong action: compulsivity in the general population is associated with augmented outcome-irrelevant value-based learning.

Authors: Nitzan Shahar; Tobias U Hauser; Rani Moran; Michael Moutoussis; Edward T Bullmore; Raymond J Dolan
Journal: Transl Psychiatry Date: 2021-11-05 Impact factor: 7.989