Literature DB >> 28731839

Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems.

Wouter Kool1, Samuel J Gershman1,2, Fiery A Cushman1.   

Abstract

Human behavior is sometimes determined by habit and other times by goal-directed planning. Modern reinforcement-learning theories formalize this distinction as a competition between a computationally cheap but inaccurate model-free system that gives rise to habits and a computationally expensive but accurate model-based system that implements planning. It is unclear, however, how people choose to allocate control between these systems. Here, we propose that arbitration occurs by comparing each system's task-specific costs and benefits. To investigate this proposal, we conducted two experiments showing that people increase model-based control when it achieves greater accuracy than model-free control, and especially when the rewards of accurate performance are amplified. In contrast, they are insensitive to reward amplification when model-based and model-free control yield equivalent accuracy. This suggests that humans adaptively balance habitual and planned action through on-line cost-benefit analysis.

Entities:  

Keywords:  cognitive control; decision making; open data; open materials; reinforcement learning

Mesh:

Year:  2017        PMID: 28731839     DOI: 10.1177/0956797617708288

Source DB:  PubMed          Journal:  Psychol Sci        ISSN: 0956-7976


  44 in total

1.  Assessing the role of reward in task selection using a reward-based voluntary task switching paradigm.

Authors:  David A Braun; Catherine M Arrington
Journal:  Psychol Res       Date:  2017-09-26

Review 2.  Mental labour.

Authors:  Wouter Kool; Matthew Botvinick
Journal:  Nat Hum Behav       Date:  2018-09-03

3.  The prevalence and importance of statistical learning in human cognition and behavior.

Authors:  Brynn E Sherman; Kathryn N Graves; Nicholas B Turk-Browne
Journal:  Curr Opin Behav Sci       Date:  2020-02-29

4.  Discovery of hierarchical representations for efficient planning.

Authors:  Momchil S Tomov; Samyukta Yagati; Agni Kumar; Wanqian Yang; Samuel J Gershman
Journal:  PLoS Comput Biol       Date:  2020-04-06       Impact factor: 4.475

5.  A Control Theoretic Model of Adaptive Learning in Dynamic Environments.

Authors:  Harrison Ritz; Matthew R Nassar; Michael J Frank; Amitai Shenhav
Journal:  J Cogn Neurosci       Date:  2018-06-07       Impact factor: 3.225

Review 6.  Substance use is associated with reduced devaluation sensitivity.

Authors:  Kaileigh A Byrne; A Ross Otto; Bo Pang; Christopher J Patrick; Darrell A Worthy
Journal:  Cogn Affect Behav Neurosci       Date:  2019-02       Impact factor: 3.282

7.  Dopamine and Proximity in Motivation and Cognitive Control.

Authors:  Andrew Westbrook; Michael Frank
Journal:  Curr Opin Behav Sci       Date:  2018-01-04

8.  The computational basis of following advice in adolescents.

Authors:  Julia M Rodriguez Buritica; Hauke R Heekeren; Wouter van den Bos
Journal:  J Exp Child Psychol       Date:  2019-01-02

9.  Humans primarily use model-based inference in the two-stage task.

Authors:  Carolina Feher da Silva; Todd A Hare
Journal:  Nat Hum Behav       Date:  2020-07-06

Review 10.  Reinforcement-learning in fronto-striatal circuits.

Authors:  Bruno Averbeck; John P O'Doherty
Journal:  Neuropsychopharmacology       Date:  2021-08-05       Impact factor: 7.853

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.