Literature DB >> 33811071

Two sides of the same coin: Beneficial and detrimental consequences of range adaptation in human reinforcement learning.

Sophie Bavard1,2,3, Aldo Rustichini4, Stefano Palminteri5,2,3.   

Abstract

Evidence suggests that economic values are rescaled as a function of the range of the available options. Although locally adaptive, range adaptation has been shown to lead to suboptimal choices, particularly notable in reinforcement learning (RL) situations when options are extrapolated from their original context to a new one. Range adaptation can be seen as the result of an adaptive coding process aiming at increasing the signal-to-noise ratio. However, this hypothesis leads to a counterintuitive prediction: Decreasing task difficulty should increase range adaptation and, consequently, extrapolation errors. Here, we tested the paradoxical relation between range adaptation and performance in a large sample of participants performing variants of an RL task, where we manipulated task difficulty. Results confirmed that range adaptation induces systematic extrapolation errors and is stronger when decreasing task difficulty. Last, we propose a range-adapting model and show that it is able to parsimoniously capture all the behavioral results.
Copyright © 2021 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).

Entities:  

Year:  2021        PMID: 33811071     DOI: 10.1126/sciadv.abe0340

Source DB:  PubMed          Journal:  Sci Adv        ISSN: 2375-2548            Impact factor:   14.136


  3 in total

1.  Human value learning and representation reflect rational adaptation to task demands.

Authors:  Keno Juechems; Tugba Altun; Rita Hira; Andreas Jarvstad
Journal:  Nat Hum Behav       Date:  2022-05-30

Review 2.  Decision neuroscience and neuroeconomics: Recent progress and ongoing challenges.

Authors:  Jeffrey B Dennison; Daniel Sazhin; David V Smith
Journal:  Wiley Interdiscip Rev Cogn Sci       Date:  2022-02-08

3.  Bumblebees retrieve only the ordinal ranking of foraging options when comparing memories obtained in distinct settings.

Authors:  Cwyn Solvi; Yonghe Zhou; Yunxiao Feng; Yuyi Lu; Mark Roper; Li Sun; Rebecca J Reid; Lars Chittka; Andrew B Barron; Fei Peng
Journal:  Elife       Date:  2022-09-27       Impact factor: 8.713

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.