Literature DB >> 34368809

Revisiting the Role of Uncertainty-Driven Exploration in a (Perceived) Non-Stationary World.

Dalin Guo1, Angela J Yu2.   

Abstract

Humans are often faced with an exploration-versus-exploitation trade-off. A commonly used paradigm, multi-armed bandit, has shown humans to exhibit an "uncertainty bonus", which combines with estimated reward to drive exploration. However, previous studies often modeled belief updating using either a Bayesian model that assumed the reward contingency to remain stationary, or a reinforcement learning model. Separately, we previously showed that human learning in the bandit task is best captured by a dynamic-belief Bayesian model. We hypothesize that the estimated uncertainty bonus may depend on which learning model is employed. Here, we re-analyze a bandit dataset using all three learning models. We find that the dynamic-belief model captures human choice behavior best, while also uncovering a much larger uncertainty bonus than the other models. More broadly, our results also emphasize the importance of an appropriate learning model, as it is crucial for correctly characterizing the processes underlying human decision making.

Entities:  

Keywords:  Bayesian modeling; decision making; multi-armed bandit; reinforcement learning

Year:  2021        PMID: 34368809      PMCID: PMC8341546     

Source DB:  PubMed          Journal:  Cogsci


  17 in total

1.  Relative and absolute strength of response as a function of frequency of reinforcement.

Authors:  R J HERRNSTEIN
Journal:  J Exp Anal Behav       Date:  1961-07       Impact factor: 2.468

2.  Sequential effects: Superstition or rational behavior?

Authors:  Angela J Yu; Jonathan D Cohen
Journal:  Adv Neural Inf Process Syst       Date:  2008

3.  Uncertainty and exploration in a restless bandit problem.

Authors:  Maarten Speekenbrink; Emmanouil Konstantinidis
Journal:  Top Cogn Sci       Date:  2015-04-20

4.  Humans use directed and random exploration to solve the explore-exploit dilemma.

Authors:  Robert C Wilson; Andra Geana; John M White; Elliot A Ludvig; Jonathan D Cohen
Journal:  J Exp Psychol Gen       Date:  2014-10-27

5.  Cortical substrates for exploratory decisions in humans.

Authors:  Nathaniel D Daw; John P O'Doherty; Peter Dayan; Ben Seymour; Raymond J Dolan
Journal:  Nature       Date:  2006-06-15       Impact factor: 49.962

Review 6.  Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.

Authors:  Jonathan D Cohen; Samuel M McClure; Angela J Yu
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2007-05-29       Impact factor: 6.237

7.  Validation of decision-making models and analysis of decision variables in the rat basal ganglia.

Authors:  Makoto Ito; Kenji Doya
Journal:  J Neurosci       Date:  2009-08-05       Impact factor: 6.167

8.  Learning the value of information and reward over time when solving exploration-exploitation problems.

Authors:  Irene Cogliati Dezza; Angela J Yu; Axel Cleeremans; William Alexander
Journal:  Sci Rep       Date:  2017-12-05       Impact factor: 4.379

9.  Dopamine blockade impairs the exploration-exploitation trade-off in rats.

Authors:  François Cinotti; Virginie Fresno; Nassim Aklil; Etienne Coutureau; Benoît Girard; Alain R Marchand; Mehdi Khamassi
Journal:  Sci Rep       Date:  2019-05-01       Impact factor: 4.379

10.  Devaluation of Unchosen Options: A Bayesian Account of the Provenance and Maintenance of Overly Optimistic Expectations.

Authors:  Corey Yishan Zhou; Dalin Guo; Angela J Yu
Journal:  Cogsci       Date:  2020 Jul-Aug
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.