Literature DB >> 33768122

Uncertainty and Exploration.

Samuel J Gershman1.   

Abstract

In order to discover the most rewarding actions, agents must collect information about their environment, potentially foregoing reward. The optimal solution to this "explore-exploit" dilemma is often computationally challenging, but principled algorithmic approximations exist. These approximations utilize uncertainty about action values in different ways. Some random exploration algorithms scale the level of choice stochasticity with the level of uncertainty. Other directed exploration algorithms add a "bonus" to action values with high uncertainty. Random exploration algorithms are sensitive to total uncertainty across actions, whereas directed exploration algorithms are sensitive to relative uncertainty. This paper reports a multi-armed bandit experiment in which total and relative uncertainty were orthogonally manipulated. We found that humans employ both exploration strategies, and that these strategies are independently controlled by different uncertainty computations.

Entities:  

Keywords:  Bayesian inference; explore-exploit dilemma; reinforcement learning

Year:  2018        PMID: 33768122      PMCID: PMC7989061          DOI: 10.1037/dec0000101

Source DB:  PubMed          Journal:  Decision (Wash D C )        ISSN: 2325-9965


  5 in total

1.  Impulsivity and risk-seeking as Bayesian inference under dopaminergic control.

Authors:  John G Mikhael; Samuel J Gershman
Journal:  Neuropsychopharmacology       Date:  2021-08-10       Impact factor: 7.853

2.  Trait somatic anxiety is associated with reduced directed exploration and underestimation of uncertainty.

Authors:  Haoxue Fan; Samuel J Gershman; Elizabeth A Phelps
Journal:  Nat Hum Behav       Date:  2022-10-03

3.  Using Computational Modeling to Capture Schizophrenia-Specific Reinforcement Learning Differences and Their Implications on Patient Classification.

Authors:  Andra Geana; Deanna M Barch; James M Gold; Cameron S Carter; Angus W MacDonald; J Daniel Ragland; Steven M Silverstein; Michael J Frank
Journal:  Biol Psychiatry Cogn Neurosci Neuroimaging       Date:  2021-04-18

4.  Time pressure changes how people explore and respond to uncertainty.

Authors:  Charley M Wu; Eric Schulz; Timothy J Pleskac; Maarten Speekenbrink
Journal:  Sci Rep       Date:  2022-03-08       Impact factor: 4.996

5.  Human Belief State-Based Exploration and Exploitation in an Information-Selective Symmetric Reversal Bandit Task.

Authors:  Lilla Horvath; Stanley Colcombe; Michael Milham; Shruti Ray; Philipp Schwartenbeck; Dirk Ostwald
Journal:  Comput Brain Behav       Date:  2021-08-02
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.