Literature DB >> 29289795

Deconstructing the human algorithms for exploration.

Samuel J Gershman1.   

Abstract

The dilemma between information gathering (exploration) and reward seeking (exploitation) is a fundamental problem for reinforcement learning agents. How humans resolve this dilemma is still an open question, because experiments have provided equivocal evidence about the underlying algorithms used by humans. We show that two families of algorithms can be distinguished in terms of how uncertainty affects exploration. Algorithms based on uncertainty bonuses predict a change in response bias as a function of uncertainty, whereas algorithms based on sampling predict a change in response slope. Two experiments provide evidence for both bias and slope changes, and computational modeling confirms that a hybrid model is the best quantitative account of the data.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Bayesian inference; Explore-exploit dilemma; Reinforcement learning

Mesh:

Year:  2017        PMID: 29289795      PMCID: PMC5801139          DOI: 10.1016/j.cognition.2017.12.014

Source DB:  PubMed          Journal:  Cognition        ISSN: 0010-0277


  25 in total

1.  Stimulus information as a determinant of reaction time.

Authors:  R HYMAN
Journal:  J Exp Psychol       Date:  1953-03

2.  Bayesian model selection for group studies - revisited.

Authors:  L Rigoux; K E Stephan; K J Friston; J Daunizeau
Journal:  Neuroimage       Date:  2013-09-07       Impact factor: 6.556

3.  Uncertainty and exploration in a restless bandit problem.

Authors:  Maarten Speekenbrink; Emmanouil Konstantinidis
Journal:  Top Cogn Sci       Date:  2015-04-20

4.  Discovering hierarchical motion structure.

Authors:  Samuel J Gershman; Joshua B Tenenbaum; Frank Jäkel
Journal:  Vision Res       Date:  2015-03-26       Impact factor: 1.886

5.  Novelty and Inductive Generalization in Human Reinforcement Learning.

Authors:  Samuel J Gershman; Yael Niv
Journal:  Top Cogn Sci       Date:  2015-03-23

6.  Structure learning in human sequential decision-making.

Authors:  Daniel E Acuña; Paul Schrater
Journal:  PLoS Comput Biol       Date:  2010-12-02       Impact factor: 4.475

7.  Neurons in posterior cingulate cortex signal exploratory decisions in a dynamic multioption choice task.

Authors:  John M Pearson; Benjamin Y Hayden; Sridhar Raghavachari; Michael L Platt
Journal:  Curr Biol       Date:  2009-09-03       Impact factor: 10.834

8.  The nature of belief-directed exploratory choice in human decision-making.

Authors:  W Bradley Knox; A Ross Otto; Peter Stone; Bradley C Love
Journal:  Front Psychol       Date:  2012-01-31

9.  Physiological and behavioral signatures of reflective exploratory choice.

Authors:  A Ross Otto; W Bradley Knox; Arthur B Markman; Bradley C Love
Journal:  Cogn Affect Behav Neurosci       Date:  2014-12       Impact factor: 3.526

10.  Do not Bet on the Unknown Versus Try to Find Out More: Estimation Uncertainty and "Unexpected Uncertainty" Both Modulate Exploration.

Authors:  Elise Payzan-Lenestour; Peter Bossaerts
Journal:  Front Neurosci       Date:  2012-10-16       Impact factor: 4.677

View more
  29 in total

1.  Selective maintenance of value information helps resolve the exploration/exploitation dilemma.

Authors:  Michael N Hallquist; Alexandre Y Dombrovski
Journal:  Cognition       Date:  2018-11-28

2.  Structured, uncertainty-driven exploration in real-world consumer choice.

Authors:  Eric Schulz; Rahul Bhui; Bradley C Love; Bastien Brier; Michael T Todd; Samuel J Gershman
Journal:  Proc Natl Acad Sci U S A       Date:  2019-06-24       Impact factor: 11.205

Review 3.  Believing in dopamine.

Authors:  Samuel J Gershman; Naoshige Uchida
Journal:  Nat Rev Neurosci       Date:  2019-09-30       Impact factor: 34.870

4.  Uncertainty in learning, choice, and visual fixation.

Authors:  Hrvoje Stojić; Jacob L Orquin; Peter Dayan; Raymond J Dolan; Maarten Speekenbrink
Journal:  Proc Natl Acad Sci U S A       Date:  2020-01-24       Impact factor: 11.205

5.  Adaptive Regulation of Motor Variability.

Authors:  Ashesh K Dhawale; Yohsuke R Miyamoto; Maurice A Smith; Bence P Ölveczky
Journal:  Curr Biol       Date:  2019-10-17       Impact factor: 10.834

6.  Computational mechanisms of curiosity and goal-directed exploration.

Authors:  Philipp Schwartenbeck; Johannes Passecker; Tobias U Hauser; Thomas Hb FitzGerald; Martin Kronbichler; Karl J Friston
Journal:  Elife       Date:  2019-05-10       Impact factor: 8.140

7.  Impulsivity and risk-seeking as Bayesian inference under dopaminergic control.

Authors:  John G Mikhael; Samuel J Gershman
Journal:  Neuropsychopharmacology       Date:  2021-08-10       Impact factor: 7.853

8.  Dopaminergic modulation of the exploration/exploitation trade-off in human decision-making.

Authors:  Karima Chakroun; David Mathar; Antonius Wiehler; Florian Ganzer; Jan Peters
Journal:  Elife       Date:  2020-06-02       Impact factor: 8.140

9.  Balancing exploration and exploitation with information and randomization.

Authors:  Robert C Wilson; Elizabeth Bonawitz; Vincent D Costa; R Becket Ebitz
Journal:  Curr Opin Behav Sci       Date:  2020-11-06

10.  Lapses in perceptual decisions reflect exploration.

Authors:  Sashank Pisupati; Lital Chartarifsky-Lynn; Anup Khanal; Anne K Churchland
Journal:  Elife       Date:  2021-01-11       Impact factor: 8.140

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.