Literature DB >> 25347535

Humans use directed and random exploration to solve the explore-exploit dilemma.

Robert C Wilson1, Andra Geana1, John M White2, Elliot A Ludvig2, Jonathan D Cohen1.   

Abstract

All adaptive organisms face the fundamental tradeoff between pursuing a known reward (exploitation) and sampling lesser-known options in search of something better (exploration). Theory suggests at least two strategies for solving this dilemma: a directed strategy in which choices are explicitly biased toward information seeking, and a random strategy in which decision noise leads to exploration by chance. In this work we investigated the extent to which humans use these two strategies. In our "Horizon task," participants made explore-exploit decisions in two contexts that differed in the number of choices that they would make in the future (the time horizon). Participants were allowed to make either a single choice in each game (horizon 1), or 6 sequential choices (horizon 6), giving them more opportunity to explore. By modeling the behavior in these two conditions, we were able to measure exploration-related changes in decision making and quantify the contributions of the two strategies to behavior. We found that participants were more information seeking and had higher decision noise with the longer horizon, suggesting that humans use both strategies to solve the exploration-exploitation dilemma. We thus conclude that both information seeking and choice variability can be controlled and put to use in the service of exploration. PsycINFO Database Record (c) 2014 APA, all rights reserved.

Entities:  

Mesh:

Year:  2014        PMID: 25347535      PMCID: PMC5635655          DOI: 10.1037/a0038199

Source DB:  PubMed          Journal:  J Exp Psychol Gen        ISSN: 0022-1015


  14 in total

1.  Decisions from experience and the effect of rare events in risky choice.

Authors:  Ralph Hertwig; Greg Barron; Elke U Weber; Ido Erev
Journal:  Psychol Sci       Date:  2004-08

Review 2.  An integrative theory of locus coeruleus-norepinephrine function: adaptive gain and optimal performance.

Authors:  Gary Aston-Jones; Jonathan D Cohen
Journal:  Annu Rev Neurosci       Date:  2005       Impact factor: 12.449

3.  Performance variability enables adaptive plasticity of 'crystallized' adult birdsong.

Authors:  Evren C Tumer; Michael S Brainard
Journal:  Nature       Date:  2007-12-20       Impact factor: 49.962

4.  Not noisy, just wrong: the role of suboptimal inference in behavioral variability.

Authors:  Jeffrey M Beck; Wei Ji Ma; Xaq Pitkow; Peter E Latham; Alexandre Pouget
Journal:  Neuron       Date:  2012-04-12       Impact factor: 17.173

Review 5.  The description-experience gap in risky choice.

Authors:  Ralph Hertwig; Ido Erev
Journal:  Trends Cogn Sci       Date:  2009-10-14       Impact factor: 20.229

Review 6.  Noise in the nervous system.

Authors:  A Aldo Faisal; Luc P J Selen; Daniel M Wolpert
Journal:  Nat Rev Neurosci       Date:  2008-04       Impact factor: 34.870

7.  A sensory source for motor variation.

Authors:  Leslie C Osborne; Stephen G Lisberger; William Bialek
Journal:  Nature       Date:  2005-09-15       Impact factor: 49.962

8.  Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings.

Authors:  Elise Payzan-LeNestour; Peter Bossaerts
Journal:  PLoS Comput Biol       Date:  2011-01-20       Impact factor: 4.475

9.  Vocal experimentation in the juvenile songbird requires a basal ganglia circuit.

Authors:  Bence P Olveczky; Aaron S Andalman; Michale S Fee
Journal:  PLoS Biol       Date:  2005-03-29       Impact factor: 8.029

10.  Do not Bet on the Unknown Versus Try to Find Out More: Estimation Uncertainty and "Unexpected Uncertainty" Both Modulate Exploration.

Authors:  Elise Payzan-Lenestour; Peter Bossaerts
Journal:  Front Neurosci       Date:  2012-10-16       Impact factor: 4.677

View more
  96 in total

1.  Selective maintenance of value information helps resolve the exploration/exploitation dilemma.

Authors:  Michael N Hallquist; Alexandre Y Dombrovski
Journal:  Cognition       Date:  2018-11-28

2.  Common neural code for reward and information value.

Authors:  Kenji Kobayashi; Ming Hsu
Journal:  Proc Natl Acad Sci U S A       Date:  2019-06-11       Impact factor: 11.205

3.  Structured, uncertainty-driven exploration in real-world consumer choice.

Authors:  Eric Schulz; Rahul Bhui; Bradley C Love; Bastien Brier; Michael T Todd; Samuel J Gershman
Journal:  Proc Natl Acad Sci U S A       Date:  2019-06-24       Impact factor: 11.205

4.  Optimal utility and probability functions for agents with finite computational precision.

Authors:  Keno Juechems; Jan Balaguer; Bernhard Spitzer; Christopher Summerfield
Journal:  Proc Natl Acad Sci U S A       Date:  2021-01-12       Impact factor: 11.205

Review 5.  A Primer on Foraging and the Explore/Exploit Trade-Off for Psychiatry Research.

Authors:  M A Addicott; J M Pearson; M M Sweitzer; D L Barack; M L Platt
Journal:  Neuropsychopharmacology       Date:  2017-05-29       Impact factor: 7.853

Review 6.  Time discounting and time preference in animals: A critical review.

Authors:  Benjamin Y Hayden
Journal:  Psychon Bull Rev       Date:  2016-02

Review 7.  Temporal trade-offs in psychophysics.

Authors:  David L Barack; Joshua I Gold
Journal:  Curr Opin Neurobiol       Date:  2016-02-26       Impact factor: 6.627

8.  Behavioural variability contributes to over-staying in patchy foraging.

Authors:  Tyler Cash-Padgett; Benjamin Hayden
Journal:  Biol Lett       Date:  2020-03-11       Impact factor: 3.703

9.  Attenuation of dopamine-modulated prefrontal value signals underlies probabilistic reward learning deficits in old age.

Authors:  Lieke de Boer; Jan Axelsson; Katrine Riklund; Lars Nyberg; Peter Dayan; Lars Bäckman; Marc Guitart-Masip
Journal:  Elife       Date:  2017-09-05       Impact factor: 8.140

10.  Pure correlates of exploration and exploitation in the human brain.

Authors:  Tommy C Blanchard; Samuel J Gershman
Journal:  Cogn Affect Behav Neurosci       Date:  2018-02       Impact factor: 3.282

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.