Literature DB >> 33184605

Balancing exploration and exploitation with information and randomization.

Robert C Wilson1,2,3, Elizabeth Bonawitz4, Vincent D Costa5, R Becket Ebitz6.   

Abstract

Explore-exploit decisions require us to trade off the benefits of exploring unknown options to learn more about them, with exploiting known options, for immediate reward. Such decisions are ubiquitous in nature, but from a computational perspective, they are notoriously hard. There is therefore much interest in how humans and animals make these decisions and recently there has been an explosion of research in this area. Here we provide a biased and incomplete snapshot of this field focusing on the major finding that many organisms use two distinct strategies to solve the explore-exploit dilemma: a bias for information ('directed exploration') and the randomization of choice ('random exploration'). We review evidence for the existence of these strategies, their computational properties, their neural implementations, as well as how directed and random exploration vary over the lifespan. We conclude by highlighting open questions in this field that are ripe to both explore and exploit.

Entities:  

Year:  2020        PMID: 33184605      PMCID: PMC7654823          DOI: 10.1016/j.cobeha.2020.10.001

Source DB:  PubMed          Journal:  Curr Opin Behav Sci        ISSN: 2352-1546


  62 in total

1.  Cognitive development. Observing the unexpected enhances infants' learning and exploration.

Authors:  Aimee E Stahl; Lisa Feigenson
Journal:  Science       Date:  2015-04-03       Impact factor: 47.728

Review 2.  The short-latency dopamine signal: a role in discovering novel actions?

Authors:  Peter Redgrave; Kevin Gurney
Journal:  Nat Rev Neurosci       Date:  2006-11-08       Impact factor: 34.870

Review 3.  Probabilistic models, learning algorithms, and response variability: sampling in cognitive development.

Authors:  Elizabeth Bonawitz; Stephanie Denison; Thomas L Griffiths; Alison Gopnik
Journal:  Trends Cogn Sci       Date:  2014-07-04       Impact factor: 20.229

4.  Pupil diameter predicts changes in the exploration-exploitation trade-off: evidence for the adaptive gain theory.

Authors:  Marieke Jepma; Sander Nieuwenhuis
Journal:  J Cogn Neurosci       Date:  2010-07-28       Impact factor: 3.225

5.  Variability in velocity profiles during free-air whisking behavior of unrestrained rats.

Authors:  R Blythe Towal; Mitra J Z Hartmann
Journal:  J Neurophysiol       Date:  2008-04-24       Impact factor: 2.714

6.  Uncertainty about mapping future actions into rewards may underlie performance on multiple measures of impulsivity in behavioral addiction: evidence from Parkinson's disease.

Authors:  Bruno B Averbeck; Atbin Djamshidian; Sean S O'Sullivan; Charlotte R Housden; Jonathan P Roiser; Andrew J Lees
Journal:  Behav Neurosci       Date:  2013-04       Impact factor: 1.912

7.  The role of the noradrenergic system in the exploration-exploitation trade-off: a psychopharmacological study.

Authors:  Marieke Jepma; Erik T Te Beek; Eric-Jan Wagenmakers; Joop M A van Gerven; Sander Nieuwenhuis
Journal:  Front Hum Neurosci       Date:  2010-08-26       Impact factor: 3.169

8.  Theory of choice in bandit, information sampling and foraging tasks.

Authors:  Bruno B Averbeck
Journal:  PLoS Comput Biol       Date:  2015-03-27       Impact factor: 4.475

9.  The hippocampus and exploration: dynamically evolving behavior and neural representations.

Authors:  Adam Johnson; Zachary Varberg; James Benhardus; Anthony Maahs; Paul Schrater
Journal:  Front Hum Neurosci       Date:  2012-07-25       Impact factor: 3.169

10.  Pupil size and social vigilance in rhesus macaques.

Authors:  R Becket Ebitz; John M Pearson; Michael L Platt
Journal:  Front Neurosci       Date:  2014-05-06       Impact factor: 4.677

View more
  21 in total

1.  Attenuated Directed Exploration during Reinforcement Learning in Gambling Disorder.

Authors:  A Wiehler; K Chakroun; J Peters
Journal:  J Neurosci       Date:  2021-02-02       Impact factor: 6.167

2.  Rules warp feature encoding in decision-making circuits.

Authors:  R Becket Ebitz; Jiaxin Cindy Tu; Benjamin Y Hayden
Journal:  PLoS Biol       Date:  2020-11-30       Impact factor: 8.029

3.  Sex differences in learning from exploration.

Authors:  Cathy S Chen; Evan Knep; Autumn Han; R Becket Ebitz; Nicola M Grissom
Journal:  Elife       Date:  2021-11-19       Impact factor: 8.140

Review 4.  From exploration to exploitation: a shifting mental mode in late life development.

Authors:  R Nathan Spreng; Gary R Turner
Journal:  Trends Cogn Sci       Date:  2021-09-27       Impact factor: 20.229

Review 5.  The population doctrine in cognitive neuroscience.

Authors:  R Becket Ebitz; Benjamin Y Hayden
Journal:  Neuron       Date:  2021-08-19       Impact factor: 18.688

6.  Trait somatic anxiety is associated with reduced directed exploration and underestimation of uncertainty.

Authors:  Haoxue Fan; Samuel J Gershman; Elizabeth A Phelps
Journal:  Nat Hum Behav       Date:  2022-10-03

7.  Balance between breadth and depth in human many-alternative decisions.

Authors:  Alice Vidal; Salvador Soto-Faraco; Rubén Moreno-Bote
Journal:  Elife       Date:  2022-09-15       Impact factor: 8.713

8.  Dynamic decision policy reconfiguration under outcome uncertainty.

Authors:  Krista Bond; Kyle Dunovan; Alexis Porter; Jonathan E Rubin; Timothy Verstynen
Journal:  Elife       Date:  2021-12-24       Impact factor: 8.140

9.  A Push For Examining Subjective Experience in Value-Based Decision-Making.

Authors:  Drew C Schreiner; Ege A Yalcinbas; Christina M Gremel
Journal:  Curr Opin Behav Sci       Date:  2021-04-15

10.  Revisiting the Role of Uncertainty-Driven Exploration in a (Perceived) Non-Stationary World.

Authors:  Dalin Guo; Angela J Yu
Journal:  Cogsci       Date:  2021-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.