Literature DB >> 30988442

Generalization guides human exploration in vast decision spaces.

Charley M Wu1, Eric Schulz2, Maarten Speekenbrink3, Jonathan D Nelson4,5, Björn Meder6,5.   

Abstract

From foraging for food to learning complex games, many aspects of human behaviour can be framed as a search problem with a vast space of possible actions. Under finite search horizons, optimal solutions are generally unobtainable. Yet, how do humans navigate vast problem spaces, which require intelligent exploration of unobserved actions? Using various bandit tasks with up to 121 arms, we study how humans search for rewards under limited search horizons, in which the spatial correlation of rewards (in both generated and natural environments) provides traction for generalization. Across various different probabilistic and heuristic models, we find evidence that Gaussian process function learning-combined with an optimistic upper confidence bound sampling strategy-provides a robust account of how people use generalization to guide search. Our modelling results and parameter estimates are recoverable and can be used to simulate human-like performance, providing insights about human behaviour in complex environments.

Entities:  

Mesh:

Year:  2018        PMID: 30988442     DOI: 10.1038/s41562-018-0467-4

Source DB:  PubMed          Journal:  Nat Hum Behav        ISSN: 2397-3374


  31 in total

1.  Mastering the game of Go with deep neural networks and tree search.

Authors:  David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal:  Nature       Date:  2016-01-28       Impact factor: 49.962

2.  Uncertainty and exploration in a restless bandit problem.

Authors:  Maarten Speekenbrink; Emmanouil Konstantinidis
Journal:  Top Cogn Sci       Date:  2015-04-20

3.  Formalizing Neurath's ship: Approximate algorithms for online causal learning.

Authors:  Neil R Bramley; Peter Dayan; Thomas L Griffiths; David A Lagnado
Journal:  Psychol Rev       Date:  2017-02-27       Impact factor: 8.934

4.  Neural computations underlying arbitration between model-based and model-free learning.

Authors:  Sang Wan Lee; Shinsuke Shimojo; John P O'Doherty
Journal:  Neuron       Date:  2014-02-05       Impact factor: 17.173

5.  Humans use directed and random exploration to solve the explore-exploit dilemma.

Authors:  Robert C Wilson; Andra Geana; John M White; Elliot A Ludvig; Jonathan D Cohen
Journal:  J Exp Psychol Gen       Date:  2014-10-27

6.  Building machines that learn and think like people.

Authors:  Brenden M Lake; Tomer D Ullman; Joshua B Tenenbaum; Samuel J Gershman
Journal:  Behav Brain Sci       Date:  2016-11-24       Impact factor: 12.579

7.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

Review 8.  Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework.

Authors:  Samuel J Gershman; Nathaniel D Daw
Journal:  Annu Rev Psychol       Date:  2016-09-02       Impact factor: 24.137

9.  Neural mechanisms of foraging.

Authors:  Nils Kolling; Timothy E J Behrens; Rogier B Mars; Matthew F S Rushworth
Journal:  Science       Date:  2012-04-06       Impact factor: 47.728

10.  Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing.

Authors:  Stefano Palminteri; Germain Lefebvre; Emma J Kilford; Sarah-Jayne Blakemore
Journal:  PLoS Comput Biol       Date:  2017-08-11       Impact factor: 4.475

View more
  29 in total

1.  Structured, uncertainty-driven exploration in real-world consumer choice.

Authors:  Eric Schulz; Rahul Bhui; Bradley C Love; Bastien Brier; Michael T Todd; Samuel J Gershman
Journal:  Proc Natl Acad Sci U S A       Date:  2019-06-24       Impact factor: 11.205

2.  Discovery of hierarchical representations for efficient planning.

Authors:  Momchil S Tomov; Samyukta Yagati; Agni Kumar; Wanqian Yang; Samuel J Gershman
Journal:  PLoS Comput Biol       Date:  2020-04-06       Impact factor: 4.475

3.  Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Authors:  Jaron T Colas; Neil M Dundon; Raphael T Gerraty; Natalie M Saragosa-Harris; Karol P Szymula; Koranis Tanwisuth; J Michael Tyszka; Camilla van Geen; Harang Ju; Arthur W Toga; Joshua I Gold; Dani S Bassett; Catherine A Hartley; Daphna Shohamy; Scott T Grafton; John P O'Doherty
Journal:  Hum Brain Mapp       Date:  2022-07-21       Impact factor: 5.399

4.  Humans adaptively resolve the explore-exploit dilemma under cognitive constraints: Evidence from a multi-armed bandit task.

Authors:  Vanessa M Brown; Michael N Hallquist; Michael J Frank; Alexandre Y Dombrovski
Journal:  Cognition       Date:  2022-07-30

5.  Superstitious learning of abstract order from random reinforcement.

Authors:  Yuhao Jin; Greg Jensen; Jacqueline Gottlieb; Vincent Ferrera
Journal:  Proc Natl Acad Sci U S A       Date:  2022-08-23       Impact factor: 12.779

6.  Balancing exploration and exploitation with information and randomization.

Authors:  Robert C Wilson; Elizabeth Bonawitz; Vincent D Costa; R Becket Ebitz
Journal:  Curr Opin Behav Sci       Date:  2020-11-06

7.  Effects of subclinical depression on prefrontal-striatal model-based and model-free learning.

Authors:  Suyeon Heo; Yoondo Sung; Sang Wan Lee
Journal:  PLoS Comput Biol       Date:  2021-05-14       Impact factor: 4.475

Review 8.  Formalizing planning and information search in naturalistic decision-making.

Authors:  L T Hunt; N D Daw; P Kaanders; M A MacIver; U Mugan; E Procyk; A D Redish; E Russo; J Scholl; K Stachenfeld; C R E Wilson; N Kolling
Journal:  Nat Neurosci       Date:  2021-06-21       Impact factor: 28.771

Review 9.  Promises and challenges of human computational ethology.

Authors:  Dean Mobbs; Toby Wise; Nanthia Suthana; Noah Guzmán; Nikolaus Kriegeskorte; Joel Z Leibo
Journal:  Neuron       Date:  2021-06-17       Impact factor: 18.688

10.  Distinct motivations to seek out information in healthy individuals and problem gamblers.

Authors:  Irene Cogliati Dezza; Xavier Noel; Axel Cleeremans; Angela J Yu
Journal:  Transl Psychiatry       Date:  2021-07-26       Impact factor: 6.222

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.