Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Generalization guides human exploration in vast decision spaces.

Literature DB >> 30988442

Generalization guides human exploration in vast decision spaces.

Charley M Wu¹, Eric Schulz², Maarten Speekenbrink³, Jonathan D Nelson^4,5, Björn Meder^6,5.

Abstract

From foraging for food to learning complex games, many aspects of human behaviour can be framed as a search problem with a vast space of possible actions. Under finite search horizons, optimal solutions are generally unobtainable. Yet, how do humans navigate vast problem spaces, which require intelligent exploration of unobserved actions? Using various bandit tasks with up to 121 arms, we study how humans search for rewards under limited search horizons, in which the spatial correlation of rewards (in both generated and natural environments) provides traction for generalization. Across various different probabilistic and heuristic models, we find evidence that Gaussian process function learning-combined with an optimistic upper confidence bound sampling strategy-provides a robust account of how people use generalization to guide search. Our modelling results and parameter estimates are recoverable and can be used to simulate human-like performance, providing insights about human behaviour in complex environments.

Entities: Species

Mesh：

Year: 2018 PMID： 30988442 DOI： 10.1038/s41562-018-0467-4

Source DB: PubMed Journal: Nat Hum Behav ISSN： 2397-3374

31 in total

1. Mastering the game of Go with deep neural networks and tree search.

Authors: David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal: Nature Date: 2016-01-28 Impact factor: 49.962

2. Uncertainty and exploration in a restless bandit problem.

Authors: Maarten Speekenbrink; Emmanouil Konstantinidis
Journal: Top Cogn Sci Date: 2015-04-20

3. Formalizing Neurath's ship: Approximate algorithms for online causal learning.

Authors: Neil R Bramley; Peter Dayan; Thomas L Griffiths; David A Lagnado
Journal: Psychol Rev Date: 2017-02-27 Impact factor: 8.934

4. Neural computations underlying arbitration between model-based and model-free learning.

Authors: Sang Wan Lee; Shinsuke Shimojo; John P O'Doherty
Journal: Neuron Date: 2014-02-05 Impact factor: 17.173

5. Humans use directed and random exploration to solve the explore-exploit dilemma.

Authors: Robert C Wilson; Andra Geana; John M White; Elliot A Ludvig; Jonathan D Cohen
Journal: J Exp Psychol Gen Date: 2014-10-27

6. Building machines that learn and think like people.

Authors: Brenden M Lake; Tomer D Ullman; Joshua B Tenenbaum; Samuel J Gershman
Journal: Behav Brain Sci Date: 2016-11-24 Impact factor: 12.579

7. Human-level control through deep reinforcement learning.

Authors: Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal: Nature Date: 2015-02-26 Impact factor: 49.962

Review 8. Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework.

Authors: Samuel J Gershman; Nathaniel D Daw
Journal: Annu Rev Psychol Date: 2016-09-02 Impact factor: 24.137

9. Neural mechanisms of foraging.

Authors: Nils Kolling; Timothy E J Behrens; Rogier B Mars; Matthew F S Rushworth
Journal: Science Date: 2012-04-06 Impact factor: 47.728

10. Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing.

Authors: Stefano Palminteri; Germain Lefebvre; Emma J Kilford; Sarah-Jayne Blakemore
Journal: PLoS Comput Biol Date: 2017-08-11 Impact factor: 4.475

29 in total

1. Structured, uncertainty-driven exploration in real-world consumer choice.

Authors: Eric Schulz; Rahul Bhui; Bradley C Love; Bastien Brier; Michael T Todd; Samuel J Gershman
Journal: Proc Natl Acad Sci U S A Date: 2019-06-24 Impact factor: 11.205

2. Discovery of hierarchical representations for efficient planning.

Authors: Momchil S Tomov; Samyukta Yagati; Agni Kumar; Wanqian Yang; Samuel J Gershman
Journal: PLoS Comput Biol Date: 2020-04-06 Impact factor: 4.475

3. Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Authors: Jaron T Colas; Neil M Dundon; Raphael T Gerraty; Natalie M Saragosa-Harris; Karol P Szymula; Koranis Tanwisuth; J Michael Tyszka; Camilla van Geen; Harang Ju; Arthur W Toga; Joshua I Gold; Dani S Bassett; Catherine A Hartley; Daphna Shohamy; Scott T Grafton; John P O'Doherty
Journal: Hum Brain Mapp Date: 2022-07-21 Impact factor: 5.399

4. Humans adaptively resolve the explore-exploit dilemma under cognitive constraints: Evidence from a multi-armed bandit task.

Authors: Vanessa M Brown; Michael N Hallquist; Michael J Frank; Alexandre Y Dombrovski
Journal: Cognition Date: 2022-07-30