Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Novelty and Inductive Generalization in Human Reinforcement Learning.

Literature DB >> 25808176

Novelty and Inductive Generalization in Human Reinforcement Learning.

Abstract

In reinforcement learning (RL), a decision maker searching for the most rewarding option is often faced with the question: What is the value of an option that has never been tried before? One way to frame this question is as an inductive problem: How can I generalize my previous experience with one set of options to a novel option? We show how hierarchical Bayesian inference can be used to solve this problem, and we describe an equivalence between the Bayesian model and temporal difference learning algorithms that have been proposed as models of RL in humans and animals. According to our view, the search for the best option is guided by abstract knowledge about the relationships between different options in an environment, resulting in greater search efficiency compared to traditional RL algorithms previously applied to human cognition. In two behavioral experiments, we test several predictions of our model, providing evidence that humans learn and exploit structured inductive knowledge to make predictions about novel options. In light of this model, we suggest a new interpretation of dopaminergic responses to novelty.

Entities: Chemical Disease Gene Species

Keywords: Bayesian inference; Exploration-exploitation dilemma; Neophilia; Neophobia; Reinforcement learning

Mesh：

Substances：
Dopamine

Year: 2015 PMID： 25808176 PMCID： PMC4537661 DOI： 10.1111/tops.12138

Source DB: PubMed Journal: Top Cogn Sci ISSN： 1756-8757

57 in total

Review 1. Conditioned place preference: what does it add to our preclinical understanding of drug reward?

Authors: M T Bardo; R A Bevins
Journal: Psychopharmacology (Berl) Date: 2000-12 Impact factor: 4.530

2. Learning, prediction and causal Bayes nets.

Authors: Clark Glymour
Journal: Trends Cogn Sci Date: 2003-01 Impact factor: 20.229

3. Distinguishing genuine from spurious causes: a coherence hypothesis.

Authors: Y Lien; P W Cheng
Journal: Cogn Psychol Date: 2000-03 Impact factor: 3.468

4. Failure to find a learned drive based on hunger; evidence for learning motivated by exploration.

Authors: A K MYERS; N E MILLER
Journal: J Comp Physiol Psychol Date: 1954-12

5. Categories and causality: the neglected direction.

Authors: Michael R Waldmann; York Hagmayer
Journal: Cogn Psychol Date: 2006-02-23 Impact factor: 3.468

6. The Psychophysics Toolbox.

Authors: D H Brainard
Journal: Spat Vis Date: 1997

7. Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat.

Authors: J C Horvitz; T Stewart; B L Jacobs
Journal: Brain Res Date: 1997-06-13 Impact factor: 3.252

Review 8. A neural substrate of prediction and reward.

Authors: W Schultz; P Dayan; P R Montague
Journal: Science Date: 1997-03-14 Impact factor: 47.728

9. A new one-trial test for neurobiological studies of memory in rats. 1: Behavioral data.

Authors: A Ennaceur; J Delacour
Journal: Behav Brain Res Date: 1988-11-01 Impact factor: 3.332

10. Striatal activity underlies novelty-based choice in humans.

Authors: Bianca C Wittmann; Nathaniel D Daw; Ben Seymour; Raymond J Dolan
Journal: Neuron Date: 2008-06-26 Impact factor: 17.173

20 in total

1. Structured, uncertainty-driven exploration in real-world consumer choice.

Authors: Eric Schulz; Rahul Bhui; Bradley C Love; Bastien Brier; Michael T Todd; Samuel J Gershman
Journal: Proc Natl Acad Sci U S A Date: 2019-06-24 Impact factor: 11.205

2. Causal Inference About Good and Bad Outcomes.

Authors: Hayley M Dorfman; Rahul Bhui; Brent L Hughes; Samuel J Gershman
Journal: Psychol Sci Date: 2019-02-13

3. Context-dependent learning and causal structure.

Authors: Samuel J Gershman
Journal: Psychon Bull Rev Date: 2017-04

4. Deconstructing the human algorithms for exploration.

Authors: Samuel J Gershman
Journal: Cognition Date: 2017-12-29

5. Impulsivity and risk-seeking as Bayesian inference under dopaminergic control.

Authors: John G Mikhael; Samuel J Gershman
Journal: Neuropsychopharmacology Date: 2021-08-10 Impact factor: 7.853

6. Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Authors: Jaron T Colas; Neil M Dundon; Raphael T Gerraty; Natalie M Saragosa-Harris; Karol P Szymula; Koranis Tanwisuth; J Michael Tyszka; Camilla van Geen; Harang Ju; Arthur W Toga; Joshua I Gold; Dani S Bassett; Catherine A Hartley; Daphna Shohamy; Scott T Grafton; John P O'Doherty
Journal: Hum Brain Mapp Date: 2022-07-21 Impact factor: 5.399