Literature DB >> 25808176

Novelty and Inductive Generalization in Human Reinforcement Learning.

Samuel J Gershman1, Yael Niv2.   

Abstract

In reinforcement learning (RL), a decision maker searching for the most rewarding option is often faced with the question: What is the value of an option that has never been tried before? One way to frame this question is as an inductive problem: How can I generalize my previous experience with one set of options to a novel option? We show how hierarchical Bayesian inference can be used to solve this problem, and we describe an equivalence between the Bayesian model and temporal difference learning algorithms that have been proposed as models of RL in humans and animals. According to our view, the search for the best option is guided by abstract knowledge about the relationships between different options in an environment, resulting in greater search efficiency compared to traditional RL algorithms previously applied to human cognition. In two behavioral experiments, we test several predictions of our model, providing evidence that humans learn and exploit structured inductive knowledge to make predictions about novel options. In light of this model, we suggest a new interpretation of dopaminergic responses to novelty.
Copyright © 2015 Cognitive Science Society, Inc.

Entities:  

Keywords:  Bayesian inference; Exploration-exploitation dilemma; Neophilia; Neophobia; Reinforcement learning

Mesh:

Substances:

Year:  2015        PMID: 25808176      PMCID: PMC4537661          DOI: 10.1111/tops.12138

Source DB:  PubMed          Journal:  Top Cogn Sci        ISSN: 1756-8757


  57 in total

Review 1.  Conditioned place preference: what does it add to our preclinical understanding of drug reward?

Authors:  M T Bardo; R A Bevins
Journal:  Psychopharmacology (Berl)       Date:  2000-12       Impact factor: 4.530

2.  Learning, prediction and causal Bayes nets.

Authors:  Clark Glymour
Journal:  Trends Cogn Sci       Date:  2003-01       Impact factor: 20.229

3.  Distinguishing genuine from spurious causes: a coherence hypothesis.

Authors:  Y Lien; P W Cheng
Journal:  Cogn Psychol       Date:  2000-03       Impact factor: 3.468

4.  Failure to find a learned drive based on hunger; evidence for learning motivated by exploration.

Authors:  A K MYERS; N E MILLER
Journal:  J Comp Physiol Psychol       Date:  1954-12

5.  Categories and causality: the neglected direction.

Authors:  Michael R Waldmann; York Hagmayer
Journal:  Cogn Psychol       Date:  2006-02-23       Impact factor: 3.468

6.  The Psychophysics Toolbox.

Authors:  D H Brainard
Journal:  Spat Vis       Date:  1997

7.  Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat.

Authors:  J C Horvitz; T Stewart; B L Jacobs
Journal:  Brain Res       Date:  1997-06-13       Impact factor: 3.252

Review 8.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

9.  A new one-trial test for neurobiological studies of memory in rats. 1: Behavioral data.

Authors:  A Ennaceur; J Delacour
Journal:  Behav Brain Res       Date:  1988-11-01       Impact factor: 3.332

10.  Striatal activity underlies novelty-based choice in humans.

Authors:  Bianca C Wittmann; Nathaniel D Daw; Ben Seymour; Raymond J Dolan
Journal:  Neuron       Date:  2008-06-26       Impact factor: 17.173

View more
  20 in total

1.  Structured, uncertainty-driven exploration in real-world consumer choice.

Authors:  Eric Schulz; Rahul Bhui; Bradley C Love; Bastien Brier; Michael T Todd; Samuel J Gershman
Journal:  Proc Natl Acad Sci U S A       Date:  2019-06-24       Impact factor: 11.205

2.  Causal Inference About Good and Bad Outcomes.

Authors:  Hayley M Dorfman; Rahul Bhui; Brent L Hughes; Samuel J Gershman
Journal:  Psychol Sci       Date:  2019-02-13

3.  Context-dependent learning and causal structure.

Authors:  Samuel J Gershman
Journal:  Psychon Bull Rev       Date:  2017-04

4.  Deconstructing the human algorithms for exploration.

Authors:  Samuel J Gershman
Journal:  Cognition       Date:  2017-12-29

5.  Impulsivity and risk-seeking as Bayesian inference under dopaminergic control.

Authors:  John G Mikhael; Samuel J Gershman
Journal:  Neuropsychopharmacology       Date:  2021-08-10       Impact factor: 7.853

6.  Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Authors:  Jaron T Colas; Neil M Dundon; Raphael T Gerraty; Natalie M Saragosa-Harris; Karol P Szymula; Koranis Tanwisuth; J Michael Tyszka; Camilla van Geen; Harang Ju; Arthur W Toga; Joshua I Gold; Dani S Bassett; Catherine A Hartley; Daphna Shohamy; Scott T Grafton; John P O'Doherty
Journal:  Hum Brain Mapp       Date:  2022-07-21       Impact factor: 5.399

7.  Pure correlates of exploration and exploitation in the human brain.

Authors:  Tommy C Blanchard; Samuel J Gershman
Journal:  Cogn Affect Behav Neurosci       Date:  2018-02       Impact factor: 3.282

Review 8.  Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework.

Authors:  Samuel J Gershman; Nathaniel D Daw
Journal:  Annu Rev Psychol       Date:  2016-09-02       Impact factor: 24.137

9.  Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.

Authors:  He A Xu; Alireza Modirshanechi; Marco P Lehmann; Wulfram Gerstner; Michael H Herzog
Journal:  PLoS Comput Biol       Date:  2021-06-03       Impact factor: 4.475

10.  Distinct motivations to seek out information in healthy individuals and problem gamblers.

Authors:  Irene Cogliati Dezza; Xavier Noel; Axel Cleeremans; Angela J Yu
Journal:  Transl Psychiatry       Date:  2021-07-26       Impact factor: 6.222

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.