Literature DB >> 27618944

Reinforcement Learning and Episodic Memory in Humans and Animals: An Integrative Framework.

Samuel J Gershman1, Nathaniel D Daw2.   

Abstract

We review the psychology and neuroscience of reinforcement learning (RL), which has experienced significant progress in the past two decades, enabled by the comprehensive experimental study of simple learning and decision-making tasks. However, one challenge in the study of RL is computational: The simplicity of these tasks ignores important aspects of reinforcement learning in the real world: (a) State spaces are high-dimensional, continuous, and partially observable; this implies that (b) data are relatively sparse and, indeed, precisely the same situation may never be encountered twice; furthermore, (c) rewards depend on the long-term consequences of actions in ways that violate the classical assumptions that make RL tractable. A seemingly distinct challenge is that, cognitively, theories of RL have largely involved procedural and semantic memory, the way in which knowledge about action values or world models extracted gradually from many experiences can drive choice. This focus on semantic memory leaves out many aspects of memory, such as episodic memory, related to the traces of individual events. We suggest that these two challenges are related. The computational challenge can be dealt with, in part, by endowing RL systems with episodic memory, allowing them to (a) efficiently approximate value functions over complex state spaces, (b) learn with very little data, and (c) bridge long-term dependencies between actions and rewards. We review the computational theory underlying this proposal and the empirical evidence to support it. Our proposal suggests that the ubiquitous and diverse roles of memory in RL may function as part of an integrated learning system.

Entities:  

Keywords:  decision making; memory; reinforcement learning

Mesh:

Year:  2016        PMID: 27618944      PMCID: PMC5953519          DOI: 10.1146/annurev-psych-122414-033625

Source DB:  PubMed          Journal:  Annu Rev Psychol        ISSN: 0066-4308            Impact factor:   24.137


  88 in total

Review 1.  Hippocampal replay in the awake state: a potential substrate for memory consolidation and retrieval.

Authors:  Margaret F Carr; Shantanu P Jadhav; Loren M Frank
Journal:  Nat Neurosci       Date:  2011-02       Impact factor: 24.884

2.  Dynamic response-by-response models of matching behavior in rhesus monkeys.

Authors:  Brian Lau; Paul W Glimcher
Journal:  J Exp Anal Behav       Date:  2005-11       Impact factor: 2.468

3.  Solving the credit assignment problem: explicit and implicit learning of action sequences with probabilistic outcomes.

Authors:  Wai-Tat Fu; John R Anderson
Journal:  Psychol Res       Date:  2007-04-20

Review 4.  Context, learning, and extinction.

Authors:  Samuel J Gershman; David M Blei; Yael Niv
Journal:  Psychol Rev       Date:  2010-01       Impact factor: 8.934

Review 5.  The hippocampal-striatal axis in learning, prediction and goal-directed behavior.

Authors:  C M A Pennartz; R Ito; P F M J Verschure; F P Battaglia; T W Robbins
Journal:  Trends Neurosci       Date:  2011-09-01       Impact factor: 13.837

6.  Attention, similarity, and the identification-categorization relationship.

Authors:  R M Nosofsky
Journal:  J Exp Psychol Gen       Date:  1986-03

7.  The role of dopamine in cognitive sequence learning: evidence from Parkinson's disease.

Authors:  Daphna Shohamy; Catherine E Myers; Steven Grossman; Jacob Sage; Mark A Gluck
Journal:  Behav Brain Res       Date:  2005-01-30       Impact factor: 3.332

8.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

9.  Dissociating the role of the orbitofrontal cortex and the striatum in the computation of goal values and prediction errors.

Authors:  Todd A Hare; John O'Doherty; Colin F Camerer; Wolfram Schultz; Antonio Rangel
Journal:  J Neurosci       Date:  2008-05-28       Impact factor: 6.167

10.  Neuron-type-specific signals for reward and punishment in the ventral tegmental area.

Authors:  Jeremiah Y Cohen; Sebastian Haesler; Linh Vong; Bradford B Lowell; Naoshige Uchida
Journal:  Nature       Date:  2012-01-18       Impact factor: 49.962

View more
  66 in total

Review 1.  Age-related variability in decision-making: Insights from neurochemistry.

Authors:  Anne S Berry; William J Jagust; Ming Hsu
Journal:  Cogn Affect Behav Neurosci       Date:  2019-06       Impact factor: 3.282

2.  Sampling memory to make profitable choices.

Authors:  Brice A Kuhl; Nicole M Long
Journal:  Nat Neurosci       Date:  2017-06-27       Impact factor: 24.884

3.  The Successor Representation: Its Computational Logic and Neural Substrates.

Authors:  Samuel J Gershman
Journal:  J Neurosci       Date:  2018-07-13       Impact factor: 6.167

Review 4.  Predictive mechanisms linking brain opioids to chronic pain vulnerability and resilience.

Authors:  Anthony Kenneth Peter Jones; Christopher Andrew Brown
Journal:  Br J Pharmacol       Date:  2017-06-10       Impact factor: 8.739

5.  Ventral striatum lesions do not affect reinforcement learning with deterministic outcomes on slow time scales.

Authors:  Raquel Vicario-Feliciano; Elisabeth A Murray; Bruno B Averbeck
Journal:  Behav Neurosci       Date:  2017-08-14       Impact factor: 1.912

6.  Decision-making Increases Episodic Memory via Postencoding Consolidation.

Authors:  Vishnu P Murty; Sarah DuBrow; Lila Davachi
Journal:  J Cogn Neurosci       Date:  2018-07-31       Impact factor: 3.225

Review 7.  Surviving threats: neural circuit and computational implications of a new taxonomy of defensive behaviour.

Authors:  Joseph LeDoux; Nathaniel D Daw
Journal:  Nat Rev Neurosci       Date:  2018-03-29       Impact factor: 34.870

Review 8.  Believing in dopamine.

Authors:  Samuel J Gershman; Naoshige Uchida
Journal:  Nat Rev Neurosci       Date:  2019-09-30       Impact factor: 34.870

Review 9.  "Chasing the first high": memory sampling in drug choice.

Authors:  Aaron M Bornstein; Hanna Pickard
Journal:  Neuropsychopharmacology       Date:  2020-01-02       Impact factor: 7.853

10.  Stress Disrupts Human Hippocampal-Prefrontal Function during Prospective Spatial Navigation and Hinders Flexible Behavior.

Authors:  Thackery I Brown; Stephanie A Gagnon; Anthony D Wagner
Journal:  Curr Biol       Date:  2020-04-02       Impact factor: 10.834

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.