Literature DB >> 33362625

The Missing Link Between Memory and Reinforcement Learning.

Christian Balkenius1, Trond A Tjøstheim1, Birger Johansson1, Annika Wallin1, Peter Gärdenfors1,2.   

Abstract

Reinforcement learning systems usually assume that a value function is defined over all states (or state-action pairs) that can immediately give the value of a particular state or action. These values are used by a selection mechanism to decide which action to take. In contrast, when humans and animals make decisions, they collect evidence for different alternatives over time and take action only when sufficient evidence has been accumulated. We have previously developed a model of memory processing that includes semantic, episodic and working memory in a comprehensive architecture. Here, we describe how this memory mechanism can support decision making when the alternatives cannot be evaluated based on immediate sensory information alone. Instead we first imagine, and then evaluate a possible future that will result from choosing one of the alternatives. Here we present an extended model that can be used as a model for decision making that depends on accumulating evidence over time, whether that information comes from the sequential attention to different sensory properties or from internal simulation of the consequences of making a particular choice. We show how the new model explains both simple immediate choices, choices that depend on multiple sensory factors and complicated selections between alternatives that require forward looking simulations based on episodic and semantic memory structures. In this framework, vicarious trial and error is explained as an internal simulation that accumulates evidence for a particular choice. We argue that a system like this forms the "missing link" between more traditional ideas of semantic and episodic memory, and the associative nature of reinforcement learning.
Copyright © 2020 Balkenius, Tjøstheim, Johansson, Wallin and Gärdenfors.

Entities:  

Keywords:  accumulator model; decision making; episodic memory; memory model; semantic memory

Year:  2020        PMID: 33362625      PMCID: PMC7758424          DOI: 10.3389/fpsyg.2020.560080

Source DB:  PubMed          Journal:  Front Psychol        ISSN: 1664-1078


  34 in total

1.  Loss aversion and inhibition in dynamical models of multialternative choice.

Authors:  Marius Usher; James L McClelland
Journal:  Psychol Rev       Date:  2004-07       Impact factor: 8.934

2.  Purposive behavior and cognitive mapping: a neural network model.

Authors:  N A Schmajuk; A D Thieme
Journal:  Biol Cybern       Date:  1992       Impact factor: 2.086

3.  Patients with hippocampal amnesia cannot imagine new experiences.

Authors:  Demis Hassabis; Dharshan Kumaran; Seralynne D Vann; Eleanor A Maguire
Journal:  Proc Natl Acad Sci U S A       Date:  2007-01-17       Impact factor: 11.205

4.  Arousal-Biased Competition in Perception and Memory.

Authors:  Mara Mather; Matthew R Sutherland
Journal:  Perspect Psychol Sci       Date:  2011-03

5.  Synaptic depression and cortical gain control.

Authors:  L F Abbott; J A Varela; K Sen; S B Nelson
Journal:  Science       Date:  1997-01-10       Impact factor: 47.728

6.  Dynamics of pattern formation in lateral-inhibition type neural fields.

Authors:  S Amari
Journal:  Biol Cybern       Date:  1977-08-03       Impact factor: 2.086

7.  Varieties of attention-deficit/hyperactivity disorder-related intra-individual variability.

Authors:  F Xavier Castellanos; Edmund J S Sonuga-Barke; Anouk Scheres; Adriana Di Martino; Christopher Hyde; Judith R Walters
Journal:  Biol Psychiatry       Date:  2005-01-28       Impact factor: 13.382

8.  Episodic future thinking.

Authors:  Cristina M. Atance; Daniela K. O'Neill
Journal:  Trends Cogn Sci       Date:  2001-12-01       Impact factor: 20.229

9.  A neural model of the dynamic activation of memory.

Authors:  M Herrmann; E Ruppin; M Usher
Journal:  Biol Cybern       Date:  1993       Impact factor: 2.086

10.  Looking is buying. How visual attention and choice are affected by consumer preferences and properties of the supermarket shelf.

Authors:  Kerstin Gidlöf; Andrey Anikin; Martin Lingonblad; Annika Wallin
Journal:  Appetite       Date:  2017-04-19       Impact factor: 3.868

View more
  1 in total

1.  Direct Approach or Detour: A Comparative Model of Inhibition and Neural Ensemble Size in Behavior Selection.

Authors:  Trond A Tjøstheim; Birger Johansson; Christian Balkenius
Journal:  Front Syst Neurosci       Date:  2021-11-09
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.