Literature DB >> 35105677

Adaptive Learning through Temporal Dynamics of State Representation.

Niloufar Razmi1,2, Matthew R Nassar3,2.   

Abstract

People adjust their learning rate rationally according to local environmental statistics and calibrate such adjustments based on the broader statistical context. To date, no theory has captured the observed range of adaptive learning behaviors or the complexity of its neural correlates. Here, we attempt to do so using a neural network model that learns to map an internal context representation onto a behavioral response via supervised learning. The network shifts its internal context on receiving supervised signals that are mismatched to its output, thereby changing the "state" to which feedback is associated. A key feature of the model is that such state transitions can either increase learning or decrease learning depending on the duration over which the new state is maintained. Sustained state transitions that occur after changepoints facilitate faster learning and mimic network reset phenomena observed in the brain during rapid learning. In contrast, state transitions after one-off outlier events are short lived, thereby limiting the impact of outlying observations on future behavior. State transitions in our model provide the first mechanistic interpretation for bidirectional learning signals, such as the P300, that relate to learning differentially according to the source of surprising events and may also shed light on discrepant observations regarding the relationship between transient pupil dilations and learning. Together, our results demonstrate that dynamic latent state representations can afford normative inference and provide a coherent framework for understanding neural signatures of adaptive learning across different statistical environments.SIGNIFICANCE STATEMENT How humans adjust their sensitivity to new information in a changing world has remained largely an open question. Bridging insights from normative accounts of adaptive learning and theories of latent state representation, here we propose a feedforward neural network model that adjusts its learning rate online by controlling the speed of transitioning its internal state representations. Our model proposes a mechanistic framework for explaining learning under different statistical contexts, explains previously observed behavior and brain signals, and makes testable predictions for future experimental studies.
Copyright © 2022 the authors.

Entities:  

Keywords:  Bayesian; P300; adaptive learning; neural network; pupillometry; representation

Mesh:

Year:  2022        PMID: 35105677      PMCID: PMC8944236          DOI: 10.1523/JNEUROSCI.0387-21.2022

Source DB:  PubMed          Journal:  J Neurosci        ISSN: 0270-6474            Impact factor:   6.709


  53 in total

1.  Conflict monitoring and cognitive control.

Authors:  M M Botvinick; T S Braver; D M Barch; C S Carter; J D Cohen
Journal:  Psychol Rev       Date:  2001-07       Impact factor: 8.934

2.  Functionally dissociable influences on learning rate in a dynamic environment.

Authors:  Joseph T McGuire; Matthew R Nassar; Joshua I Gold; Joseph W Kable
Journal:  Neuron       Date:  2014-11-19       Impact factor: 17.173

3.  Designer receptor manipulations reveal a role of the locus coeruleus noradrenergic system in isoflurane general anesthesia.

Authors:  Elena M Vazey; Gary Aston-Jones
Journal:  Proc Natl Acad Sci U S A       Date:  2014-02-24       Impact factor: 11.205

Review 4.  Reward prediction errors create event boundaries in memory.

Authors:  Nina Rouhani; Kenneth A Norman; Yael Niv; Aaron M Bornstein
Journal:  Cognition       Date:  2020-06-17

5.  The influence of context boundaries on memory for the sequential order of events.

Authors:  Sarah DuBrow; Lila Davachi
Journal:  J Exp Psychol Gen       Date:  2013-08-19

6.  The stability flexibility tradeoff and the dark side of detail.

Authors:  Matthew R Nassar; Vanessa Troiani
Journal:  Cogn Affect Behav Neurosci       Date:  2020-11-24       Impact factor: 3.526

7.  Catecholaminergic Regulation of Learning Rate in a Dynamic Environment.

Authors:  Marieke Jepma; Peter R Murphy; Matthew R Nassar; Mauricio Rangel-Gomez; Martijn Meeter; Sander Nieuwenhuis
Journal:  PLoS Comput Biol       Date:  2016-10-28       Impact factor: 4.475

8.  Dynamic routing of task-relevant signals for decision making in dorsolateral prefrontal cortex.

Authors:  Christopher H Donahue; Daeyeol Lee
Journal:  Nat Neurosci       Date:  2015-01-12       Impact factor: 24.884

9.  Dissociable effects of surprise and model update in parietal and anterior cingulate cortex.

Authors:  Jill X O'Reilly; Urs Schüffelgen; Steven F Cuell; Timothy E J Behrens; Rogier B Mars; Matthew F S Rushworth
Journal:  Proc Natl Acad Sci U S A       Date:  2013-08-28       Impact factor: 11.205

10.  Flexible combination of reward information across primates.

Authors:  Shiva Farashahi; Christopher H Donahue; Benjamin Y Hayden; Daeyeol Lee; Alireza Soltani
Journal:  Nat Hum Behav       Date:  2019-09-09
View more
  1 in total

Review 1.  Filling the gaps: Cognitive control as a critical lens for understanding mechanisms of value-based decision-making.

Authors:  R Frömer; A Shenhav
Journal:  Neurosci Biobehav Rev       Date:  2021-12-10       Impact factor: 8.989

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.