Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Adaptive Learning through Temporal Dynamics of State Representation.

Literature DB >> 35105677

Adaptive Learning through Temporal Dynamics of State Representation.

Niloufar Razmi^1,2, Matthew R Nassar^3,2.

Abstract

People adjust their learning rate rationally according to local environmental statistics and calibrate such adjustments based on the broader statistical context. To date, no theory has captured the observed range of adaptive learning behaviors or the complexity of its neural correlates. Here, we attempt to do so using a neural network model that learns to map an internal context representation onto a behavioral response via supervised learning. The network shifts its internal context on receiving supervised signals that are mismatched to its output, thereby changing the "state" to which feedback is associated. A key feature of the model is that such state transitions can either increase learning or decrease learning depending on the duration over which the new state is maintained. Sustained state transitions that occur after changepoints facilitate faster learning and mimic network reset phenomena observed in the brain during rapid learning. In contrast, state transitions after one-off outlier events are short lived, thereby limiting the impact of outlying observations on future behavior. State transitions in our model provide the first mechanistic interpretation for bidirectional learning signals, such as the P300, that relate to learning differentially according to the source of surprising events and may also shed light on discrepant observations regarding the relationship between transient pupil dilations and learning. Together, our results demonstrate that dynamic latent state representations can afford normative inference and provide a coherent framework for understanding neural signatures of adaptive learning across different statistical environments.SIGNIFICANCE STATEMENT How humans adjust their sensitivity to new information in a changing world has remained largely an open question. Bridging insights from normative accounts of adaptive learning and theories of latent state representation, here we propose a feedforward neural network model that adjusts its learning rate online by controlling the speed of transitioning its internal state representations. Our model proposes a mechanistic framework for explaining learning under different statistical contexts, explains previously observed behavior and brain signals, and makes testable predictions for future experimental studies.

Entities: Chemical

Keywords: Bayesian; P300; adaptive learning; neural network; pupillometry; representation

Mesh：

Year: 2022 PMID： 35105677 PMCID： PMC8944236 DOI： 10.1523/JNEUROSCI.0387-21.2022

Source DB: PubMed Journal: J Neurosci ISSN： 0270-6474 Impact factor: 6.709

53 in total

1. Conflict monitoring and cognitive control.

Authors: M M Botvinick; T S Braver; D M Barch; C S Carter; J D Cohen
Journal: Psychol Rev Date: 2001-07 Impact factor: 8.934

2. Functionally dissociable influences on learning rate in a dynamic environment.

Authors: Joseph T McGuire; Matthew R Nassar; Joshua I Gold; Joseph W Kable
Journal: Neuron Date: 2014-11-19 Impact factor: 17.173

3. Designer receptor manipulations reveal a role of the locus coeruleus noradrenergic system in isoflurane general anesthesia.

Authors: Elena M Vazey; Gary Aston-Jones
Journal: Proc Natl Acad Sci U S A Date: 2014-02-24 Impact factor: 11.205

Review 1. Filling the gaps: Cognitive control as a critical lens for understanding mechanisms of value-based decision-making.

Authors: R Frömer; A Shenhav
Journal: Neurosci Biobehav Rev Date: 2021-12-10 Impact factor: 8.989

1 in total

Adaptive Learning through Temporal Dynamics of State Representation.

1. Conflict monitoring and cognitive control.

2. Functionally dissociable influences on learning rate in a dynamic environment.

3. Designer receptor manipulations reveal a role of the locus coeruleus noradrenergic system in isoflurane general anesthesia.

Review 4. Reward prediction errors create event boundaries in memory.

5. The influence of context boundaries on memory for the sequential order of events.

6. The stability flexibility tradeoff and the dark side of detail.

7. Catecholaminergic Regulation of Learning Rate in a Dynamic Environment.

8. Dynamic routing of task-relevant signals for decision making in dorsolateral prefrontal cortex.

9. Dissociable effects of surprise and model update in parietal and anterior cingulate cortex.

10. Flexible combination of reward information across primates.

Review 1. Filling the gaps: Cognitive control as a critical lens for understanding mechanisms of value-based decision-making.