Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A Control Theoretic Model of Adaptive Learning in Dynamic Environments.

Literature DB >> 29877769

A Control Theoretic Model of Adaptive Learning in Dynamic Environments.

Harrison Ritz¹, Matthew R Nassar¹, Michael J Frank¹, Amitai Shenhav¹.
1. Brown University.

Abstract

To behave adaptively in environments that are noisy and nonstationary, humans and other animals must monitor feedback from their environment and adjust their predictions and actions accordingly. An understudied approach for modeling these adaptive processes comes from the engineering field of control theory, which provides general principles for regulating dynamical systems, often without requiring a generative model. The proportional-integral-derivative (PID) controller is one of the most popular models of industrial process control. The proportional term is analogous to the "delta rule" in psychology, adjusting estimates in proportion to each error in prediction. The integral and derivative terms augment this update to simultaneously improve accuracy and stability. Here, we tested whether the PID algorithm can describe how people sequentially adjust their predictions in response to new information. Across three experiments, we found that the PID controller was an effective model of participants' decisions in noisy, changing environments. In Experiment 1, we reanalyzed a change-point detection experiment and showed that participants' behavior incorporated elements of PID updating. In Experiments 2-3, we developed a task with gradual transitions that we optimized to detect PID-like adjustments. In both experiments, the PID model offered better descriptions of behavioral adjustments than both the classical delta-rule model and its more sophisticated variant, the Kalman filter. We further examined how participants weighted different PID terms in response to salient environmental events, finding that these control terms were modulated by reward, surprise, and outcome entropy. These experiments provide preliminary evidence that adaptive learning in dynamic environments resembles PID control.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2018 PMID： 29877769 PMCID： PMC6432773 DOI： 10.1162/jocn_a_01289

Source DB: PubMed Journal: J Cogn Neurosci ISSN： 0898-929X Impact factor: 3.225

64 in total

A Control Theoretic Model of Adaptive Learning in Dynamic Environments.

1. Optimizing the use of information: strategic control of activation of responses.

Review 2. Decision making in recurrent neuronal circuits.

3. Behavioral and neural evidence for item-specific performance monitoring.

4. Functionally dissociable influences on learning rate in a dynamic environment.

5. Beyond trial-by-trial adaptation: A quantification of the time scale of cognitive control.

6. Importance of unpredictability for reward responses in primate dopamine neurons.

7. A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli.

8. Errors and error correction in choice-response tasks.

9. Taming the beast: extracting generalizable knowledge from computational models of cognition.

10. Fictive reward signals in the anterior cingulate cortex.

1. A generative learning model for saccade adaptation.

2. PID Control as a Process of Active Inference with Linear Generative Models.