Literature DB >> 29877769

A Control Theoretic Model of Adaptive Learning in Dynamic Environments.

Harrison Ritz1, Matthew R Nassar1, Michael J Frank1, Amitai Shenhav1.   

Abstract

To behave adaptively in environments that are noisy and nonstationary, humans and other animals must monitor feedback from their environment and adjust their predictions and actions accordingly. An understudied approach for modeling these adaptive processes comes from the engineering field of control theory, which provides general principles for regulating dynamical systems, often without requiring a generative model. The proportional-integral-derivative (PID) controller is one of the most popular models of industrial process control. The proportional term is analogous to the "delta rule" in psychology, adjusting estimates in proportion to each error in prediction. The integral and derivative terms augment this update to simultaneously improve accuracy and stability. Here, we tested whether the PID algorithm can describe how people sequentially adjust their predictions in response to new information. Across three experiments, we found that the PID controller was an effective model of participants' decisions in noisy, changing environments. In Experiment 1, we reanalyzed a change-point detection experiment and showed that participants' behavior incorporated elements of PID updating. In Experiments 2-3, we developed a task with gradual transitions that we optimized to detect PID-like adjustments. In both experiments, the PID model offered better descriptions of behavioral adjustments than both the classical delta-rule model and its more sophisticated variant, the Kalman filter. We further examined how participants weighted different PID terms in response to salient environmental events, finding that these control terms were modulated by reward, surprise, and outcome entropy. These experiments provide preliminary evidence that adaptive learning in dynamic environments resembles PID control.

Entities:  

Mesh:

Year:  2018        PMID: 29877769      PMCID: PMC6432773          DOI: 10.1162/jocn_a_01289

Source DB:  PubMed          Journal:  J Cogn Neurosci        ISSN: 0898-929X            Impact factor:   3.225


  64 in total

1.  Optimizing the use of information: strategic control of activation of responses.

Authors:  G Gratton; M G Coles; E Donchin
Journal:  J Exp Psychol Gen       Date:  1992-12

Review 2.  Decision making in recurrent neuronal circuits.

Authors:  Xiao-Jing Wang
Journal:  Neuron       Date:  2008-10-23       Impact factor: 17.173

3.  Behavioral and neural evidence for item-specific performance monitoring.

Authors:  Chris Blais; Silvia Bunge
Journal:  J Cogn Neurosci       Date:  2010-12       Impact factor: 3.225

4.  Functionally dissociable influences on learning rate in a dynamic environment.

Authors:  Joseph T McGuire; Matthew R Nassar; Joshua I Gold; Joseph W Kable
Journal:  Neuron       Date:  2014-11-19       Impact factor: 17.173

5.  Beyond trial-by-trial adaptation: A quantification of the time scale of cognitive control.

Authors:  Bart Aben; Tom Verguts; Eva Van den Bussche
Journal:  J Exp Psychol Hum Percept Perform       Date:  2017-01-12       Impact factor: 3.332

6.  Importance of unpredictability for reward responses in primate dopamine neurons.

Authors:  J Mirenowicz; W Schultz
Journal:  J Neurophysiol       Date:  1994-08       Impact factor: 2.714

7.  A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli.

Authors:  J M Pearce; G Hall
Journal:  Psychol Rev       Date:  1980-11       Impact factor: 8.934

8.  Errors and error correction in choice-response tasks.

Authors:  P M Rabbitt
Journal:  J Exp Psychol       Date:  1966-02

9.  Taming the beast: extracting generalizable knowledge from computational models of cognition.

Authors:  Matthew R Nassar; Michael J Frank
Journal:  Curr Opin Behav Sci       Date:  2016-10

10.  Fictive reward signals in the anterior cingulate cortex.

Authors:  Benjamin Y Hayden; John M Pearson; Michael L Platt
Journal:  Science       Date:  2009-05-15       Impact factor: 47.728

View more
  2 in total

1.  A generative learning model for saccade adaptation.

Authors:  Carlos R Cassanello; Florian Ostendorf; Martin Rolfs
Journal:  PLoS Comput Biol       Date:  2019-08-09       Impact factor: 4.475

2.  PID Control as a Process of Active Inference with Linear Generative Models.

Authors:  Manuel Baltieri; Christopher L Buckley
Journal:  Entropy (Basel)       Date:  2019-03-07       Impact factor: 2.524

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.