Literature DB >> 33372237

Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales.

Dimitrije Marković1, Thomas Goschke2,3, Stefan J Kiebel4,5.   

Abstract

Cognitive control is typically understood as a set of mechanisms that enable humans to reach goals that require integrating the consequences of actions over longer time scales. Importantly, using routine behaviour or making choices beneficial only at short time scales would prevent one from attaining these goals. During the past two decades, researchers have proposed various computational cognitive models that successfully account for behaviour related to cognitive control in a wide range of laboratory tasks. As humans operate in a dynamic and uncertain environment, making elaborate plans and integrating experience over multiple time scales is computationally expensive. Importantly, it remains poorly understood how uncertain consequences at different time scales are integrated into adaptive decisions. Here, we pursue the idea that cognitive control can be cast as active inference over a hierarchy of time scales, where inference, i.e., planning, at higher levels of the hierarchy controls inference at lower levels. We introduce the novel concept of meta-control states, which link higher-level beliefs with lower-level policy inference. Specifically, we conceptualize cognitive control as inference over these meta-control states, where solutions to cognitive control dilemmas emerge through surprisal minimisation at different hierarchy levels. We illustrate this concept using the exploration-exploitation dilemma based on a variant of a restless multi-armed bandit task. We demonstrate that beliefs about contexts and meta-control states at a higher level dynamically modulate the balance of exploration and exploitation at the lower level of a single action. Finally, we discuss the generalisation of this meta-control concept to other control dilemmas.

Entities:  

Keywords:  Active inference; Arbitration; Exploration-exploitation dilemma; Hierarchy of time scales; Meta-control

Year:  2020        PMID: 33372237     DOI: 10.3758/s13415-020-00837-x

Source DB:  PubMed          Journal:  Cogn Affect Behav Neurosci        ISSN: 1530-7026            Impact factor:   3.282


  58 in total

Review 1.  Deciding How To Decide: Self-Control and Meta-Decision Making.

Authors:  Y-Lan Boureau; Peter Sokol-Hessner; Nathaniel D Daw
Journal:  Trends Cogn Sci       Date:  2015-10-05       Impact factor: 20.229

Review 2.  A Primer on Foraging and the Explore/Exploit Trade-Off for Psychiatry Research.

Authors:  M A Addicott; J M Pearson; M M Sweitzer; D L Barack; M L Platt
Journal:  Neuropsychopharmacology       Date:  2017-05-29       Impact factor: 7.853

3.  Planning as inference.

Authors:  Matthew Botvinick; Marc Toussaint
Journal:  Trends Cogn Sci       Date:  2012-08-30       Impact factor: 20.229

4.  The computational and neural basis of cognitive control: charted territory and new frontiers.

Authors:  Matthew M Botvinick; Jonathan D Cohen
Journal:  Cogn Sci       Date:  2014-07-31

5.  A Large-Scale Circuit Mechanism for Hierarchical Dynamical Processing in the Primate Cortex.

Authors:  Rishidev Chaudhuri; Kenneth Knoblauch; Marie-Alice Gariel; Henry Kennedy; Xiao-Jing Wang
Journal:  Neuron       Date:  2015-10-01       Impact factor: 17.173

6.  Dynamic cognitive models of intertemporal choice.

Authors:  Junyi Dai; Timothy J Pleskac; Thorsten Pachur
Journal:  Cogn Psychol       Date:  2018-03-26       Impact factor: 3.468

Review 7.  Frontal Cortex and the Hierarchical Control of Behavior.

Authors:  David Badre; Derek Evan Nee
Journal:  Trends Cogn Sci       Date:  2017-12-08       Impact factor: 20.229

Review 8.  Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective.

Authors:  Matthew M Botvinick; Yael Niv; Andew G Barto
Journal:  Cognition       Date:  2008-10-15

9.  Learning the value of information in an uncertain world.

Authors:  Timothy E J Behrens; Mark W Woolrich; Mark E Walton; Matthew F S Rushworth
Journal:  Nat Neurosci       Date:  2007-08-05       Impact factor: 24.884

10.  Context-Dependent Risk Aversion: A Model-Based Approach.

Authors:  Darío Cuevas Rivera; Florian Ott; Dimitrije Markovic; Alexander Strobel; Stefan J Kiebel
Journal:  Front Psychol       Date:  2018-10-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.