| Literature DB >> 32210892 |
Randall C O'Reilly1,2, Ananta Nair2, Jacob L Russin1, Seth A Herd1,2.
Abstract
We address the distinction between habitual/automatic vs. goal-directed/controlled behavior, from the perspective of a computational model of the frontostriatal loops. The model exhibits a continuum of behavior between these poles, as a function of the interactive dynamics among different functionally-specialized brain areas, operating iteratively over multiple sequential steps, and having multiple nested loops of similar decision making circuits. This framework blurs the lines between these traditional distinctions in many ways. For example, although habitual actions have traditionally been considered purely automatic, the outer loop must first decide to allow such habitual actions to proceed. Furthermore, because the part of the brain that generates proposed action plans is common across habitual and controlled/goal-directed behavior, the key differences are instead in how many iterations of sequential decision-making are taken, and to what extent various forms of predictive (model-based) processes are engaged. At the core of every iterative step in our model, the basal ganglia provides a "model-free" dopamine-trained Go/NoGo evaluation of the entire distributed plan/goal/evaluation/prediction state. This evaluation serves as the fulcrum of serializing otherwise parallel neural processing. Goal-based inputs to the nominally model-free basal ganglia system are among several ways in which the popular model-based vs. model-free framework may not capture the most behaviorally and neurally relevant distinctions in this area.Entities:
Keywords: automatic processing; basal ganglia; computational modeling; controlled processing; frontal cortex; goals; habits
Year: 2020 PMID: 32210892 PMCID: PMC7076192 DOI: 10.3389/fpsyg.2020.00380
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
FIGURE 1Structure of Proposer-Predictor-Actor-Critic architecture (Herd et al., 2019) across frontal cortex and subcortical areas. We depict two parallel circuits with a hierarchical relationship. The top is a broad functional diagram, emphasizing the serially iterative and hierarchical nature of our proposed decision-making process. The bottom expands those functions, and identifies the brain areas that perform each function.
FIGURE 2Neural network implementation of the Proposer-Predictor-Actor-Critic theory. The model performs a three-factor task of choosing a Plan that accomplishes a current Goal in a current Situation. This abstract task can be conceptualized as navigation, social interaction, etc. The network’s Proposer component selects one Plan, based on pattern completion from inputs representing the current Situation and the current Goal. Each Plan deterministically produces an Outcome, each of which has one associated Result. The model is rewarded if that Result matches the current Goal. The Predictor component (when it is used) then predicts the resulting Outcome and Result (based on the proposed Plan and the current Situation), and the Actor component then uses that prediction as input to accept or reject that plan. If the plan is rejected, this computational cycle begins again with a new plan from the Proposer.
FIGURE 3Model’s simulation of habitization (from Herd et al., 2019). Performance grows faster with more training, but generalization is sacrificed. (A) Performance (% correct). The model with the Proposer component (Full model) performs worse at generalization (test – dashed lines). (B) The Proposer component learns correct behavior over time, with increasing probability of producing optimal plans for first consideration. (C) The Proposer’s learning reduces the total number of plans considered, by providing good options for first consideration, and thus also reduces total performance time. This may capture one factor in habitization in humans and animals.