| Literature DB >> 31110338 |
Christoph W Korn1,2,3, Dominik R Bach4,5,6.
Abstract
Jointly minimizing multiple threats over extended time horizons enhances survival. Consequently, many tests of approach-avoidance conflicts incorporate multiple threats for probing corollaries of animal and human anxiety. To facilitate computations necessary for threat minimization, the human brain may concurrently harness multiple decision policies and associated neural controllers, but it is unclear which. We combine a task that mimics foraging under predation with behavioural modelling and functional neuroimaging. Human choices rely on immediate predator probability-a myopic heuristic policy-and on the optimal policy, which integrates all relevant variables. Predator probability relates positively and the associated choice uncertainty relates negatively to activations in the anterior hippocampus, amygdala and dorsolateral prefrontal cortex. The optimal policy is positively associated with dorsomedial prefrontal cortex activity. We thus provide a decision-theoretic outlook on the role of the human hippocampus, amygdala and prefrontal cortex in resolving approach-avoidance conflicts relevant for anxiety and integral for survival.Entities:
Mesh:
Year: 2019 PMID: 31110338 PMCID: PMC6629544 DOI: 10.1038/s41562-019-0603-9
Source DB: PubMed Journal: Nat Hum Behav ISSN: 2397-3374
Figure 1Task outline.
Participants performed a sequential decision-making task that was framed as foraging under the dual threats of predation and starvation. Current energy was depicted as an energy bar. Participants were monetarily incentivized to keep their energy above zero and to attain the maximum energy state of five as often as possible (intermediate energy states did not directly translate into monetary payoffs). For each mini-block of trials, called “forest,” energy was reset and foraging options varied. During the initial forest phase (3.5 sec), participants were informed about their initial energy (2, 3, or 4 energy points) and the two possible foraging options in the current forest over five trials, called “days.” The number of days was not depicted and varied according to an exponential distribution (for fMRI design efficiency) but participants knew that their final payoffs would depend on a subset of forests for which they would complete exactly five days. Triangles of four different sizes showed the current predator probability (0.1, 0.2, 0.3, or 0.4). Green rectangles of four different sizes depicted the probability of foraging gain (0.2, 0.4, 0.6, or 0.8). The number of blue dots within the green rectangles showed gain magnitudes (varying from 0 to 4). Dark red dots showed losses, which were set to 1 for the waiting option (with a probability of 1) and set to 2 for the foraging option (with a probability of 1 - (probability of foraging gain)). After a fixation interval (3.5 sec) the choice phase started (2.0 sec): One of the two foraging options was presented on the screen (with a probability of 0.5) and participants had to make a decision between waiting and this foraging option. Sides were counterbalanced. If participants failed to respond, the words “Too slow” appeared. In the example shown, the participant chose foraging (indicated by an asterisk). After an interval of 1.0 sec, the outcome phase started (1.0 sec). In the current example, the predator attacked and all energy points were lost. After a variable fixation interval (between 0.5 and 3.8 sec), a new day or a new forest was depicted.
List of variables used in behavioural model comparisons, RT analyses, and fMRI analyses
| Variable | Description |
|---|---|
| #1 optimal policy | The optimal decision variable (DV) that takes into account all remaining time points (days), energy states, and the transition probabilities between them. Therefore, optimal policy cannot be directly inferred from information on the screen but necessitates rather complex computations. |
| #2 predator probability | The probability of the predator attacking when the foraging option is chosen. This probability is depicted by the size of a triangle on the screen and takes the values of 0.1, 0.2, 0.3, or 0.4. A predator attack leads to immediate death (i.e., an energy state of zero). |
| #3 probability of foraging gain | The probability of obtaining the depicted number of gains when the foraging option is chosen. This probability is depicted by the size of a green rectangle on the screen and takes the values of 0.2, 0.4, 0.6, or 0.8. |
| #4 gain magnitude | The number of points added to the energy state if foraging is successful and the predator does not attack. Gains vary from zero to four and are depicted by the number of blue dots within the green rectangle. Waiting entails a sure loss of one energy point and unsuccessful foraging always entails a constant loss magnitude of two points (depicted as red dots). |
| #5 continuous energy | Energy varies on a continuous scale from one to five in steps of one point. Zero energy corresponds to being dead (and thus no choice can be made in a zero energy state). |
| #6 binary energy | An energy state of one is special because in that state waiting leads to sure death. The “binary energy” variable therefore distinguishes between an energy state of one and all other energy states. |
| #7 expected energy | The foraging option entails an expected energy state, in the sense of expected value. This metric is thus calculated from the predator probability, the probability of foraging gain, the obtainable gain magnitude, and the continuous energy state (as well as the constant loss magnitude). The expected energy variable only takes one time step into account. |
| #8 expected energy change | The difference between the current energy state and the expected energy state for foraging (see #7). |
| #9 days past | The number of days (i.e., time steps) already spent in a given forest. The maximum number of days within a forest is always five. |
| #10 pseudo-optimal: horizon-1 | This policy is optimal in the final time step (i.e., when only one day is left within a forest). Otherwise it can be regarded as pseudo-optimal because it is too short-sighted. |
| #11 pseudo-optimal: starvation-only | This policy would be optimal if no predators were present. |
| #12 pseudo-optimal: predation-only | This policy would be optimal if starvation were not possible. |
| #13 pseudo-optimal: horizon-2.5 | This policy would be optimal if participants would not have be rewarded according to a full horizon of 5 days but according to an average horizon of 2.5 days, which was implemented to in the main experiment to enhance fMRI design efficiency. |
| #14 past energy change | The difference between the energy states between choice and outcome phases of the past trial. Due to the Markov property of the task this past change is irrelevant for the optimal policy. This same metric can be evaluated during the outcome phase of each trial and signals how many energy points are gained or lost in the given trial. This variables is thus included as a parametric modulator during the outcome phase. |
| #15 win-stay-lose-shift | This DV prescribes foraging if the energy state increased with respect to the past trial and waiting if the energy state decreased. Win-stay-lose-shift is a binarized version of “past energy change.” |
| #16 death in past forest | Binary variable indicating whether participants reached zero energy points in the forest immediately prior to the current forest. |
| #17 uncertainty of p predator | When the prescriptions of the employed
heuristic policy, i.e., of p predator, are closer to 0 (i.e., waiting)
or 1 (i.e., foraging) uncertainty is lower than when the prescriptions
lie in-between. Uncertainty is indexed by the derivative of the mean of
the logistic function for p predator (see |
| #18 uncertainty of optimal policy | When the absolute value differences according
to the optimal policy are large (i.e., either clearly prescribing
waiting or foraging) uncertainty is lower than when the absolute value
differences are small (i.e., the optimal policy is more or less
indifferent). Uncertainty is indexed by the derivative of the mean of
the logistic function for the optimal policy (see |
| #19 discrepancy in choice probabilities between p predator & optimal policy | In some cases, the heuristic policy of using p
predator and the optimal policy make quite distinct prescriptions (high
discrepancy), whereas in others they make quite similar prescriptions
(low discrepancy). Discrepancy is indexed by the absolute differences of
two logistic functions (see |
Variables #1 to #16 were used in models of choice behaviour. On the basis of the two variables in the behavioural model (i.e., on the basis of #1 and #2), we derived variables #17 to #19 and included these in RT and fMRI analyses.
Figure 2Models of choice data in the fMRI sample.
(a) Bayesian Model comparisons show that the probability of predator was the best single predictor of participants’ choices. The plot depicts fixed-effects analyses using relative log-group Bayes factors based on Bayesian Information Criterion (BIC) relative to model #1.
(b) A model that additionally included the optimal policy best explained remaining variance in participants’ choices.
(c) The winning model captures the relationship between participants’ average choices and the predator probability.
(d) The winning model captures the relationship between participants’ average choices and the optimal policy (binned value differences of foraging versus waiting).
Number of participants in the fMRI sample, n=24. Number of participants in the behavioural sample, n=23. Better fit is indicated by smaller log-group Bayes factors (i.e., larger negative values). In the right-hand panels error bars are SEM. Per data bin, circles depict mean empirical data points and lines and crosses depict mean model predictions (averaged for simulated data according to each participant’s model fit). See Table 1 for a list that specifies the considered decision variables (DVs) for the task and thus the models tested here. See Supplementary Tables 1-3 for detailed model comparisons in the fMRI sample and Supplementary Tables 4-6 for detailed model comparisons in the behavioural sample. Supplementary Tables 7-9 present shared variances between the DVs and confusion matrices. See Supplementary Figure 1 for the relationships among the 16 DVs included in the models. See Supplementary Figures 2, 3 for plots showing that the winning model captures the data split according to the other 14 DVs and that the winning model makes better qualitative predictions than the other models considered. Supplementary Figure 4 depicts individual variability in the fMRI sample.
Reaction time data: Linear mixed effects model
Log-transformed RTs were analysed using linear mixed effects models; as implemented in the R package lmer. P-values and degrees of freedom were derived using the R package lmerTest. Significant effects are printed in bold font. Log-likelihood differences were calculated between the models including all fixed effects relative to the models without the respective fixed effect (but with the same random-effects structure). Better fit is indicated by smaller log-likelihood differences (i.e., larger negative values).
| Predictor | estimates | degrees of freedom | t-values | p-values | log-likelihood difference | 95%-confidence interval | |
|---|---|---|---|---|---|---|---|
| lower limit | upper limit | ||||||
| fMRI sample (n=24) | |||||||
| Intercept | 6.66 | 24.45 | 6.56 | 6.75 | |||
| #1 p predator | 0.02 | 27.07 | 0.27 | 0.786 | -0.1 | -0.15 | 0.20 |
| #2 optimal policy | -0.06 | 30.67 | -4.0 | -0.10 | -0.02 | ||
| #3 uncertainty of p predator | 0.55 | 25.17 | -4.8 | 0.22 | 0.87 | ||
| #4 uncertainty of optimal policy | 0.12 | 43.17 | 1.73 | 0.091 | -1.4 | -0.02 | 0.27 |
| #5 discrepancy between p predator & optimal policy | 0.20 | 28.85 | -10.1 | 0.13 | 0.27 | ||
| Behavioural sample (n=23) | |||||||
| Intercept | 6.69 | 23.05 | 6.58 | 6.80 | |||
| #1 p predator | -0.00 | 25.00 | -0.04 | 0.970 | 0.0 | -0.17 | 0.16 |
| #2 optimal policy | -0.05 | 35.64 | -4.1 | -0.09 | -0.02 | ||
| #3 uncertainty of p predator | 0.31 | 28.31 | -3.2 | 0.07 | 0.54 | ||
| #4 uncertainty of optimal policy | -0.07 | 31.67 | -0.85 | 0.400 | -1.1 | -0.22 | 0.09 |
| #5 discrepancy between p predator & optimal policy | 0.12 | 22.34 | -4.9 | 0.05 | 0.20 | ||
Figure 3FMRI results during the choice phase.
(a) Predator probability showed a positive relation within a cluster spanning right anterior hippocampus and amygdala as well as within bilateral DLPFC.
(b) The choice uncertainty according to the predator probability showed a negative relation in right anterior hippocampus extending into amygdala as well as bilateral DLPFC and right lateral IFG.
(c) The optimal policy showed a positive relation within DMPFC, extending into pre-SMA and ACC, and thalamus among other regions.
(d) The choice uncertainty according to the optimal policy showed a positive relation in DMPFC extending into pre-SMA.
Number of participants, n=24. Overlay on group average T1-weighted image in MNI space; clusters are whole-brain family-wise error (FWE) corrected for multiple comparisons at p < 0.05 with a cluster-defining threshold of p < 0.001. See Table 3 for a list of all clusters. See Supplementary Table 11 for the relationships among the variables included as parametric modulators during the choice phase. See Figure 4 for the overlap of the hippocampus/amygdala clusters depicted in (a) and (b). Supplementary Figure 5 visualizes the overlap of all clusters. See Supplementary Table 12 and Supplementary Figure 6 for fMRI results during the outcome phase. See Supplementary Table 13 for the results of a secondary model that additionally included participants’ choices as a parametric modulator. See Supplementary Table 14 for the results of a tertiary model in which choice uncertainties were derived from the behavioural sample. Supplementary Table 15 and Supplementary Figure 8 present the results of a covariate analyses testing for inter-individual differences. Supplementary Table 16 and Supplementary Figures 8, 9 present non-independent region of interest analyses. See Supplementary Table 17 for contrasts testing for interaction effects. Supplementary Figure 10 visualizes the overlap between clusters from the current study and clusters from a related previous study 2.
FMRI results during choice phase (primary GLM)
Clusters are whole-brain FWE corrected for multiple comparisons at p <0.05 with a cluster-defining threshold of p < 0.001. Number of participants, n=24.
| Side | Peak voxel MNI coordinates (mm) | Cluster size (Voxel) | Peak t score | |||
|---|---|---|---|---|---|---|
| X | Y | Z | ||||
| Dorsolateral prefrontal cortex (DLPFC) | R | 30 | 36 | 48 | 1325 | 6.69 |
| Medial occipital cortex | L | -14 | -74 | -9 | 1112 | 6.17 |
| Anterior hippocampus extending into amygdala | R | 26 | -6 | -23 | 340 | 6.17 |
| DLPFC | L | -29 | 33 | 50 | 194 | 4.75 |
| Inferior occipital gyrus | L | -29 | -87 | -12 | 462 | 6.48 |
| Inferior occipital gyrus | R | 21 | -92 | -9 | 294 | 6.05 |
| Thalamus | L | -8 | -21 | 6 | 250 | 7.92 |
| Inferior frontal gyrus (IFG) extending into insula | R | 32 | 23 | 5 | 950 | 7.58 |
| Anterior cingulate cortex | L | -9 | 32 | 20 | 260 | 6.59 |
| Medial occipital cortex | L | -23 | -72 | -9 | 417 | 6.25 |
| Thalamus | R | 11 | -27 | -6 | 408 | 5.69 |
| Posterior dorsal medial prefrontal cortex (DMPFC) extending into supplementary motor area | L/R | 8 | 23 | 53 | 1393 | 5.57 |
| IFG extending into insula | L | -30 | 21 | -8 | 152 | 4.56 |
| Superior parietal gyrus | L | -17 | -57 | 69 | 472 | 7.41 |
| None | ||||||
| DLPFC extending into DMPFC | R | 14 | 38 | 54 | 1977 | 6.74 |
| Posterior middle temporal gyrus | R | 59 | -56 | -5 | 197 | 6.10 |
| Lateral IFG | R | 53 | 39 | -6 | 572 | 6.06 |
| Anterior hippocampus extending into amygdala | R | 21 | -9 | -18 | 672 | 5.94 |
| Medial occipital cortex | L | -14 | -87 | 32 | 145 | 5.00 |
| DLPFC | L | -32 | 29 | 51 | 149 | 4.83 |
| DLPFC & DMPFC | R | 17 | 12 | 57 | 2484 | 6.96 |
| Insula | R | 21 | 23 | -5 | 707 | 6.37 |
| DLPFC | R | 39 | 35 | 41 | 935 | 6.16 |
| DLPFC extending lateral IFG | R | 41 | 54 | 12 | 800 | 5.36 |
| Posterior cingulate cortex | L | -8 | -51 | 33 | 149 | 4.84 |
| Thalamus | L | -9 | -9 | 2 | 164 | 5.34 |
| None | ||||||
Figure 4Visualization of the clusters in the hippocampus extending into the amygdala.
(a) Overlap of the functional clusters for predator probability per se (red) and for the choice uncertainty of predator probability (yellow; see also depiction of these clusters in Figure 3a, 3b). Comparing these functional clusters to anatomical masks of the Automated Anatomical Labeling (AAL) atlas showed that of the 340 voxels in the cluster for predator probability per se 194 were in the hippocampus and 113 in the amygdala. Of the 672 voxels in the cluster for uncertainty of predator probability 294 were in the hippocampus and 270 in the amygdala.
(b) Visualization of the relation between the parameter estimates in the overlap region of interest (ROI) depicted in (a). Parameter estimates were derived from a follow-up GLM, in which the four levels of predator probability were modelled as separate onset regressors for the choice phase. Notably, the predator probabilities used here cover a different range than that used in a previous study18 which showed a purely linear impact of predator probability on threat probability (with threat probabilities: 0.2, 0.5, 0.8). Please note that the number of data points was not equally distributed for the four bins. Parameter estimates were extracted using the toolbox marsbar.
Number of participants, n=24.