| Literature DB >> 25234119 |
Tobias U Hauser1, Reto Iannaccone2, Susanne Walitza3, Daniel Brandeis4, Silvia Brem5.
Abstract
Adolescence is associated with quickly changing environmental demands which require excellent adaptive skills and high cognitive flexibility. Feedback-guided adaptive learning and cognitive flexibility are driven by reward prediction error (RPE) signals, which indicate the accuracy of expectations and can be estimated using computational models. Despite the importance of cognitive flexibility during adolescence, only little is known about how RPE processing in cognitive flexibility deviates between adolescence and adulthood. In this study, we investigated the developmental aspects of cognitive flexibility by means of computational models and functional magnetic resonance imaging (fMRI). We compared the neural and behavioral correlates of cognitive flexibility in healthy adolescents (12-16years) to adults performing a probabilistic reversal learning task. Using a modified risk-sensitive reinforcement learning model, we found that adolescents learned faster from negative RPEs than adults. The fMRI analysis revealed that within the RPE network, the adolescents had a significantly altered RPE-response in the anterior insula. This effect seemed to be mainly driven by increased responses to negative prediction errors. In summary, our findings indicate that decision making in adolescence goes beyond merely increased reward-seeking behavior and provides a developmental perspective to the behavioral and neural mechanisms underlying cognitive flexibility in the context of reinforcement learning.Entities:
Keywords: Adolescence; Cognitive flexibility; Development; Functional magnetic resonance imaging (fMRI); Reward prediction errors
Mesh:
Year: 2014 PMID: 25234119 PMCID: PMC4330550 DOI: 10.1016/j.neuroimage.2014.09.018
Source DB: PubMed Journal: Neuroimage ISSN: 1053-8119 Impact factor: 6.556
Fig. 1Probabilistic reversal learning task. On each trial (average duration: 9000 ms), two stimuli were simultaneously presented. The participant had to select one of the stimuli within 1500 ms. The selected stimulus was highlighted until the end of the stimulus presentation (2500 ms). After a jittered interstimulus interval (2000–4000 ms), the outcome was displayed for 1000 ms. Rewards were indicated by a framed coin whereas punishments were depicted by a crossed coin. Between trials, a jittered fixation cross was shown (2000–4000 ms).
Reward prediction errors in cognitive flexibility. Regions which correlate with RPEs across all subjects (p < .05 FWE; only clusters with k > 29 are listed). All coordinates are reported in MNI space. RPE: increasing activity with increasing RPEs; − RPE: decreasing RPEs elicit more activity; aIns: anterior insula; amygd: amygdala; dmPFC: dorsomedial prefrontal cortex; dlPFC: dorsolateral prefrontal cortex; IPL: inferior prefrontal cortex; mPFC: medial prefrontal cortex; PCC: posterior cingulate cortex; SFG: superior frontal gyrus; vmPFC: ventromedial prefrontal cortex.
| Contrast | Region | Hemisphere | Cluster size (voxels) | ||||
|---|---|---|---|---|---|---|---|
| RPE | amygd | Right | 95 | 18 | − 7.5 | − 18 | 6.74 |
| Left | 69 | − 27 | − 9 | − 19.5 | 6.03 | ||
| putamen | Left | 99 | − 27 | − 13.5 | 1.5 | 6.40 | |
| mPFC | Left | 132 | − 9 | 55.5 | 18 | 6.05 | |
| IPL | Left | 64 | − 48 | − 63 | 22.5 | 5.97 | |
| SFG | Left | 30 | − 18 | 30 | 45 | 5.93 | |
| PCC | Left | 133 | − 6 | − 54 | 12 | 5.91 | |
| precentral | Right | 50 | 55.5 | 0 | 6 | 5.89 | |
| vmPFC | Left | 189 | − 10.5 | 42 | − 10.5 | 5.83 | |
| − RPE | dmPFC | Bilateral | 1712 | 1.5 | 28.5 | 39 | 7.15 |
| aIns | Right | 622 | 36 | 18 | − 1.5 | 6.90 | |
| Left | 326 | − 34.5 | 16.5 | − 6 | 6.52 | ||
| dlPFC | Right | 196 | 25.5 | 48 | 27 | 6.45 | |
| 163 | 39 | 31.5 | 33 | 5.82 | |||
| IPL | Right | 112 | 55.5 | − 42 | 43.5 | 6.24 | |
| 35 | 37.5 | − 42 | 42 | 5.91 | |||
| Left | 89 | − 36 | − 46.5 | 40.5 | 5.95 | ||
| Precuneus | Bilateral | 65 | 7.5 | − 66 | 48 | 6.14 |
Results of the model comparison. Model comparison clearly revealed that the RSAV model has a better model fit than the Rescorla–Wagner and the risk-sensitive model in both groups (mean ± SD). logL: maximum log-Likelihood, AIC: Akaike Information Criterion, px: exceedance probability (probability that the given model fits data better than the other models).
| Model | All subjects | Adolescents | Adults | ||||||
|---|---|---|---|---|---|---|---|---|---|
| logL | AIC | px | logL | AIC | px | logL | AIC | px | |
| Rescorla–Wagner | − 0.98 ± 0.12 | 1.999 ± 0.248 | 0 | − 0.98 ± 0.14 | 1.997 ± 0.271 | 0 | − 0.98 ± 0.11 | 2.001 ± 0.229 | 0 |
| Risk–sensitive | − 0.97 ± 0.13 | 1.985 ± 0.250 | 0 | − 0.97 ± 0.14 | 1.988 ± 0.282 | 0 | − 0.97 ± 0.11 | 1.981 ± 0.219 | 0 |
| RSAV | − 0.66 ± 0.21 | 1.407 ± 0.411 | − 0.67 ± 0.23 | 1.424 ± 0.464 | − 0.65 ± 0.18 | 1.387 ± 0.356 | |||
Fig. 2Learning rate differences between adolescents and adults. The parameters from the RSAV model show an increased learning rate for negative RPEs in chosen stimuli (α−). The other learning rates did not significantly differ. *: p < .05, multiple comparison corrected.
Fig. 3Differences between adolescents and adults in the RPE network. (A) A network containing the dmPFC (upper panel) and the aIns (lower panel) shows increased activation for decreasing RPEs among all subjects. (B) A group comparison between the adolescents and adults reveals a significant activation difference in the right aIns. (C) Subsequent exploratory analysis revealed that this group difference was mainly driven by an increased activation for negative RPEs in adolescents. ***: p < .001.