| Literature DB >> 24291614 |
Florian Schlagenhauf1, Quentin J M Huys2, Lorenz Deserno3, Michael A Rapp4, Anne Beck5, Hans-Joachim Heinze6, Ray Dolan7, Andreas Heinz8.
Abstract
Subjects with schizophrenia are impaired at reinforcement-driven reversal learning from as early as their first episode. The neurobiological basis of this deficit is unknown. We obtained behavioral and fMRI data in 24 unmedicated, primarily first episode, schizophrenia patients and 24 age-, IQ- and gender-matched healthy controls during a reversal learning task. We supplemented our fMRI analysis, focusing on learning from prediction errors, with detailed computational modeling to probe task solving strategy including an ability to deploy an internal goal directed model of the task. Patients displayed reduced functional activation in the ventral striatum (VS) elicited by prediction errors. However, modeling task performance revealed that a subgroup did not adjust their behavior according to an accurate internal model of the task structure, and these were also the more severely psychotic patients. In patients who could adapt their behavior, as well as in controls, task solving was best described by cognitive strategies according to a Hidden Markov Model. When we compared patients and controls who acted according to this strategy, patients still displayed a significant reduction in VS activation elicited by informative errors that precede salient changes of behavior (reversals). Thus, our study shows that VS dysfunction in schizophrenia patients during reward-related reversal learning remains a core deficit even when controlling for task solving strategies. This result highlights VS dysfunction is tightly linked to a reward-related reversal learning deficit in early, unmedicated schizophrenia patients.Entities:
Keywords: Computational modeling; Imaging; Reversal learning; Reward; Schizophrenia; Ventral striatum
Mesh:
Year: 2013 PMID: 24291614 PMCID: PMC3991847 DOI: 10.1016/j.neuroimage.2013.11.034
Source DB: PubMed Journal: Neuroimage ISSN: 1053-8119 Impact factor: 6.556
Group description.
| Schizophrenia patients | Healthy controls | Sig. | |
|---|---|---|---|
| Age (years) | 27.5 ± 5.2 (21–40) | 27.2 ± 4.9 (20–41) | n.s. |
| Gender | 22 males, 2 females | 22 males, 2 females | |
| Edinburgh handedness inventory | 84.6 ± 27.2 (n = 23) | 90.7 ± 17.6 | n.s. |
| Verbal IQ (WST) | 97.7 ± 12.6 | 103.6 ± 8.6 (n = 23) | n.s. |
| Age of onset (years) | 25.4 ± 5.8 | ||
| Duration of illness (years) | 2.4 ± 2.2 | ||
| Number of episodes | 1.7 ± 1.1 | ||
| PANSS total | 85.6 ± 16.2 | ||
| PANSS positive | 22.2 ± 5.8 | ||
| PANSS negative | 21.6 ± 5.9 | ||
| PANSS general | 41.8 ± 10.0 |
Fig. 1Structure of the reversal learning task. Subjects chose one of two abstract stimuli as quickly as possible by pressing the left or right button (maximum decision time: 2 s). Then, a blue box surrounding their chosen target and feedback (either a green smiley face for positive feedback or a red frowny face for negative feedback) was simultaneously shown for 1 s. “Correct” here indicates choice of the more often rewarded (80%) and “incorrect” of the less rewarded stimulus (20%) during that block. A misleading trial is a trial on which a probabilistic punishment was received although the correct (e.g. more rewarded) stimulus was chosen (shown), or one on which a probabilistic reward was obtained despite the incorrect choice (not shown). Note that subjects did not know whether feedbacks were truly informative or not, as they did not know the underlying state of the task. Participants were only able to label feedback as informative or misleading based on their own beliefs about the state of the task, i.e. about which response is more rewarded.
Fig. 2Prediction error signal in the ventral striatum. A: Prediction error signal in 24 healthy controls. B: Stronger PE signal in healthy controls (n = 24) compared to unmedicated schizophrenia patients (n = 22).
Fig. 3Model comparison. A: Model Bayesian Information Criterion (∆BICint) scores (compared to the best model). The best model has the lowest score (∆BICint = 0). The red line shows the random effects threshold. B: Model fit to individual participants (black dots). Red crosses indicate participants not fitted better than chance. Red dashed lines show group means for participants fitted better than chance. C: Average learning curves after reversals for participants fitted worse than chance (red), and for HC and SZ fitted better than chance (blue and green, respectively). Dashed lines show action choices generated from the model: after fitting the parameters to each subject's data, the model was run on the same task and surrogate choices were generated. D: Model ∆BICint scores for poor-fit schizophrenia patients (fitted worse than chance). Asterisks indicate the best fitting model. For further details see Supplementary Results. Abbreviations: SA = stimulus-action; DSA = Double stimulus-action, double update model; HMM = Hidden Markov Model; R/P = models with separate reward and punishment effects.
Fig. 4Group differences for HMM parameters. Comparison of model parameters between healthy controls (blue) and schizophrenia patients (green) for stay probability parameter γ (A) and reward sensitivity c (B). Stay probability was significantly different between HC and all SZ (n = 22), and remained so when excluding participants fitted worse than chance. Reward sensitivity, on the other hand, did not differ between HC and SZ who were well fitted, but did indeed differ if the poorly fitted subjects were included.
Fig. 5Group difference for the contrast ‘informative punishment – informative reward’. A + B: Healthy controls compared to schizophrenia patients (combining patients with good and bad model fit) displayed stronger activation in the bilateral ventral striatum (VS) and right ventrolateral prefrontal cortex (vlPFC) for the contrast ‘informative punishment – informative reward’ derived from the HMM model. C: Plots of parameter estimates revealed that patients with good and bad model fit showed reduced VS activation compared to healthy controls (upper and middle panel), while only patients with bad model fit showed reduced activation in the right vlPFC (lower panel) (for post-hoc t-tests — see results section).