| Literature DB >> 29540863 |
Poornima Kumar1,2, Franziska Goer3, Laura Murray3, Daniel G Dillon3,4, Miranda L Beltzer3, Andrew L Cohen3, Nancy H Brooks3, Diego A Pizzagalli5,6,7.
Abstract
Anhedonia (hyposensitivity to rewards) and negative bias (hypersensitivity to punishments) are core features of major depressive disorder (MDD), which could stem from abnormal reinforcement learning. Emerging evidence highlights blunted reward learning and reward prediction error (RPE) signaling in the striatum in MDD, although inconsistencies exist. Preclinical studies have clarified that ventral tegmental area (VTA) neurons encode RPE and habenular neurons encode punishment prediction error (PPE), which are then transmitted to the striatum and cortex to guide goal-directed behavior. However, few studies have probed striatal activation, and functional connectivity between VTA-striatum and VTA-habenula during reward and punishment learning respectively, in unmedicated MDD. To fill this gap, we acquired fMRI data from 25 unmedicated MDD and 26 healthy individuals during a monetary instrumental learning task and utilized a computational modeling approach to characterize underlying neural correlates of RPE and PPE. Relative to controls, MDD individuals showed impaired reward learning, blunted RPE signal in the striatum and overall reduced VTA-striatal connectivity to feedback. Critically, striatal RPE signal was increasingly blunted with more major depressive episodes (MDEs). No group differences emerged in PPE signals in the habenula and VTA or in connectivity between these regions. However, PPE signals in the habenula correlated positively with number of MDEs. These results highlight impaired reward learning, disrupted RPE signaling in the striatum (particularly among individuals with more lifetime MDEs) as well as reduced VTA-striatal connectivity in MDD. Collectively, these findings highlight reward-related learning deficits in MDD and their underlying pathophysiology.Entities:
Mesh:
Year: 2018 PMID: 29540863 PMCID: PMC5983542 DOI: 10.1038/s41386-018-0032-x
Source DB: PubMed Journal: Neuropsychopharmacology ISSN: 0893-133X Impact factor: 7.853
Demographics of participants in the final sample. Mean with standard deviations are reported
| Controls | MDD | ||
|---|---|---|---|
| Sample size | 26 | 25 | N/A |
| Age | 26.31 ± 7.96 | 25.25 ± 5.46 | >0.5 |
| Gender | 19f, 7m | 19f, 6m | >0.5 |
| Caucasian | 17 (65.4%) | 16 (64%) | >0.5 |
| Years of education | 15.44 ± 1.80 | 15.80 ± 2.31 | >0.5 |
| BDI | 0.44 ± 0.71 | 26.26 ± 9.21 | <0.001 |
| SHAPS (Anhedonia) | 18.6 ± 4.49 | 33.40 ± 4.22 | <0.001 |
| HDRS | 0.42 ± 0.94 | 17.27 ± 3.99 | <0.001 |
| Number of lifetime MDEs | N/A | 3.72 ± 3.06 | N/A |
| Age of initial onset | N/A | 17.63 ± 5.34 | N/A |
| Length of current episode (in months) | N/A | 11.52 ± 15.41 | N/A |
Note: BDI-II: Beck Depression Inventory-II [36], SHAPS: Snaith Hamilton Pleasure Scale [37]; HDRS: Hamilton Depression Rating Scale (17-items; [35]). Missing data: 2 controls for race, 1 control for BDI and Anhedonia; 3 MDD for length of current episode and HDRS; 1 MDD for age of initial onset; 7 MDD for number of episodes.
Fig. 1Behavioral results: Observed behavioral choices for reward [green—healthy controls; Red—MDD] and punishments [orange—healthy controls; purple—MDD] in controls vs. MDD. The learning curves depict, trial-by-trial, averaged proportion of “correct” stimulus (associated with a probability of 0.8 of winning money) in the gain condition (upper graph), and the “incorrect” stimulus (associated with a probability of 0.8 of losing money) in the loss condition (lower graph) chosen by participants. Error bars represent standard error.
Summary of task performance (choice accuracy as a function of trial type) in the healthy control (N = 26) and MDD (N = 25) groups. Mean with standard deviations are reported.
| Variable | Reward | Punishment | Neutral | |||
|---|---|---|---|---|---|---|
| Controls | MDD | Controls | MDD | Controls | MDD | |
| Choice accuracy (%) | 87.82 ± 12.51 | 80.37 ± 19.18 | 83.11 ± 10.11 | 83.70 ± 8.46 | 49.39 ± 20.68 | 54.20 ± 18.53 |
| Number correct (%) | 71.06 ± 7.75 | 66.73 ± 12.07 | 67.31 ± 6.61 | 68.23 ± 5.01 | 49.84 ± 11.71 | 52.57 ± 10.43 |
Fig. 2ROI results: Parameter estimates reflecting reward and punishment prediction errors extracted from the right (a) and left (b) striatum, habenula (c), right insula (d) and ventral tegmental area (VTA) (e) in healthy controls and MDD. Error bars represent standard error. *p < 0.05
Fig. 3Correlation between number of depressive episodes, and (a) reward prediction error in the right striatum and (b) punishment prediction error in the habenula, in the MDD group. [Note: To evaluate the effect of number of MDEs on PE without potential confounds, these analyses were conducted while adjusting for length of current episode. Unstandardized residuals are shown in the figure; correlation plots with raw scores (with no covariates) are shown in the Supplementary Fig S6]. These correlations were significant even after controlling for both length of current episode and current depression severity (BDI scores) [right striatum: r = −0.60; p = 0.011 and habenula: r = 0.56; p = 0.018]. Information about number of episodes was missing for seven MDD individuals, so the sample size for this correlational analysis was N = 18. PE - prediction error.
Fig. 4VTA-Right Striatum (a), VTA-Left Striatum (b), VTA-Habenula (c), and VTA-Right Insula (d) connectivity values in healthy controls and MDD. Error bars represent standard error. §p < 0.1