| Literature DB >> 28824511 |
Yudai Iijima1, Keisuke Takano2, Yannick Boddez2, Filip Raes2, Yoshihiko Tanno3.
Abstract
Learning theories of depression have proposed that depressive cognitions, such as negative thoughts with reference to oneself, can develop through a reinforcement learning mechanism. This negative self-reference is considered to be positively reinforced by rewarding experiences such as genuine support from others after negative self-disclosure, and negatively reinforced by avoidance of potential aversive situations. The learning account additionally predicts that negative self-reference would be maintained by an inability to adjust one's behavior when negative self-reference no longer leads to such reward. To test this prediction, we designed an adapted version of the reversal-learning task. In this task, participants were reinforced to choose and engage in either negative or positive self-reference by probabilistic economic reward and punishment. Although participants were initially trained to choose negative self-reference, the stimulus-reward contingencies were reversed to prompt a shift toward positive self-reference (Study 1) and a further shift toward negative self-reference (Study 2). Model-based computational analyses showed that depressive symptoms were associated with a low learning rate of negative self-reference, indicating a high level of reward expectancy for negative self-reference even after the contingency reversal. Furthermore, the difficulty in updating outcome predictions of negative self-reference was significantly associated with the extent to which one possesses negative self-images. These results suggest that difficulty in adjusting action-outcome estimates for negative self-reference increases the chance to be faced with negative aspects of self, which may result in depressive symptoms.Entities:
Keywords: Q-learning model; depression; reinforcement learning; rumination; self-reference
Year: 2017 PMID: 28824511 PMCID: PMC5539182 DOI: 10.3389/fpsyg.2017.01333
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Mean negative log likelihood and AIC across participants for the single and double update models.
| Negative log likelihood | AIC | N of participants with AIC as Double < Single | ||||
|---|---|---|---|---|---|---|
| Double | Single | Double | Single | (N/total sample size) | ||
| Study 1 | 21.14 | 22.00 | 48.28 | 48.00 | 12/39 | |
| Study 2 | 23.02 | 24.00 | 52.03 | 51.98 | 13/44 | |
Means and SDs of choice frequencies of negative self-reference and learning parameters in the baseline, emotional, and neutral reversal learning (RL) tasks for Studies 1 and 2.
| Study 1 ( | Study 2 ( | |||
|---|---|---|---|---|
| Baseline task—choice frequency | 0.40 | 0.23 | 0.42 | 0.24 |
| Emotional RL task | ||||
| Choice frequency | ||||
| Acquisition | 0.81 | 0.11 | 0.78 | 0.15 |
| First reversal | 0.21 | 0.19 | 0.23 | 0.12 |
| Second reversal | – | – | 0.78 | 0.11 |
| Learning rate—negative ( | 0.60 | 0.28 | 0.70 | 0.26 |
| Learning rate—positive ( | 0.58 | 0.30 | 0.71 | 0.24 |
| Exploitation ( | 12.06 | 11.26 | 8.33 | 8.20 |
| Neutral RL task | ||||
| Choice frequency | ||||
| Acquisition | 0.77 | 0.14 | 0.79 | 0.16 |
| First reversal | 0.21 | 0.14 | 0.23 | 0.15 |
| Second reversal | – | – | 0.79 | 0.14 |
| Learning rate—B ( | 0.69 | 0.28 | 0.74 | 0.23 |
| Learning rate—A ( | 0.52 | 0.30 | 0.66 | 0.30 |
| Exploitation ( | 11.60 | 11.75 | 15.53 | 13.09 |
Multiple regressions predicting the severity of depressive symptoms (log-transformed) by learning parameters of the emotional and neutral reversal learning (RL) tasks (Model 1) and rating scores of self-attributes (Model 2) in Study 1.
| Model 1 | Model 2 | |||||
|---|---|---|---|---|---|---|
| B | 95% CI | B | 95% CI | |||
| Emotional RL task | ||||||
| Learning rate—negative ( | -1.03 | [-1.97, -0.08] | 0.03 | -0.58 | [-1.37, 0.20] | 0.14 |
| Learning rate—positive ( | 0.45 | [-0.39, 1.30] | 0.28 | 0.14 | [-0.55, 0.84] | 0.68 |
| Exploitation ( | -0.01 | [-0.03, 0.01] | 0.43 | 0.00 | [-0.02, 0.02] | 0.91 |
| Neutral RL task | ||||||
| Learning rate—B ( | -0.36 | [-1.21, 0.50] | 0.41 | -0.25 | [-0.95, 0.46] | 0.48 |
| Learning rate—A ( | 0.06 | [-0.79, 0.91] | 0.88 | 0.22 | [-0.47, 0.91] | 0.52 |
| Exploitation ( | 0.01 | [-0.01, 0.03] | 0.27 | 0.00 | [-0.01, 0.02] | 0.78 |
| Applicability ratings | ||||||
| Negative self-attributes | – | – | – | 0.67 | [0.29, 1.05] | <0.01 |
| Positive self-attributes | – | – | – | -0.05 | [-0.43, 0.33] | 0.80 |
| 0.22 | 0.53 | |||||
Multiple regressions predicting the severity of depressive symptoms (log-transformed) by learning parameters of the emotional and neutral reversal learning (RL) tasks (Model 1) and rating scores of self-attributes (Model 2) in Study 2.
| Model 1 | Model 2 | |||||
|---|---|---|---|---|---|---|
| B | 95% CI | B | 95% CI | |||
| Emotional RL task | ||||||
| Learning rate—negative ( | -1.43 | [-2.78, -0.09] | 0.04 | -0.44 | [-1.46, 0.58] | 0.39 |
| Learning rate—positive ( | 0.41 | [-0.97, 1.79] | 0.55 | 0.30 | [-0.70, 1.3] | 0.55 |
| Exploitation ( | 0.00 | [-0.02, 0.03] | 0.74 | 0.01 | [-0.02, 0.03] | 0.62 |
| Neutral RL task | ||||||
| Learning rate—B ( | -0.25 | [-1.15, 0.65] | 0.58 | -0.48 | [-1.13, 0.17] | 0.14 |
| Learning rate—A ( | 0.99 | [-0.36, 2.34] | 0.14 | 0.70 | [-0.27, 1.67] | 0.15 |
| Exploitation ( | -0.01 | [-0.03, 0.02] | 0.63 | 0.00 | [-0.02, 0.01] | 0.55 |
| Ratings | ||||||
| Negative self-attributes | – | – | – | 0.70 | [0.34, 1.06] | <0.01 |
| Positive self-attributes | – | – | – | -0.40 | [-0.75, -0.06] | 0.02 |
| 0.15 | 0.59 | |||||