| Literature DB >> 26728563 |
R C Kessler1, H M van Loo2, K J Wardenaar2, R M Bossarte3, L A Brenner4, T Cai5, D D Ebert1,6, I Hwang1, J Li5, P de Jonge2, A A Nierenberg7, M V Petukhova1, A J Rosellini1, N A Sampson1, R A Schoevers2, M A Wilcox8, A M Zaslavsky1.
Abstract
Heterogeneity of major depressive disorder (MDD) illness course complicates clinical decision-making. Although efforts to use symptom profiles or biomarkers to develop clinically useful prognostic subtypes have had limited success, a recent report showed that machine-learning (ML) models developed from self-reports about incident episode characteristics and comorbidities among respondents with lifetime MDD in the World Health Organization World Mental Health (WMH) Surveys predicted MDD persistence, chronicity and severity with good accuracy. We report results of model validation in an independent prospective national household sample of 1056 respondents with lifetime MDD at baseline. The WMH ML models were applied to these baseline data to generate predicted outcome scores that were compared with observed scores assessed 10-12 years after baseline. ML model prediction accuracy was also compared with that of conventional logistic regression models. Area under the receiver operating characteristic curve based on ML (0.63 for high chronicity and 0.71-0.76 for the other prospective outcomes) was consistently higher than for the logistic models (0.62-0.70) despite the latter models including more predictors. A total of 34.6-38.1% of respondents with subsequent high persistence chronicity and 40.8-55.8% with the severity indicators were in the top 20% of the baseline ML-predicted risk distribution, while only 0.9% of respondents with subsequent hospitalizations and 1.5% with suicide attempts were in the lowest 20% of the ML-predicted risk distribution. These results confirm that clinically useful MDD risk-stratification models can be generated from baseline patient self-reports and that ML methods improve on conventional methods in developing such models.Entities:
Mesh:
Year: 2016 PMID: 26728563 PMCID: PMC4935654 DOI: 10.1038/mp.2015.198
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Distributions and Polychoric/Tetrachoric correlations among the outcomes in the Survey 1–2 panel (N =1,056)
| Distribution
| Correlations with indications of severity
| ||||
|---|---|---|---|---|---|
| Est | (se) | Hospitalized | Suicide attempt | Disability | |
|
|
|
|
| ||
| I. Number of years since Survey 1 with episodes lasting 2+ weeks | |||||
| Any (%) | 37.9 | (1.7) | .49 | .34 | .49 |
| Number (mean) | 2.0 | (0.2) | .46 | .38 | .49 |
| High persistence (90th percentile [9+ years]) (%) | 9.7 | (1.5) | .46 | .47 | .53 |
| II. Number of years since NCS with episodes lasting most days | |||||
| Any (%) | 16.7 | (1.5) | .23 | .22 | .49 |
| Number (mean) | 0.8 | (0.1) | .30 | .30 | .53 |
| High chronicity (90th percentile [4+ years]) (%) | 8.4 | (1.0) | .29 | .29 | .58 |
| III. Severity | |||||
| Hospitalized for MDD since Survey 1 (%) | 5.8 | (1.1) | -- | .84 | .76 |
| Suicide attempt since Survey 1 (%) | 4.5 | (0.8) | .84 | -- | .51 |
| Disabled due to MDD at Survey 2 (%) | 3.2 | (0.6) | .76 | .51 | -- |
Area under the receiver operating characteristic curve (AUC) of Survey 1 risk scores based on machine learning (ML) models and logistic regression models predicting Survey 2 outcomes (N =1,056)
| AUC of risk scores based on
| ||
|---|---|---|
| ML models
| Logistic models
| |
| High persistence | .71 | .68 |
| High chronicity | .63 | .62 |
| Hospitalization | .73 | .65 |
| Disability | .74 | .69 |
| Suicide attempt | .76 | .70 |
Sensitivity, positive predictive value, and likelihood-ratio positive of Survey 1 risk scores based on machine learning (ML) models in the upper and lower 20% and 33% of the risk distribution predicting Survey 2 outcomes (N =1,056)
| High Persistence | High Chronicity | Hospitalization | Disability | Suicide attempt | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Est | (se) | Est | (se) | Est | (se) | Est | (se) | Est | (se) | |
|
|
|
|
|
| ||||||
| Highest 20% | 38.1 | (4.2) | 34.6 | (7.3) | 40.8 | (6.8) | 55.8 | (6.6) | 55.8 | (6.9) |
| Highest 33% | 62.2 | (4.8) | 49.7 | (6.7) | 66.6 | (3.2) | 68.3 | (5.8) | 70.7 | (3.8) |
| Lowest 33% | 10.5 | (3.7) | 16.7 | (5.9) | 11.8 | (4.2) | 9.7 | (2.9) | 10.6 | (2.7) |
| Lowest 20% | 5.6 | (1.8) | 15.9 | (5.8) | 0.9 | (0.9) | 7.4 | (3.0) | 1.5 | (0.7) |
| Highest 20% | 18.3 | (3.5) | 14.4 | (3.2) | 13.1 | (3.0) | 8.8 | (1.8) | 12.5 | (2.7) |
| Highest 33% | 17.5 | (2.8) | 12.2 | (2.3) | 12.5 | (2.6) | 6.3 | (1.5) | 9.3 | (1.7) |
| Lowest 33% | 3.1 | (1.3) | 4.2 | (1.5) | 2.3 | (1.1) | 0.9 | (0.5) | 1.4 | (0.7) |
| Lowest 20% | 2.7 | (1.0) | 6.7 | (2.4) | 0.3 | (0.3) | 1.2 | (0.7) | 0.3 | (0.2) |
| Highest 20% | 2.1 | (0.4) | 1.8 | (0.4) | 2.2 | (0.5) | 2.9 | (0.6) | 3.0 | (0.5) |
| Highest 33% | 2.0 | (0.2) | 1.5 | (0.2) | 2.1 | (0.2) | 2.1 | (0.3) | 2.2 | (0.2) |
| Lowest 33% | 0.3 | (0.1) | 0.5 | (0.2) | 0.3 | (0.1) | 0.3 | (0.1) | 0.3 | (0.2) |
| Lowest 20% | 0.3 | (0.1) | 0.8 | (0.3) | 0.0 | (0.0) | 0.4 | (0.2) | 0.1 | (0.0) |