| Literature DB >> 32555144 |
Jonas Busk1,2, Maria Faurholt-Jepsen3, Mads Frost4, Jakob E Bardram5, Lars Vedel Kessing3,6, Ole Winther7,8,9.
Abstract
Currently, the golden standard for assessing the severity of depressive and manic symptoms in patients with bipolar disorder (BD) is clinical evaluations using validated rating scales such as the Hamilton Depression Rating Scale 17-items (HDRS) and the Young Mania Rating Scale (YMRS). Frequent automatic estimation of symptom severity could potentially help support monitoring of illness activity and allow for early treatment intervention between outpatient visits. The present study aimed (1) to assess the feasibility of producing daily estimates of clinical rating scores based on smartphone-based self-assessments of symptoms collected from a group of patients with BD; (2) to demonstrate how these estimates can be utilized to compute individual daily risk of relapse scores. Based on a total of 280 clinical ratings collected from 84 patients with BD along with daily smartphone-based self-assessments, we applied a hierarchical Bayesian modelling approach capable of providing individual estimates while learning characteristics of the patient population. The proposed method was compared to common baseline methods. The model concerning depression severity achieved a mean predicted R2 of 0.57 (SD = 0.10) and RMSE of 3.85 (SD = 0.47) on the HDRS, while the model concerning mania severity achieved a mean predicted R2 of 0.16 (SD = 0.25) and RMSE of 3.68 (SD = 0.54) on the YMRS. In both cases, smartphone-based self-reported mood was the most important predictor variable. The present study shows that daily smartphone-based self-assessments can be utilized to automatically estimate clinical ratings of severity of depression and mania in patients with BD and assist in identifying individuals with high risk of relapse.Entities:
Mesh:
Year: 2020 PMID: 32555144 PMCID: PMC7303106 DOI: 10.1038/s41398-020-00867-6
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
Fig. 1Distributions of clinical ratings of symptom severity of depression (HDRS) and mania (YMRS) and smartphone-based self-reported mood scores.
A negative mood score is expected to indicate a high HDRS score and a positive mood score is expected to indicate a high YMRS score. The HDRS and YMRS scores are rarely high at the same time (indicating mixed mood). Thus, data is expected to primarily occupy the white background areas of the scatter plots.
Weight table showing the population-level parameters in the HDRS total model (top) and the YMRS total model (bottom).
| HDRS | |||||||
|---|---|---|---|---|---|---|---|
| Predictor | Mean (SD) | 95% CIc | | | Mean ( | 95% CIc | ||
| Intercept | 6.43 (0.66) | 5.13 | 7.73 | 9.67 | 4.10 (0.50) | 3.23 | 5.19 |
| Mood negative | −9.11 (1.40) | −11.94 | −6.43 | 6.51 | 0.56 (0.40) | 0.02 | 1.50 |
| Sleep negative | −6.48 (1.66) | −9.72 | −3.19 | 3.89 | 0.42 (0.31) | 0.02 | 1.16 |
| Mixed Mood | 2.11 (0.67) | 0.79 | 3.42 | 3.15 | 0.44 (0.32) | 0.02 | 1.20 |
| Anxiety | 2.26 (0.86) | 0.58 | 3.96 | 2.63 | 0.38 (0.28) | 0.02 | 1.06 |
| Medicine changed | −1.81 (0.71) | −3.19 | −0.40 | 2.55 | 0.35 (0.27) | 0.01 | 0.99 |
| Cognitive difficulty | 1.09 (0.73) | −0.35 | 2.48 | 1.50 | 0.43 (0.32) | 0.02 | 1.19 |
| Mood positive | −2.80 (1.94) | −6.59 | 0.94 | 1.44 | 0.42 (0.32) | 0.02 | 1.19 |
| Sleep positive | 2.83 (2.05) | −1.09 | 6.90 | 1.38 | 0.41 (0.31) | 0.02 | 1.15 |
| Activity | 0.53 (0.61) | −0.66 | 1.71 | 0.88 | 0.50 (0.35) | 0.02 | 1.29 |
| Stress | 0.56 (0.73) | −0.86 | 1.99 | 0.76 | 0.50 (0.36) | 0.02 | 1.32 |
| Alcohol | 0.59 (1.01) | −1.39 | 2.54 | 0.59 | 0.41 (0.31) | 0.02 | 1.15 |
| Medicine omitted | 0.52 (0.97) | −1.38 | 2.42 | 0.54 | 0.37 (0.28) | 0.01 | 1.04 |
| Irritable | 0.05 (0.74) | −1.41 | 1.49 | 0.06 | 0.59 (0.42) | 0.02 | 1.57 |
The population-level regression weight means, μ, are summarized in the leftmost columns and sorted by variable importance computed as the absolute t-statistic of the mean parameter. The corresponding variances, τ, are summarized in the columns to the right and can be interpreted as the amount of pooling of the given variable in the hierarchical model.
aPopulation-level regression weight means.
bPopulation-level variance can be interpreted as the amount of pooling of the given variable in the hierarchical model.
cCredible interval.
dAbsolute t-statistic of the mean parameter indicating variable importance.
Results of K = 100 cross-validation experiments with the HDRS total score (left columns) and the YMRS total score (right columns) models based on all, mandatory and mood self-assessment items, respectively.
| HDRS total score | YMRS total score | |||
|---|---|---|---|---|
| Model | RMSE (SD) ↓b | RMSE ( | ||
| Pooled naïve mean | −0.02 (0.03) | 5.99 (0.37) | −0.04 (0.05) | 4.18 (0.70) |
| Pooled Ridge | 0.37 (0.10) | 4.68 (0.48) | 0.02 (0.15) | 4.03 (0.60) |
| Pooled XGBoost | 0.44 (0.10) | 4.40 (0.41) | −0.04 (0.21) | 4.11 (0.53) |
| Pooled Bayesian | 0.36 (0.12) | 4.72 (0.51) | 0.00 (0.21) | 4.04 (0.56) |
| Separate naïve mean | 0.47 (0.11) | 4.29 (0.47) | −0.00 (0.33) | 4.00 (0.53) |
| Separate Ridge | 0.47 (0.12) | 4.30 (0.49) | 0.04 (0.30) | 3.92 (0.54) |
| Separate XGBoost | 0.27 (0.15) | 5.03 (0.49) | −0.38 (0.50) | 4.64 (0.45) |
| Hierarchical Bayesian | ||||
| Pooled naïve mean | −0.02 (0.03) | 5.94 (0.37) | −0.04 (0.06) | 4.25 (0.71) |
| Pooled Ridge | 0.21 (0.07) | 5.24 (0.34) | 0.01 (0.09) | 4.12 (0.65) |
| Pooled XGBoost | 0.37 (0.12) | 4.63 (0.39) | −0.06 (0.18) | 4.23 (0.57) |
| Pooled Bayesian | 0.21 (0.10) | 5.22 (0.37) | 0.03 (0.13) | 4.08 (0.61) |
| Separate naïve mean | 0.46 (0.16) | 4.28 (0.57) | −0.01 (0.30) | 4.08 (0.54) |
| Separate Ridge | 0.46 (0.16) | 4.29 (0.57) | 0.00 (0.29) | 4.06 (0.54) |
| Separate XGBoost | 0.25 (0.18) | 5.06 (0.54) | −0.34 (0.39) | 4.68 (0.42) |
| Hierarchical Bayesian | ||||
| Pooled naïve mean | −0.02 (0.02) | 5.91 (0.41) | −0.05 (0.05) | 4.20 (0.77) |
| Pooled Ridge | 0.21 (0.06) | 5.19 (0.35) | 0.02 (0.07) | 4.05 (0.70) |
| Pooled XGBoost | 0.34 (0.11) | 4.75 (0.35) | 0.01 (0.18) | 4.03 (0.54) |
| Pooled Bayesian | 0.20 (0.12) | 5.23 (0.45) | 0.04 (0.12) | 4.00 (0.63) |
| Separate naïve mean | 0.44 (0.15) | 4.31 (0.47) | 0.02 (0.27) | 3.98 (0.59) |
| Separate Ridge | 0.45 (0.15) | 4.29 (0.48) | 0.03 (0.27) | 3.96 (0.59) |
| Separate XGBoost | 0.42 (0.15) | 4.42 (0.42) | −0.04 (0.34) | 4.05 (0.51) |
| Hierarchical Bayesian | ||||
The hierarchical Bayesian model achieved the best overall performance in every case and could predict the clinical severity ratings within 4 points of RMSE on the original rating scales. The best HDRS total result was achieved using all self-assessment items while the best YMRS total result was achieved using only the mood self-assessment item.
Bold values indicates the best results within each set of self-assessment items.
aCoefficient of determination. Higher is better.
bRoot Mean Square Error. Lower is better.
Results of K = 100 cross-validation experiments with the HDRS item 1 score (left columns) and YMRS item 1 score (right columns) models based on all, mandatory and mood self-assessment items, respectively.
| HDRS item 1 score | YMRS item 1 score | |||
|---|---|---|---|---|
| Model | RMSE (SD) ↓b | RMSE (SD) ↓b | ||
| Pooled naïve mean | −0.03 (0.04) | 0.95 (0.07) | −0.05 (0.07) | 0.61 (0.10) |
| Pooled Ridge | 0.41 (0.08) | 0.71 (0.06) | −0.09 (0.13) | 0.62 (0.09) |
| Pooled XGBoost | −0.17 (0.20) | 0.64 (0.09) | ||
| Pooled Bayesian | 0.38 (0.14) | 0.73 (0.10) | −0.16 (0.20) | 0.63 (0.09) |
| Separate naïve mean | 0.35 (0.15) | 0.75 (0.08) | −0.27 (0.33) | 0.66 (0.08) |
| Separate Ridge | 0.37 (0.15) | 0.73 (0.07) | −0.23 (0.30) | 0.65 (0.08) |
| Separate XGBoost | 0.18 (0.17) | 0.84 (0.07) | −0.35 (0.34) | 0.68 (0.06) |
| Hierarchical Bayesian | 0.40 (0.12) | 0.72 (0.06) | −0.07 (0.24) | 0.61 (0.08) |
| Pooled naïve mean | −0.03 (0.04) | 0.93 (0.06) | −0.04 (0.06) | 0.60 (0.08) |
| Pooled Ridge | 0.32 (0.07) | 0.75 (0.05) | 0.01 (0.10) | 0.59 (0.08) |
| Pooled XGBoost | 0.39 (0.13) | 0.71 (0.07) | −0.17 (0.22) | 0.63 (0.08) |
| Pooled Bayesian | 0.33 (0.13) | 0.75 (0.09) | −0.03 (0.17) | 0.59 (0.08) |
| Separate naïve mean | 0.35 (0.13) | 0.73 (0.08) | −0.25 (0.27) | 0.65 (0.08) |
| Separate Ridge | 0.37 (0.13) | 0.72 (0.08) | −0.22 (0.25) | 0.64 (0.08) |
| Separate XGBoost | 0.14 (0.14) | 0.84 (0.07) | −0.36 (0.36) | 0.67 (0.06) |
| Hierarchical Bayesian | 0.00 (0.22) | 0.58 (0.08) | ||
| Pooled naïve mean | −0.03 (0.04) | 0.94 (0.07) | −0.07 (0.15) | 0.61 (0.09) |
| Pooled Ridge | 0.34 (0.07) | 0.75 (0.05) | 0.01 (0.16) | 0.58 (0.09) |
| Pooled XGBoost | 0.40 (0.12) | 0.71 (0.07) | −0.04 (0.27) | 0.59 (0.09) |
| Pooled Bayesian | 0.33 (0.12) | 0.76 (0.09) | 0.02 (0.21) | 0.58 (0.08) |
| Separate naïve mean | 0.34 (0.12) | 0.75 (0.07) | −0.36 (0.62) | 0.66 (0.08) |
| Separate Ridge | 0.36 (0.12) | 0.74 (0.07) | −0.35 (0.61) | 0.66 (0.08) |
| Separate XGBoost | 0.37 (0.13) | 0.73 (0.07) | −0.21 (0.51) | 0.63 (0.08) |
| Hierarchical Bayesian | −0.08 (0.45) | 0.59 (0.08) | ||
The best HDRS item 1 result was achieved using the XGBoost model with all self-assessment items while YMRS item 1 could not be estimated significantly better than the naïve baseline models.
Bold values indicates the best results within each set of self-assessment items.
aCoefficient of determination. Higher is better.
bRoot Mean Square Error. Lower is better.
Fig. 2Results of predicting relapse risk scores evaluated as a binary classification problem and presented in receiver operating characteristic (ROC) curves.
In both the HDRS case (left) and the YMRS case (right), the hierarchical Bayesian regression model outperforms naïve pooled and separate mean models.