| Literature DB >> 32234702 |
Jonas Busk1, Maria Faurholt-Jepsen2, Mads Frost3, Jakob E Bardram4, Lars Vedel Kessing2,5, Ole Winther1,6,7.
Abstract
BACKGROUND: Bipolar disorder is a prevalent mental health condition that is imposing significant burden on society. Accurate forecasting of symptom scores can be used to improve disease monitoring, enable early intervention, and eventually help prevent costly hospitalizations. Although several studies have examined the use of smartphone data to detect mood, only few studies deal with forecasting mood for one or more days.Entities:
Keywords: Bayesian analysis; bipolar disorder; digital phenotyping; early medical intervention; forecasting; machine learning; mood
Mesh:
Year: 2020 PMID: 32234702 PMCID: PMC7367518 DOI: 10.2196/15028
Source DB: PubMed Journal: JMIR Mhealth Uhealth ISSN: 2291-5222 Impact factor: 4.773
Items of the daily self-assessment questionnaire.
| Attribute | Description | Value |
| Activity | Level of physical activity | −3 to 3 |
| Alcohol | Alcoholic drinks consumed | 0 to 10+ |
| Anxiety | Level of anxiety | 0 to 2 |
| Irritable | Level of irritability | 0 to 2 |
| Cognitive difficulty | Level of cognitive discomfort | 0 to 2 |
| Medicine | Medicine adherence | 0 to 2 |
| Mixed mood | Experienced mixed mood | 0 to 1 |
| Mood | Experienced mood | −3 to 3 |
| Sleep | Hours of sleep | 0 to 24 |
| Stress | Level of stress | 0 to 2 |
Figure 1Forecasting is the task of predicting the future, given all relevant information from the past and the present. The window size, w, is the size of history defining the predictor variables and the horizon, h, is how far in the future the target variable is predicted.
Figure 2A Bayesian network of a hierarchical linear regression model. Individual regression intercept αj and weights βj are drawn from population distributions parameterized by μα, τα and μβ, τβ. This allows the model to account for individual differences while constraining individual parameters to be similar across the population.
Figure 3Distribution of all self-reported mood scores (left) and individual mean mood scores (right). The mood scores are generally close to zero indicating neutral mood with only a few exceptions indicating depressed or elevated mood.
Figure 4The mean of individual correlations of self-assessment items and mood lagged up to 7 days. Nonzero correlations indicate that items have some relation to mood on subsequent days that can be utilized for mood forecasting.
Figure 5Window size selection results. The root mean squared error (RMSE) was evaluated in time-series cross-validation experiments for w=1 through 7 and h=1. The lowest RMSE was achieved by the hierarchical linear model at w=4.
Predictor variables sorted by overall feature importance measured by the mean absolute t-statistic of the individual-level regression parameters in the hierarchical Bayesian linear regression model for w=4 and h=1. Self-reported mood is the most important variable for predicting mood on the following day.
| Predictor | | | |||
|
|
|
|
|
|
| Mood | 4.53 (3.35) | 2.34 (0.55) | 0.47 (0.28) | 2.78 (0.18) |
| Anxiety | 2.78 (0.05) | 0.71 (0.02) | 1.29 (0.01) | 0.76 (0.00) |
| Irritable | 2.74 (0.11) | 1.22 (0.01) | 0.95 (0.01) | 1.30 (0.00) |
| Mixed mood | 2.09 (0.06) | 2.51 (0.02) | 1.96 (0.01) | 0.52 (0.01) |
| Medicine changed | 0.36 (0.10) | 0.08 (0.01) | 2.15 (0.01) | 0.64 (0.00) |
| Sleep positive | 1.65 (0.01) | 0.72 (0.00) | 0.37 (0.00) | 0.16 (0.00) |
| Cognitive difficulty | 1.48 (0.09) | 0.58 (0.02) | 0.19 (0.00) | 1.57 (0.00) |
| Alcohol | 0.67 (0.02) | 0.77 (0.01) | 1.56 (0.01) | 0.87 (0.00) |
| Medicine omitted | 0.13 (0.01) | 1.31 (0.00) | 0.60 (0.00) | 0.14 (0.00) |
| Stress | 1.22 (0.12) | 0.91 (0.02) | 0.71 (0.01) | 0.28 (0.01) |
| Activity | 1.04 (0.03) | 1.14 (0.02) | 0.49 (0.01) | 1.14 (0.01) |
| Sleep negative | 0.41 (0.01) | 0.52 (0.00) | 0.48 (0.00) | 0.52 (0.00) |
Results of the leave-all-out time-series cross-validation (left) and leave-one-out time-series cross-validation (right) experiments. The hierarchical Bayesian linear regression model achieves the best results. The pooled models are better than the separate models, overall.
| Model | Leave-all-out | Leave-one-out | ||
|
|
| RMSEb |
| RMSEb |
| Last observed | 0.342 | 0.376 | 0.151 | 0.385 |
| Pooled mean | −0.007 | 0.465 | −0.009 | 0.419 |
| Pooled ridge | 0.450 | 0.344 | 0.340 | 0.339 |
| Pooled XGBoost | 0.455 | 0.342 | 0.343 | 0.338 |
| Separate mean | 0.213 | 0.412 | −0.443 | 0.502 |
| Separate ridge | 0.345 | 0.375 | −0.471 | 0.506 |
| Separate XGBoost | 0.302 | 0.388 | −0.682 | 0.541 |
| Hierarchical Bayesian linear | 0.511 | 0.324 | 0.347 | 0.337 |
| Hierarchical Bayesian ordinal | 0.495 | 0.330 | 0.343 | 0.339 |
aCoefficient of determination (R): higher is better.
bRoot mean squared error (RMSE): lower is better.
Figure 6Results of forecasting mood for up to seven days. The root mean squared error (RMSE) was evaluated in time-series cross-validation experiments for w=4 and h=1 through 7. As expected, the RMSE increases when forecasting further ahead. The proposed hierarchical models achieved consistently lower RMSEs than the baseline models.
Figure 7Examples of 7-day mood forecasts produced by the hierarchical linear regression model. The forecasted mood values are shown with 95% CI uncertainties and compared with observed data. The forecast to the left is rather accurate despite variation in the data, whereas the forecast to the right fails to anticipate future mood changes.