| Literature DB >> 30135774 |
Adam Mikus1, Mark Hoogendoorn1, Artur Rocha2, Joao Gama3, Jeroen Ruwaard4, Heleen Riper4.
Abstract
Technology driven interventions provide us with an increasing amount of fine-grained data about the patient. This data includes regular ecological momentary assessments (EMA) but also response times to EMA questions by a user. When observing this data, we see a huge variation between the patterns exhibited by different patients. Some are more stable while others vary a lot over time. This poses a challenging problem for the domain of artificial intelligence and makes on wondering whether it is possible to predict the future mental state of a patient using the data that is available. In the end, these predictions could potentially contribute to interventions that tailor the feedback to the user on a daily basis, for example by warning a user that a fall-back might be expected during the next days, or by applying a strategy to prevent the fall-back from occurring in the first place. In this work, we focus on short term mood prediction by considering the adherence and usage data as an additional predictor. We apply recurrent neural networks to handle the temporal aspects best and try to explore whether individual, group level, or one single predictive model provides the highest predictive performance (measured using the root mean squared error (RMSE)). We use data collected from patients from five countries who used the ICT4Depression/MoodBuster platform in the context of the EU E-COMPARED project. In total, we used the data from 143 patients (with between 9 and 425 days of EMA data) who were diagnosed with a major depressive disorder according to DSM-IV. Results show that we can make predictions of short term mood change quite accurate (ranging between 0.065 and 0.11). The past EMA mood ratings proved to be the most influential while adherence and usage data did not improve prediction accuracy. In general, group level predictions proved to be the most promising, however differences were not significant. Short term mood prediction remains a difficult task, but from this research we can conclude that sophisticated machine learning algorithms/setups can result in accurate performance. For future work, we want to use more data from the mobile phone to improve predictive performance of short term mood.Entities:
Keywords: Depression; Machine learning; Prediction; Short term mood
Year: 2017 PMID: 30135774 PMCID: PMC6096213 DOI: 10.1016/j.invent.2017.10.001
Source DB: PubMed Journal: Internet Interv ISSN: 2214-7829
The EMA measures that are present in the dataset.
| Abbreviation | EMA question |
|---|---|
| Mood | How is your mood right now? |
| Worry | How much do you worry about things at the moment? |
| Self-esteem | How good do you feel about yourself right now? |
| Sleep | How did you sleep tonight? |
| Activities done | To what extent have you carried out enjoyable activities today? |
| Enjoyed activities | How much have you enjoyed the days activities? |
| Social contact | How much have you been involved in social interactions today? |
Description of the pre-processed input features.
| Input feature | Description |
|---|---|
| EMA mood ratings | Averages of the patients daily mood ratings |
| Additional mood ratings | Indicates whether patient rated mood more than two times (more than the protocol requires) per day |
| Nationality | The country of origin of the patient |
| Module completions | Whether a therapeutic module has been completed |
| Treatment state | The current treatment state of the patient: can be active (before finishing the final module), done (after having finished the final module but still able to access the system) and archived (no longer having access to the system) |
| Day of the week | Current day of the week |
| Number of answered questions per day | Number of questions the patient rated a day |
| EMA ratings | Averages of patients daily EMA ratings (except mood) |
| Number of treatment days | Total number of days in the treatment |
| Number of exchanged messages with therapist | The number of messages that have been sent by the therapist |
| Number of exchanged characters with therapist | The number of characters contained in the messages exchanged with the therapist |
| Number of messages | Number of feedback messages generated by the MoodBuster application (motivational messages and reminders) |
| Number of patient messages | Number of messages sent by the patient |
| Number of web sessions | Number of web sessions |
| Number of pages in module | The number of pages viewed in the modules |
| Mood Response time | The response time to an EMA request to rate the mood |
| Module duration | Duration of the current module since the patient started |
Description of statistical attributes used for clustering patients.
| Feature | Description |
|---|---|
| Count | Number days with answers (average mood ratings) |
| Mean | Mean of the daily average mood ratings in the examined period |
| Standard deviation | Standard deviation of the daily average mood ratings in the examined period |
| Maximum | Maximum rated value of the daily average mood ratings in the examined period |
| Minimum | Minimum rated value of the daily average mood ratings in the examined period |
| Median | Median of the daily average mood ratings in the examined period |
| Q1 | First quartile of the daily average mood ratings in the examined period |
| Q3 | Third quartile of the daily average mood ratings in the examined period |
| Rating ratio | Number of days with mood ratings divided by all days in the examined period |
| Maximum rating time Difference | Largest difference in days between two consecutive rated days in the examined period |
Fig. 1Dendrogram of the results of hierarchical clustering (the numbers express the IDs of the patients).
Results of benchmark SVR.
| Set-up | RMSE results (standard deviation) |
|---|---|
| Single | 0.090 (0.00) |
| Clustered layer | 0.077 (0.026) |
| Individual | 0.098 (0.051) |
| Single | 0.100 (0.00) |
| Clustered layer | 0.100 (0.028) |
Results of best predictive models (note: mood has been scaled between 0 and 1, and the RMSE should also be interpreted on that scale).
| Set-up | Best model | RMSE result (standard deviation) |
|---|---|---|
| Single | GRU 2 layer/GRUP | 0.070 (0.00) |
| Clustered | GRUP | 0.066 (0.023) |
| Individual | LSTMP | 0.086 (0.047) |
| Single | LSTM 1 layer/ LSTMP | 0.070 (0.00) |
| Clustered | GRUP | 0.075 (0.026) |
Fig. 2Single predictive model using 1 layer GRU. Prediction on the independent test set.
Fig. 3Individual predictive model using GRU with recurrent projection layer. Prediction on the independent test set.
Fig. 4GRU with recurrent projection layer clustered predictive model. Prediction on the independent test set.
Fig. 5LSTM with recurrent projection layer clustered predictive model. Prediction on the independent test set.
Results single generic model.
| Network | RMSE train score | RMSE test score | SD train scores | SD test scores |
|---|---|---|---|---|
| LSTM 1 layer | 0.091 | 0.073 | 0.001 | 0.001 |
| LSTM 2 layer | 0.090 | 0.074 | 0.002 | 0.003 |
| LSTMP | 0.089 | 0.073 | 0.000 | 0.001 |
| GRU 1 layer | 0.092 | 0.076 | 0.005 | 0.006 |
| GRU 2 layer | 0.090 | 0.070 | 0.00 | 0.00 |
| GRUP | 0.090 | 0.070 | 0.00 | 0.00 |
| LSTM 1 layer | 0.090 | 0.070 | 0.00 | 0.00 |
| LSTM 2 layer | 0.090 | 0.074 | 0.00 | 0.006 |
| LSTMP | 0.090 | 0.070 | 0.00 | 0.00 |
| GRU 1 layer | 0.090 | 0.074 | 0.00 | 0.006 |
| GRU 2 layer | 0.092 | 0.076 | 0.005 | 0.009 |
| GRUP | 0.090 | 0.076 | 0.000 | 0.006 |
Results clustered models.
| Network | RMSE train score | RMSE test score | SD train scores | SD test scores |
|---|---|---|---|---|
| LSTM 1 layer | 0.082 | 0.076 | 0.054 | 0.054 |
| LSTM 2 layer | 0.073 | 0.069 | 0.037 | 0.025 |
| LSTMP | 0.074 | 0.069 | 0.038 | 0.026 |
| GRU 1 layer | 0.076 | 0.070 | 0.031 | 0.021 |
| GRU 2 layer | 0.072 | 0.067 | 0.034 | 0.025 |
| GRUP | 0.073 | 0.066 | 0.035 | 0.023 |
| LSTM 1 layer | 0.085 | 0.079 | 0.027 | 0.023 |
| LSTM 2 layer | 0.081 | 0.079 | 0.030 | 0.030 |
| LSTMP | 0.079 | 0.078 | 0.031 | 0.030 |
| GRU 1 layer | 0.083 | 0.078 | 0.027 | 0.022 |
| GRU 2 layer | 0.079 | 0.079 | 0.027 | 0.024 |
| GRUP | 0.078 | 0.075 | 0.028 | 0.026 |
Results individual models.
| Network | RMSE train score | RMSE test score | SD train scores | SD test scores |
|---|---|---|---|---|
| LSTM 1 layer | 0.107 | 0.113 | 0.100 | 0.107 |
| LSTM 2 layer | 0.085 | 0.088 | 0.044 | 0.050 |
| LSTMP | 0.083 | 0.086 | 0.041 | 0.047 |
| GRU 1 layer | 0.087 | 0.092 | 0.038 | 0.047 |
| GRU 2 layer | 0.080 | 0.090 | 0.038 | 0.047 |
| GRUP | 0.083 | 0.088 | 0.039 | 0.046 |