| Literature DB >> 27025287 |
Joost Asselbergs1, Jeroen Ruwaard, Michal Ejdys, Niels Schrader, Marit Sijbrandij, Heleen Riper.
Abstract
BACKGROUND: Ecological momentary assessment (EMA) is a useful method to tap the dynamics of psychological and behavioral phenomena in real-world contexts. However, the response burden of (self-report) EMA limits its clinical utility.Entities:
Keywords: affect; data mining; ecological momentary assessment; experience sampling; mobile phone sensing
Mesh:
Year: 2016 PMID: 27025287 PMCID: PMC4829730 DOI: 10.2196/jmir.5505
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Overview of study setup.
Mood prediction feature set.
| Raw data and feature | Variables, n | Range | |
|
|
|
| |
|
| Caller top 5 contact frequency, 3-day window, normalized | 5 | 0-1 |
|
| Caller top 5 contact duration, 3-day window, normalized | 5 | 0-1 |
|
|
|
| |
|
| SMS text message top 5 contact frequency, 3-day window, normalized | 5 | 0-1 |
|
|
|
| |
|
| Percentage of high activity | 1 | 0-1 |
|
|
|
| |
|
| Frequency of screen-on events (normalized within participant data) | 1 | –3 to 3a |
|
| Total screen duration events (normalized within participant data) | 1 | –3 to 3a |
|
|
|
| |
|
| Top 5 apps usage frequency, normalized | 5 | 0-1 |
|
| Top 5 apps usage duration, normalized | 5 | 0-1 |
|
| Categorized apps, usage frequency, normalized | 11 | 0-1 |
|
| Categorized apps, usage duration, normalized | 11 | 0-1 |
|
|
|
| |
|
| Number of images taken (normalized within participant data) | 1 | 0-1 |
|
|
|
| |
|
| Mood of yesterday, standardized | 1 | –3 to 3a |
|
| Mood of day before yesterday, standardized | 1 | –3 to 3a |
a Standard normal distribution (ie, 99.7% of values ranging between –3 and 3).
Figure 2Predictive model building algorithm: forward stepwise regression with leave-one-out cross-validation.
Participant demographics, study adherence, and EMA summary statistics (N=27).
| Measurements | Descriptive statistics | ||
|
|
| ||
|
| Sex (female), n (%) | 22 (78) | |
|
| Age (years), mean (SD) | 21.1 (2.2) | |
|
| CES-Da baseline score, mean (SD) | 9.4 (5.8) | |
|
|
| ||
|
| Number of days in study, mean (SD) | 35.5 (3.8) | |
|
| Last day rated, mean (SD) | 40.3 (3.3) | |
|
| Up to 42 days in study, n (%) | 18 (67) | |
|
|
|
| |
|
|
| 1 | 9 (1) |
|
|
| 2 | 25 (3) |
|
|
| 3 | 44 (5) |
|
|
| 4 | 228 (24) |
|
|
| 5 | 653 (68) |
|
|
| ||
|
| One-dimensional mood | 7.0 (0.95) | |
|
| Circumplex: valence | 0.7 (0.63) | |
|
| Circumplex: arousal | –0.1 (1.00) | |
a CES-D: Center for Epidemiologic Studies Depression Scale (clinical cut-off: 16).
b One-dimensional mood: 1-10 scale; circumplex-based mood: –2 to 2 scale.
Figure 3Observed versus predicted daily (one-dimensional) mood mean (range 1-10) for one participant with a personalized model trained on data including the predicted day (top) or excluding the predicted day (bottom) from the training procedure (ie, in-sample vs out-of-sample performance, respectively).
Figure 4Predictive performance (mean squared error and % correct predictions) of the personalized models as observed for the prediction of the one-dimensional EMA mood measure for each participant (N=27) during cross-validated forward selection regression (stepCV).
Predictive performance of personalized models and naive benchmark models.a
| Modelb | One-dimensional mood, mean (95% CI)c | Multidimensional mood (circumplex), mean (95% CI)c | ||||
|
| Correct | MSE | Valence | Arousal | ||
|
|
|
| Correct | MSE | Correct | MSE |
| Step CV | 57% (50%-64%) | 0.67 (0.35-0.98) | 76% (71%-81%) | 0.22 (0.17-0.27) | 54% (47%-61%) | 0.58 (0.40-0.76) |
| Step AIC | 55% (49%-61%) | 0.58 (0.41-0.75) | 76% (71%-81%) | 0.23 0.17-0.29) | 55% (49%-61%) | 0.58 (0.42-0.74) |
| Mean | 62% (56%-68%) | 0.41 (0.30-0.52) | 85% (81%-89%) | 0.15 0.12-0.18) | 63% (57%-69%) | 0.34 (0.27-0.41) |
| History | 64% (58%-70%) | 0.40 (0.29-0.51) | 83% (79%-87%) | 0.15 (0.12-0.18) | 63% (58%-68%) | 0.33 (0.27-0.39) |
a Results shown are those obtained with 42 days of training data for N=27 participants.
b In personalized prediction models, Step CV and Step AIC, multiple regression models were constructed through stepwise forward variable selection based on cross-validated MSE (see Figure 2) and the Akaike information criterion (AIC), respectively. The mean model included the intercept only and the history model included the intercept and mood at T1 and T2.
c The MSE column shows the mean of the (cross-validated) squared prediction residuals, and the correct column shows the percentage of predictions that fell within the tolerated error margin around the observed score (ie, cross-validated residual ≤0.5). All differences between the performance criteria of the personalized model approaches and the benchmark models were significant (Wilcoxon signed rank tests: P<.02).
Relative predictive performance of the personalized models compared to the intercept-only benchmark regression model.a
| Measure | MSE, b (SE) | % Correct, b (SE) | |||
|
| Interceptb | Study day | Interceptb | Study day | |
|
|
|
|
|
| |
|
| stepAIC | 0.82 (0.20) | –0.0126 (0.0100) | –18.9 (4.7) | 0.52 (0.24)c |
|
| stepCV | 0.50 (0.17) | –0.0045 (0.0086) | –14.7 (4.2) | 0.46 (0.21)c |
|
|
|
|
|
| |
|
| stepAIC | 0.94 (0.40) | –0.0261 (0.0201) | –21.2 (3.1) | 0.34 (0.16)c |
|
| stepCV | 0.25 (0.08) | –0.0041 (0.0042) | –12.1 (3.0) | 0.03 (0.15) |
|
|
|
|
|
| |
|
| stepAIC | 0.49 (0.12) | –0.0003 (0.0061) | –14.8 (3.0) | 0.29 (0.15) |
|
| stepCV | 0.47 (0.13) | 0.0002 (0.0064) | –10.2 (3.6) | 0.02 (0.18) |
a Results show the estimated parameters of the linear regression model (ie, mood ~1 + “study day”); MSE: mean squared error; b: regression estimate (unstandardized); SE: standard error of regression estimate.
b All intercept estimates were significant at α=.05.
c These study day estimates were significant.
Figure 5Relative predictive performance of stepAIC and stepCV (in comparison to the intercept-only model) as a function of increasing training data size (one-dimensional mood).