| Literature DB >> 35420993 |
Soumya Choudhary1, Nikita Thomas2, Janine Ellenberger1, Girish Srinivasan1, Roy Cohen1.
Abstract
BACKGROUND: Depression is a major global cause of morbidity, an economic burden, and the greatest health challenge leading to chronic disability. Mobile monitoring of mental conditions has long been a sought-after metric to overcome the problems associated with the screening, diagnosis, and monitoring of depression and its heterogeneous presentation. The widespread availability of smartphones has made it possible to use their data to generate digital behavioral models that can be used for both clinical and remote screening and monitoring purposes. This study is novel as it adds to the field by conducting a trial using private and nonintrusive sensors that can help detect and monitor depression in a continuous, passive manner.Entities:
Keywords: depression; digital mental health; digital phenotyping; mobile phone
Year: 2022 PMID: 35420993 PMCID: PMC9152726 DOI: 10.2196/37736
Source DB: PubMed Journal: JMIR Form Res ISSN: 2561-326X
Figure 1The Behavidence app screen showing the daily Mental Health Similarity Score.
Demographic distribution showing the numbers for age, gender, and language of the answered Patient Health Questionnaire-9 (N=558).
| Variable | Value, n (%) | ||
|
| |||
|
| 18-25 | 474 (84.9) | |
|
| 26-35 | 29 (5.2) | |
|
| 36-55 | 42 (7.5) | |
|
| 56-64 | 10 (1.8) | |
|
| >64 | 3 (0.5) | |
|
| |||
|
| Male | 254 (45.5) | |
|
| Female | 286 (51.3) | |
|
| Prefer not to say | 18 (3.2) | |
|
| |||
|
| Korean | 487 (87.3) | |
|
| English | 71 (12.7) | |
Distribution of the participants’ PHQ-9a scores (N=558).
| PHQ-9 score category and score | Participants, n (%) | ||
|
|
| ||
|
| 0 | 20 (3.6) | |
|
| 1 | 6 (1.1) | |
|
| 2 | 12 (2.2) | |
|
| 3 | 7 (1.3) | |
|
| 4 | 18 (3.2) | |
|
|
| ||
|
| 5 | 13 (2.3) | |
|
| 6 | 33 (5.9) | |
|
| 7 | 23 (4.1) | |
|
| 8 | 37 (6.6) | |
|
| 9 | 18 (3.2) | |
|
|
| ||
|
| 10 | 23 (4.1) | |
|
| 11 | 28 (5) | |
|
| 12 | 29 (5.2) | |
|
| 13 | 43 (7.7) | |
|
| 14 | 39 (7) | |
|
|
| ||
|
| 15 | 29 (5.2) | |
|
| 16 | 31 (5.6) | |
|
| 17 | 26 (4.7) | |
|
| 18 | 31 (5.6) | |
|
| 19 | 17 (3) | |
|
|
| ||
|
| 20 | 16 (2.9) | |
|
| 21 | 16 (2.9) | |
|
| 22 | 13 (2.3) | |
|
| 23 | 6 (1.1) | |
|
| 24 | 10 (1.8) | |
|
| 25 | 5 (0.9) | |
|
| 26 | 1 (0.2) | |
|
| 27 | 8 (1.4) | |
aPHQ-9: Patient Health Questionnaire-9.
Distribution of gender among the PHQ-9a scoring categories (N=558).
| PHQ-9 category | Male, n (%) | Female, n (%) | Other or prefer not to answer, n (%) |
| None | 41 (65.6) | 21 (32.8) | 1 (1.6) |
| Mild | 69 (56) | 52 (41.6) | 3 (2.4) |
| Moderate | 73 (45.1) | 82 (50.6) | 7 (4.3) |
| Moderately severe | 48 (35.8) | 82 (61.3) | 4 (2.9) |
| Severe | 24 (32.1) | 47 (62.8) | 4 (5.1) |
aPHQ-9: Patient Health Questionnaire-9.
Distribution of individuals who self-reported “no previous diagnosis” among the PHQ-9a scoring categories (N=499).
| PHQ-9 category | Participants, n (%) |
| None | 63 (12.6) |
| Mild | 113 (22.6) |
| Moderate | 145 (29.2) |
| Moderately severe | 119 (23.8) |
| Severe | 59 (11.8) |
aPHQ-9: Patient Health Questionnaire-9.
The t test (1-tailed) results of the none versus severe cohorts with P values and Cohen d statistic.
| Feature | Cohort, mean (SD) | Cohen | |||||||||
|
| None | Severe |
|
| |||||||
|
| |||||||||||
|
| Mean session time | 1.1 (0.5) | 2.5 (4.8) | <.001 | 0.4257 | ||||||
|
| Total session | 300.0 (145.0) | 416.7 (233.3) | <.001 | 0.5811 | ||||||
|
| Number of opens | 305.7 (137.9) | 240.8 (169.6) | <.001 | 0.4248 | ||||||
|
| Sleep | 266.7 (183.3) | 300.0 (216.7) | <.001 | 0.1947 | ||||||
|
| Average gap | 3.2 (3.5) | 4.3 (6.2) | <.001 | 0.2495 | ||||||
|
| |||||||||||
|
| Average activity | 28.5 (67.5) | 57.0 (101.0) | <.001 | 0.2053 | ||||||
|
| Average gap activity | 7.6 (13.9) | 23.8 (47.5) | <.001 | 0.3191 | ||||||
|
| Total activity | 1181.6 (436.7) | 1165.0 (446.7) | .68 | 0.0859 | ||||||
Accuracy metrics of the 3 classification algorithms tested in this study: random forest, extreme gradient boosting (XGBoost), and a support vector machine with radial basis function kernel.
| Metric and cohort | Random forest model (%) | XGBoost model (%) | SVMa model (%) | |
| Accuracy | 87 | 86 | 44 | |
|
| ||||
|
| None | 89 | 82 | 44 |
|
| Depression | 85 | 90 | 0 |
|
| ||||
|
| None | 85 | 90 | 100 |
|
| Depression | 89 | 81 | 0 |
|
| ||||
|
| None | 87 | 86 | 61 |
|
| Depression | 87 | 85 | 0 |
aSVM: support vector machine.
Accuracy metrics of all models trained in this study.
| Metric and cohort | PHQ-9a binary nonsensor model (%) | PHQ-9 binary gyroscope (sensor) model (%) | PHQ-9 three-class model (%) | PHQ-9 questions model (%) | |
| Accuracy | 87 | 76 | 78 | 78 | |
|
| |||||
|
| None | 89 | 78 | 75 | 80 |
|
| Moderate | N/Ab | N/A | 86 | N/A |
|
| Severe | 85 | 74 | 74 | 76 |
|
| |||||
|
| None | 85 | 67 | 76 | 75 |
|
| Moderate | N/A | N/A | 83 | N/A |
|
| Severe | 89 | 83 | 76 | 81 |
|
| |||||
|
| None | 87 | 72 | 75 | 78 |
|
| Moderate | N/A | N/A | 84 | N/A |
|
| Severe | 87 | 78 | 75 | 89 |
aPHQ-9: Patient Health Questionnaire-9.
bN/A: not applicable.
Figure 2Feature importance plot of the Patient Health Questionnaire-9 binary nonsensor model achieving 87% accuracy. The x-axis represents the feature importance metric, Gini Impurity, which can range from 0.0 to 0.5. The y-axis represents the list of features ordered from greatest to least importance.
Mean values of each cohort and P values from 1-tailed t tests of the top 5 important features in the Patient Health Questionnaire-9 binary nonsensor model.
| Feature | Cohort, mean (SD) | ||
|
| None | Severe |
|
| Mean session time: average session length that a user interacts with their mobile device within a 24-hour period (minutes) | 1.07 (3.58) | 2.49 (4.97) | <.001 |
| App 1: average time a user spent on apps that fall into app category 1—social interaction apps—within a 24-hour period (minutes) | 1.41 (4.40) | 3.58 (1.16) | <.001 |
| App 11: average time a user spent on apps that fall into app category 11—miscellaneous and additional passive recreational apps—within a 24-hour period (minutes) | 1.56 (1.94) | 3.37 (5.11) | <.001 |
| App 3 opens: number of times a user opened apps that fall into app category 3—active messaging and communication apps—within a 24-hour period (counts) | 110.49 (70.10) | 74.45 (71.05) | <.001 |
| App 0: average time a user spent on apps that fall into app category 0—nonofficial or unregulated apps—within a 24-hour period (minutes) | 0.34 (0.60) | 0.83 (2.34) | <.001 |
The PHQ-9a binary nonsensor-based model validation results showing the majority of days with high MHSSb across all days of data collected.
| PHQ-9 severity | Participants, n (%) | Majority of days of data with MHSSs >50% (%)c |
| None | 38 (18) | 15.84 |
| Moderate | 116 (55.5) | 75.02 |
| Severe | 55 (26) | 95.82 |
aPHQ-9: Patient Health Questionnaire-9.
bMHSS: Mental Health Similarity Score.
cEach participant has a different total number of days of data collected. Hence, each PHQ-9 group has a different total number of days. Therefore, the majority of days mentioned is the total percentage of days that group participants had MHSS greater that 50%.
Correlation analysis within the severe cohort between baseline per-item scores and Mental Health Similarity Scores on the day of baseline assessment.
| PHQ-9a item | Pearson correlation | Spearman correlation |
| Question 1: “Little interest or pleasure in doing things?” | −0.078 | −0.075 |
| Question 2: “Feeling down, depressed, or hopeless?” | 0.596 | 0.607 |
| Question 3: “Trouble falling or staying asleep, or sleeping too much?” | 0.004 | 0.0 |
| Question 4: “Feeling tired or having little energy?” | −0.101 | −0.045 |
| Question 5: “Poor appetite or overeating?” | −0.017 | 0.059 |
| Question 6: “Feeling bad about yourself—or that you are a failure or have let yourself or your family down?” | 0.492 | 0.543 |
| Question 7: “Trouble concentrating on things, such as reading the newspaper or watching television?” | −0.214 | −0.213 |
| Question 8: “Moving or speaking so slowly that other people could have noticed? Or the opposite—being so fidgety or restless that you have been moving around a lot more than usual?” | 0.064 | 0.093 |
| Question 9: “Thoughts that you would be better off dead, or of hurting yourself in some way?” | 0.479 | 0.447 |
| Question 1+Question 2+Question 6+Question 9 | 0.655 | 0.580 |
| Question 2+Question 6+Question 9 | 0.727 | 0.698 |
aPHQ-9: Patient Health Questionnaire-9.
Figure 3Feature importance plot of the random forest model for Patient Health Questionnaire-9 questions 2, 6, and 9. The x-axis represents the feature importance metric, Gini Impurity, which can range from 0.0 to 0.5. The y-axis represents the list of features ordered from greatest to least importance.
Mean values of each cohort and P values from 1-tailed t tests of the top 5 important features in the random forest model for Patient Health Questionnaire-9 questions 2, 6, and 9.
| Feature | Cohort, mean (SD) | |||
|
| None | Severe |
| |
| App 3 opens: number of times a user opened apps that fall into app category 3—active messaging and communication apps—within a 24-hour period (counts) | 110.49 (73.47) | 74.45 (77.25) | <.001 | |
| App 3 upper: number of times a user opened apps that fall into app category 3—active messaging and communication apps—and had session times greater than the average session time of that app category within a 24-hour period (counts) | 6.47 (5.77) | 3.25 (3.98) | <.001 | |
| App 2 opens: number of times a user opened apps that fall into app category 2—passive information and consumption apps—within a 24-hour period (counts) | 2.13 (5.24) | 0.46 (1.2) | <.001 | |
| App 6: average time a user spent on apps that fall into app category 6—general utilities apps—within a 24-hour period (minutes) | 0.40 (2.11) | 0.57 (0.48) | <.001 | |
| App 2: average time a user spent on apps that fall into app category 2—passive information and consumption apps—within a 24-hour period (minutes) | 0.29 (0.70) | 0.18 (0.20) | .008 | |