| Literature DB >> 35885716 |
Juyoung Hong1, Jiwon Kim1, Sunmi Kim2, Jaewon Oh2, Deokjong Lee2,3, San Lee2,3, Jinsun Uh4, Juhong Yoon5, Yukyung Choi1.
Abstract
With the impact of the COVID-19 pandemic, the number of patients suffering from depression is rising around the world. It is important to diagnose depression early so that it may be treated as soon as possible. The self-response questionnaire, which has been used to diagnose depression in hospitals, is impractical since it requires active patient engagement. Therefore, it is vital to have a system that predicts depression automatically and recommends treatment. In this paper, we propose a smartphone-based depression prediction system. In addition, we propose depressive features based on multimodal sensor data for predicting depressive mood. The multimodal depressive features were designed based on depression symptoms defined in the Diagnostic and Statistical Manual of Mental Disorders (DSM-5). The proposed system comprises a "Mental Health Protector" application that collects data from smartphones and a big data-based cloud platform that processes large amounts of data. We recruited 106 mental patients and collected smartphone sensor data and self-reported questionnaires from their smartphones using the proposed system. Finally, we evaluated the performance of the proposed system's prediction of depression. As the test dataset, 27 out of 106 participants were selected randomly. The proposed system showed 76.92% on an f1-score for 16 patients with depression disease, and in particular, 15 patients, 93.75%, were successfully predicted. Unlike previous studies, the proposed method has high adaptability in that it uses only smartphones and has a distinction of evaluating prediction accuracy based on the diagnosis.Entities:
Keywords: depression prediction; depressive symptoms feature; machine learning; smartphone
Year: 2022 PMID: 35885716 PMCID: PMC9318674 DOI: 10.3390/healthcare10071189
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
Figure 1Proposed Architecture. This illustrates the architecture of the proposed depression prediction system. The “Mental Health Protector” application installed on the participant’s smartphone collects multimodal-based passive sensor data and active data, consisting of survey results. The collected data are transmitted to a cloud-based platform for data processing and storage. The participant’s depression is predicted and monitored using the data transmitted to IRIS.
Figure 2Data Collection Procedure. Among psychiatric outpatients, the data collection procedure was informed to users who agree to participate in the study (V1). When the application installation was completed, data were collected for two weeks (M1–M2) after conducting a survey related to the user’s demographic information and mental health (M1). After the first data collection period, a middle test (M2) was conducted, and the second data collection (M2–M3) was performed for two weeks again. When the second data collection is completed, the final test (M3) was conducted, and the participant’s role in the study ended.
Figure 3User Interface of “Mental Health Protector” application. The image shows a part of the data collection application titled “Mental Health Protector” execution screen. The image on the left depicts the initial execution screen, the image in the center depicts the screen that responds to the survey, and the image on the right represents the screen that can check the response status.
Data table list collected from “Mental Health Protector” application. Multimodal sensor data collected for each user are transmitted to a cloud-based big data platform according to a predefined table. Data information about table names can be found in the description.
| Number | Table Name | Description |
|---|---|---|
| 1 | TB_WEB_USR_INFO | User’s web information |
| 2 | TB_API_USER_INFO | API user information |
| 3 | CALL_LOGS | Call incoming/outgoing history |
| 4 | SMS_LOGS | SMS incoming/outgoing history |
| 5 | CELL_INFO | Network signal level and quality information |
| 6 | BATTERY_INFO | Battery power level data for user’s device |
| 7 | SENSORS | Sensor data for user’s device |
| 8 | BLUETOOTH_DEVICES | Bluetooth device data near user’s device |
| 9 | LOCATIONS | GPS for user’s device |
| 10 | WIFI_INFO | Wi-fi data near user’s device |
| 11 | SCREEN_ONOFF | Screen on/off data for user’s device |
| 12 | FACE_LANDMARK | 3D facial landmarks coordinate and expression feature |
| 13 | ACTIVITY_TRANSITION | The type of behavior of user |
| 14 | BLUETOOTH_TYPE_MAP | Bluetooth device type mapping table |
| 15 | SENSOR_MAP | Sensor type mapping table |
Participants’ Demographics. This shows the distribution of demographic information and responses to self-report questionnaires for the participants. Statistics for participants on “age”, “gender”, “depression severity for response scores”, and “depressed mood by cut-point” are provided.
| Variable | Value, n(%) | |
|---|---|---|
| Gender | Male | 43 (40.57) |
| Female | 63 (59.43) | |
| Year of birth | 1960 | 4 (3.77) |
| 1965 | 6 (5.66) | |
| 1970 | 13 (12.26) | |
| 1975 | 17 (16.04) | |
| 1980 | 14 (13.21) | |
| 1985 | 7 (6.60) | |
| 1990 | 14 (13.21) | |
| 1995 | 15 (14.15) | |
| 2000 | 16 (15.09) | |
| Depression severity (PHQ-9) | severe | 5 (4.72) |
| moderately severe | 8 (7.55) | |
| moderate | 20 (18.87) | |
| minor | 22 (20.75) | |
| minimal | 31 (29.25) | |
| normal | 20 (18.87) | |
| Depressed mood (CESD-R) | Depressive | 84 (79.25) |
| Non-Depressive | 22 (20.75) |
Figure 4Overview of the Smartphone-Based Depressed Mood Prediction System. The system for predicting depression consists of three parts (Data Collection, Multimodal Feature Extraction, and Depression Prediction). In the Data Collection, the “Mental Health Protector” program collects multimodal sensor data from smartphone. In the Multimodal Feature Extraction, derived features are extracted from the collected data. In the Depression Prediction, a machine learning-based classifier is used to the derived features to predict the user’s depression.
Figure 5Architecture for Facial Expression Feature. This represents the training process and loss for deep learning-based facial expression feature design.
Multimodal Derived Features. We present multimodal derived features for predicting depression in participants. For the proposed derived features, screen on/off, GPS, accelerometer, gyroscope, and facial image are utilized. Consequently, 33 features are designed to predict depression.
| Derived Feature | Sensor Data | Features |
|---|---|---|
| Amount of sleep | Screen on/off | Statistical features of estimated sleep time per day (maximum, minimum, mean, SD, Q1, Q2, Q3) |
| Quality of sleep | Statistical features of smartphone usage (maximum, minimum, mean, SD, Q1, Q2, Q3) | |
| Location variance | GPS |
|
| Entropy |
| |
| Physical activity per day | Accelerometer, | Average daily physical activity time via Google Activity Recognition Transition API |
| Facial expression | Facial image | Facial expression features |
Accuracy by Derived Features. The table shows the accuracy of the test data for each derived feature we designed. When all the derived features are combined, 74.07% accuracy is obtained.
| Derived Feature | Accuracy | |
|---|---|---|
| Sleep | Amount of sleep | 55.56 |
| Quality of sleep | ||
| Activity | Location Variance | 59.26 |
| Entropy | ||
| Activity | ||
| Facial Expression | Facial Expression | 66.67 |
| Total Derived Features |
| |
Random Forest Classification Result Compared to Diagnosis. The table shows the result of comparing the participant’s diagnosis with depression predictions. The “Depression” row represents the performance predicted to be depressing for 16 depressed patients. When CESD-R is used for depressed patients, the recall rate is 93.75% and the precision rate is 65.21%. Therefore, it shows that CESD-R is more effective than PHQ-9 in predicting depression in patients with depression.
| Survey | Precision | Recall | F1-Score | |
|---|---|---|---|---|
| Depression | PHQ-9 | 64.71 | 68.75 | 66.67 |
| CESD-R |
|
|
| |
| Total | PHQ-9 | 58.71 | 59.26 | 58.91 |
| CESD-R |
|
|
|