| Literature DB >> 35493532 |
Tomoki Saito1, Hikaru Suzuki1, Akifumi Kishi2.
Abstract
The prevention and treatment of mental illness is a serious social issue. Prediction and intervention, however, have been difficult because of lack of objective biomarkers for mental illness. The objective of this study was to use biometric data acquired from wearable devices as well as medical examination data to build a predictive model that can contribute to the prevention of the onset of mental illness. This was an observational study of 4,612 subjects from the health database of society-managed health insurance in Japan provided by JMDC Inc. The inputs to the predictive model were 3-months of continuous wearable data and medical examinations within and near that period; the output was the presence or absence of mental illness over the following month, as defined by insurance claims data. The features relating to the wearable data were sleep, activity, and resting heart rate, measured by a consumer-grade wearable device (specifically, Fitbit). The predictive model was built using the XGBoost algorithm and presented an area-under-the-receiver-operating-characteristic curve of 0.712 (SD = 0.02, a repeated stratified group 10-fold cross validation). The top-ranking feature importance measure was wearable data, and its importance was higher than the blood-test values from medical examinations. Detailed verification of the model showed that predictions were made based on disrupted sleep rhythms, mild physical activity duration, alcohol use, and medical examination data on disrupted eating habits as risk factors. In summary, the predictive model showed useful accuracy for grouping the risk of mental illness onset, suggesting the potential of predictive detection, and preventive intervention using wearable devices. Sleep abnormalities in particular were detected as wearable data 3 months prior to mental illness onset, and the possibility of early intervention targeting the stabilization of sleep as an effective measure for mental illness onset was shown.Entities:
Keywords: mHealth; machine learning; medical examination; mental illness; physical activity; predictive detection; sleep; wearable data
Year: 2022 PMID: 35493532 PMCID: PMC9046696 DOI: 10.3389/fdgth.2022.861808
Source DB: PubMed Journal: Front Digit Health ISSN: 2673-253X
Features used in building the predictive model.
|
|
|
|
|---|---|---|
|
| ||
| RHR | bpm | monthly average of daily RestingHeartRate |
|
| ||
| veryActiveMinutes | min | monthly average of daily veryActiveMinutes |
| fairlyActiveMinutes | min | monthly average of daily fairlyActiveMinutes |
| lightlyActiveMinutes | min | monthly average of daily lightlyActiveMinutes |
| Steps | steps | monthly average of daily steps measured |
| log_count.active | days | monthly number of days for activity data linkage |
|
| ||
| notMainSleep.minutes | min | monthly average of daily minutes of sleep determined to be “isMainSleep=False” |
| notMainSleep.counts | times | monthly counts of sleep determined to be “isMainSleep=False” |
| log_count.sleep | days | monthly number of days for activity data linkage |
| timeInBed | min | monthly average of daily timeInBed |
| minutesAsleep | min | monthly average of daily minutesAsleep |
| minutesToFallAsleep | min | monthly average of daily minutesToFallAsleep |
| minutesAfterWakeup | min | monthly average of daily minutesAfterWakeup |
| deepMinutes | min | monthly average of daily deepMinutes |
| remMinutes | min | monthly average of daily remMinutes |
| lightMinutes | min | monthly average of daily lightMinutes |
| wakeMinutes | min | monthly average of daily wakeMinutes |
| startTime.sleep | [hhmmss] | monthly average of daily bedtime |
| SC_lag | min | monthly social jetlag |
| chronotype | [hhmmss] | monthly chronotype |
| SRI | - | monthly Sleep Regularity Index |
|
| ||
| BMI | kg/m2 | Body Mass Index |
| SBP | mmHg | Systolic Blood Pressure |
| DBP | mmHg | Diastolic Blood Pressure |
| TG | mg/dl | Triglyceride |
| HDL | mg/dl | HDL cholesterol |
| LDL | mg/dl | LDL cholesterol |
| AST | U/l | Aspartate aminotransferase |
| ALT | U/l | Alanine aminotransferase |
| GT | U/l | Gamma glutamyl transferase |
| FBS | mg/dl | Fasting blood sugar |
| HBA1C | % (NGSP) | HbA1c |
| US | - | Urinary sugar |
| UP | - | Uric protein qualitative |
|
| ||
| SMOKE | - | Do you habitually smoke? |
| DRINK | - | How often do you drink (sake, distilled spirit, beer, liquor)? |
| AMOUNT_DRINK | - | Amount of drinking per day on days when you drink. |
| FITNESS | - | Do you exercise more than 30 min, more than twice in a week and continue this exercise habit more than 1 year? |
| WALK | - | Do you walk or perform same level of physical activity as walk more than 1 h per day? |
|
| ||
| GENDER | - | Gender |
| AGE | years | Age at the end of the 3 months |
| YM | [yyyymm] | the end period of 3 months |
| YEAR | - | the Year of YM |
| MONTH | - | the Month of YM |
Figure 1Overview of the predictive model construction and validation. (A) The predictive model was built using the XGBoost algorithm. The inputs to the model were 3-month's continuous wearable data and those of the medical examinations closest to (i.e., within or prior to) that period; the output was the presence or absence of mental illness over the following month, which was defined based on insurance claims data. (B) Predictive performance was evaluated using a repeated stratified 10-fold cross validation (CV). Because the dataset included different time series from a single person, partition division was conducted to avoid including the same person in the model training and testing data in each fold (group CV).
Figure 2Flow diagram detailing subject inclusion. PHRs, personal health records.
Demographic data of the subjects (N = 4,612).
| Age (years), mean ( | 45.9 (9.1) |
| Gender (male), | 3,289 (71.3) |
| Body mass index (kg/m2), mean ( | 23.2 (3.3) |
N, number of subjects; SD, standard deviation.
Figure 3Receiver-operating-characteristic (ROC) for merged validation data created by 10-fold cross validation. Area-under-the-curve (AUC) = 0.711. The point closest to the top left (0,1) was (0.23, 0.67), and the corresponding cut-off value was 0.9%.
Figure 4Area-under-the-curve (AUC) when the class-weight was moved from 1 to 20 in one-value increments (10-fold cross validation). Other hyperparameters were fixed at the final parameter.
Figure 5Density estimation curve of the onset probability output by the predictive model for the merged validation data created by 10-fold cross validation. The solid line (FLAG = 1) corresponds to individuals with mental illness onset, and the dashed line (FLAG = 0) corresponds to those without mental illness onset.
Figure 6Top-10 features in feature importance (gain) of the XGBoost model built using all of the training data.
Mode values of branches in the most important features and significant differences in onset probability owing to related groups.
|
|
|
|
| ||
|---|---|---|---|---|---|
|
|
|
|
| ||
| notMainSleep.counts1 | 5 | 100% (62/62) | 0.02% (4/20,471) | 0.16% (17/10,453) | <0.001 |
| lightlyActiveMinutes3 | 260 | 100% (20/20) | 0.04% (13/30,008) | 0.14% (11/7,848) | 0.005 |
| SC_lag1 | 105 | 41% (7/17) | 0.06% (20/34,264) | 0.11% (4/3,556) | 0.278 |
| minutesAsleep1 | 419 | 92% (23/25) | 0.06% (21/36,840) | 0.30% (3/1,016) | 0.026 |
| remMinutes1 | 75 | 88% (15/17) | 0.15% (7/4,553) | 0.03% (2/7,326) | 0.032 |
| SRI1 | 0.92 | 54% (7/13) | 0.09% (22/23,637) | 0.01% (2/14,219) | 0.002 |
| log_count.sleep2 | 30 | 100% (14/14) | 0.09% (21/22,381) | 0.02% (3/15,475) | 0.006 |
| DRINK | 2.5 | 100% (9/9) | 0.08% (21/25,439) | 0.02% (2/8,946) | 0.059 |
| TG | 608 | 54% (13/24) | 0.06% (22/37,776) | 2.50% (2/80) | 0.001 |
| GT | 19 | 91% (10/11) | 0.01% (1/10,431) | 0.08% (23/27,425) | 0.010 |
“Mode split cover” is the percentage of the mode in all branches used by the applicable feature. “Probability” is the onset probability of being less than or more than the mode split value relative to the entire training data. Records missing a feature value were excluded. The “p-value” was calculated using Fisher's exact test. GT, glutamyl transferase; SRI, sleep regularity index; TG, triglyceride.