| Literature DB >> 35069283 |
Abstract
This study provided baseline data for preventing depression in female older adults living alone by understanding the degree of their depressive disorders and factors affecting these depressive disorders by analyzing epidemiological survey data representing South Koreans. To achieve the study objective, this study explored the main risk factors of depressive disorders using the stacking ensemble machine technique. Moreover, this study developed a nomogram that could help primary physicians easily interpret high-risk groups of depressive disorders in primary care settings based on the major predictors derived from machine learning. This study analyzed 582 female older adults (≥60 years old) living alone. The depressive disorder, a target variable, was measured using the Korean version of Patient Health Questionnaire-9. This study developed five single predictive models (GBM, Random Forest, Adaboost, SVM, XGBoost) and six stacking ensemble models (GBM + Bayesian regression, RandomForest + Bayesian regression, Adaboost + Bayesian regression, SVM + Bayesian regression, XGBoost + Bayesian regression, GBM + RandomForest + Adaboost + SVM + XGBoost + Bayesian regression) to predict depressive disorders. The naive Bayesian nomogram confirmed that stress perception, subjective health, n-6 fatty acid, n-3 fatty acid, mean hours of sitting per day, and mean daily sleep hours were six major variables related to the depressive disorders of female older adults living alone. Based on the results of this study, it is required to evaluate the multiple risk factors for depression including various measurable factors such as social support.Entities:
Keywords: depressive disorders; extreme gradient boosting (XGBoost); multiple risk factors; naive Bayesian nomogram; stacking ensemble; synthetic minority oversampling technique (SMOTE)
Year: 2022 PMID: 35069283 PMCID: PMC8777037 DOI: 10.3389/fpsyt.2021.773290
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 4.157
Figure 1The concept of a stacking ensemble.
Figure 2Process flow diagram for depressive disorders of female older adults living alone.
The procedure for performing SMOTE.
|
| |
|---|---|
| Step 1 | It calculates K-nearest neighbors for |
| Step 2 | It randomly selects one neighbor |
| Step 3 | It generates a new sample ( |
|
| |
| Step 4 | The new sample ( |
Figure 3The concept of the LOOCV method.
Figure 4Example of a naive Bayesian nomogram.
Data, measurement unit of female older adults living alone in South Korea, mean ± SD/n (%).
|
|
|
|---|---|
| Age | 72.9 ± 6.20 |
|
| |
| No | 214 (37.2) |
| Yes | 364 (62.8) |
|
| |
| No | 535 (92.7) |
| Yes | 42 (7.3) |
|
| |
| No | 552 (90.6) |
| Yes | 54 (9.4) |
|
| |
| No | 456 (78.5) |
| Yes | 125 (21.5) |
|
| |
| Very thin | 43 (7.4) |
| Slightly skinny | 55 (9.5) |
| Average | 226 (39.1) |
| Slightly obese | 192 (33.2) |
| Very obese | 62 (10.7) |
|
| |
| Try to lose weight | 176 (30.4) |
| Try to maintain weight | 72 (12.5) |
| Try to gain weight | 33 (5.7) |
| Never tried to control weight | 297 (51.4) |
|
| |
| <KRW 1.5 million | 514 (88.6) |
| ≥KRW 1.5 million | 66 (11.4) |
|
| |
| Middle school graduation or below | 503 (86.4) |
| High school graduation or above | 78 (13.4) |
|
| |
| I feel stressed very much | 30 (5.2) |
| I feel stressed a lot | 82 (14.2) |
| I feel stressed a little | 246 (42.7) |
| I hardly feel stressed | 218 (37.8) |
|
| |
| Underweight (<18.5 kg/m2) | 14 (2.5) |
| Normal weight (≥18.5 kg/m2 and <23 kg/m2) | 228 (40.1) |
| Pre-obesity stage (≥23 kg/m2 and <25 kg/m2) | 193 (33.9) |
| Stage 1 obesity (≥25 kg/m2 and <30 kg/m2) | 104 (18.3) |
| Stage 2 obesity (≥30 kg/m2 and <35 kg/m2) | 29 (5.1) |
| Stage 3 obesity (≥35 kg/m2) | 1 (0.2) |
|
| |
| 5–7 times per week | 480 (88.9) |
| 3–4 times per week | 25 (4.6) |
| 1–2 times per week | 13 (2.4) |
| Rarely | 22 (4.1) |
|
| |
| Yes | 13 (2.2) |
| No | 569 (97.8) |
|
| |
| Good | 79 (13.6) |
| Okay | 271 (46.6) |
| Bad | 232 (39.9) |
|
| |
| Normal | 94 (16.2) |
| Prehypertension | 102 (17.5) |
| Hypertension | 386 (66.3) |
|
| |
| Normal | 243 (45.3) |
| Impaired fasting glucose | 148 (27.6) |
| Diabetes | 145 (27.1) |
|
| |
| No | 423 (87.4) |
| Yes | 61 (12.6) |
|
| |
| No | 318 (59.3) |
| Yes | 218 (40.7) |
| Waist Circumference (cm) | 84.6 ± 9.8 |
| N-3 fatty acid intake per day (g) | 1.35 ± 1.8 |
| N-6 fatty acid intake per day (g) | 5.47 ± 6.7 |
| Vitamin c intake per day(g) | 50.1 ± 57.6 |
| Energy intake per day (Kcal) | 1,416.8 ± 603.9 |
| Water intake per day (g) | 693.5 ± 497.7 |
| Protein intake per day (g) | 45.6 ± 25.0 |
| Cholesterol intake per day (g) | 113.6 ± 174.9 |
| Carbohydrate intake per day (g) | 254.6 ± 109.0 |
| Calcium intake (mg) | 386.6 ± 264.8 |
| Vitamin A intake per day (ug) | 432.9 ± 483.4 |
| Usual hours of sitting per day | 9.0 ± 4.0 |
|
| |
| <5 | 49 (8.5) |
| 5–6 | 72 (12.4) |
| 6–7 | 108 (18.7) |
| 7–8 | 159 (27.5) |
| 8–9 | 116 (20.0) |
| >9 | 75 (13.0) |
|
| |
| No | 509 (87.5) |
| Yes | 73 (12.5) |
Figure 5The comparison of 11 machine learning models (accuracy, precision, recall, F1-score) for predicting the depressive disorders of female older adults living alone.
Figure 6The feature importance (best 6) of final model.
Figure 7The naive Bayesian nomogram for predicting the depressive disorders of female older adults living alone in South Korea.
Figure 8ROC of the Bayesian nomogram for predicting the depressive disorders of female older adults living alone.
Figure 9Classification accuracy of the Bayesian nomogram for predicting the depressive disorders of female older adults living alone.
Figure 10F1-score of the Bayesian nomogram for predicting the depressive disorders of female older adults living alone.
Figure 11Calibration plot of the Bayesian nomogram for predicting the depressive disorders of female older adults living alone.