| Literature DB >> 30301301 |
Seunghyong Ryu1, Hyeongrae Lee1, Dong-Kyun Lee1, Kyeongwoo Park1.
Abstract
OBJECTIVE: In this study, we aimed to develop a model predicting individuals with suicide ideation within a general population using a machine learning algorithm.Entities:
Keywords: Machine learning algorithm; Prediction; Public health data; Suicide ideation
Year: 2018 PMID: 30301301 PMCID: PMC6258996 DOI: 10.30773/pi.2018.08.27
Source DB: PubMed Journal: Psychiatry Investig ISSN: 1738-3684 Impact factor: 2.505
Characteristics of suicide ideators (N=5,814) and non-suicide ideators (N=29,302)
| Suicide ideator[ | Non-suicide ideator[ | Statistics[ | |
|---|---|---|---|
| Age, years | 54.13 (17.73) | 49.00 (16.26) | T=20.43, p<0.01 |
| Sex | χ2=553.10, p<0.01 | ||
| Male | 1,654 (28.4) | 13,225 (45.1) | |
| Female | 4,160 (71.6) | 16,077 (54.9) | |
| Education | χ2=1,345.13, p<0.01 | ||
| Village school | 41 (0.7) | 72 (0.2) | |
| Uneducated | 834 (14.4) | 1,451 (5.0) | |
| Elementary school | 1,585 (27.4) | 4,896 (16.7) | |
| Middle school | 673 (11.6) | 3,419 (11.7) | |
| High school | 1,342 (23.2) | 8,587 (29.4) | |
| Two- or three-year college | 471 (8.1) | 3,497 (12.0) | |
| Four-year university | 738 (12.7) | 6,221 (21.3) | |
| Graduate school | 107 (1.8) | 1,108 (3.8) | |
| Reasons for unemployment | χ2=1,296.16, p<0.01 | ||
| Do not feel the need | 297 (5.1) | 2,054 (7.0) | |
| Schooling | 119 (2.1) | 774 (2.6) | |
| Retired | 83 (1.4) | 846 (2.9) | |
| Having health problems | 1,471 (25.4) | 2,674 (9.2) | |
| Looking for a job | 350 (6.1) | 1514 (5.2) | |
| Parenting or nursing | 507 (8.8) | 2,818 (9.6) | |
| etc. | 171 (3.0) | 755 (2.6) | |
| Employed | 2,787 (48.2) | 17,773 (60.8) | |
| Average work week, hours | 24.28 (26.86) | 29.73 (25.88) | T=-14.15, p<0.01 |
| Subjective health status | χ2=2,340.76, p<0.01 | ||
| Very good | 126 (2.2) | 1,432 (4.9) | |
| Good | 1,154 (19.9) | 10,041 (34.3) | |
| Fair | 1,982 (34.2) | 12,571 (43.0) | |
| Poor | 1,857 (32.0) | 4,552 (15.6) | |
| Very poor | 679 (11.7) | 657 (2.2) | |
| Days of feeling sick or discomfort, days | 4.46 (6.08) | 1.91 (4.39) | T=30.43, p<0.01 |
| Limitation of daily life and social activities | χ2=1,585.42, p<0.01 | ||
| Yes | 1,858 (32.1) | 3,399 (11.6) | |
| No | 3,937 (67.9) | 25,854 (88.4) | |
| EQ-5D: mobility | χ2=1,574.23, p<0.01 | ||
| No problems | 3,760 (64.9) | 25,099 (85.8) | |
| Some problems | 1,889 (32.6) | 4,046 (13.8) | |
| Confined to bed | 148 (2.6) | 110 (0.4) | |
| EQ-5D: usual activities | χ2=1,910.65, p<0.01 | ||
| No problems | 4,191 (72.3) | 26,825 (91.7) | |
| Some problems | 1,325 (22.9) | 2,225 (7.6) | |
| Unable to perform | 278 (4.8) | 203 (0.7) | |
| EQ-5D: pain/discomfort | χ2=1,812.66, p<0.01 | ||
| No | 3,148 (54.3) | 22,789 (77.9) | |
| Moderate | 2,058 (35.5) | 5,862 (20.0) | |
| Extreme | 590 (10.2) | 603 (2.1) | |
| EQ-5D: anxiety/depression | χ2=3,746.10, p<0.01 | ||
| No | 3,647 (62.9) | 26,866 (91.8) | |
| Moderate | 1,887 (32.6) | 2,280 (7.8) | |
| Extreme | 262 (4.5) | 109 (0.4) | |
| EQ-VAS | 63.76 (21.81) | 75.03 (16.55) | T=-37.125, p<0.01 |
| Depressed mood over 2 weeks | χ2=6,316.11, p<0.01 | ||
| Yes | 2,802 (48.2) | 2,321 (7.9) | |
| No | 3,011 (51.8) | 26,980 (92.1) | |
| Stress level in daily life | χ2=3,295.15, p<0.01 | ||
| Extremely | 837 (14.4) | 844 (2.9) | |
| Stressful | 2,429 (41.8) | 5,524 (18.9) | |
| Moderately | 2,085 (35.9) | 17,541 (59.9) | |
| Minimally | 457 (7.9) | 5,389 (18.4) |
N (%) or mean±SD,
chi-square test or independent t-test.
EQ-5D: EuroQoL-5D, EQ-VAS: EuroQoL-Visual Analogue Scale
Figure 1.A plot of recursive feature elimination with feature selection in the test set.
Figure 2.Scheme of prediction model development.
Figure 3.Receiver operating characteristic (ROC) curves. *15-feature model, †39-feature model. AUC: Area under ROC curve.
Confusion matrix and prediction scores
| Test set (N=1,162) | Entire population (N=35,116) | |
|---|---|---|
| True positive | 448 | 4,860 |
| True negative | 460 | 23,641 |
| False positive | 121 | 5,661 |
| False negative | 133 | 954 |
| Accuracy | 0.781 | 0.821 |
| Sensitivity | 0.771 | 0.836 |
| Specificity | 0.792 | 0.807 |
| Positive predictive value | 0.787 | 0.462 |
| Negative predictive value | 0.776 | 0.961 |