| Literature DB >> 35168594 |
Lloyd D Balbuena1, Marilyn Baetz2, Joseph Andrew Sexton3, Douglas Harder4, Cindy Xin Feng5, Kerstina Boctor6, Candace LaPointe4, Elizabeth Letwiniuk4, Arash Shamloo6, Hemant Ishwaran7, Ann John8, Anne Lise Brantsæter9.
Abstract
BACKGROUND: Machine learning (ML) is increasingly used to predict suicide deaths but their value for suicide prevention has not been established. Our first objective was to identify risk and protective factors in a general population. Our second objective was to identify factors indicating imminent suicide risk.Entities:
Keywords: machine learning; prediction; primary prevention; secondary prevention; suicide
Mesh:
Year: 2022 PMID: 35168594 PMCID: PMC8848909 DOI: 10.1186/s12888-022-03702-y
Source DB: PubMed Journal: BMC Psychiatry ISSN: 1471-244X Impact factor: 3.630
Demographic characteristics of the study participants
| Cohort of Norway | Saskatoon | |||
|---|---|---|---|---|
| N | 173,224 | 13,892 | ||
| Sex | ||||
| Female | 89,074 | 51% | 7101 | 51% |
| Male | 84,150 | 49% | 6791 | 49% |
| Marital status | ||||
| Married / Partnered | 104,665 | 60% | 1413 | 10% |
| Single | 38,470 | 22% | 1120 | 8% |
| Divorced/separated/widowed | 29,415 | 17% | ||
| Missing | 674 | 0% | 11,359 | 82% |
| Suicide deaths | 319 | 0% | 80 | 1% |
| Mean age at entry (sd) | 50 | 16 | 37 | 20 |
| Mean follow-up time in years (sd) | 16.68 | 4.71 | 3.11 | 1.19 |
Notes: For Saskatoon, the figures for Single are combined with Divorced/separated/widowed; Cell entries are frequencies (percentages) except for age and follow-up time.
Multivariable Cox Model for Females in the Cohort of Norway Training Data (n=235)
| Variable | HR | 95% CI | |
|---|---|---|---|
| Age | 1.02 | 1.00-1.03 | 0.01 |
| Married | 0.64 | 0.44-0.94 | 0.02 |
| Lives with spouse/partner | 0.71 | 0.50-1.01 | 0.06 |
| Proportion of county with low income | 1.09 | 1.00-1.17 | .04 |
| Waist-hip ratio | 0.98 | 0.66-14.70 | 0.99 |
| Triglycerides | 1.06 | 0.93-1.19 | 0.38 |
| Daily smoking | 1.52 | 1.08-2.14 | 0.02 |
| Second hand smoke exposure in childhood | 1.09 | 0.77-1.54 | 0.63 |
| Daily hours spent in smoke-filled rooms | 1.04 | 1.00-1.07 | 0.05 |
| Alcohol use | 1.02 | 0.88-1.18 | 0.81 |
| Frequency of hard physical activity at leisure time | 1.09 | 0.94-1.25 | 0.24 |
| Mood symptoms | 1.11 | 1.02-1.21 | 0.01 |
Multivariable Cox Model for Males in the Cohort of Norway Training Data (n=305)
| Variable | HR | 95% CI | |
|---|---|---|---|
| Age | 1.01 | 0.99-1.02 | 0.48 |
| Married | 1.27 | 0.71-2.25 | 0.42 |
| Lives with spouse/partner | 0.61 | .35-1.05 | 0.07 |
| Proportion of county with low income | 1.23 | 1.08-1.40 | 0.001 |
| Waist-hip ratio | 0.01 | 0.00-.66 | 0.03 |
| Triglycerides | 1.28 | 1.00-1.64 | 0.05 |
| Daily smoking | 2.06 | 1.21-3.51 | 0.02 |
| Second hand smoke exposure in childhood | 0.89 | 0.50-1.59 | 0.69 |
| Daily hours spent in smoke-filled rooms | 0.96 | 0.90-1.03 | 0.29 |
| Alcohol use | 1.22 | 0.97-1.53 | 0.09 |
| Frequency of hard physical activity at leisure time | 0.98 | 0.78-1.23 | 0.83 |
| Mood symptoms | 1.18 | 1.05-1.32 | 0.02 |
Fig. 1Predicted survival curves based on the Cox Model for Females (Cohort of Norway) A: Mood symptoms score: 7 (solid line) vs Mood symptoms score: 0 (dashed line) B: 8.3% low-income residents (solid line) vs. 3.0% low-income residents (dashed line) C: Daily smoker (solid line) vs Not daily smoker (dashed line, obscured by solid blue line)
Fig. 2Predicted survival curves based on the Cox Model for Males (Cohort of Norway) A: Mood symptoms score: 7 (solid line) vs Mood symptoms score: 0 (dashed line) B: 8.3% low-income residents (solid line) vs. 3.0% low-income residents (dashed line) C: Daily smoker (solid line) vs Not daily smoker (dashed line, obscured by solid blue line)
Top Predictors in the Random Survival Forest Model fitted to the Cohort of Norway Training Data
| Combined Sexes ( | Males Only ( | Females Only ( | ||||
|---|---|---|---|---|---|---|
| Variable | Importance | Importance | Importance | |||
| Male | 1.45 | <0.01 | — | — | — | — |
| Proportion of county with low income | 0.66 | <0.01 | 0.50 | 0.03 | 0.63 | 0.01 |
| Lives with spouse/partner | 0.56 | <0.01 | 0.58 | 0.01 | — | — |
| Mood symptoms | 0.56 | <0.01 | 0.62 | 0.01 | 0.69 | 0.01 |
| Daily hours spent in smoke-filled rooms | 0.50 | <0.01 | 0.57 | 0.03 | — | — |
| Daily smoking | 0.44 | <0.01 | 0.36 | 0.01 | 0.56 | <0.01 |
| Waist-hip ratio | 0.32 | 0.02 | — | — | — | — |
| Married | 0.27 | 0.01 | 0.71 | <0.01 | — | — |
| Alcohol use | 0.20 | 0.05 | — | — | — | — |
| Takes blood pressure medications | – | – | 0.14 | 0.05 | — | — |
Note: Importance is determined by Altmann’s permutation method [57] in which the permuted values of a variable are compared with the true values. Greater decreases in prediction accuracy reflect higher importance.
Univariate survival models in the Saskatoon Training data (n unique people = 134, n records = 777)
| Variable | OR | 95% CI | |
|---|---|---|---|
| Age at index | 1.02 | 1.00-1.03 | 0.01 |
| Male | 1.05 | 0.62-1.78 | 0.86 |
| RESH score | 0.95 | 0.71-1.27 | 0.72 |
| Number of community mental health visits | 1.16 | 0.88-1.51 | 0.29 |
Note: Each row is a single predictor in addition to interval (6-month periods, in days).
Top Predictors in the Historical Random Forest model fitted to the Saskatoon Training data (n unique people = 134, n records = 777)
| Predictor | Increase in Model Error if Predictor is marginalized | Z-score |
|---|---|---|
| Age at index | .009 | -0.217 |
| Male | .003 | 0.110 |
Note: Predictors are entered simultaneously in the model.