| Literature DB >> 33302948 |
Matthijs Blankers1,2,3, Louk F M van der Post4, Jack J M Dekker4,5.
Abstract
BACKGROUND: Accurate prediction models for whether patients on the verge of a psychiatric criseis need hospitalization are lacking and machine learning methods may help improve the accuracy of psychiatric hospitalization prediction models. In this paper we evaluate the accuracy of ten machine learning algorithms, including the generalized linear model (GLM/logistic regression) to predict psychiatric hospitalization in the first 12 months after a psychiatric crisis care contact. We also evaluate an ensemble model to optimize the accuracy and we explore individual predictors of hospitalization.Entities:
Keywords: Acute psychiatry; Machine learning; Prognostic modeling; Psychiatric hospitalization
Year: 2020 PMID: 33302948 PMCID: PMC7731561 DOI: 10.1186/s12911-020-01361-1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Predictor variables organized in three main themes
| Sociodemographics | SPI items | Psychiatric care |
|---|---|---|
| Gender (cat) | Suicide risk (cat) | Patients’ informal social support system involved (cat) |
| Age (num) | Danger to others (cat) | Patient referrer (cat) |
| Living situation (cat) | Severity of psychiatric symptoms (cat) | Number of previous face-to-face treatment contacts up to 2 weeks/1 month/3 months/6 months/12 months before the index crisis care contact (num) |
| Marital status (cat) | Problems with self-care (cat) | Number of previous psychiatric hospitalizations (last 12 months and last 5 years) (num) |
| Cultural background (cat) | Substance misuse (cat) | Number of previous psychiatric day care treatments (last 12 months and last 5 years) (num) |
| Psychiatric diagnosis (cat) | Medical condition(s) (cat) | Number of involuntary treatments/hospitalizations (last 12 months and last 5 years) (num) |
| Global Assessment of Functioning (GAF) score (num) | Disturbances in patients’ family connectedness (cat) | Days of psychiatric hospitalization (last 12 months) (num) |
| Professional functioning (cat) | Any earlier psychiatric care referrals (> 1 year and > 5 years before current contact) (num) | |
| Stability of patients’ living situation (cat) | ||
| Patient is motivated to receive treatment (cat) | ||
| Prescription medication compliance (cat) | ||
| Anosognosia (cat) | ||
| Patients’ family involvement in informal care (cat) | ||
| Symptom persistence (cat) |
Data types cat categorical data, num numerical data
Descriptive statistics for the 2084 patients in the first year after a psychiatric crisis care contact
| Variable | All participants (n = 2084) | Hospitalized (n = 710) | Not hospitalized (n = 1374) | X2 ( | |
|---|---|---|---|---|---|
| M (SD)|n (%) | M (SD)|n (%) | M (SD)|n (%) | |||
| Age | |||||
| Years | 40.8 (15.1) | 41.0 (13.8) | 40.7 (15.7) | 0.94 (1) | 0.33 |
| Sex | |||||
| Male | 1083 (52.0%) | 405 (57.0%) | 678 (49.3%) | 10.81 (1) | 0.001 |
| Female | 1001 (48.0%) | 305 (43.0%) | 696 (50.7%) | ||
| Diagnosis | 120.2 (5) | < 0.0001 | |||
| Psychotic | 807 (38.7%) | 373 (52.5%) | 434 (31.6%) | ||
| Depressive | 285 (13.7%) | 98 (13.8%) | 187 (13.6%) | ||
| Substance related | 239 (11.5%) | 84 (11.8%) | 155 (11.3%) | ||
| Manic/bipolar | 34 (1.6%) | 12 (1.7%) | 22 (1.6%) | ||
| Other | 561 (26.9%) | 103 (14.5%) | 458 (33.3%) | ||
| No or deferred | 158 (7.6%) | 40 (5.6%) | 118 (8.6%) | ||
| Living situation | 35.35 (5) | < 0.0001 | |||
| Alone | 1018 (48.8%) | 385 (54.2%) | 633 (46.1%) | ||
| With partner/other(s) | 564 (27.1%) | 142 (20.0%) | 422 (30.7%) | ||
| With parents | 235 (11.3%) | 73 (10.3%) | 162 (11.8%) | ||
| Homeless | 96 (4.6%) | 42 (5.9%) | 54 (3.9%) | ||
| Institutionalized | 68 (3.3%) | 31 (4.4%) | 37 (2.7%) | ||
| Other | 103 (4.9%) | 37 (5.2%) | 66 (4.8%) | ||
| Cultural background | 12.16 (5) | 0.033 | |||
| Dutch | 1151 (55.2%) | 409 (57.6%) | 742 (54.0%) | ||
| Surinamese/Antilles | 303 (14.5%) | 124 (17.5%) | 189 (13.8%) | ||
| Moroccan | 145 (7.0%) | 44 (6.2%) | 101 (7.4%) | ||
| Turkish | 82 (4.0%) | 22 (3.1%) | 60 (4.4%) | ||
| Other non-western | 243 (11.7%) | 78 (11.0%) | 165 (12.0%) | ||
| Other western | 160 (7.7%) | 43 (6.1%) | 117 (8.5%) | ||
Fig. 1Comparison of AUC scores for the ten machine learning based models. Note AUC (or c-statistic) indicates the performance of the different machine learning based models. The error bars indicate ± 1 standard error intervals
Key performance statistics of the trained models
| ML algorithm | AUC | Sensitivity | Specificity | Accuracy |
|---|---|---|---|---|
| Gradient boosting | 0.774 | 0.455 | 0.894 | 0.744 |
| Oblique random forest | 0.762 | 0.509 | 0.847 | 0.732 |
| DeepBoost | 0.760 | 0.461 | 0.871 | 0.731 |
| Random forest | 0.757 | 0.478 | 0.864 | 0.732 |
| GLM (logistic regression) | 0.756 | 0.444 | 0.876 | 0.729 |
| Support vector machines | 0.751 | 0.370 | 0.917 | 0.731 |
| Naive Bayes | 0.751 | 0.455 | 0.861 | 0.723 |
| Neural network | 0.749 | 0.528 | 0.828 | 0.726 |
| Keras/TensorFlow | 0.741 | 0.465 | 0.850 | 0.719 |
| K-nearest neighbors | 0.702 | 0.356 | 0.879 | 0.701 |
The base rate of (non-)hospitalization = 0.659. The accuracy of each model was tested against this base rate, all p < 0.00001, based on 2-sided z-tests; hence each model led to a significant improvement in classification accuracy compared to an intercept only model
Fig. 2Overall variable importance plot for the machine learning based models. Note This plot presents the 39 predictors (before dummy-recoding) in descending order of unique predictive value. (n) indicates a numeric variable, (cat) indicates the variable is categorical, (SPI) indicates the variable is part of the SPI instrument, (M/F) and (Y/N) variables are dichotomous. Psychiatric care register data have a 5-year time horizon unless otherwise indicated