| Literature DB >> 32028934 |
J Wolff1,2, A Gary3, D Jung4, C Normann5, K Kaier6, H Binder6, K Domschke5, A Klimke7,8, M Franz9,10.
Abstract
BACKGROUND: A common problem in machine learning applications is availability of data at the point of decision making. The aim of the present study was to use routine data readily available at admission to predict aspects relevant to the organization of psychiatric hospital care. A further aim was to compare the results of a machine learning approach with those obtained through a traditional method and those obtained through a naive baseline classifier.Entities:
Keywords: Decision support techniques; Health services administration; Hospitals; Machine learning; Psychiatry
Mesh:
Year: 2020 PMID: 32028934 PMCID: PMC7006066 DOI: 10.1186/s12911-020-1042-2
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Characteristics of inpatient episodes
| 2017 | 2018 | |||
|---|---|---|---|---|
| Number of Episodes (n) | 20,283 | 20,331 | ||
| Age (years, mean & SD) | 48 | 19 | 48 | 19 |
| Female (n & %) | 8872 | 44 | 8869 | 44 |
| GAF Admission (mean & SD) | 35 | 12 | 35 | 12 |
| Length of Stay (days, median & IQR) | 16 | 8–29 | 16 | 8–29 |
| Basic Diagnostic Grouping (n & %) | ||||
| F0/G3 | 2044 | 10.1 | 2099 | 10.3 |
| F1 | 7485 | 36.9 | 7649 | 37.6 |
| F2 | 2929 | 14.4 | 3047 | 15.0 |
| F3 | 5566 | 27.4 | 5365 | 26.4 |
| Others | 2259 | 11.1 | 2171 | 10.7 |
| Study site (n & %) | ||||
| Site 1 | 3564 | 17.6 | 3716 | 18.3 |
| Site 2 | 1313 | 6.5 | 1502 | 7.4 |
| Site 3 | 2436 | 12.0 | 2548 | 12.5 |
| Site 4 | 2115 | 10.4 | 1983 | 9.8 |
| Site 5 | 2159 | 10.6 | 2284 | 11.2 |
| Site 6 | 3854 | 19.0 | 3656 | 18.0 |
| Site 7 | 1493 | 7.4 | 1446 | 7.1 |
| Site 8 | 1636 | 8.1 | 1662 | 8.2 |
| Site 9 | 1713 | 8.4 | 1534 | 7.5 |
| 1:1 Observation (n & %) | 265 | 1.3 | 265 | 1.3 |
| Crisis Intervention (n & %) | 219 | 1.1 | 192 | 0.9 |
| Non-Response (n & %) | 5108 | 25.2 | 4617 | 22.7 |
| Coercive Treatment (n & %) | 1306 | 6.9 | 1382 | 6.8 |
SD Standard deviation, GAF Global Assessment of Functioning, IQR Interquartile range, F0/G3 Organic mental disorders, F1 Substance-related mental disorders, F2 Schizophrenia, schizotypal and delusional disorders, F3 Affective Disorders
Fig. 1Receiver Operating Characteristic Curves, A = Precision at least 33%, B = Precision at least 25%, C=Precision at least 20%, CI = 95% Confidence Interval. Crossed circles show cut-off values that maximise sensitivity at different minimum thresholds of precision. Grey areas are not clinically meaningful because of a sensitivity of less than 0.2. Cut-off points in grey areas are not shown
Fig. 2Precision and Recall Plot, A = Precision at least 33%, B = Precision at least 25%, C=Precision at least 20%. Dashed horizontal line shows the prevalence of the outcome. Crossed circles show cut-off values that maximise sensitivity at different minimum thresholds of precision. Grey areas are not clinically meaningful because of a precision or recall of less than 0.2. Cut-off points in grey areas are not shown. Actual precision could be more than minimum precision
Perfomance Measures
| Sensitivity / Recall | Specificity | Positive Predictive Value / Precision | Negative Predictive Value | Prevalence | Detection Prevalence | Balanced Accuracy | |
|---|---|---|---|---|---|---|---|
| Precision at least 20% | |||||||
| Non-Response | 1.00 | 0.00 | 0.23 | 1.00 | 0.23 | 1.00 | 0.50 |
| Coercive Treatment | 0.73 | 0.78 | 0.20 | 0.97 | 0.07 | 0.26 | 0.76 |
| Long LOS | 0.98 | 0.28 | 0.20 | 0.99 | 0.16 | 0.76 | 0.63 |
| Short LOS | 0.83 | 0.37 | 0.20 | 0.92 | 0.16 | 0.66 | 0.60 |
| Precision at least 25% | |||||||
| Non-Response | 0.96 | 0.15 | 0.25 | 0.93 | 0.23 | 0.87 | 0.56 |
| Coercive Treatment | 0.48 | 0.89 | 0.25 | 0.96 | 0.07 | 0.13 | 0.69 |
| Long LOS | 0.94 | 0.48 | 0.25 | 0.98 | 0.16 | 0.58 | 0.71 |
| Short LOS | 0.61 | 0.65 | 0.25 | 0.90 | 0.16 | 0.39 | 0.63 |
| Precision at least 33% | |||||||
| Non-Response | 0.52 | 0.69 | 0.33 | 0.83 | 0.23 | 0.36 | 0.61 |
| Coercive Treatment | 0.23 | 0.97 | 0.33 | 0.94 | 0.07 | 0.05 | 0.60 |
| Long LOS | 0.49 | 0.82 | 0.33 | 0.90 | 0.16 | 0.23 | 0.65 |
| Short LOS | 0.41 | 0.84 | 0.33 | 0.88 | 0.16 | 0.20 | 0.62 |
Outcomes without clinically meaningful operational points are not shown (Crisis Intervention & 1:1 Observation). Actual precision could be more than minimum precision. TP True Positive, FP False Positive, TN True negative, FN False Negative, Sensitivity = TP/(TP+ FN), Specificity = TN/(TN + FP), Positive Predictive Value = TP/(TP + FP), Negative Predictive Value = TN/(TN + FN), Prevalence = (TP + FN)/(TP + FP + TN + FN), Detection Prevalence = (TP + FP)/(TP + FP + TN + FN), Balanced Accuracy = (Sensitivity+Specificity)/2
Fig. 3Performance in different study sites. One point represents one study site. The diamond represents the mean using the sites as units. ROC = Receiver operating characteristic. AUC = Area under the curve. LOS = Length of stay
Fig. 4Importance of variables in predictions. F0/G3 = Organic mental disorders, F1 = Substance-related mental disorders, F2 = Schizophrenia, schizotypal and delusional disorders, F3 = Affective Disorders. GAF = Global Assessment of Functioning, Adm. = Admission, GP = General Practitioner