| Literature DB >> 31401594 |
Mathias Carl Blom1, Awais Ashfaq2,3, Anita Sant'Anna2, Philip D Anderson4,5, Markus Lingman6,7.
Abstract
OBJECTIVES: The aim of this work was to train machine learning models to identify patients at end of life with clinically meaningful diagnostic accuracy, using 30-day mortality in patients discharged from the emergency department (ED) as a proxy.Entities:
Keywords: advance care planning; emergency medicine; machine learning; mortality
Mesh:
Year: 2019 PMID: 31401594 PMCID: PMC6701621 DOI: 10.1136/bmjopen-2018-028015
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Descriptive statistics
| Variable | Complete data set* | Validation set n=55 164 | Development set n=65 776 | |||
| N missing (%) | % exposed† | % exposed | % experiencing outcome in exposed | % experiencing outcome in unexposed | P value‡ | |
| Female | 0 (0.0) | 49.5 | 49.0 | 0.19 | 0.22 | 0.48 |
| Arrived by ambulance | 0 (0.0)§ | 13.6 | 11.1 | 0.87 | 0.12 | <0.001 |
| Referred by physician | 0 (0.0) | 14.0 | 10.1 | 0.36 | 0.19 | 0.006 |
| Triage priority 1 | 0 (0.0) | 0.8 | 0.9 | 1.48 | 0.19 | <0.001 |
| Triage priority 2 | 0 (0.0) | 13.1 | 14.8 | 0.41 | 0.17 | <0.001 |
| Radiology order in ED | 0 (0.0)¶ | 18.1 | 12.8 | 0.27 | 0.20 | 0.19 |
| Left against medical advice | 0 (0.0) | 5.0 | 5.1 | 0.09 | 0.21 | 0.18 |
| Discharged night-time | 0 (0.0) | 30.4 | 33.5 | 0.18 | 0.22 | 0.36 |
| Discharged weekend | 0 (0.0) | 31.0 | 33.0 | 0.17 | 0.23 | 0.12 |
| Discharged summer | 0 (0.0) | 15.2 | 14.7 | 0.11 | 0.22 | 0.04 |
| Discharged winter | 0 (0.0) | 23.3 | 23.4 | 0.22 | 0.20 | 0.73 |
| Male provider | 3385 (2.73) | 44.2 | 43.9 | 0.24 | 0.18 | 0.09 |
| Junior physician | 3385 (2.73) | 22.5 | 25.2 | 0.25 | 0.19 | 0.22 |
| Non-physician provider | 3385 (2.73) | 7.1 | 14.3 | 0.11 | 0.22 | 0.03 |
| Mortality | 0 (0.0) | 0.15 | 0.21 | N/A | N/A | N/A |
| Median (IQR) | Median (IQR) | Median (IQR) in subjects experiencing outcome | Median (IQR) in subjects not experiencing outcome | P** | ||
| Age (years) | 0 (0.0) | 42.0 | 31.0 | 81.0 | 31.0 | <0.001 |
| Comorbidity score | 3035 (2.45) | 0.0 | 0.0 | 2.0 | 0.0 | <0.001 |
| ED census (N) | 0 (0.0) | 29.0 | 30.0 | 33.0 | 30.0 | 0.02 |
| Hospital bed occupancy (%) | 0 (0.0) | 92.0 | 89.1 | 90.1 | 89.1 | 0.87 |
*N before excluding missing values.
†Proportion of subjects sharing characteristic indicated in ‘variable’ column.
‡P-value for difference in outcome, exposed vs unexposed, non-adjusted, development set. Arrived by ambulance, referred by physician, triage priority 1 and 2, discharged summer, non-physician provider with p<0.05.
§Database-linkage between source table and ambulance dispatches for 14 918 (12.0%) subjects.
¶Database-linkage between source table and radiology orders for 18 435 (14.9%) subjects.
**P-value for difference in predictor distribution, subjects experiencing outcome vs subjects not experiencing outcome, non-adjusted, development set. Age, comorbidity score and ED census with p<0.05.
ED, emergency department.
Exclusion analysis
| Change (N) | Cohort size (N) | |
| All ED visits 2015–2016 in database | N/A | 177 833 |
| Including all ED visits with discharge destination ‘home’ | +109 745 | 109 745 |
| Including all ED visits with discharge destination ‘referred’ | +8070 | 117 815 |
| Including all ED visits with discharge destination ‘LAMA’ | +6644 | 124 459 |
| Excluding ED visits with discharge destination ‘admitted to hospital’ | −112 | 124 347 |
| Excluding visits to odontology | −339 | 124 008 |
| Excluding ED visits with where patient has unknown gender | −7 | 124 001 |
| Excluding ED visits where patient age is not >0.00 years | −26 | 123 975 |
| Excluding missing values | −3035 | 120 940 |
| Final sample | N/A | 120 940 |
ED, emergency department; LAMA, leave against medical advice; N/A, not applicable.
Algorithm performance (development and validation set)
| Development set | Validation set | |||||
| ROC–AUC | Sensitivity | Specificity | ROC–AUC | Sensitivity | Specificity | |
| KNN | 0.923 | 0.856 | 0.850 | 0.925 | 0.891 | 0.844 |
| SVM | 0.944 | 0.921 | 0.854 | 0.945 | 0.869 | 0.858 |
| MLP | 0.975 | 1.00 | 0.922 | 0.867 | 0.500 | 0.925 |
| RF | 0.962 | 0.750 | 0.954 | 0.934 | 0.737 | 0.907 |
| AB | 1.000 | 1.000 | 1.000 | 0.499 | 0.000 | 0.999 |
| LR | 0.940 | 0.714 | 0.944 | 0.942 | 0.890 | 0.861 |
AB, boosted gradient trees; KNN, K-nearest neighbours; LR, logistic regression; MLP, neural network; RF, random forests; SVM, support vector machine.
Figure 1Algorithm performance (development and validation set). AB, boosted gradient trees; KNN, K-nearest neighbours; LR, logistic regression; MLP, neural network; RF, random forests; SVM, support vector machine.
Figure 2Variable importance using the RF algorithm. ED, emergency department; LAMA, leave against medical advice; RF, random forests.