| Literature DB >> 31872067 |
Lin Wang1, Zhong Xue1, Chika F Ezeana2, Mamta Puppala2, Shenyi Chen2, Rebecca L Danforth1, Xiaohui Yu2, Tiancheng He2, Mark L Vassallo3, Stephen T C Wong1,2.
Abstract
Patient falls during hospitalization can lead to severe injuries and remain one of the most vexing patient-safety problems facing hospitals. They lead to increased medical care costs, lengthened hospital stays, more litigation, and even death. Existing methods and technology to address this problem mostly focus on stratifying inpatients at risk, without predicting fall severity or injuries. Here, a retrospective cohort study was designed and performed to predict the severity of inpatient falls, based on a machine learning classifier integrating multi-view ensemble learning and model-based missing data imputation method. As input, over two thousand inpatient fall patients' demographic characteristics, diagnoses, procedural data, and bone density measurements were retrieved from the HMH clinical data warehouse from two separate time periods. The predictive classifier developed based on multi-view ensemble learning with missing values (MELMV) outperformed other three baseline models; achieved a cross-validated AUC of 0.713 (95% CI, 0.701-0.725), an AUC of 0.808 (95% CI, 0.740-0.876) on the separate testing set. Our studies show the efficacy of integrative machine-learning based classifier model in dealing with multi-source patient data, which in this case delivers robust predictive performance on the severity of patient falls. The severe fall index provided by the MELMV classifier is calculated to identify inpatients who are at risk of having severe injuries if they fall, thus triggering additional steps of intervention to prevent a harmful fall, beyond the standard-of-care procedure for all high-risk fall patients.Entities:
Keywords: Disease prevention; Risk factors
Year: 2019 PMID: 31872067 PMCID: PMC6908660 DOI: 10.1038/s41746-019-0200-3
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Characteristics and variables of the 1837 training data and 306 testing data.
| Characteristics | Training set ( | Testing set ( |
|---|---|---|
| No. Unique Patients | 1692 | 275 |
| Age, Mean (SD) | 63.0 (15.7) | 64.6 (15.8) |
| Sex, No. (%) | ||
| Female | 942 (51.3) | 148 (48.4) |
| Male | 895 (48.7) | 158 (51.6) |
| Race, No. (%) | ||
| Asian | 32 (1.7) | 7 (2.3) |
| Black | 432 (23.5) | 73 (23.9) |
| Caucasian | 1120 (61.0) | 213 (69.6) |
| Hispanic | 30 (1.6) | 0 (0) |
| Indian, American | 5 (0.3) | 4 (1.3) |
| Other or unknown | 218 (11.9) | 9 (2.9) |
| Bone density | ||
| Type: Dual Femur, No. (%) | 137 (7.5) | 27 (8.8) |
| Bone Mass Density, mean (SD), g/cm2 | 0.538 (0.183) | 0.926 (0.199) |
| Bone Density T-Score, mean (SD) | −1.154 (1.312) | −1.011 (1.438) |
| Type: forearm, No. (%) | 92 (5.0) | 7 (2.3) |
| Bone Mass Density, mean (SD), g/cm2 | 0.526 (0.252) | 0.745 (0.221) |
| Bone Density T-Score, mean (SD) | −1.540 (1.768) | −1.629 (1.428) |
| Disease | ||
| ICD-9, Distinct No. | 3170 | 1891 |
| Disease category, Distinct No. | 723 | 442 |
| Procedural | ||
| No. (%) | 559 (30.4) | 97 (31.7) |
| CPT code, Distinct No. | 202 | 39 |
| Severe falls, No. (%) | 297 (16.2) | 33 (10.8) |
Model performance comparison with different imputation methods for missing data.
| Modeling approach | Training set AUC (95% CI) | Testing set AUC (95% CI) |
|---|---|---|
| MELMV Model with Model-based Imputation | 0.713 (0.701–0.725) | 0.808 (0.740–0.876) |
| Random forest with Model-based Imputation | 0.679 (0.664–0.695) | 0.753 (0.665–0.841) |
| Model of ensemble of all classifiers | 0.628 (0.616–0.640) | 0.706 (0.622–0.790) |
| Single Logistic Regression Model with multivariable imputation | 0.668 (0.653–0.682) | 0.728 (0.726–0.730) |
| Single Support Vector Machine Model with multivariable imputation | 0.618 (0.607–0.628) | 0.596 (0.592–0.600) |
| Model of ensemble single view of LG and SVM | 0.619 (0.605–0.634) | 0.645 (0.643–0.648) |
Fig. 1The performance of the multi-view ensemble learning with missing data classifier (MELMV) on the testing set.
Model performance was evaluated by the receiver operating characteristic curve on a prospective testing set. The 95% CIs of specificity were also showed at shaded band. ROC: receiver operating characteristic curve; AUC: area under the receiver operating characteristic curve; sn: sensitivity; sp: specificity; MELMV: multi-view ensemble learning with missing value classifier.
Fig. 2The comparison of the performance of the multi-view ensemble learning with missing data classifier (MELMV) and Hester Davis score on the high risk fall patients in the testing data.
a Performance of the MELMV model evaluated by the receiver operating characteristic curve on the testing set with high risk of fall based on the Hester Davis score. b Performance of the Hester Davis score as severity fall classifier evaluated by the receiver operating characteristic curve on the testing set. ROC: receiver operating characteristic curve; AUC: area under the receiver operating characteristic curve; sn: sensitivity; sp: specificity; MELMV: multi-view ensemble learning with missing value classifier.
List of disease areas of the training dataset whose injury scores are significantly higher than the population mean, with statistical power ≥80%, p-value < 0.05.
| Ave injury scores | No of cases | Disease area | |
|---|---|---|---|
| 0.01 | 5.30 | 10 | Occlusion and Stenosis of multiple and bilateral precerebral |
| 0.02 | 5.29 | 7 | Rhinovirus Infection in conditions classified elsewhere |
| 0.02 | 5.17 | 18 | Venous (Peripheral) insufficiency, unspecified |
| 0.00 | 5.02 | 46 | Mixed acid-base balance disorder |
| 0.01 | 4.91 | 33 | Other diuretics causing adverse effects in therapeutic use |
| 0.03 | 4.90 | 31 | Hepatorenal Syndrome |
| 0.04 | 4.89 | 35 | Acute and chronic respiratory failure |
| 0.04 | 4.88 | 24 | Temporary tracheostomy |
| 0.04 | 4.85 | 27 | Kidney replaced by transplant |
| 0.03 | 4.83 | 35 | Hypovolemia |
| 0.02 | 4.80 | 35 | Cachexia |
| 0.00 | 4.80 | 65 | Cardiac pacemaker in situ |
| 0.04 | 4.78 | 49 | Encounter for palliative care |
| 0.03 | 4.71 | 58 | Closed (Endoscopic) biopsy of bronchus |
| 0.03 | 4.70 | 54 | Body mass index less than 19, adult |
| 0.04 | 4.68 | 50 | Occlusion and stenosis of carotid artery without mention of cerebral infarction |
| 0.03 | 4.66 | 99 | Other ascites |
| 0.02 | 4.62 | 111 | Other fluid overload |
| 0.03 | 4.59 | 138 | Long-Term (current) use of steroids |