| Literature DB >> 34813632 |
Jennifer A Bishop1, Hamza A Javed1, Rasheed El-Bouri1, Tingting Zhu1, Thomas Taylor1, Tim Peto2, Peter Watkinson2, David W Eyre2,3, David A Clifton1.
Abstract
BACKGROUND: Delays in patient flow and a shortage of hospital beds are commonplace in hospitals during periods of increased infection incidence, such as seasonal influenza and the COVID-19 pandemic. The objective of this study was to develop and evaluate the efficacy of machine learning methods at identifying and ranking the real-time readiness of individual patients for discharge, with the goal of improving patient flow within hospitals during periods of crisis. METHODS AND PERFORMANCE: Electronic Health Record data from Oxford University Hospitals was used to train independent models to classify and rank patients' real-time readiness for discharge within 24 hours, for patient subsets according to the nature of their admission (planned or emergency) and the number of days elapsed since their admission. A strategy for the use of the models' inference is proposed, by which the model makes predictions for all patients in hospital and ranks them in order of likelihood of discharge within the following 24 hours. The 20% of patients with the highest ranking are considered as candidates for discharge and would therefore expect to have a further screening by a clinician to confirm whether they are ready for discharge or not. Performance was evaluated in terms of positive predictive value (PPV), i.e., the proportion of these patients who would have been correctly deemed as 'ready for discharge' after having the second screening by a clinician. Performance was high for patients on their first day of admission (PPV = 0.96/0.94 for planned/emergency patients respectively) but dropped for patients further into a longer admission (PPV = 0.66/0.71 for planned/emergency patients still in hospital after 7 days).Entities:
Mesh:
Year: 2021 PMID: 34813632 PMCID: PMC8610279 DOI: 10.1371/journal.pone.0260476
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistical analysis of data set.
| Patient admission cohort LOS statistics | Planned admissions | Emergency admissions (all) | Emergency admissions (with infection) |
|---|---|---|---|
| Total no. admissions in dataset | 11,574 | 38,258 | 4,438 |
| Mean LOS (days) | 2.3 | 4.7 | 5.6 |
| Median LOS (days) | 1.9 | 3.2 | 4.9 |
| Min LOS (days) | 0.25 | 0.25 | 0.25 |
| Max LOS (days) | 132 | 195 | 195 |
| Standard deviation in LOS (days) | 5.5 | 8.3 | 9.3 |
| IQR in LOS (days) | 5.2 | 6.7 | 8.4 |
Total number of admissions and the corresponding statistical summary of LOS for each patient admission.
Fig 1Patient sub-dataset diagram.
An illustration of how the sub-datasets were stratified. The figure contains three patients with emergency admissions who had stays that lasted at least 1 day (IDs = 1, 3, 4); at day t = 3 only two of the example patients remained (IDs = 3, 4); and on day t = 7, only one of these patients remained in hospital (ID = 4), therefore we would only be able to make an 8th day discharge prediction for this remaining patient. A comparable example is also displayed for planned admissions.
Patient sub-dataset sizes.
|
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
|---|---|---|---|---|---|---|---|---|
|
| 11206 | 5524 | 1184 | 972 | 718 | 572 | 478 | 424 |
|
| 21636 | 14010 | 8464 | 5400 | 3998 | 2970 | 2590 | 2038 |
Total amount of unique patient admissions to hospital within each subset of data Dpt or Det, in which t denotes the time passed in days since the patient’s admission.
Model hyperparameters.
Table detailing the LR, RF, SVM and DNN models’ hyperparameters.
| Model Type | Hyper-parameter | Selection |
|---|---|---|
| LR | Norm penalization | l1 |
| LR | Inverse regularisation strength, C | 103 |
| RF | Number of trees | 75 |
| RF | Minimum samples for node split | 2 |
| RF | Minimum samples for leaf node | 1 |
| SVM | Kernel function | Non-linear—radial basis function (RBF) |
| SVM | Soft-margin regularizer, C | 105 |
| SVM | Inverse of the variance of the RBF kernel, γ | 10−4 |
| DNN | Hidden layers activation function | ReLu |
| DNN | Final layer activation function | Sigmoid |
| DNN | Hidden layers | 2 |
| DNN | Nodes per hidden layer | 100 |
| DNN | Dropout rates | 0.3 |
| DNN | Weight-initialization | Random, normally distributed weights. |
| DNN | Optimization algorithm | RMSprop, learning rate of 10−3 |
| DNN | Epoch number | 150 |
| DNN | Batch-size | 20 |
| DNN | Batch normalization | Implemented |
| DNN | Weight matrix norm constraint | 4 |
Table of features.
| Category | Features | Total # | |
|---|---|---|---|
| Demographic | I. Age | II. Charlson Comorbidity Index (CCI) | 2 |
| Seasonal | III. Monday | IV. Tuesday | 7 |
| V. Wednesday | VI. Thursday | ||
| VII. Friday | VIII. Saturday | ||
| IX. Sunday | |||
| ICU | X. ICU Ward: CTTC | XI. ICU Ward: AICU | 14 |
| XII. ICU Ward: CICU | XIII. Non-surgical ICU admission | ||
| XIV. Surgical ICU admission | XV. Planned ICU admission | ||
| XVI. Unplanned ICU admission | XVII. Reparation ICU admission | ||
| XVIII. Local ICU admission | IXX. Time elapse since ICU admission | ||
| XX. Time elapsed since ICU discharge | XXI. Number of ICU admissions | ||
| XXII. ICU LOS | XXIII. Patient is currently in ICU | ||
| Procedures | XXIV. Time elapsed since last theatre visit | XXV. Number of theatre visits | 8 |
| XXVII. Time elapsed since last procedure | |||
| XXVI. Patient has been to theatre | XXIX. Patient had a procedure | ||
| XXVIII. No. of procedures since admission | |||
| XXX. Patient had radiology-based procedure | XXXI. Time since last radiology procedure | ||
| Bloods | XXXII. Any blood tests since admission | XXXIII. Time elapsed since last blood test | 20 |
| XXXIV. Albumin BT taken in <48 hrs | XXXV. BT taken in <48 hrs | ||
| XXXVII. Creatinine BT taken in<48 hrs | |||
| XXXVI. Any BT in <48 hrs abnormal | XXXIX. Potassium BT taken in <48 hrs | ||
| XLI. White blood cell count BT taken in <48 hrs | |||
| XLIII. Albumin BT taken in <24 hrs | |||
| XXXVIII. Sodium BT taken in <48 hrs | |||
| XL. Urea BT taken in <48 hrs | XLV. Any BT in <24 hrs abnormal | ||
| XLII. Haemoglobin BT taken in <48 hrs | XLVII. Sodium BT taken in <24 hrs | ||
| XLIX. Urea BT taken in <24 hrs | |||
| XLVI. Creatinine BT taken in <24 hrs | LI. Haemoglobin BT taken in <24 hrs | ||
| XLIV. Any BT taken in <24 hrs | |||
| XLVIII. Potassium BT taken in <24 hrs | |||
| L. White blood cell count BT taken in <24 hrs | |||
| NEWS | LII. Mean NEWS since admission | LIII. Max NEWS since admission | 26 |
| LIV. Min NEWS since admission | LV. Variability in NEWS since admission | ||
| LVI. Most recent NEWS | LVII. First NEWS on admission | ||
| LVIII. 72–96 hr mean NEWS | LIX. 72–96 hr max NEWS | ||
| LX. 72–96 hr min NEWS | LXI. 72–96 hr variability in NEWS | ||
| LXII. 72–96 hr # of observation sets | LXIII. 48–72 hr mean NEWS | ||
| LXIV. 48–72 hr max NEWS | LXV. 48–72 hr min NEWS | ||
| LXVI. 48–72 hr variability NEWS | LXVII. 48–72 hr # of observation sets | ||
| LXVIII. 24–48 hr mean NEWS | LXIX. 24–48 hr max NEWS | ||
| LXX. 24–48 hr min NEWS | LXXI. 24–48 hr variability in NEWS | ||
| LXXII. 24–48 hr # of observation sets | LXXIII. 0–24 hr mean NEWS | ||
| LXXIV. 0–24 hr max NEWS | LXXV. 0–24 hr min NEWS | ||
| LXXVI. 0–24 hr variability in NEWS | LXXVII. 0–24 hr # of observation sets | ||
| Diagnosis | LXXVIII. Mean LOS of patients in the same CCS category | LXXIX. Variance in LOS of people in the same CCS category | 2 |
A table of all features engineered from the data within the EHR. All features were included in the LR, RF and DNN models. Feature selection was carried out to determine the features to be included in the SVM models from this set. Additional detail about each feature’s data type can be found in Appendix B in (S1 File).
Fig 2Normalised SURF feature selection scores.
The normalised scores resulting from our feature selection methodology, using the SURF feature selection algorithm, for each feature in the datasets. The features on the x-axis of this plot are summarised in Table 4.
Fig 3PPV results.
Scatter plots giving the PPV, calculated for the 20% of patients with the highest positive classification scores, for LR, RF, SVM and DNN models, trained and tested on the different sub-datasets which reflect the patient cohorts at different points in their hospital stays. Planned admissions are represented by D, emergency admissions D and emergency admissions with an infectious disease D*.
PPV results.
| Dataset—Models |
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| Mean SD | 0.83 0.0077 | 0.84 0.0091 | 0.81 0.012 | 0.77 0.018 | 0.75 0.025 | 0.71 0.021 | 0.69 0.055 | 0.63 0.079 |
|
| Mean SD | 0.91 0.012 | 0.92 0.014 | 0.85 0.032 | 0.82 0.046 | 0.76 0.032 | 0.71 0.06 | 0.68 0.094 | ||
|
| Mean SD | 0.92 0.0088 | 0.89 0.0098 | 0.86 0.0084 | 0.81 0.017 | 0.82 0.024 | 0.75 0.023 | 0.65 0.14 | ||
|
| Mean SD | 0.73 0.049 | ||||||||
|
|
| Mean SD | 0.82 0.095 | 0.78 0.016 | 0.77 0.013 | 0.74 0.024 | 0.73 0.034 | 0.71 0.031 | 0.72 0.047 | 0.67 0.045 |
|
| Mean SD | 0.87 0.012 | 0.86 0.014 | 0.83 0.018 | 0.81 0.023 | 0.76 0.019 | 0.77 0.028 | 0.74 0.034 | 0.70 0.038 | |
|
| Mean SD | 0.90 0.0059 | 0.85 0.0094 | 0.83 0.012 | 0.80 0.019 | 0.78 0.022 | 0.75 0.016 | |||
|
| Mean SD | 0.71 0.029 | ||||||||
|
| Mean SD | 0.76 0.018 | 0.72 0.022 | 0.73 0.036 | 0.70 0.051 | 0.68 0.047 | 0.65 0.045 | 0.66 0.061 | 0.61 0.067 | |
|
| Mean SD | 0.83 0.013 | 0.80 0.010 | 0.78 0.029 | 0.75 0.036 | 0.73 0.051 | 0.70 0.057 | 0.68 0.052 | ||
|
| Mean SD | 0.84 0.014 | 0.74 0.029 | 0.71 0.031 | 0.69 0.034 | 0.65 0.032 | ||||
|
| Mean SD | 0.82 0.012 | 0.79 0.021 | 0.66 0.037 | ||||||
Table detailing the mean PPV and 1-standard deviation for the LR, RF, SVM and DNN models trained and tested on the different sub-datasets containing planned admissions, D, emergency admissions, D and emergency admissions with an infection, D*.
NPV results.
| Dataset—Models NPV |
|
|
|
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|---|---|
|
|
| Mean SD | 0.86 0.0093 | 0.87 0.017 | 0.84 0.079 | 0.79 0.048 | 0.72 0.055 | 0.73 0.042 | 0.70 0.065 | 0.63 0.071 |
|
| Mean SD | 0.94 0.022 | 0.95 0.027 | 0.88 0.019 | 0.83 0.031 | 0.81 0.052 | 0.75 0.077 | 0.72 0.089 | ||
|
| Mean SD | 0.94 0.0093 | 0.90 0.011 | 0.87 0.021 | 0.84 0.017 | 0.80 0.025 | 0.76 0.082 | 0.69 0.095 | ||
|
| Mean SD | 0.81 0.023 | 0.73 0.094 | |||||||
|
|
| Mean SD | 0.81 0.0021 | 0.80 0.032 | 0.78 0.027 | 0.77 0.032 | 0.75 0.044 | 0.72 0.039 | 0.74 0.041 | 0.65 0.048 |
|
| Mean SD | 0.90 0.0072 | 0.85 0.013 | 0.86 0.011 | 0.79 0.023 | 0.76 0.031 | 0.75 0.032 | 0.69 0.035 | ||
|
| Mean SD | 0.92 0.013 | 0.84 .0011 | 0.86 0.015 | 0.81 0.012 | 0.81 0.017 | 0.77 0.023 | 0.74 0.028 | 0.75 0.031 | |
|
| Mean SD | 0.83 0.017 | ||||||||
|
| Mean SD | 0.78 0.026 | 0.73 0.018 | 0.72 0.033 | 0.70 0.045 | 0.69 0.055 | 0.64 0.057 | 0.65 0.042 | 0.62 0.063 | |
|
| Mean SD | 0.84 0.021 | 0.78 0.023 | 0.81 0.028 | 0.77 0.032 | 0.74 0.029 | 0.70 0.038 | 0.69 0.045 | ||
|
| Mean SD | 0.86 0.018 | 0.83 0.013 | 0.79 0.028 | 0.76 0.024 | 0.70 0.033 | 0.67 0.039 | 0.68 0.041 | ||
|
| Mean SD | 0.78 0.019 | 0.65 0.044 | |||||||
Table detailing the mean NPV and 1-standard deviation for the LR, RF, SVM and DNN models trained and tested on the different sub-datasets containing planned admissions, D, emergency admissions, D and emergency admissions with an infection, D*.