| Literature DB >> 34083734 |
Amara Tariq1, Leo Anthony Celi2,3,4, Janice M Newsome5, Saptarshi Purkayastha6, Neal Kumar Bhatia7, Hari Trivedi8,5, Judy Wawira Gichoya5, Imon Banerjee8,5.
Abstract
The strain on healthcare resources brought forth by the recent COVID-19 pandemic has highlighted the need for efficient resource planning and allocation through the prediction of future consumption. Machine learning can predict resource utilization such as the need for hospitalization based on past medical data stored in electronic medical records (EMR). We conducted this study on 3194 patients (46% male with mean age 56.7 (±16.8), 56% African American, 7% Hispanic) flagged as COVID-19 positive cases in 12 centers under Emory Healthcare network from February 2020 to September 2020, to assess whether a COVID-19 positive patient's need for hospitalization can be predicted at the time of RT-PCR test using the EMR data prior to the test. Five main modalities of EMR, i.e., demographics, medication, past medical procedures, comorbidities, and laboratory results, were used as features for predictive modeling, both individually and fused together using late, middle, and early fusion. Models were evaluated in terms of precision, recall, F1-score (within 95% confidence interval). The early fusion model is the most effective predictor with 84% overall F1-score [CI 82.1-86.1]. The predictive performance of the model drops by 6 % when using recent clinical data while omitting the long-term medical history. Feature importance analysis indicates that history of cardiovascular disease, emergency room visits in the past year prior to testing, and demographic factors are predictive of the disease trajectory. We conclude that fusion modeling using medical history and current treatment data can forecast the need for hospitalization for patients infected with COVID-19 at the time of the RT-PCR test.Entities:
Year: 2021 PMID: 34083734 PMCID: PMC8175333 DOI: 10.1038/s41746-021-00461-0
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Fig. 1Study design.
a Proposed AI model decision point shows the prediction of two patients with distinct outcomes. b CONSORT diagram for Cohort selection process including decision nodes and a number of excluded cases.
Performance for binary classification models with hospitalization and non-hospitalization as two targets, in terms of class-wise and aggregated (weighted average) precision, recall, and F-score; C.I. (95% confidence) was computed using bootstrapping over 1000 iterations with random samples.
| Precision | Recall | F1-score | Number of samples | ||
|---|---|---|---|---|---|
| Demographics | Non hospitalization | 68 | 63 | 65 | 277 |
| Hospitalization | 71 | 74 | 72 | 328 | |
| Overall | 69 | 69 | 69 | ||
| C.I. | 66.8–71.6 | 66.9–71.7 | 66.8–71.6 | ||
| Prescriptions | Non hospitalization | 62 | 86 | 72 | 277 |
| Hospitalization | 82 | 55 | 66 | 328 | |
| Overall | 73 | 69 | 69 | ||
| C.I. | 71.2–75.5 | 67.2–71.9 | 66.7–71.4 | ||
| ICD-9 | Non hospitalization | 74 | 84 | 79 | 277 |
| Hospitalization | 85 | 75 | 80 | 328 | |
| Overall | 80 | 79 | 79 | ||
| C.I. | 78.1–82.0 | 77.1–81.4 | 77.2–81.4 | ||
| CPT | Non hospitalization | 74 | 83 | 78 | 277 |
| Hospitalization | 84 | 75 | 79 | 328 | |
| Overall | 79 | 79 | 79 | ||
| C.I. | 77.1–81.6 | 76.5–81.0 | 76.6–81.1 | ||
| Laboratory test results | Non hospitalization | 72 | 78 | 75 | 277 |
| Hospitalization | 80 | 75 | 77 | 328 | |
| Overall | 76 | 76 | 76 | ||
| C.I. | 74.1–78.8 | 74.0–78.8 | 74.4–79.1 | ||
| Late fusion | Non hospitalization | 84 | 77 | 81 | 277 |
| Hospitalization | 82 | 88 | 85 | 328 | |
| Overall | 83 | 83 | 83 | ||
| C.I. | 81.3–85.3 | 81.1–85.2 | 81.0–85.2 | ||
| Early fusion | Non hospitalization | 83 | 82 | 82 | 277 |
| Hospitalization | 85 | 86 | 85 | 328 | |
| Overall | 84 | 84 | 84 | ||
| C.I. | 82.1–86.1 | 82.1–86.1 | 82.1–86.1 | ||
| Middle fusion | Non hospitalization | 82 | 78 | 80 | 277 |
| Hospitalization | 82 | 86 | 84 | 328 | |
| Overall | 82 | 82 | 82 | ||
| C.I. | 79.9–84.0 | 79.8–84.0 | 79.8–83.9 | ||
| Late fusion – w/o ‘history’ interval | Non hospitalization | 76 | 75 | 75 | 277 |
| Hospitalization | 79 | 80 | 79 | 328 | |
| Overall | 78 | 78 | 78 | ||
| C.I. | 76.5–79.8 | 75.6–79.7 | 75.5–79.7 | ||
| Early fusion – w/o ‘history’ interval | Non hospitalization | 75 | 79 | 77 | 277 |
| Hospitalization | 82 | 77 | 79 | 328 | |
| Overall | 78 | 78 | 78 | ||
| C.I. | 76.3–80.5 | 76.0–80.2 | 76.1–80.3 | ||
| Middle fusion – w/o ‘history’ interval | Non hospitalization | 69 | 83 | 76 | 277 |
| Hospitalization | 83 | 69 | 75 | 328 | |
| Overall | 77 | 75 | 75 | ||
| C.I. | 74.7–79.0 | 73.2–77.6 | 73.2–77.6 |
Fig. 2Statistical analysis of the models.
a PR (left) and ROC (right) curves for model distinguishing between self-isolation and hospitalization outcomes. Each colored line represents a separate model and the color scheme is consistent between PR and ROC curves. Feature importance from (b) early fusion—shows the importance of top 25 individual EMR data component, c late Fusion model—shows the importance of individual EMR data sources. The standard deviation bar (red) is generated via 10-fold cross-validation on the training data. d Calibration curve for early, late and middle fusion models along with Brier scores for each calibrated model.
Fig. 3Performance stratification of the best performaing model based on Early fusion.
a stratification based on race and ethnicity, b stratification based on gender, c stratification based on age.
Stratified patient characteristics.
| Variables | Total cohort (2844 patients) | Train (2275 patients) | Test (569 patients) |
|---|---|---|---|
| AGE, mean(SD) | 55.6 (17.9) | 55.5 (18.0) | 55.7 (17.9) |
| GENDER [mean age/std] | |||
| Male | 1470 (46%) [56.7 (16.8)] | 1115 (46%) [56.8 (17.0)] | 254 (42%) [56.5 (16.2)] |
| Female | 1719 (54%) [54.5 (18.8)] | 1298 (54%) [54.5 (18.8)] | 351 (58%) [55.2 (19.1)] |
| Race | |||
| African American | 1678 (56.4%) | 1357 (56.1%) | 321 (54.4%) |
| Caucasian/White | 593 (19.7%) | 474 (19.6%) | 119 (19.7%) |
| Asian | 79 (2.6%) | 62 (2.6%) | 17 (2.8%) |
| American Indian or Alaska Native | 11 (0.4%) | 6 (0.3%) | 5(0.8%) |
| Multiple | 10 (0.3%) | 6 (0.3%) | 4 (0.7%) |
| Native Hawaiian Pacific Islander | 6 (0.2%) | 2 (0.1%) | 4 (0.7%) |
| Unknown | 638 (21.1%) | 511 (21.1%) | 127 (21.0%) |
| Ethnic group | |||
| Hispanic or Latino | 233 (7.7%) | 188 (7.8%) | 45 (7.4%) |
| Non-Hispanic or Latino | 2131 (70.5%) | 1706 (70.6%) | 425 (70.3%) |
| Unknown | 659 (21.8) | 524 (21.7%) | 135 (22.3%) |
| Comorbidities | |||
| Respiratory disease | 1799 (59.5%) | 1435 (59.3%) | 364 (60.2%) |
| Hypertension | 1372 (45.4%) | 1092 (45.2%) | 280 (46.3%) |
| Renal disease | 1016 (33.6%) | 806 (33.3%) | 210 (34.7%) |
| Diabetes | 467 (15.4%) | 380 (15.7%) | 87 (14.4%) |
Fig. 4Patients characteristics as heatmaps.
Heatmaps of a common comorbidities in our patient population according to different age groups, b relation between race and comorbidities, c relation between ethnic group and comorbidities. The value represented as % and darker color represents higher value.
Fig. 5Proposed fusion AI model architectures.
a Early fusion, b late fusion, c middle fusion/branched NN model.