| Literature DB >> 33083565 |
Narges Razavian1,2,3, Vincent J Major1, Mukund Sudarshan4, Jesse Burk-Rafel5, Peter Stella6, Hardev Randhawa7, Seda Bilaloglu1, Ji Chen1, Vuthy Nguy1, Walter Wang1, Hao Zhang1, Ilan Reinstein8, David Kudlowitz5, Cameron Zenger5, Meng Cao5, Ruina Zhang5, Siddhant Dogra5, Keerthi B Harish1, Brian Bosworth5,9, Fritz Francois5,9, Leora I Horwitz1,2,5, Rajesh Ranganath1,3,4, Jonathan Austrian5,7, Yindalon Aphinyanaphongs1,2.
Abstract
The COVID-19 pandemic has challenged front-line clinical decision-making, leading to numerous published prognostic tools. However, few models have been prospectively validated and none report implementation in practice. Here, we use 3345 retrospective and 474 prospective hospitalizations to develop and validate a parsimonious model to identify patients with favorable outcomes within 96 h of a prediction, based on real-time lab values, vital signs, and oxygen support variables. In retrospective and prospective validation, the model achieves high average precision (88.6% 95% CI: [88.4-88.7] and 90.8% [90.8-90.8]) and discrimination (95.1% [95.1-95.2] and 86.8% [86.8-86.9]) respectively. We implemented and integrated the model into the EHR, achieving a positive predictive value of 93.3% with 41% sensitivity. Preliminary results suggest clinicians are adopting these scores into their clinical workflows.Entities:
Keywords: Health care; Prognosis; Viral infection
Year: 2020 PMID: 33083565 PMCID: PMC7538971 DOI: 10.1038/s41746-020-00343-x
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Demographics, outcomes, biomarkers, and vital signs of retrospective cohort.
| Patient Characteristics | All Cohort (% of | With Adverse (% of | Without Adverse (% of | |
|---|---|---|---|---|
| Demographics | ||||
| Age, mean (sd) | 63.5 (16.5) | 65.3 (15.8) | 61.5 (17.0) | <0.0001 |
| Sex, | <0.0001 | |||
| Female | 1275 (38.4%) | 571 (33.4%) | 704 (43.9%) | |
| Male | 2042 (61.6%) | 1141 (66.6%) | 901 (56.1%) | |
| Race, | <0.0001 | |||
| White | 1481 (44.6%) | 794 (46.4%) | 687 (42.8%) | |
| Black | 508 (15.3%) | 204 (11.9%) | 304 (18.9%) | |
| Asian | 246 (7.4%) | 141 (8.2%) | 105 (6.5%) | |
| Other Race | 916 (27.6%) | 478 (27.9%) | 438 (27.3%) | |
| Unknown | 164 (4.9%) | 84 (4.9%) | 80 (5.0%) | |
| Adverse Event Outcomes, | ||||
| Mortality (For all time) | 702 (21.2%) | |||
| Hospice Discharge | 102 (3.1%) | |||
| ICU Admission | 673 (20.3%) | |||
| O2 Support Devices Beyond Nasal Cannula | 1513 (45.6%) | |||
| O2 Flow Rate > 6 L/min on Nasal Cannula | 365 (11.0%) | |||
| Readmission within 96 h of discharge | 20 (0.60%) | |||
| Biomarkers, first value measured, mean (sd) | ||||
| Neutrophils Count (103/uL) | 6.2 (5.4) | 7.3 (6.7) | 4.9 (3.0) | <0.0001 |
| Neutrophils Percent | 74.8 (12.8) | 79.3 (11.2) | 70.0 (12.8) | <0.0001 |
| Lymphocytes Count (103/uL) | 1.1 (1.7) | 1.1 (2.3) | 1.2 (0.74) | 0.014 |
| Lymphocytes Percent | 16.0 (10.4) | 12.6 (8.7) | 19.7 (10.8) | <0.0001 |
| Eosinophils Count (103/uL) | 0.03 (0.12) | 0.02 (0.11) | 0.05 (0.12) | <0.0001 |
| Eosinophils Percent | 0.49 (1.2) | 0.28 (1.0) | 0.70 (1.4) | <0.0001 |
| Platelet Count (103/uL) | 225.9 (98.45) | 222.0 (95.6) | 230.1 (101.2) | 0.017 |
| Blood Urea Nitrogen (mg/dL) | 24.7 (22.8) | 27.5 (23.8) | 21.7 (21.2) | <0.0001 |
| Creatinine (mg/dL) | 1.5 (1.8) | 1.6 (1.7) | 1.4 (1.9) | 0.027 |
| C-Reactive Protein (mg/L) | 124.4 (86.3) | 149.0 (87.8) | 97.0 (75.8) | <0.0001 |
| D-Dimer (ng/mL DDU) | 1295.7 (3582.4) | 1573.6 (4101.2) | 987.0 (2869.9) | <0.0001 |
| Ferritin (ng/mL) | 1324.0 (2315.4) | 1609.4 (2767.8) | 1009.2 (1624.3) | <0.0001 |
| Lactate Dehydrogenase (U/L) | 399.3 (243.9) | 457.0 (279.5) | 337.0 (178.8) | <0.0001 |
| Troponin I (ng/mL) | 0.28 (2.7) | 0.41 (3.5) | 0.13 (1.3) | 0.0032 |
| Vital signs, first 12 h, mean (sd) | ||||
| HR max | 93.7 (17.9) | 96.5 (19.4) | 90.8 (15.7) | <0.0001 |
| Resp max | 23.8 (7.1) | 25.8 (8.3) | 21.6 (4.7) | <0.0001 |
| SpO2 max (%) | 96.3 (2.4) | 96.0 (2.6) | 96.6 (2.1) | <0.0001 |
| Temp max (F) | 99.8 (1.5) | 99.9 (1.6) | 99.6 (1.4) | <0.0001 |
| HR min | 80.2 (14.2) | 80.9 (14.8) | 79.5 (13.5) | 0.0045 |
| Resp min | 18.9 (3.6) | 19.3 (4.2) | 18.5 (2.6) | <0.0001 |
| SpO2 min (%) | 92.4 (4.9) | 91.0 (5.8) | 93.9 (2.9) | <0.0001 |
| Temp min (F) | 98.4 (0.97) | 98.5 (1.0) | 98.4 (0.88) | 0.12 |
aThe With Adverse and Without Adverse groups are compared with: (1) two-sided Welch’s t test for age, biomarkers, vital signs, and days since admission and (2) Pearson’s χ2 test for sex, ethnicity and race.
Distillation of a parsimonious model as a combination of conditionally independent variables.
| Variable Explanation | Conditional Independence | Used in Final Model | Final Model Coefficient (+toward a favorable outcome) | |
|---|---|---|---|---|
| Model Intercept | +1.43 | |||
| 1 | Age | 0.016 | X | |
| 2 | Oxygen support device greater than nasal cannula | 0.016 | ✓ | −7.31 |
| 3 | Respiratory rate, maximum in last 12 h | 0.016 | ✓ | −1.23 |
| 4 | Oxygen saturation, maximum in last 12 h | 0.016 | X | 0 |
| 5 | Oxygen support device of nasal cannula | 0.016 | ✓ | −0.816 |
| 6 | Nasal cannula oxygen flow rate, maximum value in last 12 h | 0.016 | ✓ | 0 if flow > 3 L/min +1.12 if 0 < flow < = 3 L/min +0.424 if flow = 0 |
| 7 | Oxygen saturation, minimum value in last 12 h | 0.016 | ✓ | +1.52 |
| 8 | Temperature, maximum value in last 12 h | 0.016 | ✓ | −0.439 |
| 9 | Lactate dehydrogenase, most recent value | 0.016 | ✓ | −0.168 |
| 10 | Platelet count, most recent value | 0.016 | ✓ | +0.755 |
| 11 | Blood urea nitrogen, most recent value | 0.016 | ✓ | −1.30 |
| 12 | C-reactive protein, most recent value | 0.016 | ✓ | −0.558 |
| 13 | Heart rate, minimum value in last 12 h | 0.033 | ✓ | −0.437 |
| 14 | Respiratory rate, minimum value in last 12 h | 0.033 | ✓ | −0.407 |
| 15 | Eosinophils percent, most recent value | 0.148 | ✓ | +0.916 |
| 16 | Body mass index, maximum value in last 12 h | 0.148 | X | |
| 17 | No oxygen support device (i.e. room air) | 0.803 | X | |
| 18 | Heart rate, maximum value in last 12 h | 0.967 | X | |
| 19 | Neutrophil count, most recent value | 0.967 | X | |
| 20 | Temperature, minimum value in last 12 h | 0.984 | X | |
| 21 | Eosinophil count, most recent value | 0.984 | X | |
| 22 | Weight, maximum value in last 12 h | 0.984 | X | |
| 23 | Mean platelet volume, most recent value | 0.984 | X | |
| 24 | Categorical variable of historical smoking behavior: e.g. non-smoker or smoker | 1.000 | X | |
| 25 | Lymphocyte count, most recent value | 1.000 | X | |
| 26 | Female sex | 1.000 | X | |
| 27 | Number of days since admission | 1.000 | X | |
| 28 | Lymphocytes percent, most recent value | 1.000 | X | |
| 29 | Categorical variable of current smoking behavior: e.g. never, former, current smoker | 1.000 | X | |
| 30 | Troponin I, most recent value | 1.000 | X | |
| 31 | Neutrophils percent, most recent value | 1.000 | X | |
| 32 | Body mass index, minimum value in last 12 h | 1.000 | X | |
| 33 | Creatinine, most recent value | 1.000 | X | |
| 34 | D-dimer, most recent value | 1.000 | X | |
| 35 | Ferritin, most recent value | 1.000 | X | |
| 36 | Weight, minimum value in last 12 h | 1.000 | X | |
| 37 | Categorical variable of patient race and ethnicity | 1.000 | X |
Fig. 1Predictive performance of the blackbox and parsimonious models on retrospective held-out set.
Model performance in an unseen 20% sample of data including 664 unique patients and a total of 5,914 prediction instances. (a) precision recall curve (PRC) for all patients, and (b) receiver operating characteristic (ROC) curve for all patients. (c) PRC for patients at times when patient does not need O2 support beyond nasal cannula at 6 L/min (d) ROC curve for patients at times when patient does not need O2 support beyond nasal cannula of 6 L/min. (e) PRC for patients transferred out of ICU, (f) ROC curve for patients transferred out of ICU. The shaded areas around each curve depict the empirical bounds of one standard deviation computed with a bootstrap procedure with 100 iterations where, in each iteration, 50% of the held-out set is sampled with replacement.
Fig. 2Timing of the first ‘green’ prediction for patients discharged alive from the retrospective held-out set.
(a) Time from admission to the first green score. (b) Time from the first green score to discharge. This analysis includes all held-out set patients with at least one green score who were discharged alive (n = 361) and stratifies that group into patients that received some of their care in an ICU (n = 31) and those who received no ICU care (n = 330).
Fig. 3Electronic Health Record integration and visualization of predictions.
Provider-facing view showing: (1) a patient list column, (2) displaying model scores for a clinician’s list of patients. Hovering over the score triggers a dialog box (3) displaying model scores along with (4) an explanation of contributing factors and (5) a trend line of recent scores. To reduce potential for confusion by clinicians, we display the inverse of the model prediction raw score (i.e 1 - score) and scale the score from 0–100. Consequently, lower scores represent patients at lower risk of adverse outcomes. Negative feature contributions are protective. Note, in the first prediction, the variable “Nasal cannula O2 flow rate Max in last 12 h” has a value of “N/A” because their O2 device is greater than Nasal cannula.
Fig. 4Prospective deployment and evaluation on real-time predictions.
A total of 109,913 predictions were generated on 30-min intervals for 445 patients and 474 admissions. (a) Precision recall curve. (b) Receiver operating characteristic curve. The shaded areas around each curve depict the empirical bounds of one standard deviation computed with a bootstrap procedure with 100 iterations, where in each iteration, 50% of the held-out set is sampled with replacement. Note: the shaded standard deviation of Fig. 4 are present but very small as the many predictions made at a 30-min frequency decreases variance.
Fig. 5Display of model scores to users within the EHR.
Model scores can be shown to users in two different displays that correspond to alternative clinical workflows. (a) Patient list (Fig. 3) display report indicated the number of times users navigated to a patient list that includes our model scores. (b) COVID-19 report describes the number of times a user navigated to a summary report that contained various COVID-19 specific components including our model scores.