| Literature DB >> 34601240 |
Thomas W Campbell1, Melissa P Wilson2, Heinrich Roder3, Samantha MaWhinney4, Robert W Georgantas3, Laura K Maguire3, Joanna Roder3, Kristine M Erlandson5.
Abstract
RATIONALE: Prognostic tools for aiding in the treatment of hospitalized COVID-19 patients could help improve outcome by identifying patients at higher or lower risk of severe disease. The study objective was to develop models to stratify patients by risk of severe outcomes during COVID-19 hospitalization using readily available information at hospital admission.Entities:
Keywords: COVID-19; Clinical decision support systems; Machine learning; Prognostic models; Risk assessment
Mesh:
Year: 2021 PMID: 34601240 PMCID: PMC8459591 DOI: 10.1016/j.ijmedinf.2021.104594
Source DB: PubMed Journal: Int J Med Inform ISSN: 1386-5056 Impact factor: 4.046
Patient characteristics for the development and independent validation cohorts.
| Development Cohort (N = 229) | Validation Cohort (N = 330) | ||
|---|---|---|---|
| Categorical Attribute | Class | n (%) | n (%) |
| Race | White | 41 (17.9) | 113 (34.2) |
| Black/African American | 52 (22.7) | 23 (7.0) | |
| Hispanic/Latino | 94 (41.0) | 164 (49.7) | |
| Other | 31 (13.5) | 25 (7.6) | |
| Unknown | 11 (4.8) | 5 (1.5) | |
| Sex | Male | 124 (54.1) | 172 (52.1) |
| Female | 105 (45.9) | 158 (47.9) | |
| eGFR | ≥60 mL/min/1.73 m2 | 180 (78.6) | 244 (73.9) |
| 30 ≥ but < 60 mL/min/1.73 m2 | 34 (14.8) | 61 (18.5) | |
| <30 mL/min/1.73 m2 | 15 (6.6) | 25 (7.6) | |
| Hypertension | Yes | 100 (43.7) | NA |
| No | 126 (56.3) | NA | |
| Diabetes | Yes | 82 (35.8) | NA |
| No | 147 (64.2) | NA | |
| BMI | 30 (27–36) | ||
| Age | 57 (43–68) | 57 (44–70) | |
| Temperature | 37 (37–38) | 37 (37–38) | |
| Heart Rate | 98 (84–110) | 98 (85–110) | |
| Systolic | 130 (120–150) | 130 (120–140) | |
| Diastolic BP | 74 (65–83) | 74 (65–84) | |
| Respiratory Rate | 20 (18–24) | 20 (18–24) | |
| Oxygen Saturation | 92 (87–94) | 92 (88–95) | |
| Weight | 82 (72–100) | 85 (73–100) | |
| QTc | 440 (420–460) | 440 (420–470) | |
| Sodium | 140 (130–140) | 140 (130–140) | |
| Potassium | 3.8 (3.4–4.0) | 3.9 (3.6–4.2) | |
| Carbon dioxide | 23 (21–25) | 23 (21–25) | |
| BUN | 13 (10–20) | 15 (10–22) | |
| Creatinine | 0.94 (0.69–1.2) | 0.87 (0.72–1.2) | |
| Anion Gap | 12 (10–13) | 11 (9–13) | |
| WBC Count | 6.8 (5.3–8.9) | 7 (5.5–9.1) | |
| Hemoglobin | 15 (13–16) | 14 (13–15) | |
| Hematocrit | 44 (40–47) | 42 (38–46) | |
| Platelet Count | 210 (160–260) | 200 (160–260) | |
| LDH* U/L | 320 (260–420) | 450 (290–750) | |
| D-Dimer | 860 (530–1500) | 740 (410–1500) | |
| CRP | 83 (41–150) | 79 (37–160) | |
| Ferritin | 360 (170–730) | 310 (130–600) | |
Definition of abbreviations: eGFR = estimated glomerular filtration rate; BP = blood pressure; QTc = corrected QT interval; BUN = blood urea nitrogen; CO2 = carbon dioxide; WBC = white blood cell; LDH = lactate dehydrogenase; CRP = C-reactive protein; BMI = body mass index.
Used for classification.
Only complete for 205 out 229 patients.
Fig. 1Consort diagrams of patient selection down to development (A) and validation (B) cohorts.
Fig. 2Hierarchical Configuration of Classifiers used for Risk Assessment for Each Endpoint. A Diagnostic Cortex model with in-bag decision tree model (represented by the top box) was used to stratify the entire development cohort into a higher and lower risk group for each endpoint. Diagnostic Cortex models (middle boxes) without trees were used to split the resulting two groups further according to one of the two schemas. (Schema A was used for the tests predicting risk of any complication and intubation. Schema B was used for the tests predicting risk of ARDS and admission to the ICU.)
Fig. 3Time from data collection to admission to the ICU for the 85 patients admitted to the ICU in the validation cohort indicating potential utility for ICU admission risk assessment at hospital admission. A time in the [0, 1] bin indicates the patient was admitted on the same day as the data was collected.
Fig. 4Performance Flow Chart for the Risk Assessment Test for ICU Admission for (A) the Development Cohort and (B) the Validation Cohort. Each uncolored box represents a classifier with the contents reflecting the set of patients to be classified by the classifier. The colored boxes represent the final risk groups with the contents reflecting composition of the groups and test performance. Bootstrap 95% confidence intervals for performance metrics are given in the supplement. Pos = Positive (Admitted to ICU); Neg = Negative (Not Admitted to ICU), PPV = Positive Predictive Value.
Fig. 5Performance Flow Chart for the Test Predicting Risk of Developing ARDS in (A) the Development Cohort, (B) the Validation Cohort. Each uncolored box represents a classifier with the contents reflecting the set of patients to be classified by the classifier. The colored boxes represent the final risk groups with the contents reflecting composition of the groups and test performance. Bootstrap 95% confidence intervals for performance metrics are given in the supplement. Pos = Positive (Developed ARDS); Neg = Negative (Did not develop ARDS), PPV = Positive Predictive Value.
Fig. 6Performance Flow Chart for the Test Assessing Risk of Intubation for (A) the Development Cohort and (B) the Validation Cohort. Each uncolored box represents a classifier with the contents reflecting the set of patients to be classified by the classifier. The colored boxes represent the final risk groups with the contents reflecting composition of the groups and test performance. Bootstrap 95% confidence intervals for performance metrics are given in the supplement. Pos = Positive (Intubated); Neg = Negative (Not intubated), PPV = Positive Predictive Value.
Basic lab results and other clinical and demographic attributes have predictive power in predicting risk of adverse events during COVID-19 hospitalization These include things like age, BMI, general inflammatory markers, and others Machine learning can be used to combine these attributes to make reliable predictions of COVID-19 prognosis |
Robust, validated, multi class machine learning risk prediction for 3 endpoints during COVID-19 hospitalization using only easily collectable attributes that are common in EHRs Rigorous explainability analysis to describe which attributes contributed more to the machine learning algorithms’ assignment of risk not found in any similar work |