| Literature DB >> 34002167 |
Daniel Rozenbaum1, Jacob Shreve1, Nathan Radakovich2, Abhijit Duggal3, Lara Jehi4, Aziz Nazha1,2,5.
Abstract
OBJECTIVE: To develop predictive models for in-hospital mortality and length of stay (LOS) for coronavirus disease 2019 (COVID-19)-positive patients. PATIENTS AND METHODS: We performed a multicenter retrospective cohort study of hospitalized COVID-19-positive patients. A total of 764 patients admitted to 14 different hospitals within the Cleveland Clinic from March 9, 2020, to May 20, 2020, who had reverse transcriptase-polymerase chain reaction-proven coronavirus infection were included. We used LightGBM, a machine learning algorithm, to predict in-hospital mortality at different time points (after 7, 14, and 30 days of hospitalization) and in-hospital LOS. Our final cohort was composed of 764 patients admitted to 14 different hospitals within our system.Entities:
Keywords: ANC, absolute neutrophil count; AST, aspartate aminotransferase; BMI, body mass index; CK, creatinine kinase; COVID-19, coronavirus disease 2019; CRP, C-reactive protein; CXR, chest radiograph; D1, day 1; ICU, intensive care unit; INR, international normalized ratio; LDH, lactate dehydrogenase; LOS, length of stay; LightGBM, Light Gradient Boosting Machine; NC, nasal cannula; Nan, missing value; PTT, partial thromboplastin time; Q, quartile; ROC AUC, area under the receiver operating characteristics curve; SHAP, SHapley Additive exPlanations; SUN, serum urea nitrogen
Year: 2021 PMID: 34002167 PMCID: PMC8114764 DOI: 10.1016/j.mayocpiqo.2021.05.001
Source DB: PubMed Journal: Mayo Clin Proc Innov Qual Outcomes ISSN: 2542-4548
Patients’ Characteristicsa,b
| Characteristic | All Patients (n = 764) | Death or Hospice (n=116) | Survived (n=648) | |
|---|---|---|---|---|
| Demographic characteristic | ||||
| Race, no. (%) | ||||
| White | 433 (56.7) | 82 (70.7) | 351 (54.2) | .001 |
| African American | 277 (36.3) | 30 (25.9) | 247 (38.1) | .02 |
| Asian | 10 (1.3) | 1 (0.9) | 9 (1.4) | >.99 |
| Multiracial | 28 (3.7) | 1 (0.9) | 27 (4.2) | .11 |
| Ethnicity, no. (%) | ||||
| Non-Hispanic | 705 (94.8) | 109 (96.5) | 596 (94.5) | .51 |
| Hispanic | 39 (5.2) | 4 (3.5) | 35 (5.5) | |
| Age (y), median (Q1, Q3) | 64 (53, 76) | 80 (72, 84) | 62 (52, 73) | <.001 |
| Sex, no. (%) | ||||
| Female | 366 (47.9) | 57 (49.1) | 309 (47.7) | .85 |
| Male | 398 (52.1) | 59 (50.9) | 339 (52.3) | .85 |
| Body mass index (kg/m2), median (Q1, Q3) | 30.1 (25.9, 35.4) | 30.3 (26.5, 35.6) | 28.6 (22.9, 32.7) | <.001 |
| Previous medical history, no. (%) | ||||
| Chronic obstructive pulmonary disease | 95 (13.5) | 17 (16.2) | 78 (13.0) | .47 |
| Asthma | 156 (22.1) | 16 (15.1) | 140 (23.3) | .08 |
| Diabetes | 284 (39.9) | 50 (46.3) | 234 (38.7) | .17 |
| Hypertension | 528 (72.3) | 96 (83.5) | 432 (70.2) | .005 |
| Coronary artery disease | 152 (21.6) | 44 (40.4) | 108 (18.1) | <.001 |
| Heart failure | 139 (19.6) | 44 (40.0) | 95 (15.9) | <.001 |
| Any cancer | 142 (19.4) | 35 (31.5) | 107 (17.3) | .001 |
| Laboratory parameters, median (Q1, Q3) | ||||
| Metabolic indexes | ||||
| Sodium (mEq/L) | 137.0 (134.0, 139.0) | 138.0 (134.0, 141.0) | 137.0 (134.0, 139.0) | .02 |
| Potassium (mEq/L) | 4.0 (3.7, 4.4) | 4.2 (3.8, 4.5) | 4.0 (3.7, 4.3) | <.001 |
| Creatinine (mg/dL) | 1.0 (0.8, 1.4) | 1.6 (1.1, 2.3) | 1.0 (0.8, 1.3) | <.001 |
| Lactate (mg/dL) | 1.4 (1.0, 1.8) | 1.5 (1.2, 2.1) | 1.3 (1.0, 1.8) | .02 |
| Hepatic indexes | ||||
| Alanine aminotransferase (U/L) | 24.0 (15.0, 40.0) | 27.0 (15.0, 41.0) | 23.0 (15.0, 39.0) | 0.38 |
| Aspartate aminotransferase (U/L) | 34.0 (24.0, 52.0) | 43.0 (32.0, 79.0) | 32.0 (23.0, 49.0) | <.001 |
| Total bilirubin (mg/dL) | 0.4 (0.3, 0.6) | 0.5 (0.3, 0.7) | 0.4 (0.3, 0.6) | .05 |
| Alkaline phosphatase (U/L) | 72.0 (57.5, 94.5) | 82.0 (63.5, 104.0) | 71.0 (57.0, 93.2) | .01 |
| Albumin (g/dL) | 3.7 (3.4, 4.0) | 3.4 (3.0, 3.8) | 3.7 (3.4, 4.0) | <.001 |
| Hematologic indexes | ||||
| Hemoglobin (g/dL) | 13.1 (11.6, 14.5) | 11.9 (9.9, 13.8) | 13.3 (11.9, 14.6) | <.001 |
| White blood cell count (k/μL) | 6.4 (4.8, 8.5) | 7.7 (5.4, 10.9) | 6.3 (4.8, 8.2) | <.001 |
| Platelet count (k/μL) | 207.0 (160.0, 267.0) | 198.5 (144.2, 245.2) | 209.0 (163.0, 268.0) | .04 |
| Coagulation indexes | ||||
| International normalized ratio | 1.0 (1.0, 1.1) | 1.1 (1.0, 1.2) | 1.0 (1.0, 1.1) | .04 |
| Partial thromboplastin time (s) | 29.6 (27.1, 33.4) | 30.8 (27.0, 33.7) | 29.4 (27.1, 33.2) | .50 |
| D-Dimer (ng/mL) | 840.0 (490.0, 1615.0) | 1470.0 (825.0, 3380.0) | 780.0 (470.0, 1390.0) | <.001 |
| Inflammatory indexes | ||||
| Lactate dehydrogenase (U/L) | 299.0 (229.8, 401.0) | 400.0 (308.0, 531.0) | 288.0 (223.5, 366.5) | <.001 |
| C-Reactive protein (mg/dL) | 6.5 (3.0, 12.2) | 11.9 (5.7, 17.5) | 5.9 (2.5, 11.3) | <.001 |
| Procalcitonin (ng/mL) | 0.1 (0.1, 0.4) | 0.3 (0.2, 1.4) | 0.1 (0.1, 0.3) | <.001 |
| Ferritin (ng/mL) | 511.4 (255.3, 1009.2) | 852.9 (351.9, 1747.5) | 485.5 (235.1, 893.2) | <.001 |
| Cardiac enzymes | ||||
| Troponin T (ng/mL) | 0.0 (0.0, 0.1) | 0.1 (0.0, 0.2) | 0.0 (0.0, 0.1) | .06 |
| Creatine kinase (U/L) | 135.0 (69.5, 297.0) | 242.0 (105.0, 753.0) | 115.0 (65.8, 228.2) | .001 |
| Treatment-related variables, no. (%) | ||||
| Intensive care unit on admission | 147 (19.2) | 48 (41.4) | 99 (15.3) | <.001 |
| Need for noninvasive mechanical ventilation | 96 (12.6) | 34 (29.3) | 62 (9.6) | <.001 |
| Mechanical ventilation on d 1 | 74 (9.7) | 34 (29.3) | 40 (6.2) | <.001 |
| Mechanical ventilation during stay | 133 (17.4) | 59 (74.7) | 74 (27.4) | <.001 |
| Use of hydroxychloroquine | 293 (52.6) | 39 (48.1) | 254 (53.4) | .45 |
| Use of tocilizumab | 50 (9.0) | 8 (9.9) | 42 (8.8) | .92 |
| New use of steroids | 94 (12.3) | 32 (27.6) | 62 (9.6) | <.001 |
Q, quartile.
SI conversion factors: To convert sodium and potassium values to mmol/L, multiply by 1.0; to convert creatinine values to μmol/L, multiply by 88.4; to convert lactate values to mmol/L, multiply by 0.111; to convert total bilirubin values to μmol/L, multiply by 17.104; to convert albumin and hemoglobin values to g/L, multiply by 10; to convert white blood cell values to ×109/L, multiply by 1; to convert platelet values to ×109/L, multiply by 1; to convert D-dimer values to nmol/L, multiply by 5.476; to convert C-reactive protein values to mg/L, multiply by 10; to convert ferritin values to μg/L, multiply by 1; to convert troponin T values to μg/L, multiply by 1.0.
For the categorical variables, percentages are calculated out of non-missing data points instead of out of 764.
Figure1Ten most important variables for each model. Bar plots show the 10 most important variables for each model based on their SHapley Additive exPlanations (SHAP) values (values generated using the SHAP algorithm indicating how much a variable contributed to the model’s decisions). ANC, absolute neutrophil count; AST, aspartate aminotransferase; BMI, body mass index; CK, creatinine kinase; COVID-19, coronavirus disease 2019; CRP, C-reactive protein; CXR, chest radiograph; D1, day 1; ICU, intensive care unit; INR, international normalized ratio; LDH, lactate dehydrogenase; LOS, length of stay; NC, nasal cannula; PTT, partial thromboplastin time; SUN, serum urea nitrogen.
Figure 2Personalized prediction of mortality and length of stay (LOS). Decision plots show how the probability of the outcome (7-day mortality on the left and LOS >7 days on the right) shifts as each variable is considered for 3 different patients on each side. The starting point in the bottom of each graph is the pre-test probability (ie, overall percentage of patients who died within 7 days or whose LOS was >7 days). For instance, in the top panel left, the probability of dying goes from about 40% to 90% as the patient’s age (of 85 years) is considered by the algorithm. On the left, the 3 patients depicted had similar ages but different outcomes (top 1 died and the other 2 survived), all of which were correctly predicted by the model. On the right, from top to bottom, LOS was 5, 8, and 24 days. BMI, body mass index; CRP, C-reactive protein; D1, day 1; LDH, lactate dehydrogenase; Nan, missing value; NC, nasal cannula; PTT, partial thromboplastin time; SUN, serum urea nitrogen.