| Literature DB >> 34545350 |
Carlo Lombardi1, Elena Roca1, Barbara Bigni1, Bruno Bertozzi1, Camillo Ferrandina1, Alberto Franzin1, Oscar Vivaldi1, Marcello Cottini2, Andrea D'Alessio3, Paolo Del Poggio3, Gian Marco Conte4, Alvise Berti5.
Abstract
Early prediction of COVID-19 in-hospital mortality relies usually on patients' preexisting comorbidities and is rarely reproducible in independent cohorts. We wanted to compare the role of routinely measured biomarkers of immunity, inflammation, and cellular damage with preexisting comorbidities in eight different machine-learning models to predict mortality, and evaluate their performance in an independent population. We recruited and followed-up consecutive adult patients with SARS-Cov-2 infection in two different Italian hospitals. We predicted 60-day mortality in one cohort (development dataset, n = 299 patients, of which 80% was allocated to the development dataset and 20% to the training set) and retested the models in the second cohort (external validation dataset, n = 402). Demographic, clinical, and laboratory features at admission, treatments and disease outcomes were significantly different between the two cohorts. Notably, significant differences were observed for %lymphocytes (p < 0.05), international-normalized-ratio (p < 0.01), platelets, alanine-aminotransferase, creatinine (all p < 0.001). The primary outcome (60-day mortality) was 29.10% (n = 87) in the development dataset, and 39.55% (n = 159) in the external validation dataset. The performance of the 8 tested models on the external validation dataset were similar to that of the holdout test dataset, indicating that the models capture the key predictors of mortality. The shap analysis in both datasets showed that age, immune features (%lymphocytes, platelets) and LDH substantially impacted on all models' predictions, while creatinine and CRP varied among the different models. The model with the better performance was model 8 (60-day mortality AUROC 0.83 ± 0.06 in holdout test set, 0.79 ± 0.02 in external validation dataset). The features that had the greatest impact on this model's prediction were age, LDH, platelets, and %lymphocytes, more than comorbidities or inflammation markers, and these findings were highly consistent in both datasets, likely reflecting the virus effect at the very beginning of the disease.Entities:
Keywords: COVID-19; CRP; Coronavirus; In-hospital death; LDH; Lymphocytes; Platelets; SARS-CoV-2
Year: 2021 PMID: 34545350 PMCID: PMC8444380 DOI: 10.1016/j.crimmu.2021.09.001
Source DB: PubMed Journal: Curr Res Immunol ISSN: 2590-2555
Baseline demographics, comorbidities, clinical features at presentation, treatments and outcomes of hospitalized patients with COVID-19 in the development dataset and external validation dataset. The variables used as input variables of the models are marked asb. Comparisons were performed with either X2 test or Fisher exact tests for categorical variables, and Student's t-test for continuous variables.
| Characteristics | Development dataset | External validation dataset | p value |
|---|---|---|---|
| N. | 299 | 402 | |
| | 68.79 (11.65) | 70.21 (13.17) | 0.1384 |
| | 69.57% (208) | 67.41% (271) | 0.5446 |
| | 19.40% (58) | 5.22% (21) | <.0001 |
| | 99.33% (297) | 100% (402) | 0.1816 |
| | 15.39% (46) | 3.48% (14) | <.0001 |
| | 19.39% (58) | 19.90% (80) | 0.8686 |
| | 53.51% (160) | 46.77% (188) | 0.0773 |
| | 28.09% (84) | 24.13% (97) | 0.2356 |
| | 36.12% (108) | 7.46% (30) | <.0001 |
| | 6.35% (19) | 9.70 (39) | 0.1116 |
| | 5.69% (17) | 6.22% (25) | 0.7686 |
| | 3.34% (10) | 0.50% (2) | 0.0041 |
| | 85.62% (256) | 98.01% (394) | <.0001 |
| | 51.51% (154) | NA | – |
| | 50.17% (150) | 96.52% (388) | <.0001 |
| | NA | 95.27% (383) | – |
| | 6.02% (18) | 4.48% (18) | 0.3602 |
| | 4.01% (12) | NA | – |
| | 2.68% (8) | NA | – |
| | 96.66% (289) | 95.52% (384) | 0.4486 |
| | 248.9 (73.6) | 355.6 (116.1) | <.0001 |
| | 7.89 (4.35) | 8.13 (4.32) | 0.4637 |
| | 14.75 (9.45) | 13.28 (7.73) | 0.0235 |
| | 187.000 (82.000) | 225.000 (98.000) | <.0001 |
| | 126.3 (88.58) | 122.8 (95.7) | 0.6260 |
| | 395 [305.75–530] | 405 [304–524] | 0.9897 |
| | 53 [38–75] | 50 [36–74.25] | 0.1225 |
| | 32 [20–57] | 41 [27.75–62] | <.0001 |
| | 1.01 [0.96–1.12] | 1.04 [0.99–1.12] | 0.0018 |
| | 1.26 (0.94) | 1.53 (1.13) | 0.0011 |
| | 83.28% (249) | 28.61% (115) | <.0001 |
| | 22.75% (68) | 5.72% (23) | <.0001 |
| | 21.07% (63) | 0% (0) | <.0001 |
| | 34.45% (103) | 0.75% (3) | <.0001 |
| | 4.01% (12) | 0% (0) | <.0001 |
| | 48.16% (144) | 35.57% (143) | 0.008 |
| | 13.04% (39) | 19.65% (79) | 0.0207 |
| | 10.03% (30) | 10.70% (43) | 0.7762 |
| | 29.10% (87) | 39.55% (159) | 0.0041 |
Abbreviations: HTN: Blood hypertension, BMI: body mass index; Cardiovascular Disease: chronic heart failure, myocardial infarction, atrial fibrillation; CKD: chronic kidney disease, stage III correspond to estimated glomerular filtration rate <60 mL/min; COPD: Chronic obstructive pulmonary disease; WBC: White blood cells, PLT: platlets, CRP: C-reactive protein, LDH: lactic dehydrogenase, AST: aspartate aminotransferase; ALT: alanine aminotransferase; INR: international normalized ratio; sCr: serum Creatinine;; Antibiotics: oral Cefixime: 400 mg/day for ≥5 days; oral Azithromycin 500 mg/day for ≥5 days; oral Claritromicin 250 mg x 2/day for ≥5 days endovenoous Ceftriaxon 2 g/day for ≥5 days; endovenous piperacillina/tazobactam 4.5 mg x 3 or 4/day for ≥5 da; oral or endovenous Levofloxacin 500 mg/day for ≥5 days. HCQ: hydroxychloroquine, 200 mg 12 h apart for the first 2 doses, then 200 mg/day for ≥5 days; Oral Prednisolone or equivalents: range 5–25 mg/day for ≥5 days. NIV: Non-invasive ventilation; ICU: intesive care unit. SD = standard deviation.
Thoracic X-ray as a screening test, followed by CT-scan in doubtful cases.
O2 therapy: administered when saturation were ≤92% at resting in ambient air; required nasal canula or Venturi mask; NIV: required non-inviasive ventilation.
NIV: patients non-responsive to high-flow O2-therapy, requiring.
ICU with intubation: required intensive care unit hospitalization with intubation.
Fig. 1Clinical and laboratory features of the development dataset (A an C, in the blue panels) and of the validation dataset (B and D, in the white panels) by outcomes. * <0.05, ** <0.01, ***<0.001. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)
Mean F1-score and AUROC obtained in the cross-validation on the training set (N = 239).
| F1-score (mean ± SD) | AUROC (mean ± SD) | |
|---|---|---|
| 0.60 ± 0.06 | 0.74 ± 0.11 | |
| 0.68 ± 0.07 | 0.83 ± 0.07 | |
| 0.66 ± 0.06 | 0.84 ± 0.05 | |
| 0.69 ± 0.04 | 0.88 ± 0.04 | |
| 0.69 ± 0.15 | 0.86 ± 0.07 | |
| 0.69 ± 0.05 | 0.87 ± 0.04 | |
| 0.72 ± 0.05 | 0.87 ± 0.04 | |
| 0.68 ± 0.03 | 0.87 ± 0.03 | |
Fig. 2The impact of the input features on predictions. The shap analysis on the model with the best performance (Model 8), in the development test set (A) and the external validation dataset (B). The model includes both continuous and binary input features. Continuous features vary from low to high values, whereas binary features are either present or absent. Each dot represents the impact of a feature on the mortality prediction for one patient at entrance. The color indicates the level of contribution of each variable (with red indicating a higher impact on the prediction) and the direction the prediction towards death (right) or survival (left). (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)