| Literature DB >> 34671777 |
Guillermo Carbonell1,2,3,4, Diane Marie Del Valle5,6,7, Edgar Gonzalez-Kozlova5,6,7,8, Brett Marinelli1, Emma Klein2, Maria El Homsi1,9, Daniel Stocker2,10, Michael Chung1, Adam Bernheim1, Nicole W Simons8, Jiani Xiang11, Sharon Nirenberg8,11, Patricia Kovatch8,11, Sara Lewis1,2, Miriam Merad5,6,7, Sacha Gnjatic5,6,7,12, Bachir Taouli1,2,7.
Abstract
Despite extraordinary international efforts to dampen the spread and understand the mechanisms behind SARS-CoV-2 infections, accessible predictive biomarkers directly applicable in the clinic are yet to be discovered. Recent studies have revealed that diverse types of assays bear limited predictive power for COVID-19 outcomes. Here, we harness the predictive power of chest CT in combination with plasma cytokines using a machine learning approach for predicting death during hospitalization and maximum severity degree in COVID-19 patients. Patients (n=152) from the Mount Sinai Health System in New York with plasma cytokine assessment and a chest CT within 5 days from admission were included. Demographics, clinical, and laboratory variables, including plasma cytokines (IL-6, IL-8, and TNF-α) were collected from the electronic medical record. We found that chest CT combined with plasma cytokines were good predictors of death (AUC 0.78) and maximum severity (AUC 0.82), whereas CT quantitative was better at predicting severity (AUC 0.81 vs 0.70) while cytokine measurements better predicted death (AUC 0.70 vs 0.66). Finally, we provide a simple scoring system using plasma IL-6, IL-8, TNF-α, GGO to aerated lung ratio and age as novel metrics that may be used to monitor patients upon hospitalization and help physicians make critical decisions and considerations for patients at high risk of death for COVID-19.Entities:
Year: 2021 PMID: 34671777 PMCID: PMC8528085 DOI: 10.1101/2021.10.11.21264709
Source DB: PubMed Journal: medRxiv
Patient demographics, clinical and outcome data (n=152).
Data are numbers of patients with percentages between parentheses.
|
| 61 (48 – 71) |
|
| 90 (59.2%) |
|
| |
| - Hispanic | 37/152 (24.3%) |
| - African American | 35/152 (23.0%) |
| - Asian | 12/152 (8.0%) |
| - White | 35/152 (23.0%) |
| - Other | 70/152 (46.1%) |
| 51 (33.6%) | |
|
| |
| - Normal (395%) | 50/152 (32.9) |
| - Abnormal (<95%) | 102/152 (67.1) |
|
| |
| - Asthma | 16/151 (10.6%) |
| - Atrial Fibrillation | 12/151 (7.94%) |
| - Cancer (active) | 26/151 (17.2%) |
| - Chronic Kidney Disease | 16/151 (10.6%) |
| - Congestive Heart Failure | 14/151 (9.27%) |
| - COPD | 13/151 (8.61%) |
| - Diabetes | 43/151 (28.5%) |
| - HIV | 6/151 (4.00%) |
| - Hypertension | 52/151 (34.4%) |
| - Obstructive Sleep Apnea | 5/151 (3.31%) |
|
| |
| - Current | 12 (9.09%) |
| - History | 39 (61.4%) |
|
| |
| - Anosmia/Ageusia | 1/152 (0.658%) |
| - Congestion/Runny Nose | 10/152 (6.58%) |
| - Cough | 78/152 (51.3%) |
| - Diarrhea | 26/152 (17.1%) |
| - Fatigue | 49/152 (32.2%) |
| - Fever | 83/152 (55%) |
| - Headache | 9/152 (5.92%) |
| - Myalgias | 20/152 (13.2%) |
| - Nausea/Vomiting | 42/152 (27.6%) |
| - Shortness of breath | 105/152 (69.1%) |
| - Sore throat | 4/152 (2.63%) |
|
| |
| - Mild (3–4) | 78/150 (52.0%) |
| - Severe (5–7) | 72/150 (48%) |
|
| |
| - ICU admission | 43/152 (65.4%) |
| - Acute Respiratory Distress Syndrome | 10/151 (15.1%) |
| - Died during hospital admission | 26/152 (17.1%) |
Cytokine assessment, CT qualitative score and CT quantitative analysis. Data are medians with interquartile ranges (IQR) in parentheses. GGO: ground-glass opacities.
|
| |
| IL-6 (pg/mL) | 61.0 (22.8–146.3) |
| IL-8 (pg/mL) | 35.0 (20.0–59.9) |
| TNF-α (pg/mL) | 18.5 (13.0–27.4) |
|
| |
|
| 9 (5–14) |
|
| |
|
| |
| Total lung volume (mL) | 2713 (2081–3505) |
| Well aerated lung volume (mL) | 1982 (1420–2903) |
| GGO volume (mL) | 344.5 (208.2–509.6) |
| Consolidation volume (mL) | 98.20 (49.68–252.6) |
| GGO/well-aerated lung ratio | 0.1774 (0.0767–0.3673) |
Figure 2.Key differences between patients who died and patients who survived.
We conducted a Wilcoxon rank test for the variables collected. The number of * indicates significance (*<0.05. **<0.01, ***<0.001, ****<0.0001).
Figure 1.CT qualitative score and CT quantitative analysis.
Male patient with COVID-19. A) CT demonstrates multifocal ground-glass opacities and regions of consolidation in the right lower. The qualitative score established by a radiologist is based on the percentage of lung involvement per lobe (shown on the right, range 0–20). B) CT quantitative analysis using segmentation software. Quantitative analysis extracts volumetric measurements (shown on the right) representing the aerated lung, the ground glass opacities (GGO) volume, the consolidation volume and the GGO to aerated lung ratio. RUL: right upper lobe; RLL: right lower lobe; ML: middle lobe; LUL: left upper lobe; LLL: left lower lobe.
Figure 3.Data analysis overview.
The flowchart shows how we start from 4 different scenarios (cytokines, CT qualitative, CT quantitative, combined) with the endpoints of death and maximum severity. First, we explore potential biases in our dataset by testing with several statistical approaches such as correlations, Fisher exact and independence, and Wilcoxon rank test. Next, we evaluate the probability of survival using Cox proportional hazard models to identify potential markers. Then, we use elastic net regression to explore further the predictive capabilities to separate patients that survive or not per scenario. To ensure we correctly test our hypothesis per scenario, we perform a combination of random testing/training sets and cross-fold validation to identify the predictive value of each scenario. Then, we use a coefficient-based selection to filter the significant (adjusted p-value<0.05) models and select the variables relevant for predicting. Finally, we use the variables from our best minimal model to build a risk prediction nomogram.
Figure 4.Power of chest CT and cytokines for prediction of death and maximum severity score.
We tested the 5 scenarios to evaluate their relevance for prediction of death and maximum severity. A) Average ROC curves derived using risk of death are showed for each scenario. B) Boxplot showing the AUC values for all significant (p-value<0.05) models build per scenario for risk of death. C) Average ROC curves derived for maximum severity per scenario. D) Boxplot showing the AUC values for all significant (p-value<0.05) models build per scenario for severity.
Finally, to simplify our findings we took advantage of our elastic net regression interpretability to distillate probabilistic model for scoring risk or nomogram. The nomogram uses the glmnet selected variables GGO to aerated lung ratio, age, TNF-α, IL-6 and IL-8 to provide a score for risk of death (Suppl. Figure 11)[18 20].