| Literature DB >> 35487339 |
Shivang Bhakta1, Devang K Sanghavi2, Patrick W Johnson3, Katie L Kunze4, Matthew R Neville5, Hani M Wadei6, Wendelyn Bosch7, Rickey E Carter3, Sadia Z Shah6, Benjamin D Pollock5, Sven P Oman8, Leigh Speicher9, Jason Siegel10, Claudia R Libertin7, Mark W Matson11, Pablo Moreno Franco12, Jennifer B Cowart13.
Abstract
OBJECTIVES: The emergence of SARS-CoV-2 variants of concern has led to significant phenotypical changes in transmissibility, virulence, and public health measures. Our study used clinical data to compare characteristics between a Delta variant wave and a pre-Delta variant wave of hospitalized patients.Entities:
Keywords: COVID-19; Delta variant; Genomics; Gradient boosting model; Machine learning; Variants of concern
Mesh:
Year: 2022 PMID: 35487339 PMCID: PMC9040426 DOI: 10.1016/j.ijid.2022.04.050
Source DB: PubMed Journal: Int J Infect Dis ISSN: 1201-9712 Impact factor: 12.074
Figure 1Epidemiologic curve showing 2 COVID-19 disease admission waves in Mayo Clinic, Florida. The number of new COVID-19 cases (y-axis) is shown as the absolute number of admissions per day. The blue-colored first wave comprises a heterogeneous array of SARS-CoV-2 variants, named Wave 1. The red-colored second wave shows Wave 2. A buffer period of 12.2 weeks was established after the end of the first wave.
Patient characteristics, comorbidities, and outcomes stratified by wave.
| Wave 1 (n=653) | Wave 2 (n=665) | Total (N=1318) | Standardized difference | |
|---|---|---|---|---|
| Age (years) | 67 (20, 103) | 60 (21, 101) | 64 (20, 103) | |
| Sex (male) | 392 (60.0%) | 388 (58.3%) | 780 (59.2%) | 3.4% |
| Race | ||||
| American Indian/Alaskan Native | 1 (0.2%) | 2 (0.3%) | 3 (0.2%) | |
| Asian | 38 (5.8%) | 27 (4.1%) | 65 (4.9%) | |
| Black or African American | 61 (9.3%) | 83 (12.5%) | 144 (10.9%) | |
| Pacific Islander | 1 (0.2%) | 0 (0.0%) | 1 (0.1%) | |
| White | 533 (81.6%) | 524 (78.8%) | 1057 (80.2%) | |
| Other/Unknown | 19 (2.9%) | 29 (4.4%) | 48 (3.6%) | |
| Ethnicity | 9.7% | |||
| Hispanic | 27 (4.1%) | 37 (5.6%) | 64 (4.9%) | |
| Non-Hispanic | 618 (94.6%) | 614 (92.3%) | 1232 (93.5%) | |
| Unknown | 8 (1.2%) | 14 (2.1%) | 22 (1.7%) | |
| Chronic kidney disease | 64 (9.8%) | 44 (6.6%) | 108 (8.2%) | |
| Chronic lung disease | 391 (59.9%) | 507 (76.2%) | 898 (68.1%) | |
| Congenital heart disease | 7 (1.1%) | 6 (0.9%) | 13 (1.0%) | 1.7% |
| Congestive heart failure | 104 (15.9%) | 72 (10.8%) | 176 (13.4%) | |
| Coronary artery disease | 171 (26.2%) | 119 (17.9%) | 290 (22.0%) | |
| Diabetes mellitus | 174 (26.6%) | 162 (24.4%) | 336 (25.5%) | 5.2% |
| Hypertension | 433 (66.3%) | 342 (51.4%) | 775 (58.8%) | |
| Immunosuppression | 125 (19.1%) | 84 (12.6%) | 209 (15.9%) | |
| Overall COVID-19 risk of complications score | 4 (0, 10) | 3 (0, 9) | 4 (0, 10) | |
| End stage renal disease | 50 (7.7%) | 38 (5.7%) | 88 (6.7%) | 7.8% |
| Monoclonal antibodies | 33 (5.1%) | 32 (4.8%) | 65 (4.9%) | 1.1% |
| Dialysis | 23 (3.5%) | 13 (2.0%) | 36 (2.7%) | 9.6% |
| Transplant patient | 92 (14.1%) | 72 (10.8%) | 164 (12.4%) | 9.9% |
| Solid organ transplant | 68 (10.4%) | 42 (6.3%) | 110 (8.3%) | |
| Solid organ transplant type | ||||
| Heart | 7 (10.3%) | 4 (9.5%) | 11 (10.0%) | |
| Kidney | 35 (51.5%) | 24 (57.1%) | 59 (53.6%) | |
| Liver | 10 (14.7%) | 6 (14.3%) | 16 (14.5%) | |
| Lung | 15 (22.1%) | 8 (19.0%) | 23 (20.9%) | |
| Pancreas | 1 (1.5%) | 0 (0.0%) | 1 (0.9%) | |
| Vaccination status | ||||
| Unvaccinated | 619 (94.8%) | 474 (71.3%) | 1093 (82.9%) | |
| Partially vaccinated | 29 (4.4%) | 40 (6.0%) | 69 (5.2%) | |
| Breakthrough | 5 (0.8%) | 151 (22.7%) | 156 (11.8%) | |
| Vaccine type at first immunization | ||||
| Johnson & Johnson | 1 (2.9%) | 13 (6.8%) | 14 (6.2%) | |
| Moderna | 14 (41.2%) | 57 (29.8%) | 71 (31.6%) | |
| Pfizer | 19 (55.9%) | 121 (63.4%) | 140 (62.2%) | |
| Reason for testing | ||||
| N-Miss | 468 | 164 | 632 | |
| Asymptomatic | 44 (23.8%) | 19 (3.8%) | 63 (9.2%) | |
| Symptomatic | 141 (76.2%) | 482 (96.2%) | 623 (90.8%) | |
| Anti-spike antibody test | ||||
| N-Miss | 534 | 43 | 577 | |
| Negative (< 0.8 U/mL) | 31 (26.1%) | 268 (43.1%) | 299 (40.4%) | |
| Positive (≥ 0.8 U/mL) | 88 (73.9%) | 354 (56.9%) | 442 (59.6%) | |
| Positive anti-nucleocapsid antibody | ||||
| N-Miss | 101 | 206 | 307 | |
| Negative | 358 (64.9%) | 334 (72.8%) | 692 (68.4%) | |
| Positive | 194 (35.1%) | 125 (27.2%) | 319 (31.6%) | |
| Critical care services | 177 (27.1%) | 261 (39.2%) | 438 (33.2%) | |
| Mechanical ventilation | 52 (8.0%) | 70 (10.5%) | 122 (9.3%) | 8.9% |
| Length of stay (days) | ||||
| N-Miss | 0 | 2 | 2 | |
| Median (range) | 5 (1–193) | 5 (1–155) | 5 (1–193) | |
| Deceased | 109 (16.7%) | 85 (12.8%) | 194 (14.7%) |
Categorical data are shown as count (percent). Numeric data are presented as median (range).
Standardized difference = difference in proportions divided by standard error; imbalance defined as absolute value greater than 10% (text in bold formatting).
Immunosuppression status was attributed to the following patients: diagnosed with human immunodeficiency virus infection, actively receiving chemotherapy, receiving immunosuppressive medications, or diagnosed with iatrogenic immunosuppression.
Figure 2Patient characteristics and outcomes between Wave 1 and Wave 2. (a) Age; (b) chronic kidney disease; (c) chronic lung disease; (d) hypertension; (e) death; (f) intensive care unit; (g) mechanical ventilation; (h) length of hospital stay.
Laboratory assays stratified by wave.
| Wave 1 (n=653) | Wave 2 (n=665) | Total (N=1318) | ||
|---|---|---|---|---|
| Activated partial thromboplastin Time | ||||
| N | 334 | 469 | 803 | |
| Median (range) | 31.0 (17.0–225.0) | 30.0 (17.0–300.0) | 30.0 (17.0–300.0) | |
| C-reactive protein | ||||
| N | 617 | 639 | 1256 | |
| Median (range) | 53.8 (1.5–400.0) | 71.2 (1.5–400.0) | 63.2 (1.5–400.0) | |
| Creatinine | ||||
| N | 626 | 623 | 1249 | |
| Median (range) | 1.0 (0.3–12.5) | 0.9 (0.2–20.3) | 1.0 (0.2–20.3) | |
| D-dimer | 0.79 | |||
| N | 615 | 636 | 1251 | |
| Median (range) | 825.0 (110.0–42000.0) | 841.5 (110.0–42000.0) | 831.0 (110.0–42000.0) | |
| Ferritin | ||||
| N | 604 | 611 | 1215 | |
| Median (range) | 365.5 (9.0–17569.0) | 513.0 (5.0–30714.0) | 433.0 (5.0–30714.0) | |
| Fibrinogen | ||||
| N | 391 | 514 | 905 | |
| Median (range) | 513.0 (108.0–1000.0) | 567.0 (76.0–1000.0) | 543.0 (76.0–1000.0) | |
| Interleukin-6 | ||||
| N | 541 | 599 | 1140 | |
| Median (range) | 38.0 (1.0–3543.0) | 44.0 (1.0–4500.0) | 41.0 (1.0–4500.0) | |
| International normalized ratio | ||||
| N | 586 | 619 | 1205 | |
| Median (range) | 1.2 (0.8–5.4) | 1.2 (0.9–5.2) | 1.2 (0.8–5.4) | |
| Lactate dehydrogenase | ||||
| N | 597 | 621 | 1218 | |
| Median (range) | 268.0 (87.0–25000.0) | 347.0 (65.0–3360.0) | 299.0 (65.0–25000.0) | |
| Lymphocytes, absolute | 0.18 | |||
| N | 587 | 619 | 1206 | |
| Median (range) | 0.9 (0.1–94.1) | 0.9 (0.1–105.1) | 0.9 (0.1–105.1) | |
| Mean platelet volume | 0.27 | |||
| N | 605 | 643 | 1248 | |
| Median (range) | 10.3 (8.0–14.2) | 10.2 (8.0–14.7) | 10.2 (8.0–14.7) | |
| Neutrophils, percentage | ||||
| N | 587 | 619 | 1206 | |
| Median (range) | 74.8 (5.7–96.2) | 78.3 (3.7–96.6) | 76.6 (3.7–96.6) | |
| Neutrophils, absolute | ||||
| N | 587 | 619 | 1206 | |
| Median (range) | 4.5 (0.3–32.3) | 5.2 (0.6–23.1) | 4.8 (0.3–32.3) | |
| Platelet count | 0.18 | |||
| N | 612 | 647 | 1259 | |
| Median (range) | 189.0 (2.0–1120.0) | 195.0 (4.0–667.0) | 193.0 (2.0–1120.0) | |
| Procalcitonin | 0.42 | |||
| N | 590 | 631 | 1221 | |
| Median (range) | 0.1 (0.0–96.9) | 0.1 (0.0–140.6) | 0.1 (0.0–140.6) | |
| Prothrombin time | ||||
| N | 586 | 619 | 1205 | |
| Median (range) | 13.2 (9.5–62.4) | 12.6 (9.7–58.0) | 12.9 (9.5–62.4) |
Laboratory assays at the first test during a patient's admission. For values below the lower limit of detection, values were imputed to half the distance between 0 and the lower limit. For values above the upper limit of detection, values were winsorized at the upper limit.
P-values arise from Kruskal-Wallis rank-sum tests. Values in bold formatting are statistically significant (p < 0.05).
Figure 3Shapley additive explanations (SHAP) plot for the gradient boosting model. (a) The figure plots every patient in the analysis as a point. The y-axis lists the input variables. The x-axis is a metric of the SHAP value associated with each variable and patient within the dataset (i.e., points plotted for each case based on the impact on prediction). The points plotted on the far-left have a greater impact on Wave 1 prediction and points plotted on the right have a greater impact on Wave 2 prediction. The normalized value of observation is color-based (red = higher values; blue = lower values). (b) The bar graph shows the input variables’ importance for wave prediction. The scaled importance is color-based (red = higher importance; blue = lower importance).
Figure 4Model performance and diagnostic summaries for the gradient boosting model.
The left panel shows a receiver operating characteristic curve along with associated diagnostic metrics. The blue circle represents the selected threshold, which was determined by the optimal F1 score in the training data. The right panel is a confusion matrix displaying false and true negatives/positives and associated metrics of specificity, sensitivity, negative predictive probability (NPV), and positive predictive probability (PPV). In all cases, metrics are associated with test data that were not used during model development or selection.