| Literature DB >> 33739635 |
Subhanik Purkayastha1, Yanhe Xiao2, Zhicheng Jiao3, Rujapa Thepumnoeysuk1, Kasey Halsey1,4, Jing Wu2, Thi My Linh Tran1,4, Ben Hsieh1,4, Ji Whae Choi1,4, Dongcui Wang2, Martin Vallières5, Robin Wang3, Scott Collins1, Xue Feng6, Michael Feldman7, Paul J Zhang7, Michael Atalay1, Ronnie Sebro3, Li Yang2, Yong Fan3, Wei Hua Liao8, Harrison X Bai1,9.
Abstract
OBJECTIVE: To develop a machine learning (ML) pipeline based on radiomics to predict Coronavirus Disease 2019 (COVID-19) severity and the future deterioration to critical illness using CT and clinical variables.Entities:
Keywords: COVID-19; CT; Machine learning; Radiomics; Severity
Mesh:
Year: 2021 PMID: 33739635 PMCID: PMC8236359 DOI: 10.3348/kjr.2020.1104
Source DB: PubMed Journal: Korean J Radiol ISSN: 1229-6929 Impact factor: 3.500
Fig. 1Illustration of patient inclusion and exclusion.
Adapted from Zhang et al. Cell 2020;181:1423-1433.e11 [16]. HUP = Hospital of the University of Pennsylvania, RIH = Rhode Island Hospital, RT-PCR = reverse transcriptase-polymerase chain reaction
Fig. 2Illustration of our analysis pipeline.
A. Radiomics feature representation. For each patient, 1583 radiomics features were extracted from automatically segmented lung regions. B. Radiomics based severity prediction. Binary classifiers were applied to classify the patients into severe or non-severe classes based on the radiomics features. C. Radiomics based progression prediction. A random survival forest model was optimized based on the 1583 radiomics features to assign risk scores to different subjects. D. Clinically based progression prediction. Fifteen clinical variables extracted from demographic recordings were input to another survival forest model to assign risk scores to different subjects. Finally, for each patient, the deep learning-based and clinical-based predictions were added with two balanced weights to obtain the combined progression risk score.
Comparison of Patient Characteristics Across the Training, Validation, and Test Sets
| Training Set (n = 687) | Validation Set (n = 97) | Test Set (n = 197) | |||
|---|---|---|---|---|---|
| Age, year | 0.393 | ||||
| Median ± interquartile range | 49 ± 24 (range of 0–92) | 48 ± 27 (range of 0–85) | 49 ± 28 (range of 0–87) | ||
| < 20 | 24 (3) | 4 (4) | 10 (5) | ||
| 20–39 | 169 (25) | 29 (30) | 57 (29) | ||
| 40–59 | 298 (43) | 34 (35) | 70 (36) | ||
| ≥ 60 | 196 (29) | 30 (31) | 60 (30) | ||
| Sex | 0.954 | ||||
| Male | 351 (51) | 49 (51) | 103 (52) | ||
| Female | 332 (48) | 47 (48) | 93 (47) | ||
| Presence of fever | 0.942 | ||||
| Fever | 297 (43) | 38 (39) | 87 (42) | ||
| No fever | 118 (17) | 15 (15) | 35 (15) | ||
| White blood cell count | 0.397 | ||||
| Elevated | 45 (7) | 9 (9) | 12 (6) | ||
| Normal | 370 (54) | 45 (46) | 108 (55) | ||
| Lymphocyte count | 0.613 | ||||
| Normal | 182 (26) | 30 (31) | 55 (28) | ||
| Decreased | 254 (37) | 32 (33) | 74 (38) | ||
| Comorbidities | |||||
| Cardiovascular disease | 50 (7) | 11 (11) | 15 (8) | 0.316 | |
| Hypertension | 94 (14) | 14 (14) | 32 (16) | 0.817 | |
| COPD | 20 (3) | 3 (3) | 7 (4) | 0.946 | |
| Diabetes | 48 (7) | 10 (10) | 24 (12) | 0.076 | |
| Chronic liver disease | 18 (3) | 2 (2) | 4 (2) | 0.822 | |
| Chronic kidney disease | 16 (2) | 2 (2) | 7 (4) | 0.689 | |
| Malignant tumor | 13 (2) | 2 (2) | 2 (1) | 0.628 | |
| HIV | 0 (0) | 0 (0) | 0 (0) | 1.000 | |
| Outcomes* | |||||
| Ventilator | 64 (9) | 9 (9) | 20 (10) | 0.991 | |
| Intensive care unit | 76 (11) | 11 (11) | 25 (13) | 0.924 | |
| Death | 20 (3) | 1 (1) | 3 (2) | 0.312 | |
| Unknown critical† | 104 (15) | 13 (13) | 27 (14) | 0.882 | |
| Discharged | 235 (34) | 30 (31) | 70 (36) | 0.833 | |
| Progression to critical event, days | 0.149 | ||||
| Median | 0.72 (range of 0–21) | 0.59 (range of 0–30) | 0.08 (range of 0–13) | ||
| Day 1 | 111 (16) | 14 (14) | 38 (19) | ||
| Day 2 | 16 (2) | 6 (6) | 3 (2) | ||
| Day 3 | 8 (1) | 2 (2) | 2 (1) | ||
| ≥ Day 4 | 56 (8) | 5 (5) | 12 (6) | ||
| Progression to discharge, days | 0.244 | ||||
| Median | 12 (range of 0–46) | 11 (range of 0.2–31) | 11.6 (range of 0–38) | ||
| 0–4 | 33 (5) | 7 (7) | 17 (9) | ||
| 5–9 | 89 (13) | 10 (10) | 25 (13) | ||
| 10–14 | 146 (21) | 24 (25) | 41 (21) | ||
| ≥ 15 | 150 (22) | 16 (16) | 33 (17) | ||
| Epidemiologic contact | |||||
| Epicenter‡ | 129 (19) | 9 (9) | 30 (15) | 0.031 | |
| COVID-19 patient | 87 (13) | 13 (13) | 31 (16) | 0.762 | |
Unless specified otherwise, data are number of patients with the percentage in parentheses. *Patients with multiple critical outcomes may be counted in multiple categories, †For patients from public data source (Adapted from Zhang et al. Cell 2020;181:1423-1433.e11 [16]), the type of critical condition was not specified, ‡Epidemiologic contact with epicenter includes patients who have visited Wuhan, China and New York, NY, USA. COPD = chronic obstructive pulmonary disease, HIV = human immunodeficiency virus
Clinical Characteristics of Critical and Non-Critical COVID-19 Patients
| Critical (n = 274) | Non-Critical (n = 707) | |||
|---|---|---|---|---|
| Age, year | < 0.001 | |||
| Median ± interquartile range | 57.5 ± 23.8 (range of 0 to 92) | 46 ± 22.5 (range of 0 to 84) | ||
| < 20 | 18 (7) | 20 (3) | ||
| 20–39 | 29 (11) | 226 (32) | ||
| 40–59 | 100 (36) | 302 (43) | ||
| ≥ 60 | 127 (46) | 159 (22) | ||
| Sex | 0.273 | |||
| Male | 148 (54) | 355 (50) | ||
| Female | 124 (45) | 348 (49) | ||
| Presence of fever | < 0.001 | |||
| Fever | 103 (38) | 319 (45) | ||
| No fever | 20 (7) | 148 (21) | ||
| White blood cell count | < 0.001 | |||
| Elevated | 45 (16) | 21 (3) | ||
| Normal | 79 (29) | 444 (63) | ||
| Lymphocyte count | 0.001 | |||
| Normal | 78 (28) | 189 (27) | ||
| Decreased | 45 (16) | 215 (30) | ||
| Comorbidities | ||||
| Cardiovascular disease | 42 (15) | 34 (5) | < 0.001 | |
| Hypertension | 62 (23) | 78 (11) | < 0.001 | |
| COPD | 15 (5) | 15 (2) | < 0.001 | |
| Diabetes | 36 (13) | 46 (7) | < 0.001 | |
| Chronic liver disease | 6 (2) | 18 (3) | 0.495 | |
| Chronic kidney disease | 19 (7) | 6 (1) | < 0.001 | |
| Malignant tumor | 9 (3) | 8 (1) | < 0.001 | |
| HIV | 0 (0) | 0 (0) | 1.000 | |
| Outcomes* | ||||
| Ventilator | 93 (34) | N/A | ||
| Intensive care unit | 112 (41) | N/A | ||
| Death | 24 (9) | N/A | ||
| Unknown critical† | 144 (53) | N/A | ||
| Progression to critical event, days | ||||
| Median | 0.3 (range of 0 to 30) | N/A | ||
| Day 1 | 163 (59) | N/A | ||
| Day 2 | 15 (5) | N/A | ||
| Day 3 | 12 (4) | N/A | ||
| > Day 3 | 73 (27) | N/A | ||
| Epidemiologic Contact | ||||
| Epicenter‡ | 14 (5) | 154 (22) | < 0.001 | |
| COVID-19 patients | 26 (9) | 105 (15) | 0.662 | |
Unless specified otherwise, data are number of patients with the percentage in parentheses. *Patients with multiple critical outcomes may be counted in multiple categories, †For patients from public data source (Adapted from Zhang et al. Cell 2020;181:1423-1433.e11 [16]), the type of critical condition was not specified, ‡Epidemiologic contact with epicenter includes patients who have visited Wuhan, China and New York, NY, USA. COPD = chronic obstructive pulmonary disease, HIV = human immunodeficiency virus
Performance Metrics of Our Manually Optimized ML Pipelines Predicting Severity on the Test Set Using Radiomics Features Alone, Clinical Variables Alone, Combined Radiomics and Clinical Variables, and Visual CT Severity Score and Clinical Variables
| Dataset | Pipeline | AUC | Accuracy | PPV | NPV | Sensitivity | Specificity | |
|---|---|---|---|---|---|---|---|---|
| Radiomics | TSCR + KNN | 0.74 | 0.79 | 0.68 | 0.84 | 0.62 | 0.85 | 0.147 |
| Lower 95% CI | 0.72 | 0.77 | 0.66 | 0.83 | 0.60 | 0.82 | - | |
| Upper 95% CI | 0.75 | 0.81 | 0.70 | 0.86 | 0.65 | 0.87 | - | |
| Clinical | CHSQ + BY | 0.70 | 0.68 | 0.61 | 0.78 | 0.73 | 0.67 | 0.023 |
| Lower 95% CI | 0.67 | 0.66 | 0.57 | 0.76 | 0.71 | 0.65 | - | |
| Upper 95% CI | 0.72 | 0.71 | 0.63 | 0.80 | 0.75 | 0.70 | - | |
| Radiomics + clinical | CHSQ + KNN | 0.76 | 0.80 | 0.69 | 0.87 | 0.62 | 0.87 | - |
| Lower 95% CI | 0.73 | 0.77 | 0.65 | 0.85 | 0.59 | 0.85 | - | |
| Upper 95% CI | 0.79 | 0.82 | 0.72 | 0.89 | 0.65 | 0.89 | - | |
| Visual CT severity score + clinical | CHSQ + BST | 0.70 | 0.77 | 0.60 | 0.79 | 0.56 | 0.85 | 0.023 |
| Lower 95% CI | 0.67 | 0.74 | 0.57 | 0.77 | 0.53 | 0.83 | - | |
| Upper 95% CI | 0.73 | 0.79 | 0.62 | 0.82 | 0.59 | 0.87 | - |
*P value in comparison with the radiomics + clinical model AUC. AUC = area under the curve, BST = boosting, BY = bayesian, CHSQ = chi-square score, CI = confidence interval, KNN = k-nearest neighbors, NPV = negative predictive value, PPV = positive predictive value, TSCR = t test score
Performance Metrics of Our Radiomics-Based, Clinical-Based, Combined Radiomics and Clinical-Based, Visual CT Severity Score, and Combined Clinical and Visual CT Severity Score-Based Progression Prediction Models
| Metric | Clinical | Radiomics | Clinical + Radiomics | Visual CT Severity Score | Clinical + Visual CT Severity Score |
|---|---|---|---|---|---|
| iAUC | 0.814 | 0.775 | 0.829 | 0.740 | 0.829 |
| Standard error | 0.023 | 0.028 | 0.023 | 0.030 | 0.017 |
| Lower 95% CI | 0.768 | 0.720 | 0.784 | 0.682 | 0.795 |
| Upper 95% CI | 0.859 | 0.829 | 0.873 | 0.799 | 0.863 |
| C-index | 0.847 | 0.767 | 0.868 | 0.742 | 0.860 |
| Standard error | 0.023 | 0.031 | 0.020 | 0.034 | 0.020 |
| Lower 95% CI | 0.803 | 0.706 | 0.830 | 0.676 | 0.820 |
| Upper 95% CI | 0.892 | 0.828 | 0.907 | 0.809 | 0.900 |
| 3-day ROC AUC | 0.874 | 0.792 | 0.897 | 0.807 | 0.910 |
| Standard error | 0.029 | 0.040 | 0.025 | 0.041 | 0.023 |
| Lower 95% CI | 0.816 | 0.714 | 0.848 | 0.726 | 0.865 |
| Upper 95% CI | 0.931 | 0.870 | 0.947 | 0.888 | 0.955 |
| 5 day ROC AUC | 0.918 | 0.812 | 0.933 | 0.783 | 0.932 |
| Standard error | 0.022 | 0.037 | 0.019 | 0.041 | 0.018 |
| Lower 95% CI | 0.875 | 0.739 | 0.896 | 0.702 | 0.896 |
| Upper 95% CI | 0.961 | 0.884 | 0.971 | 0.864 | 0.968 |
| 7-day ROC AUC | 0.897 | 0.817 | 0.927 | 0.764 | 0.907 |
| Standard error | 0.025 | 0.036 | 0.020 | 0.041 | 0.025 |
| Lower 95% CI | 0.847 | 0.746 | 0.888 | 0.683 | 0.858 |
| Upper 95% CI | 0.946 | 0.888 | 0.966 | 0.845 | 0.956 |
AUC = area under the curve, CI = confidence interval, iAUC = incremental AUC, ROC = receiver operating characteristic
Fig. 3Time-dependent ROC curves and AUCs for days 3, 5, and 7 for three progression models.
A–C. The results for the three models are shown: one trained on radiomics features, one trained on clinical variables, and one trained on the combination of radiomics features and clinical variables. The x-axis represents the false-positive rate and the y-axis represents the true-positive rate. AUC = area under the curve, ROC = receiver operating characteristic