| Literature DB >> 35378436 |
Isaac Shiri1, Yazdan Salimi1, Masoumeh Pakbin2, Ghasem Hajianfar3, Atlas Haddadi Avval4, Amirhossein Sanaat1, Shayan Mostafaei5, Azadeh Akhavanallaf1, Abdollah Saberi1, Zahra Mansouri1, Dariush Askari6, Mohammadreza Ghasemian7, Ehsan Sharifipour8, Saleh Sandoughdaran9, Ahmad Sohrabi10, Elham Sadati11, Somayeh Livani12, Pooya Iranpour13, Shahriar Kolahi14, Maziar Khateri15, Salar Bijari11, Mohammad Reza Atashzar16, Sajad P Shayesteh17, Bardia Khosravi18, Mohammad Reza Babaei19, Elnaz Jenabi20, Mohammad Hasanian21, Alireza Shahhamzeh22, Seyaed Yaser Foroghi Ghomi23, Abolfazl Mozafari24, Arash Teimouri13, Fatemeh Movaseghi24, Azin Ahmari25, Neda Goharpey26, Rama Bozorgmehr27, Hesamaddin Shirzad-Aski28, Roozbeh Mortazavi29, Jalal Karimi30, Nazanin Mortazavi31, Sima Besharat32, Mandana Afsharpad10, Hamid Abdollahi33, Parham Geramifar20, Amir Reza Radmard18, Hossein Arabi1, Kiara Rezaei-Kalantari3, Mehrdad Oveisi34, Arman Rahmim35, Habib Zaidi36.
Abstract
BACKGROUND: We aimed to analyze the prognostic power of CT-based radiomics models using data of 14,339 COVID-19 patients.Entities:
Keywords: COVID-19; Machine learning; Prognosis; Radiomics; X-ray CT
Mesh:
Year: 2022 PMID: 35378436 PMCID: PMC8964015 DOI: 10.1016/j.compbiomed.2022.105467
Source DB: PubMed Journal: Comput Biol Med ISSN: 0010-4825 Impact factor: 6.698
Fig. 1The flowchart of our study represents the different radiomic steps. CT images of 14,339 COVID-19 patients with overall survival outcome were collected from 19 medical centers. Whole lung segmentations were performed automatically using a deep learning-based model. The models were evaluated using 10 different splitting and cross-validation strategies, including different types of test datasets and the different metrics reported for models evaluation.”
Fig. 2Inclusion and exclusion criteria adopted in this study. Overall, 24,478 patients with CT images suspected of SAR-CoV2 infection from 19 centers were enrolled in this study and 10,139 patients were excluded based on exclusion criteria. Among the 14,339 patients with SARS-CoV2 infection confirmation used for further analysis, 12,761 and 1,578 cases were alive and deceased, respectively. The fraction of deceased patients is overrepresented in our data due to our exclusion criteria.
Demographics and data acquisition parameters across the different centers.
| Center | Number (Deceased %) | PCR | Gender | CTDIvol | Tube current (mAs) | Age | Slice thickness | |
|---|---|---|---|---|---|---|---|---|
| % of patients with available PCR result | Male | Female | Mean ± SD | Mean ± SD | Mean ± SD | Mean ± SD | ||
| 152 (26.3%) | 100% | 87 | 65 | 1.57 ± 0.32 | 45 ± 47.8 | 62.1 ± 17.0 | 1.50 ± 0.50 | |
| 264 (41.6%) | 99% | 120 | 144 | 8.47 ± 0.53 | 186.2 ± 34.0 | 71.8 ± 12.4 | 3.44 ± 0.95 | |
| 319 (1.5%) | 100% | 168 | 151 | 8.55 ± 4.61 | 185 ± 65.3 | 51.6 ± 17.9 | 2.82 ± 0.99 | |
| 373 (22.5%) | 77% | 157 | 216 | 7.52 ± 1.23 | 147.1 ± 43.2 | 54.1 ± 19.3 | 5.22 ± 0.46 | |
| 382 (19.1%) | 100% | 215 | 167 | 6.37 ± 0.67 | 128.3 ± 4.5 | 58.2 ± 18.25 | 7.67 ± 1.7 | |
| 394 (38.6%) | 100% | 257 | 137 | 5.8 ± 2.52 | 149 ± 48.3 | 56.8 ± 16.8 | 2.34 ± 0.24 | |
| 492 (5.2%) | 100% | 276 | 216 | 9.68 ± 2.91 | 349 ± 181 | 33.0 ± 5.66 | 2.57 ± 1.36 | |
| 539 (19.1%) | 94% | 287 | 252 | 4.10 ± 4.57 | 167 ± 39.1 | 54.1 ± 17.4 | 7.71 ± 0.77 | |
| 608 (3.0%) | 100% | 274 | 334 | 6.61 ± 4.87 | 164 ± 128 | 55.2 + 15.99 | 2.10 ± 1.21 | |
| 685 (14.8%) | 89% | 348 | 337 | 5.01 ± 3.57 | 171 ± 33.3 | 56.4 ± 18.7 | 7.16 ± 0.81 | |
| 866 (13.2%) | 100% | 520 | 346 | 3.40 ± 0.42 | 123.6 ± 9.6 | 55.4 ± 17.9 | 2.04 ± 0.23 | |
| 883 (11.2%) | 62% | 448 | 435 | 4.30 ± 1.54 | 166.2 ± 51.3 | 55.4 ± 15.6 | 4.99 ± 0.16 | |
| 949 (10.9%) | 76% | 568 | 381 | 5.57 ± 1.21 | 210.7 ± 39.6 | 52.6 ± 19.3 | 2.17 ± 1.51 | |
| 1022 (9.7%) | 100% | 526 | 496 | 5.01 ± 3.19 | 91.6 ± 48.6 | 56.2 ± 18.8 | 4.93 ± 0.49 | |
| 1024 (1.0%) | 29% | 542 | 482 | 6.82 ± 1.02 | 228.4 ± 27.2 | 48.3 ± 14.5 | 2.48 ± 1.96 | |
| 1180 (12.7%) | 84% | 578 | 602 | 5.63 ± 2.98 | 172 ± 34.1 | 55.4 ± 16.7 | 6.38 ± 0.82 | |
| 1323 (12.7%) | 100% | 680 | 643 | 6.14 ± 1.41 | 175 ± 38.2 | 57.4 ± 18.7 | 5.66 ± 0.82 | |
| 1355 (2.5%) | 70% | 752 | 603 | 4.21 ± 3.18 | 98.4 ± 41.1 | 55.1 ± 19.1 | 4.75 ± 0.69 | |
| 1529 (5.6%) | 93% | 814 | 715 | 8.92 ± 1.40 | 220.7 ± 27.9 | 54.5 ± 19.0 | 6.96 ± 0.34 | |
Fig. 3Different strategies implemented in this study for model evaluation. In strategies 1–6, the data were split into 70%/30% train/test sets. In strategy 7, the centers were split into 70%/30% train/test sets. In strategy 8, the model was trained on one center and evaluated on 18 centers and repeated for all centers. In strategy 9, on each of the 19 iterations, 18 centers were used as the training set, and one as the external validation set. In strategy 10, the models were built and evaluated on each center separately using data splitting of 70%/30% train/test sets, respectively.
Fig. 4Radiomics feature maps for ten different cases (5 alive and 5 deceased). Rows represent different radiomics feature map and columns represent different cases. We represented one feature from each feature sets including, 10Percentile from first order, Joint Entropy from GLCM, Dependence Variance from GLDM, Short Run High Gray Level Emphasis from GLRLM, Size Zone Non-Uniformity Normalized from GLSZM, and Strength from NGTDM.
Fig. 5Cluster heat map of radiomic features in non-harmonized data set. This illustration demonstrates that clustering cannot depict significant correlation or cluster in alive and deceased cases.
Fig. 6Radiomic features correlation using Pearson correlation in non-harmonized data set. Highly correlated features (both positive and negative) were eliminated from further analysis.
Fig. 7Heat map of AUC for cross combination of feature selectors and classifiers for the defined ten strategies. The heat map rows represent twenty-eighth cross combination of feature selectors and classifiers, whereas the columns depict strategies 1–10.
Fig. 8Heat map of sensitivity for cross combination of feature selectors and classifiers for the defined ten strategies. Heat map rows represent twenty-eighth cross combination of feature selectors and classifiers, whereas the columns depict strategies 1–10.
Fig. 9Heat map of specificity for cross combination of feature selectors and classifiers for the defined ten strategies. Heat map rows represent twenty-eighth cross combination of feature selectors and classifiers, whereas the columns depict strategies 1–10.
Fig. 10ROC curve for test sets in strategies 1–7. Strategy 1: AUC 0.84 ± 0.01 (a), Strategy 2: AUC 0.84 ± 0.01 (b), Strategy 3: AUC 0.83 ± 0.01 (c), Strategy 4: AUC 0.83 ± 0.01 (d), Strategy 5: AUC 0.83 ± 0.01 (e), Strategy 6: AUC 0.83 ± 0.01 (f), Strategy 7: AUC 0.79 ± 0.01 (g) and different strategies comparison (k). P-values for all ROCs were significant (p-value<0.05).