| Literature DB >> 33177576 |
Subhanik Purkayastha1, Yijun Zhao2, Jing Wu2, Rong Hu3, Aidan McGirr4, Sukhdeep Singh4, Ken Chang5, Raymond Y Huang6, Paul J Zhang7, Alvin Silva4, Michael C Soulen8, S William Stavropoulos8, Zishu Zhang2, Harrison X Bai9.
Abstract
Pre-treatment determination of renal cell carcinoma aggressiveness may help guide clinical decision-making. We aimed to differentiate low-grade (Fuhrman I-II) from high-grade (Fuhrman III-IV) renal cell carcinoma using radiomics features extracted from routine MRI. 482 pathologically confirmed renal cell carcinoma lesions from 2008 to 2019 in a multicenter cohort were retrospectively identified. 439 lesions with information on Fuhrman grade from 4 institutions were divided into training and test sets with an 8:2 split for model development and internal validation. Another 43 lesions from a separate institution were set aside for independent external validation. The performance of TPOT (Tree-Based Pipeline Optimization Tool), an automatic machine learning pipeline optimizer, was compared to hand-optimized machine learning pipeline. The best-performing hand-optimized pipeline was a Bayesian classifier with Fischer Score feature selection, achieving an external validation ROC AUC of 0.59 (95% CI 0.49-0.68), accuracy of 0.77 (95% CI 0.68-0.84), sensitivity of 0.38 (95% CI 0.29-0.48), and specificity of 0.86 (95% CI 0.78-0.92). The best-performing TPOT pipeline achieved an external validation ROC AUC of 0.60 (95% CI 0.50-0.69), accuracy of 0.81 (95% CI 0.72-0.88), sensitivity of 0.12 (95% CI 0.14-0.30), and specificity of 0.97 (95% CI 0.87-0.97). Automated machine learning pipelines can perform equivalent to or better than hand-optimized pipeline on an external validation test non-invasively predicting Fuhrman grade of renal cell carcinoma using conventional MRI.Entities:
Year: 2020 PMID: 33177576 PMCID: PMC7658976 DOI: 10.1038/s41598-020-76132-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Patient demographics, clinical features and tumor characteristics for overall cohort in training, validation, and test sets.
| Training set | Validation set | Test set | ||
|---|---|---|---|---|
| N = 351 | N = 88 | N = 43 | ||
| Age, median, range (years) | 60.0 (27–92) | 62.5 (30–81) | 64.0 (38–85) | 0.004* |
| Gender | 0.092 | |||
| Male | 236 (67.2%) | 61 (69.3%) | 42 (82.4%) | |
| Female | 115 (32.8%) | 27 (30.7%) | 9 (17.6%) | |
| Race | 0.034* | |||
| White | 241 (68.7%) | 55 (62.5%) | 45 (88.2%) | |
| Black | 60 (17.1%) | 21 (23.9%) | 6 (11.8%) | |
| Asian | 32 (9.1%) | 9 (10.2%) | 0 (18.1%) | |
| Unknown | 18 (5.1%) | 3 (3.4%) | 0 (4.1%) | |
| Von Hipple– Lindau syndrome | 6 (1.7%) | 1 (1.1%) | 0 (0%) | 0.425 |
| Subtype | < 0.001* | |||
| Clear cell | 249 (71.0%) | 66 (75.0%) | 22 (43.1%) | |
| Papillary | 78 (22.2%) | 9 (10.2%) | 22 (43.1%) | |
| Chromophobe | 1 (0.3%) | 3 (3.4%) | 5 (9.8%) | |
| Clear cell papillary | 15 (4.3%) | 8 (9.1%) | 0 (0%) | |
| Multilocular cystic | 3 (0.9%) | 1 (1.1%) | 2 (3.9%) | |
| Unclassified | 5 (1.4%) | 1 (1.1%) | 0 (0%) | |
| Laterality | 0.820 | |||
| Left | 166 (47.3%) | 40 (45.5%) | 26 (51.0%) | |
| Right | 185 (52.7%) | 48 (54.5%) | 25 (49.0%) | |
| Location | 0.367 | |||
| Upper | 116(33.0%) | 36 (40.9%) | 13 (25.5%) | |
| Interpole | 142 (40.5%) | 33 (37.5%) | 21 (41.2%) | |
| Lower | 93 (26.5%) | 19 (21.6%) | 17 (33.3%) | |
| Tumor size, median, range (cm) | 3.5 (0.9–18.7) | 3.3 (1.0–17.2) | 3.0 (0.2–15.5) | 0.693 |
| Renal vein invasion | 36 (10.3%) | 9 (10.2%) | 0 (13.3%) | 0.006* |
| Histological grade | 0.351 | |||
| Low grade | 226 (64.4%) | 59 (67.0%) | 38 (74.5%) | |
| High grade | 125 (35.6%) | 29 (33.0%) | 13 (25.5%) | |
| T stage | 0.330 | |||
| T1a | 174 (49.6%) | 48 (54.5%) | 33 (64.7%) | |
| T1b | 66 (18.8%) | 15 (17.0%) | 11 (21.6%) | |
| T2a | 11 (3.1%) | 2 (2.3%) | 1 (2.0%) | |
| T2b | 5 (1.4%) | 2 (2.3%) | 1 (2.0%) | |
| T3a | 48 (13.7%) | 11 (12.5%) | 1 (2.0%) | |
| T3b | 10 (2.8%) | 1 (1.1%) | 0 (0%) | |
| T3c | 0 (0%) | 0 (0%) | 0 (0%) | |
| T4 | 0 (0%) | 1 (1.1%) | 0 (0%) | |
| Unavailable | 37 (10.5%) | 8 (9.1%) | 4 (7.8%) | |
| Lymph node metastasis | 5 (1.4%) | 0 (0%) | 0 (0%) | 0.764 |
| Distant metastasis | 14 (4.0%) | 5 (5.7%) | 0 (0%) | 0.094 |
| Institution | < 0.001* | |||
| HUP | 299 (85.2%) | 75 (85. 2%) | 0 (0%) | |
| SXY | 8 (2.3%) | 3 (3.4%) | 0 (0%) | |
| PPH | 12 (3.4%) | 3 (3.4%) | 0 (0%) | |
| TCIA | 32 (9.1%) | 7 (8.0%) | 0 (0%) | |
| MAY | 0 (0%) | 0 (0%) | 51 (100.0%) |
*Statistically significant.
Figure 1Heatmap of ROC-AUCs on internal validation set of classifier and feature selection combinations for 50 selected features.
Comparison results of 10 TPOT models.
| Model index | AUC | Accuracy | Sensitivity | Specificity | Precision | Hamming loss | Kappa |
|---|---|---|---|---|---|---|---|
| 1 | 0.52 | 0.58 | 0.11 | 0.92 | 0.43 | 0.42 | 0.03 |
| 2 | 0.65 | 0.73 | 0.46 | 0.85 | 0.43 | 0.27 | 0.33 |
| 3 | 0.65 | 0.75 | 0.38 | 0.92 | 0.44 | 0.25 | 0.34 |
| 4 | 0.65 | 0.75 | 0.38 | 0.92 | 0.44 | 0.25 | 0.34 |
| 5 | 0.63 | 0.73 | 0.38 | 0.88 | 0.41 | 0.28 | 0.29 |
| 6a | 0.67 | 0.76 | 0.42 | 0.92 | 0.47 | 0.24 | 0.38 |
| 7 | 0.61 | 0.71 | 0.35 | 0.86 | 0.38 | 0.29 | 0.23 |
| 8 | 0.55 | 0.71 | 0.15 | 0.95 | 0.35 | 0.29 | 0.13 |
| 9 | 0.65 | 0.75 | 0.38 | 0.92 | 0.44 | 0.25 | 0.34 |
| 10 | 0.61 | 0.73 | 0.31 | 0.92 | 0.40 | 0.27 | 0.26 |
aModel 6 was selected as the final TPOT model for further external validation.
Figure 2ROC curve plotted for the hand-optimized radiomics pipeline and the TPOT pipeline on the external test set.