| Literature DB >> 35873825 |
Asma Shaheen1,2, Syed Talha Bukhari2, Maria Nadeem2, Stefano Burigat1, Ulas Bagci3,4,5, Hassan Mohy-Ud-Din2.
Abstract
Radiomics-guided prediction of overall survival (OS) in brain gliomas is seen as a significant problem in Neuro-oncology. The ultimate goal is to develop a robust MRI-based approach (i.e., a radiomics model) that can accurately classify a novel subject as a short-term survivor, a medium-term survivor, or a long-term survivor. The BraTS 2020 challenge provides radiological imaging and clinical data (178 subjects) to develop and validate radiomics-based methods for OS classification in brain gliomas. In this study, we empirically evaluated the efficacy of four multiregional radiomic models, for OS classification, and quantified the robustness of predictions to variations in automatic segmentation of brain tumor volume. More specifically, we evaluated four radiomic models, namely, the Whole Tumor (WT) radiomics model, the 3-subregions radiomics model, the 6-subregions radiomics model, and the 21-subregions radiomics model. The 3-subregions radiomics model is based on a physiological segmentation of whole tumor volume (WT) into three non-overlapping subregions. The 6-subregions and 21-subregions radiomic models are based on an anatomical segmentation of the brain tumor into 6 and 21 anatomical regions, respectively. Moreover, we employed six segmentation schemes - five CNNs and one STAPLE-fusion method - to quantify the robustness of radiomic models. Our experiments revealed that the 3-subregions radiomics model had the best predictive performance (mean AUC = 0.73) but poor robustness (RSD = 1.99) and the 6-subregions and 21-subregions radiomics models were more robust (RSD 1.39) with lower predictive performance (mean AUC 0.71). The poor robustness of the 3-subregions radiomics model was associated with highly variable and inferior segmentation of tumor core and active tumor subregions as quantified by the Hausdorff distance metric (4.4-6.5mm) across six segmentation schemes. Failure analysis revealed that the WT radiomics model, the 6-subregions radiomics model, and the 21-subregions radiomics model failed for the same subjects which is attributed to the common requirement of accurate segmentation of the WT volume. Moreover, short-term survivors were largely misclassified by the radiomic models and had large segmentation errors (average Hausdorff distance of 7.09mm). Lastly, we concluded that while STAPLE-fusion can reduce segmentation errors, it is not a solution to learning accurate and robust radiomic models.Entities:
Keywords: MRI; brain tumor segmentation; deep learning; glioblastoma; machine learning; radiomics; survival prediction
Year: 2022 PMID: 35873825 PMCID: PMC9301117 DOI: 10.3389/fnins.2022.911065
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
Summary of training and testing cohorts (A and B) used in the overall survival classification task.
| Characteristics | Training cohort | Testing cohort A | Testing cohort B |
|
| |||
| No. of patients |
|
|
|
| Patient distribution | |||
| CBICA UPenn |
| – |
|
| TCIA |
|
| – |
| Others |
| – |
|
|
| |||
| 3D multiparametric MRI scans | √ | √ | √ |
| Ground truth Segmentation masks | √ | × | × |
|
| |||
|
| |||
| Range | 27.8−86.6 | 17.0−80.0 | 21.7−85.6 |
| Mean | 61.9 | 58.4 | 57.3 |
| Median | 63.5 |
|
|
| 1-Standard deviation | 12.0 | 15.5 | 14.3 |
|
| |||
| Range (days) |
|
| – |
| Mean (days) | 446.4 | 390.8 | – |
| Median (days) | 374.5 | 293.7 | – |
| 1-Standard deviation (days) | 343.8 | 314.4 | – |
| Short-term [10months] |
|
| – |
| Medium-term [10−15months] |
|
| – |
| Long-term [ > 15months] |
|
| – |
Configuration and hyperparameters of five CNNs used for automatic segmentation of brain tumor volume.
| Network | Dong 2D U-Net | Wang 2.5D CNN | Isensee 3D U-Net | HDC-Net | E1D3 3D U-Net |
| Architecture | 2D U-Net | Three 2.5D Anisotropic CNNs (W-Net, T-Net, and E-Net) in cascade | 3D U-Net with | 2.5D U-Net | 3D U-Net |
| Activation | ReLU | P-ReLU | Leaky-ReLU (0.01) | ReLU | Leaky-ReLU (0.01) |
| Batch size | 10 | 5 | 2 | 8 | 2 |
| Initialization | He-normal | Truncated Normal | He-normal | He-normal | He-normal |
| Input size/Output size | 240 | W-Net: 19×144 | 1283/1283 | 1283/1283 | 963/963 |
| Learning Rate policy | Polynomial Decay (batch-wise) | Constant (10−3) | Polynomial decay | Polynomial decay | Polynomial decay (epoch-wise) |
| Optimizer | Adam | Adam | SGD + Nesterov | Adam (AMSGrad variant) | SGD + Nesterov |
| Loss | Soft Dice | Soft Dice | Soft Dice + Cross Entropy | Generalized Soft Dice | Soft Dice + Cross Entropy |
| Regularization | – | ||||
| Total Training iterations (Gradient-Decent updates) | 50k | 20k (per network) | 250k | 37.35k | 125k |
| # Parameters | 34.5 | W-Net: 0.21 | 31.2 | 0.29 | 34.9million |
| Training Time | ∼110 h | W-Net (single-view): ∼84 h | ∼101 h | ∼110 h | ∼48 h |
| Test-time Augmentation | √ | × | √ | √ | √ |
| Morphological Post-processing | Morphological closing, cluster thresholding | × | × | × | √ |
Summary of radiomic features extracted for four radiomic models, namely, the WT radiomics model, the 3-subregions radiomics model, the 6-subregions radiomics model, and the 21-subregions radiomics model.
| Feature types | Feature names | No of features |
| Clinical features | Age | 1 |
| Spatial features | Centroid of the WT, (Euclidean) Distance between the (centroid of) WT and the (centroid of) the brain | 4 |
| Shape features (WT radiomics model) | Volume and Surface Area of Whole Tumor | 2 |
| Shape features ( | Volume and Surface Area of Peritumoral Edema (PTE), Enhancing Core (ENC), and Non-Enhancing Core (NEC) | 6 (2 features × 3 subregions) |
| Shape features ( | Volume and Surface Area of Right Cerebral Cortex (RCC), Left Cerebral Cortex (LCC), Left Lateral Ventricle (LLV), Right Lateral Ventricle (RLV), Left Cerebral White Matter (LCWM), Right Cerebral White Matter (RCWM) | 12 (2 features × 6 subregions) |
| Shape features ( | Volume and Surface Area of 21 Subcortical Regions defined by a registered Harvard-Oxford subcortical atlas (see | 42 (2 features × 21 subregions) |
Performance of six segmentation schemes, including five CNNs and one STAPLE-fusion method, on testing cohorts A and B (60 subjects).
| Segmentation network | Dice similarity coefficient (%) | Hausdorff distance (mm) | Final Ranking Score (FRS) | ||||
|
|
| ||||||
| WT | TC | EC | WT | TC | EC | ||
| Dong 2D U-Net | 90.4 ± 6.5 | 87.3 ± 9.9 | 84.1 ± 9.4 | 5.8 ± 9.0 | 6.5 ± 8.6 | 3.2 ± 5.5 | 6 |
| Wang 2.5D CNN | 90.6 ± 5.5 | 89.3 ± 8.7 | 85.2 ± 10.0 | 6.6 ± 10.0 | 5.7 ± 8.7 | 2.9 ± 4.8 | 5 |
| Isensee 3D U-Net | 4.4 ± 5.6 | 4.4 ± 8.5 | 2.1 ± 1.9 | 1 | |||
| HDC-Net | 90.8 ± 5.4 | 90.1 ± 7.3 | 85.9 ± 8.4 | 4.3 ± 4.4 | 4.5 ± 8.0 | 2.1 ± 1.3 | 3 |
| E1D3 3D U-Net | 91.4 ± 4.9 | 89.7 ± 9.0 | 85.9 ± 9.1 | 5.5 ± 7.8 | 5.6 ± 10.0 | 3.4 ± 6.3 | 4 |
| STAPLE-Fusion | 91.4 ± 4.8 | 90.6 ± 7.6 | 86.7;±7.7 | 2 | |||
Bold font indicates best scores for overlapping subregions (WT, TC, and EC).
**Indicates that the segmentation network is ranked significantly lower (p < 0.001) in comparison to the top-ranked method Isensee 3D U-Net (FRS = 1).
FIGURE 1Automatically segmented tumor subregions from six segmentation schemes including five CNNs and one STAPLE fusion method. Peritumoral edema, enhancing core, non-enhancing core. peritumoral edema (Green) enhancing core (Yellow) and non-enhancing core (Orange).
Quantitative analysis of four radiomic models, namely, the WT radiomics model, the 3-subregions radiomics model, the 6-subregions radiomics model, and the 21-subregions radiomics model on testing cohort A (31 subjects).
| Segmentation network | Performance metric | WT radiomics model | |||
| Dong 2D U-Net | AUC | 0.71(0.77, 0.48, 0.70) | 0.70(0.68, 0.43, 0.78) | ||
| AUPRC | 0.58 | 0.66 | 0.51 | 0.57 | |
| Wang 2.5D CNN | AUC | 0.68(0.68, 0.44, 0.7) | 0.70(0.71, 0.42, 0.74) | 0.70(0.65, 0.43, 0.78) | |
| AUPRC | 0.53 | 0.68 | 0.51 | 0.56 | |
| Isensee 3D U-Net | AUC | 0.71(0.7, 0.31, 0.75) | |||
| AUPRC | 0.57 | 0.62 | 0.56 | 0.61 | |
| HDC-Net | AUC | 0.69(0.68, 0.45, 0.72) | 0.73(0.67, 0.53, 0.82) | 0.71(0.71, 0.45, 0.75) | 0.71(0.67, 0.45, 0.78) |
| AUPRC | 0.54 | 0.61 | 0.53 | 0.58 | |
| E1D3 3D U-Net | AUC | 0.67(0.68, 0.36,0.69) | 0.72(0.71, 0.35, 0.8) | 0.71(0.73, 0.42, 0.76) | |
| AUPRC | 0.54 | 0.64 | 0.57 | 0.60 | |
| STAPLE Fusion | AUC | 0.68(0.69, 0.42, 0.71) | 0.74(0.7, 0.45, 0.82) | 0.70(0.75, 0.40, 0.74) | 0.71(0.69, 0.43, 0.77) |
| AUPRC | 0.55 | 0.64 | 0.51 | 0.59 |
Bold font indicates the best performance achieved for each radiomics model.
The micro-AUC of the three classes is displayed as an ordered triplet (short-term survivor, medium-term survivor, and long-term survivor) below the weighted average AUC value.
FIGURE 2Distribution of misclassified subjects in (A) the WT radiomics model, the 3-subregions radiomics model, and the 6-subregions radiomics model (B) the WT radiomics model, the 3-subregions radiomics model, and the 21-subregions radiomics model (C) the WT radiomics model, the 6-subregions radiomics model, and the 21-subregions radiomics model, on testing cohort A (31 subjects). For instance, 11 subjects were misclassified by the WT, 6-subregions, and 21-subregions radiomics models.
Quantitative analysis of four radiomic models, namely, the WT radiomics model, the 3-subregions radiomics model, the 6-subregions radiomics model, and the 21-subregions radiomics model, on testing cohort B (29 subjects).
| Segmentation network | Accuracy (%) | |||
|
| ||||
| WT | ||||
| Dong 2D U-Net | 44.8 | 41.4 | 48.3 | 51.7 |
| Wang 2.5D CNN | 44.8 | 41.4 | 41.4 | 37.9 |
| Isensee 3D U-Net | 44.8 | 41.4 | 48.3 | 44.8 |
| HDC-Net | 48.3 | 37.9 | 44.8 | 44.8 |
| E1D3 3D U-Net | 48.3 | 44.8 | 48.3 | 51.7 |
| STAPLE Fusion | 48.3 | 41.4 | 48.3‘ | 41.4 |
The 21 subregions defined by the Harvard-Oxford subcortical atlas are tabulated below.
| Label | Anatomical Region |
| 1 | Left Cerebral White Matter |
| 2 | Left Cerebral Cortex |
| 3 | Left Lateral Ventricle |
| 4 | Left Thalamus |
| 5 | Left Caudate |
| 6 | Left Putamen |
| 7 | Left Pallidum |
| 8 | Brainstem |
| 9 | Left Hippocampus |
| 10 | Left Amygdala |
| 11 | Left Accumbens |
| 12 | Right Cerebral White Matter |
| 13 | Right Cerebral Cortex |
| 14 | Right Lateral Ventricle |
| 15 | Right Thalamus |
| 16 | Right Caudate |
| 17 | Right Putamen |
| 18 | Right Pallidum |
| 19 | Right Hippocampus |
| 20 | Right Amygdala |
| 21 | Right Accumbens |