| Literature DB >> 33937440 |
João Santinha1,2, Celso Matos3, Mário Figueiredo2, Nikolaos Papanikolaou1.
Abstract
Purpose: Radiogenomics offers a potential virtual and noninvasive biopsy. However, radiogenomics models often suffer from generalizability issues, which cause a performance degradation on unseen data. In MRI, differences in the sequence parameters, manufacturers, and scanners make this generalizability issue worse. Such image acquisition information may be used to define different environments and select robust and invariant radiomic features associated with the clinical outcome that should be included in radiomics/radiogenomics models. Approach: We assessed 77 low-grade gliomas and glioblastomas multiform patients publicly available in TCGA and TCIA. Radiomics features were extracted from multiparametric MRI images (T1-weighted, contrast-enhanced T1-weighted, T2-weighted, and fluid-attenuated inversion recovery) and different regions-of-interest (enhancing tumor, nonenhancing tumor/necrosis, and edema). A method developed to find variables that are part of causal structures was used for feature selection and compared with an embedded feature selection approach commonly used in radiomics/radiogenomics studies, across two different scenarios: (1) leaving data from a center as an independent held-out test set and tuning the model with the data from the remaining centers and (2) use stratified partitioning to obtain the training and the held-out test sets.Entities:
Keywords: IDH1/2 mutation status; generalizability; invariance; multicentric; radiogenomics; robust
Year: 2021 PMID: 33937440 PMCID: PMC8082292 DOI: 10.1117/1.JMI.8.3.031905
Source DB: PubMed Journal: J Med Imaging (Bellingham) ISSN: 2329-4302
Fig. 1Example of segmentation of the different regions used to extract the radiomic features (TCIA case ID – TCGA-14-3477) overlaid in the cT1w image (a). Enhanced tumor region (L1) is shown in green, nonenhanced tumor and necrosis (L2) are shown in yellow, and edema (L3) is shown in brown. (b) The cT1w image without segmentation overlay; (c) FLAIR image; (d) T2w image.
List of hospitals and corresponding total number of cases and number of IDH1/2 mutated and wild-type cases with respective tumor grades (II, III, and IV).
| Hospital (center ID) | No. of cases | No. of IDH1/2 mutant (grades) | No. of IDH1/2 wild-type (grades) |
|---|---|---|---|
| Case Western (1) | 7 | 4 (4 – II) | 3 (1 – III; 2 – IV) |
| Case Western – St. Joes (2) | 12 | 11 (5 – II; 6 – III) | 1 (1 – II) |
| Henry Ford Hospital (3) | 36 | 29 (12 – II; 15 – III; 2 – IV) | 7 (5 – III; 2 – IV) |
| Thomas Jefferson University (4) | 22 | 7 (2 – II; 5 – III) | 15 (1 – II; 3 – III; 11 – IV) |
List of feature extraction parameters used for each MRI sequence.
| Feature extraction parameters | cT1w | T1w | T2w | FLAIR |
|---|---|---|---|---|
| Normalization scale | 100 | 100 | 100 | 100 |
| Voxel array shift | 300 | 300 | 300 | 300 |
| Resampled pixel spacing (mm) | ||||
| Resegment range (mode: sigma) | ||||
| Bin width | 5 | 2 | 5 | 5 |
| LoG sigma (mm) | [1.016, 2, 3] | [1.016, 2, 3] | [1.016, 2, 3] | [1.016, 2, 3] |
| Wavelet number of levels | 2 | 2 | 2 | 2 |
Performance metrics of the different models, A1, A2, and B, when trained with the different combinations of three out of the four centers and tested on the held-out center. AUC, area under the receiver operating characteristic curve; Acc, accuracy; Sens, sensitivity; Spec, specificity; MCC, Matthews correlation coefficient.
| Model A1 | Model A2 | Model B | |||||
|---|---|---|---|---|---|---|---|
| CV [95% CI] | Test [95% CI] | CV [95% CI] | Test [95% CI] | CV [95% CI] | Test [95% CI] | ||
| Train with centers 2, 3, and 4; test with center 1 | AUC | 0.95 [0.94; 0.97] | 0.75 [0.25; 1.00] | 0.93 [0.91; 0.95] | 0.67 [0.25; 1.00] | 0.81 [0.78; 0.84] | 0.83 [0.50; 1.00] |
| Acc | 0.90 [0.88; 0.92] | 0.71 [0.35; 0.91] | 0.89 [0.86; 0.91] | 0.71 [0.35; 0.91] | 0.75 [0.72; 0.78] | 0.71 [0.35; 0.91] | |
| Sens | 0.84 [0.78; 0.90] | 0.67 [0.20; 0.93] | 0.86 [0.80; 0.92] | 0.67 [0.20; 0.93] | 0.55 [0.45; 0.65] | 0.67 [0.20; 0.93] | |
| Spec | 0.93 [0.91; 0.95] | 0.75 [0.30; 0.95] | 0.90 [0.87; 0.93] | 0.75 [0.30; 0.95] | 0.85 [0.82; 0.88] | 0.75 [0.30; 0.95] | |
| MCC | 0.77 [0.72; 0.82] | 0.42 [0.19; 0.68] | 0.75 [0.70; 0.81] | 0.42 [0.19; 0.68] | 0.42 [0.33; 0.51] | 0.42 [0.19; 0.68] | |
| Train with centers 1, 3, and 4; test with center 2 | AUC | 0.92 [0.90; 0.94] | 0.64 [0.36; 0.91] | 0.91 [0.89; 0.94] | 0.82 [0.55; 1.00] | 0.84 [0.81; 0.86] | 0.55 [0.27; 0.82] |
| Acc | 0.85 [0.82; 0.87] | 0.75 [0.46; 0.91] | 0.85 [0.82; 0.87] | 0.67 [0.39; 0.86] | 0.77 [0.75; 0.80] | 0.83 [0.55; 0.95] | |
| Sens | 0.79 [0.75; 0.83] | 0.00 [0.00; 0.73] | 0.76 [0.69; 0.83] | 0.00 [0.00; 0.73] | 0.59 [0.53; 0.65] | 0.00 [0.00; 0.73] | |
| Spec | 0.87 [0.84; 0.91] | 0.82 [0.52; 0.94] | 0.89 [0.87; 0.91] | 0.73 [0.43; 0.90] | 0.87 [0.84; 0.90] | 0.91 [0.62; 0.98] | |
| MCC | 0.67 [0.61; 0.72] | -0.13 [-0.38; -0.04] | 0.66 [0.60; 0.72] | -0.17 [-0.41; -0.06] | 0.48 [0.42; 0.54] | -0.09 [-0.38; -0.02] | |
| Train with centers 1, 2, and 4; test with center 3 | AUC | 0.91 [0.89; 0.93] | 0.80 [0.62; 0.95] | 0.91 [0.89; 0.93] | 0.85 [0.38; 0.86] | 0.83 [0.80; 0.86] | 0.89 [0.56; 0.96] |
| Acc | 0.84 [0.80; 0.88] | 0.75 [0.58; 0.86] | 0.85 [0.83; 0.88] | 0.78 [0.61; 0.88] | 0.76 [0.72; 0.79] | 0.81 [0.64; 0.90] | |
| Sens | 0.75 [0.66; 0.83] | 0.86 [0.48; 0.97] | 0.82 [0.74; 0.89] | 0.57 [0.25; 0.84] | 0.54 [0.44; 0.64] | 0.86 [0.48; 0.97] | |
| Spec | 0.88 [0.84; 0.93] | 0.72 [0.54; 0.85] | 0.87 [0.83; 0.91] | 0.83 [0.65; 0.92] | 0.87 [0.82; 0.91] | 0.79 [0.61; 0.90] | |
| MCC | 0.64 [0.55; 0.73] | 0.47 [0.41; 0.53] | 0.69 [0.63; 0.74] | 0.36 [0.30; 0.42] | 0.44 [0.34; 0.54] | 0.55 [0.48; 0.60] | |
| Train with centers 1, 2, and 3; test with center 4 | AUC | 0.87 [0.84; 0.89] | 0.77 [0.51; 1.00] | 0.84 [0.81; 0.87] | 0.72 [0.44; 0.93] | 0.82 [0.78; 0.85] | 0.75 [0.47; 0.97] |
| Acc | 0.82 [0.79; 0.85] | 0.59 [0.38; 0.76] | 0.83 [0.80; 0.86] | 0.36 [0.19; 0.57] | 0.77 [0.74; 0.80] | 0.27 [0.13; 0.48] | |
| Sens | 0.59 [0.50; 0.67] | 0.53 [0.30; 0.75] | 0.62 [0.53; 0.71] | 0.13 [0.03; 0.37] | 0.56 [0.46; 0.65] | 0.00 [0.00; 0.15] | |
| Spec | 0.93 [0.90; 0.97] | 0.71 [0.35; 0.91] | 0.94 [0.91; 0.96] | 0.86 [0.48; 0.97] | 0.88 [0.84; 0.92] | 0.86 [0.48; 0.97] | |
| MCC | 0.59 [0.52; 0.66] | 0.23 [0.16; 0.31] | 0.61 [0.55; 0.68] | -0,01 [-0.07; 0.00] | 0.47 [0.39; 0.55] | -0.32 [-0.46; -0.20] | |
Confusion matrix of the different models, A1, A2, and B, on each of the corresponding independent held-out test sets (WT denotes wild-type).
| Reference | ||||
|---|---|---|---|---|
| Prediction | Mutant | WT | ||
| Test using center 1 | Model A1 | Mutant | 3 | 1 |
| WT | 1 | 2 | ||
| Model A2 | Mutant | 3 | 1 | |
| WT | 1 | 2 | ||
| Model B | Mutant | 3 | 1 | |
| WT | 1 | 2 | ||
| Test using center 2 | Model A1 | Mutant | 9 | 1 |
| WT | 2 | 0 | ||
| Model A2 | Mutant | 8 | 1 | |
| WT | 3 | 0 | ||
| Model B | Mutant | 10 | 1 | |
| WT | 1 | 0 | ||
| Test using center 3 | Model A1 | Mutant | 21 | 1 |
| WT | 8 | 6 | ||
| Model A2 | Mutant | 24 | 3 | |
| WT | 5 | 4 | ||
| Model B | Mutant | 23 | 1 | |
| WT | 6 | 6 | ||
| Test using center 4 | Model A1 | Mutant | 5 | 7 |
| WT | 2 | 8 | ||
| Model A2 | Mutant | 6 | 13 | |
| WT | 1 | 2 | ||
| Model B | Mutant | 6 | 15 | |
| WT | 1 | 0 | ||
Fig. 2Sensitivity and specificity of cross-validation versus testing for different cases. (a), (b) Sensitivity and specificity of model A1;(c), (d) sensitivity and specificity of model A2; (e), (f) sensitivity and specificity of model B.
Performance metrics of the different models, A1, A2, and B, on the cross-validation (CV) and the held-out test (Test) set, corresponding to 30% of the whole data set and obtained through stratified partitioning based on IDH1/2 mutation status. AUC, area under the receiver operating characteristic curve; Acc, accuracy; Sens, sensitivity; Spec, specificity; MCC, Matthews correlation coefficient; CI, confidence interval.
| Model A1 | Model A2 | Model B | ||||
|---|---|---|---|---|---|---|
| CV [95% CI] | Test [95% CI] | CV [95% CI] | Test [95% CI] | CV [95% CI] | Test [95% CI] | |
| AUC | 0.95 [0.94; 0.97] | 0.80 [0.58; 0.95] | 0.93 [0.91; 0.95] | 0.89 [0.71; 1.00] | 0.81 [0.78; 0.84] | 0.79 [0.58; 0.94] |
| Acc | 0.90 [0.88; 0.92] | 0.64 [0.42; 0.80] | 0.89 [0.86; 0.91] | 0.86 [0.66; 0.95] | 0.75 [0.72; 0.78] | 0.73 [0.51; 0.86] |
| Sens | 0.84 [0.78; 0.90] | 0.43 [0.15; 0.74] | 0.86 [0.80; 0.92] | 0.86 [0.48; 0.97] | 0.55 [0.45; 0.65] | 0.43 [0.15; 0.74] |
| Spec | 0.93 [0.91; 0.95] | 0.73 [0.48; 0.89] | 0.90 [0.87; 0.93] | 0.87 [0.62; 0.96] | 0.85 [0.82; 0.88] | 0.87 [0.62; 0.96] |
| MCC | 0.77 [0.72; 0.82] | 0.16 [0.10; 0.24] | 0.75 [0.70; 0.81] | 0.70 [0.60; 0.77] | 0.42 [0.33; 0.51] | 0.33 [0.24; 0.42] |
Confusion matrix of the different models, A1, A2, and B, on held-out test set.
| Prediction | Reference | ||
|---|---|---|---|
| Mutant | WT | ||
| Model A1 | Mutant | 11 | 4 |
| WT | 4 | 3 | |
| Model A2 | Mutant | 13 | 1 |
| WT | 2 | 6 | |
| Model B | Mutant | 13 | 4 |
| WT | 2 | 3 | |
-values of one-sided tests to assess if accuracy was greater than the NIR.
| Model A1 | Model A2 | Model B | |
|---|---|---|---|
| Test 1 | 0.359 | 0.359 | 0.359 |
| Test 2 | 0.986 | 0.998 | 0.928 |
| Test 3 | 0.854 | 0.745 | 0.599 |
| Test 4 | 0.873 | 1.000 | 1.000 |
| Mix | 0.758 | 0.048 | 0.420 |
List of features selected by each method (A1, A2, and B) for each scenario (individual centers as independent test set or stratified partitioning of data from all four centers into training and test sets. Note: “3D” within features named log.sigma.< sigma_value>.mm.3D.< feature_class_feature_name> does not indicate the extraction of the feature in 3D but that the LoG is a 3D filter.
| Test 1 | Test 2 | Test 3 | Test 4 | Test mix | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A1 | A2 | B | A1 | A2 | B | A1 | A2 | B | A1 | A2 | B | A1 | A2 | B | Total | |
| wavelet.HL_firstorder_Mean_cT1_L2 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 8 |
| wavelet.HH_gldm_DependenceVariance_T2_L2 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 6 |
| wavelet.HL_glszm_SmallAreaLowGrayLevelEmphasis_FLAIR_L3 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 6 |
| log.sigma.2.mm.3D_firstorder_90Percentile_cT1_L2 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 4 |
| original_shape_MinorAxisLength_T1_L2 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 4 |
| wavelet2.LL_firstorder_Mean_T2_L3 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 4 |
| wavelet2.HL_glszm_LargeAreaHighGrayLevelEmphasis_T1_L2 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
| wavelet.LH_gldm_LargeDependenceHighGrayLevelEmphasis_T1_L2 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| wavelet2.HH_glcm_Idm_T2_L2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 3 |
| original_firstorder_Mean_T2_L3 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| wavelet.HL_firstorder_10Percentile_T2_L2 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| wavelet2.LH_gldm_LowGrayLevelEmphasis_FLAIR_L3 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| log.sigma.3.mm.3D_firstorder_MeanAbsoluteDeviation_cT1_L3 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
| wavelet.HL_firstorder_RootMeanSquared_cT1_L2 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 3 |
| log.sigma.3.mm.3D_ngtdm_Strength_T2_L2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 3 |
| wavelet2.HH_glrlm_GrayLevelNonUniformity_T2_L2 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet2.HL_gldm_LargeDependenceHighGrayLevelEmphasis_T1_L2 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet.HL_glszm_GrayLevelNonUniformity_T2_L2 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| log.sigma.1.015625.mm.3D_firstorder_Mean_T2_L1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet2.HH_gldm_LargeDependenceLowGrayLevelEmphasis_T1_L2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet2.HL_gldm_LargeDependenceLowGrayLevelEmphasis_T1_L2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet.HH_glcm_SumEntropy_T2_L2 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet2.LL_glcm_MCC_T2_L2 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet.LH_glcm_Correlation_cT1_L3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| wavelet2.HL_glszm_ZoneEntropy_T1_L2 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet.HL_glszm_ZoneEntropy_T1_L2 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet2.HH_glcm_InverseVariance_T2_L2 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| wavelet2.HH_glszm_LargeAreaLowGrayLevelEmphasis_T1_L1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| wavelet2.LL_glcm_MaximumProbability_FLAIR_L3 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| wavelet2.LL_glcm_ClusterShade_cT1_L3 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| wavelet.HL_gldm_DependenceVariance_FLAIR_L3 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet.LH_glszm_SizeZoneNonUniformity_FLAIR_L1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| wavelet.HL_glszm_LargeAreaHighGrayLevelEmphasis_T1_L2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| wavelet.HL_glcm_Correlation_FLAIR_L1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 2 |
| wavelet2.HH_firstorder_10Percentile_T2_L2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 2 |
| log.sigma.1.015625.mm.3D_ngtdm_Contrast_T2_L2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 2 |
| log.sigma.3.mm.3D_gldm_LargeDependenceLowGrayLevelEmphasis_FLAIR_L3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 2 |
| wavelet2.HL_glcm_ClusterShade_cT1_L3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 2 |
| log.sigma.1.015625.mm.3D_glrlm_ShortRunLowGrayLevelEmphasis_T1_L3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 2 |
| log.sigma.3.mm.3D_glrlm_GrayLevelVariance_cT1_L3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 2 |
| original_shape_LeastAxisLength_T2_L2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 2 |
Fig. 3Representation of coefficients of selected features for models (a) A1, (b) A2, and (3) B when tuned with data from centers 2, 3, and 4.
Fig. 4Representation of coefficients of selected features for models (a) A1, (b) A2, and (c) B when tuned with data from centers 1, 3, and 4.
Fig. 5Representation of coefficients of selected features for models (a) A1, (b) A2, and (c) B when tuned with data from centers 1, 2, and 4.
Fig. 6Representation of coefficients of selected features for models (a) A1, (b) A2, and (c) B when tuned with data from centers 1, 2, and 3.
Fig. 7Representation of coefficients of selected features for models (a) A1, (b) A2, and (c) B when tuned with 70% of data from centers 1, 2, 3, and 4.