| Literature DB >> 31507398 |
Abstract
Purpose: Predicting patients' survival outcomes is recognized of key importance to clinicians in oncology toward determining an ideal course of treatment and patient management. This study applies radiomics analysis on pre-operative multi-parametric MRI of patients with glioblastoma from multiple institutions to identify a signature and a practical machine learning model for stratifying patients into groups based on overall survival.Entities:
Keywords: MRI; glioblastoma multiforme; machine learning; patient's survival prediction; precision oncology; radiomics analysis
Year: 2019 PMID: 31507398 PMCID: PMC6718726 DOI: 10.3389/fncom.2019.00058
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 2.380
Figure 1Glioblastoma multiforme sub-regions segmentation masks generated by experts annotated in the different MRI sequences. (A) the whole tumor (yellow) visible in T2-FLAIR, (B) the tumor core (light blue) visible in T2, and (C) the active tumor structures (purple) visible in T1-Gd. Combination of three segmentation labels overlaid on T2-FLAIR MRI producing (D) the final labels of the tumor sub-structures: peritumoral edema [ED] (yellow), non-enhancing solid core tumor [NET] (light blue), necrosis [NCR], and enhancing tumor core (purple).
Demographic and clinical characteristics data of GBM patients in discovery, validation, and combined sets.
| Patient distribution | 109 (67%) | 54 (33%) | 163 |
| -CBICA UPenn | – | – | 85 (52%) |
| -TCIA | – | – | 76 (47%) |
| - MGH, HU, DU, and BU | – | – | 2 (1%) |
| - Data set of T1, T1-Gd, T2, and T2-FLAIR MRI sequences with tumor sub-structures “ground truth” segmentation labels | – | – | 163 |
| - Range | 18.97–84.84 | 33.88–85.76 | 18.97–85.76 |
| - Mean | 59.73 | 61.55 | 60.33 |
| - Median | 60.94 | 62.36 | 61.17 |
| −1 Standard deviation | 12.23 | 11.81 | 12.03 |
| - Range | 5–1767 | 22–1731 | 5–1767 |
| - Mean | 421.37 | 426.18 | 422.96 |
| - Median | 362.00 | 364.50 | 362.00 |
| -1 Standard deviation | 350.00 | 352.31 | 349.67 |
| - Short-term survivors [ <10 months] | 44 | 21 | 65 (40%) |
| - Medium-term survivors [10–15 months] | 28 | 14 | 42 (26%) |
| - Long-term survivors [>15 months] | 37 | 19 | 56 (34%) |
| - Gross total resection | 36 | 23 | 59 (36%) |
| - Subtotal resection | 19 | 5 | 24 (15%) |
| -Missing information | 54 | 26 | 80 (49%) |
CBICA UPenn, Center for Biomedical Image Computing and Analytics at the University of Pennsylvania; TCIA, The Cancer Imaging Archive; BU, Bern University; DU, Debrecen University; HU, Heidelberg University; MGH, Massachusetts General Hospital.
Data in parentheses are P-value.
Figure 2Multi-parametric MRI sequences before and after intensity normalization with 256 scale (8-bit depth). All normalized images have the same scale.
A summary of radiomics features extracted from the tumor sub-regions (WT, TC, and AT) in multi-parametric MR images (T1-Gd, T2, and T2-FLAIR).
| Sub-regions ( | Whole tumor (WT), tumor core (TC), and active tumor (AT). |
| Shape features ( | Volume [tumor, brain], volume ratio [tumor/brain, AT/WT |
| Intensity features ( | Minimum value, maximum value, median value, mean value, range, variance, moment 2nd-order, moment 3rd-order, entropy, kurtosis, root mean square (RMS), skewness, standard deviation, mean absolute deviation (MAD). |
| Texture features: GLCM ( | Contrast, correlation, energy, homogeneity, (sum) variance, (sum) average, (mean) variance, (mean) autocorrelation, entropy, (sum) entropy2, (difference) entropy2, (sum) variance2, (difference) variance2, range of all GLCM features. |
| HOG features ( | Sum HOG, median HOG, standard deviation HOG. |
| LBP features ( | Sum LBP, mean LBP, standard deviation LBP. |
All features were extracted from a 2D image except those indicated as volumetric features (3D).
Unless noted with a strike (*), each feature was individually extracted from the “whole tumor” area on T2-FLAIR MRI, tumor core area on T2 MRI, and active tumor area on T1-Gd MRI.
These features were calculated as combined features from joint of WT, TC, and AT sub-structures.
Features indicated with (2) were derived from GLCM calculated horizontally (0-degree) and 45-degree rotations.
Figure 3The workflow of radiomics analysis used in this study. The overall procedure of identifying a mpMRI radiomics signature model and a practical ML model for stratifying the GBM patient's prognoses based on overall survival.
The subset of nine imaging features selected by the LASSO model and the clinical factors with their median, non-zero coefficients determined with Cox regression, and P-value for constructing the mpMRI radiomics signature in the discovery data set.
| T2-FLAIR_WT_DifferenceVariance2_F41 | 132670 | 1.8000e-06 | 9.7500e-04 |
| T2_TC_TumorToBrainVolumeRation_F79 | 0.0057 | 27.0110 | 0.0028 |
| T2_TC_MinimumTumorIntensity_F111 | 123.9908 | −0.0066 | 0.0030 |
| T2_TC_Range_F115 | 121.5823 | 0.0063 | 0.0030 |
| T2_TC_SumHOG _F139 | 244.4848 | 0.0025 | 0.0040 |
| T2-FLAIR_WT_ SumEntropy2_F38 | 1.9066 | −1.4337 | 0.0152 |
| T1-GD_AT_Energy_F_79 | 0.2027 | −3.1019 | 0.0175 |
| T2-FLAIR_WT_MedianHOG _F44 | 0.1107 | 17.1896 | 0.0185 |
| T2_TC_Moment3rd_F121 | −5324.8 | 5.8300e-06 | 0.0203 |
| Age (years) | 61.17 | – | 3.3700e-04 |
| Resection status (GTR, STR, NA) | – | – | 0.9720 |
They were ordered by their association with survival (P-value).
Figure 4The optimal λ selection by cross-validated deviance of LASSO fit. The partial likelihood deviance plotted vs. λ. The green dotted vertical line was plotted at the optimal λ (36.50) and the blue dotted at λ + 1 STD (84.33) as shown in the plot.
Figure 5The survival stratification created using the constructed radiomics signature. The signature performance in stratifying the survival into short-, medium-, and long-survivors on the (A) discovery and (B) validation sets.
Figure 6The heat map of the LASSO selected radiomics features that used to discover the signature. The rows demonstrate the subset of nine selected features, while the columns indicate the patients (both discovery and validation data sets). The color map shows the z-score difference of each radiomics feature.
AUC and overall accuracy of several trained ML models' performance in classifying GBM patients survival into three groups as a function of choice of features.
| •Imaging features | 0.67 (0.69) | 0.52 (0.60) | 0.61 (0.59) | 47.2 (50.3) |
| •Imaging features + clinical factors | 0.74 | 0.51 | 0.67 | 53.4 |
| •Imaging features (LASSO) | 0.72 (0.74) | 0.31 (0.37) | 0.68 (0.73) | 50.9 (56.4) |
| •Imaging features (LASSO) + clinical factors | 0.80 (0.81) | 0.51 (0.53) | 0.68 (0.73) | 54.0 (55.2) |
| •Imaging features | 0.64 | 0.48 | 0.60 | 46.0 |
| •Imaging features + clinical factors | 0.68 | 0.46 | 0.67 | 50.1 |
| •Imaging (LASSO) features | 0.73 (0.72) | 0.47 (0.45) | 0.72 (0.67) | 47.2 (50.3) |
| •Imaging (LASSO) features + clinical factors | 0.79 (0.78) | 0.44 (0.55) | 0.70 (0.66) | 47.9 (50.9) |
| •Imaging features | 0.67 | 0.52 | 0.61 | 47.2 |
| •imaging features + clinical factors | 0.72 | 0.48 | 0.67 | 49.1 |
| •Imaging (LASSO) features | 0.74 | 0.45 | 0.72 | 56.4 |
| •Imaging (LASSO) features + clinical factors | 0.79 | 0.49 | 0.71 | 53.4 |
| •Imaging (LASSO) features | 0.75 | 0.42 | 0.71 | 57.1 |
| •Imaging (LASSO) features + clinical factors | ||||
Values in brackets are the performance of SVM Linear classifier.
Values in brackets are the performance of SVM Coarse Gaussian classifier.
Values in brackets are the performance of KNN Cosine classifier.
Values in brackets are the performance of KNN Medium classifier.
The overall best classification results are listed in bold.
Figure 7The AUC plot of the three best overall ML classifier invariants in each machine learning category: SVM (Coarse Gaussian), KNN (Medium), and Ensemble (Subspace Discriminant) in classifying OS into three classes using the best feature combination.
The comparison of this study's findings with similarly published works for GBM patients stratification based on survival with radiomics analysis.
| Yang et al. ( | T1 and T2-FLAIR | Ensemble (random forest) learning | 12-months survival | – | 0.67 | – |
| Macyszyn et al. ( | T1, T1-Gd, T2, T2-FLAIR, DTI, and DSC | SVMs | Short- (<6 months), medium- (6–18 months), and long-term (>18 months) | 80.0% | – | – |
| This work | T1, T1-Gd, T2, and T2-FLAIR | LASSO and Cox regression, ensemble (subspace discriminant) learning | Short- (<10 months), medium- (10–15 months), and long-term (>15 months) | 57.8% | 0.81, 0.47, 0.72 | Discovery ( |
| Sanghani et al. ( | T1, T1-Gd, T2, and T2-FLAIR | SVMs | Short- (<10 months), medium- (10–15 months), and long-term (>15 months) | 88.95% | – | – |
| Liu et al. ( | T1, T1-Gd, T2, and T2-FLAIR | SVMs | Short- (<12 months) vs. long-term (≥12 months) | 80.7% | 0.79 | – |
| Chen et al. ( | T1-Gd | LASSO Cox regression | Short- (<12 months) vs. long-term (≥12 months) | 85.1% | 0.81 | Discovery ( |
| Chaddad et al. ( | T1-Gd and T2-FLAIR | Random forest | Short- (<12 months) vs. long-term (>12 months) | – | 0.78 | – |
| Zong et al. ( | T1, T1-Gd, T2, and T2-FLAIR | CNNs | Short- (<6 months), medium- (6–18 months), and long-term (>18 months) | 64.3%, | – | – |
| Rathore et al. ( | T1, T1-Gd, T2, T2-FLAIR, DSC-MRI, and DTI | K-means clustering, Cox regression | Worst (MS = 6 months), intermediate (MS = 12 months), and longest survival (MS = 19 months) | – | – | Validation ( |
DTI, Diffusion Tensor Imaging; DSC, Dynamic Susceptibility Contrast-Enhanced; CNNs, Convolutional Neural Networks; MS, Median Survival.