| Literature DB >> 32942564 |
Nguyen Quoc Khanh Le1,2, Duyen Thi Do3, Fang-Ying Chiu2, Edward Kien Yee Yapp4, Hui-Yuan Yeh5, Cheng-Yu Chen1,2,6,7.
Abstract
Approximately 96% of patients with glioblastomas (GBM) have IDH1 wildtype GBMs, characterized by extremely poor prognosis, partly due to resistance to standard temozolomide treatment. O6-Methylguanine-DNA methyltransferase (MGMT) promoter methylation status is a crucial prognostic biomarker for alkylating chemotherapy resistance in patients with GBM. However, MGMT methylation status identification methods, where the tumor tissue is often undersampled, are time consuming and expensive. Currently, presurgical noninvasive imaging methods are used to identify biomarkers to predict MGMT methylation status. We evaluated a novel radiomics-based eXtreme Gradient Boosting (XGBoost) model to identify MGMT promoter methylation status in patients with IDH1 wildtype GBM. This retrospective study enrolled 53 patients with pathologically proven GBM and tested MGMT methylation and IDH1 status. Radiomics features were extracted from multimodality MRI and tested by F-score analysis to identify important features to improve our model. We identified nine radiomics features that reached an area under the curve of 0.896, which outperformed other classifiers reported previously. These features could be important biomarkers for identifying MGMT methylation status in IDH1 wildtype GBM. The combination of radiomics feature extraction and F-core feature selection significantly improved the performance of the XGBoost model, which may have implications for patient stratification and therapeutic strategy in GBM.Entities:
Keywords: F-score feature selection; IDH1 wildtype; O6-methylguanine-DNA methyltransferase; XGBoost; concomitant adjuvant temozolomide; glioblastoma; machine learning; molecular subtype; noninvasive imaging biomarker; radiogenomics
Year: 2020 PMID: 32942564 PMCID: PMC7563334 DOI: 10.3390/jpm10030128
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Patient characteristics (n = 53).
| Methylated ( | Unmethylated ( | |
|---|---|---|
| Age (mean ± SD, years) | 53 ± 16.43 | 60.83 ± 12.12 |
| Gender | ||
| Male | 8 | 19 |
| Female | 18 | 8 |
| TCGA subtype | ||
| Classical | 6 | 8 |
| Mesenchymal | 9 | 8 |
| Neural | 1 | 2 |
| Proneural | 6 | 9 |
| Methylation class | ||
| CL_1 | 1 | 4 |
| CL_2 | 8 | 9 |
| CL_3 | 6 | 5 |
| CL_4 | 4 | 5 |
| CL_6 | 3 | 4 |
Figure 1Recursive feature elimination (RFE) curve for predicting O6-Methylguanine-DNA methyltransferase (MGMT) methylation status in patients with IDH1 wildtype glioblastomas (GBM) by using different features. Nine features were used as optimal cutoff points.
Top nine radiomics features in our model.
| No. | Feature Name | Modality | Matrix | Type |
|---|---|---|---|---|
| 1 | HISTO_ET_T2_Bin6 | T2 | First Order | Histogram |
| 2 | TEXTURE_GLRLM_ED_T2_GLV | T2 | GLRLM | Texture |
| 3 | TEXTURE_GLSZM_NET_FLAIR_ZP | FLAIR | GLSZM | Wavelet Texture |
| 4 | TEXTURE_GLSZM_NET_FLAIR_SZE | FLAIR | GLSZM | Wavelet Texture |
| 5 | TEXTURE_GLSZM_NET_FLAIR_ZSN | FLAIR | GLSZM | Wavelet Texture |
| 6 | TEXTURE_GLSZM_NET_T1_ZSN | T1 | GLSZM | Wavelet Texture |
| 7 | TEXTURE_GLSZM_NET_T1_SZE | T1 | GLSZM | Wavelet Texture |
| 8 | HISTO_ED_T2_Bin5 | T2 | First Order | Histogram |
| 9 | TEXTURE_GLSZM_NET_T1_ZP | T1 | GLSZM | Wavelet Texture |
GLRLM: Gray Level Run Length Matrix, GLSZM: Gray Level Size Zone Matrix.
Hyperparameter search range and optimal values for each learning algorithm.
| Learning Algorithm | Hyperparameter Range | Optimal Value |
|---|---|---|
| K-nearest neighbors (kNN) | n_neighbors = [1, 2, 3, .., 30] | 1 |
| weights = [uniform, distance] | uniform | |
| metric = [euclidean, manhattan, minkowski] | minkowski | |
| Random Forest | max_depth = [10, 20, 30, 40, 50, …, 100, 110, None] | None |
| max_features = [‘auto’, ‘sqrt’] | sqrt | |
| min_samples_leaf = [1, 2, 3, 4, 5] | 1 | |
| min_samples_split = [2, 4, 6, 8, 10, 12] | 10 | |
| n_estimators = [100, 200, 300, 400, 500, 600, …, 2000] | 2000 | |
| Support vector machine (SVM) | C = [0.001, 0.01, 0.1, 1, 10] | 0.001 |
| gamma = [0.001, 0.01, 0.1, 1] | 0.1 | |
| kernel = [rbf, linear, poly, sigmoid] | poly | |
| eXtreme Gradient Boosting (XGBoost) | min_child_weight = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10] | 1 |
| gamma = [0.5, 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5] | 2.5 | |
| subsample = [0, 0.2, 0.4, 0.6, 0.8, 1] | 0.6 | |
| colsample_bytree = [0, 0.2, 0.4, 0.6, 0.8, 1] | 0.4 | |
| max_depth = [10, 20, 30, 40, 50, …, 100, 110, None] | 50 | |
| n_estimators = [100, 200, 300, 400, 500, 600, …, 2000] | 200 |
Figure 2Performance results of different machine learning classifiers in predicting MGMT methylation status. (A) All 704 radiomics features. (B) Nine top-rank radiomics features.
Comparison between our models and previously published works in predicting MGMT mutation status in GBMs.
| Biomarkers | Classifiers | Sensitivity | Specificity | Accuracy | |
|---|---|---|---|---|---|
| Jiang et al. | 15 features | Mann–Whitney test | 0.821 | 0.857 | 0.886 |
| Levner et al. | 8 features | L1-regularized neural networks | 0.854 | 0.9 | 0.877 |
| Xi et al. | 63 features | Support vector machine | 0.888 | 0.838 | 0.866 |
| Crisi et al. | 14 features | Multilayer perceptron | 0.75 | 0.85 | - |
| Ahn et al. | - | Mann–Whitney | 0.563 | 0.852 | - |
| Korfiatis et al. | 4 features | Support vector machine | 0.803 | 0.813 | - |
| Sasaki et al. | 5 features | LASSO | 0.67 | 0.66 | 0.67 |
| Our study | 9 features | XGBoost | 0.88 | 0.887 | 0.887 |
LASSO: least absolute shrinkage and selection operator, “-” indicates that this value has not been reported in the original paper.
Figure 3An illustrative example of how to segment the MRI scan of a patient with GBM into three different regions (non-enhanced part of tumor core (NET): green, enhanced part of tumor core (ET): blue, peritumoral edema (ED): yellow, GT: ground truth).