| Literature DB >> 31464116 |
Yae Won Park1,2, Yoon Seong Choi3, Sung Soo Ahn2, Jong Hee Chang4, Se Hoon Kim5, Seung Koo Lee2.
Abstract
OBJECTIVE: To assess whether radiomics features derived from multiparametric MRI can predict the tumor grade of lower-grade gliomas (LGGs; World Health Organization grade II and grade III) and the nonenhancing LGG subgroup.Entities:
Keywords: Grade; Lower-grade glioma; Magnetic resonance imaging; Radiomics; The Cancer Genome Atlas
Mesh:
Year: 2019 PMID: 31464116 PMCID: PMC6715562 DOI: 10.3348/kjr.2018.0814
Source DB: PubMed Journal: Korean J Radiol ISSN: 1229-6929 Impact factor: 3.500
Fig. 1Patient enrollment process for entire LGG group and nonenhancing LGG subgroup in (A) institutional cohort and (B) TCGA cohort.
FLAIR = fluid-attenuated inversion recovery, LGG = lower-grade glioma, TCGA = The Cancer Genome Atlas, TCIA = The Cancer Imaging Archive, T1C = contrast-enhanced T1-weighted
Fig. 2Workflow of image processing, radiomics feature extraction, and machine learning.
GLCM = gray level co-occurrence matrix, GLRLM = gray level run-length matrix, GLSZM = gray level size zone matrix, ROC = receiver operating characteristic
Patient Characteristics in Entire LGG Group
| Variables | Institutional Cohort (n = 204) | TCGA Validation Set (n = 99) | |||
|---|---|---|---|---|---|
| Training Set (n = 136) | Test Set (n = 68) | Total (n = 204) | |||
| Age (mean ± SD) | 44.99 ± 12.94 | 44.00 ± 12.33 | 44.66 ± 12.74 | 46.96 ± 13.95 | 0.154 |
| Sex | 0.222 | ||||
| Male | 65 (47.8) | 43 (63.2) | 108 (52.9) | 55 (55.6) | |
| Female | 71 (52.2) | 25 (36.8) | 96 (47.1) | 54 (44.4) | |
| Grade | 0.075 | ||||
| II | 81 (59.6) | 40 (58.8) | 121 (59.3) | 48 (48.5) | |
| III | 55 (40.4) | 28 (41.2) | 83 (40.7) | 51 (51.5) | |
Data are number of patients. Numbers in parentheses are percentage. *p values were calculated using Student's t test for continuous variables and chi-square test for categorical variables, to compare patient characteristics of institutional cohort (n = 204) and TCGA validation set (n = 99). LGG = lower-grade glioma, SD = standard deviation, TCGA = The Cancer Genome Atlas
Patient Characteristics in Nonenhancing LGG Subgroup
| Variables | Institutional Cohort (n = 110) | TCGA Validation Set (n = 37) | |||
|---|---|---|---|---|---|
| Training Set (n = 73) | Test Set (n = 37) | Total (n = 110) | |||
| Age (mean ± SD) | 43.18 ± 11.76 | 43.32 ± 10.28 | 44.24 ± 11.34 | 43.56 ± 15.13 | 0.805 |
| Sex | 0.152 | ||||
| Male | 39 (53.4) | 14 (37.8) | 53 (48.2) | 17 (45.9) | |
| Female | 34 (46.6) | 23 (62.2) | 57 (51.8) | 20 (54.1) | |
| Grade | 0.247 | ||||
| II | 61 (83.6) | 31 (83.8) | 92 (83.6) | 27 (73.0) | |
| III | 12 (16.4) | 6 (16.2) | 18 (16.4) | 10 (27.0) | |
Data are number of patients. Numbers in parentheses are percentage. *p values were calculated using Student's t test for continuous variables and chi-square test for categorical variables, to compare patient characteristics of institutional cohort (n = 110) and TCGA validation set (n = 37).
Fig. 3Heatmap of AUC values.
Heat map of AUC values from machine learning classifier to predict grade (A) in entire LGG group in internal validation for institutional test set (n = 136) after training on institutional training set (n = 68) and entire LGG group in external validation for TCGA validation set (n = 99) after training on entire institutional cohort (n = 204); and (B) in nonenhancing LGG subgroup in internal validation for institutional test set (n = 73) after training on institutional training set (n = 37) and nonenhancing LGG subgroup in external validation on TCGA cohort (n = 37) after training on entire nonenhancing institutional cohort (n = 110). AUC = area under curve, GBM = gradient boosting machine, LDA = linear discriminant analysis, RF = random forest, RFE = recursive feature elimination, ROSE = random over-sampling examples, SMOTE = synthetic minority over-sampling technique
Performance of Best Machine Learning Classifiers in Grade Prediction for Entire LGG Group and Nonenhancing LGG Subgroup in Internal and External Validations
| Cohort | Validation | Subsampling | AUC (95% CI) | Accuracy | Sensitivity | Specificity | NIR | |
|---|---|---|---|---|---|---|---|---|
| Entire LGG | Internal | 0.85 (0.76–0.94) | 79.4% | 92.9% | 70.0% | 58.8% | < 0.001 | |
| Entire LGG | TCGA | 0.72 (0.62–0.82) | 66.7% | 72.6% | 60.4% | 51.5% | 0.002 | |
| Nonenhancing LGG | Internal | 0.82 (0.66–0.97) | 78.4% | 83.3% | 77.4% | 83.8% | 0.866 | |
| Nonenhancing LGG | TCGA | 0.68 (0.49–0.87) | 72.2% | 55.6% | 77.8% | 75.0% | 0.725 |
AUC = area under curve, CI = confidence interval, GBM = gradient boosting machine, NIR = no-information rate, RF = random forest, RFE = recursive feature elimination, ROSE = random over-sampling examples