| Literature DB >> 35127472 |
Chen Sun1, Liyuan Fan2, Wenqing Wang1, Weiwei Wang3, Lei Liu4, Wenchao Duan1, Dongling Pei1, Yunbo Zhan1, Haibiao Zhao1, Tao Sun1, Zhen Liu1, Xuanke Hong1, Xiangxiang Wang1, Yu Guo1, Wencai Li3, Jingliang Cheng5, Zhicheng Li4, Xianzhi Liu1, Zhenyu Zhang1, Jing Yan5.
Abstract
BACKGROUND: Isocitrate dehydrogenase (IDH) mutation and 1p19q codeletion status have been identified as significant markers for therapy and prognosis in lower-grade glioma (LGG). The current study aimed to construct a combined machine learning-based model for predicting the molecular subtypes of LGG, including (1) IDH wild-type astrocytoma (IDHwt), (2) IDH mutant and 1p19q non-codeleted astrocytoma (IDHmut-noncodel), and (3) IDH-mutant and 1p19q codeleted oligodendroglioma (IDHmut-codel), based on multiparametric magnetic resonance imaging (MRI) radiomics, qualitative features, and clinical factors.Entities:
Keywords: Visually Accessible Rembrandt Images; lower-grade glioma; machine learning; molecular subtypes; radiomics
Year: 2022 PMID: 35127472 PMCID: PMC8814098 DOI: 10.3389/fonc.2021.756828
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1The workflow of this study.
Distribution of patient characteristics in the training cohort and testing cohort.
| Characteristic | Overall (n = 335) | Training Cohort (n = 269) | Testing Cohort (n = 66) | p-Value |
|---|---|---|---|---|
| Gender | 0.9417 | |||
| Male | 189 (53.24%) | 151 (53.94%) | 38 (64.20%) | |
| Female | 146 (43.58%) | 118 (46.06%) | 28 (35.80%) | |
| Age(year)* | 44.93 ± 12.47 | 44.59 ± 12.27 | 46.27 ± 13.24 | 0.3519 |
| Molecular subtypes | 0.7837 | |||
| IDHwt | 94 (28.06%) | 76(29.13%) | 18(24.69%) | |
| IDHmut-noncodel | 110 (32.84%) | 86 (31.89%) | 24 (35.80%) | |
| IDHmut-codel | 131 (39.10%) | 107 (38.98%) | 24 (39.51%) |
Unless otherwise noted, data are numbers of patients, with percentages in parentheses.
*Data are means ± standard deviations.
Selected radiomics features for predicting the molecular subtypes of lower-grade glioma patients.
| No. | Selected Features | Type | Sequence | Filter | pFDR |
|---|---|---|---|---|---|
| f1 | Interquartile range | Intensity | ADC | Original | <0.001 |
| f2 | Skewness | Intensity | ADC | Original | <0.001 |
| f3 | NGTDM Complexity | Texture | ADC | log-sigma-3-0-mm | <0.001 |
| f4 | GLCM ClusterShade | Texture | ADC | log-sigma-5-0-mm | <0.001 |
| f5 | GLRLM RunVariance | Texture | ADC | log-sigma-5-0-mm | <0.001 |
| f6 | Median | Intensity | ADC | Wavelet. HLL | <0.001 |
| f7 | GLCM ClusterShade | Texture | ADC | Wavelet. HLL | <0.001 |
| f8 | GLCM Imc1 | Texture | FLAIR | log-sigma-3-0-mm | <0.001 |
| f9 | GLRLM RunVariance | Texture | FLAIR | log-sigma-4-0-mm | <0.001 |
| f10 | Skewness | Intensity | FLAIR | Wavelet. LHL | <0.001 |
| f11 | GLRLM GrayLevelNonUniformityNormalized | Texture | T1WI | Wavelet. LLH | <0.001 |
| f12 | GLRLM RunVariance | Texture | T1WI | Wavelet. LHH | <0.001 |
| f13 | GLCM SumEntropy | Texture | T1WI | Wavelet. HLL | <0.001 |
| f14 | GLDM LargeDependenceEmphasis | Texture | T1WI | Wavelet. HLL | <0.001 |
| f15 | Skewness | Intensity | T1WI | Wavelet. LLL | <0.001 |
| f16 | GLRLM LongRunHighGrayLevelEmphasi | Texture | T1WI | Wavelet. LLL | <0.001 |
| f17 | Skewness | Intensity | CE-T1WI | Original | <0.001 |
H and L were high- and low-pass filters in wavelet transform, respectively. pFDR is short for false discovery rate-adjusted p-value.
Figure 2The receiving operating characteristics (ROC) curves of the combined model, the radiomics model, the clinical model, and the qualitative model on the (A–C) training cohort and (D–F) testing cohort.
Summary of the subtype-specific classification performance of the radiomics model.
| Molecular subgroups | Cohorts | AUC | BAL_ACC | SEN | SPE |
|---|---|---|---|---|---|
| IDHwt | Training | 0.8121 (0.7559-0.8682) | 0.7782 (0.6989-0.8401) | 0.7895 (0.6444-0.8947) | 0.7668 (0.6528-0.8912) |
| Testing | 0.6557 (0.5084-0.8029) | 0.6806 (0.4394-0.8181) | 0.7778 (0.2778-100.00) | 0.5833 (0.2292-0.9792) | |
| IDHmt-noncodel | Training | 0.7384 (0.6739-0.8030) | 0.7052 (0.5462-0.7510) | 0.8256 (0.5462-0.9186) | 0.5847 (0.4699-0.8306) |
| Testing | 0.6830 (0.5478-0.8183) | 0.7232 (0.5758-0.8333) | 0.7083 (0.5000-0.9177) | 0.7381 (0.3810-0.8810) | |
| IDHmt-codel | Training | 0.7905 (0.7351-0.8459) | 0.7595 (0.6989-0.8104) | 0.7103 (0.5888-0.8598) | 0.8086 (0.6420-0.8827) |
| Testing | 0.7579 (0.6359-0.8799) | 0.7500 (0.6212-0.8636) | 0.6667 (0.4583-0.9594) | 0.8333 (0.4524-0.9762) |
BAL_ACC, SEN, and SPE are short for balanced accuracy, sensitivity, and specificity, respectively. The 95% confidence interval for each index is shown.
Summary of the subtype-specific classification performance of the combined model.
| Molecular subtypes | Cohorts | AUC | BAL_ACC | SEN | SPE |
|---|---|---|---|---|---|
| IDHwt | Training | 0.8414 (0.7906-0.8922) | 0.7911 (0.6876-0.8476) | 0.7895 (0.6184-0.9342) | 0.7927 (0.6062-0.9119) |
| Testing | 0.8623 (0.7453-0.9792) | 0.8924 (0.8182-0.9697) | 0.8889 (0.7222-1.0000) | 0.8958 (0.7917-0.9792) | |
| IDHmt-noncodel | Training | 0.8190 (0.7604-0.8776) | 0.7761 (0.7212-0.8699) | 0.6395 (0.5000-0.8488) | 0.9126 (0.6632-0.9781) |
| Testing | 0.8056 (0.6833-0.9278) | 0.8066 (0.7424-0.9091) | 0.7083 (0.5000-0.9167) | 0.9048 (0.7381-1.0000) | |
| IDHmt-codel | Training | 0.8193 (0.7695-0.8692) | 0.7552 (0.6877-0.8104) | 0.7944 (0.5514-0.9159) | 0.7160 (0.5802-0.9198) |
| Testing | 0.8036 (0.6898-0.9173) | 0.8095 (0.7121-0.8939) | 0.8333 (0.5833-0.9583) | 0.7857 (0.6667-0.9524) |
BAL_ACC, SEN, and SPE are short for balanced accuracy, sensitivity, and specificity, respectively. The 95% confidence interval for each index is shown.
Figure 3Heat map of the subtype-specific importance of all features used in subtype classification.