| Literature DB >> 35645718 |
Yukun Liu1, Tianshi Li1, Ziwen Fan1, Yiming Li1, Zhiyan Sun2, Shaowu Li2, Yuchao Liang1, Chunyao Zhou1, Qiang Zhu1, Hong Zhang1, Xing Liu2, Lei Wang1, Yinyan Wang1,3,4.
Abstract
Purpose: The majority of solitary brain metastases appear similar to glioblastomas (GBMs) on magnetic resonance imaging (MRI). This study aimed to develop and validate an MRI-based model to differentiate intracranial metastases from GBMs using automated machine learning. Materials andEntities:
Keywords: automated machine learning; glioblastoma; image-based differentiation; intracranial metastasis; prediction
Year: 2022 PMID: 35645718 PMCID: PMC9133479 DOI: 10.3389/fnins.2022.855990
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
FIGURE 1Study flowchart. TCGA, the cancer genome atlas; TPOT, tree-based pipeline optimization tool.
FIGURE 2The process of automating machine learning and the predictive performance of the models it builds. (A) The two-layer loops for 10-fold nested cross-validation. (B) The process of automating machine learning modeling and validation using TPOT. (C) AUCs for the models built by the 10-fold nested cross-validation during training, testing, and validation. AUC, areas under the receiver operating characteristic curve; TPOT, tree-based pipeline optimization tool.
Age and sex distribution.
| Overall | Metastases | Glioblastomas |
| Training and testing | Validation |
| |
| Age (year ± SD) | 53.2 ± 13.2 | 54.3 ± 10.1 | 52.1 ± 15.8 | 0.01 | 52.3 ± 13.2 | 56.3 ± 12.7 | <0.01 |
| Sex ( | 0.34 | 0.25 | |||||
| Male | 523 (0.56) | 261 (0.55) | 262 (0.57) | 403 (0.57) | 120 (0.53) | ||
| Female | 412 (0.44) | 217 (0.45) | 195 (0.43) | 305 (0.43) | 107 (0.47) |
SD, standard deviation.
Areas under the receiver operating characteristic curves for the 30 predictive models during training, testing, and validation.
| Radiomics features | Cohort | Models trained by different training groups | Average | |||||||||
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |||
|
| ||||||||||||
| CE | Training | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
|
| ||||||||||||
| Testing | 0.817 | 0.841 | 0.835 | 0.848 | 0.911 | 0.854 | 0.914 | 0.816 | 0.879 | 0.848 | 0.856 | |
|
| ||||||||||||
| Validation | 0.661 | 0.671 | 0.700 | 0.665 | 0.692 | 0.663 | 0.679 | 0.691 | 0.673 | 0.698 | 0.679 | |
|
| ||||||||||||
| T2 | Training | 1.000 | 1.000 | 1.000 | 0.999 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
|
| ||||||||||||
| Testing | 0.959 | 0.998 | 0.994 | 0.969 | 0.982 | 0.977 | 0.988 | 0.964 | 0.958 | 0.973 | 0.976 | |
|
| ||||||||||||
| Validation | 0.802 | 0.839 | 0.812 | 0.837 | 0.830 | 0.838 | 0.824 | 0.822 | 0.840 | 0.837 | 0.828 | |
|
| ||||||||||||
| CE and T2 | Training | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
|
| ||||||||||||
| Testing | 0.973 | 1.000 | 0.998 | 0.986 | 0.986 | 0.986 | 0.999 | 0.975 | 0.984 | 0.990 | 0.988 | |
|
| ||||||||||||
| Validation | 0.817 | 0.848 | 0.864 | 0.847 | 0.838 | 0.882 | 0.834 | 0.835 | 0.864 | 0.826 | 0.846 | |
CE, contrast-enhanced T1-weighted.
FIGURE 3Receiver operating characteristic curves for the models during the corresponding testing groups.
FIGURE 4Receiver operating characteristic curves for the groups of models during validation.
The predicting performance of the groups of models in the validation cohort.
| Models based on | Accuracy | Precision | Specificity | Sensitivity | AUC |
| CE features | 0.656 | 0.698 | 0.660 | 0.653 | 0.687 |
| T2 features | 0.749 | 0.768 | 0.718 | 0.774 | 0.831 |
| CE and T2 features | 0.784 | 0.805 | 0.767 | 0.798 | 0.867 |
CE, contrast-enhanced T1-weighted.
The distribution of features included in all model of the best algorithms.
| Features | CE | T2-weighted | Total | ||
| Shape | 2 | 1 | 3 | ||
| Original | Derived | Original | Derived | 0 | |
| First-order | 3 | 23 | 0 | 22 | 48 |
| Gray-level co-occurrence matrix (GLCM) | 0 | 7 | 1 | 26 | 34 |
| Gray-level run-length matrix (GLRLM) | 0 | 7 | 3 | 17 | 27 |
| Gray-level size zone matrix (GLSZM) | 0 | 2 | 0 | 18 | 20 |
| Gray-level dependence matrix (GLDM) | 0 | 2 | 0 | 15 | 17 |
| Total | 46 | 103 | 149 | ||
CE, contrast-enhanced T1-weighted.
FIGURE 5Heatmap of the 149 key features. The values of the features were normalized using min–max normalization. The values of contrast-enhanced T1-weighted features are represented by white to orange (0–1), and values of T2-weighted features are represented by white to blue. CE, contrast-enhanced T1-weighted; TCGA, the cancer genome atlas.