| Literature DB >> 35795472 |
Rachel Madhogarhia1, Debanjan Haldar2, Sina Bagheri3, Ariana Familiar3, Hannah Anderson3, Sherjeel Arif4, Arastoo Vossough3, Phillip Storm2, Adam Resnick3, Christos Davatzikos4, Anahita Fathi Kazerooni4, Ali Nabavizadeh3.
Abstract
The current era of advanced computing has allowed for the development and implementation of the field of radiomics. In pediatric neuro-oncology, radiomics has been applied in determination of tumor histology, identification of disseminated disease, prognostication, and molecular classification of tumors (ie, radiogenomics). The field also comes with many challenges, such as limitations in study sample sizes, class imbalance, generalizability of the methods, and data harmonization across imaging centers. The aim of this review paper is twofold: first, to summarize existing literature in radiomics of pediatric neuro-oncology; second, to distill the themes and challenges of the field and discuss future directions in both a clinical and technical context.Entities:
Keywords: brain tumors; neuro-oncology; pediatrics; radiogenomics; radiomics
Year: 2022 PMID: 35795472 PMCID: PMC9252112 DOI: 10.1093/noajnl/vdac083
Source DB: PubMed Journal: Neurooncol Adv ISSN: 2632-2498
Figure 1.Overview of a typical radiomics workflow.
First, the dataset of images and any relevant clinical or genomic data is gathered. Here, T1w, T1wCE, T2w, and FLAIR images are shown. Next, the images are preprocessed through steps such as co-registration and skull stripping. Here, the processed images are shown. Then, tumors are usually segmented by experienced radiologists. Here, the different colors showcase different segmented components on the processed images. Various features are then extracted from each image, usually on the order of hundreds of features per patient. Here, a sample histogram is shown to represent histogram-based features (created in MATLAB). From these features, various models are built, and feature reduction/selection is typically performed to find the most predictive features to prevent overfitting. Random Forest is diagrammed here. Each model built can vary along a few parameters, such as: algorithm (eg, SVM vs RF vs kNN) as well as parameters, feature reduction/selection method, data included (eg, image only vs combined image and clinical data), or image modalities used (eg, T1w vs T2w vs T1w and T2w combined). Finally, each model’s performance is evaluated on a validation and/or external test set based on various metrics such as area under the curve. Here, a sample ROC curve is shown (created in MATLAB).
Summary of Radiomic Papers from in Pediatric Neuro-Oncology
| Paper | Relevant Tumors | Patient Age Group (Years) | # Subjects and Breakdown, If applicable | Single or Multi- Institutional (# Centers) | Radiomics Endpoint | Images Used | Segmentation Method used | Feature Selection/Reduction Method(s) | Model(s) | Performance of Best Model(s), Generally in Terms of AUC |
|---|---|---|---|---|---|---|---|---|---|---|
| Fetit et al.[ | Pilocytic astrocytoma | n/a | 48 Total breakdown: | Single | Differentiate between PA, MB, EP, comparing 2D and 3D texture analysis | T1w | Semi- automatic | Entropy-MDL discretization and PCA | NB, kNN, classification tree, SVM, ANN, LR | Best models were LR and ANN on 3D texture features with entropy-MDL-based selection. AUC was 99% (leave-one-out cross- validation and 10-fold cross-validation) |
| Fetit et al.[ | Pilocytic astrocytoma | n/a | 121 Total breakdown: | Multi (3) | Differentiate between PA, MB, EP with 3D texture analysis | T1w | Semi- automatic | Entropy-MDL discretization | C-SVM | Overall AUC: 85% (leave-one-out cross- validation) |
| Fetit et al.[ | Pilocytic astrocytoma | n/a | 134 Total breakdown: 71 PA | multi (3) | Differentiate between PA, MB, EP | T1w | Semi- automatic | Relief, entropy-MDL, and combination of Relief and entropy- MDL | C-SVM. Included oversampling on EP. | Mean AUC: 76% (on unseen test set from a different center in pairwise testing). AUC on combined dataset: 86% (before oversampling on EP) and 92% (after oversampling on EP) (leave-one-out cross- validation). |
| Hara et al.[ | Embryonal brain tumors | Median: 6.9 | 34 Total | Single | Differentiate between various histologies, identify tumors with neuraxis metastases, identify patients at risk of recurrence, and predict survival outcomes from preoperative imaging | T1wCE | Manual | Selected features that had the largest observed variance within the cohort, and selected ones with physician- defined prognostic value for further analysis | LR to predict sex and M status, multinomial LR for histology, and Cox regression for recurrence and survival outcomes | For histology, key features included size and texture features. For neuraxis metastases, predictive features included tumor diameter (AUC = 0.74) and neighborhood gray tone coarseness (AUC = 0.7). For recurrence, AUC was 0.7 for predictive features such as tumor volume and neighborhood gray tone coarseness. |
| Dasgupta et al.[ | Medulloblastoma | Median: 9range 2–48 | 111 Total | Multi (n/a) | Predict MB molecular subgroup from preoperative imaging | Multiparametric MRI, includingT1wCE, T2w | n/a | Pearson chi-square test and Fisher’s exact test (on features extracted by observers, such as tumor location, maximum tumor size, and contrast enhancement characteristics, amongst others) | Logistic regression to develop binary nomograms for each subgroup | WNT: AUC of 0.693. |
| Goya Outi et al.[ | Diffuse intrinsic pontine glioma | Mean: 7.4 | 38 Total breakdown: | Single | Predict H3 mutation status | T1w | n/a | Multi-level feature selection, including intra-class correlation coefficient, AUC, and hierarchical clustering using spearman’s correlation coefficient | SVM, kNN, RF. Included oversampling on minority class (H3.1) | Best model was SVM with combined imaging and clinical features. This model had F1-weighted score of 0.84 (leave-one-out cross-validation) |
| Iv et al.[ | Medulloblastoma | Range: 1 to 18, mean: 8.56 | 109 Total | Multi (3) | Predict MB molecular subgroup | T1wCE | Manual | Wilcoxon rank sum test | SVM | For double 10-fold cross-validation on combined data, best model used both T1wCE and T2w. AUC: 0.79 (SHH), 0.45 (WNT), 0.70 (Group 3), 0.83 (Group 4).T2w-only model had slightly better performance on WNT (0.63). |
| Zhou et al.[ | Pilocytic astrocytoma | Range: 0.25–18, mean: 8.6 | 288 Total | multi (4) | Differentiate between PA, MB, EP | T1wCE | Manual | Chi-squared score, analysis of variance, T-test, Fisher, Relief, Wilcoxon, mutual information, minimum redundancy/ maximum relevance, conditional infomax, joint mutual information, conditional mutual information maximization, interaction capping, double input symmetric relevance, mutual information maximization | Neural network, decision tree, boosting, Bayesian, bagging, RF, SVM, linear discriminant analysis, kNN, generalized linear model. Compared automated optimization of pipeline (with TPOT) with manual optimization of feature selection and classification. | Multiclass classification: micro-averaged AUC was 0.91 from TPOT and was 0.92(chi- squared + generalized linear model) from manual expert optimization (test set). Binary classifiers from TPOT had AUC: 0.94 (MB), 0.84 (EP), 0.94 (PA) (test set). Binary classifiers from manual expert optimization had AUC: 0.98 (MB), 0.70 (EP), 0.93 (PA) (test set). |
| Grist et al.[ | Pilocytic Astrocytoma | n/a | 49 Total breakdown: | Multi (4) | Differentiate between low- grade (PA) and high-grade (EP, MB), as well as differentiate between PA, MB, and EP | T1w | n/a | PCA and UA | Single layer Neural Network, AdaBoost, RF, SVM, and kNN. Tried oversampling on EP. | AdaBoost with univariate reduction achieved 85% balanced accuracy (3-fold cross-validation) |
| Li et al.[ | Ependymoma | Range: 0–14 mean: 7 | 45 Patients, 135 slices total | Single | Differentiate between EP and PA | T1w | Manual | KWT | SVM | AUC = 0.88 (validation set) |
| Quon et al.[ | Diffuse midline glioma | Range: 0.21–34 | 617 Total breakdown: | Multi (5) | Detection and classification of posterior fossa tumors | T1wCE | Manual (identification of tumor vs. no tumor on slices) | n/a | Deep-learning architectures (ResNet, ResNeXt, DenseNet, InceptionV3). Used transfer learning. Final prediction made from aggregate slice-level predictions from ensemble of 5 models | 2D ResNeXt-50-32x4d trained on T2w features had AUC of 0.99 for tumor detection. For classification, accuracy was 92% and F1 was 0.80 (held-out test set). Model’s tumor detection accuracy was similar to 4 radiologists; model’s classification accuracy and F1 score was higher than 2/4 radiologists |
| Pisapia et al.[ | Optic pathway gliomas | Range: 2–18 | 38 Total breakdown: | Single | Predict progression (defined as radiographic tumor growth or vision decline) | T1w | Manual | n/a | SVM | Model that included features defined as the change in features between pairwise combinations of imaging studies done before progression scan had accuracy: 86% (leave-out-two cross- validation) |
| Prince et al.[ | Adamantinomatous craniopharyngioma | n/a | 39 Total | Multi (18) | Identify ACP | T1wCT | n/a | n/a | Various pretrained deep- learning neural networks; used transfer learning, genetic algorithm to optimize, and data augmentation | Accuracy of 87.8% for model using features from both MRI and CT, 83.3% for MRI only, and 85.3 for CT only (test set). Model performed on par with average of two human specialists. |
| Tam et al.[ | Diffuse intrinsic pontine glioma | Range: 1.58– 19.08mean: 6.67 | 177 Total | Multi (11) | Prognostication (predict overall survival) | T1wCE | Manual | Features chosen based on lambda value with minimum cross-validated error across 100 repetitions of 10-fold cross-validation of fitting a Cox regression model | Cox proportional hazards model | Model using both radiomic and clinical features: concordance was 0.70 (training set) and 0.59 (testing set) |
| Wagner et al.[ | Low-grade gliomas | Mean: 9.21 | 115 Total | Multi (2) | Predict BRAF molecular status (fusion and V600E point mutation) from imaging | FLAIR | Semi- automatic | n/a | RF | AUC = 0.75 (internal 4-fold cross-validation) and 0.85 (external validation on a cohort from a separate center than training data) |
| Novak et al.[ | Pilocytic astrocytoma | Range: 1.0–16.3 | 124 Total breakdown:36 PA | Multi (12) | Differentiate between posterior fossa tumors | T1w | Manual | PCA | NB, RF | Overall classification accuracy: 86.3% for RF and 84.6% for NB (10- fold cross-validation). NB classified more EP and PA cases correctly than RF, while RF classified more MB cases correctly than NB |
| Dong et al.[ | Ependymoma | Range: 0–15 | 51 Total breakdown: | Single | Distinguish between EP and MB | T1wCE | Semi- automatic | UA, UAS, MLR | kNN, AdaBoost, RF, SVM | Best model was RF with multivariable logistic regression for feature selection. AUC = 0.91 (10-fold cross- validation) |
| Zheng et al.[ | Medulloblastoma | Mean: 5.6 | 124 Total breakdown: | Single | Predict CSF dissemination | T1w (both head and spine) | Manual | mRMR and LASSO | Multivariable logistic regression | Best model used combined clinical and radiomic features. AUC: 0.87 (internal validation cohort) and 0.73 (external validation cohort) |
Abbreviations: ADC, Apparent Diffusion Coefficient; CSF, Cerebrospinal Fluid; DTI, Diffusion Tensor Imaging; DWI, Diffusion Weighted Imaging; FLAIR, Fluid-attenuated inversion recovery.
an/a indicates that the information was not clearly specified (eg, age was not specified for cohort or segmentation methodology was not identified) or the column was not relevant to the particular study (eg, segmentation was not performed).
Abbreviations
| Brain tumors | |
| LGG | Low-Grade Glioma |
| HGG | High-Grade Glioma |
| PA | Pilocytic Astrocytoma |
| MB | Medulloblastoma |
| EP | Ependymoma |
| DIPG | Diffuse Intrinsic Pontine Glioma |
| DMG | Diffuse Midline Glioma |
| ACP | Adamantinomatous Craniopharyngioma |
| OPG | Optic Pathway Glioma |
| Classification algorithms | |
| NB | Naïve Bayes |
| RF | Random Forest |
| kNN | k-Nearest Neighbors |
| SVM | Support Vector Machine |
| C-SVM | Cost-Based Support Vector Machine |
| CNN | Convolutional Neural Network |
| GBT | Gradient Boosted Trees |
| LR | Logistic Regression |
| Feature reduction/selection methods | |
| PCA | Principal Component Analysis |
| UA | Univariate Analysis |
| UAS | Univariate Analysis Screening |
| MLR | Multivariate Logistic Regression |
| KWT | Kruskal-Wallis Test |
Figure 2.Data platforms for pediatrics.
In CAVATICA, researchers can rapidly run computational analysis on datasets. In PedcBioPortal, users can view analytics on datasets without needing any knowledge of programming or bioinformatics tools. In Flywheel, users can manage and process imaging data.