Literature DB >> 35795472

Radiomics and radiogenomics in pediatric neuro-oncology: A review.

Rachel Madhogarhia¹, Debanjan Haldar², Sina Bagheri³, Ariana Familiar³, Hannah Anderson³, Sherjeel Arif⁴, Arastoo Vossough³, Phillip Storm², Adam Resnick³, Christos Davatzikos⁴, Anahita Fathi Kazerooni⁴, Ali Nabavizadeh³.

Abstract

The current era of advanced computing has allowed for the development and implementation of the field of radiomics. In pediatric neuro-oncology, radiomics has been applied in determination of tumor histology, identification of disseminated disease, prognostication, and molecular classification of tumors (ie, radiogenomics). The field also comes with many challenges, such as limitations in study sample sizes, class imbalance, generalizability of the methods, and data harmonization across imaging centers. The aim of this review paper is twofold: first, to summarize existing literature in radiomics of pediatric neuro-oncology; second, to distill the themes and challenges of the field and discuss future directions in both a clinical and technical context.

Entities: Chemical

Keywords: brain tumors; neuro-oncology; pediatrics; radiogenomics; radiomics

Year: 2022 PMID： 35795472 PMCID： PMC9252112 DOI： 10.1093/noajnl/vdac083

Source DB: PubMed Journal: Neurooncol Adv ISSN： 2632-2498

The current era of advanced computing has led to the emergence of the field of radiomics.[1] Radiomics, in brief, is the “high-throughput extraction of large amounts of imaging features” from clinically acquired radiologic images.[2] This data is then compiled into mineable databases and can be utilized in a variety of applications, ranging from hypothesis generation to predictive modeling to clinical decision making.[3] The large-scale impact of the application of this technology becomes clear when one considers the vast amount of underutilized radiologic data generated through routine patient care.[1] Within the field of neuro-oncology, patient tumors are diagnosed and followed through a series of cross-sectional imaging including, but not limited to, CT, MRI, and PET scans. Although each of these scans can contain millions of voxels worth of data, their analysis is usually limited to qualitative assessments performed by individual neuroradiologists.[1,3] This practice leaves much of the generated data underutilized in the clinical setting and creates a niche that can be filled through the implementation of radiomics. Radiogenomics, as it is referred to in this review, or imaging genomics, is defined as the integration of radiomics with alterations in molecular and genomic data or using machine learning (ML) methods based on radiomic features to find noninvasive and in vivo signatures to predict molecular alterations in tumors.[4] This technology has shown great potential in the field of neuro-oncology as many disease subtypes have been defined based on their genetic and molecular profiles by the 2021 WHO guidelines.[5] Some theoretical applications of this technology could allow for the development of noninvasive, “virtual biopsies” which can bridge gaps in histopathological sampling that occur after initial diagnosis when disease progresses or in situations where sophisticated molecular analyses are not readily available.[4] These technologies have been increasingly applied to a variety of fields and have shown clinical utility in neuro-oncological applications.[6-8] For example, in adult glioma populations, radiomics have been applied to help to characterize molecular subtypes of tumors, differentiate between treatment effects and tumor recurrence,[9] and allow for more accurate survival stratification.[10-13] Furthermore, radiomic profiles of disease could be utilized by clinicians to optimize therapy plans within this group of tumors.[14] Relative to its adult counterpart, the field of pediatric neuro-oncology has a relative dearth of radiomic studies. The cause of this disparity is unclear and likely multifactorial. One explanation for this could be the simple fact that, among oncological patients, there are far fewer children than adults, and therefore there is less data available upon which to build radiomic models. According to the Central Brain Tumor Registry of the United States (CBTRUS), the incidence rate of brain and other CNS tumors from 2013 to 2017 was about five times higher in adults than in children.[15] Despite the relative difference in volumes of patients between adult and pediatric populations, childhood brain and other CNS cancers surpass all other cancers as the primary reason for cancer mortality in children.[15] Thus, there is a clear need for further work within these fields. Figure 1 provides an overview of the workflow commonly used in radiomic studies. We refer the readers interested in learning more about principles of radiomics and radiogenomics studies and their applications to some of the existing excellent review papers on this topic.[1,2,6] The aim of this review paper is twofold: first, to summarize existing literature in radiomics/radiogenomics of pediatric neuro-oncology; second, to distill the themes and challenges of the field and discuss future directions in both a clinical and technical context.

Figure 1.

Overview of a typical radiomics workflow.

First, the dataset of images and any relevant clinical or genomic data is gathered. Here, T1w, T1wCE, T2w, and FLAIR images are shown. Next, the images are preprocessed through steps such as co-registration and skull stripping. Here, the processed images are shown. Then, tumors are usually segmented by experienced radiologists. Here, the different colors showcase different segmented components on the processed images. Various features are then extracted from each image, usually on the order of hundreds of features per patient. Here, a sample histogram is shown to represent histogram-based features (created in MATLAB). From these features, various models are built, and feature reduction/selection is typically performed to find the most predictive features to prevent overfitting. Random Forest is diagrammed here. Each model built can vary along a few parameters, such as: algorithm (eg, SVM vs RF vs kNN) as well as parameters, feature reduction/selection method, data included (eg, image only vs combined image and clinical data), or image modalities used (eg, T1w vs T2w vs T1w and T2w combined). Finally, each model’s performance is evaluated on a validation and/or external test set based on various metrics such as area under the curve. Here, a sample ROC curve is shown (created in MATLAB).

Overview of a typical radiomics workflow. First, the dataset of images and any relevant clinical or genomic data is gathered. Here, T1w, T1wCE, T2w, and FLAIR images are shown. Next, the images are preprocessed through steps such as co-registration and skull stripping. Here, the processed images are shown. Then, tumors are usually segmented by experienced radiologists. Here, the different colors showcase different segmented components on the processed images. Various features are then extracted from each image, usually on the order of hundreds of features per patient. Here, a sample histogram is shown to represent histogram-based features (created in MATLAB). From these features, various models are built, and feature reduction/selection is typically performed to find the most predictive features to prevent overfitting. Random Forest is diagrammed here. Each model built can vary along a few parameters, such as: algorithm (eg, SVM vs RF vs kNN) as well as parameters, feature reduction/selection method, data included (eg, image only vs combined image and clinical data), or image modalities used (eg, T1w vs T2w vs T1w and T2w combined). Finally, each model’s performance is evaluated on a validation and/or external test set based on various metrics such as area under the curve. Here, a sample ROC curve is shown (created in MATLAB).

Pediatric Neuro-Oncology

WHO Classification of Brain Tumors

For all tumors of the central nervous system, the WHO CNS5 promotes the use of layered and integrated diagnoses to more comprehensively characterize lesions and capture the depth of information that makes them distinct clinical entities.[5] This layered structure includes an integrated diagnosis, comprised of histopathological classification, CNS WHO grade, and molecular information specific to the subtype. For example, a tumor could have an integrated diagnosis of “‘Diffuse low-grade glioma, MAPK pathway-altered’ subtype: Diffuse low-grade glioma, FGFR1 TKD-duplicated,” a histopathological classification of “Oligodendroglioma,” a CNS WHO grade of “Not assigned,” and molecular information that includes the specific molecular alteration and method of detection.[5] This tiered approach enables precision in both research and clinical settings and highlights the importance of the multimodal approaches to tumor characterization that have been implemented in recent years. It is likely that with advancements in radiomic and radiogenomic fields, features from these modalities will contribute to these layered classification systems, and so by becoming familiar with the current layout, researchers can find opportunities to better differentiate and classify tumors.

Literature Search Strategy

PubMed and Google Scholar databases were electronically searched to identify relevant studies published prior to May 2021 that included keywords in the title/abstract in the following categories: radiomics (such as “radiomics” or “radiogenomics”), brain tumors (such as “glioma” or “medulloblastoma” or “astrocytoma” or “ependymoma”), and pediatric (such as “pediatric” or “childhood”). Some terms were also searched in medical subject headings, and no limitations were included on the year of publication. Following the initial search, the articles listed in the references of identified studies were evaluated. The search returned 139 articles, 108 of which were excluded after title/abstract review. The remaining articles were reviewed, and studies that met any of the following criteria were excluded: fewer than 75% of patients were pediatric, ML was not used, brain MRI was not the main modality (eg, brain CT only or spine MRI only), no radiomic features were used (ie, studies only using MRS data were excluded), or the tumor was not in the brain (eg, studies on tumors in the spinal cord or elsewhere in the nervous system were excluded). Ultimately, 18 studies were included in the final analysis.

Summary of Radiomics Studies in Pediatric Neuro-Oncology

In this section, we will review radiomics and radiogenomics studies in pediatric neuro-oncology (summarized in Table 1). We have grouped studies by their endpoint, resulting in four categories: applications in determining tumor histology, applications in identifying disseminated diseases, applications in prognostication, and applications in molecular classification (ie, radiogenomics).

Table 1.

Summary of Radiomic Papers from in Pediatric Neuro-Oncology

Paper	Relevant Tumors	Patient Age Group (Years)	# Subjects and Breakdown, If applicable	Single or Multi- Institutional (# Centers)	Radiomics Endpoint	Images Used	Segmentation Method used	Feature Selection/Reduction Method(s)	Model(s)	Performance of Best Model(s), Generally in Terms of AUC
Fetit et al.[16]	Pilocytic astrocytoma Medulloblastoma Ependymoma	n/a^a	48 Total breakdown: 20 PA 21 MB 7 EPs	Single	Differentiate between PA, MB, EP, comparing 2D and 3D texture analysis	T1w T2w	Semi- automatic	Entropy-MDL discretization and PCA	NB, kNN, classification tree, SVM, ANN, LR	Best models were LR and ANN on 3D texture features with entropy-MDL-based selection. AUC was 99% (leave-one-out cross- validation and 10-fold cross-validation)
Fetit et al.[17]	Pilocytic astrocytoma Medulloblastoma Ependymoma	n/a	121 Total breakdown: 61 PA 42 MB 18 EP	Multi (3)	Differentiate between PA, MB, EP with 3D texture analysis	T1w T2w	Semi- automatic	Entropy-MDL discretization	C-SVM	Overall AUC: 85% (leave-one-out cross- validation)
Fetit et al.[18]	Pilocytic astrocytoma Medulloblastoma Ependymoma	n/a	134 Total breakdown: 71 PA 45 MB 18 EP	multi (3)	Differentiate between PA, MB, EP	T1w T2w	Semi- automatic	Relief, entropy-MDL, and combination of Relief and entropy- MDL	C-SVM. Included oversampling on EP.	Mean AUC: 76% (on unseen test set from a different center in pairwise testing). AUC on combined dataset: 86% (before oversampling on EP) and 92% (after oversampling on EP) (leave-one-out cross- validation).
Hara et al.[19]	Embryonal brain tumors	Median: 6.9	34 Total	Single	Differentiate between various histologies, identify tumors with neuraxis metastases, identify patients at risk of recurrence, and predict survival outcomes from preoperative imaging	T1wCE FLAIR	Manual	Selected features that had the largest observed variance within the cohort, and selected ones with physician- defined prognostic value for further analysis	LR to predict sex and M status, multinomial LR for histology, and Cox regression for recurrence and survival outcomes	For histology, key features included size and texture features. For neuraxis metastases, predictive features included tumor diameter (AUC = 0.74) and neighborhood gray tone coarseness (AUC = 0.7). For recurrence, AUC was 0.7 for predictive features such as tumor volume and neighborhood gray tone coarseness.
Dasgupta et al.[20]	Medulloblastoma	Median: 9range 2–48	111 Total Breakdown: 17 WNT 44 SHH 27 Group 3 23 Group 4	Multi (n/a)	Predict MB molecular subgroup from preoperative imaging	Multiparametric MRI, includingT1wCE, T2w	n/a	Pearson chi-square test and Fisher’s exact test (on features extracted by observers, such as tumor location, maximum tumor size, and contrast enhancement characteristics, amongst others)	Logistic regression to develop binary nomograms for each subgroup	WNT: AUC of 0.693. SHH: AUC of 0.991. Group 3: AUC of 0.600. Group 4: AUC of 0.788. (validation cohort)
Goya Outi et al.[21]	Diffuse intrinsic pontine glioma	Mean: 7.4	38 Total breakdown: 9 H3.1 mutation22 H3.3 mutation 4 WT3 unknown	Single	Predict H3 mutation status	T1w T2w T1wCE FLAIR	n/a	Multi-level feature selection, including intra-class correlation coefficient, AUC, and hierarchical clustering using spearman’s correlation coefficient	SVM, kNN, RF. Included oversampling on minority class (H3.1)	Best model was SVM with combined imaging and clinical features. This model had F1-weighted score of 0.84 (leave-one-out cross-validation)
Iv et al.[22]	Medulloblastoma	Range: 1 to 18, mean: 8.56	109 Total breakdown: 30 SHH19 WNT 24 Group 336 Group 4	Multi (3)	Predict MB molecular subgroup	T1wCE T2w	Manual	Wilcoxon rank sum test	SVM	For double 10-fold cross-validation on combined data, best model used both T1wCE and T2w. AUC: 0.79 (SHH), 0.45 (WNT), 0.70 (Group 3), 0.83 (Group 4).T2w-only model had slightly better performance on WNT (0.63). For 3-dataset cross- validation, T1wCE & T2w model had AUC: 0.80 (Group 4), 0.70 (SHH), 0.45 (WNT), 0.39 (Group 3). T1wCE-only model had slightly higher AUC for SHH (0.73).T2w-only model had highest AUC for WNT (0.72) and Group 3 (0.57).
Zhou et al.[23]	Pilocytic astrocytoma Medulloblastoma Ependymoma	Range: 0.25–18, mean: 8.6	288 Total breakdown:107 PA 111 MB70 EP	multi (4)	Differentiate between PA, MB, EP	T1wCE T2wADC	Manual	Chi-squared score, analysis of variance, T-test, Fisher, Relief, Wilcoxon, mutual information, minimum redundancy/ maximum relevance, conditional infomax, joint mutual information, conditional mutual information maximization, interaction capping, double input symmetric relevance, mutual information maximization	Neural network, decision tree, boosting, Bayesian, bagging, RF, SVM, linear discriminant analysis, kNN, generalized linear model. Compared automated optimization of pipeline (with TPOT) with manual optimization of feature selection and classification.	Multiclass classification: micro-averaged AUC was 0.91 from TPOT and was 0.92(chi- squared + generalized linear model) from manual expert optimization (test set). Binary classifiers from TPOT had AUC: 0.94 (MB), 0.84 (EP), 0.94 (PA) (test set). Binary classifiers from manual expert optimization had AUC: 0.98 (MB), 0.70 (EP), 0.93 (PA) (test set).
Grist et al.[24]	Pilocytic Astrocytoma Medulloblastoma Ependymoma	n/a	49 Total breakdown: 22 PA 17 MB 10 EP	Multi (4)	Differentiate between low- grade (PA) and high-grade (EP, MB), as well as differentiate between PA, MB, and EP	T1w T1wCE T2w FLAIR DWI DSC	n/a	PCA and UA	Single layer Neural Network, AdaBoost, RF, SVM, and kNN. Tried oversampling on EP.	AdaBoost with univariate reduction achieved 85% balanced accuracy (3-fold cross-validation)
Li et al.[25]	Ependymoma Pilocytic Astrocytoma	Range: 0–14 mean: 7	45 Patients, 135 slices total breakdown: 81 slices EP 54 slices PA	Single	Differentiate between EP and PA	T1w T2w	Manual	KWT	SVM	AUC = 0.88 (validation set)
Quon et al.[26]	Diffuse midline glioma Medulloblastoma Pilocytic astrocytoma Ependymoma	Range: 0.21–34 median: 6.75	617 Total breakdown: 122 DMG 272 MB 135 PA 88 EP (+199 control)	Multi (5)	Detection and classification of posterior fossa tumors	T1wCE T2w ADC	Manual (identification of tumor vs. no tumor on slices)	n/a	Deep-learning architectures (ResNet, ResNeXt, DenseNet, InceptionV3). Used transfer learning. Final prediction made from aggregate slice-level predictions from ensemble of 5 models	2D ResNeXt-50-32x4d trained on T2w features had AUC of 0.99 for tumor detection. For classification, accuracy was 92% and F1 was 0.80 (held-out test set). Model’s tumor detection accuracy was similar to 4 radiologists; model’s classification accuracy and F1 score was higher than 2/4 radiologists
Pisapia et al.[27]	Optic pathway gliomas	Range: 2–18	38 Total breakdown: 19 with progression19 without progression	Single	Predict progression (defined as radiographic tumor growth or vision decline)	T1w T1wCE T2w FLAIRDTI (FA, RAD, TR)	Manual	n/a	SVM	Model that included features defined as the change in features between pairwise combinations of imaging studies done before progression scan had accuracy: 86% (leave-out-two cross- validation)
Prince et al.[28]	Adamantinomatous craniopharyngioma	n/a	39 Total	Multi (18)	Identify ACP	T1wCT	n/a	n/a	Various pretrained deep- learning neural networks; used transfer learning, genetic algorithm to optimize, and data augmentation	Accuracy of 87.8% for model using features from both MRI and CT, 83.3% for MRI only, and 85.3 for CT only (test set). Model performed on par with average of two human specialists.
Tam et al.[29]	Diffuse intrinsic pontine glioma	Range: 1.58– 19.08mean: 6.67	177 Total	Multi (11)	Prognostication (predict overall survival)	T1wCE T2w	Manual	Features chosen based on lambda value with minimum cross-validated error across 100 repetitions of 10-fold cross-validation of fitting a Cox regression model	Cox proportional hazards model	Model using both radiomic and clinical features: concordance was 0.70 (training set) and 0.59 (testing set)
Wagner et al.[30]	Low-grade gliomas	Mean: 9.21	115 Total	Multi (2)	Predict BRAF molecular status (fusion and V600E point mutation) from imaging	FLAIR	Semi- automatic	n/a	RF	AUC = 0.75 (internal 4-fold cross-validation) and 0.85 (external validation on a cohort from a separate center than training data)
Novak et al.[31]	Pilocytic astrocytoma Medulloblastoma Ependymoma	Range: 1.0–16.3	124 Total breakdown:36 PA 55 MB26 EP 7 other (ATRTs + other low-grade tumors not included in classification analysis)	Multi (12)	Differentiate between posterior fossa tumors	T1w T2w T1wCE DWI	Manual	PCA	NB, RF	Overall classification accuracy: 86.3% for RF and 84.6% for NB (10- fold cross-validation). NB classified more EP and PA cases correctly than RF, while RF classified more MB cases correctly than NB
Dong et al.[32]	Ependymoma Medulloblastoma	Range: 0–15	51 Total breakdown: 24 EP27 MB	Single	Distinguish between EP and MB	T1wCE DWI	Semi- automatic	UA, UAS, MLR	kNN, AdaBoost, RF, SVM	Best model was RF with multivariable logistic regression for feature selection. AUC = 0.91 (10-fold cross- validation)
Zheng et al.[33]	Medulloblastoma	Mean: 5.6	124 Total breakdown: 44 with CSF dissemination 80 without CSF dissemination.	Single	Predict CSF dissemination	T1w (both head and spine)	Manual	mRMR and LASSO	Multivariable logistic regression	Best model used combined clinical and radiomic features. AUC: 0.87 (internal validation cohort) and 0.73 (external validation cohort)

Abbreviations: ADC, Apparent Diffusion Coefficient; CSF, Cerebrospinal Fluid; DTI, Diffusion Tensor Imaging; DWI, Diffusion Weighted Imaging; FLAIR, Fluid-attenuated inversion recovery.

an/a indicates that the information was not clearly specified (eg, age was not specified for cohort or segmentation methodology was not identified) or the column was not relevant to the particular study (eg, segmentation was not performed).

Summary of Radiomic Papers from in Pediatric Neuro-Oncology Abbreviations: ADC, Apparent Diffusion Coefficient; CSF, Cerebrospinal Fluid; DTI, Diffusion Tensor Imaging; DWI, Diffusion Weighted Imaging; FLAIR, Fluid-attenuated inversion recovery. an/a indicates that the information was not clearly specified (eg, age was not specified for cohort or segmentation methodology was not identified) or the column was not relevant to the particular study (eg, segmentation was not performed).

Applications in Determining Tumor Histology

Determination of tumor histology is a key step in diagnosis and treatment of pediatric brain tumors. Radiomics has shown promise in aiding in the determination of histology even prior to biopsy. This early information can help with surgical planning and better inform caregivers when counseling patients.

Posterior fossa tumors

—A common goal of radiomic studies in pediatric neuro-oncology has been to distinguish between posterior fossa tumors, particularly Ependymoma (EP), Medulloblastoma (MB), and Pilocytic Astrocytoma (PA)—refer to Table 2 for other abbreviations commonly used throughout this review. These tumors are all located in the posterior fossa, have some similar radiologic characteristics, and patients with these tumors present with similar symptoms. It is clinically important to differentiate between these tumors as this diagnosis can help in guiding surgical or treatment planning, as well as informing prognosis.[34,35] Currently, pathological analysis of the biopsied tissue is the gold standard for diagnosis,[36] but this process is invasive. An accurate radiomic model could allow for a fast, noninvasive method for presurgical and pretreatment diagnosis. Radiomics studies have shown promise in achieving this goal.[16-18,23-26,31,32]

Table 2.

Abbreviations

Brain tumors
LGG	Low-Grade Glioma
HGG	High-Grade Glioma
PA	Pilocytic Astrocytoma
MB	Medulloblastoma
EP	Ependymoma
DIPG	Diffuse Intrinsic Pontine Glioma
DMG	Diffuse Midline Glioma
ACP	Adamantinomatous Craniopharyngioma
OPG	Optic Pathway Glioma
Classification algorithms
NB	Naïve Bayes
RF	Random Forest
kNN	k-Nearest Neighbors
SVM	Support Vector Machine
C-SVM	Cost-Based Support Vector Machine
CNN	Convolutional Neural Network
GBT	Gradient Boosted Trees
LR	Logistic Regression
Feature reduction/selection methods
PCA	Principal Component Analysis
UA	Univariate Analysis
UAS	Univariate Analysis Screening
MLR	Multivariate Logistic Regression
KWT	Kruskal-Wallis Test

Abbreviations Several of these studies found that classifiers were least accurate in classifying EP. One deep-learning study found that EP had the least distinctive learned feature space amongst these tumors,[26] which may partially explain why posterior fossa classification models often are least accurate at classifying EP among the tumor types. Adding EP patients may help address these difficulties (see Class Imbalance section in the Discussion for further discussion on this topic). Quantitative information from Diffusion Weighted Imaging (DWI) (Apparent Diffusion Coefficient (ADC) maps) and/or DSC images (unnormalized cerebral blood volume, corrected cerebral blood volume, and/or K2 maps) have been shown to be predictive of classification of posterior fossa tumors.[23,24,31,32] For example, Dong et al. found that several ADC-derived features distinguished EP from MB, which is consistent with the clinical understanding that MB, by virtue of its high cellular density, often restricts diffusion to a greater extent than EP tumors.[32] In particular, texture-based features derived from T1w and T2w images have been shown to be predictive of posterior fossa tumor classification in both single and multi-institutional studies.[16-18,25] Most radiomic studies select imaging features from the region of interest drawn around the tumor. However, one study found that imaging features extracted from the whole brain, when combined with imaging features calculated from the region of interest, can improve classification accuracy for posterior fossa tumors.[24] While the mechanism of this improved accuracy is not fully elucidated in the mentioned study, it implies that peri-tumoral regions harbor data that can be meaningful in the clinical context.

craniopharyngioma

—Correct diagnosis of adamantinomatous craniopharyngioma (ACP) is essential because how ACP is treated and managed differs drastically from other tumors that are considered in the differential diagnosis.[28] For example, germinomas, which are often in the same radiologic differential as ACPs, are effectively treated without surgical intervention whereas ACP often involves aggressive surgical resection and external beam radiation.[28] ACP is accurately diagnosed in 64%–87% of cases.[37] ACP is rare and there is limited available data upon which to build radiomic models. Prince et al. utilized transfer learning, data augmentation, and genetic algorithms to build models from a dataset of 39 patients to identify ACP. They also found that combining CT with MRI scans outperformed either modality alone.[28]

Embryonal tumors

—Hara et al. found that histological subgroups of embryonal tumors, namely MB, pineoblastoma, and supratentorial primitive neuroectodermal tumors (sPNET), could be differentiated based on texture features as well as quantitative markers of tumor size.[19] Specifically, tumor volume and maximum 3D diameter were associated with specific histologies, with sPNETs being the largest and pineoblastomas being smallest of the three subtypes. This is consistent with clinical reasoning as sPNETs likely go undetected for longer periods of time due to a decreased likelihood of causing hydrocephalus. The study also identified eight textural features that quantified intratumor heterogeneity that were markedly different between tumor histologies, with pineoblastomas being the most heterogenous and MBs being the most homogeneous.[19]

Applications in Identifying Disseminated Disease

The presence of metastasis is important in the staging of many oncologic processes including some primary brain tumors.[38] However, detection of metastatic progression is sometimes difficult and requires a wide array of diagnostic testing some of which can be expensive and invasive. In embryonal tumors, Hara et al. found that radiomic features can discriminate between localized and disseminated disease.[19] Larger primary tumor size was found to be associated with a higher likelihood of neuraxis metastases, and likelihood of neuraxis metastases also trended toward a decrease in primary tumor heterogeneity based on texture features (although these results did not reach statistical significance).[19] In MB patients, Zheng et al. found a clinical-radiomic model to be predictive of preoperative Cerebrospinal Fluid (CSF) dissemination (as determined by head and spine MRI, as well as no subsequent dissemination at the 1-year follow-up for the non-CSF dissemination group).[33]

Applications in Prognostication

Tumor prognostication, both in terms of survival and risk of recurrence after treatment, is a topic of great interest within the field. While molecular data acquired from biopsy is a current mainstay in tumor classification and prognostication, noninvasive methods that can predict risk of recurrence and overall survival would be hugely beneficial to patients, particularly in cases where biopsy is difficult or unfeasible. Radiomic features have been shown to be useful in predicting prognosis in optic pathway gliomas (OPGs) and diffuse intrinsic pontine gliomas (DIPGs).[27,29] In developing a model to predict OPG progression, which was defined by radiographic tumor growth and/or vision loss on follow-up scans, Pisapia et al. included data derived from Diffusion Tensor Imaging (DTI), including fractional anisotropy (FA), trace, and radial diffusivity, in addition to features from T1w, T1wCE, T2w, and Fluid-attenuated inversion recovery (FLAIR) sequences.[27] The imaging was acquired as part of a well-defined surveillance protocol. FA features in the optic radiations were found to be among the most predictive features. These results suggest that DTI can measure the changes in the microstructure of white matter that follows tumor growth.[27] For DIPGs, which are progressive and lethal pediatric cancers, intensity-based and texture-based features from wavelet-transformed images have been found to be predictive of overall survival.[29] In embryonal tumors, larger tumor size and decreased heterogeneity, as measured by their quantitative features, trended toward recurrence of tumors, but these trends did not reach statistical significance based on univariate analyses.[19]

Radiogenomics for Molecular Classification of Tumors

Radiogenomics, as it is referred to in this review or imaging genomics, is defined as the integration of radiomics with alterations in molecular and genomic data or using ML methods based on radiomic features to find noninvasive and in vivo signatures to predict molecular alterations in tumors.[4] This technology has great promise in the field of neuro-oncology as many disease subtypes have been defined based on their genetic and molecular profiles by WHO guidelines.[5,8] In pediatrics, models have been developed to predict the molecular subtype of H3 histone mutations in DIPG,[21] molecular subtype in MBs,[20,22] and BRAF mutation.[30]

H3 molecular subtype in DIPG

—There are two types of H3 mutations frequently associated with diffuse midline gliomas of the pons (generically called DIPG): H3.1K27M and H3.3K27M. A patient’s H3 mutation status is clinically important as it guides treatment of DIPG patients and is included in the classification schema of the new WHO CNS5 guidelines.[5] In one model developed for prediction of H3 mutation subtype among patients with DIPG, three of the six selected imaging features were from FLAIR scans, indicating that the differences in H3 molecular subtype affect some aspect of tumor biology that is reflected in FLAIR scans.[21] This study also found that the combined model of clinical and radiomic features was more accurate than the model built on image data alone.

MB molecular subtype

—In the recent WHO CNS5 guidelines, MB has been classified into four distinct subgroups based on molecular identity: wingless (WNT)-activated, sonic hedgehog (SHH)-activated and TP53-wildtype, SHH-activated and TP53-mutant, and non-WNT/non-SHH.[5] However, radiogenomic studies have utilized prior guidelines and so discussion of these entities here will have the following identities—WNT, SHH, Group 3, and Group 4 (Groups 3 and 4 are now classified as non-WNT/non-SHH[39]). Radiomic models generally perform better at predicting SHH and Group 4 than Group 3 and WNT, especially WNT.[20,22] Association has been shown between the presence of peri-tumoral edema, especially moderate to severe edema, and SHH MBs.[20] The location of SHH tumors also has been shown to vary significantly between pediatric patients and adult patients, with them being more commonly located in the midline in the former and lateralized in the latter.[20] One contributing factor to the worse performance of models for prediction of WNT subtype might be that it is the least common subtype of MB, and there were not enough subjects for the predictive models to train on. Iv et al. found that for a dataset that combined data from three institutions into one set, models built from T1wCE and T2w images outperformed models built from either modality alone for all subgroups except WNT. They also explored training models on data from two institutions and testing on the third, and they found that some of the most predictive features that were repeatedly selected in all three loops included features related to lesion area, intensity-based histograms, tumor edge-sharpness, and local area integral invariant.[22]

BRAF mutations

—Pediatric low-grade gliomas (pLGGs) represent a particularly large and heterogeneous group of tumors where molecular subtyping is a key part of diagnosis and treatment planning. The type of BRAF alteration may play a particularly important role in the prognosis of these tumors. Differentiation between BRAF fusion tumors and BRAF V600E mutated tumors is becoming necessary prior to initiating treatment with certain precision chemotherapeutic agents and has been shown to be important in predicting the clinical course of patients.[40] Wagner et al. developed a model that predicted BRAF status in patients with pLGGs.[30] Features were extracted from FLAIR sequences and included histogram, shape, and texture-based features. Features extracted from 3D wavelet transforms, as well as location of the tumor and patient age at presentation, were especially predictive. Several of the aforementioned radiogenomic studies that built models on conventional MRI modalities noted that the integration of data from DWI or the resulting ADC maps could improve predictive performances,[20,22,30] but the data were not always routinely available.[20]

Challenges and Future Directions

There are a number of important challenges that face radiomic studies. In this section, we will discuss these challenges as well as potential solutions and overall trajectory of the field.

Sample Size

Small sample size is a widespread challenge

—Many studies had relatively small sample sizes, which most authors noted as a limitation of their models and expressed a desire to increase their patient cohort size in the future. The sample size of the papers summarized in Table 1 ranged from 34 to 617 brain tumor patients, with a median of 110 patients. One of the reasons contributing to small cohort sizes might be a simple lack of pediatric data. For instance, among oncological patients, there are far fewer children than adults, and therefore there is less data available upon which to build radiomic models specifically for pediatrics compared to adults. In fact, the incidence rate of brain and other CNS tumors from 2013 to 2017 was about five times higher in adults than in children.[15] One group of authors noted that even though EP and MB data have been collected in pediatrics for nearly a decade, their study included a small sample due to the rarity of brain tumors in children overall.[32] Consequences of small sample size include decreased generalizability, increased risk of overfitting, and lack of a separate test cohort.

Increasing cohort sizes through collaboration and consortia

—A long-term solution to small sample size is to use data from multiple institutions through either direct collaboration or utilization of a collective online database. Just under half of the radiomic papers summarized in Table 1 used data from a single institution. Some authors noted that moving forward, multicenter data could be used to increase sample size and evaluate model generalizability.[32,33] There exist a few consortiums to facilitate data collection and sharing,[41,42] such as the Children’s Brain Tumor Network , which has several cloud-based platforms, outlined in Figure 2, where researchers can access the data. Other databases relevant for radiomic studies include the National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) and the Pediatric Brain Tumor Consortium.

Figure 2.

Data platforms for pediatrics.

In CAVATICA, researchers can rapidly run computational analysis on datasets. In PedcBioPortal, users can view analytics on datasets without needing any knowledge of programming or bioinformatics tools. In Flywheel, users can manage and process imaging data.

Data platforms for pediatrics. In CAVATICA, researchers can rapidly run computational analysis on datasets. In PedcBioPortal, users can view analytics on datasets without needing any knowledge of programming or bioinformatics tools. In Flywheel, users can manage and process imaging data. One example of how multi-institutional databases can be helpful is the previously discussed study on ACP, which is a rare tumor. The authors used data from the Advancing Treatment for Pediatric Craniopharyngioma consortium, which provides data from 17 North American centers, along with some data from St. Jude’s. This allowed them to build a dataset of 39 patients for a rare brain tumor.[28] One solution for building such databases is for the radiologists to prospectively capture imaging data and annotate them with a more standardized lexicon.[1] This will help researchers use larger patient cohorts and multicenter data. Another approach to multi-institutional collaboration is federated learning, which bypasses many of the challenges associated with data sharing.[43,44] In this approach, individual institutions train models on their institutional data. The trained models are then aggregated across institutions into one model. This training process is repeated until a final model is obtained.[44] Recently, federated learning has shown potential for adult brain tumor segmentation from MRI scans.[43]

Algorithmic solutions to small sample sizes

—A shorter-term solution to having a small data set is to employ transfer learning or data augmentation. Transfer learning allows one to combat the risk of overfitting a model to a small data set by taking an already trained model and using it as the starting point for training on another data set. For example, one study, discussed earlier for work on posterior fossa tumors, used transfer learning from a model that had been pretrained on over 1 million images.[26] Another study used data augmentation, the process of adding synthetic data samples by transforming the original data, in addition to transfer learning.[28] Moving forward, studies with small sample sizes should consider employing these learning techniques to increase the power and generalizability of any findings.

Class Imbalance

—Many studies summarized in Table 1 faced the challenge of class imbalance, which occurs when the sizes of the classification categories in the training dataset are skewed. In radiomics, class imbalance is often a problem when training a classifier on several cancers where one is rarer than the others, and as a result, there are less samples available for some classes compared to others. To discuss class imbalance, we will use the studies that worked on differentiating between three posterior fossa tumors—MB, PA, and EP—as our example. PA (and/or sometimes MB) is usually the largest class size, and EP is the smallest. This imbalance makes sense given the tumor incidence rates. According to the CBTRUS age-adjusted incidence rates from 2013 to 2017, in cases per 100,000 persons aged 0 to 19, the incidence rate was 0.92 for PA, 0.40 for MB, and only 0.29 for EP. In fact, PA is the most common brain or CNS tumor in persons aged 0 to 19 years, making up 14.9% of all brain or CNS tumors in pediatrics.[15] The rarity of EP contributes to its disproportionally small class size in radiomic studies, a trend that can be seen in many of the previously discussed studies.[16-18,23,24,31] Imbalanced multiclass data can be one of the causes of worse performance and lower sensitivity of a classifier on the minority class (here, EP). One strategy for overcoming class imbalance is balancing the training cohort through undersampling of majority class(es) or oversampling of minority class(es).[45] However, some implementations of these methods, such as random under or oversampling, might reduce the accuracy on the majority class or cause overfitting to the minority class.[45] One study with a training cohort of 71 PA, 45 MB, and 18 EP tumors, created additional synthetic EP samples using the synthetic minority oversampling technique (SMOTE) on the extracted features, which improved their overall classification performance and increased sensitivity of the classifier in discrimination of EP from other posterior fossa tumors. However, it was noted that their generated synthetic samples were included in the leave-one-out cross-validation loops, whereas ideally, performance evaluation would only be based on original samples.[18] Another study outside of those summarized in Table 1 found improved accuracy when combining ensemble learning algorithms with an oversampling technique, SMOTE, for tumor classification from MRS data.[46]

Multicenter Data and Generalizability

Multicenter studies to incorporate technological variation

—Using data from multiple centers not only increases sample size, but also aids model generalizability. A problem with radiomic studies is reproducibility.[47] Scanning hardware and scanning protocol differences affect radiomic features to various extents. As such, scanner field strength, manufacturer, family and specific type of MRI sequences, addition of inversion recovery or fat suppression pulses, sequence acquisition parameters, spatial resolution, k-space readout schemes, and scanner image filters might change texture features that are explored in radiomic analyses.[48-52] Several authors discussed previously have noted that incorporating images from various institutions is important because it incorporates a wider range of scanner hardware and protocol differences.[22,30] This variety makes models more robust against these differences, and therefore more generalizable. Although this variety might reduce model accuracy, it is important if a radiomic model is ever to be used beyond a particular set of data. Several studies have investigated the transferability of radiomic models across institutions with different scanners and/or acquisition protocols by training models on datasets from one or multiple institutions, then testing that model on unseen data from one or multiple different institutions.[18,22] Reported findings indicated that models trained and tested on data combined from all institutions sometimes have slightly higher performance metrics than models trained and tested on data from different institutions, but the latter models still performed well, indicating the potential for successful use of radiomic models across institutions.

Data harmonization and standardization

—Using multicenter training data presents an additional challenge: data harmonization. Harmonization has been defined as the “explicit removal of site-related effects in multi-site data”.[53] Without harmonization during pre-processing, radiomic models may not be successful when applied to images that were taken with different MRI protocols. Striking a balance between scan protocol variation in the real world and relative harmonization to maximize both accuracy and generalizability is one of the challenges in the field. Several harmonization methods have been developed for MRI, but further research is needed because there is not yet consensus within the radiomics field on what the optimal harmonization method is.[53,54] The harmonization problem can be mitigated with the standardization of image acquisition protocols. In 2020, the Response Assessment in Pediatric Neuro-Oncology (RAPNO) working group was established to develop recommendations to guide and standardize response assessment in clinical trials for tumor types. Their guidelines on imaging sequences and parameters can also be used in general imaging practices.[55] As more institutions adopt these recommendations, the resulting standardization may make extracted features more reproducible, which should make it easier for radiomic studies to be more generalizable and useful.

Auto-segmentation pipelines to increase generalizability

—Many radiomic studies in the literature have segmented brain tumors from the MRI scans before feature extraction. Full volumetric manual segmentation can be subject to inter-reviewer variability. For example, one study reported that Dice scores between human raters in segmenting sub-regions of gliomas ranged from 74% to 85%.[56] Some studies address this problem by having multiple reviewers delineate tumor regions and reach a consensus about the final segmentation.[29] Inconsistent segmentation guidelines and manual review processes might prohibit the generalizability of radiomic models due to a mismatch in the resulting segmented tumors and their components. In addition, it is time-consuming to segment manually. One potential solution is to develop an automatic segmentation module for pediatric brain cancer, thereby allowing for more consistent segmentations across radiomic studies, a reduction in segmentation time, and easier translation of radiomic models to clinical settings.[57] Additionally, some studies outside of those in Table 1 used reproducibility or feature stability analysis as a first step in their feature reduction process by removing features that were not reproducible across feature extractions from segmentations performed by different radiologists.[58,59] Future radiomic studies should consider employing similar analyses as this may aid in generalizability of subsequent models.

Future Directions

Combination of multi-omic approaches with radiomics

—While radiomics can provide a plethora of mineable data, their contribution in a clinical context will likely be synergistic with a variety of other systems-based tools. One combination of particular interest is the incorporation of radiomic features with liquid biopsy data for tumor characterization and monitoring of progression. Liquid biopsy refers to the extraction of tumor molecular data from commonly collected/easily accessible bodily fluids. In practice, cell-free DNA/RNA have shown the most promise and have been used to identify independent prognostic factors in adult pancreatic, lung, and prostate cancer.[60] While neuro-oncology is lagging behind in terms of these advances, there is increasing interest in the use of cell free DNA to characterize and follow primary brain tumors, namely in adult glioblastoma multiforme.[61] Mutual characteristics (such as noninvasive nature, ease of repeatability, and ease of clinical implementation) of liquid biopsy information and radiomics make them particularly well suited for combination. Over the past several decades, the paradigm of cancer research/treatment has been shifting from static, “one-size” characterization of oncologic entities to granular, dynamical methods. Thus, there is a need for easily implemented and repeatable tests that can more accurately represent the heterogenous and progressive nature of many primary CNS tumors.

Biological meaning of radiomic features

—In their 2021 paper, Tomaszewski and Gillies emphasized the importance of attaching biological meaning to radiomic findings as the field moves forward.[62] Most studies validate their proposed radiomic signatures using an independent test set. However, biological validation is critical to increase the practical value of these studies and integrate them into clinical decision making. Additionally, providing biological context can help move the field towards acceptance as a standalone method for diagnostics or prognostication. Four classes of biological data that can be correlated to radiomic signatures include: gene expression data, protein expression data from immunohistochemistry, data from local pathologic analysis, and data from habitat imaging. Attaching biological meaning to radiomics can be performed in two ways: either radiomics can be used to predict a biological phenomenon that can then be related to a patient outcome, or the radiomic signature is created to predict a patient outcome and consequently investigated for its association with biologic data.[62]

Image-localized biopsies

—Given that tumors are often heterogenous and harbor varying genetic and phenotypic aberrations, it is imperative, for the sake of accuracy, that radiogenomic studies correlate imaging findings with the location of biopsy from which molecular information is derived. However, this is seldom the case in retrospective radiogenomic studies. While there has been work done to quantify the spatial uncertainty in radiogenomic models,[63] future endeavors should attempt to co-register image-localized biopsies and imaging pathology. This practice could enable better characterizations of tumors and lead to pipelines that can help guide biopsies to maximize yield of genetic and molecular information.

The future: an open-science platform for clinicians and researchers

—Currently, most radiomic models cannot be used after publication. This is at least in part due to a lack of interoperability and reproducibility, despite the individual models showing potential for helping clinicians in many different ways. Trained radiomic models should be shared in a centralized platform where they can ultimately be used routinely for clinical applications across institutions. Challenges to this ideal include imaging data storage and management and a lack of standardized file structures and image acquisition protocols across sites. Ongoing efforts to develop systems for data management will help enable efficient workflows across institutions. Software (eg, CaPTk[64]) for processing multimodal data from end-to-end will be essential for widespread implementation of radiomic models. Finally, multiple expert groups are moving towards standardization of the reporting of radiomic studies, similar to what has been done previously for traditional studies of diagnostic or prognostic performance of imaging exams.[65] This can help improve the quality, repeatability, and reproducibility of radiomic studies and bring them closer to finding clinical applications.

Conclusion

Radiomics and radiogenomics have been increasingly applied to the field of pediatric neuro-oncology, but many challenges need to be overcome to pave the way for meaningful clinical application. Utilization of multi-institutional databases will be key to the advancement of this field. Despite these unique hurdles, the works reviewed in this article demonstrate the utility of these technologies in aiding in diagnosis, classification, and prognostication of pediatric brain tumors.

60 in total

1. Evaluation of radiomic texture feature error due to MRI acquisition and reconstruction: A simulation study utilizing ground truth.

Authors: Fei Yang; Nesrin Dogan; Radka Stoyanova; John Chetley Ford
Journal: Phys Med Date: 2018-05-22 Impact factor: 2.685

2. Imaging patterns predict patient survival and molecular subtype in glioblastoma via machine learning techniques.

Authors: Luke Macyszyn; Hamed Akbari; Jared M Pisapia; Xiao Da; Mark Attiah; Vadim Pigrish; Yingtao Bi; Sharmistha Pal; Ramana V Davuluri; Laura Roccograndi; Nadia Dahmane; Maria Martinez-Lage; George Biros; Ronald L Wolf; Michel Bilello; Donald M O'Rourke; Christos Davatzikos
Journal: Neuro Oncol Date: 2015-07-16 Impact factor: 12.300

Review 3. Neuroimaging of pediatric posterior fossa tumors including review of the literature.

Authors: Andrea Poretti; Avner Meoded; Thierry A G M Huisman
Journal: J Magn Reson Imaging Date: 2011-10-11 Impact factor: 4.813

Review 4. Treatment of posterior fossa tumors in children.

Authors: Dattatraya Muzumdar; Enrique C G Ventureyra
Journal: Expert Rev Neurother Date: 2010-04 Impact factor: 4.618

Review 5. Imaging signatures of glioblastoma molecular characteristics: A radiogenomics review.

Authors: Anahita Fathi Kazerooni; Spyridon Bakas; Hamidreza Saligheh Rad; Christos Davatzikos
Journal: J Magn Reson Imaging Date: 2019-08-27 Impact factor: 4.813

6. Clinical Applications of Quantitative 3-Dimensional MRI Analysis for Pediatric Embryonal Brain Tumors.

Authors: Jared H Hara; Ashley Wu; Javier E Villanueva-Meyer; Gilmer Valdes; Vikas Daggubati; Sabine Mueller; Timothy D Solberg; Steve E Braunstein; Olivier Morin; David R Raleigh
Journal: Int J Radiat Oncol Biol Phys Date: 2018-06-08 Impact factor: 7.038

7. The Biological Meaning of Radiomic Features.

Authors: Michal R Tomaszewski; Robert J Gillies
Journal: Radiology Date: 2021-05 Impact factor: 11.105

8. Robust deep learning classification of adamantinomatous craniopharyngioma from limited preoperative radiographic images.

Authors: Eric W Prince; Ros Whelan; David M Mirsky; Nicholas Stence; Susan Staulcup; Paul Klimo; Richard C E Anderson; Toba N Niazi; Gerald Grant; Mark Souweidane; James M Johnston; Eric M Jackson; David D Limbrick; Amy Smith; Annie Drapeau; Joshua J Chern; Lindsay Kilburn; Kevin Ginn; Robert Naftel; Roy Dudley; Elizabeth Tyler-Kabara; George Jallo; Michael H Handler; Kenneth Jones; Andrew M Donson; Nicholas K Foreman; Todd C Hankinson
Journal: Sci Rep Date: 2020-10-09 Impact factor: 4.379

9. Uncertainty quantification in the radiogenomics modeling of EGFR amplification in glioblastoma.

Authors: Leland S Hu; Lujia Wang; Kristin R Swanson; Jing Li; Andrea Hawkins-Daarud; Jennifer M Eschbacher; Kyle W Singleton; Pamela R Jackson; Kamala Clark-Swanson; Christopher P Sereduk; Sen Peng; Panwen Wang; Junwen Wang; Leslie C Baxter; Kris A Smith; Gina L Mazza; Ashley M Stokes; Bernard R Bendok; Richard S Zimmerman; Chandan Krishna; Alyx B Porter; Maciej M Mrugala; Joseph M Hoxworth; Teresa Wu; Nhan L Tran
Journal: Sci Rep Date: 2021-02-16 Impact factor: 4.379

10. MRI-based radiomics for prognosis of pediatric diffuse intrinsic pontine glioma: an international study.

Authors: Lydia T Tam; Kristen W Yeom; Jason N Wright; Alok Jaju; Alireza Radmanesh; Michelle Han; Sebastian Toescu; Maryam Maleki; Eric Chen; Andrew Campion; Hollie A Lai; Azam A Eghbal; Ozgur Oztekin; Kshitij Mankad; Darren Hargrave; Thomas S Jacques; Robert Goetti; Robert M Lober; Samuel H Cheshier; Sandy Napel; Mourad Said; Kristian Aquilina; Chang Y Ho; Michelle Monje; Nicholas A Vitanza; Sarah A Mattonen
Journal: Neurooncol Adv Date: 2021-03-05