| Literature DB >> 35486171 |
Kavi Fatania1,2,3, Farah Mohamud4, Anna Clark5, Michael Nix5, Susan C Short6,7, James O'Connor8,9,10, Andrew F Scarsbrook11,6, Stuart Currie11,6.
Abstract
OBJECTIVES: Radiomics is a promising avenue in non-invasive characterisation of diffuse glioma. Clinical translation is hampered by lack of reproducibility across centres and difficulty in standardising image intensity in MRI datasets. The study aim was to perform a systematic review of different methods of MRI intensity standardisation prior to radiomic feature extraction.Entities:
Keywords: Glioma; Magnetic resonance imaging; Reproducibility of results
Mesh:
Year: 2022 PMID: 35486171 PMCID: PMC9474349 DOI: 10.1007/s00330-022-08807-2
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 7.034
Fig. 1MR imaging in three different examples of adult-type diffuse gliomas
Fig. 2PRISMA flowchart illustrating the study selection for the systematic review of intensity normalisation in diffuse glioma radiomic studies
Summary of the risk of bias and applicability concerns for the 12 studies
| Study | Risk of bias | Applicability concerns | |||||
|---|---|---|---|---|---|---|---|
| Patient selection | Index test | Reference standard | Flow and timing | Patient selection | Index test | Reference standard | |
| Chen et al 2019 [ | Unclear | Low | Low | Low | Low | Low | Low |
| Zhao et al 2020 [ | Unclear | Low | Low | Low | Low | Low | Low |
| Reuze et al 2018 [ | Unclear | Low | Low | Low | Low | High | Low |
| Um et al 2019 [ | Unclear | Low | Low | Low | Low | Low | Low |
| Upadhaya et al 2016 [ | Unclear | Low | Low | Low | Low | High | Low |
| Florez et al 2018 [ | Unclear | Low | Low | Low | Low | Low | Low |
| Florez et al 2018 [ | Unclear | Low | Low | Low | Low | Low | Low |
| Hu et al 2021 [ | Unclear | Low | Low | Low | Low | Low | Low |
| Hoebel et al 2021 [ | Low | Low | Low | Low | Low | Low | Low |
| Vils et al 2021 [ | Low | Low | Low | Low | Low | Low | Low |
| Carré et al 2020 [ | Unclear | Low | Low | Low | Low | Low | Low |
| Orlhac et al 2020 [ | Unclear | Low | Low | Low | Low | Low | Low |
Summary of key features from the included studies (n = 12)
| Study | Aims | Patients (train:test seta) | MRI sequences examined | Normalisation method | Pre-processing | Segmentation method | Radiomics software | Results | Conclusion |
|---|---|---|---|---|---|---|---|---|---|
| Chen et al 2019 [ | To improve prediction of glioma grade using radiomics and the HSASR method of normalisation | 521 (416:105) | T1Gd | HSASR method | Skull stripping and resampling | Manual | Pyradiomics | Highest AUC was 0.9934 for glioma grading with processing compared to 0.8512 without. The AUC after processing generally increased by more than 15% | Multicentre data processed by this method have good adaptability, which improves grading results and has value for clinical prediction |
| Zhao et al 2020 [ | To examine the impact standardising MRI images with the HS-GS method has on using radiomics to predict glioma grades | 693 (554:139) | T1Gd | HS-GS method | Skull stripping and resampling | Manual | Pyradiomics | The AUC of the predicted classification after HG-GS processing is 0.956 which is 26.96% higher than not performing a standardisation method | The results show that by adding HS-GS method to standard pre-processing, the diagnostic performance of using radiomics for glioma grading improves with respect to AUC, ACC, sensitivity, and specificity |
| Reuze et al 2018 [ | To assess the effect of intensity rescaling on radiomic analysis of multicentre cohorts and the impact on the robustness of radiomic features | 190 (n/a) | T1Gd | Intensity rescaling | Spatial resampling and discretisation of grey levels | Manual | LIFEx freeware | Out of the 31 textural features that were extracted, only 11 were deemed to be robust after the harmonisation method | Overall, the efficiency of the harmonisation method differed between devices, therefore it was not deemed to be a sufficient method to correct the differences between images |
| Um et al 2019 [ | To determine the utility of a set of pre-processing methods on improving MRI radiomic feature robustness across multi-institutional datasets | 161 (111:47) | FLAIR, T1W, and T1Gd | Histogram standardisation | Co-registration | Semi-automatic | Computational Environment for Radiotherapy Research (CERR) | From all of the pre-processing methods, histogram standardisation had a superior performance at reducing covariate shift, as Haralick, Soebel, and Laplacian of Gaussian features returned a significant decrease of Matthews correlation coefficient to 0.191, 0.170, and 0.140 respectively ( | From all the pre-processing methods, histogram standardisation contributes the most at the investigated measures such as feature dependence on scanner variability and covariate shift |
| Upadhaya et al 2016 [ | To identify the impact of adding several pre-processing steps on the accuracy of the prognostic model which identifies patients above and below a median survival of 12 months | 58 (58:58b) | T1W, T2W, T1Gd, and FLAIR | Dynamics intensity limitation | Bias field correction, skull stripping, co-registration, spatial resampling, and intensity quantisation | Automatic | Not identified | The additional pre-processing steps improved the prognostic model from sensitivity and specificity of 79% and 86% respectively to a sensitivity and specificity of 93% | The addition of investigated pre-processing methods highlights how various acquisition methods from different MR scanners can influence the accuracy of prognostic models |
| Florez et al 2018 [ | To assess the ability of radiomic feature, to differentiate gross tumour volume (GTV) from oedema and differentiate vasogenic from tumour cell infiltration oedema | 17 (17;n/a) | T1W, T1Gd, T2W, FLAIR and apparent diffusion coefficient (ADC) | 1%-99% normalisation | Segmentation | Semi-automatic | MatLab version 2016a | Out of all of the sequences examined, T1Gd with 1–99% normalisation was the model best at classifying tumours with an AUC > 0.97 | From the several hundred of radiomic feature extracted, only a small subset showed excellent ability to classify tumour tissue |
| Florez et al 2018 [ | To assess the ability of radiomic features to distinguish oedema and infiltrative tumour based on FLAIR sequence | 20 (20;n/a) | FLAIR | 1–99% normalisation | Segmentation | Semi-automatic | MatLab version 2016a | Performance using single best discriminator reduced with addition of normalisation (AUC 0.87 vs 0.84) in patients with GBM | Small subset of texture features shows the ability to discriminate oedema from tumour |
| Hu et al 2021 [ | To evaluate the impact MIL normalisation has on segmentation and feature extraction which allows the prediction of pathological grading and | 800 (533:267) | T1W, T1Gd, and FLAIR for all of the datasets (and T2W for the BraTs dataset, | CycleGAN | Modality normalisation, layer spacing normalisation | Automatic | Not identified | MIL normalisation improved the AUC of pathological grading and IDH1 status prediction by 32% and 25% ( | MIL normalisation can produce high-quality standardised data which is imperative for radiomic analysis |
| Hoebel et al 2021 [ | To assess the impact of intensity normalisation methods (z-score normalisation and histogram matching) and intensity quantisation methods has on the repeatability and reproducibility of features extracted from a scan-rescan glioblastoma cohort. | 48 (n/a) | T1Gd and FLAIR | Segmentation, registration, bias field correction, and whole-brain extraction | Manual | Pyradiomics | For intensity features, both methods improved the repeatability on FLAIR images when compared to non-normalised baseline ( | Both normalisation methods showed better repeatability for FLAIR images than T1Gd images, which may be a consequence of variations in contrast administration and timing of image acquisition after contrast administration | |
| Vils et al 2021 [ | To evaluate the association between radiomic features, clinical outcome, and molecular characteristic such as MGMT status | 118 (69:49) | T1Gd | Linear intensity interpolation | Segmentation and manual extraction of brain tissue | Manual | Z-Rad | Regarding radiomic models capable of predicting MGMT status, images where the features were extracted from tumoural volumes of interest and normalised with linear interpolation were the only images validated in an independent cohort with an AUC of 0.670 (95% CI 0.5341–0.8056) | The proposed model may be a non-invasive approach to predict patient response to chemotherapy |
| Carré et al 2020 [ | To assess the impact of three intensity normalisation methods coupled with grey level discretisation on the task of tumour grade classification in two independent cohorts | 263 (195:48) | T1Gd and FLAIR | Nyul, WhiteStripe, and | Bias field correction, spatially resampled, skull-stripping, co-registration and segmentation | Manual | Pyradiomics | Significantly higher Jenson-Shannon divergence values were found on histogram and first-order features when comparing images with and without normalisation ( | A combination of |
| Orlhac et al 2020 [ | To assess the impact of intensity normalisation and post-extraction realignment (ComBat) on the statistical distribution of radiomics from diffuse gliomas | 18 | T1Gd and FLAIR | Hybrid WhiteStripe (and ComBat) | Co-registration, bias field correction, spatial resampling | Manual | LIFEx freeware | 69% of normal white matter, and 60% of tumour radiomics were significantly different following WhiteStripe (88 and 98% without WhiteStripe, respectively) | Intensity standardisation results in similar intensity values in images, but significant scanner-dependent changes require further correction with ComBat |
HSASR histogram specification with automatic selection of reference, HS-GS histogram specification grid search
aTrain/test numbers are only stated for any predictive model developed in the study; ‘n/a’ stated if no model was developed
bModel developed using leave one out cross-validation, according to stated references in the study
Limitations of the current literature and opportunities for the future
| Limitation | Opportunity |
|---|---|
| 1. Assessing the effect of multiple preprocessing steps simultaneously | Effects of preprocessing steps presented independently of others so their effect on the result can be determined |
| 2. Investigating the effect of only one intensity standardisation technique | Impact of more than one standardisation method on a predictive model or feature robustness should be evaluated |
| 3. Lack of scan-rescan data used to test the repeatability of radiomic features | Increased availability of datasets that have rescanned a patient with a diffuse glioma within a short time interval (i.e. days) in public databases |
| 4. Single-centre studies used to assess standardisation techniques | Use of multi-centre datasets in assessing the efficacy of standardisation techniques and repeatability of radiomic features |