| Literature DB >> 35882891 |
Alexandre Carré1,2, Enzo Battistella1,3,4, Stephane Niyoteka1,2, Roger Sun1,2, Eric Deutsch1,2, Charlotte Robert5,6.
Abstract
The use of multicentric data is becoming essential for developing generalizable radiomic signatures. In particular, Magnetic Resonance Imaging (MRI) data used in brain oncology are often heterogeneous in terms of scanners and acquisitions, which significantly impact quantitative radiomic features. Various methods have been proposed to decrease dependency, including methods acting directly on MR images, i.e., based on the application of several preprocessing steps before feature extraction or the ComBat method, which harmonizes radiomic features themselves. The ComBat method used for radiomics may be misleading and presents some limitations, such as the need to know the labels associated with the "batch effect". In addition, a statistically representative sample is required and the applicability of a signature whose batch label is not present in the train set is not possible. This work aimed to compare a priori and a posteriori radiomic harmonization methods and propose a code adaptation to be machine learning compatible. Furthermore, we have developed AutoComBat, which aims to automatically determine the batch labels, using either MRI metadata or quality metrics as inputs of the proposed constrained clustering. A heterogeneous dataset consisting of high and low-grade gliomas coming from eight different centers was considered. The different methods were compared based on their ability to decrease relative standard deviation of radiomic features extracted from white matter and on their performance on a classification task using different machine learning models. ComBat and AutoComBat using image-derived quality metrics as inputs for batch assignment and preprocessing methods presented promising results on white matter harmonization, but with no clear consensus for all MR images. Preprocessing showed the best results on the T1w-gd images for the grading task. For T2w-flair, AutoComBat, using either metadata plus quality metrics or metadata alone as inputs, performs better than the conventional ComBat, highlighting its potential for data harmonization. Our results are MRI weighting, feature class and task dependent and require further investigations on other datasets.Entities:
Mesh:
Year: 2022 PMID: 35882891 PMCID: PMC9325761 DOI: 10.1038/s41598-022-16609-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Institutional information of patients of Bakas et al.[24] and patients selected in our study.
| Collection | Institutions | N | N selected | TCGA-ID |
|---|---|---|---|---|
| TCGA-GBM | Henry Ford Hospital, Detroit, MI | 46 | 46 | TCGA-06 |
| CWRU School of Medicine, Cleveland, OH | 9 | 9 | TCGA-19 | |
| University of California, San Francisco, CA | 22 | 22 | TCGA-08 | |
| Emory University, Atlanta, GA | 6 | 0 | TCGA-14 | |
| MD Anderson Cancer Center, Houston, TX | 25 | 25 | TCGA-02 | |
| Duke University School of Medicine, Durham, NC | 10 | 9 | TCGA-12 | |
| Thomas Jefferson University, Philadelphia, PA | 14 | 14 | TCGA-76 | |
| Fondazione IRCCSInstituto Neuroligico C. Besta, Milan, Italy | 3 | 0 | TCGA-27 | |
| TCGA-LGG | St Joseph Hospital/Medical Center, Phoenix, AZ | 29 | 29 | TCGA-HT |
| Henry Ford Hospital, Detroit, MI | 52 | 52 | TCGA-DU | |
| Case Western Reserve University, Cleveland, OH | 10 | 10 | TCGA-FG | |
| Thomas Jefferson University, Philadelphia, PA | 16 | 16 | TCGA-CS | |
| University of North Carolina, Chapel Hill, NC | 1 | 0 | TCGA-EZ | |
| Total | 243 | 232 |
TCGA The Tumor Genome Atlas.
Summary table of metadata and quality metrics extracted from the raw DICOM files.
| Type | Name | Description | Tags |
|---|---|---|---|
| Metadata | Rows | Number of rows in the image | 0028,0010 |
| Columns | Number of columns in the image | 0028,0011 | |
| Vox_X | Voxel resolution in x plane | 0028,0030 | |
| Vox_Y | Voxel resolution in y plane | 0028,0030 | |
| Vox_Z | Voxel resolution in z plane | 0018,0050 | |
| PixelBandwidth | Reciprocal of the total sampling period, in hertz per pixel | 0018,0095 | |
| Manufacturer | Manufacturer of the equipment | 0008,0070 | |
| ModelName | Model name of the manufacturer of the equipment | 0008,1090 | |
| MagneticField | Nominal field strength of the MR magnet, in Tesla | 0018,0087 | |
| EchoNumbers | Echo number used to generate the image | 0018,0086 | |
| EchoTime | Time in ms between the middle of the excitation pulse and the peak of the echo produced (kx=0) | 0018,0081 | |
| EchoTrainLength | Number of lines in k-space acquired per excitation per image | 0018,0091 | |
| InversionTime | Time in ms between the middle of the inverting RF pulse and the middle of the excitation pulse to detect the amount of longitudinal magnetization | 0018,0082 | |
| RepetitionTime | The period of time in ms between the beginning of a pulse sequence and the beginning of the succeeding (essentially identical) pulse sequence | 0018,0080 | |
| FlipAngle | Steady state angle in degrees by which the magnetic vector is flipped with respect to the magnetic vector of the primary field | 0018,1314 |
F is Foreground intensity voxels () with foreground voxels. B is Background intensity voxels () with background voxels.
is Foreground random patch voxels (n = 5000, with a 5 × 5 × 5 patch-size). is Background random patch voxels (n = 5000, with a 5 × 5 × 5 patch-size).
Figure 1Study design.
Figure 2Workflows for the ComBat and AutoComBat computation models. These workflows are performed after extraction of the radiomic features.
Figure 3Parallel coordinate plots per center of the information extracted from the dataset for the T1w-gd and T2w-flair MRI. (a) Information extracted from the header of the DICOM files. (b) Measurement of quality metrics.
Counts (%) of features for each harmonization method with a RSD (95% CI) lower than the one corresponding to the raw images for the T1w-gd and T2w-flair MRI on the test set.
| MRI | Method | Feature class | (n = 91) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| First-order(n=18) | glcm(n=22) | gldm(n=14) | glrlm(n=16) | glszm (n=16) | ngtdm(n=5) | ||||
| Total | |||||||||
| Top | Vs. raw | ||||||||
| Preprocess | 4 | 5 | 7 | 12 | 7 | 2 | 37 (41%) | 75 (82%) | |
| ComBat | 3 | 1 | 3 | 6 | 2 | 0 | 15 (16%) | 62 (68%) | |
| AutoComBat | |||||||||
| All | 2 | 2 | 0 | 1 | 5 | 1 | 11 (12%) | 49 (54%) | |
| Metadata | 3 | 3 | 1 | 2 | 5 | 1 | 15 (16%) | 56 (62%) | |
| QM | 16 | 20 | 8 | 10 | 10 | 4 | 68 (75%) | 68 (75%) | |
| Preprocess | 13 | 20 | 11 | 10 | 13 | 4 | 71 (78%) | 71 (78%) | |
| ComBat | 10 | 14 | 11 | 9 | 11 | 4 | 59 (65%) | 72 (79%) | |
| AutoComBat | |||||||||
| All | 6 | 5 | 8 | 7 | 8 | 2 | 36 (40%) | 47 (52%) | |
| Metadata | 12 | 12 | 6 | 4 | 10 | 1 | 45 (49%) | 55 (60%) | |
| QM | 2 | 0 | 2 | 0 | 1 | 0 | 5 (5%) | 7 (8%) | |
The main part of the table gives the number of features for which the considered method is evaluated as the best one, which is called “Top”. Total vs. Raw gives the total number of features for each method that are significantly better compared to Raw.
For AutoComBat, “All” means the use of Metadata and Quality Metrics. QM Quality Metrics.
Figure 4Harmonization strength evaluated on the WM radiomic features for the T1w-gd MRI on the test set. Points represent the RSD values and error bars are the 95% CI.
Figure 5Harmonization strength evaluated on the WM radiomic features for the T2w-flair MRI on the test set. Points represent the RSD values and error bars are the 95% CI.
Figure 6Clustering interpretation of AutoComBat for the T1w-gd images. (A, C) correspond to normalized feature importance in the final clusterings and (B, D) are visual representations of the proposed clusters. In both cases, a feature reduction strategy was retained in AutoComBat. (A, B) AutoComBat using all features as inputs - UMAP feature reduction, (C, D) AutoComBat using QM - PCA feature reduction.
Figure 7Balanced accuracy for the tumor grading task for the 5 machine learning models (RF, SVC, XGBoost, KNN, LR) and the different MRI images (T1w, T1w-gd, T2w, T2w-flair) on the test set for the first, second and first & second-order feature types depending on the harmonization method. Each color corresponds to a harmonization method. Each dot indicates the performance of one ML algorithm, and the vertical dashed line is the median value of the performance of the 5 ML algorithms.