| Literature DB >> 35869125 |
Víctor M Campello1, Carlos Martín-Isla2, Cristian Izquierdo2, Andrea Guala3,4, José F Rodríguez Palomares3,4,5,6, David Viladés4,7, Martín L Descalzo7, Mahir Karakas8, Ersin Çavuş9,10, Zahra Raisi-Estabragh11,12, Steffen E Petersen11,12,13,14, Sergio Escalera2,15, Santi Seguí2, Karim Lekadir2.
Abstract
Radiomics is an emerging technique for the quantification of imaging data that has recently shown great promise for deeper phenotyping of cardiovascular disease. Thus far, the technique has been mostly applied in single-centre studies. However, one of the main difficulties in multi-centre imaging studies is the inherent variability of image characteristics due to centre differences. In this paper, a comprehensive analysis of radiomics variability under several image- and feature-based normalisation techniques was conducted using a multi-centre cardiovascular magnetic resonance dataset. 218 subjects divided into healthy (n = 112) and hypertrophic cardiomyopathy (n = 106, HCM) groups from five different centres were considered. First and second order texture radiomic features were extracted from three regions of interest, namely the left and right ventricular cavities and the left ventricular myocardium. Two methods were used to assess features' variability. First, feature distributions were compared across centres to obtain a distribution similarity index. Second, two classification tasks were proposed to assess: (1) the amount of centre-related information encoded in normalised features (centre identification) and (2) the generalisation ability for a classification model when trained on these features (healthy versus HCM classification). The results showed that the feature-based harmonisation technique ComBat is able to remove the variability introduced by centre information from radiomic features, at the expense of slightly degrading classification performance. Piecewise linear histogram matching normalisation gave features with greater generalisation ability for classification ( balanced accuracy in between 0.78 ± 0.08 and 0.79 ± 0.09). Models trained with features from images without normalisation showed the worst performance overall ( balanced accuracy in between 0.45 ± 0.28 and 0.60 ± 0.22). In conclusion, centre-related information removal did not imply good generalisation ability for classification.Entities:
Mesh:
Year: 2022 PMID: 35869125 PMCID: PMC9307565 DOI: 10.1038/s41598-022-16375-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Percentage of first and second order features below the 0.01 JSD threshold for healthy subjects. Results are averaged over centre pairs and ROI and presented separately for ED and ES frames. Only features with square cross-correlation below 0.9 were considered. The black lines represent the standard deviation. O original images (without normalisation), R image intensity rescaling, N image intensity normalisation, HM histogram matching and PLHM piecewise linear histogram matching. An “R.” in front of a method means that it is applied at ROI level.
Figure 2Percentage of first and second order features below the 0.01 JSD threshold for healthy subjects after the application of the feature-based harmonisation tool ComBat. Results are averaged over centre pairs and ROI and presented separately for ED and ES frames. Only features with square cross-correlation below 0.9 were considered. The black lines represent the standard deviation. O original images (without normalisation), R image intensity rescaling, N image intensity normalisation, HM histogram matching and PLHM: piecewise linear histogram matching. An “R.” in front of a method means that it is applied at ROI level.
Mean and standard deviation (in parenthesis) for JSD for distributions of features obtained after the application of R.PLHM normalisation on healthy patients. Results are presented separately for ED and ES frames and for each feature family before and after the application of ComBat harmonisation. Only features with square cross-correlation below 0.9 were considered. Values are averaged over ROI. Numbers in blue stand for non-significant differences in the JSD distributions when compared to first order features according to the Mann–Whitney U test at the 0.01 level.
| Family | Without combat | With combat | ||
|---|---|---|---|---|
| ED | ES | ED | ES | |
| 1st order | 0.009 (0.009) | 0.008 (0.007) | 0.012 (0.011) | 0.011 (0.008) |
| GLCM | 0.008 (0.007) | 0.009 (0.009) | 0.013 (0.012) | 0.012 (0.013) |
| GLDM | 0.011 (0.010) | 0.010 (0.008) | 0.013 (0.013) | 0.011 (0.010) |
| GLRLM | 0.012 (0.011) | 0.010 (0.007) | 0.011 (0.010) | 0.011 (0.009) |
| GLSZM | 0.011 (0.011) | 0.011 (0.010) | 0.013 (0.012) | 0.011 (0.010) |
Figure 3Balanced accuracy of random forest models when predicting the centre of origin of healthy subjects for first and second order texture features before and after the application of ComBat harmonisation. The row above corresponds to image preprocessing techniques applied at the whole image level, while in the row below they are applied at the ROI level. O original images (without normalisation), R image intensity rescaling, N image intensity normalisation, HM histogram matching, PLHM piecewise linear histogram matching. An “R.” in front of a method means that it is applied at ROI level.
Figure 4Balanced accuracy of random forest models on unseen centres for classification of HCM versus healthy patients. All models were trained with a combination of first and second order texture features from all ROIs. The first column corresponds to models trained with features extracted from Vall d’Hebron studies, while models in the second column were trained with features from Sagrada Familia studies. The row above corresponds to image preprocessing techniques applied at the whole image level, while in the row below they are applied at the ROI level. HCM hypertrophic cardiomyopathy, O original images (without normalisation), R image intensity rescaling, N image intensity normalisation, HM histogram matching, PLHM piecewise linear histogram matching. An “R.” in front of a method means that it is applied at ROI level.
Figure 5Balanced accuracy of random forest models on the validation set (same domain) versus the testing set (unseen centres) for classification of HCM versus healthy patients. Results are presented without ComBat harmonisation. All models were trained with a combination of first and second order texture features from all ROIs. The first column corresponds to models trained with features extracted from Vall d’Hebron studies, while models in the second column were trained with features from Sagrada Familia studies. The row above corresponds to image preprocessing techniques applied at the whole image level, while in the row below they are applied at the ROI level. HCM hypertrophic cardiomyopathy, O original images (without normalisation), R image intensity rescaling, N image intensity normalisation, HM histogram matching, PLHM piecewise linear histogram matching. An “R.” in front of a method means that it is applied at ROI level.
Distribution of diseases per centre considered in the analysis.
| Centre | Creu Blanca | Dexeus | Sagrada Familia | Universitätsklinikum Hamburg-Eppendorf | Vall d’Hebron | Total |
|---|---|---|---|---|---|---|
| Canon | General Electric | Philips | Philips | Siemens | ||
| Healthy | 14 | 11 | 33 | 32 | 22 | 112 |
| HCM | 15 | 5 | 37 | 14 | 25 | 106 |
Average specifications for the studies acquired in the five different centres.
| Centre | Vendor | Model | In-plane resolution (mm) | Slice thickness (mm) | Number of slices | Intensities range |
|---|---|---|---|---|---|---|
| Vall d’Hebron | Siemens | Magnetom Avanto | 1.32 | 9.2 | 12 | 0–1193 |
| Sagrada Familia | Philips | Achieva | 1.20 | 9.9 | 10 | 0–357 |
| Universitätsklinikum Hamburg-Eppendorf | Philips | Achieva | 1.45 | 9.9 | 11 | 0–3725 |
| Dexeus | General Electric | Signa Excite | 1.36 | 10 | 12 | 0–3030 |
| Creu Blanca | Canon | Vantage Orian | 0.85 | 10 | 13 | 0–14,442 |