| Literature DB >> 36202934 |
M G Poirot1,2, M W A Caan3, H G Ruhe4,5,6, A Bjørnerud7,8,9, I Groote9,10, L Reneman4, H A Marquering4,3.
Abstract
Radiomics in neuroimaging uses fully automatic segmentation to delineate the anatomical areas for which radiomic features are computed. However, differences among these segmentation methods affect radiomic features to an unknown extent. A scan-rescan dataset (n = 46) of T1-weighted and diffusion tensor images was used. Subjects were split into a sleep-deprivation and a control group. Scans were segmented using four segmentation methods from which radiomic features were computed. First, we measured segmentation agreement using the Dice-coefficient. Second, robustness and reproducibility of radiomic features were measured using the intraclass correlation coefficient (ICC). Last, difference in predictive power was assessed using the Friedman-test on performance in a radiomics-based sleep deprivation classification application. Segmentation agreement was generally high (interquartile range = 0.77-0.90) and median feature robustness to segmentation method variation was higher (ICC > 0.7) than scan-rescan reproducibility (ICC 0.3-0.8). However, classification performance differed significantly among segmentation methods (p < 0.001) ranging from 77 to 84%. Accuracy was higher for more recent deep learning-based segmentation methods. Despite high agreement among segmentation methods, subtle differences significantly affected radiomic features and their predictive power. Consequently, the effect of differences in segmentation methods should be taken into account when designing and evaluating radiomics-based research methods.Entities:
Mesh:
Year: 2022 PMID: 36202934 PMCID: PMC9537186 DOI: 10.1038/s41598-022-20703-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Pairwise segmentation agreement matrix. Dice-coefficients between each pair of segmentation methods for each subcortical area. Left–Right averages are shown where possible. Clarification of abbreviations: WM white matter, DC diencephalon, CSF cerebral spinal fluid, Inf Lat Vent inferior lateral ventricle.
Figure 2Reproducibility and robustness for each class of radiomic features. For an explanation of abbreviations see Supplementary Note S6.
Classification performance of pipelines for each segmentation method. Median performance of cross-validation folds and IQR are shown. Best performance shown in bold.
| BCE loss | Accuracy | |
|---|---|---|
| SAMSEG | 0.53 (0.37–0.70) | 77% (60–89%) |
| MED-DEEPBRAIN | 0.50 (0.30–0.69) | 82% (62–91%) |
| ASEG | 0.60 (0.47–0.72) | 71% (62–82%) |
| FASTSURFER |