| Literature DB >> 35710850 |
Xinzhi Teng1, Jiang Zhang1, Alex Zwanenburg2,3,4,5, Jiachen Sun1, Yuhua Huang1, Saikit Lam1, Yuanpeng Zhang1, Bing Li1, Ta Zhou1, Haonan Xiao1, Chenyang Liu1, Wen Li1, Xinyang Han1, Zongrui Ma1, Tian Li1, Jing Cai6.
Abstract
Radiomic model reliability is a central premise for its clinical translation. Presently, it is assessed using test-retest or external data, which, unfortunately, is often scarce in reality. Therefore, we aimed to develop a novel image perturbation-based method (IPBM) for the first of its kind toward building a reliable radiomic model. We first developed a radiomic prognostic model for head-and-neck cancer patients on a training (70%) and evaluated on a testing (30%) cohort using C-index. Subsequently, we applied the IPBM to CT images of both cohorts (Perturbed-Train and Perturbed-Test cohort) to generate 60 additional samples for both cohorts. Model reliability was assessed using intra-class correlation coefficient (ICC) to quantify consistency of the C-index among the 60 samples in the Perturbed-Train and Perturbed-Test cohorts. Besides, we re-trained the radiomic model using reliable RFs exclusively (ICC > 0.75) to validate the IPBM. Results showed moderate model reliability in Perturbed-Train (ICC: 0.565, 95%CI 0.518-0.615) and Perturbed-Test (ICC: 0.596, 95%CI 0.527-0.670) cohorts. An enhanced reliability of the re-trained model was observed in Perturbed-Train (ICC: 0.782, 95%CI 0.759-0.815) and Perturbed-Test (ICC: 0.825, 95%CI 0.782-0.867) cohorts, indicating validity of the IPBM. To conclude, we demonstrated capability of the IPBM toward building reliable radiomic models, providing community with a novel model reliability assessment strategy prior to prospective evaluation.Entities:
Mesh:
Year: 2022 PMID: 35710850 PMCID: PMC9203573 DOI: 10.1038/s41598-022-14178-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Changes in the training and validation C-indexes with respect to feature numbers in the step-wise backward feature elimination method under three-fold cross-validation, repeated 10 times. The points indicate the averaged C-index over cross-validation folds, and the shaded area indicates the range of one standard deviation (std).
The characteristics of selected features for model building.
| Features | C-index | ICC | |
|---|---|---|---|
| Log-sigma-6-0-mm-3D_gldm_LargeDependenceLowGrayLevelEmphasis_64_binCount | 0.619 | 0.045 | 0.747 |
| Wavelet-HHL_glrlm_LongRunLowGrayLevelEmphasis_128_binCount | 0.587 | 0.169 | 0.454 |
| Original_glszm_LargeAreaLowGrayLevelEmphasis_128_binCount | 0.614 | 0.066 | 0.610 |
| Wavelet-LLL_glrlm_RunEntropy_128_binCount | 0.608 | 0.064 | 0.900 |
| Wavelet-LHL_glszm_LowGrayLevelZoneEmphasis_64_binCount | 0.572 | 0.091 | 0.491 |
| Wavelet-HLL_glszm_SmallAreaHighGrayLevelEmphasis_128_binCount | 0.604 | 0.085 | 0.542 |
The univariate C-index, p-value, and ICC were tabulated. Feature names indicate the feature, the bin count (if applicable), and the image used to compute it.
Figure 2Visualization of model performance on the original and perturbed data.
Figure 3The feature map of wavelet-LLL_glrlm_RunEntropy (left) and wavelet-HLL_glszm_SmallAreaHighGrayLevelEmphasis_128_binCount (right). The window is fixed between 1 and 99 percentile of the feature map to eliminate the effects of noise.
The characteristics of selected features for model building in sensitivity analysis.
| Feature names | C-index | ICC | |
|---|---|---|---|
| Log-sigma-6-0-mm-3D_glszm_GrayLevelNonUniformity_128_binCount | 0.653 | 0.00001 | 0.97 |
| Wavelet-HHL_glszm_GrayLevelNonUniformity_64_binCount | 0.656 | 0.00001 | 0.91 |
The univariate C-index, p-value, and ICC were tabulated. Feature names indicate the feature, the bin count (if applicable), and the image used to compute it.
The model performance in discrimination and reliability.
| Training C-index | Testing C-index | Model robustness ICC | |
|---|---|---|---|
| Original features | 0.67 | 0.71 | 0.72 |
| Log-sigma features | 0.72 | 0.54 | 0.59 |
| Wavelet features | 0.80 | 0.56 | 0.51 |
| Original features | 0.65 | 0.74 | 0.85 |
| Log-sigma features | 0.58 | 0.54 | 0.91 |
| Wavelet features | 0.62 | 0.55 | 0.89 |
An improvement in model reliability is observed after removing non-robust radiomic features.
Figure 4The general workflow of the study.