| Literature DB >> 35302238 |
Thibault Escobar1,2, Sébastien Vauclin2, Fanny Orlhac1, Christophe Nioche1, Pascal Pineau2, Laurence Champion1,3, Hervé Brisse1,4, Irène Buvat1.
Abstract
BACKGROUND: Translation of predictive and prognostic image-based learning models to clinical applications is challenging due in part to their lack of interpretability. Some deep-learning-based methods provide information about the regions driving the model output. Yet, due to the high-level abstraction of deep features, these methods do not completely solve the interpretation challenge. In addition, low sample size cohorts can lead to instabilities and suboptimal convergence for models involving a large number of parameters such as convolutional neural networks.Entities:
Keywords: interpretability; machine learning; radiomics; sub-region; voxel-wise
Mesh:
Year: 2022 PMID: 35302238 PMCID: PMC9325536 DOI: 10.1002/mp.15603
Source DB: PubMed Journal: Med Phys ISSN: 0094-2405 Impact factor: 4.506
FIGURE 1Example of engineered radiomic feature maps. As the models were trained by taking the mean values inside the region of interest as inputs, there was no need to resample all the feature maps on a common grid at this stage. The feature maps then had different spatial resolution (3 mm × 3mm × 3 mm for positron emission tomography (PET), 1 mm × 1 mm × 1 mm for computed tomography (CT) and magnetic resonance imaging (MRI)). (a) CT first‐order entropy, (b) PET gray‐level co‐occurrence matrix (GLCM) contrast, (c) T1 gray‐level dependence matrix (GLDM) gray‐level non‐uniformity (GLNU), and (d) fat‐suppressed T2 gray‐level run length matrix (GLRLM) long run high gray‐level emphasis (LRHGLE)
FIGURE 2Average stratified Brier score permutation test distribution for M1 and M2 model building settings. (a) Positron emission tomography (PET)/computed tomography (CT) and (b) magnetic resonance imaging (MRI)
Cross‐validated performance for the grid‐search forward selection and least absolute shrinkage and selection operator (LASSO) C parameter optimization
| Model building settings | M1 | M2 |
|---|---|---|
|
| 2.2 | 2.2 |
| Number of selected features | 5 (1 shape feature) | 5 (1 shape feature) |
| ASB (±1 SD) | 0.872 ± 0.056 | 0.838 ± 0.065 |
| Brier score loss (±1 SD) | 0.133 ± 0.057 | 0.167 ± 0.068 |
| ROC AUC (±1 SD) | 0.910 ± 0.094 | 0.853 ± 0.115 |
Abbreviations: ASB, average stratified Brier score; AUC, area under the curve; ROC, receiver operating characteristic; SD, standard deviation.
FIGURE 3Probability density function of the out‐of‐bag (OOB) receiver operating characteristic (ROC) area under the curve (AUC) for M1 (positron emission tomography (PET)/computed tomography (CT)), M2 (magnetic resonance imaging (MRI)), anatomical tumor volume (ATV), SUVmax, metabolic tumor volume (MTV), and total lesion glycolysis (TLG). The average ROC curves associated with these distributions are reported in the left sub‐figure
Bootstrap out‐of‐bag (OOB) receiver operating characteristic (ROC) area under the curve (AUC) for M1 (positron emission tomography (PET)/computed tomography (CT)), M2 (magnetic resonance imaging (MRI)), anatomical tumor volume (ATV), SUVmax, metabolic tumor volume (MTV), and total lesion glycolysis (TLG)
| OOB ROC AUC | M1 | M2 | ATV |
|---|---|---|---|
| Mean (±1 SD) | 0.883 ± 0.086 | 0.840 ± 0.090 | 0.691 ± 0.107 |
| 95% CI | [0.660, 1.000] | [0.622, 0.974] | [0.472, 0.890] |
| Maximum PDF (mode) | 0.908 | 0.858 | 0.703 |
Abbreviations: CI, confidence interval; PDF, probability density function; SD, standard deviation.
FIGURE 4Slice examples of radiomic decision maps DVM1 (a) and DVM2 (b), positron emission tomography (PET) (c), computed tomography (CT) (d), T1 (e), and fat‐suppressed T2 (f) images for six patients
FIGURE 5Joint scatter and kernel density estimation plot comparing M1ʹ and M1 probabilistic outputs on the whole dataset. The color of the dots represents the true label of the corresponding patients (blue: no lung metastasis occurrence, red: lung metastasis occurrence)