| Literature DB >> 35814761 |
Turkey Refaee1,2, Zohaib Salahuddin1, Anne-Noelle Frix3, Chenggong Yan1,4, Guangyao Wu5, Henry C Woodruff1,6, Hester Gietema6, Paul Meunier7, Renaud Louis3, Julien Guiot3, Philippe Lambin1,6.
Abstract
Purpose: To develop handcrafted radiomics (HCR) and deep learning (DL) based automated diagnostic tools that can differentiate between idiopathic pulmonary fibrosis (IPF) and non-IPF interstitial lung diseases (ILDs) in patients using high-resolution computed tomography (HRCT) scans. Material andEntities:
Keywords: artificial intelligence (AI); computed tomography; idiopathic pulmonary fibrosis; interpretability; interstitial lung disease; radiomics
Year: 2022 PMID: 35814761 PMCID: PMC9259876 DOI: 10.3389/fmed.2022.915243
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
FIGURE 1The flowchart diagram shows the patient selection process. IPF, idiopathic pulmonary fibrosis, ILDs, non-IPF interstitial lung diseases.
FIGURE 2Radiomics Pipeline for Lung disease classification from CT images. The same 12 radiomics features from both lungs after feature selection are concatenated and fed to the Random Forest classifier. Post-hoc SHAP analysis is performed for interpretability.
FIGURE 3Figure shows different steps in the deep learning pipeline for the prediction of lung diseases in CT scans.
Demographic and clinical information of the study participants.
| Variables | Site 1 | Database A | |
| n | 365 | 109 | −− |
| Age [mean(SD)] | 64.10 (9.57) | 63.61 (14.17) | 0.8 |
| Sex = M (%) | 213 (87) | 74 (67.9) | 0.09 |
| FEV1 [mean (SD)] | 80.42 (21.47) | 69.60 (20.67) | <0.001 |
| FVC [mean(SD)] | 80.52 (21.25) | 67.35 (21.37) | <0.001 |
| DLCO [mean(SD)] | 51.32 (24.99) | 29.84 (5.36) | <0.001 |
| BMI [mean(SD)] | 25.48 (6.45) | 29.55 (5.21) | <0.001 |
BMI, body mass index, FEV, forced expiratory volume, FVC, forced vital capacity, and diffusion capacity of the lungs for carbon monoxide (DLCO) are shown in the table for different patients along with their mean and standard deviation (SD).
FIGURE 4Receiver operating characteristics (ROC) curves for five-fold cross-validation (A) and external test dataset (B) for the classification of IPF and non-IPF ILDs using handcrafted radiomics (HCR), deep learning (DL), and ensemble (HCR + DL) models.
Precision and recall metrics for five-fold cross-validation using handcrafted radiomics (HCR), deep learning (DL), and an ensemble of HCR and DL models.
| Model | Accuracy | Sensitivity | Specificity | Positive predictive value (PPV) | Negative predictive value (NPV) |
| Handcrafted radiomics (HCR) | 0.762 ± 0.068 | 0.816 ± 0.094 | 0.745 ± 0.065 | 0.506 ± 0.084 | 0.923 ± 0.040 |
| Deep learning (DL) | 0.779 ± 0.046 | 0.711 ± 0.10 | 0.800 ± 0.075 | 0.541 ± 0.074 | 0.901 ± 0.025 |
| Ensemble (HCR + DL) |
|
|
|
|
|
Comparison of diagnostic performance on the external test dataset for HCR, DL, an ensemble of HCR and DL, and in-silico trial with clinicians.
| Model | Accuracy | Sensitivity | Specificity | Positive predictive value (PPV) | Negative predictive value (NPV) |
| Handcrafted radiomics (HCR) | 0.761 | 0.698 | 0.821 | 0.787 | 0.741 |
| Deep learning (DL) | 0.779 | 0.792 | 0.768 | 0.763 | 0.796 |
| Ensemble (HCR + DL) |
|
|
|
|
|
| 0.66 ± 0.067 | 0.572 ± 0.186 | 0.750 ± 0.0525 | 0.680 ± 0.042 | 0.669 ± 0.100 |
FIGURE 5Global SHAP summary plots (A) demonstrate the impact of the top 20 features on the model output in terms of SHAP values and the corresponding feature values. SHAP dependence plots (B–D), and (E) show the effect of a particular feature value on the SHAP value and its interaction with another feature.
FIGURE 6GradCAM heatmaps for post-hoc interpretability of IPF and non-IPF ILDs HRCT scans to understand the predictions made by the Densenet-121.