| Literature DB >> 35903037 |
Shuwen Wang1, Leilei Zhou1, Xiaoran Li2, Jie Tang3, Jing Wu1, Xindao Yin1, Yu-Chen Chen1, Lingquan Lu1.
Abstract
BACKGROUND In this study we aimed to establish a new transfer learning model based on noncontrast and thin-layer computed tomography (CT) scans to distinguish between malignant and benign solid lung nodules. MATERIAL AND METHODS CT images from 202 patients with 210 lesions (malignant: 127, benign: 83) manifesting as solid lung nodules from January 2016 to December 2020 from 3 institutions were retrospectively collected, and each nodule was histopathologically confirmed. Two experienced thoracic radiologists reviewed all images and determined the regions of interest (ROIs) in the three-dimensional (3D) images layer-by-layer. We divided the lesions and images into training and testing sets at a ratio of 7: 3. The Inception V3 model was pretrained by the training dataset. Five-fold cross-validation was used to choose the optimal model. Receiver operator characteristic curves (ROC curves) for methods to evaluate the performance of the models were drafted. RESULTS In the validation set, the AUC, accuracy, sensitivity, and specificity of Inception V3 model (lesion-level) were 0.999, 0.989, 0.983, and 1.0, respectively, which is higher than the image-level (0.997, 0.933, 0.922, and 0.948, respectively). The Inception V3 model (lesion-level) performed better than the image-level but there was no significant difference between the models (P>0.05). The ResNet50 model based on image level achieved AUC, accuracy, sensitivity, and specificity of 0.963, 0.926, 0.916, and 0.944, respectively, which is lower than that of Inception V3. CONCLUSIONS Our study developed a novel deep learning model based on noncontrast and thin-layer CT scans to classify benign vs malignant lung nodules, and the Inception V3 model greatly improved the differentiation accuracy and specificity.Entities:
Mesh:
Year: 2022 PMID: 35903037 PMCID: PMC9344882 DOI: 10.12659/MSM.936830
Source DB: PubMed Journal: Med Sci Monit ISSN: 1234-1010
Figure 1The workflow of the collection of patients. Created using Microsoft Office Visio 2016, China.
Image data of patients.
| Patients | Lesions | Images | |
|---|---|---|---|
| Benign | 76 | 83 | 2120 |
| Malignant | 126 | 127 | 2888 |
| Total | 202 | 210 | 5008 |
Lesions – pulmonary nodules; Images – handcraft-annotation nodule CT images.
Figure 2CT images with discordant interpretations between deep learning models and radiologists. (A) Model False-Positives: a 63-year-old male patient with histologically-confirmed benign nodule when the model predicts a malignant nodule. (B) Model False-Negatives: a 47-year-old male patient with histological-confirmed malignant nodule when the model predicts a benign nodule.
Characteristics of solid nodules in the lung.
| Malignant nodules (n=127) | Benign nodules (n=83) | ||
|---|---|---|---|
|
| 0.078 | ||
| Male | 67 (52.8) | 54 (65.1) | |
| Female | 60 (47.2) | 29 (34.9) | |
|
| 65.17±8.09 | 62.31±11.6 | 0.053 |
|
| 0.172 | ||
| Left upper lobe | 30 (23.6) | 25 (30.1) | |
| Left lower lobe | 19 (15.0) | 11 (13.3) | |
| Left mixed | 2 (1.6) | 0 | |
| Right upper lobe | 36 (28.3) | 26 (31.3) | |
| Right middle lobe | 9 (7.1) | 0 | |
| Right lower lobe | 27 (21.3) | 21 (25.3) | |
| Right mixed | 4 (3.1) | 0 | |
|
| <0.001 | ||
| Absent | 29 (22.8) | 47 (56.6) | |
| Present | 98 (77.2) | 36 (43.4) | |
|
| <0.001 | ||
| Absent | 15 (11.8) | 43 (51.8) | |
| Present | 112 (88.2) | 40 (48.2) | |
|
| 0.904 | ||
| 6–10 | 11 (8.7) | 8 (9.6) | |
| 11–30 | 75 (59.1) | 52 (62.6) | |
| >30 | 41 (32.3) | 23 (27.7) | |
|
| 26.65 (7–100) | 24.36 (6–67) | |
|
| / | ||
| ADC | 104 (81.9) | ||
| SCC | 23 (18.1) | ||
| OP | 3 (3.6) | ||
| TB | 41 (49.4) | ||
| PCP | 13 (15.6) | ||
| IPT | 6 (7.2) | ||
| Granuloma | 20 (24.1) |
ADC – adenocarcinoma; SCC – squamous cell carcinoma; OP – organizing pneumonia; TB – tuberculosis; PCP – pulmonary cryptococcosis; IPT – inflammatory pseudotumor; Granulomas – histopathology-confirmed granulomas but unknown to the specific disease.
P<0.05.
Results of Inception V3 model.
| Models | Training set | Validation set | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | Accuracy | Sensitivity | Specificity | PPV | NPV | AUC | Accuracy | Sensitivity | Specificity | PPV | NPV | |
| Lesion-level | 0.998 | 0.984 | 0.974 | 1.0 | 1.0 | 0.962 | 0.999 | 0.989 | 0.983 | 1.0 | 1.0 | 0.969 |
| Image-level | 0.978 | 0.936 | 0.913 | 0.964 | 0.969 | 0.899 | 0.977 | 0.933 | 0.922 | 0.948 | 0.960 | 0.901 |
PPV – positive predictive value; NPV – negative predictive value; AUC – area under the curve.
Five-fold cross-validation results of InceptionV3 and ResNet50 models (image-level).
| AUC | Accuracy | Sensitivity | Specificity | PPV | NPV | |
|---|---|---|---|---|---|---|
| Inception V3 | 0.977 | 0.933 | 0.922 | 0.948 | 0.960 | 0.901 |
| ResNet50 | 0.963 | 0.926 | 0.916 | 0.944 | 0.957 | 0.892 |
PPV – positive predictive value; NPV – negative predictive value; AUC – area under the curve.
Figure 3(A, B) Receiver operating characteristic (ROC) curves for methods to predict pulmonary nodules of the 2 transfer learning models (ResNet50 and Inception V3) (image-level). The x-axis represents the false-positive rate (FPR) and the y-axis represents the true-positive rate (TPR). Created using matplotlib version 2.2.2 (Python 3.6).