| Literature DB >> 34994091 |
Zhixin Qiu1, Qingxia Wu2, Shuo Wang3,4, Zhixia Chen5, Feng Lin6, Yuyan Zhou1, Jing Jin1, Jinghong Xian7, Jie Tian2,3,4, Weimin Li1.
Abstract
BACKGROUND: Early identification of the malignant propensity of pulmonary ground-glass nodules (GGNs) can relieve the pressure from tracking lesions and personalized treatment adaptation. The purpose of this study was to develop a deep learning-based method using sequential computed tomography (CT) imaging for diagnosing pulmonary GGNs.Entities:
Keywords: deep learning; ground-glass nodules; multiple timepoints; sequential
Mesh:
Year: 2022 PMID: 34994091 PMCID: PMC8841714 DOI: 10.1111/1759-7714.14305
Source DB: PubMed Journal: Thorac Cancer ISSN: 1759-7706 Impact factor: 3.500
FIGURE 1The workflow of this study. This study included the following six parts: (a) baseline CT and follow‐up CT acquisition and ROI (the green box) delineation, (b) image preprocessing, (c) building a DL model that was pretrained in ImageNet and fine‐tuned with our CT images, (d) constructing DL‐features by initial CT and follow‐up CT, (e) building individualized GGN prediction models by the DL‐feature, and (f) comparing the DL model with radiologists
Patient characteristics in the primary and validation cohorts
| Characteristics | Training set ( |
| Testing set ( |
|
| ||
|---|---|---|---|---|---|---|---|
| Benign 108 21.0) | Malignant 400 (79.0) | Benign 46 (22.7) | Malignant 208 (77.3) | 0.35 | |||
| Age, years | < 0.001 | 0.007 | 0.08 | ||||
| Mean ± SD | 49.6 ± 12.4 | 54.4 ± 11.2 | 50.7 ± 11.4 | 55.8 ± 10.9 | |||
| Sex (No. %) | 0.34 | 0.75 | 0.67 | ||||
| Male | 34 (31.5) | 105 (26.2) | 12 (26.1) | 62 (29.8) | |||
| Female | 74 (68.5) | 295 (73.8) | 34 (73.9) | 146 (70.2) | |||
| Smoking (No. %) | 0.77 | 0.22 | 0.25 | ||||
| Yes | 12 (11.1) | 51 (12.8) | 4 (8.7) | 36 (17.3) | |||
| No | 96 (88.9) | 349 (87.2) | 42 (91.3) | 172 (82.7) | |||
| Extrapulmonary cancer history (No. %) | 0.006 | 0.37 | 0.37 | ||||
| Yes | 2 (1.9) | 44 (11.0) | 3 (6.5) | 26 (12.5) | |||
| No | 106 (98.1) | 356 (89.0) | 43 (93.5) | 182 (87.5) | |||
| Family cancer history (No. %) | 0.17 | 0.06 | 0.30 | ||||
| Yes | 10 (9.3) | 60 (15.0) | 3 (6.5) | 40 (19.2) | |||
| No | 98 (90.7) | 340 (85.0) | 43 (93.5) | 168 (80.8) | |||
| Nodule size (No. %) | < 0.001 | < 0.001 | 0.78 | ||||
| ≤ 10 mm | 73 (67.6) | 180 (45.0) | 36 (78.3) | 94 (45.2) | |||
| > 10 mm | 35 (32.4) | 220 (55.0) | 10 (21.7) | 114 (54.8) | |||
| Type (No. %) | 0.13 | 0.85 | 0.25 | ||||
| pGGN | 59 (54.6) | 183 (45.8) | 23 (50.0) | 110 (52.9) | |||
| mGGN | 49 (45.4) | 217 (54.2) | 23 (50.0) | 98 (47.1) | |||
| Location (No. %) | 0.38 | 0.46 | 0.61 | ||||
| RUL | 45 (41.7) | 152 (38) | 16 (8.7) | 93 (44.7) | |||
| RML | 11 (10.2) | 24 (6) | 5 (10.9) | 13 (6.2) | |||
| RLL | 18 (16.7) | 63 (15.8) | 5 (10.9) | 25 (12.0) | |||
| LUL | 24 (22.2) | 115 (28.7) | 16 (34.8) | 53 (25.5) | |||
| LLL | 10 (9.3) | 46 (11.5) | 4 (8.7) | 24 (11.5) | |||
| Time interval (No. %) | < 0.001 | < 0.001 | 0.89 | ||||
| ≤ 90 days | 46 (42.6) | 305 (76.2) | 11 (23.9) | 163 (78.4) | |||
| (90, 180) days | 25 (23.1) | 68 (17.0) | 19 (41.3) | 26 (12.5) | |||
| >180 days | 37 (34.3) | 27 (6.8) | 16 (34.8) | 19 (9.1) | |||
| Pathology | |||||||
| Inflammatory | 73 | 37 | |||||
| Granuloma | 3 | 0 | |||||
| Fibrosis | 10 | 5 | |||||
| Interstitial hyperplasia | 19 | 4 | |||||
| Hamartoma | 1 | 0 | |||||
| Sclerosing | 1 | 0 | |||||
| Tuberculosis | 1 | 0 | |||||
| Invasive adenocarcinoma | 325 | 180 | |||||
| Preinvasive adenocarcinoma | 73 | 28 | |||||
| Squamous carcinoma | 2 | 0 | |||||
Note: Cancer (IASLC)
Abbreviations: LLL, left lower lobe; LUL, left upper lobe; mGGN, mixed ground‐glass nodule; pGGN, pure ground‐glass nodule; RLL, right lower lobe; RML, right middle lobe; RUL, right upper lobe.
p is derived from the univariable association analyses of each clinicopathological variable. Between patients with benign and malignant GGN in the training and testing set, respectively.
p represents the difference of each clinicopathological variable between the training and testing set.
Invasive adenocarcinoma includes minimally invasive adenocarcinoma and invasive pulmonary adenocarcinoma according to The International Association for the Study of Lung.
Preinvasive adenocarcinoma includes atypical adenomatous hyperplasia and adenocarcinomas in situ according to IASLC.
FIGURE 2Performance of the various models. (a) Receiver operating characteristic curves of the clinical model, DL model using initial CT, DL model using follow‐up CT, DL model using initial CT + follow‐up CT, and the combined model using initial CT+ follow‐up CT+ clinical in the training and testing sets. (b) Receiver operating characteristic curves of the various models and two radiologists. (c and d) DL‐score from the initial CT + follow‐up CT within the histological subgroup
Diagnostic performance of the models
| Training set | Testing set | |||||||
|---|---|---|---|---|---|---|---|---|
| AUC (95% CI) | ACC (%) | SEN (%) | SPE (%) | AUC (95%CI) | ACC (%) | SEN (%) | SPE (%) | |
| Clinical model | ||||||||
| Age + extrapulmonary cancer history + nodule size | 0.673 (0.617–0.729) | 64.57 (60.23–68.73) | 64.25 (59.34–68.95) | 65.74 (55.99–74.60) | 0.702 (0.619–0.784) | 66.54 (60.37–72.31) | 65.87 (58.99–72.80) | 69.57 (54.25–82.60) |
| DL model | ||||||||
| Initial CT | 0.741 (0.688–0.793) | 68.11 (63.86–72.15) | 67.25 (62.41–71.83) | 71.30 (61.80–79.59) | 0.776 (0.704–0.848) | 64.96 (58.75–70.82) | 60.58 (53.58–67.26) | 84.78 (71.13–93.66) |
| Follow‐up CT | 0.764 (0.713–0.815) | 64.57 (60.23–68.73) | 60.50 (55.52–65.32) | 79.63 (70.80–86.70) | 0.744 (0.670–0.818) | 67.32 (61.18–73.06) | 64.90 (58.00–71.38) | 78.26 (63.64–89.05) |
| Initial + follow‐up CT | 0.856 (0.821–0.890) | 74.21 (70.18–77.96) | 71.50 (66.80–75.88) | 84.26 (76.00–90.55) | 0.841 (0.777–0.904) | 75.59 (69.83–80.74) | 72.60 (66.00–78.54) | 89.13 (76.43–96.38) |
| Combined model | ||||||||
| Initial + follow‐up CT + age + extrapulmonary cancer history + nodule size | 0.867 (0.834–0.900) | 76.18 (72.23–79.82) | 72.00 (67.32–76.35) | 91.67 (84.77–96.12) | 0.827 (0.763–0.891) | 77.17 (71.50–82.18) | 75.48 (69.05–81.17) | 84.78 (71.13–93.66) |
| Junior radiologist | ||||||||
| Initial CT | 60.14 | 53.81 | 85.00 | |||||
| Initial + follow‐up CT | 66.89 | 65.68 | 71.67 | |||||
| Senior radiologist | ||||||||
| Initial CT | 73.31 | 76.27 | 61.67 | |||||
| Initial + follow‐up CT | 77.03 | 83.90 | 50.00 | |||||
Abbreviations: AUC, area under the receiver operating characteristic curve; CI, confidence interval; ACC, accuracy; SEN, sensitivity; SPE, specificity.
FIGURE 3Performance of the DL model using initial and follow‐up CT and radiologists within different subgroups in the testing set. (a) Receiver operating characteristic curves of the DL model and radiologists within tumor diameter subgroups. (b) Receiver operating characteristic curves of the DL model and radiologists within the time interval of two CT scanning subgroups. (c) Receiver operating characteristic curves of the DL model and radiologists within type subgroups
Diagnostic performance of the DL model and radiologists within different subgroups
| Training set | Testing set | Junior radiologist | Senior radiologist | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC (95% CI) | ACC (%) | SEN (%) | SPE (%) | AUC (95% CI) | ACC (%) | SEN (%) | SPE (%) | ACC (%) | SEN (%) | SPE (%) | ACC (%) | SEN (%) | SPE (%) | |
| Nodule size | ||||||||||||||
| ≤ 10 mm | 0.831 (0.781–0.882) | 75.10 | 74.44 | 76.71 | 0.778 (0.690–0.865) | 70.00 | 62.80 | 88.90 | 57.69 | 50.86 | 77.50 | 70.51 | 75.86 | 55.00 |
| > 10 mm | 0.872 (0.820–0.924) | 81.57 | 80.45 | 88.57 | 0.841 (0.678–0.999) | 80.65 | 79.82 | 90.00 | 77.14 | 80.00 | 60.00 | 84.29 | 91.67 | 40.00 |
| Time interval | ||||||||||||||
| ≤ 90 days | 0.829 (0.775–0.883) | 73.22 | 71.80 | 82.61 | 0.81 3(0.703–0.923) | 70.69 | 70.55 | 72.73 | 69.59 | 45.00 | 72.41 | 82.47 | 86.78 | 45.00 |
| (90, 180] days | 0.834 (0.752–0.916) | 74.19 | 72.06 | 80.00 | 0.854 (0.733–0.975) | 84.44 | 76.92 | 94.74 | 61.02 | 45.95 | 86.36 | 62.71 | 70.27 | 50.00 |
| > 180 days | 0.895 (0.815–0.975) | 84.38 | 74.07 | 91.89 | 0.908 (0.785–0.999) | 91.43 | 94.70 | 87.50 | 62.79 | 48.00 | 83.33 | 72.09 | 84.00 | 55.56 |
| Type | ||||||||||||||
| pGGN | 0.861 (0.814–0.909) | 73.55 | 70.49 | 83.05 | 0.808 (0.714–0.903) | 72.18 | 70.00 | 82.61 | 62.82 | 62.90 | 62.50 | 73.72 | 81.45 | 43.75 |
| mGGN | 0.853 (0.804–0.903) | 75.56 | 72.35 | 89.80 | 0.881 (0.801–0.962) | 81.82 | 81.63 | 82.61 | 71.43 | 68.75 | 82.14 | 80.71 | 86.61 | 57.14 |
Abbreviations: ACC, accuracy; AUC, area under the receiver operating characteristic curve; CI, confidence interval; mGGN, mixed ground‐glass opacity; pGGN, pure ground‐glass opacity; SEN, sensitivity; SPE, specificity.
FIGURE 4Representative prediction results from the testing set