| Literature DB >> 32006165 |
Guangyao Wu1,2, Henry C Woodruff3, Sebastian Sanduleanu3, Turkey Refaee3, Arthur Jochems3, Ralph Leijenaar3, Hester Gietema4, Jing Shen5, Rui Wang6, Jingtong Xiong7, Jie Bian7, Jianlin Wu8, Philippe Lambin3.
Abstract
OBJECTIVES: Develop a CT-based radiomics model and combine it with frozen section (FS) and clinical data to distinguish invasive adenocarcinomas (IA) from preinvasive lesions/minimally invasive adenocarcinomas (PM).Entities:
Keywords: Adenocarcinoma of lung; Carcinoma, non-small-cell lung; Frozen sections; Machine learning; Tomography, spiral computed
Mesh:
Year: 2020 PMID: 32006165 PMCID: PMC7160197 DOI: 10.1007/s00330-019-06597-8
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 5.315
Fig. 1Flowchart for patient selection from three hospitals
Fig. 2Flowchart showing the process of radiomics
Demographic and clinical characteristics of patients on different datasets
| Variable | Training ( | Testing ( | Validation ( | |
|---|---|---|---|---|
| Age, | ||||
| ≤ 60 | 172 (52.0) | 74 (51.7) | 67 (45.0) | 0.34 |
| > 60 | 159 (48.0) | 69 (48.3) | 82 (55.0) | |
| Gender, | ||||
| Male | 111 (33.5) | 45 (31.5) | 62 (41.6) | 0.14 |
| Female | 220 (66.5) | 98 (68.5) | 87 (58.4) | |
| Smoking, | ||||
| Yes | 46 (13.9) | 19 (13.3) | 30 (20.1) | 0.16 |
| No | 285 (86.1) | 124 (86.7) | 119 (79.9) | |
| Family history, | ||||
| Yes | 9 (2.7) | 6 (4.2) | 7 (4.7) | 0.49 |
| No | 322 (97.3) | 137 (95.8) | 142 (95.3) | |
| Final pathology, | ||||
| IA | 200 (60.4) | 86 (60.1) | 105 (70.5) | 0.08 |
| PM | 131 (39.6) | 57 (39.9) | 44 (29.5) | |
| Diameter (cm), | ||||
| ≤ 1 | 132 (39.9) | 54 (37.8) | 52 (34.9) | 0.54 |
| 1.1–2.0 | 131 (39.6) | 51 (35.7) | 63 (42.3) | |
| 2.1–3.0 | 68 (20.5) | 38 (26.6) | 34 (22.8) | |
| Location, | ||||
| LUL | 78 (23.6) | 31 (21.7) | 26 (17.4) | 0.37 |
| LLL | 49 (14.8) | 28 (19.6) | 22 (14.8) | |
| RUL | 127 (38.4) | 51 (35.7) | 51 (34.2) | |
| RML | 23 (6.9) | 10 (7.0) | 14 (9.4) | |
| RLL | 54 (16.3) | 23 (16.1) | 36 (24.2) | |
| Nodule type, | ||||
| pGGN | 124 (37.5) | 49 (34.3) | 37 (24.8) | 0.09 |
| PSN | 150 (45.3) | 71 (49.7) | 79 (53.0) | |
| Solid | 57 (17.2) | 23 (16.1) | 33 (22.1) | |
| Volume (mm3), | ||||
| < 500 | 115 (34.7) | 47 (32.9) | 43 (28.9) | 0.60 |
| 500–1000 | 67 (20.2) | 29 (20.3) | 27 (18.1) | |
| > 1000 | 149 (45.0) | 67 (46.9) | 79 (53.0) | |
| Surgical type, | ||||
| Lobectomy | 256 (77.3) | 103 (72.0) | 104 (69.8) | 0.17 |
| Limited resection | 75 (22.7) | 40 (28.0) | 45 (30.2) | |
IA, invasive adenocarcinoma; PM, preinvasive lesions/minimally invasive adenocarcinomas; LUL, left upper lobe; LLL, left lower lobe; RUL, right upper lobe; RML, right middle lobe; RLL, right lower lobe; pGGN, pure ground-glass nodule; PSN, part-solid nodule. p values calculated using Pearson’s chi-squared test
Demographic and clinical characteristics of patients on IA and PM groups
| Variable | IA ( | PM ( | |
|---|---|---|---|
| Age, | < 0.01* | ||
| ≤ 60 | 168 (43.0) | 145 (62.5) | |
| > 60 | 223 (57.0) | 87 (37.5) | |
| Gender, | < 0.01* | ||
| Male | 155 (39.6) | 63 (27.2) | |
| Female | 236 (60.4) | 169 (72.8) | |
| Smoking, | 0.089 | ||
| Yes | 67 (17.1) | 28 (12.1) | |
| No | 324 (82.9) | 204 (87.9) | |
| Family history, | 0.06 | ||
| Yes | 18 (4.6) | 4 (1.7) | |
| No | 373 (95.4) | 228 (98.3) | |
| Diameter (cm), | < 0.01* | ||
| ≤ 1 | 57 (14.6) | 181 (78.0) | |
| 1.1–2.0 | 211 (54.0) | 34 (14.7) | |
| 2.1–3.0 | 123 (31.5) | 17 (7.3) | |
| Location, | 0.27 | ||
| LUL | 89 (22.8) | 46 (19.8) | |
| LLL | 57 (14.6) | 42 (18.1) | |
| RUL | 139 (35.5) | 90 (38.8) | |
| RML | 27 (6.9) | 20 (8.6) | |
| RLL | 79 (20.2) | 34 (14.7) | |
| Nodule type, | < 0.01* | ||
| pGGNs | 62 (15.9) | 148 (63.8) | |
| PSN | 217 (55.5) | 83 (35.8) | |
| Solid | 112 (28.6) | 1 (0.4) | |
| Volume (mm3), | < 0.01* | ||
| < 500 | 47 (12.0) | 158 (68.1) | |
| 500–1000 | 81 (20.7) | 42 (18.1) | |
| > 1000 | 263 (67.3) | 32 (13.8) | |
| Surgical type, | < 0.01* | ||
| Lobectomy | 377 (96.4) | 86 (37.1) | |
| Limited resection | 14 (3.6) | 146 (62.9) |
IA, invasive adenocarcinoma; PM, preinvasive lesions/minimally invasive adenocarcinomas; LUL, left upper lobe; LLL, left lower lobe; RUL, right upper lobe; RML, right middle lobe; RLL, right lower lobe; pGGN, pure ground-glass nodule; PSN, part-solid nodule. *p < 0.05. p values calculated using Pearson’s chi-squared test
The predictive performance of individual feature on clinical, semantic, and radiomics model on the training dataset
| Individual features | ACC (95% CI) | Sensitivity | Specificity | AUC (95% CI) |
|---|---|---|---|---|
| Clinical | ||||
| Age | 0.57 (0.51–0.62) | 0.28 | 0.76 | 0.63 (0.57–0.69) |
| Gender | 0.60 (0.55–0.66) | 0 | 1 | 0.56 (0.51–0.61) |
| Smoking | 0.60 (0.55–0.66) | 0 | 1 | 0.55 (0.51–0.58) |
| Family history | 0.60 (0.55–0.66) | 0 | 1 | 0.50 (0.49–0.52) |
| Semantic | ||||
| Diameter | 0.71 (0.66–0.76) | 0.60 | 0.78 | 0.81 (0.77–0.86) |
| Location | 0.57 (0.52–0.63) | 0.02 | 0.94 | 0.51 (0.44–0.57) |
| Nodule type | 0.77 (0.72–0.82) | 0.69 | 0.83 | 0.80 (0.76–0.84) |
| Radiomics | ||||
| LocInt_peakLocal | 0.68 (0.64–0.73) | 0.60 | 0.74 | 0.83 (0.80–0.87) |
| Wavelet_HLL_Stats_max | 0.68 (0.63–0.72) | 0.59 | 0.73 | 0.85 (0.82–0.89) |
| GLRLM_LGRE | 0.76 (0.72–0.80) | 0.68 | 0.81 | 0.90 (0.87–0.92) |
| Wavelet_LLL_Stats_cov | 0.73 (0.68–0.77) | 0.63 | 0.79 | 0.87 (0.84–0.90) |
ACC, accuracy; AUC, area under curve; CI, confidence interval
Fig. 3a CT axial view of a pulmonary nodule; b zoomed in view; c feature map overlaid on the zoomed in CT
The detailed AUC values and p values among models on the three datasets
| Dataset | Model | AUC | 95% CI | P1 | P2 | P3 |
|---|---|---|---|---|---|---|
| Training | Clinical | 0.58 | 0.52–0.65 | < 0.01* | ||
| Semantic | 0.85 | 0.81–0.89 | 0.01* | |||
| Volume | 0.84 | 0.80–0.88 | 0.01* | |||
| FS | 0.90 | 0.87–0.93 | 0.82 | 0.58 | < 0.01* | |
| Radiomics | 0.89 | 0.86–0.93 | ||||
| RV | 0.90 | 0.87–0.94 | 0.18 | < 0.01* | ||
| CSRV | 0.91 | 0.88–0.94 | ||||
| FSV | 0.94 | 0.91–0.96 | 0.06 | |||
| FSRV | 0.96 | 0.94–0.98 | ||||
| CSFSRV | 0.96 | 0.94–0.98 | < 0.01* | 0.74 | ||
| Testing | Clinical | 0.55 | 0.45–0.65 | < 0.01* | ||
| Semantic | 0.85 | 0.78–0.92 | 0.28 | |||
| Volume | 0.87 | 0.81–0.93 | 0.54 | |||
| FS | 0.93 | 0.88–0.97 | 0.20 | 0.21 | 0.01* | |
| Radiomics | 0.89 | 0.83–0.94 | ||||
| RV | 0.88 | 0.82–0.93 | 0.10 | < 0.01* | ||
| CSRV | 0.89 | 0.84–0.94 | 0.21 | |||
| FSV | 0.98 | 0.96–1 | 0.50 | |||
| FSRV | 0.97 | 0.94–1 | ||||
| CSFSRV | 0.97 | 0.94–0.99 | < 0.01* | 0.25 | ||
| Validation | Clinical | 0.61 | 0.51–0.72 | < 0.01* | ||
| Semantic | 0.87 | 0.81–0.92 | 0.75 | |||
| Volume | 0.93 | 0.88–0.98 | 0.16 | |||
| FS | 0.92 | 0.87–0.96 | 0.29 | 0.97 | 0.01* | |
| Radiomics | 0.88 | 0.81–0.94 | ||||
| RV | 0.91 | 0.86–0.96 | 0.71 | 0.03* | ||
| CSRV | 0.92 | 0.87–0.96 | ||||
| FSV | 0.97 | 0.94–0.99 | 0.62 | |||
| FSRV | 0.96 | 0.93–0.99 | ||||
| CSFSRV | 0.96 | 0.94–0.99 | 0.01* | 0.30 |
FS, frozen section; RV, radiomics combining with volume; CSRV, radiomics combing with clinical, semantic, and volume; FSV, frozen section combining with volume; FSRV, frozen section combining with radiomics and volume; CSFSRV, radiomics combining with clinical, semantic, volume, and frozen section; AUC, the area under the curve; CI, confidence interval. *p < 0.05; P1 = p values between radiomics and other models; P2 = p values between CSRV and other models; P3 = p values between FSRV and other models. p values calculated using roc test by Delong method
The detailed diagnosis values of models on three datasets
| Dataset | Model | Accuracy (95% CI) | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|---|
| Training | Clinical | 0.63 (0.58–0.68) | 0.31 | 0.84 | 0.56 | 0.65 |
| Semantic | 0.79 (0.74–0.83) | 0.79 | 0.79 | 0.71 | 0.85 | |
| Volume | 0.66 (0.61–0.71) | 0.61 | 0.69 | 0.56 | 0.73 | |
| FS | 0.90 (0.86–0.93) | 0.91 | 0.89 | 0.84 | 0.94 | |
| Radiomics | 0.82 (0.78–0.86) | 0.79 | 0.85 | 0.77 | 0.86 | |
| RV | 0.83 (0.79–0.87) | 0.79 | 0.86 | 0.78 | 0.86 | |
| CSRV | 0.83 (0.78–0.87) | 0.80 | 0.85 | 0.77 | 0.87 | |
| FSV | 0.89 (0.85–0.92) | 0.89 | 0.89 | 0.84 | 0.93 | |
| FSRV | 0.91 (0.88–0.94) | 0.90 | 0.92 | 0.88 | 0.93 | |
| CSFSRV | 0.91 (0.87–0.94) | 0.89 | 0.92 | 0.88 | 0.93 | |
| Testing | Clinical | 0.63 (0.54–0.71) | 0.23 | 0.90 | 0.59 | 0.64 |
| Semantic | 0.78 (0.71–0.85) | 0.68 | 0.85 | 0.75 | 0.80 | |
| Volume | 0.69 (0.60–0.76) | 0.61 | 0.73 | 0.60 | 0.74 | |
| FS | 0.92 (0.87–0.96) | 0.95 | 0.91 | 0.87 | 0.96 | |
| Radiomics | 0.79 (0.71–0.85) | 0.70 | 0.85 | 0.75 | 0.81 | |
| RV | 0.80 (0.73–0.87) | 0.74 | 0.85 | 0.76 | 0.83 | |
| CSRV | 0.83 (0.75–0.88) | 0.77 | 0.86 | 0.79 | 0.85 | |
| FSV | 0.92 (0.87–0.96) | 0.95 | 0.91 | 0.87 | 0.96 | |
| FSRV | 0.94 (0.88–0.97) | 0.86 | 0.99 | 0.98 | 0.91 | |
| CSFSRV | 0.92 (0.87–0.96) | 0.84 | 0.98 | 0.96 | 0.90 | |
| Validation | Clinical | 0.70 (0.62–0.77) | 0.18 | 0.91 | 0.47 | 0.73 |
| Semantic | 0.79 (0.72–0.85) | 0.68 | 0.84 | 0.64 | 0.86 | |
| Volume | 0.79 (0.72–0.85) | 0.66 | 0.85 | 0.64 | 0.86 | |
| FS | 0.90 (0.84–0.94) | 0.95 | 0.88 | 0.76 | 0.98 | |
| Radiomics | 0.85 (0.78–0.90) | 0.61 | 0.94 | 0.82 | 0.85 | |
| RV | 0.87 (0.81–0.92) | 0.68 | 0.95 | 0.86 | 0.88 | |
| CSRV | 0.87 (0.81–0.92) | 0.73 | 0.93 | 0.82 | 0.89 | |
| FSV | 0.91 (0.85–0.95) | 0.95 | 0.89 | 0.78 | 0.98 | |
| FSRV | 0.93 (0.87–0.96) | 0.93 | 0.92 | 0.84 | 0.97 | |
| CSFSRV | 0.91 (0.86–0.95) | 0.89 | 0.92 | 0.83 | 0.95 |
FS, frozen section; RV, radiomics combining with volume; CSRV, radiomics combining with clinical, semantic, and volume; FSV, frozen section combining with volume; FSRV, frozen section combining with radiomics and volume; CSFSRV, radiomics combining with clinical, semantic, volume, and frozen section; PPV, positive predictive values; NPV, negative predictive values; CI, confidence interval
Fig. 4The calibration plots of the single and complex models on the validation dataset
Fig. 5The decision curve of models performed on the validation dataset