| Literature DB >> 35003258 |
Chengdi Wang1, Xiuyuan Xu2, Jun Shao1, Kai Zhou2, Kefu Zhao2, Yanqi He1, Jingwei Li1, Jixiang Guo2, Zhang Yi2, Weimin Li1.
Abstract
OBJECTIVE: The detection of epidermal growth factor receptor (EGFR) mutation and programmed death ligand-1 (PD-L1) expression status is crucial to determine the treatment strategies for patients with non-small-cell lung cancer (NSCLC). Recently, the rapid development of radiomics including but not limited to deep learning techniques has indicated the potential role of medical images in the diagnosis and treatment of diseases.Entities:
Year: 2021 PMID: 35003258 PMCID: PMC8741343 DOI: 10.1155/2021/5499385
Source DB: PubMed Journal: J Oncol ISSN: 1687-8450 Impact factor: 4.375
Figure 1The framework of deep learning model for gene mutation classification and prognosis prediction. The deep learning model is composed of 3D convolutional neural network (CNN) for classifying the EGFR and PD-L1 expression status, and the prognostic model based on clinical metadata and deep learning features was also implemented.
Demographic and clinical characteristics of included NSCLC patients.
| Characteristics | Total ( | Training set ( | Validation set ( | Test set ( |
|
|---|---|---|---|---|---|
| Age at diagnosis (year) | |||||
| Mean ± SD | 57.70 ± 10.50 | 57.69 ± 10.27 | 57.48 ± 9.70 | 57.85 ± 10.27 | 0.95 |
| Sex | |||||
| Male | 620 (49.13) | 438 (49.66) | 66 (52.80) | 116 (45.49) | 0.35 |
| Female | 642 (50.87) | 444 (50.34) | 59 (47.20) | 139 (54.51) | |
| Smoking | |||||
| Current/former | 452 (35.82) | 323 (36.62) | 45 (36.00) | 84 (32.94) | 0.18 |
| Never | 749 (59.35) | 520 (58.96) | 77 (61.60) | 152 (59.61) | |
| Unknown | 61 (4.83) | 39 (4.42) | 3 (2.40) | 19 (7.45) | |
| Gene mutation status | |||||
| EGFR(−) PD-L1(−) | 276 (21.87) | 193 (21.88) | 28 (22.40) | 55 (21.57) | 1.00 |
| EGFR(−) PD-L1(+) | 290 (22.98) | 203 (23.02) | 28 (22.40) | 59 (23.14) | |
| EGFR(+) PD-L1(−) | 502 (39.78) | 350 (39.68) | 50 (40.00) | 102 (40.00) | |
| EGFR(+) PD-L1(+) | 194 (15.37) | 136 (15.42) | 19 (15.20) | 39 (15.29) | |
| EGFR-TKI-targeted therapy | |||||
| Yes | 391 (30.98) | 265 (30.05) | 45 (36.00) | 81 (31.76) | 0.39 |
| No | 871 (69.02) | 617 (69.95) | 80 (64.00) | 174 (68.24) | |
| ICI therapy | |||||
| Yes | 15 (1.19) | 12 (1.36) | 0 (0) | 3 (1.18) | 0.42 |
| No | 1247 (98.81) | 870 (98.64) | 125 (100.00) | 252 (98.82) | |
| Histopathology | |||||
| LUAD | 1185 (93.90) | 824 (93.42) | 119 (95.20) | 242 (94.90) | 0.82 |
| LUSC | 53 (4.20) | 41 (4.65) | 4 (3.20) | 8 (3.14) | |
| Others | 24 (1.90) | 17 (1.93) | 2 (1.60) | 5 (1.96) | |
| Tumor stage | |||||
| I | 529 (41.91) | 378 (42.86) | 43 (34.40) | 108 (42.35) | 0.22 |
| II | 96 (7.61) | 60 (6.80) | 9 (7.20) | 27 (10.59) | |
| III | 236 (18.70) | 160 (18.14) | 30 (24.00) | 46 (18.04) | |
| IV | 367 (29.08) | 262 (29.71) | 37 (29.60) | 68 (26.67) | |
| Unknown | 34 (2.69) | 22 (2.49) | 6 (4.80) | 6 (2.35) | |
| Follow-up | |||||
| Median follow-up time (month, 95% CI) | 31 (30–31) | 30 (30–31) | 31 (29–35) | 32 (30,33) | 0.09 |
| Overall survival | |||||
| Death | 412 (32.65) | 283 (32.08) | 51 (40.80) | 78 (30.59) | 0.20 |
| Median OS (month, 95% CI) | 44 (42–49) | 43 (41–49) | 41 (33-NA) | 46 (40-NA) | |
LUAD: lung adenocarcinoma; LUSC: lung squamous cell carcinoma; SD: standard deviation; OS: overall survival; CI: confidence interval.
Predictive performance in predicting EGFR mutation and PD-L1 expression status.
| EGFR | PD-L1 | ACC (95%CI) | AUC (95%CI) | Sensitivity (95%CI) | Specificity (95%CI) | |
|---|---|---|---|---|---|---|
| Training set | − | − | 0.92 (0.91–0.93) | 0.97 (0.96–0.97) | 0.87 (0.83–0.9) | 0.93 (0.92–0.95) |
| − | + | 0.91 (0.89–0.93) | 0.96 (0.95–0.97) | 0.75 (0.7–0.78) | 0.97 (0.95–0.98) | |
| + | − | 0.87 (0.85–0.89) | 0.96 (0.95–0.97) | 0.96 (0.94–0.98) | 0.81 (0.79–0.84) | |
| + | + | 0.91 (0.90–0.92) | 0.95 (0.94–0.96) | 0.39 (0.33–0.46) | 1 (0.99–1) | |
| Average | 0.90 (0.86–0.93) | 0.96 (0.94–0.98) | 0.74 (0.31–1) | 0.93 (0.79–1) | ||
|
| ||||||
| Validation set | − | − | 0.78 (0.72–0.72) | 0.82 (0.75–0.88) | 0.57 (0.44–0.7) | 0.85 (0.79–0.9) |
| − | + | 0.76 (0.71–0.71) | 0.78 (0.71–0.84) | 0.45 (0.33–0.57) | 0.86 (0.81–0.91) | |
| + | − | 0.74 (0.68–0.68) | 0.85 (0.79–0.89) | 0.78 (0.69–0.87) | 0.71 (0.64–0.78) | |
| + | + | 0.84 (0.79–0.79) | 0.75 (0.66–0.82) | 0.13 (0.03–0.26) | 0.95 (0.92–0.98) | |
| Average | 0.78 (0.70–0.86) | 0.80 (0.72–0.88) | 0.48 (0.01–0.95) | 0.84 (0.66–1) | ||
|
| ||||||
| Test set | − | − | 0.74 (0.7–0.7) | 0.76 (0.72–0.81) | 0.45 (0.36–0.54) | 0.82 (0.79–0.86) |
| − | + | 0.68 (0.64–0.64) | 0.66 (0.61–0.71) | 0.28 (0.21–0.36) | 0.83 (0.79–0.87) | |
| + | − | 0.73 (0.69–0.69) | 0.79 (0.75–0.83) | 0.82 (0.77–0.88) | 0.68 (0.63–0.73) | |
| + | + | 0.83 (0.79–0.79) | 0.69 (0.63–0.75) | 0.15 (0.07–0.23) | 0.95 (0.93–0.97) | |
| Average | 0.75 (0.65–0.85) | 0.73 (0.63–0.83) | 0.43 (0–0.92) | 0.82 (0.62–1) | ||
Abbreviation: ACC: accuracy; AUC: area under the ROC curve.
Figure 2Performance of the deep learning model for the prediction of EGFR and PD-L1 expression status in training (a), validation (b), and test (c) sets by receiver operating characteristic (ROC) curves.
Figure 3Confusion matrix of prediction model in training (a), validation (b), and test cohorts (c), respectively. The micro-average accuracies (ACCs) were 0.90 (95% CI: 0.86–0.93), 0.78 (95% CI: 0.70–0.86), and 0.75 (95% CI: 0.65–0.85) in the training, validation, and test cohorts, respectively.
Figure 4Attention map of suspicious tumor area using CAM. Suspicious tumor areas were indicated according to the attention map of the deep learning model.
Figure 5Kaplan–Meier curves in the high-risk and low-risk groups stratified by confusion prognostic prediction model in training (a), validation (b), and test sets (c). When the patients were stratified into high-risk and low-risk groups, Kaplan–Meier curves of progression to poor prognosis showed a distinct difference in survival probability in this cohort.
Recent representative studies using deep learning to predict gene status in lung cancer patients on CT images.
| Author | Year | Design | Dataset | Training cohort | Validation cohort | Test cohort | Model | Outcome | Performance reported |
|---|---|---|---|---|---|---|---|---|---|
| Baihua Zhang | 2021 | Retrospective multicenter on CT | 914 LUAD | 638 | NA | 71 internal; 205 external | SE-CNN + radiomics mapping | EGFR mutation | AUC 0.910 and 0.841 in internal and external test cohorts, respectively |
|
| |||||||||
| Wei Mu | 2020 | Retrospective multicenter on PET/CT | 681 NSCLCs | 429 | 187 | 65 external | CNN | EGFR mutation treatment response | AUC 0.86, 0.83, and 0.81 in the training, internal validation, and external test cohorts, respectively |
|
| |||||||||
| Shuo Wang | 2019 | Retrospective multicenter on CT | 844 LUAD | 603 | Five-fold cross validation; 241 independent | NA | CNN | EGFR mutation | AUC 0.85 in the primary cohort; AUC 0.81 in the independent validation cohort |
|
| |||||||||
| Wei Zhao | 2019 | Retrospective multicenter on CT | 616 LUAD | 348 | 116 | 115 internal; 37 public | CNN 3D DenseNets | EGFR mutation | AUC 0.758 and 0.750 in the internal test set and public test set |
|
| |||||||||
| Junfeng Xiong | 2018 | Retrospective single-center on CT | 503 LUAD | 345 | 158 | NA | CNN | EGFR mutation | An AUC (CNN) of 0.776 and an AUC (a fusion model of CNNs and clinical features) of 0.838 in the validation set |
|
| |||||||||
| Panwen Tian | 2021 | Retrospective multicenter on CT | 939 NSCLCs | 750 | 93 | 96 | KNN | PD-L1 expression treatment response | AUC 0.78, 0.71, and 0.76 in the training, validation, and test cohorts |
|
| |||||||||
| Ying Zhu | 2020 | Retrospective single-center on CT | 127 LUAD | NA | Five-fold cross validation | NA | CNN 3D DenseNets | PD-L1 expression | AUC more than 0.750 |
|
| |||||||||
| Zhengbo Song | 2020 | Retrospective multicenter on CT | 1028 NSCLCs | 651 | 286 | 91 | CNN 3D ResNet10 | ALK fusion status | AUC(CNN) 0.8046 and 0.7754 in the primary and validation cohorts, AUC (trained by both CT images and clinicopathological information) 0.8540 and 0.8481 in the primary and validation cohorts |
LUAD: lung adenocarcinoma; NSCLC: non-small-cell lung cancer; CNN: convolutional neural network; KNN: k-nearest neighbor; NA: not applicable.