| Literature DB >> 34367993 |
Guotao Yin1, Ziyang Wang1, Yingchao Song2, Xiaofeng Li1, Yiwen Chen1, Lei Zhu1, Qian Su1, Dong Dai1, Wengui Xu1.
Abstract
OBJECTIVE: The purpose of this study was to develop a deep learning-based system to automatically predict epidermal growth factor receptor (EGFR) mutant lung adenocarcinoma in 18F-fluorodeoxyglucose (FDG) positron emission tomography/computed tomography (PET/CT).Entities:
Keywords: adenocarcinoma of lung; deep learning; epidermal growth factor receptor; fluorodeoxyglucose F18; positron emission tomography computed tomography
Year: 2021 PMID: 34367993 PMCID: PMC8340023 DOI: 10.3389/fonc.2021.709137
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1The process of the data set establishment. Long interval: exceeding 2 weeks. Corrupted image data: the CT or PET data that cannot open.
Figure 2The architecture of the SE-ResNet. (A) The structure of the SE-Residual module. The structure in the blue dashed line is F, the structure in the orange dashed line is the squeezing step, the structure in the green dashed line is the excitation step. (B) The composition of the SE-ResNet. The SE-ResNet consists of 4 basic modules. Each basic module is composed of 3 SE-Residual modules. A fully connected layer was attached to the end of the model.
Figure 3The pipeline of this study. The CT and PET images were first resampled, and the ROIs centered the primary lung tumor were manually selected and normalized. Then SECT and SEPET served as base classifiers and were trained on training data set through fivefold cross-validation to get the EGFR mutation probabilities of training data set. Simultaneously, these models were tested on testing data set for five times. The predictive probabilities of SECT and SEPET for training data set were combined and used for the training of SVM, which served as meta-classifier. And the five times predictive probabilities of SECT and SEPET for testing data set was averaged respectively and combined for the testing of SVM. Finally, the performance of multi-modal stacking model and single-modal deep learning models was compared through ROC curve analysis.
Clinical characteristics of patients.
| Training data set | Testing data set | |||||
|---|---|---|---|---|---|---|
| Mut (n=102) | WT (n=96) | Mut (n=51) | WT (n=52) | |||
| Sex | 0.0091 | 0.0045 | ||||
| Male | 47 (46.08) | 62 (64.58) | 19 (37.25) | 34 (65.38) | ||
| Female | 55 (53.92) | 34 (35.42) | 32 (62.75) | 18 (34.62) | ||
| Age (median (range)) | 63 (37-75) | 63.5 (28-74) | 0.30 | 60 (43-86) | 60 (47-77) | 0.89 |
| Tumor Location | 0.23 | 0.62 | ||||
| Left lobes | 71 (69.61) | 59 (61.46) | 21 (41.18) | 24 (46.15) | ||
| Right lobes | 31 (30.39) | 37 (38.54) | 30 (58.82) | 28 (53.85) | ||
| Smoking History | 0.0049 | 0.044 | ||||
| Yes | 30 (29.41) | 47 (48.96) | 12 (23.53) | 22 (42.31) | ||
| No | 72 (70.59) | 49 (51.04) | 39 (76.47) | 30 (57.69) | ||
| Tumor size | 2.76 ± 1.00 | 2.97 ± 1.30 | 0.21 | 2.59 ± 0.63 | 2.88 ± 1.05 | 0.10 |
| Stage | 0.47 | 0.48 | ||||
| I | 58 (56.86) | 45 (46.88) | 33 (64.70) | 27 (51.93) | ||
| II | 11 (10.78) | 14 (14.58) | 7 (13.73) | 8 (15.38) | ||
| III | 9 (8.82) | 13 (13.54) | 4 (7.84) | 4 (7.69) | ||
| IV | 24 (23.54) | 24 (25.00) | 7 (13.73) | 13 (25.00) | ||
Categorical variables are presented as n (%).
Predictive performance of different models in the training data set.
| AUC (95% CI) | Sensitivity (%) | Specificity (%) | Accuracy (%) | |
|---|---|---|---|---|
| StackPET-CT |
| 71.75 |
|
|
| SECT | 0.74 (0.67-0.80) | 82.35 | 53.12 | 67.17 |
| SEPET | 0.75 (0.69-0.81) |
| 56.25 | 72.22 |
| Clinical model | 0.64 (0.57-0.71) | 65.69 | 62.50 | 60.10 |
The bold values represented the highest one of the evaluation indices.
Predictive performance of different models in the testing data set.
| AUC (95% CI) | Sensitivity (%) | Specificity (%) | Accuracy (%) | |
|---|---|---|---|---|
| StackPET-CT |
| 80.39 |
|
|
| SECT | 0.72 (0.62-0.80) | 68.63 | 69.23 | 68.93 |
| SEPET | 0.74 (0.65-0.82) | 76.47 | 69.23 | 67.96 |
| Clinical model | 0.64 (0.54-0.73) |
| 40.38 | 59.22 |
The bold values represented the highest one of the evaluation indices.
Figure 4Predictive performance of SECT, SEPET, StackPET-CT, and clinical model. (A) The performance of different models in the training data set. (B) The performance of the models in the testing data set. StackPET-CT had the highest AUC in the training and testing data set.
Figure 5Suspicious areas generated by SECT and SEPET. The first column is the original PET or CT image; the second column is the attention map for classifying EGFR mutation status; the third column is the image fusing original image and the attention map. (A) CT images predicted as EGFR mutation by the SECT. (B) CT images predicted as wild-type EGFR by the SECT. (C) PET images predicted as EGFR mutation by the SEPET. (D) PET images predicted as wild-type EGFR by the SEPET.