| Literature DB >> 31074592 |
Wei Zhao1,2, Jiancheng Yang3,4,5, Bingbing Ni3,4, Dexi Bi6, Yingli Sun1, Mengdi Xu5, Xiaoxia Zhu1, Cheng Li1, Liang Jin1, Pan Gao1, Peijun Wang7, Yanqing Hua1, Ming Li1,2,8.
Abstract
To develop a deep learning system based on 3D convolutional neural networks (CNNs), and to automatically predict EGFR-mutant pulmonary adenocarcinoma in CT images. A dataset of 579 nodules with EGFR mutation status labels of mutant (Mut) or wild-type (WT) was retrospectively analyzed. A deep learning system, namely 3D DenseNets, was developed to process 3D patches of nodules from CT data, and learn strong representations with supervised end-to-end training. The 3D DenseNets were trained with a training subset of 348 nodules and tuned with a development subset of 116 nodules. A strong data augmentation technique, mixup, was used for better generalization. We evaluated our model on a holdout subset of 115 nodules. An independent public dataset of 37 nodules from the cancer imaging archive (TCIA) was also used to test the generalization of our method. Conventional radiomics analysis was also performed for comparison. Our method achieved promising performance on predicting EGFR mutation status, with AUCs of 75.8% and 75.0% for our holdout test set and public test set, respectively. Moreover, strong relations were found between deep learning feature and conventional radiomics, while deep learning worked through an enhanced radiomics manner, that is, deep learned radiomics (DLR), in terms of robustness, compactness and expressiveness. The proposed deep learning system predicts EGFR-mutant of lung adenocarcinomas in CT images noninvasively and automatically, indicating its potential to help clinical decision-making by identifying eligible patients of pulmonary adenocarcinoma for EGFR-targeted therapy.Entities:
Keywords: zzm321990EGFRzzm321990; convolutional neural networks; deep learning; mixup training technique; radiomics
Mesh:
Substances:
Year: 2019 PMID: 31074592 PMCID: PMC6601587 DOI: 10.1002/cam4.2233
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Figure 1Overall pipeline for this study. A local CT dataset (HdH Dataset) and a public dataset selected from TCIA database (TCIA Dataset) of lung adenocarcinoma patients with EGFR mutation testing were used. Nodules were manually localized, segmented, and labelled as EGFR mutant (Mut)/wild‐type (WT). For deep learning, 3D DenseNets were trained using the training subset. A strong data augmentation technique, mixup, was used for better regularization. Expressive representations, that is, deep learned radiomics (DLR), for the nodules were end‐to‐end learned during the training procedure. Meanwhile, conventional radiomics analysis following the common practice was carried out for performance comparison, and association study between the 3D deep leaning and conventional radiomics was performed by calculating the pairwise correlation coefficients
Characteristics of patients and lesions in HdH dataset
| Characteristics | Number | Percentage |
|---|---|---|
| Gender | ||
| Male | 245 | 42.3 |
| Female | 334 | 57.6 |
| Mean age (range) (y) | ||
| Male | 61.8 ± 11.6 (29‐85) | ‐ |
| Female | 58.4 ± 11.9 (22‐85) | ‐ |
| Total | 59.8 ± 11.9 (22‐85) | ‐ |
| Mean size (range) (cm) | 1.8 (0.3‐8.6) | ‐ |
| Location | ||
| Right lobe | 342 | 59.1 |
| Left lobe | 237 | 40.9 |
| Pathology | ||
| Adenocarcinoma in situ | 31 | 5.4 |
| Minimally invasive adenocarcinoma | 157 | 27.1 |
| Invasive adenocarcinoma | 391 | 67.5 |
| TMN classification (eighth edition) | ||
| 0 | 31 | 5.4 |
| I A‐B | 356 | 61.5 |
| II A‐B | 7 | 1.2 |
| III A‐C | 10 | 1.7 |
| IV A‐B | 175 | 30.2 |
|
| 308 | 53.2 |
3D DenseNet architectures for EGFR mutation classification
| Layer | Tensor size | Building blocks |
|---|---|---|
| Input | 48 × 48 × 48 × 1 | |
| Convolution | 48 × 48 × 48 × 32 | 3 × 3 × 3 conv |
| Pooling | 24 × 24 × 24 × 32 | 2 × 2 × 2 average pool |
| Dense Block (1) | 24 × 24 × 24 × 80 |
|
| Compression and Pooling (1) | 12 × 12 × 12 × 40 |
|
| Dense Block (2) | 12 × 12 × 12 × 136 |
|
| Compression and Pooling (2) | 6 × 6 × 6 × 68 |
|
| Dense Block (3) | 6 × 6 × 6 × 132 |
|
| Compression and Pooling (3) | 3 × 3 × 3 × 66 |
|
| Dense Block (4) | 3 × 3 × 3 × 114 |
|
| Global Pooling ( | 114 | 3 × 3 × 3 average pool |
| Output | 1 | sigmoid |
Presentation of our method and several previous studies in terms of methods, datasets, and resulting classification AUCs
| Method | Training #Patients | Test #Patients | # | AUC (%) |
|---|---|---|---|---|
| Radiomics | 353 | 352 | 183 (24.0%) | 69 |
| + clinical information | 75 | |||
| Radiomics | 298 | NA | 137 (46.0%) | 64.7 |
| + clinical information | 70.9 | |||
| Radiomics | 47 | NA | 19 (40.4%) | 67.0 |
| Radiomics (This study) | 464 (HdH train‐dev Dataset) | 115 (HdH test Dataset) | 62 (53.9%) | 64.5 |
| 3D DenseNets w/ | 348 (HdH training Dataset) | 115 (HdH test Dataset) | 62 (53.9%) | 75.8 |
| Radiomics (this study) | 464 (HdH train‐dev Dataset) | 37 (TCIA Dataset) | 9 (24.3%) | 68.7 |
| 3D DenseNets w/ | 348 (HdH training Dataset) | 37 (TCIA Dataset) | 9 (24.3%) | 75.0 |
Shown as the number of cases (percentage).
Estimated using the proportion of EGFR Mut on the entire data set, rather than the test set.
The evaluation results are based on multivariate statistical analysis, rather than the practice of training – validation (development) – test in machine learning. Since the prior studies listed in the above table used nonshared datasets independently, the results are for reference only.
Figure 2Visualization of conventional radiomics, deep learned radiomics (DLR) and their associations. A, Cluster map of conventional radiomics, with 115 nodules on the x‐axis and 401 radiomic features on the y‐axis. Each feature was normalized into zero mean and unit standard variance. The nodules of a same cluster (adjacent columns) shared similar radiomic features in Euclidean space. The semantic label EGFR Mut/WT of each nodule was shown on the black‐grey bar below the x‐axis. B, Correlation coefficient matrix for conventional radiomics and DLR. Note the radiomic features (y‐axis in A) and the DLR features (x‐axis in D) were both aligned with the correlation coefficient matrix. C, The classification ROC curves of our radiomics‐based and DLR‐based methods. The brighter (red or blue) blocks show the higher correlation. The black denotes no correlation. D, Cluster map of DRL with 115 nodules on the y‐axis and 114 radiomic features on the x‐axis. Each feature was normalized into zero mean and unit standard variance. The nodules of a same cluster (adjacent rows) shared similar DLR in Euclidean space. The semantic label EGFR Mut/WT of each nodule was shown on the black‐grey bar on the left of the y‐axis
Dataset summary and prediction performance of deep learning systems on HdH Dataset (training, development and test) and TCIA Dataset
| Dataset | #Patients | EGFR+
| AUC (w/ | AUC (w/ | AUC (w/o | AUC (w/o |
|---|---|---|---|---|---|---|
| HdH training dataset | 348 | 185 (53.2%) | 76.7 | 76.0 | 71.0 | 70.1 |
| HdH development dataset | 116 | 61 (52.6%) | 74.1 | 74.6 | 69.2 | 70.4 |
| HdH test dataset | 115 | 62 (53.9%) | 75.8 | 76.8 | 67.9 | 67.9 |
| TCIA dataset | 37 | 9 (24.3%) | 75.0 | 68.3 | 70.6 | 71.4 |
Shown as the number of cases (percentage).
Figure 3The learning curves of the best models with mixup training and those without, in terms of binary cross‐entropy loss. The losses on the HdH training, development (val) set and test Dataset were shown on the figures. “epochs” on the x‐axis means the training consumes once the entire training set
Figure 4t‐SNE visualization of the deep learned radiomics in a 2D space on the HdH test Dataset. A, The t‐SNE visualization scatter plot colored by the EGFR labels. Clusters can be found labeled by the ground truth. B, The t‐SNE visualization scatter plot colored by the EGFR probability score predicted by the 3D DenseNets. The prediction scores are consistent with ground truth