| Literature DB >> 32296645 |
Xianwu Xia1, Jing Gong2,3, Wen Hao2,3, Ting Yang1, Yeqing Lin1, Shengping Wang2,3, Weijun Peng2,3.
Abstract
For stage-I lung adenocarcinoma, the 5-years disease-free survival (DFS) rates of non-invasive adenocarcinoma (non-IA) is different with invasive adenocarcinoma (IA). This study aims to develop CT image based artificial intelligence (AI) schemes to classify between non-IA and IA nodules, and incorporate deep learning (DL) and radiomics features to improve the classification performance. We collect 373 surgical pathological confirmed ground-glass nodules (GGNs) from 323 patients in two centers. It involves 205 non-IA (including 107 adenocarcinoma in situ and 98 minimally invasive adenocarcinoma), and 168 IA. We first propose a recurrent residual convolutional neural network based on U-Net to segment the GGNs. Then, we build two schemes to classify between non-IA and IA namely, DL scheme and radiomics scheme, respectively. Third, to improve the classification performance, we fuse the prediction scores of two schemes by applying an information fusion method. Finally, we conduct an observer study to compare our scheme performance with two radiologists by testing on an independent dataset. Comparing with DL scheme and radiomics scheme (the area under a receiver operating characteristic curve (AUC): 0.83 ± 0.05, 0.87 ± 0.04), our new fusion scheme (AUC: 0.90 ± 0.03) significant improves the risk classification performance (p < 0.05). In a comparison with two radiologists, our new model yields higher accuracy of 80.3%. The kappa value for inter-radiologist agreement is 0.6. It demonstrates that applying AI method is an effective way to improve the invasiveness risk prediction performance of GGNs. In future, fusion of DL and radiomics features may have a potential to handle the classification task with limited dataset in medical imaging.Entities:
Keywords: CT scan; deep learning; ground-glass nodule; invasiveness risk; lung adenocarcinoma; radiomics
Year: 2020 PMID: 32296645 PMCID: PMC7136522 DOI: 10.3389/fonc.2020.00418
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Demographic characteristics of 323 patients with 373 GGNs in two datasets.
| 119 | 127 | 86 | 41 | ||||
| Sex | Male | 40 | 42 | 0.15 | 19 | 16 | 0.15 |
| Female | 73 | 74 | 43 | 16 | |||
| Age (mean ± SD, year) | 56.5 ± 11.8 | 59.7 ± 10.3 | 0.03 | 51.8 ± 12.1 | 58.1 ± 8.6 | 0.03 | |
| Location | RUL | 48 (19.5%) | 52 (21.1%) | 0.64 | 28 (22.0%) | 18 (14.2%) | 0.13 |
| RML | 6 (2.4%) | 9 (3.7%) | 6 (4.7%) | 3 (2.4%) | |||
| RLL | 17 (6.9%) | 19 (7.7%) | 15 (11.8%) | 7 (5.5%) | |||
| LUL | 34 (13.8%) | 32 (13.0%) | 25 (19.7%) | 7 (5.5%) | |||
| LLL | 14 (5.7%) | 15 (6.1%) | 12 (9.4%) | 6 (4.7%) | |||
| Diameter (mm) | (3, 10) | 72 (29.3%) | 42 (17.1%) | 0.004 | 67 (52.8%) | 8 (6.3%) | <0.0001 |
| (10, 20) | 39 (15.9%) | 68 (27.6%) | 19 (15.0%) | 22 (17.3%) | |||
| (20, 30) | 8 (3.3%) | 17 (6.9%) | 0 (0%) | 11 (8.7%) | |||
| Type | pGGN | 88 (35.8%) | 65 (26.4%) | 0.0002 | 78 (61.4%) | 18 (14.2%) | <0.0001 |
| sGGN | 31 (12.6%) | 62 (25.2%) | 8 (6.3%) | 23 (18.1%) | |||
IA, invasive adenocarcinoma; pGGO, pure ground glass nodule; sGGN, part-solid ground glass nodule.
Figure 1Flowchart of the proposed scheme.
Figure 2Segmentation results of a GGN. From top to bottom: original CT images, heat map of CNN features, and segment masks of the GGN.
Figure 3The architectures of Recurrent Residual Convolutional Neural Network (RRCNN) based on U-Net model and the transfer learning method based risk prediction model.
Figure 4Boxplots of the mean CT value of IA and non-IA GGNs in our dataset. (A) Illustrates boxplot of the training and validation dataset. (B) Shows boxplot of the testing dataset.
Figure 5Heat map of the 20 imaging features selected in the radiomics based model.
AUC values and the corresponding 95% CI generated by different methods with 127 GGNs in testing dataset.
| Deep learning based scheme | 0.83 ± 0.05 | [0.75, 0.90] |
| Radiomics feature based scheme | 0.87 ± 0.04 | [0.80, 0.93] |
| Minimum | 0.83 ± 0.05 | [0.75, 0.90] |
| Maximum | 0.90 ± 0.03 | [0.84, 0.95] |
| 0.1 × Radiomics | 0.85 ± 0.04 | [0.77, 0.91] |
| 0.2 × Radiomics+0.8 × DL | 0.86 ± 0.04 | [0.78, 0.92] |
| 0.3 × Radiomics+0.7 × DL | 0.87 ± 0.04 | [0.80, 0.93] |
| 0.4 × Radiomics+0.6 × DL | 0.88 ± 0.04 | [0.81, 0.94] |
| 0.5 × Radiomics+0.5 × DL | 0.89 ± 0.04 | [0.83, 0.95] |
| 0.6 × Radiomics+0.4 × DL | 0.90 ± 0.04 | [0.83, 0.95] |
| 0.7 × Radiomics+0.3 × DL | 0.90 ± 0.04 | [0.83, 0.90] |
| 0.8 × Radiomics+0.2 × DL | 0.90 ± 0.04 | [0.83, 0.88] |
| 0.9 × Radiomics+0.1 × DL | 0.89 ± 0.03 | [0.83, 0.94] |
Radiomics: prediction scores generated by radiomics feature based scheme.
DL: prediction scores generated by deep learning based scheme.
Figure 6Performance comparisons of three models and radiologists. (A) Shows scatter plots of prediction score distributions of non-IA and IA nodules. Left to right: prediction scores generated by DL and radiomics models for non-IA and IA nodules in testing dataset, respectively. (B) Shows ROC curves of the three models and the prediction scores of two radiologists.
The comparison of classification performance tested on 127 GGNs in independent testing dataset, in terms of accuracy (ACC), F1 score, weighted average F1 score, and Matthews correlation coefficient (MCC), respectively.
| Senior radiologist | 67.7 | 64.3 | 68.5 | 44.8 |
| Junior radiologist | 70.9 | 63.4 | 71.8 | 42.6 |
| Our fusion model | 80.3 | 75.2 | 80.9 | 62.8 |
Comparison of dataset, methods, and AUC values reported in different studies.
| Wang et al. ( | 1,545 nodules | Deep learning | 0.892 |
| Zhao et al. ( | 651 nodules | Deep learning | 0.880 |
| Gong et al. ( | 828 nodules | Deep learning | 0.92 ± 0.03 |
| Our study | 373 nodules | Fusion of deep learning and radiomics | 0.90 ± 0.03 |