| Literature DB >> 31466261 |
Nasrullah Nasrullah1,2,3, Jun Sang4,5, Mohammad S Alam6, Muhammad Mateen1,2, Bin Cai1,2, Haibo Hu1,2.
Abstract
Lung cancer is one of the major causes of cancer-related deaths due to its aggressive nature and delayed detections at advanced stages. Early detection of lung cancer is very important for the survival of an individual, and is a significant challenging problem. Generally, chest radiographs (X-ray) and computed tomography (CT) scans are used initially for the diagnosis of the malignant nodules; however, the possible existence of benign nodules leads to erroneous decisions. At early stages, the benign and the malignant nodules show very close resemblance to each other. In this paper, a novel deep learning-based model with multiple strategies is proposed for the precise diagnosis of the malignant nodules. Due to the recent achievements of deep convolutional neural networks (CNN) in image analysis, we have used two deep three-dimensional (3D) customized mixed link network (CMixNet) architectures for lung nodule detection and classification, respectively. Nodule detections were performed through faster R-CNN on efficiently-learned features from CMixNet and U-Net like encoder-decoder architecture. Classification of the nodules was performed through a gradient boosting machine (GBM) on the learned features from the designed 3D CMixNet structure. To reduce false positives and misdiagnosis results due to different types of errors, the final decision was performed in connection with physiological symptoms and clinical biomarkers. With the advent of the internet of things (IoT) and electro-medical technology, wireless body area networks (WBANs) provide continuous monitoring of patients, which helps in diagnosis of chronic diseases-especially metastatic cancers. The deep learning model for nodules' detection and classification, combined with clinical factors, helps in the reduction of misdiagnosis and false positive (FP) results in early-stage lung cancer diagnosis. The proposed system was evaluated on LIDC-IDRI datasets in the form of sensitivity (94%) and specificity (91%), and better results were obatined compared to the existing methods.Entities:
Keywords: clinical biomarkers; deep convolutional neural networks; internet of things; pulmonary nodules; wireless body area networks
Year: 2019 PMID: 31466261 PMCID: PMC6749467 DOI: 10.3390/s19173722
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Categories of lung nodules in a CT scan; benign, primary malignant, and metastatic malignant (from left to right).
Relative contribution of risk factors with clinical biomarkers [3].
| Factor | Sensitivity (%age) | Specificity (%age) | NPV (%age) |
|---|---|---|---|
| Proteins LG3BP/C163A | 97 | 13 | 95 |
| Smoking history | 97 | <5 | - |
| Age | 97 | 8 | 92 |
| Nodule size | 100 | 13 | 100 |
| Nodule location | 97 | <5 | - |
| Nodule speculation | 97 | <5 | - |
| Physiological symptoms | 97 | <5 | - |
Figure 2Overview structure of the proposed lung nodule detection and classification system.
Figure 3Block diagram of the proposed lung nodule detection and classification system.
Symptoms appear at different stages in lung cancer.
| Physiological Symptoms | Stage 1 (%age) | Stage 2 (%age) | Stage 3 (%age) | Stage 4 (%age) |
|---|---|---|---|---|
| Body temperature | 56–66 | 32–78 | 93–97 | 95–99 |
| Breathlessness | 5–54 | 41–86 | 89–94 | 95–100 |
| Pain | 26–43 | 30–63 | 34–76 | 43–82 |
| Anxiety | 35–47 | 48–65 | 66–76 | 78–95 |
| Irregular heart rate | 12–63 | 19–74 | 74–96 | 97–98 |
| Blood pressure | 27–42 | 63–86 | 87–91 | 92–94 |
| Fatigue | 17–39 | 28–48 | 67–78 | 79–89 |
| Insomnia | 37–47 | 48–62 | 63–87 | 88–92 |
| Body weight loss | 34–64 | 44–60 | 89–93 | 93–98 |
| Depression | 19–31 | 22–46 | 37–78 | 47–83 |
| Constipation | 10–20 | 18–25 | 26–43 | 44–60 |
| Anorexia | - | - | 36–68 | 67–78 |
Figure 4Modern CNN connection architecture.
Figure 5Inner link module (left), outer link module (middle), and mixed inner–outer module (right).
Output feature maps at different stages of our CMixNet architecture.
| Stage | Output | Weights |
|---|---|---|
| Initial input size | 96 × 96 × 96, 24 | 3 × 3 × 3, 24 |
| First CMixNet block | 48 × 48 × 48, 48 |
|
| Second CMixNet block | 24 × 24 × 24, 72 |
|
| Third CMixNet block | 12 × 12 × 12, 96 |
|
| Fourth CMixNet block | 6 × 6 × 6, 120 |
|
| Upsampling/Deconv. 1 | 12 × 12 × 12, 216 | 2 × 2 × 2, 216 |
| Fifth CMixNet block | 12 × 12 × 12, 152 |
|
| Upsampling/Deconv. 2 | 24 × 24 × 24, 224 | 2 × 2 × 2, 152 |
| Sixth CMixNet block | 24 × 24 × 24, 248 |
|
| Output | 24 × 24 × 24, 15 | Dropout, p = 0.5 |
Figure 6Feature extraction using CMixNet with U-Net-like encoder–decoder architecture for nodule detection. The numbers in the boxes represent the sizes of the feature maps in the sequence #slices, #rows, #cols, and #maps. The numbers outside the boxes are in sequence #filters, #slices, #rows, and #cols.
Figure 7Nodule detection through Faster R-CNN.
Figure 8Nodule classification using CMixNet and GBM. The numbers in the boxes represent the sizes of the feature maps in the sequence #slices, #rows, #cols, and #maps. The numbers outside the boxes are in sequence #filters, #slices, #rows, and #cols.
Important clinical biomarkers with sensitivity and specificity for the presence of cancer.
| Biomarkers | Sensitivity | Specificity |
|---|---|---|
| CYFRA 21-1 (Cytokeratins) [ | 43 | 89 |
| CEACAM (Carcinoembryonic antigen) [ | 69 | 68 |
| ProGRP (Pro-gastrin-releasing peptide) [ | 84 | 95 |
| Carbohydrate Antigen 125 (CA125) [ | 80 | 40 |
| Carbohydrate Antigen-19.9 (CA-19.9) and Ferritin | 78 | 20 |
| Neuron-Specific Enolase (NSE) [ | 38 | 40 |
| Squamous Cell Carcinoma Antigen (SCC) | 80 | 67 |
| Proteins LG3BP/C163A [ | 97 | 13 |
Figure 9Comparison of different FROC curves obtained from deep learning models.
Accuracy comparison of nodule classification on public dataset.
| Models | Accuracy (%) | Year |
|---|---|---|
| Multi-scale CNN [ | 86.84 | 2015 |
| Nodule level 2D CNN [ | 87.30 | 2016 |
| Slice level 2D CNN [ | 86.70 | 2016 |
| Multi-crop CNN [ | 87.14 | 2017 |
| Vanilla 3D CNN [ | 87.40 | 2016 |
| Deep 3D DPN [ | 88.74 | 2017 |
| Deep 3D DPN + GBM [ | 90.44 | 2017 |
| 3D MixNet [ | 88.83 | 2019 |
| 3D MixNet + GBM [ | 90.57 | 2019 |
| 3D CMixNet + GBM | 91.13 | 2019 |
| 3D CMixNet + GBM + Biomarkers | 94.17 | 2019 |
Performance comparison of TumorNet [34], DFCNet [14], and CMixNet.
| Dataset | Method | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|
| LIDC-IDRI test data | TumorNet | 81.70 | 85.17 | 87.41 |
| DFCNet | 80.91 | 83.22 | 86.02 | |
| CMixNet | 93.97 | 89.83 | 88.79 | |
| Hospital data | TumorNet | 81.49 | 89.94 | 81.11 |
| DFCNet | 83.67 | 96.17 | 96.33 | |
| CMixNet | 98.00 | 94.35 | 94.17 |
Training computational cost of our detection and classification models.
| Mode | Initialization | Optimizer | Regularizer | Learning Rate | Epochs | Training Time |
|---|---|---|---|---|---|---|
| Detection | Random | GSD | BN, Dropout | 0.01 | 100 | 68 h |
| Detection | Random | GSD | BN, Dropout | 0.001 | 30 | 36 h |
| Detection | Random | GSD | BN, Dropout | 0.0001 | 20 | 24 h |
| Classification | Random | GSD | BN, Dropout | 0.01 | 700 | 52 h |
| Classification | Random | GSD | BN, Dropout | 0.001 | 200 | 31 h |
| Classification | Random | GSD | BN, Dropout | 0.0001 | 100 | 23 h |