| Literature DB >> 33459996 |
Muhammad Ayaz1, Furqan Shaukat2, Gulistan Raja3.
Abstract
Tuberculosis (TB) remains one of the major health problems in modern times with a high mortality rate. While efforts are being made to make early diagnosis accessible and more reliable in high burden TB countries, digital chest radiography has become a popular source for this purpose. However, the screening process requires expert radiologists which may be a potential barrier in developing countries. A fully automatic computer-aided diagnosis system can reduce the need of trained personnel for early diagnosis of TB using chest X-ray images. In this paper, we have proposed a novel TB detection technique that combines hand-crafted features with deep features (convolutional neural network-based) through Ensemble Learning. Handcrafted features were extracted via Gabor Filter and deep features were extracted via pre-trained deep learning models. Two publicly available datasets namely (i) Montgomery and (ii) Shenzhen were used to evaluate the proposed system. The proposed methodology was validated with a k-fold cross-validation scheme. The area under receiver operating characteristics curves of 0.99 and 0.97 were achieved for Shenzhen and Montgomery datasets respectively which shows the superiority of the proposed scheme.Entities:
Keywords: Computer aided diagnosis; Convolutional neural network; Ensemble learning; Tuberculosis
Year: 2021 PMID: 33459996 PMCID: PMC7812355 DOI: 10.1007/s13246-020-00966-0
Source DB: PubMed Journal: Phys Eng Sci Med ISSN: 2662-4729
Fig. 1Sample CXR images where (a) shows a normal CXR and (b)–(d) show different manifestations of TB. (a) Normal, (b) pleural effusion, (c) infiltrates, (d) cavity lung lesion
Input image sizes of different CNN architectures
| CNN | Input image size (pixels) |
|---|---|
| Inception v3 [ | 299 × 299 × 3 |
| InceptionResnetv2 [ | 299 × 299 × 3 |
| Vgg16 [ | 224 × 224 × 3 |
| Vgg19 [ | 224 × 224 × 3 |
| MobileNet [ | 224 × 224 × 3 |
| ResNet50 [ | 224 × 224 × 3 |
| Xception [ | 299 × 299 × 3 |
Fig. 2Gabor filter ouput on sample CXR images. (a) Normal CXR, (b) Gabor output at λ = 2, (c) Gabor output at λ = 4, (d) infected CXR, (e) Gabor output at λ = 2, (f) Gabor output at λ = 4
Fig. 3Block diagram of the proposed method
Fig. 4Gabor output for TB infected CXR images. (a) Infected CXR, (b) Gabor output at λ = 2, (c) Gabor output at λ = 4, (d) infected CXR, (e) Gabor output at λ = 2, (f) Gabor output at λ = 4, (g) Infected CXR, (h) Gabor output at λ = 2, and (i) Gabor output at λ = 4
Classification results for Montgomery dataset
| Ensemble | Inception v3 | InceptionResnetv2 | Vgg16 | Vgg19 | MobileNet | ResNet50 | Xception | Gabor (λ = 2) | Gabor (λ = 4) | |
|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy (%) | 86.23 | 82.60 | 81.88 | 82.60 | 83.33 | 73.91 | 75.36 | 83.33 | 86.96 | |
| AUC | 0.90 | 0.90 | 0.89 | 0.92 | 0.93 | 0.79 | 0.84 | 0.89 | 0.93 |
Classification results for Shenzhen dataset using Logistic Regression as level 0 classifier
| Ensemble | Inception v3 | InceptionResnetv2 | Vgg16 | Vgg19 | MobileNet | ResNet50 | Xception | Gabor (λ = 2) | Gabor (λ = 4) | |
|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy (%) | 87.31 | 86.24 | 87.60 | 83.99 | 87.30 | 80.67 | 83.36 | 80.67 | 79.46 | |
| AUC | 0.93 | 0.93 | 0.93 | 0.90 | 0.86 | 0.91 | 0.85 | 0.86 |
Classification results for Shenzhen dataset using CNN as level 0 classifier
| Ensemble | Inception v3 | InceptionResnetv2 | Vgg16 | Vgg19 | MobileNet | ResNet50 | Xception | Gabor (λ = 2) | Gabor (λ = 4) | |
|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy (%) | 89.88 | 89.12 | 83.98 | 83.37 | 91.85 | 79.44 | 86.56 | 85.04 | 85.03 | |
| AUC | 0.92 | 0.92 | 0.86 | 0.86 | 0.94 | 0.81 | 0.89 | 0.87 | 0.87 |
Fig. 5ROC curves of different classifiers for Montgomery dataset
Fig. 6ROC curves of different classifiers for Shenzhen dataset
Performance comparison of different CAD systems
| CAD System | Year | Dataset | Extracted features | Classifier | Accuracy (%) | AUC |
|---|---|---|---|---|---|---|
| Santosh et al. [ | 2016 | Montgomery dataset (MC), Shenzhen dataset (SZ) | Thoracic edge map encoding | Neural network | 79.23 (MC) 86.36 (SZ) | 0.88 (MC) 0.93 (SZ) |
| Lopes et al. [ | 2017 | MC dataset, SZ dataset | CNN (GoogleNet, ResNet and VggNet ) transfer learning | SVM | 82.6 (MC) 84.7 (SZ) | 0.926 (MC) 0.926 (SZ) |
| Santosh et al. [ | 2018 | MC dataset, SZ dataset, Indian (IN) dataset | Texture, Shape, Edge, Symmetry | Bayesian network, Neural network, Random forest (RF) | 83 (MC) 86 (IN) 91 (SZ) | 0.90 (MC) 0.94 (IN) 0.96 (SZ) |
| Vajda et al. [ | 2018 | MC dataset, SZ dataset | Set A: IH, GM, SD, CD, HOG, LBP Set B: Color, intensity, edge, shape, texture Set C: Eccentricity, centroid, bounding box, orientation, extent, size | Neural network | 78.3 (MC) 95.57 (SZ) | 0.87 (MC) 0.99 (SZ) |
| Rajaraman et al. [ | 2018 | SZ MC Kenya (K) India (I) | Ensemble (pretrained CNNs, HOG, GIST, SURF) | SVM, logistic regression | 0.934 (SZ) 0.875 (MC) 0.776 (K) 0.960 (I) | 0.991 (SZ) 0.962 (MC) 0.826 (K) 0.965 (I) |
| Pasa et al. [ | 2019 | MC dataset, SZ dataset, Combined (MC and SZ) dataset, Belarus dataset | Custom CNN | 79 (MC) 84.4 (SZ) 86.2 (combined) | 0.811 (MC) 0.90 (SZ) 0.925 (combined) | |
| Govindarajan et al. [ | 2019 | MC dataset | Bag of features (BoF) approach with speeded-up Robust feature (SURF) descriptor | Multilayer perceptron | 87.8 | 0.94 |
| Kyung et al. [ | 2020 | Chest X-ray 14 dataset (for training/validation), MC dataset (for testing), SZ dataset (for Testing), Johns Hopkins Hospital dataset (JHH) (for testing) | ResNet-50 (transfer learning) | CNN | 0.91 (SZ) 0.87 (JHH) | |
| Sahlol et al. [ | 2020 | SZ dataset dataset 2 | MobileNet, Feature selection by AEO | CNN | 90.2 (SZ) 94.1 (dataset 2) | |
| Proposed method | 2020 | MC dataset, SZ dataset | Ensemble (pre trained CNN, Gabor filter) | Logistic regression, CNN | 93.47 (MC) 97.59 (SZ) | 0.97 (MC) 0.99 (SZ) |
Abbreviations: Intensity Histogram (IH), Gradient Magnitude Histogram (GM), Shape Descriptor Histogram (SD), Curvature Descriptor Histogram (CD), Histogram of Oriented Gradient (HOG), Local Binary Pattern (LBP), First order statistical feature (FOSF), Gray level co-occurrence matrix (GLCM) features, Artiicial Ecosystem-based Optimization (AEO)
Fig. 7ROC curves of ensemble learning