| Literature DB >> 35204418 |
Grace Ugochi Nneji1, Jingye Cai1, Jianhua Deng1, Happy Nkanta Monday2, Edidiong Christopher James1, Chiagoziem Chima Ukwuoma1.
Abstract
Pneumonia is a prevalent severe respiratory infection that affects the distal and alveoli airways. Across the globe, it is a serious public health issue that has caused high mortality rate of children below five years old and the aged citizens who must have had previous chronic-related ailment. Pneumonia can be caused by a wide range of microorganisms, including virus, fungus, bacteria, which varies greatly across the globe. The spread of the ailment has gained computer-aided diagnosis (CAD) attention. This paper presents a multi-channel-based image processing scheme to automatically extract features and identify pneumonia from chest X-ray images. The proposed approach intends to address the problem of low quality and identify pneumonia in CXR images. Three channels of CXR images, namely, the Local Binary Pattern (LBP), Contrast Enhanced Canny Edge Detection (CECED), and Contrast Limited Adaptive Histogram Equalization (CLAHE) CXR images are processed by deep neural networks. CXR-related features of LBP images are extracted using shallow CNN, features of the CLAHE CXR images are extracted by pre-trained inception-V3, whereas the features of CECED CXR images are extracted using pre-trained MobileNet-V3. The final feature weights of the three channels are concatenated and softmax classification is utilized to determine the final identification result. The proposed network can accurately classify pneumonia according to the experimental result. The proposed method tested on publicly available dataset reports accuracy of 98.3%, sensitivity of 98.9%, and specificity of 99.2%. Compared with the single models and the state-of-the-art models, our proposed network achieves comparable performance.Entities:
Keywords: COVID-19; contrast enhanced canny edge detection (CECED); contrast limited adaptive histogram equalization (CLAHE); deep learning; image identification; local binary pattern (LBP); pneumonia disease
Year: 2022 PMID: 35204418 PMCID: PMC8870748 DOI: 10.3390/diagnostics12020325
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Summary of the Related Works.
| Authors | Year | Mode of Imaging | DL Techniques | Classification Task | Evaluation Results |
|---|---|---|---|---|---|
| Cicero et al. [ | 2017 | X-ray Image | GoogLeNet is utilized to classify two classes - normal and abnormal images | Binary class | For normal class: |
| Guendel et al. [ | 2018 | X-ray Image | Used location-aware dense networks technique to identify anomalies in chest X-rays | Multiple class | PLCO dataset, |
| Rajaraman et al. [ | 2018 | X-ray Image | A modified VGG16 is employed for the identification and detection of viral and bacterial pneumonia | Binary class | ACC (within 91.8% to 96.2%) |
| Correa et al. [ | 2018 | Ultrasound Image | Detection of pneumonia using 3 layers feed-forward neural network | Binary class | SEN = 90.9% |
| Ke et al. [ | 2019 | X-ray Image | Detection of lung diseases using an approach called neuroheuristic | Multiple class | Sensitivity = 84.22%, |
| Saraiva et al. [ | 2019 | X-ray Image | A CNN model was applied on a dataset of 5863 images and cross-validation was used for the validation of the model | Binary class | Accuracy = 95.30% |
| Sirazitdinov et al. [ | 2019 | X-ray Image | An emsemble of RetinaNet and Mask RCNN was applied | Binary class | Precision = 75.0%, |
| Liang and Zheng [ | 2020 | X-ray Image | A modified 49 convolutional and 2 fully connected layer of a CNN model was used for the classification of children’s lung regions | Binary class | F1-score = 92.7% |
| Apostolopoulos et al. [ | 2020 | X-ray Image | Different fine-tuning approaches were evaluated for the automatic detection of pneumonia | Binary class | VGG19 has the highest value of: |
| Xu et al. [ | 2020 | X-ray Image | Multiple CNN models were compared in order to categorize the classes of CT scans | Multiple class | Accuracy = 86.7% |
| Habib et al. [ | 2020 | X-ray Image | Detection of pneumonia using an ensemble of VGG-19 and CheXNet for the extraction of features and random forest as the classifier | Binary class | Accuracy = 98.93% |
| Chouhan et al. [ | 2020 | X-ray Image | A transfer learning technique is applied for the detection of pneumonia | Binary class | Accuracy = 96.4% |
| El Asnaoui et al. [ | 2020 | X-ray Image | A fine-tuned of eight different models for the detection and classification of pneumonia | Binary class | Highest accuracy is the fine-tubed ResNet50 (>96%) |
| El Asnaoui et al. [ | 2020 | X-ray Image | A comparative findings of seven DL models for the classification and detection of pneumonia (including COVID-19) | Multiple class | Accuracy Evaluations: |
Description of the Dataset.
| Dataset | Pneumonia Category | Value | Selected Amount Used |
|---|---|---|---|
| Kaggle database of RSNA [ | Bacterial | 3029 | 1000 |
| Viral | 2983 | 1000 | |
| Normal | 8851 | 1000 | |
| Rahman et al. [ | COVID-19 | 3616 | 1000 |
Figure 1LBP coding and calculation illustration.
Figure 2LBP pre-processing.
Figure 3CLAHE Pre-processing.
Figure 4CECED Pre-processing.
Figure 5Shallow CNN structure applied for the feature extraction of LBP CXR images.
Parameter for the modified MobileNet-V3. bneck represents bottleneck convolution, SE depicts whether there is a Squeeze-and-Excite in that block, NL represents the type of non-linearity utilized, HS represents h-swish, RE denotes ReLU and S represents stride.
| Input | Operator | Expansion | Output | SE | NL | Stride |
|---|---|---|---|---|---|---|
| 224 × 224 × 3 | Conv2d, 3 × 3 | - | 16 | No | HS | 2 |
| 112 × 112 × 16 | bneck, 3 × 3 | 16 | 16 | Yes | RE | 2 |
| 56 × 56 × 16 | bneck, 3 × 3 | 72 | 24 | No | RE | 2 |
| 28 × 28 × 24 | bneck, 3 × 3 | 86 | 24 | No | RE | 1 |
| 28 × 28 × 24 | bneck, 5 × 5 | 96 | 40 | Yes | HS | 2 |
| 14 × 14 × 40 | bneck, 5 × 5 | 240 | 40 | Yes | HS | 1 |
| 14 × 14 × 40 | bneck, 5 × 5 | 240 | 40 | Yes | HS | 1 |
| 14 × 14 × 40 | bneck, 5 × 5 | 120 | 48 | Yes | HS | 1 |
| 14 × 14 × 48 | bneck, 5 × 5 | 144 | 48 | Yes | HS | 1 |
| 7 × 7 × 96 | bneck, 5 × 5 | 288 | 96 | Yes | HS | 2 |
| 7 × 7 × 96 | bneck, 5 × 5 | 576 | 96 | Yes | HS | 1 |
| 7 × 7 × 96 | bneck, 5 × 5 | 576 | 96 | Yes | HS | 1 |
| 7 × 7 × 256 | Conv2d, 1 × 1 | - | 256 | Yes | HS | 1 |
| 1 × 1 × 256 | Avg pool, 7 × 7 | - | - | No | - | 1 |
| 1 × 1 × 512 | Conv2d, 1 × 1 | - | 512 | No | HS | 1 |
Figure 6Framework of the pretrained MobileNet-V3 utilized for the features extraction of CECED CXR images.
Figure 7Framework of the pretrained Inception-V3 utilized for the features extraction of CLAHE CXR images.
Figure 8Our proposed multi-channel scheme for pneumonia identification.
Comparison of our proposed model with single channels and dual channels.
| Model | ACC (%) | SEN (%) | SPE (%) | PRE (%) | F1-s (%) | Time (min) |
|---|---|---|---|---|---|---|
| LBP-Channel Shallow CNN (LCSC) | 90.9 | 92.3 | 93.1 | 91.2 | 92.7 | 3.2 |
| CECED-Channel MobileNet-V3 (CCM) | 93.7 | 95.4 | 95.7 | 94.3 | 95.5 | 18.6 |
| CLAHE-Channel Inception-V3 (CCI) | 95.6 | 94.9 | 96.2 | 95.8 | 95.3 | 21.8 |
| LBP-Channel Shallow CNN + CECED-channel MobileNet-V3 (LCSC + CCM) | 92.2 | 93.7 | 94.5 | 92.7 | 94.3 | 23.4 |
| LBP-Channel Shallow CNN + CLAHE-channel Inception-V3 (LCSC + CCI) | 94.4 | 95.5 | 96.8 | 95.1 | 96.6 | 22.7 |
| CLAHE-Channel inception-V3 + CECED-channel MobileNet-V3 (CCI + CCM) | 97.5 | 97.3 | 98.3 | 97.8 | 98.1 | 26.8 |
| LBP-Channel Shallow CNN + CLAHE-channel Inception-V3 + CECED-channel MobileNet-V3 (LCSC + CCI + CCM) | 98.3 | 98.9 | 99.2 | 98.8 | 99.0 | 30.3 |
Figure 9Performance of the proposed multi-channel in comparison with the single channels and dual channels across the different evaluation metrics.
Figure 10Accuracy curves for the proposed multi-channel in comparison with the single-channels and dual-channels.
Figure 11Loss curves for the proposed multi-channel in comparison with the single-channels and dual-channels.
Figure 12ROC curves for the proposed multi-channel in comparison with the single-channels and dual-channels.
Figure 13Precision–recall curves for the proposed multi-channel in comparison with the single-channels and dual-channels.
Result comparison of our proposed model with state-of-the-art methods for pneumonia classification.
| Authors | ACC (%) | SEN (%) | SPE (%) |
|---|---|---|---|
| Cicero et al. [ | 91.0 | 91.0 | 91.0 |
| Correa et al. [ | - | 90.9 | 100.0 |
| Apostolopoulos et al. [ | 98.0 | 92.9 | 98.8 |
| Xu et al. [ | 86.7 | 86.9 | - |
| Habib et al. [ | 98.93 | - | - |
| Chouchan et al. [ | 96.4 | 99.6 | - |
| Yamaç et al. [ | 86.5 | 79.2 | 90.7 |
| Wang et al. [ | 93.3 | 90.7 | 95.5 |
| Li et al. [ | 96.9 | 97.8 | 94.9 |
| J.K. K. Singh and A. Singh [ | 95.8 | 96.1 | 95.7 |
| Yang et al. [ | 88.4 | 64.7 | 92.9 |
| Wang et al. [ | 94.5 | 94.7 | 97.3 |
| Alsharif et al. [ | 99.7 | 99.7 | 99.8 |
| Alqudah et al. [ | 93.9 | 93.2 | 96.6 |
| Alquran et al. [ | 93.1 | 92.9 | 96.4 |
| Masad et al. [ | 98.9 | 98.3 | 99.2 |
| Our Model | 98.3 | 98.9 | 99.2 |
Comparison table for the selected state of the art models using the same dataset.
| Model | ACC (%) | SEN (%) | SPE (%) |
|---|---|---|---|
| Xu et al. [ | 91.2 | 91.8 | 93.1 |
| Wang et al. [ | 94.0 | 92.9 | 94.2 |
| Li et al. [ | 96.1 | 94.4 | 95.5 |
| Yang et al. [ | 93.5 | 92.4 | 94.7 |
| Wang et al. [ | 95.8 | 95.4 | 96.4 |
| Our Model | 98.3 | 98.9 | 99.2 |
Figure 14Performance evaluation for some selected state of the art models using the same dataset.
Figure 15Accuracy performance for some selected state of the art models using the same dataset.
Results obtained on our dataset using different pretrained models on our proposed model.
| Model | LBP-Based Channel | CECED-Based Channel | CLAHE-Based Channel | ||||||
|---|---|---|---|---|---|---|---|---|---|
| ACC (%) | SEN (%) | SPE (%) | ACC (%) | SEN (%) | SPE (%) | ACC (%) | SEN (%) | SPE (%) | |
| AlexNet | 89.2 | 91.4 | 92.6 | 89.3 | 87.5 | 90.2 | 92.5 | 94.3 | 93.7 |
| VGG-16 | 88.9 | 90.7 | 91.4 | 90.9 | 90.1 | 91.8 | 91.4 | 92.8 | 91.9 |
| ResNet-152 | 84.6 | 86.2 | 87.9 | 91.4 | 92.3 | 93.1 | 87.8 | 88.1 | 87.6 |
| MobileNet-V3 | 87.7 | 89.4 | 90.5 | 93.7 | 95.4 | 95.7 | 90.4 | 91.6 | 90.8 |
| DenseNet-121 | 85.3 | 87.1 | 88.7 | 92.8 | 92.8 | 93.3 | 88.4 | 89.2 | 88.7 |
| Inception-V3 | 86.3 | 88.6 | 89.4 | 93.1 | 91.5 | 93.7 | 95.6 | 94.9 | 96.2 |
| Shallow CNN | 90.9 | 92.3 | 93.1 | 87.2 | 86.1 | 88.4 | 85.9 | 86.2 | 85.7 |
Performance evaluation of our proposed model based on different hyperparameter tuning on our dataset with Adam optimizer.
| Hyperparameters | (LCSC + Adam) | (CCI + Adam) | (CCM + Adam) | (LCSC + CCI + Adam) | (LCSC + CCM + Adam) | (CCI + CCM + Adam) | (LCSC + CCI + CCM + Adam) |
|---|---|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| LR (0.1) + Dropout (0.25) | 87.5 | 81.6 | 89.6 | 87.3 | 85.6 | 90.2 | 88.1 |
| LR (0.1) + Dropout (0.50) | 86.9 | 87.3 | 90.7 | 89.6 | 87.4 | 89.5 | 87.6 |
| LR (0.1) + Dropout (0.75) | 83.6 | 82.6 | 87.9 | 89.9 | 89.3 | 91.7 | 89.2 |
| LR (0.01) + Dropout (0.25) | 81.4 | 85.1 | 91.1 | 90.4 | 91.7 | 90.4 | 87.1 |
| LR (0.01) + Dropout (0.50) | 89.8 | 84.9 | 86.3 | 91.1 | 90.8 | 85.5 | 88.4 |
| LR (0.01) + Dropout (0.75) | 84.7 | 90.7 | 92.8 | 85.7 | 89.2 | 87.8 | 89.7 |
| LR (0.001) + Dropout (0.25) | 82.2 | 95.6 | 88.2 | 84.6 | 92.1 | 92.1 | 91.5 |
| LR (0.001) + Dropout (0.50) | 80.7 | 91.3 | 93.4 | 86.5 | 89.7 | 94.7 | 92.6 |
| LR (0.001) + Dropout (0.75) | 88.3 | 92.7 | 93.7 | 91.4 | 90.8 | 89.6 | 93.8 |
| LR (0.0001) + Dropout (0.25) | 85.9 | 83.4 | 85.6 | 87.9 | 92.4 | 96.3 | 97.4 |
| LR (0.0001) + Dropout (0.50) | 90.9 | 80.6 | 87.1 | 94.4 | 92.2 | 97.5 | 98.3 |
| LR (0.0001) + Dropout (0.75) | 79.5 | 86.2 | 88.9 | 89.8 | 93.3 | 95.7 | 95.9 |
Performance evaluation of our proposed model based on different hyperparameter tuning on our dataset with RMSProp optimizer.
| Hyperparameters | (LCSC + RMSProp) | (CCI + RMSProp) | (CCM + RMSProp) | (LCSC + CCI + RMSProp) | (LCSC + CCM + RMSProp) | (CCI + CCM + RMSProp) | (LCSC + CCI + CCM + RMSProp) |
|---|---|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| LR (0.1) + Dropout (0.25) | 88.5 | 89.1 | 88.9 | 87.6 | 90.7 | 89.7 | 90.1 |
| LR (0.1) + Dropout (0.50) | 89.7 | 87.4 | 89.5 | 87.2 | 89.5 | 90.3 | 91.3 |
| LR (0.1) + Dropout (0.75) | 86.3 | 90.7 | 88.1 | 89.3 | 89.6 | 91.6 | 89.8 |
| LR (0.01) + Dropout (0.25) | 87.7 | 88.2 | 89.6 | 88.4 | 88.4 | 92.5 | 91.6 |
| LR (0.01) + Dropout (0.50) | 81.4 | 91.5 | 91.2 | 90.6 | 90.9 | 90.9 | 93.4 |
| LR (0.01) + Dropout (0.75) | 81.1 | 89.8 | 90.4 | 89.7 | 91.6 | 92.1 | 92.8 |
| LR (0.001) + Dropout (0.25) | 83.8 | 92.3 | 92.2 | 88.9 | 90.1 | 91.4 | 94.4 |
| LR (0.001) + Dropout (0.50) | 86.5 | 89.6 | 94.3 | 87.3 | 93.7 | 93.7 | 96.5 |
| LR (0.001) + Dropout (0.75) | 84.2 | 90.9 | 93.5 | 90.4 | 92.4 | 95.3 | 95.9 |
| LR (0.0001) + Dropout (0.25) | 85.9 | 91.1 | 94.7 | 91.8 | 93.6 | 94.5 | 97.4 |
| LR (0.0001) + Dropout (0.50) | 88.6 | 89.5 | 93.9 | 90.6 | 92.9 | 93.9 | 96.2 |
| LR (0.0001) + Dropout (0.75) | 85.3 | 88.9 | 94.6 | 89.7 | 91.5 | 92.6 | 95.5 |
Performance evaluation of our proposed model based on different hyperparameter tuning on our dataset with SGD optimizer.
| Hyperparameters | (LCSC + SGD) | (CCI + SGD) | (CCM + SGD) | (LCSC + CCI + SGD) | (LCSC + CCM + SGD) | (CCI + CCM + SGD) | (LCSC + CCI + CCM + SGD) |
|---|---|---|---|---|---|---|---|
| Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | Accuracy (%) | |
| LR (0.1) + Dropout (0.25) | 87.2 | 88.8 | 87.7 | 86.1 | 87.1 | 90.8 | 89.7 |
| LR (0.1) + Dropout (0.50) | 85.5 | 89.3 | 88.1 | 88.5 | 89.6 | 89.1 | 90.5 |
| LR (0.1) + Dropout (0.75) | 87.9 | 97.5 | 89.5 | 87.9 | 88.2 | 90.9 | 91.6 |
| LR (0.01) + Dropout (0.25) | 89.1 | 90.9 | 91.9 | 89.7 | 89.7 | 91.7 | 89.1 |
| LR (0.01) + Dropout (0.50) | 82.3 | 92.3 | 90.3 | 91.5 | 91.4 | 92.3 | 91.3 |
| LR (0.01) + Dropout (0.75) | 83.6 | 88.7 | 89.5 | 90.3 | 90.1 | 90.8 | 93.5 |
| LR (0.001) + Dropout (0.25) | 84.9 | 91.9 | 91.7 | 89.1 | 91.2 | 92.5 | 92.7 |
| LR (0.001) + Dropout (0.50) | 85.7 | 90.4 | 92.4 | 90.5 | 92.5 | 91.6 | 95.9 |
| LR (0.001) + Dropout (0.75) | 83.5 | 91.6 | 94.1 | 91.9 | 93.8 | 94.9 | 94.4 |
| LR (0.0001) + Dropout (0.25) | 86.3 | 89.3 | 93.3 | 89.7 | 92.9 | 95.3 | 96.2 |
| LR (0.0001) + Dropout (0.50) | 87.1 | 90.8 | 94.6 | 91.5 | 93.6 | 92.5 | 95.6 |
| LR (0.0001) + Dropout (0.75) | 88.8 | 89.2 | 93.0 | 90.3 | 92.3 | 93.7 | 96.8 |
Figure 16Accuracy results using the raw chest X-ray images.
Figure 17Performance evaluations using the raw chest X-ray images.
Performance evaluation of the proposed model on the single and ensemble models using raw CXR image.
| Model | ACC (%) | SEN (%) | SPE (%) | PRE (%) | F1 (%) |
|---|---|---|---|---|---|
| Raw image Shallow CNN (RISC) | 82.1 | 83.6 | 80.8 | 82.9 | 83.3 |
| Raw image MobileNet-V3 (RIM) | 88.0 | 89.2 | 90.5 | 89.7 | 89.0 |
| Raw image Inception-V3 (RII) | 93.34 | 91.5 | 92.7 | 93.0 | 91.9 |
| Raw image Shallow CNN + Raw image Inception-V3 (RISC + RII) | 91.7 | 89.4 | 91.0 | 90.1 | 91.5 |
| Raw image Shallow CNN + Raw image MobileNet-V3 (RISC + RIM) | 85.8 | 86.1 | 87.3 | 89.6 | 88.9 |
| Raw image Inception-V3 + Raw image MobileNet-V3 (RII + RIM) | 95.2 | 94.6 | 96.1 | 94.2 | 95.7 |
| Raw image Shallow CNN + Raw image Inception-V3 + Raw image MobileNet-V3 (RISC + RII + RIM) | 96.9 | 96.0 | 95.4 | 96.5 | 95.0 |
Performance results obtained using the raw chest X-ray images on different pretrained models on our proposed model.
| Model | Raw Image | ||
|---|---|---|---|
| ACC (%) | SEN (%) | SPE (%) | |
| AlexNet | 90.9 | 89.1 | 91.0 |
| VGG-16 | 89.6 | 90.3 | 89.2 |
| ResNet-152 | 90.2 | 88.5 | 89.0 |
| MobileNet-V3 | 88.0 | 89.2 | 90.5 |
| DenseNet-121 | 87.7 | 89.1 | 88.3 |
| Inception-V3 | 93.3 | 91.5 | 92.7 |
| Shallow CNN | 82.1 | 83.6 | 80.8 |