| Literature DB >> 32989379 |
R Karthik1,2, R Menaka1,2, Hariharan M1,2.
Abstract
COVID-19 is a deadly viral infection that has brought a significant threat to human lives. Automatic diagnosis of COVID-19 from medical imaging enables precise medication, helps to control community outbreak, and reinforces coronavirus testing methods in place. While there exist several challenges in manually inferring traces of this viral infection from X-ray, Convolutional Neural Network (CNN) can mine data patterns that capture subtle distinctions between infected and normal X-rays. To enable automated learning of such latent features, a custom CNN architecture has been proposed in this research. It learns unique convolutional filter patterns for each kind of pneumonia. This is achieved by restricting certain filters in a convolutional layer to maximally respond only to a particular class of pneumonia/COVID-19. The CNN architecture integrates different convolution types to aid better context for learning robust features and strengthen gradient flow between layers. The proposed work also visualizes regions of saliency on the X-ray that have had the most influence on CNN's prediction outcome. To the best of our knowledge, this is the first attempt in deep learning to learn custom filters within a single convolutional layer for identifying specific pneumonia classes. Experimental results demonstrate that the proposed work has significant potential in augmenting current testing methods for COVID-19. It achieves an F1-score of 97.20% and an accuracy of 99.80% on the COVID-19 X-ray set.Entities:
Keywords: CNN; COVID-19; Chest X-ray; Deep learning; Pneumonia
Year: 2020 PMID: 32989379 PMCID: PMC7510455 DOI: 10.1016/j.asoc.2020.106744
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 6.725
Fig. 1Schematic diagram of the proposed Channel Shuffled Dual-Branched CNN.
Fig. 2Architectural sketch of the proposed CSDB CNN with Distinctive Filter Learning (DFL).
Data collection from different sources. Provided are the count of patients studied and samples curated in each category of pneumonia.
| S.No | Source | Details | Category | Number of patients | Sample count |
|---|---|---|---|---|---|
| 1 | Joseph Cohen Dataset | Open-source data maintained by the University of Montreal | COVID-19 | 227 | 356 |
| 2 | Radiopaedia | An open database compiled by radiologists and clinicians | COVID-19 | 35 | 61 |
| 3 | AG Chung Dataset | University of Waterloo, Canada | COVID-19 | 31 | 35 |
| 4 | ActualMed Dataset | University of Waterloo, Canada | COVID-19 | 51 | 58 |
| 5 | SIRM | Italian Society of Medical and Interventional Radiology | COVID-19 | 48 | 48 |
| 6 | RSNA Challenge | Public data provided by the National Institutes of Health Clinical Center | Normal | 8851 | 8851 |
| 7 | Paul Mooney dataset | Guangzhou Women and Children’s Medical Centre, Guangzhou | Normal | 584 | 1583 |
| Bacterial Pneumonia | 1437 | 2780 | |||
| Viral Pneumonia | 1216 | 1493 | |||
Performance of the CSDB CNN on each validation fold.
| Folds | Precision | Recall | F1-score | Accuracy | Specificity | AUC |
|---|---|---|---|---|---|---|
| Fold1 | 90.46 | 93.53 | 91.73 | 93.94 | 97.78 | 95.66 |
| Fold2 | 88.90 | 91.97 | 90.03 | 92.47 | 97.22 | 94.60 |
| Fold3 | 89.34 | 92.69 | 90.65 | 93.09 | 97.52 | 95.10 |
| Fold4 | 89.58 | 94.63 | 91.87 | 94.07 | 97.88 | 96.25 |
| 88.60 | 94.06 | 91.07 | 93.55 | 97.66 | 95.86 | |
| 89.38 | 93.38 | 91.07 | 93.42 | 97.61 | 95.49 |
Observations of the cross-validation process (learning curves) recorded by the CSDB CNN with the DFL model. The rows present the evolution of prediction accuracy with epochs, degeneration of model loss, ROC curve respectively.
Accuracy, Loss values for the proposed DFL embedded CSDB CNN observed on the training and validation data across 5 folds. Also, the Macro averaged Area under ROC (AUC) and AUC for the COVID-19 class are shown for each fold.
| Fold | Accuracy | Loss | Area under ROC (%) | |||
|---|---|---|---|---|---|---|
| Training | Validation | Training | Validation | COVID-19 | Macro-average | |
| Fold1 | 99.70 | 98.23 | 0.0667 | 0.167 | 98.51 | 97.75 |
| Fold2 | 99.88 | 96.79 | 0.0512 | 0.237 | 97.65 | 97.27 |
| Fold3 | 99.34 | 97.61 | 0.082 | 0.179 | 98.26 | 98.16 |
| Fold4 | 99.50 | 98.33 | 0.088 | 0.159 | 98.77 | 99.09 |
| Fold5 | 99.81 | 98.72 | 0.0710 | 0.127 | 98.75 | 97.75 |
Fig. 3Fold-wise Confusion Matrix for the proposed DFL-enabled CSDB CNN. The values were recorded on the validation set under each fold.
Proposed DFL augmented CSDB CNN performance on the validation set for the four target classes (values are averaged across 5 folds). Also, the macro-averaged scores for each metric (computed from all classes per validation fold) are aggregated over all 5 folds.
| Classes | Precision | Recall | F1-score | Accuracy | Specificity | AUC |
|---|---|---|---|---|---|---|
| COVID-19 | 98.36 | 96.07 | 97.20 | 99.80 | 99.94 | 98.01 |
| Normal | 99.54 | 97.89 | 98.71 | 98.25 | 99.03 | 98.46 |
| Bacterial Pneumonia | 95.43 | 98.74 | 97.05 | 98.91 | 98.94 | 98.84 |
| Viral Pneumonia | 92.03 | 97.46 | 94.66 | 98.92 | 99.08 | 98.27 |
| 96.34 | 97.54 | 96.90 | 97.94 | 99.25 | 98.39 |
Classification performance of the CSDB CNN and DFL over a baseline CNN with 7 convolutional layers connected in a feedforward manner. Class-wise F1-scores and AUC values are reported for the ablation experiments. It is assumed that ‘n’ denotes the number of convolutional layers on CNN.
| Experiments | F1-score | AUC | Accuracy | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| COVID-19 | Normal | Bacterial Pneumonia | Viral Pneumonia | Macro average | COVID-19 | Normal | Bacterial Pneumonia | Viral Pneumonia | Macro AUC | ||
| Baseline 7-layer feedforward CNN | 73.60 | 84.48 | 69.26 | 64.78 | 73.56 | 90.29 | 84.49 | 84.75 | 83.62 | 85.78 | 79.96 |
| Baseline CNN with DFL at n-1 conv layer | 81.78 | 89.90 | 76.02 | 72.89 | 80.14 | 94.51 | 88.16 | 88.45 | 88.24 | 89.84 | 84.87 |
| CSDB CNN | 94.26 | 95.62 | 90.15 | 83.18 | 76.85 | 95.21 | 94.88 | 95.60 | 94.78 | 87.81 | 93.42 |
| CSDB CNN with DFL at n-2, n-1 conv layers | 96.86 | 98.28 | 95.50 | 94.74 | 96.34 | 98.16 | 97.99 | 97.93 | 99.10 | 98.23 | 97.35 |
| CSDB CNN with DFL at n-1 conv layer | 97.20 | 98.71 | 97.05 | 94.66 | 96.90 | 98.01 | 98.46 | 98.84 | 98.27 | 98.39 | 97.94 |
Quantitative performance validation results of multiple standard CNN architectures on the COVID-19 X-ray dataset. The enhancements from DFL are evaluated by applying it to the penultimate Convolutional layer for each CNN.
| Methods | Number of parameters | Macro average precision | Macro average recall | Macro average F1-score | Accuracy | AUC |
|---|---|---|---|---|---|---|
| SqueezeNet | 728K | 67.33 | 76.91 | 70.26 | 74.92 | 84.00 |
| SqueezeNet + DFL | 72.15 | 81.10 | 75.16 | 79.34 | 86.85 | |
| VGG16 | 134M | 73.06 | 84.17 | 77.13 | 81.40 | 88.79 |
| VGG16 + DFL | 77.49 | 84.86 | 80.11 | 85.20 | 89.77 | |
| ResNet50 | 23M | 85.50 | 91.68 | 88.03 | 90.93 | 94.25 |
| ResNet50 + DFL | 91.49 | 95.15 | 93.15 | 94.79 | 96.62 | |
| ResNeXt 32 × 4d | 23M | 86.91 | 92.87 | 89.46 | 92.24 | 95.07 |
| ResNeXt 32 × 4d + DFL | 91.69 | 95.35 | 93.34 | 94.96 | 96.76 | |
| DenseNet161 | 26M | 88.96 | 93.64 | 90.98 | 93.25 | 95.62 |
| DenseNet161 + DFL | 95.11 | 97.04 | 96.03 | 97.15 | 97.99 | |
| 15M | 82.97 | 89.28 | 85.37 | 88.42 | 92.57 | |
| 96.34 | 97.54 | 96.90 | 97.94 | 98.39 | ||
Visualization of the most salient regions on the X-ray using three gradient-based input response weighing methods. Samples on the top left, top right, bottom left, bottom right correspond to COVID-19, normal, bacterial, and viral pneumonia classes respectively. The color bars were suitably chosen to best project the sensitivities of these heat maps in the most characteristic way.
Fig. 4Visualization of the template patterns captured by a distinct set of filters for each class. (For visual purposes, top 500 filters are shown for every target class).
Performance Analysis of the proposed work with current research works that have utilized the same chest X-ray data sources for COVID-19, Pneumonia images as the current work.
| S No | Source | Methodology | Class | Number of COVID-19 Test samples | Approx. parameters | Overall | Overall accuracy (%) | F1-score for COVID-19 | COVID-19 class accuracy (%) |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Ozturk et al. | DarkNet-19 based CNN | 3 | 1.164M | 87.40 | 87.02 | 88.00 | 87.02 | |
| 2 | Mangal et al. | CheXNet based CNN | 4 | 30 | 26M | 92.30 | 87.2 | 96.77 | 99.6 |
| 3 | Khan et al. | Transfer learning with Xception net | 4 | 33M | 89.8 | 89.6 | 95.61 | 96.6 | |
| 4 | Wang and Wong | Customized CNN architecture | 3 | 100 | 11.75M | 93.13 | 93.33 | 94.78 | 96.67 |
| 5 | Apostolopoulos and Mpesiana | Transfer learning with MobileNetV2 | 4 | 222 | 3.4M | 93.80 | 94.72 | 90.50 | 96.80 |
| 6 | Farooq and Hafeez | ResNet50 based CNN | 4 | 8 | 25.6M | 96.88 | 96.23 | 100.0 | 100.0 |
| 7 | Proposed Work | Customized CNN with distinctive filter learning module | 4 | 112 |