| Literature DB >> 33564340 |
Himadri Mukherjee1, Subhankar Ghosh2, Ankita Dhar1, Sk Md Obaidullah3, K C Santosh4, Kaushik Roy1.
Abstract
Among radiological imaging data, Chest X-rays (CXRs) are of great use in observing COVID-19 manifestations. For mass screening, using CXRs, a computationally efficient AI-driven tool is the must to detect COVID-19-positive cases from non-COVID ones. For this purpose, we proposed a light-weight Convolutional Neural Network (CNN)-tailored shallow architecture that can automatically detect COVID-19-positive cases using CXRs, with no false negatives. The shallow CNN-tailored architecture was designed with fewer parameters as compared to other deep learning models. The shallow CNN-tailored architecture was validated using 321 COVID-19-positive CXRs. In addition to COVID-19-positive cases, another set of non-COVID-19 5856 cases (publicly available, source: Kaggle) was taken into account, consisting of normal, viral, and bacterial pneumonia cases. In our experimental tests, to avoid possible bias, 5-fold cross-validation was followed, and both balanced and imbalanced datasets were used. The proposed model achieved the highest possible accuracy of 99.69%, sensitivity of 1.0, where AUC was 0.9995. Furthermore, the reported false positive rate was only 0.0015 for 5856 COVID-19-negative cases. Our results stated that the proposed CNN could possibly be used for mass screening. Using the exact same set of CXR collection, the current results were better than other deep learning models and major state-of-the-art works. © Springer Science+Business Media, LLC, part of Springer Nature 2021.Entities:
Keywords: COVID-19; Chest X-rays; Convolutional neural network; Deep learning; Mass screening
Year: 2021 PMID: 33564340 PMCID: PMC7863062 DOI: 10.1007/s12559-020-09775-9
Source DB: PubMed Journal: Cognit Comput ISSN: 1866-9956 Impact factor: 4.890
Fig. 1CXR samples: a–c COVID-19-positive cases and d–f non-COVID-19 cases (pneumonia)
Fig. 2Architecture of the proposed CNN
Number of generated parameters by the proposed network for a 25×25 image
| Layer | Parameters |
|---|---|
| Convolution | 280 |
| Dense 1 | 310016 |
| Dense 2 | 514 |
| Total | 310810 |
Fig. 3Feature map visualization: a COVID-19-positive case and its corresponding b–d feature maps; and e pneumonia-positive case and its corresponding f, g feature maps
Confusion matrices for different batch sizes
| Batch size: 20 | Batch size: 25 (best result) | ||||
| COVID-19 | Non- COVID-19 | COVID-19 | Non- COVID-19 | ||
| COVID-19 | 321 | 0 | COVID-19 | 321 | 0 |
| Non-COVID-19 | 3 | 318 | Non-COVID-19 | 2 | 319 |
| Batch size: 50 | Batch size: 75 | ||||
| COVID-19 | Non- COVID-19 | COVID-19 | Non- COVID-19 | ||
| COVID-19 | 319 | 2 | COVID-19 | 320 | 1 |
| Non-COVID-19 | 3 | 318 | Non-COVID-19 | 4 | 317 |
| Batch size:100 | Batch size:125 | ||||
| COVID-19 | Non-COVID-19 | COVID-19 | Non-COVID-19 | ||
| COVID-19 | 320 | 1 | COVID-19 | 320 | 1 |
| Non-COVID-19 | 3 | 318 | Non-COVID-19 | 3 | 318 |
Performance metrics for different batch sizes
| Metrics | Batch size | |||||
|---|---|---|---|---|---|---|
| 20 | 25 | 50 | 75 | 100 | 125 | |
| Sensitivity | 1 | 1 | 0.9938 | 0.9969 | 0.9969 | 0.9969 |
| Specificity | 0.9907 | 0.9938 | 0.9907 | 0.9875 | 0.9907 | 0.9907 |
| Precision | 0.9907 | 0.9938 | 0.9907 | 0.9877 | 0.9907 | 0.9907 |
| False positive rate | 0.0093 | 0.0062 | 0.0093 | 0.0125 | 0.0093 | 0.0093 |
| False negative rate | 0 | 0 | 0.0062 | 0.0031 | 0.0031 | 0.0031 |
| Accuracy (%) | 99.53 | 99.69 | 99.22 | 99.22 | 99.38 | 99.38 |
| F1 Score | 0.9953 | 0.9969 | 0.9922 | 0.9922 | 0.9938 | 0.9938 |
| AUC | 0.9997 | 0.9995 | 0.9993 | 0.9991 | 0.9993 | 0.9996 |
Fig. 4ROC curves for different batch sizes: a 20 batch size; b 25 batch size; c 50 batch size; d 75 batch size; e 100 batch size; and f 125 batch size. The batch size of 50 was found to be the best of all
Performance comparison with other deep learning models for balanced dataset
| Metrics | InceptionV3 | MobileNet | ResNet50 | Proposed CNN |
|---|---|---|---|---|
| Sensitivity | 1.0000 | 1.0000 | 0.9252 | |
| Specificity | 0.9751 | 0.9938 | 0.9751 | |
| Precision | 0.9757 | 0.9938 | 0.9738 | |
| False positive rate | 0.0249 | 0.0062 | 0.0249 | |
| False negative rate | 0.0000 | 0.0000 | 0.0748 | |
| Accuracy (%) | 98.75 | 99.69 | 95.02 | |
| F1 Score | 0.9877 | 0.9969 | 0.9489 | |
| AUC | 0.9877 | 0.9969 | 0.9355 | |
| Parameters | 26,522,146 | 7,423,938 | 49,278,594 |
Bold entries denote best result
Fig. 5ROC curves for a InceptionV3, b MobileNet, and c ResNet50
Confusion matrices for imbalanced dataset
| COVID-19 | Non-COVID-19 | |
|---|---|---|
| COVID-19 | 311 | 10 |
| Non-COVID-19 | 9 | 5847 |
Performance metrics for imbalanced data
Bold entries denote best result
Fig. 6ROC curves for a InceptionV3, b MobileNet, c ResNet50, and d Proposed network
Confusion matrices for extended dataset
Performance metrics for extended data
Fig. 7ROC curve for extended dataset
Comparative study. Index: Li et al. (2020) [14], Abbas et al. (2020)[12], Luz et al. (2020)[20], Wang et al. (2020) [24], Sethy and Behera (2020) [13], Zhang et al. (2020) [22], and Apostolopoulos and Mpesiana (2020) [28]. Since authors did not report results for several different metrics, there exists symbol “–” in the table
| Metrics | Li et al. [ | Abbas et al. [ | Luz et al. [ | Wang et al. [ | Sethy & Behera [ | Zhang et al. [ | Aposto-lopoulos & Mpesiana [ | Shallow CNN (proposed) |
|---|---|---|---|---|---|---|---|---|
| Dataset (No. of tested COVID-19-positive cases) | 36 | 32 | 31 | 31 | 5 | 100 | 224 | 321 |
| Sensitivity | – | 0.9791 | 0.968 | 0.871 | 0.9729 | 0.9600 | 0.9866 | 1.0000 |
| Specificity | – | 0.9187 | – | – | 97.4705 | 0.7065 | 0.9646 | 0.9938 |
| Precision | – | 0.9336 | – | – | – | – | – | 0.9938 |
| False positive rate | – | – | – | – | – | – | – | 0.0062 |
| False negative rate | – | – | – | – | – | – | – | 0.0000 |
| Accuracy (%) | 93.5 | 95.12 | 93.9 | 92.6 | 95.38 | 83.34 | 96.78 | 99.69 |
| F1 score | – | – | – | – | 0.9552 | – | – | 0.9969 |
| AUC | 0.992 | – | – | – | – | – | – | 0.9995 |