| Literature DB >> 32588200 |
Dipayan Das1, K C Santosh2, Umapada Pal3.
Abstract
Since December 2019, the Coronavirus Disease (COVID-19) pandemic has caused world-wide turmoil in a short period of time, and the infection, caused by SARS-CoV-2, is spreading rapidly. AI-driven tools are used to identify Coronavirus outbreaks as well as forecast their nature of spread, where imaging techniques are widely used, such as CT scans and chest X-rays (CXRs). In this paper, motivated by the fact that X-ray imaging systems are more prevalent and cheaper than CT scan systems, a deep learning-based Convolutional Neural Network (CNN) model, which we call Truncated Inception Net, is proposed to screen COVID-19 positive CXRs from other non-COVID and/or healthy cases. To validate our proposal, six different types of datasets were employed by taking the following CXRs: COVID-19 positive, Pneumonia positive, Tuberculosis positive, and healthy cases into account. The proposed model achieved an accuracy of 99.96% (AUC of 1.0) in classifying COVID-19 positive cases from combined Pneumonia and healthy cases. Similarly, it achieved an accuracy of 99.92% (AUC of 0.99) in classifying COVID-19 positive cases from combined Pneumonia, Tuberculosis, and healthy CXRs. To the best of our knowledge, as of now, the achieved results outperform the existing AI-driven tools for screening COVID-19 using the acquired CXRs, and proves the viability of using the proposed Truncated Inception Net as a screening tool.Entities:
Keywords: CNN; COVID-19; Chest X-rays; Deep learning; Inception net; Pneumonia; Tuberculosis
Mesh:
Year: 2020 PMID: 32588200 PMCID: PMC7315909 DOI: 10.1007/s13246-020-00888-x
Source DB: PubMed Journal: Phys Eng Sci Med ISSN: 2662-4729
Fig. 1COVID-19 pneumonia is characterized primarily by patches of Ground-Glass Opacity (GGO) and consolidations. In these CXRs, the GGO areas, in early stages of COVID-19, are identified/annotated with white arrows. These annotations were made in the original dataset, which solely attribute the clinical implications
Fig. 2(Above) The original architecture of the Inception Net V3 model, which was implemented for classifying images of the ImageNet database [27]. (Below) The Truncated Inception Net model, which is proposed in our work for screening COVID-19 positive CXRs. The model retains 3 inception modules and 1 grid size reduction module from the original architecture (given above)
Fig. 3Block diagram: The block diagram presents the internal pipeline of an Inception module, which forms the building block of the InceptionNet. Multiple sized kernels (e.g. 33 and 55) are used to convolve with the input image, to extract features of varied spatial resolution. Finally, the activation maps obtained from the parallel computations are stacked depth-wise to form the output
Fig. 4The learning rate is reduced every time the validation loss does not improve for more than the specified patience factor, which is 3 epochs (empirically designed)
Data collection (publicly available)
| Collection | # of positive cases | # of negative cases |
|---|---|---|
| C1: COVID-19 | 162 | – |
| C2: Pneumonia | 4280 | 1583 |
| C3: TB (China) | 342 | 340 |
| TB (USA) | 58 | 80 |
Fig. 5Few samples: a COVID-19, b Pneumonia, c Tuberculosis, and d) Healthy CXRs. GGO and consolidations are observed in COVID-19 CXRs
Experimental datasets using Table 1
| Dataset | COVID-19 | Pneumonia | TB (China) | TB (USA) | ||||
|---|---|---|---|---|---|---|---|---|
| +ve | −ve | +ve | −ve | +ve | −ve | +ve | −ve | |
| D1 | 162 | – | – | – | – | 340 | – | – |
| D2 | 162 | – | – | – | – | – | – | 80 |
| D3 | 162 | – | – | 1583 | – | – | – | – |
| D4 | 162 | – | – | 1583 | – | 340 | – | 80 |
| D5 | 162 | – | 4280 | 1583 | – | – | – | – |
| D6 | 162 | – | 4280 | 1583 | 342 | 340 | 58 | 80 |
Index: +ve = positive cases and −ve = negative/healthy cases
Results: ACC in %, AUC, SEN, SPEC, PREC, and F1 score for each fold of 10 fold cross-validation on the D6 dataset
| Dataset-fold | ACC | AUC | SEN | SPEC | PREC | F1 Score |
|---|---|---|---|---|---|---|
| D6-1 | 100 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
| D6-2 | 99.85 | 0.99 | 0.86 | 1.0 | 1.0 | 0.92 |
| D6-3 | 99.85 | 0.99 | 0.86 | 1.0 | 1.0 | 0.92 |
| D6-4 | 100 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
| D6-5 | 100 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
| D6-6 | 100 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
| D6-7 | 100 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
| D6-8 | 99.85 | 0.99 | 0.86 | 1.0 | 1.0 | 0.92 |
| D6-9 | 99.70 | 0.99 | 0.71 | 1.0 | 1.0 | 0.83 |
| D6-10 | 100 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 |
| 99.92 | 0.99 | 0.93 | 1.0 | 1.0 | 0.96 | |
| ± 0.100 | ± 0.006 | ± 0.096 | ± 0.0 | ± 0.0 | ± 0.055 |
Fig. 6Activation maps generated by the second convolutional layer (Conv2D), the second inception module (Mixed1), and the grid-size reduction module (Mixed3) in our model. The input samples are taken from a COVID-19 positive, b Pneumonia positive, and c Tuberculosis positive CXRs
Results: average ACC in %, AUC, SEN, SPEC, PREC, and F1 score using 10 fold cross-validation with standard deviation
| Dataset | ACC | AUC | SEN | SPEC | PREC | F1 Score |
|---|---|---|---|---|---|---|
| D1 | ||||||
| D2 | ||||||
| D3 | ||||||
| D4 | ||||||
| D5 | ||||||
| D6 | ||||||
| 98.77 | 0.99 | 0.95 | 0.99 | 0.99 | 0.97 | |
| ± 0.702 | ± 0.026 | ± 0.039 | ± 0.016 | ± 0.001 | ± 0.021 |
Fig. 7The ROC curves obtained for the six different datasets D1–D6. The black dotted curve represents the ROC of a random guessing classifier
Comparison: computational time (in ms) between Inception Net V3 (full architecture) and Truncated Inception Net
| 10 Samples (randomly selected) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | CXR1 | CXR2 | CXR3 | CXR4 | CXR5 | CXR6 | CXR7 | CXR8 | CXR9 | CXR10 | Mean ( |
| Inception Net V3 | 22.10 | 28.80 | 21.30 | 20.20 | 20.60 | 19.90 | 22.50 | 20.90 | 21.40 | 21.40 | 21.90±2.40 |
| Truncated Inception Net | 8.63 | 11.00 | 9.53 | 8.02 | 8.93 | 8.63 | 8.70 | 9.64 | 9.30 | 10.30 | 9.27±0.84 |
| Ratio | 2.56 | 2.61 | 2.23 | 2.52 | 2.30 | 2.30 | 2.58 | 2.16 | 2.30 | 2.07 | 2.36±0.18 |
Comparison table
| Model | # of COVID-19 CXRs | # of non COVID-19 CXRs | ACC (in %) | AUC | SEN | SPEC | PREC | F1 score | Remarks | # of parameters (in million) |
|---|---|---|---|---|---|---|---|---|---|---|
| ResNet50 and SVM [ | 25 | 25 | 95.38 | – | 0.97 | 0.93 | – | 0.95 | – | 23.5 |
| COVID-Net [ | 68 | 2794 | 83.50 | – | 1.0 | – | – | – | – | 116.6 |
| ResNet50 [ | 50 | 50 | 98.0 | – | 0.96 | 1.0 | 1.0 | 0.98 | – | 23.5 |
| Inception Net V3 [ | 50 | 50 | 97.0 | – | 0.94 | 1.0 | 1.0 | 0.96 | – | 21.7 |
| Truncated inception net | 162 | 80 (D2) | 94.04 | 1.0 | 0.88 | 1.0 | 1.0 | 0.93 | Poor | 2.1 |
| 162 | 1583 (D3) | 100.0 | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | Best | ||
| 162 | – | 98.77 | 0.99 | 0.95 | 0.99 | 0.99 | 0.97 | Average (D1–D6) |