| Literature DB >> 34069841 |
Yash Karbhari1, Arpan Basu2, Zong-Woo Geem3, Gi-Tae Han3, Ram Sarkar2.
Abstract
COVID-19 is a disease caused by the SARS-CoV-2 virus. The COVID-19 virus spreads when a person comes into contact with an affected individual. This is mainly through drops of saliva or nasal discharge. Most of the affected people have mild symptoms while some people develop acute respiratory distress syndrome (ARDS), which damages organs like the lungs and heart. Chest X-rays (CXRs) have been widely used to identify abnormalities that help in detecting the COVID-19 virus. They have also been used as an initial screening procedure for individuals highly suspected of being infected. However, the availability of radiographic CXRs is still scarce. This can limit the performance of deep learning (DL) based approaches for COVID-19 detection. To overcome these limitations, in this work, we developed an Auxiliary Classifier Generative Adversarial Network (ACGAN), to generate CXRs. Each generated X-ray belongs to one of the two classes COVID-19 positive or normal. To ensure the goodness of the synthetic images, we performed some experimentation on the obtained images using the latest Convolutional Neural Networks (CNNs) to detect COVID-19 in the CXRs. We fine-tuned the models and achieved more than 98% accuracy. After that, we also performed feature selection using the Harmony Search (HS) algorithm, which reduces the number of features while retaining classification accuracy. We further release a GAN-generated dataset consisting of 500 COVID-19 radiographic images.Entities:
Keywords: COVID-19 detection; chest X-ray; deep learning; feature selection; generative adversarial network; harmony search; synthetic data generation
Year: 2021 PMID: 34069841 PMCID: PMC8157360 DOI: 10.3390/diagnostics11050895
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Flowchart representing the overall process.
Figure 2Schematic diagram of ACGAN.
Figure 3A schematic diagram of the skip connections in the ResNet architecture. (He et al. [22]).
Figure 4A schematic diagram representing an inception block. (Chollet [23]).
Figure 5A schematic diagram representing a simplified inception block. Compared to Figure 4, it only contains a single size of convolutions () and does not contain pooling layers. (Chollet [23]).
Figure 6A strictly equivalent reformulation of the simplified inception block of Figure 5. Chollet [23].
Figure 7A flowchart representing the HS algorithm.
A summary of the datasets that were used in this work.
| Dataset | COVID-19 | Normal |
|---|---|---|
| covid-chestxray-dataset [ | 1147 | 0 |
| COVID-chestxray-dataset Initiative a | 55 | 0 |
| Actualmed COVID-19 Chest X-ray Dataset Initiative b | 247 | 0 |
| COVID-19 Radiography Database c | 1200 | 1341 |
a https://github.com/ieee8023/covid-chestxray-dataset; b https://github.com/agchung/Actualmed-COVIDchestxray-dataset; c https://www.kaggle.com/tawsifurrahman/covid19-radiography-database; all URLs accessed on 27 March 2021.
A summary of the results obtained by the classifiers considered in this work. The results were reported after performing 20 iterations. Values in the table are in the format of mean ± std. dev.
| Dataset | Model | Accuracy (%) | Precision (%) | Recall (%) | F1-Score (%) | AUC (%) |
|---|---|---|---|---|---|---|
| Original Data + Synthetic Data | VGG16 | 99.54 ± 0.40 | 99.54 ± 0.40 | 99.54 ± 0.40 | 99.54 ± 0.40 | 99.92 ± 0.17 |
| VGG19 | 99.53 ± 0.34 | 99.53 ± 0.34 | 99.53 ± 0.34 | 99.53 ± 0.34 | 99.94 ± 0.08 | |
| ResNet50 | 99.48 ± 0.43 | 99.48 ± 0.43 | 99.48 ± 0.43 | 99.48 ± 0.43 | 99.85 ± 0.19 | |
| Xception | 96.91 ± 1.04 | 96.91 ± 1.04 | 96.91 ± 1.04 | 96.91 ± 1.04 | 99.48 ± 0.26 | |
| InceptionV3 | 99.17 ± 0.39 | 99.17 ± 0.39 | 99.17 ± 0.39 | 99.17 ± 0.39 | 99.80 ± 0.21 | |
| VGG16 + HS | 100.00 ± 0.00 | 100.00 ± 0.00 | 100.00 ± 0.00 | 100.00 ± 0.00 | 100.00 ± 0.00 | |
| Original Data | VGG16 | 99.36 ± 0.29 | 99.36 ± 0.29 | 99.36 ± 0.29 | 99.36 ± 0.29 | 99.65 ± 0.48 |
| VGG19 | 99.38 ± 0.37 | 99.38 ± 0.37 | 99.38 ± 0.37 | 99.38 ± 0.37 | 99.82 ± 0.35 | |
| ResNet50 | 99.29 ± 0.52 | 99.29 ± 0.52 | 99.29 ± 0.52 | 99.29 ± 0.52 | 99.71 ± 0.31 | |
| Xception | 95.41 ± 1.08 | 95.41 ± 1.08 | 95.41 ± 1.08 | 95.41 ± 1.08 | 99.10 ± 0.80 | |
| InceptionV3 | 97.37 ± 1.07 | 97.37 ± 1.07 | 97.37 ± 1.07 | 97.37 ± 1.07 | 99.13 ± 0.46 |
Figure 8The loss curve for a single CNN model (VGG16) for 20 epochs.
A comparison of the performances of various state-of-the-art feature selection algorithms on the mixed dataset (50% original data and 50% synthetic data). The VGG16 features of initial dimension 128 were used.
| Feature Selection Algorithm | Accuracy (%) | No. of Features | Reduction (%) |
|---|---|---|---|
| PSO [ | 97.36 | 43 | 66.4 |
| GA | 97.36 | 46 | 64.1 |
| MA [ | 97.36 | 62 | 51.6 |
| EO [ | 97.36 | 51 | 60.2 |
| GWO [ | 97.36 | 46 | 64.1 |
| GSA [ | 100.00 | 65 | 49.2 |
| SCA [ | 92.10 | 38 | 70.3 |
| HS [ | 100.00 | 48 | 62.5 |
The inception score and the Fretchet inception distance for the images that were generated by the GAN.
| Inception Score | Fretchet Inception Distance |
|---|---|
| 2.508 ± 0.125 | 50.67 ± 8.127 |
Figure 9Some synthetic images that were generated using the present approach. (a) Synthetic images belonging to the COVID-19 infected CXR class generated by the GAN; (b) synthetic images belonging to the normal CXR class generated by the GAN.
A summary of the results obtained by the classifiers on the three-class classification task. The input image was to be classified as normal, pneumonia-affected or COVID-affected.
| Model | Accuracy (%) | F1 Score (%) | AUC (%) |
|---|---|---|---|
| VGG16 | 94.77 | 93.20 | 99.39 |
| VGG19 | 93.91 | 92.70 | 99.29 |
| ResNet50 | 96.97 | 96.88 | 99.68 |
| Xception | 91.08 | 92.41 | 99.27 |
| InceptionV3 | 92.64 | 92.85 | 99.56 |
| VGG16 + HS | 100.00 | 100.00 | 100.00 |