| Literature DB >> 33363252 |
Prottoy Saha1, Muhammad Sheikh Sadi1, Md Milon Islam1.
Abstract
Recently, coronavirus disease (COVID-19) has caused a serious effect on the healthcare system and the overall global economy. Doctors, researchers, and experts are focusing on alternative ways for the rapid detection of COVID-19, such as the development of automatic COVID-19 detection systems. In this paper, an automated detection scheme named EMCNet was proposed to identify COVID-19 patients by evaluating chest X-ray images. A convolutional neural network was developed focusing on the simplicity of the model to extract deep and high-level features from X-ray images of patients infected with COVID-19. With the extracted features, binary machine learning classifiers (random forest, support vector machine, decision tree, and AdaBoost) were developed for the detection of COVID-19. Finally, these classifiers' outputs were combined to develop an ensemble of classifiers, which ensures better results for the dataset of various sizes and resolutions. In comparison with other recent deep learning-based systems, EMCNet showed better performance with 98.91% accuracy, 100% precision, 97.82% recall, and 98.89% F1-score. The system could maintain its great importance on the automatic detection of COVID-19 through instant detection and low false negative rate.Entities:
Keywords: Automatic diagnosis; COVID-19; Convolutional neural network; Ensemble of classifiers; X-ray images
Year: 2020 PMID: 33363252 PMCID: PMC7752710 DOI: 10.1016/j.imu.2020.100505
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Comparative study of recent researches related to COVID-19 detection and ensemble of ML classifiers.
| Author | Sources of Dataset | Dataset Details | Model | Ensemble | Accuracy (%) |
|---|---|---|---|---|---|
| Abbas et al. [ | COVID-19 and SARS images: [ | 196 (COVID-19 = 105, Normal = 80, SARS = 11) | ResNet | No | 95.12% |
| Loey et al. [ | COVID-19 images: [ | 306 (COVID-19 = 69, Normal = 79, Bacteria pneumonia = 79, Virus pneumonia = 79) | Googlenet | No | 80.56 |
| Rahimzadeh et al. [ | COVID-19 images: [ | 15,085 (COVID-19 = 180, Normal = 8851, Pneumonia = 6054) | Xception + ResNet50V2 | No | 91.4 |
| Oh et al. [ | COVID-19 images: [ | 15,043 (COVID-19 = 180, Normal = 8851, pneumonia = 6012) | Patch based CNN | No | 88.9 |
| Zhang et al. [ | COVID-19 and images: [ | 1531 (COVID-19 = 100, Normal = 1431) | 18 layer residual CNN | No | 72.31% |
| Apostolopoulos et al. [ | COVID-19 images: [ | 1442 (COVID-19 = 224, Normal = 504, Pneumonia = 714) | MobileNet V2 | No | 96.78 |
| Mahmud et al. [ | COVID-19 images: Sylhet Medical College, BD; Normal and Pneumonia images [ | 6161 (COVID-19 = 305, Normal = 1583, Bacteria pneumonia = 2780, Virus pneumonia = 1493) | CNN with Depthwise dilated convolution | No | 90.2 |
| Tsiknakis et al. [ | COVID-19 images: [ | 572 (COVID-19 = 122, Normal = 150, Bacteria pneumonia = 150, Virus pneumonia = 150) | Inception V3 | No | 76 |
| Sethy et al. [ | COVID-19 images: [ | 381 (COVID-19 = 127, Normal = 127, Bacteria pneumonia = 63, Virus pneumonia = 64) | Resnet50 + SVM | No | 95.33 |
| Horry et al. [ | COVID-19 images: [ | 60,838 (COVID-19 = 115, Normal = 60,361, pneumonia = 322) | VGG 19 | No | 81 |
| Hasan et al. [ | [ | 768 (Diabetic = 268, non-Diabetic = 500) | KNN, DT, RF, Naïve Bayes, AB, XB | Yes | 72.26 |
Fig. 1Architecture of EMCNet.
Fig. 2First three images of the first row are samples of COVID-19 X-ray images. The rest of the images are of normal chest X-ray images.
Partition of the dataset into training, validation and testing set.
| Dataset | COVID-19 | Normal | Total |
|---|---|---|---|
| Training | 1610 | 1610 | 3220 |
| Validation | 460 | 460 | 920 |
| Testing | 230 | 230 | 460 |
| Total | 2300 | 2300 | 4600 |
CNN layers and their detail explanation.
| Layer | Filter Size | Pool Size | Stride | Padding | Number of Filters | Dropout Threshold | Activation |
|---|---|---|---|---|---|---|---|
| Conv2D | 3 × 3 | – | 1 | Valid | 32 | – | Relu |
| Conv2D | 3 × 3 | – | 1 | Valid | 128 | – | Relu |
| MaxPooling2D | – | 2 × 2 | 2 | – | – | – | – |
| Dropout | – | – | – | – | – | .25 | – |
| Conv2D | 3 × 3 | – | 1 | Valid | 64 | – | Relu |
| MaxPooling2D | – | 2 × 2 | 2 | – | – | – | – |
| Dropout | – | – | – | – | – | .25 | – |
| Conv2D | 3 × 3 | – | 1 | Valid | 128 | – | Relu |
| MaxPooling2D | – | 2 × 2 | 2 | – | – | – | – |
| Dropout | – | – | – | – | – | .25 | – |
| Conv2D | 3 × 3 | – | 1 | Valid | 512 | – | Relu |
| MaxPooling2D | – | 2 × 2 | 2 | – | – | – | – |
| Dropout | – | – | – | – | – | .25 | – |
| Conv2D | 3 × 3 | – | 1 | Valid | 512 | – | Relu |
| MaxPooling2D | – | 2 × 2 | 2 | – | – | – | – |
| Dropout | – | – | – | – | – | .25 | – |
| Flatten | – | – | – | – | – | – | – |
| FCL | – | – | – | – | 64 | – | Relu |
| Dropout | – | – | – | – | – | .25 | – |
| FCL | – | – | – | – | 2 | – | Sigmoid |
Model summary of the proposed CNN model.
| Number of Layers | Layer (Type) | Output Shape | Parameter |
|---|---|---|---|
| 1 | conv2d_6 (Conv2D) | [222, 222, 32] | 896 |
| 2 | conv2d_7 Conv2D) | [220, 220, 128] | 36,992 |
| 3 | max_pooling2d_5 (MaxPooling2D) | [110, 110, 128] | 0 |
| 4 | dropout_6 (Dropout) | [110, 110, 128] | 0 |
| 5 | conv2d_8 (Conv2D) | [108, 108, 64] | 73,792 |
| 6 | max_pooling2d_6 (MaxPooling2D) | [54, 54, 64] | 0 |
| 7 | dropout_7 (Dropout) | [54, 54, 64] | 0 |
| 8 | conv2d_9 (Conv2D) | [52, 52, 128] | 73,856 |
| 9 | max_pooling2d_7 (MaxPooling2D) | [26, 26, 128] | 0 |
| 10 | dropout_8 (Dropout) | [26, 26, 128] | 0 |
| 11 | conv2d_10 (Conv2D) | [24, 24, 512] | 590,336 |
| 12 | max_pooling2d_8 (MaxPooling2D) | [12, 12, 512] | 0 |
| 13 | dropout_9 (Dropout) | [12, 12, 512] | 0 |
| 14 | conv2d_11 (Conv2D) | [10, 10, 512] | 2,359,808 |
| 15 | max_pooling2d_9 (MaxPooling2D) | [5, 5, 512] | 0 |
| 16 | dropout_10 (Dropout) | [5, 5, 512] | 0 |
| 17 | flatten_1 (Flatten) | [12,800] | 0 |
| 18 | dense_2 (Dense) | [64] | 819,264 |
| 19 | dropout_11 (Dropout) | [64] | 0 |
| 20 | dense_3 (Dense) | [2] | 65 |
Fig. 3Proposed CNN architecture.
Fig. 4Training process of ML classifiers and ensemble of classifiers.
Fig. 5Performance analysis of the CNN used in EMCNet. (a) Training and Validation Accuracy (b) Training and Validation Loss.
Tuned hyperparameters of ML classifiers.
| ML classifiers | Hyperparameters |
|---|---|
| Random Forest | |
| Support Vector Machine | |
| Decision Tree | |
| AdaBoost |
Fig. 6Confusion matrix representation for the classifiers used in EMCNet (a) CNN (b) RF (c) DT (d) AB (e) SVM (f) Ensemble of Classifiers.
Performance evaluation of the classifiers of EMCNet based on each class label.
| Models | Class | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|---|
| CNN | COVID | 96.52 | 100 | 96.52 | 98.23 |
| Normal | 100 | 96.64 | 100 | 98.29 | |
| DT | COVID | 95.65 | 100 | 95.65 | 97.78 |
| Normal | 100 | 95.83 | 100 | 97.87 | |
| RF | COVID | 96.09 | 100 | 96.09 | 98.00 |
| Normal | 100 | 96.23 | 100 | 98.08 | |
| SVM | COVID | 96.96 | 96.12 | 96.96 | 96.54 |
| Normal | 96.09 | 96.93 | 96.09 | 96.51 | |
| AB | COVID | 96.52 | 100 | 96.52 | 98.23 |
| Normal | 100 | 96.64 | 100 | 98.29 | |
| Ensembling | COVID | 97.83 | 100 | 97.83 | 98.90 |
| Normal | 100 | 97.87 | 100 | 98.92 |
Fig. 7Graphical representation of the performance of the models used in EMCNet for each class label.
Fig. 8ROC curve for the classifiers of EMCNet.
Performance evaluation of the classifiers used in EMCNet.
| Classifiers | Accuracy (%) | Precision (%) | Recall (%) | F1-score (%) |
|---|---|---|---|---|
| CNN | 98.26 | 100 | 96.52 | 98.22 |
| RF | 98.04 | 100 | 96.09 | 98.00 |
| DT | 97.82 | 100 | 95.65 | 97.77 |
| SVM | 96.52 | 96.12 | 96.96 | 96.54 |
| AB | 98.26 | 100 | 96.52 | 98.22 |
| Ensemble | 98.91 | 100 | 97.82 | 98.89 |
Performance analysis of EMCNet in comparison with recent works.
| Author | Sources of Dataset | Dataset Details | Model | Accuracy (%) |
|---|---|---|---|---|
| Zhang et al. [ | COVID-19 and images: [ | 1531 (COVID-19 = 100, Normal = 1431) | 18 layer residual CNN | 72.31 |
| Tsiknakis et al. [ | COVID-19 images: [ | 572 (COVID-19 = 122, Normal = 150, Bacteria pneumonia = 150, Virus pneumonia = 150) | Inception V3 | 76 |
| Loey et al. [ | COVID-19 images: [ | 306 (COVID-19 = 69, Normal = 79, Bacteria pneumonia = 79, Virus pneumonia = 79) | Googlenet | 80.56 |
| Oh et al. [ | COVID-19 images: [ | 15,043 (COVID-19 = 180, Normal = 8851, pneumonia = 6012) | Patch based CNN | 88.9 |
| Rahimzadeh et al. [ | COVID-19 images: [ | 15,085 (COVID-19 = 180, Normal = 8851, Pneumonia = 6054) | Xception + ResNet50V2 | 91.4 |
| Abbas et al. [ | COVID-19 and SARS images: [ | 196 (COVID-19 = 105, Normal = 80, SARS = 11) | ResNet | 95.12 |
| Sethy et al. [ | COVID-19 images: [ | 381 (COVID-19 = 127, Normal = 127, Bacteria pneumonia = 63, Virus pneumonia = 64) | Resnet50 + SVM | 95.33 |
| Apostolopoulos et al. [ | COVID-19 images: [ | 1442 (COVID-19 = 224, Normal = 504, Pneumonia = 714) | MobileNet V2 | 96.78 |
| EMCNet | COVID-19 images: [ | 4600 (COVID-19 = 2300, Normal = 2300) | CNN + Ensemble of ML Classifiers | 98.91 |