| Literature DB >> 33519327 |
Gonçalo Marques1, Deevyankar Agarwal1, Isabel de la Torre Díez1.
Abstract
COVID-19 infection was reported in December 2019 at Wuhan, China. This virus critically affects several countries such as the USA, Brazil, India and Italy. Numerous research units are working at their higher level of effort to develop novel methods to prevent and control this pandemic scenario. The main objective of this paper is to propose a medical decision support system using the implementation of a convolutional neural network (CNN). This CNN has been developed using EfficientNet architecture. To the best of the authors' knowledge, there is no similar study that proposes an automated method for COVID-19 diagnosis using EfficientNet. Therefore, the main contribution is to present the results of a CNN developed using EfficientNet and 10-fold stratified cross-validation. This paper presents two main experiments. First, the binary classification results using images from COVID-19 patients and normal patients are shown. Second, the multi-class results using images from COVID-19, pneumonia and normal patients are discussed. The results show average accuracy values for binary and multi-class of 99.62% and 96.70%, respectively. On the one hand, the proposed CNN model using EfficientNet presents an average recall value of 99.63% and 96.69% concerning binary and multi-class, respectively. On the other hand, 99.64% is the average precision value reported by binary classification, and 97.54% is presented in multi-class. Finally, the average F1-score for multi-class is 97.11%, and 99.62% is presented for binary classification. In conclusion, the proposed architecture can provide an automated medical diagnostics system to support healthcare specialists for enhanced decision making during this pandemic scenario.Entities:
Keywords: Automated decision support system; COVID-19; Convolutional Neural Network (CNN); Deep learning; Machine learning
Year: 2020 PMID: 33519327 PMCID: PMC7836808 DOI: 10.1016/j.asoc.2020.106691
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 6.725
Related work on COVID-19 detection systems.
| Reference | Model | Data used | Number of images | Classification | Evaluation method |
|---|---|---|---|---|---|
| DarkNet model | Chest X-ray | 1125 — total; 125 — COVID-19; 500 — Pneumonia; 500 — Normal | Binary and Multi-class | 5-fold cross-validation | |
| VGG-19 | Chest X-ray | 545 — total; 181 — COVID-19; 364 - Normal | Binary | Random sampling 80:20:20 for train, validation and testing. | |
| VGG-19 and MobileNet v2 | Chest X-ray | Binary and Multi-class | 10-fold cross-validation | ||
| nCOVnet | Chest X-ray | 337 — total; 192 — COVID-19; 142- Normal | Binary | Random sampling 70% for training and 30% for testing | |
| CapsNet | Chest X-ray | 3150 — total; 1050 — Normal; 1050 — Pneumonia; 1050 — COVID-19 | Binary and Multi-class | 10-fold cross-validation | |
| Proposed CNN | Chest X-ray | 2033 — total; 135 — COVID-19, ; 939 — Normal; 941 — Pneumonia. | Binary and Multi-class | Random sampling 70% for training and 30% for testing | |
| ResNet 18 | Chest X-ray | 746 — total 349 — COVID-19 397 — Normal. | Binary | Random sampling. 70% for training and 30% for testing. | |
| Proposed Semi-supervised model | Chest X-ray | Binary | Random sampling 70% for training and 30% for testing |
Dataset information.
| Class | Reference | Number of images for training/ testing | Number of images for validation |
|---|---|---|---|
| NORMAL | 404 | 96 | |
| PNEUMONIA | 404 | 100 | |
| COVID-19 | 404 | 100 |
Layer types and parameters used in the proposed model.
| Layer (type) | Output shape | Param # |
|---|---|---|
| EfficientNetB4 (Model) | 7 | 17,673,816 |
| global_average_pooling2d | 1792 | 0 |
| dense (Dense) | 128 | 229,504 |
| dropout (Dropout) | 128 | 0 |
| dense_1 (Dense) | 64 | 8256 |
| dropout_1 (Dropout) | 64 | 0 |
| dense_2 (Dense) | 32 | 2080 |
| dropout_2 (Dropout) | 32 | 0 |
| dense_3 (Dense) | 2/3 | 99 |
| Total Parameters: 17,913,755 | ||
| Trainable Parameters: 17,788,555 | ||
| Non-trainable Parameters: 125,200 | ||
Fig. 1Block Diagram of the proposed work.
Results of binary classification for COVID-19 class.
| Fold | Precision | Recall | F1 score |
|---|---|---|---|
| 1 | 100% | 100% | 100% |
| 2 | 97.61% | 100% | 98.79% |
| 3 | 100% | 97.56% | 98.76% |
| 4 | 100% | 100% | 100% |
| 5 | 100% | 100% | 100% |
| 6 | 100% | 100% | 100% |
| 7 | 100% | 100% | 100% |
| 8 | 97.56% | 100% | 98.76% |
| 9 | 100% | 100% | 100% |
| 10 | 100% | 100% | 100% |
| Average | 99.51% | 99.75% | 99.63% |
Results of binary classification for normal class.
| Fold | Precision | Recall | F1 score |
|---|---|---|---|
| 1 | 100% | 100% | 100% |
| 2 | 97.61% | 100% | 98.79% |
| 3 | 97.56% | 100% | 98.76% |
| 4 | 100% | 100% | 100% |
| 5 | 100% | 100% | 100% |
| 6 | 100% | 100% | 100% |
| 7 | 100% | 100% | 100% |
| 8 | 100% | 97.56% | 98.76% |
| 9 | 100% | 100% | 100% |
| 10 | 100% | 100% | 100% |
| Average | 99.51% | 99.75% | 99.63% |
Results of binary classification between classes.
| Fold | Accuracy | Precision | Recall | F-1 Score |
|---|---|---|---|---|
| 1 | 100% | 100% | 100% | 100% |
| 2 | 98.76% | 98.88% | 98.75% | 98.76% |
| 3 | 98.76% | 98.78% | 98.78% | 98.76% |
| 4 | 100% | 100% | 100% | 100% |
| 5 | 100% | 100% | 100% | 100% |
| 6 | 100% | 100% | 100% | 100% |
| 7 | 100% | 100% | 100% | 100% |
| 8 | 98.76% | 98.78% | 98.78% | 98.76% |
| 9 | 100% | 100% | 100% | 100% |
| 10 | 100% | 100% | 100% | 100% |
| Average | 99.62% | 99.64% | 99.63% | 99.62% |
Fig. 2Confusion matrix of the validation testing for the binary classifier.
Fig. 3ROC curve of the validation testing for the binary classifier.
Results of multi-class classification for COVID-19 class.
| Fold | Precision | Recall | F1 Score |
|---|---|---|---|
| 1 | 100% | 100% | 100% |
| 2 | 100% | 100% | 100% |
| 3 | 100% | 100% | 100% |
| 4 | 100% | 100% | 100% |
| 5 | 100% | 100% | 100% |
| 6 | 100% | 100% | 100% |
| 7 | 100% | 100% | 100% |
| 8 | 97.56% | 100% | 98.76% |
| 9 | 97.56% | 100% | 98.76% |
| 10 | 97.56% | 100% | 98.76% |
| Average | 99.26% | 100% | 99.62% |
Results of multi-class classification for normal class.
| Fold | Precision | Recall | F1 Score |
|---|---|---|---|
| 1 | 92.68% | 95.00% | 93.82% |
| 2 | 95.00% | 95.00% | 95.00% |
| 3 | 97.50% | 97.50% | 97.50% |
| 4 | 95.00% | 95.00% | 95.00% |
| 5 | 97.50% | 97.50% | 97.50% |
| 6 | 100% | 97.56% | 98.76% |
| 7 | 95.34% | 100% | 97.61% |
| 8 | 97.61% | 100% | 98.79% |
| 9 | 97.50% | 95.12% | 96.29% |
| 10 | 92.50% | 92.50% | 92.50% |
| Average | 96.06% | 96.51% | 96.27% |
Results of multi-class classification for pneumonia class.
| Fold | Precision | Recall | F1 Score |
|---|---|---|---|
| 1 | 95.00% | 92.68% | 93.82% |
| 2 | 95.12% | 95.12% | 95.12% |
| 3 | 97.5% | 95.12% | 96.29% |
| 4 | 95.12% | 95.12% | 95.12% |
| 5 | 97.43% | 95.00% | 96.20% |
| 6 | 100% | 100% | 100% |
| 7 | 100% | 92.50% | 96.10% |
| 8 | 100% | 92.50% | 96.10% |
| 9 | 97.29% | 90.00% | 93.50% |
| 10 | 97.29% | 87.80% | 92.30% |
| Average | 97.47% | 93.58% | 95.45% |
Results of multi-class classification between all classes.
| Fold | Accuracy | Precision | Recall | F-1 Score |
|---|---|---|---|---|
| 1 | 95.90% | 95.89% | 95.89% | 95.88% |
| 2 | 96.72% | 96.70% | 96.70% | 96.70% |
| 3 | 97.54% | 98.33% | 97.54% | 97.93% |
| 4 | 96.72% | 96.70% | 96.70% | 96.70% |
| 5 | 97.52% | 98.31% | 97.50% | 97.90% |
| 6 | 99.17% | 100% | 99.18% | 99.58% |
| 7 | 97.52% | 98.44% | 97.50% | 97.90% |
| 8 | 97.52% | 98.39% | 97.50% | 97.88% |
| 9 | 95.04% | 97.45% | 95.04% | 96.18% |
| 10 | 93.38% | 95.78% | 93.43% | 94.52% |
| Average | 96.70% | 97.59% | 96.69% | 97.11% |
Fig. 4Confusion matrix of the validation testing for the multi-class classifier.
Fig. 5ROC curve of the validation testing for the multi-class classifier.
Comparison of the state-of-art models for binary classification.
| Ref. | Architecture | Accuracy | Recall | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|---|
| DarkCovidNet | 98.08% | 95.13% | 95.3% | 98.03% | 96.51% | |
| VGG19 | 96.33% | 97.05% | 96.0% | 91.6% | 94.24% | |
| VGG19 | 98.75% | 92.85% | 98.75% | – | – | |
| MobileNet v2 | 97.40% | 99.10% | 97.09% | – | – | |
| nCOVnet | 88.10% | 82.00% | 97.06% | 97.62% | 89.13% | |
| CapsNet | 97.24% | 97.42% | 97.04% | 97.08% | 97.24% | |
| RestNet 18 | 99.4% | 100% | 98.6% | 99.00% | 99.5% | |
| Semi-supervised model | 93.1% | 83.5% | – | 89.0% | 82.6% | |
| Proposed | EfficientNet | 99.62% | 99.63% | - | 99.64% | 99.62% |
Comparison of the state-of-art models for multi-class classification.
| Ref. | Architecture | Accuracy | Recall | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|---|
| DarkCovidNet | 87.02% | 85.35% | 92.18% | 89.96% | 87.37% | |
| VGG19 | 93.48% | 92.85% | 98.75% | – | – | |
| MobileNet v2 | 92.85% | 99.10% | 97.09% | – | – | |
| CapsNet | 84.22% | 84.22% | 91.79% | 84.61% | 84.21% | |
| Proposed CNN | 97.14% | 94.61% | 98.29% | – | 95.75% | |
| Proposed | EfficientNet | 96.70% | 96.69% | - | 97.59% | 97.11% |