| Literature DB >> 35095212 |
Abstract
X-ray images are an easily accessible, fast, and inexpensive method of diagnosing COVID-19, widely used in health centers around the world. In places where there is a shortage of specialist doctors and radiologists, there is need for a system that can direct patients to advanced health centers by pre-diagnosing COVID-19 from X-ray images. Also, smart computer-aided systems that automatically detect COVID-19 positive cases will support daily clinical applications. The study aimed to classify COVID-19 via X-ray images in high precision ratios with pre-trained VGG19 deep CNN architecture and the YOLOv3 detection algorithm. For this purpose, VGG19, VGGCOV19-NET models, and the original Cascade models were created by feeding these models with the YOLOv3 algorithm. Cascade models are the original models fed with the lung zone X-ray images detected with the YOLOv3 algorithm. Model performances were evaluated using fivefold cross-validation according to recall, specificity, precision, f1-score, confusion matrix, and ROC analysis performance metrics. While the accuracy of the Cascade VGGCOV19-NET model was 99.84% for the binary class (COVID vs. no-findings) data set, it was 97.16% for the three-class (COVID vs. no-findings vs. pneumonia) data set. The Cascade VGGCOV19-NET model has a higher classification performance than VGG19, Cascade VGG19, VGGCOV19-NET and previous studies. Feeding the CNN models with the YOLOv3 detection algorithm decreases the training test time while increasing the classification performance. The results indicate that the proposed Cascade VGGCOV19-NET architecture was highly successful in detecting COVID-19. Therefore, this study contributes to the literature in terms of both YOLO-aided deep architecture and classification success.Entities:
Keywords: COVID-19; Chest X-ray images; Coronavirus; Deep CNN; Transfer learning; YOLO
Year: 2022 PMID: 35095212 PMCID: PMC8785935 DOI: 10.1007/s00521-022-06918-x
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.102
Fig. 1Sample X-ray images in the data set: a COVID-19, b Pneumonia, c No-Findings
Fig. 2Architectures of VGGCOV19-NET and Cascade VGGCOV19-NET models
Fig. 3Architecture of the FC layer of VGGCOV19-NET
Fig. 4Coordinates of the tagged chest X-ray image
Fig. 5Detection and cropping of the chest region in X-ray images
Fig. 6Training loss and accuracy curves attained at fold-2 for multi-class classification: a training loss and accuracy curves of the VGGCOV19-NET model, b training loss and accuracy curves of the Cascade VGGCOV19-NET model, c training loss and accuracy curves of the VGG19 model, d training loss and accuracy curves of the Cascade VGG19 model
Fig. 7Training loss and accuracy curves attained at fold-2 for binary classification: a training loss and accuracy curves of the VGGCOV19-NET model, b training loss and accuracy curves of the Cascade VGGCOV19-NET model, c training loss and accuracy curves of the VGG19 model, d training loss and accuracy curves of the Cascade VGG19 model
Training and testing times of the models and numbers of trainable parameters
| Classification type | Model | Training and testing time (minutes) | Number of trainable parameters ( |
|---|---|---|---|
| Binary class | VGGCOV19-NET | 14.27 | 3.37 |
| Cascade VGGCOV19-NET | 4.13 | ||
| VGG19 | 18.29 | 119.55 | |
| Cascade VGG19 | 6.60 | ||
| Multi-class | VGGCOV19-NET | 25.36 | 3.37 |
| Cascade VGGCOV19-NET | 10.38 | ||
| VGG19 | 32.57 | 119.55 | |
| Cascade VGG19 | 11.75 |
R, P, F1-score, and acc values for COVID-19, no-findings, and pneumonia classes of the VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19, and Cascade VGG19 models
| Models | Fold | Performance results (%) | |||
|---|---|---|---|---|---|
| Recall | Precision | Accuracy | |||
| VGGCOV19-NET | Fold 1 | 78.20* | 78.80* | 78.30* | 78.22 |
| Fold 2 | 91.10* | 91.10* | 91.10* | 91.11 | |
| Fold 3 | 84.40* | 84.80* | 84.50* | 84.44 | |
| Fold 4 | 95.10* | 95.20* | 95.10* | 95.11 | |
| Fold 5 | 95.60* | 95.70* | 95.60* | 95.56 | |
| Overlapped | |||||
| COVID-19 | 92.80 | 99.15 | 95.87 | ||
| No-findings | 90.20 | 86.40 | 88.26 | ||
| Pneumonia | 86.60 | 89.09 | 87.83 | ||
| Average | 89.87 | 91.55 | 90.65 | 88.89 | |
| Cascade VGGCOV19-NET | Fold 1 | 85.80* | 86.30* | 85.90* | 85.80 |
| Fold 2 | 100* | 100* | 100* | 100 | |
| Fold 3 | 100* | 100* | 100* | 100 | |
| Fold 4 | 100* | 100* | 100* | 100 | |
| Fold 5 | 100* | 100* | 100* | 100 | |
| Overlapped | |||||
| COVID-19 | 98.40 | 96.85 | 97.62 | ||
| No-findings | 96.40 | 98.17 | 97.28 | ||
| Pneumonia | 97.60 | 96.25 | 96.92 | ||
| Average | 97.47 | 97.09 | 97.27 | 97.16 | |
| VGG19 | Fold 1 | 80.40* | 80.40* | 80.20* | 80.44 |
| Fold 2 | 80* | 81.50* | 79.80* | 80 | |
| Fold 3 | 85.30* | 87* | 85.20* | 85.33 | |
| Fold 4 | 88* | 88.70* | 87.90* | 88 | |
| Fold 5 | 87.10 | 87.40* | 87* | 87.11 | |
| Overlapped | |||||
| COVID-19 | 91.20 | 95 | 93.06 | ||
| No-findings | 86.20 | 81.16 | 83.60 | ||
| Pneumonia | 80.40 | 84.81 | 82.54 | ||
| Average | 85.93 | 86.99 | 86.40 | 84.17 | |
| Cascade VGG19 | Fold 1 | 83.10* | 83.60* | 83.10* | 83.11 |
| Fold 2 | 99.10* | 99.10* | 99.10* | 99.11 | |
| Fold 3 | 93.30* | 93.50* | 93.30* | 93.33 | |
| Fold 4 | 97.30* | 97.40* | 97.30* | 97.33 | |
| Fold 5 | 98.70* | 98.70* | 98.70* | 98.67 | |
| Overlapped | |||||
| COVID-19 | 89.60 | 0.9912 | 0.9412 | ||
| No-findings | 95.80 | 0.9374 | 0.9476 | ||
| Pneumonia | 94.00 | 0.9381 | 0.9391 | ||
| Average | 93.13 | 0.9556 | 0.9426 | 94.31 | |
*Weighted average
Fig. 8Overlapped confusion matrices of models for multi-class classification a overlapped confusion matrix of the VGGCOV19-NET model, b overlapped confusion matrix of the Cascade VGGCOV19-NET model, c overlapped confusion matrix of the VGG19 model, d overlapped confusion matrix of the Cascade VGG19 model
Specificity, sensitivity, auc, F1-score, precision, and accuracy values for no-findings and COVID-19 classes of the VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19, and Cascade VGG19 models
| Models | Fold | Confusion matrix and performance results (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| TP | TN | FP | FN | Recall | Sensitivity | Precision | Accuracy | AUC | |||
| VGGCOV19-NET | Fold 1 | 24 | 98 | 0 | 3 | 89 | 100 | 98* | 98* | 97.60 | 99.12 |
| Fold 2 | 21 | 103 | 1 | 0 | 100 | 99 | 99* | 99* | 99.20 | 100 | |
| Fold 3 | 28 | 97 | 0 | 0 | 100 | 100 | 100* | 100* | 100 | 100 | |
| Fold 4 | 25 | 97 | 1 | 2 | 93 | 99 | 98* | 98* | 97.60 | 98.94 | |
| Fold 5 | 21 | 103 | 0 | 1 | 95 | 100 | 99* | 99* | 99.20 | 99.95 | |
| Overlapped | |||||||||||
| COVID-19 | 119 | 498 | 2 | 6 | 95.40 | 99.60 | 98.35 | 96.85 | |||
| No-findings | 498 | 119 | 6 | 2 | 99.60 | 95.40 | 98.81 | 99.20 | |||
| Average | 97.50 | 97.50 | 98.58 | 98.03 | 98.72 | 99.60 | |||||
| Cascade VGGCOV19-NET | Fold 1 | 28 | 96 | 0 | 1 | 0.97 | 1 | 0.99* | 0.99* | 99.20 | 98 |
| Fold 2 | 21 | 104 | 0 | 0 | 100 | 100 | 100* | 100* | 100 | 100 | |
| Fold 3 | 29 | 96 | 0 | 0 | 100 | 100 | 100* | 100* | 100 | 100 | |
| Fold 4 | 26 | 99 | 0 | 0 | 100 | 100 | 100* | 100* | 100 | 100 | |
| Fold 5 | 20 | 105 | 0 | 0 | 100 | 100 | 100* | 100* | 100 | 100 | |
| Overlapped | |||||||||||
| COVID-19 | 124 | 500 | 0 | 1 | 99.20 | 100 | 100 | 99.60 | |||
| No-findings | 500 | 124 | 1 | 0 | 100 | 99.20 | 99.80 | 99.90 | |||
| Average | 99.60 | 99.60 | 99.90 | 99.75 | 99.84 | 99.60 | |||||
| VGG19 | Fold 1 | 23 | 100 | 0 | 2 | 92 | 100 | 98* | 98* | 98.40 | 96 |
| Fold 2 | 24 | 100 | 0 | 1 | 96 | 100 | 99* | 99* | 99.20 | 98 | |
| Fold 3 | 25 | 100 | 0 | 0 | 100 | 100 | 100* | 100* | 100 | 100 | |
| Fold 4 | 21 | 100 | 0 | 4 | 84 | 100 | 97* | 97* | 96.80 | 92 | |
| Fold 5 | 23 | 100 | 0 | 2 | 92 | 100 | 98* | 98* | 98.40 | 96 | |
| Overlapped | |||||||||||
| COVID-19 | 116 | 500 | 0 | 9 | 92.80 | 100 | 100 | 96.26 | |||
| No-findings | 500 | 116 | 9 | 0 | 100 | 92.8 | 98.23 | 99.10 | |||
| Average | 96.40 | 96.40 | 99.11 | 97.68 | 98.56 | 96.40 | |||||
| Cascade VGG19 | Fold 1 | 16 | 106 | 1 | 2 | 94 | 98 | 98* | 98* | 97.60 | 98 |
| Fold 2 | 27 | 96 | 2 | 0 | 93 | 100 | 98* | 98* | 98.40 | 99.78 | |
| Fold 3 | 25 | 99 | 1 | 0 | 96 | 100 | 99* | 99* | 99.20 | 99.80 | |
| Fold 4 | 30 | 95 | 0 | 0 | 100 | 100 | 100* | 100* | 100 | 100 | |
| Fold 5 | 23 | 102 | 0 | 0 | 100 | 100 | 100* | 100* | 100 | 100 | |
| Overlapped | |||||||||||
| COVID-19 | 121 | 498 | 4 | 2 | 96.80 | 99.60 | 98.37 | 97.58 | |||
| No-findings | 498 | 121 | 2 | 4 | 99.60 | 96.80 | 99.20 | 99.40 | |||
| Average | 98.20 | 98.20 | 98.79 | 98.49 | 99.04 | 99.52 | |||||
*Weighted average
Fig. 9ROC curves of the models for binary classification a ROC curve of the VGGCOV19-NET model, b ROC curve of the Cascade VGGCOV19-NET model, c ROC curve of the VGG19 model, d ROC curve of the Cascade VGG19 model
Fig. 10Comparison of the accuracy values of the models
Comparison of the recommended VGGCOV19-NET COVID-19 diagnosis method with other CNN methods developed using radiology images
| Study | Type of images | Number of cases | Method used | Segmentation or location attention | Accuracy (%) | Remarks |
|---|---|---|---|---|---|---|
| Ozturk et al. [ | Chest X-ray | 500 No-findings | DarkCovidNet | 98.08 | Although the same data set as our study was used, the classification performance is significantly lower for three-class classification | |
| 125 COVID-19 (+) | ||||||
| 500 No-findings | 87.02 | |||||
| 500 Pneumonia | ||||||
| 125 COVID-19 (+) | ||||||
| Hemdan et al. [ | Chest X-ray | 25 COVID-19 (+) | VGG19 and DenseNet201 | – | 90 | The number of COVID-19 X-ray images in the data set is very low. Only binary classification was performed and the classification performance is considerably low |
| 25 Normal | ||||||
| Wang et al. [ | Chest X-ray | 358 COVID-19 (+) | COVID-Net | – | 93.3 | They reduced the computational complexity with the CNN architecture where they used 1 × 1 convolution blocks. The accuracy value is lower compared to our study and the sensitivity value needs improving |
| 5538 Pneumonia | ||||||
| 8066 Normal | ||||||
| Khan et al. [ | Chest X-ray | 284 COVID-19 (+) | CoroNet based on pre-trained Xception CNN | – | 95 | The classification performance is relatively lower for three-class classification |
| 657 Pneumonia | ||||||
| 310 Normal | ||||||
| 284 COVID-19 (+) | 99 | |||||
| 310 Normal | ||||||
| Medhi et al. [ | Chest X-ray | More than 150 COVID-19 (+) | CNN | – | 93 | The data set is not clear. They used the CNN model they had developed themselves. They did not apply the pre-trained models. Furthermore, the binary classification performance is low |
| Harit et al. [ | Chest X-ray | 46 COVID-19 (+) | ResNet50 | – | 94 | The data set is very small. Moreover, the classification for binary class is considerably low. Only the ResNet50 pre-trained model was applied |
| 41 normal | ||||||
| Ahammed et al. [ | Chest X-ray | 219 COVID-19 (+) | CNN, ResNet50, VGG16, InceptionV3and traditional machine learning methods | – | 94 | They attained the highest performance with the CNN developed by themselves. Low performance was attained with traditional methods. Classification accuracy is relatively low |
| 1345 Viral pneumonia | ||||||
| 1341 Normal | ||||||
| Apostolopoulos and Mpesiana [ | Chest X-ray | 224 COVID-19 (+) | VGG19 | – | 93.48 | The VGG19 pre-trained CNN architecture was used in raw form without being modified. The classification performance is significantly lower compared to our study especially for three-class classification |
| 700 Pneumonia | ||||||
| 504 Healthy | ||||||
| 224 COVID-19 (+) | 98.75 | |||||
| 504 Healthy | ||||||
| Narin et al. [ | Chest X-ray | 50 COVID-19 (+) | ResNet50 and deep CNN | – | 98 | Only binary classification was performed in the study and the number of X-ray images in the data set is very low |
| 50 COVID-19 (−) | ||||||
| Ouchicha et al. [ | Chest X-ray | 219 COVID-19 | CNN based on ResNet | 96.69 | A new CNN model based on the ResNet model was proposed. Although the classification performance for three-class classification is higher than other studies, it is lower than the performance in our study. Also other SoTA models could have been applied in the study | |
| 1341 normal | ||||||
| 1345 viral pneumonia | ||||||
| Benbrahim et al. [ | Chest X-ray | 160 COVID-19 (+) | InceptionV3 and ResNet50 | 99 | Only binary classification was made. The classification performance is good. However, the data set is considerably small | |
| 160 Normal | ||||||
| Al-antari et al. [ | Chest X-ray | 326 COVID-19 | YOLO | YOLO detection algorithm | 90.67 | Detection was made using the YOLO algorithm. Unlike our study, no cascade structure fed by YOLO was used. The classification performance is low |
| 120 Pneumonia | ||||||
| 866 other seven lung diseases | ||||||
| Nigam et al. [ | Chest X-ray | 5634 COVID-19 | VGG16, DenseNet121, Xception, NASNet and EfficientNet | YOLO detection algorithm | 93.48 | A cascade structure fed by YOLO was used as a method as in our study. However, the classification performance is lower. The impact of YOLO could not be revealed as the performances of models not fed with YOLO were not included in the study |
| 6000 Normal | ||||||
| 5000 Others | ||||||
| Aslan et al. [ | Chest X-ray | 1095 COVID-19 (+) | Hybrid CNN based on AlexNet | ANN-based automatic lung segmentation | 98.70 | ANN-based chest zone segmentation was performed, and the images were given as input to the AlexNet-based hybrid CNN. It is similar to our study as a cascade model was used. The classification performance is slightly higher than our study. However, data augmentation was carried out in the study. Data augmentation is not regarded as a good practice in medical imaging |
| 1345 Pneumonia | ||||||
| 1341 Normal | ||||||
| Hira et al.[ | Chest X-ray | 224 COVID-19 (+) | Se-ResNeXt-50 | – | 97.55 | Classification was carried out with nine pre-trained CNN models (AlexNet, GoogleNet, ResNet-50, Se-ResNet-50, DenseNet121, Inception V4, Inception ResNet V2, ResNeXt-50, and Se-ResNeXt-50). The highest classification performance was attained with Se-ResNeXt-50. The classification performance is slightly higher than our study |
| 700 Pneumonia | ||||||
| 504 Normal | ||||||
| Butt et al. [ | Chest CT | 219 COVID-19 (+) | RESNET | – | 86.7 | The data sets used in these studies consist of CT images, and the classification performances are considerably lower than our study |
| 224 Pneumonia | ||||||
| 175 Healthy people | ||||||
| Ying et al. [ | Chest CT | 777 COVID-19 (+) | DRE-NET | – | 94 | |
| 708 Healthy | ||||||
| Zheng et al. [ | Chest CT | 313 COVID-19 (+) | DECOVNET | - | 90.1 | |
| 229 COVID-19 (−) | ||||||
| Xu et al. [ | Chest CT | 219 COVID-19 (+) | ResNet-18 + location based attention | Image patch vote and Noisy-OR Bayesian function | 86.7 | |
| 224 Pneumonia | ||||||
| 175 Healthy | ||||||
| Proposed study | Chest X-ray | 500 No-Findings | Cascade VGGCOV19-NET | |||
| 125 COVID-19 (+) | ||||||
| 500 No-Findings | ||||||
| 500 Pneumonia | ||||||
| 125 COVID-19 (+) |
Bold values indicate the results obtained in this study