Literature DB >> 35095212

VGGCOV19-NET: automatic detection of COVID-19 cases from X-ray images using modified VGG19 CNN architecture and YOLO algorithm.

Abdulkadir Karacı1.   

Abstract

X-ray images are an easily accessible, fast, and inexpensive method of diagnosing COVID-19, widely used in health centers around the world. In places where there is a shortage of specialist doctors and radiologists, there is need for a system that can direct patients to advanced health centers by pre-diagnosing COVID-19 from X-ray images. Also, smart computer-aided systems that automatically detect COVID-19 positive cases will support daily clinical applications. The study aimed to classify COVID-19 via X-ray images in high precision ratios with pre-trained VGG19 deep CNN architecture and the YOLOv3 detection algorithm. For this purpose, VGG19, VGGCOV19-NET models, and the original Cascade models were created by feeding these models with the YOLOv3 algorithm. Cascade models are the original models fed with the lung zone X-ray images detected with the YOLOv3 algorithm. Model performances were evaluated using fivefold cross-validation according to recall, specificity, precision, f1-score, confusion matrix, and ROC analysis performance metrics. While the accuracy of the Cascade VGGCOV19-NET model was 99.84% for the binary class (COVID vs. no-findings) data set, it was 97.16% for the three-class (COVID vs. no-findings vs. pneumonia) data set. The Cascade VGGCOV19-NET model has a higher classification performance than VGG19, Cascade VGG19, VGGCOV19-NET and previous studies. Feeding the CNN models with the YOLOv3 detection algorithm decreases the training test time while increasing the classification performance. The results indicate that the proposed Cascade VGGCOV19-NET architecture was highly successful in detecting COVID-19. Therefore, this study contributes to the literature in terms of both YOLO-aided deep architecture and classification success.
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022.

Entities:  

Keywords:  COVID-19; Chest X-ray images; Coronavirus; Deep CNN; Transfer learning; YOLO

Year:  2022        PMID: 35095212      PMCID: PMC8785935          DOI: 10.1007/s00521-022-06918-x

Source DB:  PubMed          Journal:  Neural Comput Appl        ISSN: 0941-0643            Impact factor:   5.102


Introduction

A new pneumonia epidemic called acute respiratory syndrome coronavirus 2 (COVID-19) started in Wuhan in the Hubei province of China in December 2019 [1-3]. COVID-19 rapidly expanded from a single city to the whole country within 30 days. This sudden increase in cases caused the collapse of the health system in China [4, 5]. COVID‐19 is transmitted from person to person through direct contact and by small droplets emitted by infected people. Similar to the flu, a COVID-19 patient can develop a variety of symptoms and signs of infection, such as fever, cough, and respiratory illness. In more severe cases, the infection can cause breathing difficulties, multiple organ failure, and a rapidly progressive fatal pneumonia in 2–8% of those infected. Pneumonia can be detected from chest X-ray images [6-8]. Chest radiological imaging such as X-ray and computerized tomography (CT) plays a vital role in the early diagnosis and treatment of this disease [1, 9]. X-rays can detect several characteristic signs associated with COVID-19 in the lung [9]. Early results show that patients suggestive of COVID-19 have abnormalities on chest X-rays [10]. The real-time reverse-transcription polymerase chain reaction (RT-PCR) method is also frequently used to diagnose COVID-19. However, the sensitivity of this method is around 60–70%. Therefore, it is important to diagnose patients by examining X-ray images [1, 9]. In addition, a detection system based on X-ray images has many advantages. It is fast, can analyze multiple cases at the same time, and has more usability. In addition, X-ray equipment is available in every hospital in the modern healthcare system. This makes the radiography-based approach convenient and easily accessible [11]. CT scans and X-rays are widely used for the detection of COVID-19 in countries in which there are few test kits. Also, researchers specify that the combination of clinical image properties with laboratory results may be helpful in the early diagnosis of COVID-19 [9, 12, 13]. There are also studies reporting that some changes have been encountered in the chest X-rays and CT images of asymptomatic children and the elderly in the early periods of the disease [14]. These changes can be detected by deep convolutional neural networks (CNN). Deep CNN outperformed all other known methods in the ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) 2012 [15]. Deep CNNs are among the strong deep learning architectures that are widely and instinctively applied in many practical applications such as pattern recognition and image classification [16]. The greatest advantage of CNN for image classification is the end-to-end training of the whole system from raw pixels to the final categories. This advantage decreases the need to design a manually convenient feature extractor [15]. Researchers are focusing on deep learning techniques to determine specific properties of COVID-19 patients from the chest radiography images [17, 18]. Recently, deep learning has been highly successful in various visual tasks including medical image analysis. Deep learning has revolutionized the automatic diagnosis and management of the disease by correctly analyzing, defining and classifying the patterns in medical images [11]. In addition, transfer learning is widely used in CNN architectures. Transfer learning is used to improve the learner by transferring the information in a related area. It aims to apply the data extracted from one or more tasks to the target task. Together with the growth of deep learning, transfer learning has become an inseparable part of many applications especially in medical imaging [19, 20]. There are many pre-trained deep CNN architectures (VGG16, VGG19, MobileNet, Inception, Xception, ResNet, DarkNet, AlexNet, etc.) in which the transfer learning approach can be applied. The Visual Geometry Group Network (VGG) was developed based on the convolutional neural network architecture at the Oxford Robotics Institute [16] and was introduced by Simonyan and Zisserman [21]. VGGNet has shown particularly good performance in the ImageNet data cluster. This network has been trained on more than 1 million images in 1000 classes with more than 370,000 iterations to calibrate 138 million weight parameters. In particular, VGG19 won first place in a classification and localization competition in the Large-Scale Visual Recognition Competition, a global image recognition competition, in 2014 [21, 22]. The VGG is a series of transfer learning networks that includes VGG11, VGG13, VGG16, and VGG19. The common feature of these network structures is that several convolution layer modules are connected to three full connection layers [23]. VGG19 consists of five building blocks. The first and second building blocks consist of two convolutional layers and one pooling layer. The third and fourth blocks have four convolutional layers and one pooling layer. The final block consists of four convolutional layers. 3 × 3 small filters are also used [24]. The detection of the zone that should be the area of focus and supplying it to these CNN architectures as input instead of raw images may increase the classification performance. Detection algorithms are used for this process. One of these algorithms is YOLO (You Only Look Once). The YOLO architecture consists of 27 CNN layers with 24 convolution layers, followed by two fully connected layers. YOLOv3 is a deep single-stage CNN developed from YOLO and YOLOv2 for object detection. The YOLOv3 algorithm is one of the best object detection methods and it provides perfect improvements in terms of speed. The YOLO-V3 has high detection accuracy and speed. It also performs well in detecting small targets. This algorithm reframes object detection as a single regression problem, from image pixels to bounding box coordinates and class probabilities. It works significantly faster than other detection methods. It uses a different architecture called Darknet-53 for feature extraction. DarkNet-53 uses 3 × 3 and 1 × 1 convolutional layers and has 53 layers [25-28]. This study aims to develop a new state-of-the-art VGG19 (Visual Geometry Group Network) architecture-based improved model to diagnose COVID-19 automatically from chest X-rays, to reveal a new cascade model by combining this developed model with the YOLOv3 [25] detection algorithm, and to evaluate the efficiency of these models. For this purpose, the fully connected (FC) layer of the pre-trained VGG19 CNN architecture was re-arranged and attempts were made to diagnose COVID-19 from binary (COVID-19 vs. normal) and three-class (COVID-19 vs. normal vs. pneumonia) data sets. The modified VGG19 model was also combined with the YOLOv3 algorithm to detect the lung zone and the classification process was conducted with this newly formed cascade model. The performances of these models were compared to each other and to models in previous studies. The main contributions of this study can be summarized as follows. We propose the VGG19-based improved VGGCOV19-NET (Visual Geometry Group Network COVID19-NET) and Cascade VGGCOV19-NET models to assist in the automatic diagnosis of COVID-19 patients. Via the VGGCOV19-NET model, we prove that the classification performances of the pre-trained CNN architectures can be enhanced as a result of modification. We propose a new Cascade VGGCOV19-NET model by feeding the VGGCOV19-NET model with the lung zone X-ray images detected with the YOLOv3 algorithm. The new Cascade VGGCOV19-NET model significantly decreases the training and test time while significantly increasing the classification performance. The COVID-19 classification performance of Cascade VGGCOV19-NET is generally higher than that in previous studies. COVID-19 is rapidly diagnosed without manual feature extraction and selection methods. The manuscript is organized as follows: Previous studies are summarized in Part 2. The structures, parameters, and training of Cascade VGGCOV19-NET and VGGCOV19-NET model and the data sets and metrics used in the study are explained in Part 3. Results and classification performances are presented in Part 4. The classification performances of models are compared to that of previous studies and the results are discussed in Part 5. Finally, in Part 6, the conclusion is given.

Literature review

There are some studies using radiology images and machine learning to diagnose COVID-19 in the literature. Hemdan et al. [16] tried different CNN classifiers to diagnose COVID-19 from X-ray images. They attained the highest classification performance from VGG19 and DenseNet201 classifiers with 90% accuracy. Wang et al. [29] classified the normal, pneumonia, and COVID-19 classes with 93.3% accuracy with a CNN model they called COVID-Net. Aslan et al. [30] classified the normal, pneumonia, and COVID-19 classes with 98.70% accuracy using hybrid CNN model. Hira et al. [31] classified the normal, pneumonia and COVID-19 classes with 97.55% accuracy using ResNeXt-50. Khan et al. [11] classified three classes (COVID-19, pneumonia, and normal) with 95% accuracy and binary class (COVID-19 and normal) with 99% accuracy using CNN. Medhi et al. [32] diagnosed COVID-19 with 93% accuracy with their CNN model. Ozturk et al. [9] classified three classes (COVID-19, pneumonia, and normal) with 87.02% accuracy and the binary class with 98.08% accuracy with the CNN called DarkCovidNet. Harit et al. [33] classified COVID-19 with 94% accuracy using ResNet. Ahammed et al. [34] classified three classes (COVID-19, pneumonia, and normal) with 18 different classifiers. They attained the highest accuracy value (94%) in their CNN model. Apostolopoulos and Mpesiana [35] classified three classes and binary classes using the transfer learning approach with five different state-of-the-art pre-trained CNNs. They attained the highest accuracy in the VGG19 model. These accuracy values are, respectively, 98.75% and 93.48% for the binary class and three-class classification. Similarly, Narin et al. [36] classified the binary class using the transfer learning approach with three different state-of-the-art pre-trained CNNs. They attained the highest accuracy value (98%) in the ResNet model. Benbrahim et al. [37] classified COVID-19 with 99% accuracy with the InceptionV3 model and transfer learning approach. Butt et al. [38] classified three classes with 86.7% accuracy on chest CT images with the ResNet model. Ying et al. [39] classified COVID-19 with 94% accuracy with the DRE-Net CNN model from chest CT images. Zheng et al. [40] diagnosed COVID-19 with 90.1% accuracy from chest CT images with the 3D deep CNN model. Harmon et al. [7] detected COVID-19 with 90.8% accuracy using the Densnet-121 CNN architecture on a total of 2724 CT images, 1029 of which were COVID-19 positive. Waheed et al. [10] stated that it is difficult to collect X-ray images because the epidemic is new, and they proposed a method to produce synthetic X-ray images. They suggested that when synthetic X-ray images produced by this method are added to the data set, the model performance increases.

Method and materials

In this part, the structure of the VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19, and Cascade VGG19 models, the data set used in the formation of the models, and the performance metrics used in the evaluation of the models are explained.

X-ray image data set

The data set used in this study was attained from the X-ray images used by Ozturk et al. [9] in their study. They have published this data set as an open resource at https://github.com/muhammedtalo/COVID-19. The data set consists of 125 COVID-19, 500 pneumonia, and 500 no-findings X-ray images. They used two different resources in the formation of the data set: COVID-19 X-ray images from the database attained from various open access resources by Cohen [41] and the ChestX-ray8 database provided by Wang et al. [42] for normal and pneumonia X-ray images. Forty-three of the patients in the COVID-19 data set are women and 82 are men. Metadata information for this data set are not provided for all patients. The average age of the 26 COVID-19 positive patients is approximately 55 years. It is a data set convenient for both binary and multi-class classifications. Figure 1 shows the X-ray images of some cases with COVID-19, pneumonia, and no-findings taken from the data set.
Fig. 1

Sample X-ray images in the data set: a COVID-19, b Pneumonia, c No-Findings

Sample X-ray images in the data set: a COVID-19, b Pneumonia, c No-Findings

The proposed VGGCOV19-NET and cascade VGGCOV19-NET models

Deep learning or deep structured learning represents a class of improved machine learning techniques [37]. The occurrence of deep learning technology has been a revolution in artificial intelligence. The word “deep” expresses the increase in the dimension of this network in addition to the number of layers [9]. CNN is the deep learning approach commonly used for feature extraction and data classification in various areas. CNN has many layers, the input layer, the convolutional layer, the pooling layer, the fully connected layer, and the output layer [43, 44]. The most efficient way to use deep CNNs on small data sets is transfer learning [45]. The data gained by the pre-trained model on a large data set is transferred on the model to be trained with the transfer learning method commonly used in deep learning. The greatest advantage of using the transfer learning method is that it allows the training of fewer data clusters and incurs lower calculation costs [36]. The Visual Geometry Group Network (VGG) was developed based on the convolutional neural network architecture at the Oxford Robotics Institute [16]. VGGNet has shown particularly good performance in the ImageNet data cluster. VGG19 consists of five building blocks. The first and second building blocks consist of two convolutional layers and one pooling layer. The third and fourth blocks have four convolutional layers and one pooling layer. The final block consists of four convolutional layers. 3 × 3 small filters are also used [24, 46]. In this study, the suggested VGGCOV19-NET model is based on the modified VGG19 [21] to diagnose COVID-19. The Cascade VGGCOV19-NET model is a hybrid model in which the VGGCOV19-NET model and the YOLOv3 detection algorithm are used together. The architectures of the VGGCOV19-NET and Cascade VGGCOV19-NET models are shown in Fig. 2. The first five blocks of VGG19 are out of the top layer in VGGCOV19-NET, and they have been transferred to the new model with their weights. The top layer has been made compatible to the data with 224 × 224 × 3 dimension and the FC layer has been reformed. The FC layer has one Flatten and three dense layers in the raw state of the VGG19 model. The first two dense layers consist of 4096 neurons. The final layer is the output layer that consists of 1000 neurons. The modified FC layer of the VGGCOV19-NET model has one flatten and four Dense layers. The neuron numbers in the first three dense layers are, respectively, 128, 256 and 512. The Relu activation function was used in these layers. The final dense layer is the output layer whose activation function is Softmax. The neuron number in this layer is two (COVID-19 and no-findings) for binary classification and three (COVID-19, no-findings, and pneumonia) for multi-class classification. The FC layer is given in Fig. 3 with a more detailed image. As seen in this figure, the flatten layer output at the front of the FC layer consists of 25,088 features. As a result, the input layer of the FC layer has 25,088 neurons. There are 150,528 data for each X-ray image. These data are filtered, and 25,088 features are extracted in convolution and pooling layers that are connected from the front to the flatten layer. The dense layer number, neuron number, and activation function in the FC layer were determined as a result of many trials.
Fig. 2

Architectures of VGGCOV19-NET and Cascade VGGCOV19-NET models

Fig. 3

Architecture of the FC layer of VGGCOV19-NET

Architectures of VGGCOV19-NET and Cascade VGGCOV19-NET models Architecture of the FC layer of VGGCOV19-NET The YOLOv3 detection algorithm is placed at the front of the VGGCOV19-NET and VGG19 models in the Cascade VGGCOV19-NET and Cascade VGG19 models. The YOLOv3 algorithm detects the chest zones in X-ray images and crops them so that a new X-ray data set focusing on the chest zone is attained. This data set is given to the VGGCOV19-NET and VGG19 models as inputs and the training and test processes of the Cascade models are conducted. CNNs are designed using main multiple building blocks such as convolution, pooling, and FC layers [47]. The structure of the convolutional and pooling layers of the VGCOV19-NET model can be expressed as follows: Here, CL refers to the convolutional layer and PL refers to the pooling layer using the maximum pooling method. In the VGGCOV19-NET model, the convolution layer is very important as it performs the feature extraction process. This layer uses the convolution operation (represented by *) instead of the general matrix multiplication and is the core building block of CNN. The parameters of the convolution layer are made up of a set of learnable filters, also known as kernels. The main task of the convolutional layer is to identify the features in the local regions of the input image and to create a feature map [11]. This process is based on the method of wandering a selected filter on the input image. The dimension of the filter can be 3 × 3, 5 × 5 or 7 × 7 pixels. The input of the next layer (m2 × m3) is formed with the filter applied to the image. Activation maps occur as a result of this convolution process. Activation maps have local distinctive features. Each convolution layer has a filter (m1). The output of Layer consists of feature maps in , dimension. The i. feature map shown with is calculated according to Eq. 1. Here, represents the deviation matrix, and represents the filter dimension [48-50]. Pooling layers gradually decrease the image dimension so the parameter number and calculation complexity of the model are decreased [51]. The pooling layer has two parameters, (l) the dimension of filter and step. This layer takes the data in the dimension of as input and again, provides output volume. The operation of the pooling layer is shown in Eqs. 2, 3 and 4 in short [49]. In the pooling process, a v vector is reduced to a single scaler f (v) with pooling process f. There are basically two pooling types: average pooling , and max pooling [52]. Max pooling is used in the VGGCOV19-NET model. FC layers turn the feature maps output of (1) final convolution or pooling layer into a unidimensional vector, (2) bind to one or more dense layers (3) update the weights, (4) and give the estimation of the final classification [47]. FC layer mapping is a multi-layered sensor for . When is considered as an FC layer, the process stages of the FC layer can be shown as in Eq. 5 [49]. In addition, the activation function is used at the output of each layer of the VGGCOV19-NET model. This activation function is Softmax for the output layer of the model, and rectified linear unit (Relu) for other layers. Softmax and Relu activation functions are mathematically defined as in Eqs. 6 and 7, respectively. Softmax is used to normalize the probability vector x for any input [9, 53].

Training of the YOLOv3 detection algorithm and automatic cropping of the chest region

YOLOv3 is a deep CNN developed for object detection. The chest region on the chest X-ray image should be tagged before the training of the YOLOv3 model. The labelImg (https://github.com/tzutalin/labelImg) graphical image annotation tool was used for this. A text file was formed for each image using this tool, and the coordinates of the chest region taking place in the image were saved in this file. There are (X0, Y0, X1, Y1) points giving information about the exact location of height (cH), width (cW), and chest region of the images tagged in the text file. The normalization process is conducted to make the formed data set convenient for YOLOv3 architecture. With this process, the center point coordinates (X, Y), height (H), and width (W) data of the chest region tagged in the image are attained as in Eqs. 8 and 9 [54]. These data are shown in Fig. 4.
Fig. 4

Coordinates of the tagged chest X-ray image

Coordinates of the tagged chest X-ray image Training and test processes were conducted by randomly separating the data into two as 70% training and 30% test data after the completion of the preliminary processes. Model performance was evaluated according to Intersection over Union (IoU) and mean average Precision (mAP) metrics. IoU is an important metric to reveal the model success in object detection. IoU calculates the similarity distance between the bounding box of the target and the estimated output [55]. The division of the region remaining between the estimated value and the necessary reference value to the whole region is attained as in Eq. 10 [54, 55]. mAP ensures the evaluation of the values such as certainty, sensitivity, F1-score, and IoU from a single point. It is calculated as in Eq. 11 [54]. Processes were conducted with bounding boxes above a certain IoU threshold value while calculating the mAP value to determine the model performance. When the threshold value is taken as 0.50, mAP value for the test data was calculated as 99.80%, and the mean IoU value was calculated as 85.37. These values show that the Chest region was detected with high precision. The weights attained for these performance metrics were saved. Then, all the X-ray images in the data set were given to the trained YOLOv3 algorithm as input and the chest region was automatically detected. The detected chest region was cropped with OpenCv and a new data set containing the chest region was formed. The X-ray images whose Chest region were detected and cropped are shown in Fig. 5.
Fig. 5

Detection and cropping of the chest region in X-ray images

Detection and cropping of the chest region in X-ray images

Training of VGGCOV19-NET and cascade VGGCOV19-NET models

VGGCOV19-NET and Cascade VGGCOV19-NET models were formed using Tensorflow and the Keras library in the Phyton programming language. Some preliminary processes were conducted on the data sets before the training. Firstly, the images were adjusted to the dimensions 224 × 224 × 3. Then, each input datum was divided by 255, and the normalization process was completed. One hot encoding process was applied on the output class labels. Different optimizer algorithms (Adam, Adadelta, Sgd, RMSprop, Adamax, Nadam), loss functions (binary cross-entropy, categorical cross-entropy), and learning rates were tried to train the models. The best classification performance was attained for Adam optimizer, categorical cross-entropy loss function and 0.001 learning rate. Pre-trained weights of the VGG19 model were used in convolution layers and the trainable properties of these layers were passivized so decreasing calculation time and cost. The fivefold cross-validation method was used in the training and testing of the models. Models were trained by 100 epochs for each fold, and a high level of classification performance was achieved. The formation and training of CNN models without using GPU is very hard. For this reason, the training process of models was conducted using T4 and P100 GPUs on Google Colab (a product of Google Research). In addition, the original VGG19 model was also trained under similar conditions, and classification performance was attained. Thus, the difference in the classification performance of the VGGCOV19-NET model was better determined. The pseudocode carrying out the processes of forming the model explained in Sect. 3.2 and training the model explained in this section is given below. This code clearly reveals the formation and training of VGGCOV19-NET and Cascade VGGCOV19-NET models. Two different scenarios were tested to diagnose COVID-19 using X-ray images. While the VGGCOV19-NET, VGG19, and Cascade models had initially been trained to classify three categories (COVID-19 vs. no-findings vs. pneumonia), they were trained to classify two categories (COVID-19 vs. no-findings) in the second scenario. In other words, multi-class classification was conducted in scenario-1 and binary class classification in scenario-2. Fivefold cross-validation was used in the training and test processes of the models. Training accuracy and training loss values of multi-class and binary classifier models are shown in Figs. 6 and 7 for the fold-2 step. It was observed that the Cascade VGGCOV19-NET model showed a faster training process than the other models for multi-class. When loss values were examined for the multi-class classification, the Cascade VGGCOV19-NET model decreased the loss value very rapidly and approached zero value. However, the loss value decreasing performances of VGGCOV19-NET and VGG19 models were low, inconsistent and unable to approach zero value. It was also noted that the VGG19 model had many fluctuations in both the loss and accuracy graphs. This is an indication that the VGG19 model is unstable in the learning process and does not perform good learning. In contrast to the VGG19 model, the Cascade VGG19 model conducted stable learning in the multi-class classification. All models conducted stable learning in the binary classification, and the difference between their learning speeds was a little lower and similar to the multi-class classification.
Fig. 6

Training loss and accuracy curves attained at fold-2 for multi-class classification: a training loss and accuracy curves of the VGGCOV19-NET model, b training loss and accuracy curves of the Cascade VGGCOV19-NET model, c training loss and accuracy curves of the VGG19 model, d training loss and accuracy curves of the Cascade VGG19 model

Fig. 7

Training loss and accuracy curves attained at fold-2 for binary classification: a training loss and accuracy curves of the VGGCOV19-NET model, b training loss and accuracy curves of the Cascade VGGCOV19-NET model, c training loss and accuracy curves of the VGG19 model, d training loss and accuracy curves of the Cascade VGG19 model

Training loss and accuracy curves attained at fold-2 for multi-class classification: a training loss and accuracy curves of the VGGCOV19-NET model, b training loss and accuracy curves of the Cascade VGGCOV19-NET model, c training loss and accuracy curves of the VGG19 model, d training loss and accuracy curves of the Cascade VGG19 model Training loss and accuracy curves attained at fold-2 for binary classification: a training loss and accuracy curves of the VGGCOV19-NET model, b training loss and accuracy curves of the Cascade VGGCOV19-NET model, c training loss and accuracy curves of the VGG19 model, d training loss and accuracy curves of the Cascade VGG19 model The training, testing times, and total trainable parameters of the models are shown in Table 1. The lowest time belongs to the binary Cascade VGGCOV19-NET model. The highest time belongs to the multi-class VGG19 model. In addition, the training and testing times of the Cascade models are significantly lower than the other models. While this rate was approximately 70% in the binary classification, it was approximately 60% in the multi-class classification. The reason for this decrease in training and testing time is that Cascade models focus only on the chest region and decrease the numbers of pixels to be processed. Also, the training and testing times of the VGGCOV19-NET models were lower than those of the VGG19 models. Similarly, the trainable parameters of the VGGCOV19-NET model are lower because the neuron number in the FC layer is low.
Table 1

Training and testing times of the models and numbers of trainable parameters

Classification typeModelTraining and testing time (minutes)Number of trainable parameters (M)
Binary classVGGCOV19-NET14.273.37
Cascade VGGCOV19-NET4.13
VGG1918.29119.55
Cascade VGG196.60
Multi-classVGGCOV19-NET25.363.37
Cascade VGGCOV19-NET10.38
VGG1932.57119.55
Cascade VGG1911.75
Training and testing times of the models and numbers of trainable parameters

Performance metrics

Models were evaluated according to various performance metrics as follows: precision (P), recall (R), specificity (S), F1-score (F1), area under the receiver operating characteristics (ROC) curve (AUC), and accuracy (acc). Precision This parameter measures the proportion of anticipated positives that are true positives. Therefore, it is dependent on true positive (TP) and false positive (FP) values [56]. Recall Recall is the ratio of the true positives classified correctly by the model [57]. TP and FN values are used to calculate Recall [56]. Specificity Specificity is the ratio of the true negatives (those not from pathology) classified correctly by the model [57]. TN and FP values are used to calculate specificity [36]. F1-Score F1-score is an overall measure of the model’s accuracy that combines precision and recall. It is the double of the ratio of the multiplication of F1-score precision and recall metrics to their totals [16]. AUC ROC curves are widely used to reveal the results for binary classification problems in machine learning [58]. The ROC curve verifies the classification performance by showing the false positive rate (FPR) against the true positive rate (TPR). AUC is an important parameter for assessing the discriminative ability of prediction models [16, 59].

Results

In this part, the results obtained from VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19 and Cascade VGG19 models are presented in detail. As specified above, the VGGCOV19-NET is a model developed by the reformation of the full connected layer of VGG19 CNN architecture. Cascade models are the ones formed by feeding the input of VGGCOV19-NET and VGG19 models with the output of the YOLOv3 algorithm. R, P, F1, and acc metrics whose mathematical formulations were explained in the methods part were used to evaluate the classification performance of the models. The acc metric determines at what ratio a model anticipates the values. The P metric determines the repeatability of the measurement or how many anticipations are true. R represents the number of correct results found. The F1 score uses precision and recall metrics to calculate an average result [38]. Both multi-class and binary classification performances of the models were evaluated according to these metrics for each fold. Also, an overlapped confusion matrix (CM) was formed for the purpose of conducting a general evaluation of the models and the performance metrics generally representing the model were calculated using this matrix. To create the overlapped CM, the CMs obtained from all the folds are summed [9]. For multi-class classification, the classification performances of VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19 and Cascade VGG19 models are given in Table 2, and overlapped CM is given in Fig. 8. As seen in this table, the weighted average values of R, P, and F1 metrics are given for each fold. However, the metric values presented under the line “overlapped” have been calculated separately for each category from the overlapped CM, and these values are not weighted averages. Consequently, the use of these values will provide more explanatory data in the revelation of the general performances of the models for each category.
Table 2

R, P, F1-score, and acc values for COVID-19, no-findings, and pneumonia classes of the VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19, and Cascade VGG19 models

ModelsFoldPerformance results (%)
RecallPrecisionF1Accuracy
VGGCOV19-NETFold 178.20*78.80*78.30*78.22
Fold 291.10*91.10*91.10*91.11
Fold 384.40*84.80*84.50*84.44
Fold 495.10*95.20*95.10*95.11
Fold 595.60*95.70*95.60*95.56
Overlapped
COVID-1992.8099.1595.87
No-findings90.2086.4088.26
Pneumonia86.6089.0987.83
Average89.8791.5590.6588.89
Cascade VGGCOV19-NETFold 185.80*86.30*85.90*85.80
Fold 2100*100*100*100
Fold 3100*100*100*100
Fold 4100*100*100*100
Fold 5100*100*100*100
Overlapped
COVID-1998.4096.8597.62
No-findings96.4098.1797.28
Pneumonia97.6096.2596.92
Average97.4797.0997.2797.16
VGG19Fold 180.40*80.40*80.20*80.44
Fold 280*81.50*79.80*80
Fold 385.30*87*85.20*85.33
Fold 488*88.70*87.90*88
Fold 587.1087.40*87*87.11
Overlapped
COVID-1991.209593.06
No-findings86.2081.1683.60
Pneumonia80.4084.8182.54
Average85.9386.9986.4084.17
Cascade VGG19Fold 183.10*83.60*83.10*83.11
Fold 299.10*99.10*99.10*99.11
Fold 393.30*93.50*93.30*93.33
Fold 497.30*97.40*97.30*97.33
Fold 598.70*98.70*98.70*98.67
Overlapped
COVID-1989.600.99120.9412
No-findings95.800.93740.9476
Pneumonia94.000.93810.9391
Average93.130.95560.942694.31

*Weighted average

Fig. 8

Overlapped confusion matrices of models for multi-class classification a overlapped confusion matrix of the VGGCOV19-NET model, b overlapped confusion matrix of the Cascade VGGCOV19-NET model, c overlapped confusion matrix of the VGG19 model, d overlapped confusion matrix of the Cascade VGG19 model

R, P, F1-score, and acc values for COVID-19, no-findings, and pneumonia classes of the VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19, and Cascade VGG19 models *Weighted average Overlapped confusion matrices of models for multi-class classification a overlapped confusion matrix of the VGGCOV19-NET model, b overlapped confusion matrix of the Cascade VGGCOV19-NET model, c overlapped confusion matrix of the VGG19 model, d overlapped confusion matrix of the Cascade VGG19 model When an evaluation is conducted according to the acc metric, the classification performance of the Cascade VGGCOV19-NET model is seen to be much higher than that of the VGGCOV19-NET, VGG19 and Cascade VGG19 models. The highest mean accuracy value (acc = 97.16%) belongs to the Cascade VGGCOV19-NET model. The Cascade version of VGGCOV19-NET model increased the classification performance. A similar situation also exists between the VGG19 model (acc = 84.17%) and the Cascade VGG19 model (acc = 94.31%). However, the increase in the classification performance is far higher here. Also, according to the R metric, the Cascade VGGCOV19-NET model correctly classified the COVID-19, pneumonia, and no-findings patients to an extremely high extent. The Cascade VGGCOV19-NET model correctly classified COVID-19 patients by 98.40% (R = 0.9840), pneumonia patients by 97.60% (R = 0.9760), and no-findings patients by 96.40% (R = 0.9640). These figures are eye-catching for model conducting multi-class classifications. VGGCOV19-NET model correctly classified the COVID-19 patients by 92.80% (R = 0.9280), pneumonia patients by 86.60% (R = 0.8660), and no-findings patients by 90.20% (R = 0.9020). In addition, the Cascade VGGCOV19-NET model correctly detected COVID-19 patients at a higher rate than pneumonia and no-findings patients. The VGG19 model correctly classified COVID-19 patients by 91.20% (R = 0.9120), pneumonia patients by 80.40% (R = 0.8040), and no-findings patients by 86.20% (R = 0.8620). These results are, respectively, 89.60%, 94%, and 95.80% in the Cascade VGG19 model. The increase in the correct classification ratio in the Cascade version of VGG19 model is significant for the three categories. The second scenario used to diagnose COVID-19 is the binary classification. Sensitivity, specificity, precision, F1-score, auc, accuracy, and confusion matrix results of the models for binary classification are given in Table 3. The average accuracy values of the Cascade VGGCOV19-NET, VGGCOV19-NET, Cascade VGG19, and VGG19 models are 99.84%, 98.72%, 99.04%, and 98.56%, respectively. The classification precision of the Cascade VGGCOV19-NET model is slightly higher. The Cascade versions of the VGGCOV19-NET and VGG19 models fed with YOLOv3 algorithm also increased the classification performance for the binary classification. Revelation of how correctly the models classify the true positives (TP) and true negatives (TN) according to R and S metrics is important in binary classification. While the Cascade VGGCOV19-NET model correctly classified the COVID-19 patients by 99.20%, it correctly classified no-findings patients by 100%. It is striking in binary classification for the model to correctly classify both true positives and false positives at such a high ratio. In addition, while the VGGCOV19-NET model correctly classified the COVID-19 patients by 95.40%, it correctly classified no-findings patients by 99.60%. The provision of the VGGCOV19-NET model with Cascade structure caused an increase in the correct classification ratio (R = 0.9920) of COVID-19 patients. A similar situation exists in the Cascade VGG19 model. While the Cascade VGG19 model correctly classifies COVID-19 patients by 96.80%, the VGG19 model correctly classifies them by 92.80%. It could be said that feeding the VGG19 model with the YOLOv3 algorithm provided a greater increase in the correct classification ratio of COVID-19 patients.
Table 3

Specificity, sensitivity, auc, F1-score, precision, and accuracy values for no-findings and COVID-19 classes of the VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19, and Cascade VGG19 models

ModelsFoldConfusion matrix and performance results (%)
TPTNFPFNRecallSensitivityPrecisionF1AccuracyAUC
VGGCOV19-NETFold 12498038910098*98*97.6099.12
Fold 221103101009999*99*99.20100
Fold 3289700100100100*100*100100
Fold 4259712939998*98*97.6098.94
Fold 521103019510099*99*99.2099.95
Overlapped
COVID-191194982695.4099.6098.3596.85
No-findings4981196299.6095.4098.8199.20
Average97.5097.5098.5898.0398.7299.60
Cascade VGGCOV19-NETFold 12896010.9710.99*0.99*99.2098
Fold 22110400100100100*100*100100
Fold 3299600100100100*100*100100
Fold 4269900100100100*100*100100
Fold 52010500100100100*100*100100
Overlapped
COVID-191245000199.2010010099.60
No-findings5001241010099.2099.8099.90
Average99.6099.6099.9099.7599.8499.60
VGG19Fold 123100029210098*98*98.4096
Fold 224100019610099*99*99.2098
Fold 32510000100100100*100*100100
Fold 421100048410097*97*96.8092
Fold 523100029210098*98*98.4096
Overlapped
COVID-191165000992.8010010096.26
No-findings5001169010092.898.2399.10
Average96.4096.4099.1197.6898.5696.40
Cascade VGG19Fold 11610612949898*98*97.6098
Fold 22796209310098*98*98.4099.78
Fold 32599109610099*99*99.2099.80
Fold 4309500100100100*100*100100
Fold 52310200100100100*100*100100
Overlapped
COVID-191214984296.8099.6098.3797.58
No-findings4981212499.6096.8099.2099.40
Average98.2098.2098.7998.4999.0499.52

*Weighted average

Specificity, sensitivity, auc, F1-score, precision, and accuracy values for no-findings and COVID-19 classes of the VGGCOV19-NET, Cascade VGGCOV19-NET, VGG19, and Cascade VGG19 models *Weighted average Another metric used in the verification of the classification performance of the models is the ROC curve. ROC curves in each fold step and AUC values are shown in Fig. 9 for binary classification. The closeness of the AUC value to one shows that the classification performance of the model is high. The fact that the classification performances of VGGCOV19-NET, Cascade VGGCOV19-NET, and Cascade VGG19 models are very high was also verified by the average AUC values (99.60%, 99.60%, and 99.52%). The average AUC value (96.40%) of the VGG19 model is lower compared to these models.
Fig. 9

ROC curves of the models for binary classification a ROC curve of the VGGCOV19-NET model, b ROC curve of the Cascade VGGCOV19-NET model, c ROC curve of the VGG19 model, d ROC curve of the Cascade VGG19 model

ROC curves of the models for binary classification a ROC curve of the VGGCOV19-NET model, b ROC curve of the Cascade VGGCOV19-NET model, c ROC curve of the VGG19 model, d ROC curve of the Cascade VGG19 model Two hundred and nineteen different COVID-19 X-ray images were [60] used to verify the performance of the Cascade VGGCOV19-NET model which yielded the best performance in the study. No different data sets were used for no-findings and pneumonia. The Cascade VGGCOV19-NET model displayed better performance for this new dataset in both binary and three-class classification. The accuracy value was 99.18% for three-class classification and 99.58% for binary classification. These values are 96.56% and 98.61%, respectively, for VGGCOV19-NET. These results prove that the YOLO-aided Cascade architecture increases the classification performance.

Discussion

The accuracy values of the models are shown comparatively in Fig. 10 for binary and multi-class classifications. As can be seen in this figure, the classification performance of Cascade VGGCOV19-NET is higher than that of the other models for both multi-class and binary classifications. Moreover, the Cascade VGGCOV19-NET model generally predicts COVID-19 at high ratios.
Fig. 10

Comparison of the accuracy values of the models

Comparison of the accuracy values of the models For the Cascade VGGCOV19-NET model to have high accuracy values in binary and multi-class classification is an indicator that the model has been structured well. Furthermore, the Cascade VGGCOV19-NET model classifies COVID-19 correctly by 98.40% (R = 0.9840) in the multi-class classification in which there are three categories. This ratio is lower in the VGGCOV19-NET model (R = 0.9280) than the Cascade VGGCOV19-NET model. The fact that the recall parameter is high in the Cascade VGGCOV19-NET model proves that they can correctly distinguish COVID-19 patients from pneumonia patients and no-findings patients at a high ratio. In addition, the fact that making some changes to the original VGG CNN architecture increases the model performance is also shown in this study. Similarly, it also revealed that instead of using pre-trained CNN models alone, the formation of a Cascade structure via feeding with algorithms such as the YOLO detection algorithm increases the classification performance. A comparison of the model performance attained in this study to previous studies is also important. These results are given in Table 4.
Table 4

Comparison of the recommended VGGCOV19-NET COVID-19 diagnosis method with other CNN methods developed using radiology images

StudyType of imagesNumber of casesMethod usedSegmentation or location attentionAccuracy (%)Remarks
Ozturk et al. [9]Chest X-ray500 No-findingsDarkCovidNet98.08Although the same data set as our study was used, the classification performance is significantly lower for three-class classification
125 COVID-19 (+)
500 No-findings87.02
500 Pneumonia
125 COVID-19 (+)
Hemdan et al. [16]Chest X-ray25 COVID-19 (+)VGG19 and DenseNet20190The number of COVID-19 X-ray images in the data set is very low. Only binary classification was performed and the classification performance is considerably low
25 Normal
Wang et al. [29]Chest X-ray358 COVID-19 (+)COVID-Net93.3They reduced the computational complexity with the CNN architecture where they used 1 × 1 convolution blocks. The accuracy value is lower compared to our study and the sensitivity value needs improving
5538 Pneumonia
8066 Normal
Khan et al. [11]Chest X-ray284 COVID-19 (+)CoroNet based on pre-trained Xception CNN95The classification performance is relatively lower for three-class classification
657 Pneumonia
310 Normal
284 COVID-19 (+)99
310 Normal
Medhi et al. [32]Chest X-rayMore than 150 COVID-19 (+)CNN93The data set is not clear. They used the CNN model they had developed themselves. They did not apply the pre-trained models. Furthermore, the binary classification performance is low
Harit et al. [33]Chest X-ray46 COVID-19 (+)ResNet5094The data set is very small. Moreover, the classification for binary class is considerably low. Only the ResNet50 pre-trained model was applied
41 normal
Ahammed et al. [34]Chest X-ray219 COVID-19 (+)CNN, ResNet50, VGG16, InceptionV3and traditional machine learning methods94They attained the highest performance with the CNN developed by themselves. Low performance was attained with traditional methods. Classification accuracy is relatively low
1345 Viral pneumonia
1341 Normal
Apostolopoulos and Mpesiana [35]Chest X-ray224 COVID-19 (+)VGG1993.48The VGG19 pre-trained CNN architecture was used in raw form without being modified. The classification performance is significantly lower compared to our study especially for three-class classification
700 Pneumonia
504 Healthy
224 COVID-19 (+)98.75
504 Healthy
Narin et al. [36]Chest X-ray50 COVID-19 (+)ResNet50 and deep CNN98Only binary classification was performed in the study and the number of X-ray images in the data set is very low
50 COVID-19 (−)
Ouchicha et al. [17]Chest X-ray219 COVID-19CNN based on ResNet96.69A new CNN model based on the ResNet model was proposed. Although the classification performance for three-class classification is higher than other studies, it is lower than the performance in our study. Also other SoTA models could have been applied in the study
1341 normal
1345 viral pneumonia
Benbrahim et al. [37]Chest X-ray160 COVID-19 (+)InceptionV3 and ResNet5099Only binary classification was made. The classification performance is good. However, the data set is considerably small
160 Normal
Al-antari et al. [61]Chest X-ray326 COVID-19YOLOYOLO detection algorithm90.67Detection was made using the YOLO algorithm. Unlike our study, no cascade structure fed by YOLO was used. The classification performance is low
120 Pneumonia
866 other seven lung diseases
Nigam et al. [62]Chest X-ray5634 COVID-19VGG16, DenseNet121, Xception, NASNet and EfficientNetYOLO detection algorithm93.48A cascade structure fed by YOLO was used as a method as in our study. However, the classification performance is lower. The impact of YOLO could not be revealed as the performances of models not fed with YOLO were not included in the study
6000 Normal
5000 Others
Aslan et al. [30]Chest X-ray1095 COVID-19 (+)Hybrid CNN based on AlexNetANN-based automatic lung segmentation98.70ANN-based chest zone segmentation was performed, and the images were given as input to the AlexNet-based hybrid CNN. It is similar to our study as a cascade model was used. The classification performance is slightly higher than our study. However, data augmentation was carried out in the study. Data augmentation is not regarded as a good practice in medical imaging
1345 Pneumonia
1341 Normal
Hira et al.[31]Chest X-ray224 COVID-19 (+)Se-ResNeXt-5097.55Classification was carried out with nine pre-trained CNN models (AlexNet, GoogleNet, ResNet-50, Se-ResNet-50, DenseNet121, Inception V4, Inception ResNet V2, ResNeXt-50, and Se-ResNeXt-50). The highest classification performance was attained with Se-ResNeXt-50. The classification performance is slightly higher than our study
700 Pneumonia
504 Normal
Butt et al. [38]Chest CT219 COVID-19 (+)RESNET86.7The data sets used in these studies consist of CT images, and the classification performances are considerably lower than our study
224 Pneumonia
175 Healthy people
Ying et al. [39]Chest CT777 COVID-19 (+)DRE-NET94
708 Healthy
Zheng et al. [40]Chest CT313 COVID-19 (+)DECOVNET-90.1
229 COVID-19 (−)
Xu et al. [63]Chest CT219 COVID-19 (+)ResNet-18 + location based attentionImage patch vote and Noisy-OR Bayesian function86.7
224 Pneumonia
175 Healthy
Proposed studyChest X-ray500 No-FindingsCascade VGGCOV19-NET99.84
125 COVID-19 (+)
500 No-Findings97.16
500 Pneumonia
125 COVID-19 (+)

Bold values indicate the results obtained in this study

Comparison of the recommended VGGCOV19-NET COVID-19 diagnosis method with other CNN methods developed using radiology images Bold values indicate the results obtained in this study According to this table, the classification performance of this study is higher than other studies in the literature both for binary and three-class classifications. DarkCovidNet developed by Ozturk et al. [9] correctly classified the binary class with an accuracy of 98.08% and three-class by 87.02%. DarkCovidNet is based on DarkNet architecture where fewer layers and filters are used. The data set used in that study is the same as the one used in our study. The classification performance of Cascade VGGCOV19-NET and Cascade VGG19 developed in our study is very much higher than that of DarkCovidNet. This performance difference is approximately 9.5% for the three-class classification and 1.28% in the binary classification. There are two studies [11, 37] showing the closest performance to Cascade VGGCOV19-NET and Cascade VGG19 among other studies using different data sets for binary classification. The accuracy values of both studies are 99%. While the first of these studies [11] performed the classification process with a new Xception CNN architecture-based CNN model on 284 COVID-19 positive and 310 normal X-ray images, the second [37] performed the classification process using pre-trained CNN InceptionV3 architecture and the transfer learning approach on 160 COVID-19 (+) and 160 normal X-ray images. There are also studied in the literature classifying three classes. Khan et al. [11] classified three classes with a new Xception CNN architecture-based CNN model with an accuracy ratio of 95%, Ahammed et al. [34] classified three classes with the deep CNN model they created with accuracy of 94%, Apostolopoulos and Mpesiana [35] classified three classes with the pre-trained VGG19 architecture with accuracy of 93.48%, Aslan et al. [30] classified three classes with the hybrid CNN architecture with accuracy of 98.70%, Hira et al. [31] classified three classes with the specially ResNeXt-50 with accuracy of 97.55%, and Wang et al. [29] classified three classes with the deep CNN model they called COVID-Net with an accuracy of 93.3%. The Cascade VGGCOV19-Net model suggested in our study classify three classes with accuracy higher than these studies, except for two studies [30, 31]. Furthermore, there are two studies using the YOLO architecture in the literature. Nigam et al. [62] tried to detect COVID-19 from X-ray images. They obtained the highest classification performance for three-class classification in the EfficientNet model with 93.48%. First, X-ray images are given to the YOLO algorithm in this study to determine the lung zone as in our study. Then, these images are classified with pre-trained CNN models. The data set used in the study is a large data set. However, the classification accuracy is lower than our study. Furthermore, it could not be demonstrated whether the YOLO-based model affected the performance as the study was not conducted on raw images. Al-antari et al. [61] tried to detect nine lung diseases including COVID-19 and pneumonia. Unlike our study, they only used the YOLO detection algorithm through X-ray images whose ground truth was labeled. The average detection accuracy for all classes was 90.67%. There are also studies [38–40, 63, 64] diagnosing COVID-19 using chest CT in the literature. One of these studies [64] states that mobile devices have limited storage and calculating capacity and suggests an architecture including robot, edge, and cloud layers so that deep learning applications can be run on mobile devices. They detected COVID-19 from CT images using DenseNet169 on this architecture. The accuracy values in these studies are relatively lower than those of studies conducted using X-rays. As specified by Ozturk et al. [9], CT is very costly and not easily accessible because scanners are only available in major health centers. When compared to X-rays, CT causes higher radiation amounts in the patients. Consequently, we recommend using a deep learning model with X-ray images because they are more accessible than CT with lower radiation dosages. Because X-ray radiographies are easily accessible, they are commonly used for the diagnosis of COVID-19 in health centers worldwide. The Cascade VGGCOV19-NET model can be used as auxiliary tool in the diagnosis of COVID-19 in a much shorter period than X-ray radiographies. The main advantages of this model are as follows: They classify chest X-ray images that can be very easily, rapidly, and smoothly attained from any hospital without needing extra feature extraction methods. They generally have higher classification performance than the other studies in the literature. As in some studies in the literature, the original state-of-the-art CNN architecture was not used. VGG19 CNN architecture has been modified and its classification performance increased. New Cascade VGGCOV19-NET model based on the YOLOv3 detection algorithm and the VGG19 pre-trained CNN architecture has increased the classification performance and guided future studies. They are able to correctly distinguish COVID-19 at higher ratios than pneumonia and no-findings. Cascade VGGCOV19-NET detects COVID-19 in minutes and at a high accuracy rate. For this reason, it can be used in areas where there is a shortage of specialist doctors and radiologists. In addition, even in small healthcare facilities where only an X-ray device is available, patients can be diagnosed using the model and those who are positive for COVID referred to more advanced health centers for immediate treatment. Another advantage of the model in application is that it prevents patients who are detected negatively from entering PCR tests. Thus, unnecessary clutter in health centers can be avoided. The trained Cascade VGGCOV19-NET model is saved in the computer environment. After a simple interface is designed for the model, the X-ray image can be easily loaded into the system and results obtained very quickly. Also, X-ray images can be rapidly transferred to the physician’s computer without the need for additional procedures. All the physician need do is upload the X-ray image to the system with a few mouse clicks to see the result. From this point of view, Cascade VGGCOV19-NET can easily be used for pre-diagnosis.

Conclusion

In this study, the deep CNN model modified VGG19 architecture-based VGGCOV19-NET was presented to diagnose COVID-19 cases from chest X-ray images. In addition, we created the Cascade VGGCOV19-NET model by feeding this model with the YOLOv3 algorithm. The VGGCOV19-NET model performs the binary class and multi-class classification tasks with an accuracy of 98.72% and 88.89%, respectively. These accuracy values are 99.84% and 97.16%, respectively, in the Cascade VGGCOV19-NET model. Feeding VGGCOV19-NET with the YOLOv3 algorithm increases the classification performance. Moreover, it decreases the training and test time by approximately 70% in the binary classification and by 60% in the multi-class classification. The Cascade VGGCOV19-NET is an important and original model that detects COVID-19 with high accuracy that will lead to future studies. This model automatically detects COVID-19 with high accuracy from X-ray images, which are widely used and easily accessible for the diagnosis of COVID-19 in health centers worldwide. Conventional pneumonia has great similarities with COVID-19, and there is a severe shortage of specialists. For this reason, Cascade VGGCOV19-NET, an artificial intelligence-supported automatic detection system, is especially important in detecting COVID-19. The Cascade VGGCOV19-NET model is a tool that can assist doctors by diagnosing COVID-19 rapidly and easily at low cost. In addition, the YOLO-supported COVID-19 detection method proposed in this study can be applied to various fields, such as tumor detection from brain images, skin cancer detection, and implant-type detection from implant images. One limitation of the study is the use of a limited number of COVID-19 X-ray images. As medical images for COVID are obtained from local hospitals, it is very difficult to obtain various data sets for such studies conducted with CNN. The number of data sets in the literature was not enough when we initiated this study. Furthermore, we could not use data augmentation as it is not regarded as a good practice in medical imaging. Another limitation of the study is that the proposed cascade method was used only on X-ray images. On the other hand, it is thought that the proposed cascade method could successfully classify CT images. In a future study, we will try to detect COVID-19 through different CNN models on a very large mixed data set by adding other open-source X-ray and computed tomography images in the literature to the X-ray images data set used in this study. Another limitation of the study is that the chest zone was determined only as a quadrant with the YOLO algorithm. In particular, the three-class classification performance can increase if the right and left lung zones are segmented using image processing techniques. This segmentation will be carried out as a future study, and the results will be compared with the YOLO-based model. Moreover, another limitation of the study is that only the VGG19 SoTA (state-of-the-art) model was used. Therefore, SoTA models such as EfficientNet, ResNet, MobileNet, Densenet169, and Inceptionv3 fed with YOLO can be used to improve the performance of the COVID diagnosis system.
  31 in total

1.  Coronavirus covid-19 has killed more people than SARS and MERS combined, despite lower case fatality rate.

Authors:  Elisabeth Mahase
Journal:  BMJ       Date:  2020-02-18

2.  Diagnostic tests. 1: Sensitivity and specificity.

Authors:  D G Altman; J M Bland
Journal:  BMJ       Date:  1994-06-11

3.  Automatic detection of coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks.

Authors:  Ali Narin; Ceren Kaya; Ziynet Pamuk
Journal:  Pattern Anal Appl       Date:  2021-05-09       Impact factor: 2.580

4.  Automated detection of COVID-19 cases using deep neural networks with X-ray images.

Authors:  Tulin Ozturk; Muhammed Talo; Eylul Azra Yildirim; Ulas Baran Baloglu; Ozal Yildirim; U Rajendra Acharya
Journal:  Comput Biol Med       Date:  2020-04-28       Impact factor: 4.589

5.  CNN-based transfer learning-BiLSTM network: A novel approach for COVID-19 infection detection.

Authors:  Muhammet Fatih Aslan; Muhammed Fahri Unlersen; Kadir Sabanci; Akif Durdu
Journal:  Appl Soft Comput       Date:  2020-11-18       Impact factor: 6.725

6.  CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images.

Authors:  Asif Iqbal Khan; Junaid Latief Shah; Mohammad Mudasir Bhat
Journal:  Comput Methods Programs Biomed       Date:  2020-06-05       Impact factor: 5.428

Review 7.  Coronavirus Disease 2019 (COVID-19): A Perspective from China.

Authors:  Zi Yue Zu; Meng Di Jiang; Peng Peng Xu; Wen Chen; Qian Qian Ni; Guang Ming Lu; Long Jiang Zhang
Journal:  Radiology       Date:  2020-02-21       Impact factor: 11.105

8.  Covid-19: automatic detection from X-ray images utilizing transfer learning with convolutional neural networks.

Authors:  Ioannis D Apostolopoulos; Tzani A Mpesiana
Journal:  Phys Eng Sci Med       Date:  2020-04-03

Review 9.  Impact of COVID-19 and other viruses on reproductive health.

Authors:  Osamah Batiha; Taghleb Al-Deeb; Esra'a Al-Zoubi; Emad Alsharu
Journal:  Andrologia       Date:  2020-08-13       Impact factor: 2.775

10.  Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets.

Authors:  Stephanie A Harmon; Thomas H Sanford; Sheng Xu; Evrim B Turkbey; Holger Roth; Ziyue Xu; Dong Yang; Andriy Myronenko; Victoria Anderson; Amel Amalou; Maxime Blain; Michael Kassin; Dilara Long; Nicole Varble; Stephanie M Walker; Ulas Bagci; Anna Maria Ierardi; Elvira Stellato; Guido Giovanni Plensich; Giuseppe Franceschelli; Cristiano Girlando; Giovanni Irmici; Dominic Labella; Dima Hammoud; Ashkan Malayeri; Elizabeth Jones; Ronald M Summers; Peter L Choyke; Daguang Xu; Mona Flores; Kaku Tamura; Hirofumi Obinata; Hitoshi Mori; Francesca Patella; Maurizio Cariati; Gianpaolo Carrafiello; Peng An; Bradford J Wood; Baris Turkbey
Journal:  Nat Commun       Date:  2020-08-14       Impact factor: 14.919

View more
  2 in total

Review 1.  Machine learning applications for COVID-19 outbreak management.

Authors:  Arash Heidari; Nima Jafari Navimipour; Mehmet Unal; Shiva Toumaj
Journal:  Neural Comput Appl       Date:  2022-06-10       Impact factor: 5.102

2.  Automatic Detection of Cases of COVID-19 Pneumonia from Chest X-ray Images and Deep Learning Approaches.

Authors:  Fahima Hajjej; Sarra Ayouni; Malek Hasan; Tanvir Abir
Journal:  Comput Intell Neurosci       Date:  2022-09-06
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.