Literature DB >> 33644298

COVIDC: An Expert System to Diagnose COVID-19 and Predict its Severity using Chest CT Scans: Application in Radiology.

Wajid Arshad Abbasi¹, Syed Ali Abbas¹, Saiqa Andleeb², Ghafoor Ul Islam², Syeda Adin Ajaz¹, Kinza Arshad¹, Sadia Khalil¹, Asma Anjam¹, Kashif Ilyas¹, Mohsib Saleem¹, Jawad Chughtai¹, Ayesha Abbas¹.

Abstract

Early diagnosis of Coronavirus disease 2019 (COVID-19) is significantly important, especially in the absence or inadequate provision of a specific vaccine, to stop the surge of this lethal infection by advising quarantine. This diagnosis is challenging as most of the patients having COVID-19 infection stay asymptomatic while others showing symptoms are hard to distinguish from patients having different respiratory infections such as severe flu and Pneumonia. Due to cost and time-consuming wet-lab diagnostic tests for COVID-19, there is an utmost requirement for some alternate, non-invasive, rapid, and discounted automatic screening system. A chest CT scan can effectively be used as an alternative modality to detect and diagnose the COVID-19 infection. In this study, we present an automatic COVID-19 diagnostic and severity prediction system called COVIDC (COVID-19 detection using CT scans) that uses deep feature maps from the chest CT scans for this purpose. Our newly proposed system not only detects COVID-19 but also predicts its severity by using a two-phase classification approach (COVID vs non-COVID, and COVID-19 severity) with deep feature maps and different shallow supervised classification algorithms such as SVMs and random forest to handle data scarcity. We performed a stringent COVIDC performance evaluation not only through 10-fold cross-validation and an external validation dataset but also in a real setting under the supervision of an experienced radiologist. In all the evaluation settings, COVIDC outperformed all the existing state-of-the-art methods designed to detect COVID-19 with an F1 score of 0.94 on the validation dataset and justified its use to diagnose COVID-19 effectively in the real setting by classifying correctly 9 out of 10 COVID-19 CT scans. We made COVIDC openly accessible through a cloud-based webserver and python code available at https://sites.google.com/view/wajidarshad/software and https://github.com/wajidarshad/covidc.

Entities: Chemical Disease Gene Species

Keywords: COVID-19; CT imaging; Early diagnosis; Expert system; Pandemic; Radiology; SARS-COV-2

Year: 2021 PMID： 33644298 PMCID： PMC7901302 DOI： 10.1016/j.imu.2021.100540

Source DB: PubMed Journal: Inform Med Unlocked ISSN： 2352-9148

Introduction

Coronavirus disease 2019 (COVID-19) is a contagious infection caused by a family of viruses called coronaviridea. Since its identification at the end of 2019 in Wuhan, a city in the Hubei Province of China, this viral disease has spread rapidly around the globe [1]. The World Health Organization (WHO) has already declared this pandemic as a global health calamity and it has infected and killed millions of people worldwide [2]. Physicians say this is a viral infection that mainly infects the human respiratory system and antibiotics do not cure this disease [3,4]. Meanwhile, currently, there is an inadequate provision of an effective vaccine or medication to prevent or cure this lethal infection. In the absence or insufficient supply of effective medication or vaccines to cure or prevent COVID-19 infection, early diagnosis of the disease is fundamental to avoid its further surge by advising or adopting quarantine or isolation. Most of the patients having COVID-19 infection stay asymptomatic while others show mild to moderate symptoms such as fever or cough, shortness of breath, and signs such as oxygen saturation or lung auscultation [5,6]. Signs and symptoms have the utmost importance to identify any infection and to perform further diagnostic tests. However, even symptomatic COVID-19 infection does not show any particular symptoms which distinguish this from other respiratory infections such as severe flu and Pneumonia [6]. In this situation, all the suspected patients require screening and have to pass through the adopted diagnostic tests. Currently, there are two types of tests to detect COVID-19: diagnostic tests and antibody tests [7]. A diagnostic test such as Reverse transcription-polymerase chain reaction (RT-PCR) can be used to detect an active coronavirus infection whereas, the antibody test looks for antibodies created in your body in the response to the disease. RT-PCR is currently the gold standard diagnostic practice to detect viral infection [8]. However, the standard confirmatory clinical RT-PCR test to detect COVID-19 is manual, complex, laborious, costly, and ineffective [9,10]. Moreover, the limited availability of test kits and domain experts further hamper the situation. Meanwhile, antibody tests cannot be used to diagnose COVID-19 as after infection, antibodies can take numerous days or weeks to develop and may stay in your blood a bit longer after recovery [7]. Keeping in view the limitation of prevailing diagnostic techniques and a rapid surge of infected patients, there is an utmost requirement for some alternate automatic screening systems that can be used by the physicians to swiftly identify and isolate the COVID-19 infected patients. X-ray and Computed Tomography (CT) imaging modalities are the most common noninvasive diagnostic techniques that help medical practitioners to diagnose and treat many diseases. In the current framework, chest X-ray (CXR) and chest Computed Tomography (CCT) can also effectively be used to diagnose COVID-19 [11,12]. However, CCT due to 3D image and contrast dyes shows a significantly improved performance in COVID-19 diagnosis in comparison to simple 2D CXR [9]. In the meanwhile, CT scans of COVID-19 infected patients show diverse features and manual interpretation of these scans with subtle variations is quite challenging [13]. Moreover, the current enormous upsurge of infected patients makes it a challenging task for the domain experts to complete a timely diagnosis [14,15]. Therefore, some Computer-Aided Diagnostic (CAD) systems are required to better manipulate and understand the CCT images. In Computer-Aided Diagnostic (CAD) system design using CT scans, machine learning in the form of deep learning has successfully been applied in diagnosing lung diseases [[16], [17], [18], [19]]. Also to diagnose COVID-19 using CCT images, several machine learning-based methods with varying sources and amount of training data have been proposed in the literature (for details please see “Related Works” in the next section). Most of these previously proposed methods in the literature are based on Convolution Neural Network (CNN) deep learning approach [20]. However, for the deep learning approaches to perform effectively, a healthy amount of data is usually required for parameter tuning which is not readily available so far. Moreover, all the published studies focused only to diagnose COVID-19 whereas, its severity prediction still requires an efficient method. Therefore, to address the issues of data scarcity in deep learning and the absence of an efficient method for COVID-19 severity prediction, in this article we have proposed a method called COVIDC (COVID-19 detection using CT scans) to diagnose and predict the severity of the COVID-19 infection with CCT images. Our proposed method is different from previously proposed methods for the diagnosis of the COVID-19 and its severity prediction in that it uses a combination of transfer and shallow learning. Using this method, we can extract efficient feature representation of CT scans by using transfer learning with off-the-shelf pre-trained models on ImageNET and still be able to avoid the curse of dimensionality by engaging shallow learning algorithms such as Support Vector Machines (SVMs) [21]. This has led to improved accuracy in comparison to methods that rely only on the deep learning paradigm.

Related Works

Since COVID-19 identification at the end of 2019, several studies have been proposed in the literature to diagnose COVID-19 with chest computed tomography (CT) by using artificial intelligence (AI) techniques. We have searched existing techniques in the online literature using different keywords such as “Machine learning and COVID-19”, “COVID-19 diagnosis with CT scans”, and “COVID-19 diagnosis with CT scans and machine learning”. While going through the online existing literature (peer-reviewed) published in reputed journals, we found a plethora of machine learning techniques to diagnose COVID-19 using chest CT scans with varying sources and amount of training data [[22], [23], [24], [25], [26], [27], [28], [29], [30], [31], [32], [33], [34], [35], [36], [37]]. All these previously published techniques can be categorized into three main classes as follows: Deep learning-based, transfer learning with fine-tuning a customized fully connected layer, shallow learning with handcrafted textured features. Previously published studies employing deep learning-based machine learning techniques to detect COVID-19 mostly used Convolution Neural Network (CNN) [20] based architectures in their proposed design [[23], [24], [25],29,[31], [32], [33], [34], [35], [36], [37]]. However, deep learning approaches to generalize well normally require an enormous amount of data which is not readily available right now [22]. Due to data scarcity issues in CNN-based deep learning, some studies have also been proposed in the literature using transfer learning [22,28,30]. Ahuja et al. used data augmentation and transfer learning with ResNet18, ResNet50, ResNet101, and SqueezeNet pretrained networks to classify COVID-19 images. Ahuja et al. concluded that ResNet18 outperformed other models by obtaining a 0.97 AUC score [22]. Ko et al. proposed a fast-track COVID-19 classification model network based on transfer learning with VGG16, ResNet-50, InceptionV3, and Xception as backbone pre-trained models to diagnose COVID-19. They considered 1194 COVID-19, 264 low-quality COVID-19 (only for testing), and 2239 pneumonia, normal, and other disease CT scans in their study. They also used rotation and zoom data augmentation procedures to maximize the number of training samples. It was concluded that FCONet based on ResNet-50 outperformed other pre-trained models and achieved 96.97% of accuracy in the external validation data set of COVID-19 pneumonia images [28,38]. Pathak et al. proposed a transfer learning-based system to detect COVID-19 in CT scans with the ResNet50 to extract the features and a 2D convolutional neural network for the classification [30]. The proposed system achieved a 93.01% of accuracy with 10-fold cross-validation on 413 COVID and 439 non-COVID scans. These previously published methods to detect COVID-19 using transfer learning still have room for improvement because in transfer learning, fine-tuning the customized fully connected layer requires a healthy amount of data due to high dimensional features space generated by the off-the-shelf pre-trained models. To solve data scarcity issues in deep learning, Kang et al. proposed a shallow learning technique for COVID-19 classification using different types of handcrafted features extracted from CT images [27]. They used 2522 CT images (1495 are from COVID-19 patients, and 1027 are from community-acquired pneumonia) for the classification purpose. The proposed method achieved an overall accuracy of 95.5%. However, even after learning multiple representations of CT scans, the method proposed by Kang et al. still has room for performance improvement because it is often hard to represent subtle variations in CT scans using handcrafted texture descriptors [39]. Moreover, almost all the above studies have been proposed only to diagnose COVID-19 but its efficient severity detection is still an open research question. Furthermore, to the best of our knowledge, all of the above-discussed studies only discussed the technical details of their proposed methodology without providing any easily accessible interface to end-users. To overcome these shortcomings and to deal with the issues of data scarcity in deep/transfer learning and handcrafted textured features, in this study we have proposed a novel COVID-19 diagnosis and its severity predictor called COVIDC (COVID-19 detection using CT scans) by employing a combination of transfer and shallow learning.

Methods

In the following sections, we give the detail of our experimental procedure to design and test the generalization performance of COVIDC (COVID-19 diagnosis using CT images).

Dataset and preprocessing

In this study, we have collected three different datasets of Chest Computed Tomography (CCT) images from publicly open online repositories [[40], [41], [42]]. For COVID-19 diagnosis, we have used two datasets: i) training set (1433 COVID and 1229 non-COVID CT images) [41], ii) external validation set (349 COVID and 397 non-COVID) [42]. For COVID-19 severity prediction, we have used a dataset of 141 CT images of severe COVID-19 patients and 135 CT images of less-severe [40]. All the images in these datasets have gone through the essential preprocessing required for feature extraction including image resizing (313 × 313 pixels), de-noising, and contrast stretching [43].

Proposed methodology

The methodology we have adopted for this study has been shown in Fig. 1 . We give the detail of our proposed methodology to design and develop CT image-based COVID-19 diagnosis and its severity predictor using machine learning as follows.

Fig. 1

A proposed methodology for the development of computer-aided COVID-19 diagnosis and its severity prediction system (COVIDC) using Machine learning and chest CT images. This system has been trained using shallow learning algorithms such as SVMs with chest CT scans by extracting feature maps involving pre-trained off-the-shelf models. COVIDC can be used to predict whether a novel test CT image has COVID-19 or not.

Features extraction and preprocessing

Deep learning, where we learn automatically an efficient feature representation of an image, is very famous and has successfully been used in different image classification and analysis tasks. However, to apply deep learning efficiently, we normally require an extensive amount of data to better learn the actual distribution of the image samples. This data hunger is a challenge so far to apply the deep learning approach efficiently in our case to diagnose and predict the severity of COVID-19 where we face data scarcity. Therefore, to overcome this challenge, we have used the transfer learning strategy for feature extraction. Using transfer learning, feature extraction from the available CT images in our datasets has been performed using different off-the-shelf CNN based pre-trained models on ImageNet. For this purpose, we have used different off-the-shelf CNN based pre-trained models such as Resnet50 [44], InceptionV3 [45], Xception [46], VGG16 [47], NASNetLarge [48], DenseNet121 [49]. We have used these models for feature extraction by simply loading a pre-trained model without its classifier part by specifying the “include_top” argument as “False”. The selection of these pre-trained CNN based models was based on their reported accuracy. Preprocessing and resizing required by these pre-trained models have been applied before extracting the features map.

Classification algorithms

Most of the machine learning studies involved in COVID-19 diagnosis have used pure deep learning strategies or transfer learning with a customized fully connected layer to be fine-tuned. However, to train deep neural networks from scratch or to fine-tune the parameters of the customized fully connected layers on the top of a pre-trained model, we need an extensive amount of data and high-performance computational resources, which is mostly impractical [50]. Therefore, we have proposed a different machine learning-based approach for the preliminary diagnosis of COVID-19 and its severity prediction using digital chest CT images. The novelty of the proposed approach is that it uses a blend of pre-trained CNN based models to extract features and shallow learning algorithms such as SVMs for the classification purpose. The proposed scheme is somehow based on the paradigm of transfer learning. In this work, our dataset consists of examples of the form where is a chest CT image and is its associated label. For COVID-19 identification, indicate whether belongs to a COVID-19 patient (+1) or not (−1) and for COVID-19 severity prediction indicate severe (+1) or mild (−1). For a given chest CT image , we extract deep feature maps denoted by . Our objective is to learn two separate functions for the diagnosis and severity prediction of COVID-19. For this purpose, we have used three different machine learning-based classification algorithms: classical Support Vector Machine (SVM), Random Forest (RF), and Gradient Boosting Machine (XGBoost) [[51], [52], [53]]. The detail of these shallow machine learning algorithms is as follows.

Support Vector classification (SVC)

We used SVM for the diagnosis of COVID-19 and its severity prediction by learning a function with as parameters to be learned from the training data The optimal value of the is obtained in SVM by solving the following optimization problem [52]. The objective function in Eq. (1) maximizes the margin while minimizing margin violations (or slacks ξ) [52]. The hyperparameter controls the tradeoff between margin maximization and margin violation. We used both linear and radial basis function (RBF) kernels and coarsely optimized the values of λ and γ using grid search with scikit-learn (version:0.23) [54].

Random Forest Classification (RFC)

Random Forest Classification (RFC) uses an ensemble learning technique based on bagging. A random forest operates by constructing several decision trees in parallel during training and outputs the mean of the classes as the prediction of all trees [51]. It usually performs better on problems having features with non-linear relationships. Each classification tree in the RF is constructed on randomly sampled subsets of input features. In this study, we have optimized RF for the number of decision trees in the forest, the maximum number of features considered for splitting a node, the maximum number of levels in each decision tree, and a minimum number of samples required to split. We have also seen this classification technique effectively in action in many other studies [[55], [56], [57], [58]].

XGBoost calssification (XGBC)

XGBoost is also an ensemble learning technique based on the boosting that combines weak learners into a strong learner in an iterative fashion [53,59]. We have used trees as default base learners in XGBoost ensembles. In this study, we have optimized XGBoost in terms of the learning rate, maximum depth, the number of boosting iterations, booster, and subsample ratio with a grid search and using a python-based package xgboost 0.7 [59].

Online access to COVIDC

We have developed and deployed a webserver of COVIDC which uses the optimal machine learning model for COVID-19 diagnosis and its severity prediction. This webserver takes a chest CT image and performs COVID-19 diagnosis and its severity prediction for it. After the successful submission of a CT image, users get COVIDC predictions in the form of COVID-19 identification and its severity prediction. This process has broken into two steps: 1) whether the uploaded image represents a COVID-19 patient or not (COVID vs non-COVID), 2) if it is a COVID-19 infection then how severe it is. The webserver is available at https://sites.google.com/view/wajidarshad/software.

Results

Experimental setup

We have used two different datasets for COVID-19 diagnosis: train-test set (1433 COVID and 1229 non-COVID CT images) [41], ii) external validation set (349 COVID and 397 non-COVID) [42], and reported performance metrics on both the datasets. For the train-test set, we have used 10-fold cross-validation (CV). In a 10-fold CV, we have shuffled images in our datasets and then split them into 10 groups. Ten models have been trained and evaluated with each group given a chance to be held out as the test set [60]. Average performance metrics across folds have been reported. For the external validation set, we trained the classification models using the whole train-test set and tested them on the validation set. We have used the area under the ROC curve (ROC), the area under the precision-recall curve (PR), and F-measure (F1: Harmonic mean of the model's precision and recall) as performance measures for model evaluation and performance assessment [60], [61], [62]. We used grid search over training data to find the optimal values of hyper-parameters of different classification models. The detailed discussion of results showing the predictive performance of our proposed method over cross-validation and also on an external validation dataset is given as follows.

Classification results of COVID versus non-COVID CT images

We have trained various shallow machine learning models for the classification of COVID versus non-COVID CT scan images with a range of deep learning-based feature maps and evaluated using both 10-fold cross-validation (CV) and on an external validation dataset. The results of our evaluation in both settings are shown in Table 1, Table 2 and Fig. 2 . Using 10-fold CV, we observed a maximum F1-score of 0.98 along with 0.99, and 0.99 as the area under the ROC curve, and the area under the PR curve, respectively with Support Vector Classifier and DenseNet121 feature map (Table 1). The F1-score of 0.98 and area under the PR curve of 0.99 show the perfect sensitivity and precision of our proposed method while classifying COVID and non-COVID CT scans efficiently. Higher values of sensitivity show that our trained model produces fewer false negatives (classifying COVID as non-COVID) which is a genuine requirement in this case.

Table 1

Predictive performance for COVID-19 diagnosis across different classification models and feature maps using 10-fold CV (COVID vs non-COVID).

Feature Map	SVC			RFC			XGBC
Feature Map	ROC	PR	F1	ROC	PR	F1	ROC	PR	F1
Resnet50	0.98 ± 0.03	0.98 ± 0.03	0.95	0.96 ± 0.03	0.96 ± 0.03	0.90	0.97 ± 0.01	0.97 ± 0.01	0.92
Xception	0.98 ± 0.04	0.98 ± 0.04	0.96	0.96 ± 0.03	0.96 ± 0.03	0.89	0.97 ± 0.01	0.97 ± 0.01	0.92
InceptionV3	0.98 ± 0.03	0.98 ± 0.03	0.96	0.96 ± 0.03	0.96 ± 0.03	0.89	0.97 ± 0.01	0.97 ± 0.01	0.92
VGG16	0.98 ± 0.03	0.98 ± 0.03	0.95	0.96 ± 0.03	0.96 ± 0.03	0.90	0.96 ± 0.01	0.96 ± 0.01	0.88
NASNetLarge	0.97 ± 0.04	0.97 ± 0.04	0.94	0.95 ± 0.03	0.95 ± 0.03	0.88	0.97 ± 0.01	0.97 ± 0.01	0.90
DenseNet121	0.99 ± 0.03	0.99 ± 0.03	0.98	0.96 ± 0.01	0.96± 0.01	0.90	0.98 ± 0.03	0.98 ± 0.03	0.94

ROC (Area under the ROC curve), PR (Area under the precision-recall curve), F1 (F1 Score), SVC (Support Vector classifier), RF (Random Forest classifier), XGBC (XGBoost classifier). Bold-faced values indicate the best performance for each model.

Table 2

Predictive performance for COVID-19 diagnosis across different classification models and feature maps on external validation dataset (COVID vs non-COVID).

Feature Map	SVC			RFC			XGBC
Feature Map	ROC	PR	F1	ROC	PR	F1	ROC	PR	F1
Resnet50	0.96	0.96	0.90	0.92	0.92	0.83	0.90	0.91	0.82
Xception	0.96	0.96	0.90	0.90	0.90	0.82	0.90	0.91	0.82
InceptionV3	0.96	0.96	0.90	0.90	0.90	0.83	0.90	0.91	0.82
VGG16	0.96	0.96	0.89	0.90	0.90	0.82	0.90	0.91	0.82
NASNetLarge	0.94	0.94	0.88	0.88	0.88	0.80	0.87	0.88	0.80
DenseNet121	0.98	0.98	0.94	0.92	0.92	0.83	0.93	0.93	0.84

Fig. 2

Receiver Operating Characteristic (ROC) and Precision-Recall (PR) curves showing predictive performance of COVIDC for the classification of chest CT images across different classifiers (SVM, RF, XGB) and DenseNet feature map on an external validation dataset. COVID vs non-COVID: ROC(A), PR(B).

Predictive performance for COVID-19 diagnosis across different classification models and feature maps using 10-fold CV (COVID vs non-COVID). ROC (Area under the ROC curve), PR (Area under the precision-recall curve), F1 (F1 Score), SVC (Support Vector classifier), RF (Random Forest classifier), XGBC (XGBoost classifier). Bold-faced values indicate the best performance for each model. Predictive performance for COVID-19 diagnosis across different classification models and feature maps on external validation dataset (COVID vs non-COVID). ROC (Area under the ROC curve), PR (Area under the precision-recall curve), F1 (F1 Score), SVC (Support Vector classifier), RF (Random Forest classifier), XGBC (XGBoost classifier). Bold-faced values indicate the best performance for each model. Receiver Operating Characteristic (ROC) and Precision-Recall (PR) curves showing predictive performance of COVIDC for the classification of chest CT images across different classifiers (SVM, RF, XGB) and DenseNet feature map on an external validation dataset. COVID vs non-COVID: ROC(A), PR(B). Meanwhile, using an external validation dataset for the evaluation of our machine learning models trained on the training set, we observed a similar trend of performance with an F1-score of 0.94 along with 0.98, and 0.98 as the area under the ROC curve, and the area under the PR curve, respectively with Support Vector Classifier and DenseNet121 features map (Table 2, Fig. 2(A) and (B)). As our external validation dataset is completely different (collected in a different country with different settings) and images have huge variations with our training set. Therefore, achieving an F1-score of 0.94 shows a significant consistent generalization performance of our proposed method in diverse CT imaging environments. Moreover, the significantly better performance of SVM in comparison to Random Forest (RF) and Extreme Boosting Machine (XGB) is attributed to its ability to deal well with high dimensional features with fewer examples as in our case. Moreover, features extracted through pre-trained deep learning models perform better than handcrafted ones as in the case of the study performed by Chandra et al., [15].

Classification results of COVID-19 severity prediction with CT images

Along with COVID-19 diagnosis using CT images, we have also tried to predict the severity of COVID-19 infection using the same image modality. For this purpose, we have trained several shallow machine learning models for the classification of severe versus mild COVID-19 infection using chest CT scans of COVID-19 infected patients with a range of deep learning-based feature maps and evaluated using 10-fold cross-validation (CV). To the best of our knowledge, this is the first study on COVID-19 severity prediction posed as a classification problem using machine learning with CT image modality. The results of our evaluation are shown in Table 3 . Using 10-fold CV, we observed a maximum F1-score of 1.0 along with 1.00, and 0.99 as the area under the ROC curve, and the area under the PR curve, respectively with Support Vector Classifier and DenseNet121 feature map (Table 3). The F score of 1.0 with an area under the PR of 0.99 shows that our proposed classification model produces almost zero false positives (classifying mild COVID-19 as severe) and false negatives (classifying severe COVID-19 as mild). These results show that we have been able to classify the severe and mild COVID-19 patients with perfect accuracy using our proposed method called COVIDC.

Table 3

Predictive performance for COVID-19 severity prediction across different classification models and feature maps using 10-fold CV.

Feature Map	SVC			RFC			XGBC
Feature Map	ROC	PR	F1	ROC	PR	F1	ROC	PR	F1
Resnet50	0.99 ± 0.01	0.99 ± 0.01	0.98	0.97 ± 0.03	0.97 ± 0.03	0.96	0.98 ± 0.01	0.98 ± 0.01	0.95
Xception	0.99 ± 0.01	0.99 ± 0.01	0.98	0.96 ± 0.03	0.96 ± 0.03	0.96	0.98 ± 0.01	0.98 ± 0.01	0.96
InceptionV3	0.99 ± 0.01	0.99 ± 0.01	0.99	0.99 ± 0.01	0.99 ± 0.01	0.98	0.98 ± 0.01	0.98 ± 0.01	0.98
VGG16	0.99 ± 0.01	0.99 ± 0.01	0.98	0.95 ± 0.03	0.94 ± 0.03	0.94	0.98 ± 0.01	0.98 ± 0.01	0.97
NASNetLarge	0.98 ± 0.01	0.98 ± 0.01	0.97	0.96 ± 0.03	0.96 ± 0.03	0.92	0.97 ± 0.01	0.97 ± 0.01	0.95
DenseNet121	1.00 ± 0.01	0.99 ± 0.01	1.00	0.98 ± 0.01	0.98 ± 0.01	0.97	0.98 ± 0.05	0.98 ± 0.05	0.97

Predictive performance for COVID-19 severity prediction across different classification models and feature maps using 10-fold CV. ROC (Area under the ROC curve), PR (Area under the precision-recall curve), F1 (F1 Score), SVC (Support Vector classifier), RF (Random Forest classifier), XGBC (XGBoost classifier). Bold-faced values indicate the best performance for each model.

Predictive performance of COVIDC in its real use

The real generalization performance of our optimal trained models (COVIDC) has been evaluated under the supervision of an experienced radiologist at Abbas Institute of Medical Sciences (AIMS) hospital located in Muzaffarabad, Azad Jammu & Kashmir, Pakistan. For this purpose, 20 CT scans for COVID-19 diagnosis (10 COVID and 10 non-COVID) and 20 CT scans for severity (10 severe and 10 milds) have been used. These scans were used as novel input CT scans to our proposed method called COVIDC in raw digital form as these were not included in the training set. Results obtained through this evaluation are shown as confusion matrices in Fig. 3 . For COVID vs non-COVID, our proposed system (COVIDC) has been able to classify correctly 9 out of 10 provided COVID CT images as COVID and 1 as non-COVID (Fig. 3(A)). Similarly, for the provided non-COVID CT images, our system classified correctly 8 out of 10 images as non-COVID, and 2 as COVID (Fig. 3(A)). This performance is reasonably good as we have less misclassification rate for COVID and this is ideally required to isolate the infected patients. These results help us to interpret that this system can reliably be used to advise isolation or quartine while suspecting COVID-19 infection.

Fig. 3

Confusion matrices: Showing COVIDC performance in real setting. A) COVID vs non-COVID; B) COVID severity prediction.

Confusion matrices: Showing COVIDC performance in real setting. A) COVID vs non-COVID; B) COVID severity prediction. Over severe vs mild COVID-19 classification task, our proposed system (COVIDC) has been able to classify correctly10 out of 10 provided COVID CT images as severe (Fig. 3(B)). Similarly, for the provided mild COVID CT images, our system classified correctly 9 out of 10 images as mild, and 1 as severe (Fig. 3(B)). This performance of our proposed system in severity prediction shows that this system can reliably be used to advise intensive care during COVID-19 infection. Overall these results justify the use of our proposed system in real settings.

Discussion

In the absence or inadequate provision of specific drugs or vaccines for the treatment or prevention of COVID-19 its quick and automatic diagnosis and severity prediction is crucial. After the successful application of CT scans with machine learning in many clinical and radiological studies, it has become an important avenue to be explored to detect and severity prediction of COVID-19. Several machine learning methods have been proposed which use digital CT scans along with deep learning or transfer learning by fine-tuning the customized fully connected layer. These deep and transfer learning-based approaches face data scarcity issues. Meanwhile, studies based on shallow learning with handcrafted features do not achieve an ideal accuracy. Furthermore, all the existing methods are proposed only to detect COVID-19 and its severity prediction was still an open research question. To knob these issues and to get the best of both worlds, we have proposed a novel approach that uses a combination of transfer and shallow learning to detect COVID-19 and predict its severity. In this approach, we have used various off-the-shelf CNN based pre-trained models on ImageNet to get feature maps and shallow learning algorithms such as SVMs to get the final trained model. We have used various feature maps and shallow machine learning algorithms along with different valuation techniques to come up with a model having improved generalization performance with an F1 score of 0.94 along with are under the PR curve of 0.98. Direct performance comparison of our proposed method with existing methods is not possible due to variations in the amount of training data and performance metrics. However, we give a brief comparison based on learning strategies. In this regard, the performance of our proposed method is comparable to a transfer learning-based method proposed by Ahuja et al. with an F1 score even without using image augmentation to increase the dataset [22]. If we compare our method with existing shallow learning-based methods, our method gives improved performance in comparison to a method proposed by Kang et al. trained using handcrafted features with an overall accuracy of 86%. Improved performance of our COVIDC over the existing state-of-the-art methods [22,27] and in a real setting suggests that the proposed method can effectively be used to diagnose COVID-19 and to predict its severity.

Conclusions

In this study, we have proposed a system called COVIDC for the preliminary diagnosis and severity prediction of COVID-19 infection using chest CT scans. The stringent performance evaluation through 10-fold CV, on an external validation dataset, and in a use under real setting shows that our proposed system can effectively be used not only to diagnose COVID-19 infection but also its severity prediction. The use of our proposed system can help to reduce the surge of COVID-19 by advising timely isolation and further diagnostic tests such as RT-PCR to the suspected patients. This system can also aid in avoiding massive casualties by deciding on intensive care in case of severe COVID-19 infection. The key findings of the study are listed as follows: Our proposed system performed significantly better in comparison to the state-of-the-art existing systems during the rigorous adopted evaluation criteria even in the presence of substantial variations in the used CT images. Our proposed system not only diagnose COVID-19 but also predict its severity. We have made our proposed system accessible through a publicly open cloud-based webserver and open-source code.

Availability of data and materials

All data generated or analyzed during this study are included in this paper or available at online repositories. A Python implementation of the proposed method together with a webserver is available at https://sites.google.com/view/wajidarshad/software and https://github.com/wajidarshad/covidc.

Authors’ contributions

WAA conceived the idea, developed the scientific workflow, performed the experiments, analyzed and interpreted the results, and was a major contributor in manuscript writing. SAA contributed to the analysis of the results and writing of the manuscript. SA, GI, SAA, KA, SK, AA, KI, MS, JC, and AA helped in results interpretation and validation, formal analysis, and manuscript writing. All authors have read and approved the final manuscript.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

20 in total

1. Protein-protein binding affinity prediction on a diverse set of structures.

Authors: Iain H Moal; Rudi Agius; Paul A Bates
Journal: Bioinformatics Date: 2011-09-07 Impact factor: 6.937

Review 2. Imaging research in fibrotic lung disease; applying deep learning to unsolved problems.

Authors: Simon L F Walsh; Stephen M Humphries; Athol U Wells; Kevin K Brown
Journal: Lancet Respir Med Date: 2020-02-25 Impact factor: 30.700

3. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China.

Authors: Chaolin Huang; Yeming Wang; Xingwang Li; Lili Ren; Jianping Zhao; Yi Hu; Li Zhang; Guohui Fan; Jiuyang Xu; Xiaoying Gu; Zhenshun Cheng; Ting Yu; Jiaan Xia; Yuan Wei; Wenjuan Wu; Xuelei Xie; Wen Yin; Hui Li; Min Liu; Yan Xiao; Hong Gao; Li Guo; Jungang Xie; Guangfa Wang; Rongmeng Jiang; Zhancheng Gao; Qi Jin; Jianwei Wang; Bin Cao
Journal: Lancet Date: 2020-01-24 Impact factor: 79.321

Review 4. Diagnostic techniques for COVID-19 and new developments.

Authors: Elham Sheikhzadeh; Shimaa Eissa; Aziah Ismail; Mohammed Zourob
Journal: Talanta Date: 2020-07-14 Impact factor: 6.057

5. An interactive web-based dashboard to track COVID-19 in real time.

Authors: Ensheng Dong; Hongru Du; Lauren Gardner
Journal: Lancet Infect Dis Date: 2020-02-19 Impact factor: 25.071

6. Classification of COVID-19 patients from chest CT images using multi-objective differential evolution-based convolutional neural networks.

Authors: Dilbag Singh; Vijay Kumar; Manjit Kaur
Journal: Eur J Clin Microbiol Infect Dis Date: 2020-04-27 Impact factor: 3.267

7. Coronavirus disease (COVID-19) detection in Chest X-Ray images using majority voting based classifier ensemble.

Authors: Tej Bahadur Chandra; Kesari Verma; Bikesh Kumar Singh; Deepak Jain; Satyabhuwan Singh Netam
Journal: Expert Syst Appl Date: 2020-08-26 Impact factor: 6.954

8. Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets.

Authors: Stephanie A Harmon; Thomas H Sanford; Sheng Xu; Evrim B Turkbey; Holger Roth; Ziyue Xu; Dong Yang; Andriy Myronenko; Victoria Anderson; Amel Amalou; Maxime Blain; Michael Kassin; Dilara Long; Nicole Varble; Stephanie M Walker; Ulas Bagci; Anna Maria Ierardi; Elvira Stellato; Guido Giovanni Plensich; Giuseppe Franceschelli; Cristiano Girlando; Giovanni Irmici; Dominic Labella; Dima Hammoud; Ashkan Malayeri; Elizabeth Jones; Ronald M Summers; Peter L Choyke; Daguang Xu; Mona Flores; Kaku Tamura; Hirofumi Obinata; Hitoshi Mori; Francesca Patella; Maurizio Cariati; Gianpaolo Carrafiello; Peng An; Bradford J Wood; Baris Turkbey
Journal: Nat Commun Date: 2020-08-14 Impact factor: 14.919

9. Chest CT in COVID-19: What the Radiologist Needs to Know.

Authors: Thomas C Kwee; Robert M Kwee
Journal: Radiographics Date: 2020-10-23 Impact factor: 5.333

1 in total

1. Statistical analysis of COVID-19 infection severity in lung lobes from chest CT.

Authors: Mehdi Yousefzadeh; Mozhdeh Zolghadri; Masoud Hasanpour; Fatemeh Salimi; Ramezan Jafari; Mehran Vaziri Bozorg; Sara Haseli; Abolfazl Mahmoudi Aqeel Abadi; Shahrokh Naseri; Mohammadreza Ay; Mohammad-Reza Nazem-Zadeh
Journal: Inform Med Unlocked Date: 2022-04-01

1 in total