| Literature DB >> 33644298 |
Wajid Arshad Abbasi1, Syed Ali Abbas1, Saiqa Andleeb2, Ghafoor Ul Islam2, Syeda Adin Ajaz1, Kinza Arshad1, Sadia Khalil1, Asma Anjam1, Kashif Ilyas1, Mohsib Saleem1, Jawad Chughtai1, Ayesha Abbas1.
Abstract
Early diagnosis of Coronavirus disease 2019 (COVID-19) is significantly important, especially in the absence or inadequate provision of a specific vaccine, to stop the surge of this lethal infection by advising quarantine. This diagnosis is challenging as most of the patients having COVID-19 infection stay asymptomatic while others showing symptoms are hard to distinguish from patients having different respiratory infections such as severe flu and Pneumonia. Due to cost and time-consuming wet-lab diagnostic tests for COVID-19, there is an utmost requirement for some alternate, non-invasive, rapid, and discounted automatic screening system. A chest CT scan can effectively be used as an alternative modality to detect and diagnose the COVID-19 infection. In this study, we present an automatic COVID-19 diagnostic and severity prediction system called COVIDC (COVID-19 detection using CT scans) that uses deep feature maps from the chest CT scans for this purpose. Our newly proposed system not only detects COVID-19 but also predicts its severity by using a two-phase classification approach (COVID vs non-COVID, and COVID-19 severity) with deep feature maps and different shallow supervised classification algorithms such as SVMs and random forest to handle data scarcity. We performed a stringent COVIDC performance evaluation not only through 10-fold cross-validation and an external validation dataset but also in a real setting under the supervision of an experienced radiologist. In all the evaluation settings, COVIDC outperformed all the existing state-of-the-art methods designed to detect COVID-19 with an F1 score of 0.94 on the validation dataset and justified its use to diagnose COVID-19 effectively in the real setting by classifying correctly 9 out of 10 COVID-19 CT scans. We made COVIDC openly accessible through a cloud-based webserver and python code available at https://sites.google.com/view/wajidarshad/software and https://github.com/wajidarshad/covidc.Entities:
Keywords: COVID-19; CT imaging; Early diagnosis; Expert system; Pandemic; Radiology; SARS-COV-2
Year: 2021 PMID: 33644298 PMCID: PMC7901302 DOI: 10.1016/j.imu.2021.100540
Source DB: PubMed Journal: Inform Med Unlocked ISSN: 2352-9148
Fig. 1A proposed methodology for the development of computer-aided COVID-19 diagnosis and its severity prediction system (COVIDC) using Machine learning and chest CT images. This system has been trained using shallow learning algorithms such as SVMs with chest CT scans by extracting feature maps involving pre-trained off-the-shelf models. COVIDC can be used to predict whether a novel test CT image has COVID-19 or not.
Predictive performance for COVID-19 diagnosis across different classification models and feature maps using 10-fold CV (COVID vs non-COVID).
| Feature Map | SVC | RFC | XGBC | ||||||
|---|---|---|---|---|---|---|---|---|---|
| ROC | PR | F1 | ROC | PR | F1 | ROC | PR | F1 | |
| 0.98 ± 0.03 | 0.98 ± 0.03 | 0.95 | 0.96 ± 0.03 | 0.96 ± 0.03 | 0.90 | 0.97 ± 0.01 | 0.97 ± 0.01 | 0.92 | |
| 0.98 ± 0.04 | 0.98 ± 0.04 | 0.96 | 0.96 ± 0.03 | 0.96 ± 0.03 | 0.89 | 0.97 ± 0.01 | 0.97 ± 0.01 | 0.92 | |
| 0.98 ± 0.03 | 0.98 ± 0.03 | 0.96 | 0.96 ± 0.03 | 0.96 ± 0.03 | 0.89 | 0.97 ± 0.01 | 0.97 ± 0.01 | 0.92 | |
| 0.98 ± 0.03 | 0.98 ± 0.03 | 0.95 | 0.96 ± 0.03 | 0.96 ± 0.03 | 0.90 | 0.96 ± 0.01 | 0.96 ± 0.01 | 0.88 | |
| 0.97 ± 0.04 | 0.97 ± 0.04 | 0.94 | 0.95 ± 0.03 | 0.95 ± 0.03 | 0.88 | 0.97 ± 0.01 | 0.97 ± 0.01 | 0.90 | |
ROC (Area under the ROC curve), PR (Area under the precision-recall curve), F1 (F1 Score), SVC (Support Vector classifier), RF (Random Forest classifier), XGBC (XGBoost classifier). Bold-faced values indicate the best performance for each model.
Predictive performance for COVID-19 diagnosis across different classification models and feature maps on external validation dataset (COVID vs non-COVID).
| Feature Map | SVC | RFC | XGBC | ||||||
|---|---|---|---|---|---|---|---|---|---|
| ROC | PR | F1 | ROC | PR | F1 | ROC | PR | F1 | |
| 0.96 | 0.96 | 0.90 | 0.92 | 0.92 | 0.83 | 0.90 | 0.91 | 0.82 | |
| 0.96 | 0.96 | 0.90 | 0.90 | 0.90 | 0.82 | 0.90 | 0.91 | 0.82 | |
| 0.96 | 0.96 | 0.90 | 0.90 | 0.90 | 0.83 | 0.90 | 0.91 | 0.82 | |
| 0.96 | 0.96 | 0.89 | 0.90 | 0.90 | 0.82 | 0.90 | 0.91 | 0.82 | |
| 0.94 | 0.94 | 0.88 | 0.88 | 0.88 | 0.80 | 0.87 | 0.88 | 0.80 | |
ROC (Area under the ROC curve), PR (Area under the precision-recall curve), F1 (F1 Score), SVC (Support Vector classifier), RF (Random Forest classifier), XGBC (XGBoost classifier). Bold-faced values indicate the best performance for each model.
Fig. 2Receiver Operating Characteristic (ROC) and Precision-Recall (PR) curves showing predictive performance of COVIDC for the classification of chest CT images across different classifiers (SVM, RF, XGB) and DenseNet feature map on an external validation dataset. COVID vs non-COVID: ROC(A), PR(B).
Predictive performance for COVID-19 severity prediction across different classification models and feature maps using 10-fold CV.
| Feature Map | SVC | RFC | XGBC | ||||||
|---|---|---|---|---|---|---|---|---|---|
| ROC | PR | F1 | ROC | PR | F1 | ROC | PR | F1 | |
| 0.99 ± 0.01 | 0.99 ± 0.01 | 0.98 | 0.97 ± 0.03 | 0.97 ± 0.03 | 0.96 | 0.98 ± 0.01 | 0.98 ± 0.01 | 0.95 | |
| 0.99 ± 0.01 | 0.99 ± 0.01 | 0.98 | 0.96 ± 0.03 | 0.96 ± 0.03 | 0.96 | 0.98 ± 0.01 | 0.98 ± 0.01 | 0.96 | |
| 0.99 ± 0.01 | 0.99 ± 0.01 | 0.99 | |||||||
| 0.99 ± 0.01 | 0.99 ± 0.01 | 0.98 | 0.95 ± 0.03 | 0.94 ± 0.03 | 0.94 | 0.98 ± 0.01 | 0.98 ± 0.01 | 0.97 | |
| 0.98 ± 0.01 | 0.98 ± 0.01 | 0.97 | 0.96 ± 0.03 | 0.96 ± 0.03 | 0.92 | 0.97 ± 0.01 | 0.97 ± 0.01 | 0.95 | |
| 0.98 ± 0.01 | 0.98 ± 0.01 | 0.97 | 0.98 ± 0.05 | 0.98 ± 0.05 | 0.97 | ||||
ROC (Area under the ROC curve), PR (Area under the precision-recall curve), F1 (F1 Score), SVC (Support Vector classifier), RF (Random Forest classifier), XGBC (XGBoost classifier). Bold-faced values indicate the best performance for each model.
Fig. 3Confusion matrices: Showing COVIDC performance in real setting. A) COVID vs non-COVID; B) COVID severity prediction.