| Literature DB >> 35794537 |
Mohammad H Alshayeji1, Silpa ChandraBhasi Sindhu2, Sa'ed Abed3.
Abstract
BACKGROUND: Here propose a computer-aided diagnosis (CAD) system to differentiate COVID-19 (the coronavirus disease of 2019) patients from normal cases, as well as to perform infection region segmentation along with infection severity estimation using computed tomography (CT) images. The developed system facilitates timely administration of appropriate treatment by identifying the disease stage without reliance on medical professionals. So far, this developed model gives the most accurate, fully automatic COVID-19 real-time CAD framework.Entities:
Keywords: COVID-19; Classification; Computed tomography; Computer-aided diagnosis; Deep neural network; Machine learning; Semantic segmentation; Severity score
Mesh:
Year: 2022 PMID: 35794537 PMCID: PMC9261058 DOI: 10.1186/s12859-022-04818-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1Workflow diagram
CT image details of complete database for two classes
| Category | NCP | Normal |
|---|---|---|
| Total number of patients | 929 | 818 |
| Total number of scans | 1544 | 1069 |
| Total number of CT slices | 156,071 | 92,853 |
| Number of CT slices with lesions | 21,872 | NA* |
| Data size | 17.4 GB | 9.1 GB |
NA*: not available
Fig. 2Semantic segmentation using DeepLabv3+
Fig. 3DeepLabv3+ architecture
Fig. 4Proposed methodology
Fig. 5Workflow of infected region extraction using deep learning
Pre-processing results of COVID-19 and normal CT images
Class metrics of two-class semantic segmentation using DeepLabv3+ with weights initialized by different pre-trained networks
| Pre-trained network | Mini batch size | Accuracy | IoU | Mean BF score | Accuracy | IoU | Mean BF score |
|---|---|---|---|---|---|---|---|
| (Background) | (Lung field) | ||||||
| ResNet-18 | 8 | 0.9977 | 0.9966 | 0.9939 | 0.9921 | 0.9771 | 0.9828 |
| ResNet-18 | 32 | 0.9981 | 0.9952 | 0.9908 | 0.9804 | 0.9679 | 0.9738 |
| ResNet-50 | 8 | 0.9978 | 0.9962 | 0.9924 | 0.9885 | 0.9722 | 0.9789 |
| ResNet-50 | 16 | 0.9971 | 0.9946 | 0.9885 | 0.9818 | 0.9607 | 0.9656 |
| MobileNet-v2 | 8 | 0.9973 | 0.9964 | 0.9949 | 0.9937 | 0.9744 | 0.9849 |
| MobileNet-v2 | 16 | 0.9985 | 0.9960 | 0.9900 | 0.9818 | 0.9711 | 0.9712 |
| MobileNet-v2 | 32 | 0.9983 | 0.9966 | 0.9925 | 0.9878 | 0.9757 | 0.9775 |
Bold usage is preferred to enhance the experimental result which provides the high performance
SIFT, SURF, and ORB feature sizes for COVID-19 and normal CT images
| Feature extraction technique | COVID-19 feature size (3000 lung segmentation out) | Normal feature size (3000 lung segmentation out) |
|---|---|---|
| SIFT | 448,368 × 128 | 205,670 × 128 |
| SURF | 266,775 × 64 | 137,820 × 64 |
| ORB | 1,048,576 × 32 | 1,048,576 × 32 |
SIFT, SURF, and ORB parameters
| SIFT | SURF | ORB | |||
|---|---|---|---|---|---|
| Parameter | Value | Parameter | Value | Parameter | Value |
| nOctaveLayers | 3 | MetricThreshold | 1000 | scaleFactor | 1.2f |
| contrastThreshold | 0.04/0.03 | NumOctaves | 3 | nlevels | 8 |
| edgeThreshold | 10 | NumScaleLevels | 4 | edgeThreshold | 31 |
| sigma | 1.6 | ROI | [1 1 size(I,2) size(I,1)] | firstLevel | 0 |
| WTA_K | 2 | ||||
| scoreType | HARRIS_SCORE | ||||
| patchSize | 31 | ||||
| fastThreshold | 20 | ||||
ANOVA results for SIFT, SURF, and ORB features
| Method | Class name | Minimum | Mean | Quartile-1 | Median | Quartile-3 | Maximum | IQR |
|---|---|---|---|---|---|---|---|---|
| ORB | COVID-19 | 0 | 10.24 | 5 | 9 | 14 | 92 | 9 |
| Normal | 0 | 7.11 | 3 | 6 | 9 | 56 | 6 | |
| SIFT | COVID-19 | 0 | 1.49 | 0 | 1 | 2 | 21 | 2 |
| Normal | 0 | 0.69 | 0 | 0 | 1 | 26 | 1 | |
| SURF | COVID-19 | 0 | 0.89 | 0 | 0 | 1 | 29 | 1 |
| Normal | 0 | 0.46 | 0 | 0 | 1 | 23 | 1 |
Fig. 6Boxplot for ORB features from 6000 CT images
Fig. 7Histogram of visual word occurrences
Classifiers and hyperparameters with value ranges
| Classifier | Hyperparameter | Value range |
|---|---|---|
| Random Forest Classifier | Number of trees (n_estimators) | [1, 10, 50, 100] |
| criterion | [‘gini’,’entropy’] | |
| max_features | [‘sqrt’,’log2’] | |
| XGBoost | n_estimators | [100, 200, 300, 400, 500] |
| learning_rate | [0.0001, 0.001, 0.01, 0.1] | |
| KNN | n_neighbors | [3, 5, 11, 19] |
| weights | [‘uniform’, ‘distance’] | |
| metric | [‘euclidean’,’manhattan’] | |
| Decision Tree | max_leaf_nodes | list(range(2, 100)) |
| min_samples_split | [2, 3, 4] | |
| max_depth | np.arange(3, 10) | |
| criterion | ['gini', 'entropy'] | |
| PNN | std | (0, 10) |
Best classification performance measures for ORB features
| Classifier | K | Class | PPV | NPV | Sensitivity | Specificity | Accuracy | MR* |
|---|---|---|---|---|---|---|---|---|
| XGBoost | 2000 | COVID | 0.9930 | 0.9940 | 0.9940 | 0.9930 | 0.9935 | 0.0065 |
| Normal | 0.9940 | 0.9930 | 0.9930 | 0.9940 | 0.9935 | 0.0065 | ||
| RF | 1000 | COVID | 0.9883 | 0.9880 | 0.9880 | 0.9883 | 0.9882 | 0.0118 |
| Normal | 0.9880 | 0.9883 | 0.9883 | 0.9880 | 0.9882 | 0.0118 | ||
| DT | 50 | COVID | 0.9699 | 0.9684 | 0.9683 | 0.9700 | 0.9692 | 0.0308 |
| Normal | 0.9684 | 0.9699 | 0.9700 | 0.9683 | 0.9692 | 0.0308 | ||
| KNN | 100 | COVID | 0.9983 | 0.9953 | 0.9953 | 0.9983 | 0.9968 | 0.0032 |
| Normal | 0.9953 | 0.9983 | 0.9983 | 0.9953 | 0.9968 | 0.0032 | ||
| PNN | 100 | COVID | ||||||
| Normal |
Bold usage is preferred to enhance the experimental result which provides the high performance
Best classifier results for SIFT features
| Classifier | K | Class | PPV | NPV | Sensitivity | Specificity | Accuracy | MR* |
|---|---|---|---|---|---|---|---|---|
| XGBoost | 2000 | COVID | 0.9946 | 0.9777 | 0.9773 | 0.9947 | 0.9860 | 0.0140 |
| Normal | 0.9777 | 0.9946 | 0.9947 | 0.9773 | 0.9860 | 0.0140 | ||
| RF | 1000 | COVID | ||||||
| Normal | ||||||||
| DT | 50 | COVID | 0.9709 | 0.9570 | 0.9563 | 0.9713 | 0.9638 | 0.0362 |
| Normal | 0.9570 | 0.9709 | 0.9713 | 0.9563 | 0.9638 | 0.0362 | ||
| KNN | 100 | COVID | 0.9996 | 0.8881 | 0.8740 | 0.9997 | 0.9368 | 0.0632 |
| Normal | 0.8881 | 0.9996 | 0.9997 | 0.8740 | 0.9368 | 0.0632 | ||
| PNN | 100 | COVID | 0.9974 | 0.8958 | 0.8840 | 0.9977 | 0.9408 | 0.0592 |
| Normal | 0.8958 | 0.9974 | 0.9977 | 0.8840 | 0.9408 | 0.0592 |
Bold usage is preferred to enhance the experimental result which provides the high performance
Best classifier results for SURF features
| Classifier | K | Class | PPV | NPV | Sensitivity | Specificity | Accuracy | MR* |
|---|---|---|---|---|---|---|---|---|
| XGBoost | 2000 | COVID | 0.9856 | 0.9840 | 0.9840 | 0.9857 | 0.9848 | 0.0152 |
| Normal | 0.9840 | 0.9856 | 0.9857 | 0.9840 | 0.9848 | 0.0152 | ||
| RF | 1000 | COVID | ||||||
| Normal | ||||||||
| DT | 50 | COVID | 0.9505 | 0.9535 | 0.9537 | 0.9503 | 0.9520 | 0.0480 |
| Normal | 0.9535 | 0.9505 | 0.9503 | 0.9537 | 0.9520 | 0.0480 | ||
| KNN | 100 | COVID | 0.9932 | 0.9324 | 0.9280 | 0.9937 | 0.9608 | 0.0392 |
| Normal | 0.9324 | 0.9932 | 0.9937 | 0.9280 | 0.9608 | 0.0392 | ||
| PNN | 100 | COVID | 0.9961 | 0.9341 | 0.9297 | 0.9963 | 0.9630 | 0.0370 |
| Normal | 0.9341 | 0.9961 | 0.9963 | 0.9297 | 0.9630 | 0.0370 |
Bold usage is preferred to enhance the experimental result which provides the high performance
MR*: Miss classification Rate
Fig. 8Confusion matrix of PNN classifier using ORB features
Fig. 9ROC curve of PNN classifier using ORB features
Pixel distribution details of three classes
| Class name | Training pixel counts | Validation pixel counts | Test pixel counts | Total number of pixels (%) |
|---|---|---|---|---|
| Background | 1.3718e+08 | 1.7246e+07 | 1.713e+07 | 87.258 |
| Lung field | 1.8306e+07 | 2.1934e+06 | 2.2644e+06 | 11.578 |
| Infected regions | 1.79973e+06 | 221,827 | 266,203 | 1.163 |
Optimal hyperparameters used for infection segmentation
| Parameter | Hyperparameters | Optimal value |
|---|---|---|
| Maximum epochs | 20, 50, 100 | 50 |
| Batch size | 8,16,32 | 8 |
| Momentum factor | 0.9 | 0.9 |
| Learning rate | 0.001, 0.0001 | 0.0001 |
| L2 regularization | 0.0001 | 0.0001 |
| Optimizers | ‘sgdm’, ‘adam’ | adam |
Best dataset metrics from each network after grid search
| Network name | MBS* | ILR* | Global accuracy | Mean accuracy | Mean IoU | Weighted IoU | Mean BF score |
|---|---|---|---|---|---|---|---|
| ResNet-18 | 32 | 0.001 | 0.9936 | 0.9190 | 0.8798 | 0.9878 | 0.9210 |
| MobileNet-v2 | 8 | 0.001 | 0.9925 | 0.9307 | 0.8787 | 0.9859 | 0.9170 |
Bold usage is preferred to enhance the experimental result which provides the high performance
Best class metrics from each network after grid search
| Network name | MBS* | ILR* | Class name | Accuracy | IoU | Mean BF score |
|---|---|---|---|---|---|---|
| ResNet-18 | 32 | 0.001 | Background | 0.9992 | 0.9966 | 0.9949 |
| Lung field | 0.9727 | 0.9522 | 0.9535 | |||
| Infection | 0.7851 | 0.6905 | 0.7454 | |||
| Background | ||||||
| Lung field | ||||||
| Infection | ||||||
| MobileNet-v2 | 8 | 0.001 | Background | 0.9981 | 0.9965 | 0.9932 |
| Lung field | 0.9699 | 0.9390 | 0.9455 | |||
| Infection | 0.8240 | 0.7007 | 0.7636 |
Bold usage is preferred to enhance the experimental result which provides the high performance
ILR*: Initial Learning Rate, MBS*: Minimum Batch Size
Fig. 10Accuracy and loss versus iteration plots of final DL semantic segmentation network
Infected region extractions; severity score of different COVID-19 CT images
Performance comparison table
| Ref | Proposed method | Result | Limitation |
|---|---|---|---|
| [ | Classification: COVID-CT-Mask-Net model, Segmentation: MaskR-CNN | Classification accuracy: 0.9166, sensitivity: 0.9080, specificity: 0.9210, F1-score: 0.9150 | Poor model generalization capability |
| [ | Classification using DL features from EfficientNet and clinical data | AUC = 0.8274 | Not an automatic approach, collection of clinical data requires manual intervention |
| [ | Classification: COVIDNet-CT, heterogeneous composition of conventional spatial, pointwise, depthwise convolution layers | Classification accuracy: 0.973, specificity: 0.999, PPV:0.99, NPV: 0.993 | Architecture faces generalization issues, no infection extraction approach |
| [ | Classification: VGG16 deep neural network + ensemble learning | Classification accuracy: 0.9357, specificity: 0.9393, sensitivity: 0.9421, precision: 0.894, and F1-score: 0.9174 | Only experimented with VGG model, no infected lesion segmentation |
| [ | Classification: 3D ResNet-18 | Classification accuracy: 0.9924, recall: 0.9996, precision: 0.9935, F1-sorce: 0.9965 | Model is still a black box |
| Proposed | Classification: ORB + BOF + PNN, Infection extraction: Semantic segmentation using DeepLabv3+ with weights initialized by ResNet-50 | Classification accuracy: 0.997, AUC: 0.9988, sensitivity:0.999, specificity: 0.996, PPV:0.996, NPV: 0.999. Infection extraction: global accuracy:0.9947, weighted IoU: 0.9899, mean BF score: 0.9453 | Imbalanced dataset of different stages of COVID-19 for infection extraction model |