| Literature DB >> 35834442 |
Thi Kieu Khanh Ho1, Jeonghwan Gwak1,2,3,4.
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS CoV-2), also known as the coronavirus disease 2019 (COVID-19), has threatened many human beings around the world and capsized economies at unprecedented magnitudes. Therefore, the detection of this disease using chest X-ray modalities has played a pivotal role in producing fast and accurate medical diagnoses, especially in countries that are unable to afford laboratory testing kits. However, identifying and distinguishing COVID-19 from virtually similar thoracic abnormalities utilizing medical images is challenging because it is time-consuming, demanding, and susceptible to human-based errors. Therefore, artificial-intelligence-driven automated diagnoses, which excludes direct human intervention, may potentially be used to achieve consistently accurate performances. In this study, we aimed to (i) obtain a customized dataset composed of a relatively small number of images collected from publicly available datasets; (ii) present the efficient integration of the shallow handcrafted features obtained from local descriptors, radiomics features specialized for medical images, and deep features aggregated from pre-trained deep learning architectures; and (iii) distinguish COVID-19 patients from healthy controls and pneumonia patients using a collection of conventional machine learning classifiers. By conducting extensive experiments, we demonstrated that the feature-based ensemble approach provided the best classification metrics, and this approach explicitly outperformed schemes that used only either local, radiomic, or deep features. In addition, our proposed method achieved state-of-the-art multi-class classification results compared to the baseline reference for the currently available COVID-19 datasets.Entities:
Mesh:
Year: 2022 PMID: 35834442 PMCID: PMC9282557 DOI: 10.1371/journal.pone.0268430
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Overview of the proposed method for our COVID-19 study.
Fig 2Flowchart of radiomic feature extraction.
CXR data information.
| Approach | Training | Validation | Testing | Preprocessed Steps |
|---|---|---|---|---|
| Handcrafted, Radiomic and Deep Feature Integration | 3028 (80%) | – | 755 (20%) | Cropping, Histogram Equalization, Constant Threshold Contouring |
| Pre-trained ResNet18 and DenseNet121 | 2646 (70%) | 378(10%) | 764 (20%) | Cropping, histogram equalization, horizontal flipping, and batch augmentation methods in both training and validation |
Fig 3t-SNE dimensions of the class distributions using whole CXR images (left) and lung segments (right).
Best classification metrics for each feature-level approach.
| Approach | Handcrafted and Radiomic Features | Pretrained Resnet18 | Pretrained DenseNet121 | Combined Deep Features | All Features | Selected Handcrafted, Radiomic, and Deep Features |
|---|---|---|---|---|---|---|
|
| 0.892 | 0.886 | 0.917 | 0.912 | 0.925 |
|
|
| 0.895 | 0.899 | 0.932 | 0.926 | 0.926 |
|
|
| 0.892 | 0.886 | 0.917 | 0.919 | 0.925 |
|
|
| 0.892 | 0.882 | 0.914 | 0.910 | 0.922 |
|
Classification accuracy obtained using handcrafted and radiomic features.
| Features | LDA | kNN | GNB | SVM | AdaBoost | RF | Ensemble | XGBoost | NN |
|---|---|---|---|---|---|---|---|---|---|
| SIFT | 0.656 | 0.720 | 0.608 | 0.615 | 0.665 | 0.725 | 0.739 | 0.720 |
|
| GIST | 0.674 | 0.730 | 0.688 | 0.605 | 0.690 | 0.705 | 0.756 | 0.730 |
|
| LBP | 0.689 | 0.658 | 0.660 | 0.626 | 0.690 | 0.710 | 0.748 | 0.716 |
|
| HOG | 0.686 | 0.658 | 0.660 | 0.626 | 0.690 | 0.710 | 0.746 | 0.711 |
|
| GLCM | 0.699 | 0.722 | 0.679 | 0.657 | 0.724 | 0.751 | 0.769 | 0.738 |
|
| Radiomics | 0.769 | 0.838 | 0.745 | 0.727 | 0.764 | 0.830 | 0.849 | 0.841 |
|
Classification accuracy obtained using selected handcrafted, radiomic and deep features.
| Metrics | LDA | kNN | GNB | SVM | AdaBoost | RF | Ensemble | XGBoost | NN |
|---|---|---|---|---|---|---|---|---|---|
| Accuracy | 0.866 | 0.882 | 0.838 | 0.845 | 0.875 | 0.905 | 0.919 | 0.905 |
|
| Precision | 0.868 | 0.882 | 0.891 | 0.826 | 0.889 | 0.895 | 0.917 | 0.933 |
|
| Recall | 0.866 | 0.882 | 0.838 | 0.845 | 0.875 | 0.905 | 0.919 | 0.905 |
|
| F1 Score | 0.850 | 0.882 | 0.830 | 0.820 | 0.866 | 0.880 | 0.908 | 0.899 |
|
Classification accuracy obtained using all handcrafted and radiomic feature combinations.
| Metrics | LDA | kNN | GNB | SVM | AdaBoost | RF | Ensemble | XGBoost | NN |
|---|---|---|---|---|---|---|---|---|---|
| Accuracy | 0.792 | 0.832 | 0.746 | 0.773 | 0.765 | 0.853 | 0.847 | 0.844 |
|
| Precision | 0.809 | 0.835 | 0.849 | 0.725 | 0.786 | 0.828 | 0.856 | 0.861 |
|
| Recall | 0.792 | 0.832 | 0.746 | 0.773 | 0.765 | 0.853 | 0.847 | 0.844 |
|
| F1 Score | 0.774 | 0.832 | 0.737 | 0.735 | 0.750 | 0.817 | 0.829 | 0.823 |
|
Classification accuracy obtained using selected handcrafted (LBP + HOG + GLCM) and radiomic feature combinations.
| Metrics | LDA | kNN | GNB | SVM | AdaBoost | RF | Ensemble | XGBoost | NN |
|---|---|---|---|---|---|---|---|---|---|
| Accuracy | 0.814 | 0.836 | 0.775 | 0.793 | 0.782 | 0.864 | 0.857 | 0.855 |
|
| Precision | 0.835 | 0.866 | 0.821 | 0.819 | 0.829 | 0.893 | 0.891 | 0.879 |
|
| Recall | 0.814 | 0.836 | 0.775 | 0.793 | 0.782 | 0.864 | 0.857 | 0.865 |
|
| F1 Score | 0.816 | 0.815 | 0.777 | 0.779 | 0.768 | 0.855 | 0.890 | 0.854 |
|
Classification accuracy obtained using combined deep features (Pool5 of Resnet18 + Conv5 of Densenet121).
| Metrics | LDA | kNN | GNB | SVM | AdaBoost | RF | Ensemble | XGBoost | NN |
|---|---|---|---|---|---|---|---|---|---|
| Accuracy | 0.837 | 0.862 | 0.810 | 0.826 | 0.848 | 0.865 | 0.888 | 0.858 |
|
| Precision | 0.852 | 0.865 | 0.940 | 0.892 | 0.865 | 0.876 | 0.891 | 0.875 |
|
| Recall | 0.837 | 0.862 | 0.810 | 0.826 | 0.848 | 0.865 | 0.888 | 0.858 |
|
| F1 Score | 0.834 | 0.862 | 0.811 | 0.833 | 0.837 | 0.865 | 0.889 | 0.848 |
|
Classification accuracy obtained using all features.
| Metrics | LDA | kNN | GNB | SVM | AdaBoost | RF | Ensemble | XGBoost | NN |
|---|---|---|---|---|---|---|---|---|---|
| Accuracy | 0.849 | 0.877 | 0.812 | 0.842 | 0.857 | 0.879 | 0.900 | 0.875 |
|
| Precision | 0.865 | 0.880 | 0.938 | 0.901 | 0.890 | 0.895 | 0.913 | 0.889 |
|
| Recall | 0.849 | 0.877 | 0.812 | 0.842 | 0.857 | 0.879 | 0.900 | 0.875 |
|
| F1 Score | 0.846 | 0.877 | 0.810 | 0.849 | 0.853 | 0.871 | 0.896 | 0.866 |
|
Comparison of the three-class classification studies from previous CXR-based COVID-19 studies.
| Work | Number of Cases | Preprocessing | Approach | Performance (%) |
|---|---|---|---|---|
| Wang et al. [ |
266 COVID-19 8066 HC 5538 Pneumonia |
DA | COVID-Net |
Accuracy = 93.3 Sensitivity = 91 PPV = 98.9 |
| Ucar et al. [ |
76 COVID-19 1538 HC 4290 Pneumonia |
DA RGB format Normalizing | COVIDiagnosis-Net |
Accuracy = 98.3 Specificity = 99.13 F1-Score = 98.3 |
| Ozturk et al. [ |
127 COVID-19 500 HC 500 Pneumonia |
N/A | DarkCovidNet (CNN) |
Accuracy = 87.02 Specificity = 92.18 Sensitivity = 95.35 Precision = 89.96 F1-Score = 87.37 |
| Li et al. [ |
179 COVID-19 179 HC 179 Pneumonia |
Create a Noisy Snapshot Dataset | KTD framework (DenseNet121, ShuffleNetV2, MobileNetV2) |
Accuracy = 84.3 AUCROC = 94 |
| Punn et al. [ |
108 COVID-19 453 HC 515 Pneumonia |
Class Balancing Methods Binary Thresholding Adaptive Total Variation Method | NASNetLarge |
Accuracy = 98 Specificity = 95 Precision = 88 F1-Score = 89 |
| Elasnaoui et al. [ |
6087 images (2780 Bacterial Pneumonia, 1724 Coronavirus (1493 Viral Pneumonia, 231 COVID-19)) 1583 HC |
Intensity Normalization CLAHE Method DA Resizing | Inception ResNetV2 |
Accuracy = 92.18 Specificity = 96.06 Sensitivity = 92.11 Precision = 92.38 F1-Score = 92.07 |
| Khobahi et al. [ |
99 COVID-19 8851 HC 9579 Pneumonia |
DA | CoroNet (TFEN + CIN modules) |
Accuracy = 93.50 Sensitivity = 90 Precision = 93.63 F1-Score = 93.51 |
| Chowdhury et al. [ |
219 COVID-19 1341 HC 1345 Pneumonia |
DA | PDCOVIDNet (CNN) |
Accuracy = 96.54 Precision = 96.58 Recall = 96.59 F1-Score = 96.58 |
| Chowdhury et al. [ |
589 COVID-19 8851 HC 6053 Pneumonia |
DA | ECOVNet (pre-trained EfficientNet) |
Accuracy = 94.68 Precision = 94.76 Recall = 94.68 F1-Score = 94.70 |
| Perumal et al. [ |
183 COVID-19 8066 HC 5538 Pneumonia |
N/A | INASNET (Inception Nasnet) |
Accuracy = 94.3 Precision = 94.0 Recall = 94.0 F1-Score = 94.0 |
|
|
1093 COVID-19 1341 HC 1345 Pneumonia |
Cropping, DA Histogram Equalization Constant Threshold Contouring | Feature-based Ensemble |
Accuracy = 94.1 Precision = 94.5 Recall = 94.1 F1-Score = 94.0 |