| Literature DB >> 35053031 |
Si-Yuan Lu1, Zheng Zhang2,3, Yu-Dong Zhang1, Shui-Hua Wang1.
Abstract
Accurate and timely diagnosis of COVID-19 is indispensable to control its spread. This study proposes a novel explainable COVID-19 diagnosis system called CGENet based on graph embedding and an extreme learning machine for chest CT images. We put forward an optimal backbone selection algorithm to select the best backbone for the CGENet based on transfer learning. Then, we introduced graph theory into the ResNet-18 based on the k-nearest neighbors. Finally, an extreme learning machine was trained as the classifier of the CGENet. The proposed CGENet was evaluated on a large publicly-available COVID-19 dataset and produced an average accuracy of 97.78% based on 5-fold cross-validation. In addition, we utilized the Grad-CAM maps to present a visual explanation of the CGENet based on COVID-19 samples. In all, the proposed CGENet can be an effective and efficient tool to assist COVID-19 diagnosis.Entities:
Keywords: computer-aided diagnosis; convolutional neural network; extreme learning machine; feedforward neural network; graph neural network; transfer learning
Year: 2021 PMID: 35053031 PMCID: PMC8773037 DOI: 10.3390/biology11010033
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1Random CT scans in the SARS-CoV-2 dataset ((a) COVID-19 samples, (b) Non-COVID-19 samples).
Figure 2Pipeline of the proposed CGENet.
Pseudocode of the optimal backbone model selection algorithm.
| Step1: | load the COVID-19 dataset. |
| Step2: | load the pre-trained backbone models, including AlexNet, ResNet-18, ResNet-50, MobileNetV2, and EfficientNet. |
| Step3: | modify the structure of these backbones based on the labels of the COVID-19 dataset using ELM. |
| Step4: | train these models and test them based on 5-fold cross-validation. |
| Step5: | compute the average testing accuracies of the 5 backbones. |
| Step6: | output the optimal backbone model which yielded the highest average testing accuracy. |
Figure 3Flowchart of the graph embedding.
Figure 4Structure of an ELM.
Figure 5Structure of an RVFL.
Pseudocode of the optimal classifier selection algorithm.
| Step 1: | load the COVID-19 dataset. |
| Step 2: | load the ResNet-18. |
| Step 3: | modify the structure of the ResNet-18 based on the labels of the COVID-19 dataset using ELM and RVFL, respectively. |
| Step 4: | train the two models and test them based on 5-fold cross-validation. |
| Step 5: | compute the average testing accuracies of the two models. |
| Step 6: | output the CGENet with the better classifier, which yielded the highest average testing accuracy. |
Hyper-parameter settings in the CGENet.
| Method | Hyper-Parameter | Value |
|---|---|---|
| Transfer learning | Batch size | 24 |
| Max epochs | 2 | |
| Learning rate | 1 × 10−4 | |
|
| 4 | |
| ELM and RVFL |
| 512 |
Results of the CGENet based on 5-fold CV (unit: %).
| Accuracy | Sensitivity | Specificity | Precision | F1-Score | |
|---|---|---|---|---|---|
| Fold 1 | 98.18 | 97.63 | 98.76 | 98.80 | 98.21 |
| Fold 2 | 97.18 | 96.47 | 97.93 | 98.01 | 97.23 |
| Fold 3 | 96.57 | 96.41 | 96.73 | 96.80 | 96.61 |
| Fold 4 | 98.59 | 100.00 | 97.23 | 97.20 | 98.58 |
| Fold 5 | 98.39 | 99.19 | 97.60 | 97.61 | 98.39 |
| Average | 97.78 | 97.94 | 97.65 | 97.68 | 97.80 |
Results of the CGENet using different backbones based on 5-fold CV.
| Backbones | Accuracy | Sensitivity | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|
| AlexNet | 75.45 | 78.87 | 73.15 | 69.96 | 73.81 |
| ResNet-18 | 97.78 | 97.94 | 97.65 | 97.68 | 97.80 |
| ResNet-50 | 97.50 | 97.83 | 97.18 | 97.20 | 97.51 |
| MobileNetV2 | 97.02 | 97.37 | 96.73 | 96.72 | 97.03 |
| EfficientNet | 97.50 | 97.70 | 97.33 | 97.36 | 97.52 |
Results of the CGENet with different graph embeddings (unit: %).
| Value of | Accuracy | Sensitivity | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|
| 3 | 96.90 | 96.25 | 97.62 | 97.68 | 96.95 |
| 4 | 97.78 | 97.94 | 97.65 | 97.68 | 97.80 |
| 5 | 96.94 | 96.76 | 97.15 | 97.20 | 96.97 |
| 6 | 96.90 | 96.98 | 96.85 | 96.88 | 96.92 |
Results of the CGENet using different classifiers based on 5-fold CV.
| Classifiers | Accuracy | Sensitivity | Specificity | Precision | F1-Score |
|---|---|---|---|---|---|
| ELM | 97.78 | 97.94 | 97.65 | 97.68 | 97.80 |
| RVFL | 97.30 | 97.99 | 96.64 | 96.65 | 97.31 |
Figure 6Grad-CAM heat maps of COVID-19 cases.
Comparison with other existing approaches (unit: %).
| Methods | Accuracy | Sensitivity | Specificity | Precision | F1-Score | Dataset |
|---|---|---|---|---|---|---|
| xDNN [ | 97.38 | 95.53 | - | 99.16 | 97.31 | SARS-CoV-2 |
| SepNorm + Contrastive [ | 90.83 | 85.89 | - | 95.75 | 90.87 | SARS-CoV-2 |
| FGCNet [ | 97.14 | 97.71 | 96.56 | 96.61 | 97.15 | Private dataset |
| DarkCovidNet [ | 98.08 | 95.13 | 95.3 | 98.03 | 96.51 | Mixed public datasets |
| Dual-Track Learning [ | - | 86.00 | - | 89.60 | 87.80 | Private dataset |
| CGENet (ours) | 97.78 | 97.94 | 97.65 | 97.68 | 97.80 | SARS-CoV-2 |