| Literature DB >> 36212877 |
Lihong Peng1,2, Chang Wang1, Geng Tian3, Guangyi Liu1, Gan Li1, Yuankang Lu1, Jialiang Yang3, Min Chen4, Zejun Li4.
Abstract
COVID-19 has caused enormous challenges to global economy and public health. The identification of patients with the COVID-19 infection by CT scan images helps prevent its pandemic. Manual screening COVID-19-related CT images spends a lot of time and resources. Artificial intelligence techniques including deep learning can effectively aid doctors and medical workers to screen the COVID-19 patients. In this study, we developed an ensemble deep learning framework, DeepDSR, by combining DenseNet, Swin transformer, and RegNet for COVID-19 image identification. First, we integrate three available COVID-19-related CT image datasets to one larger dataset. Second, we pretrain weights of DenseNet, Swin Transformer, and RegNet on the ImageNet dataset based on transformer learning. Third, we continue to train DenseNet, Swin Transformer, and RegNet on the integrated larger image dataset. Finally, the classification results are obtained by integrating results from the above three models and the soft voting approach. The proposed DeepDSR model is compared to three state-of-the-art deep learning models (EfficientNetV2, ResNet, and Vision transformer) and three individual models (DenseNet, Swin transformer, and RegNet) for binary classification and three-classification problems. The results show that DeepDSR computes the best precision of 0.9833, recall of 0.9895, accuracy of 0.9894, F1-score of 0.9864, AUC of 0.9991 and AUPR of 0.9986 under binary classification problem, and significantly outperforms other methods. Furthermore, DeepDSR obtains the best precision of 0.9740, recall of 0.9653, accuracy of 0.9737, and F1-score of 0.9695 under three-classification problem, further suggesting its powerful image identification ability. We anticipate that the proposed DeepDSR framework contributes to the diagnosis of COVID-19.Entities:
Keywords: COVID-19 pneumonia; CT scan image; DenseNet; RegNet; Swin transformer; deep ensemble
Year: 2022 PMID: 36212877 PMCID: PMC9539545 DOI: 10.3389/fmicb.2022.995323
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
Figure 1Image examples in dataset.
Figure 2The pipeline for COVID-19-related CT image classification based on an ensemble of DenseNet, RegNet, and Swin transformer.
Figure 3(A) The DenseNet Block; (B) Shifted-Window technique; (C) The Squeeze-and-Excitation network.
The confusion matrix.
| True results | |||
|---|---|---|---|
| Positive | Negative | ||
| Predicted results | Positive | TP | FP |
| Negative | FN | TN | |
Parameter settings.
| Model | Parameter setting |
|---|---|
| Swin transformer | epochs = 100, batch_size = 8, lr = 0.0001 |
| RegNet | epochs = 100, batch_size = 16, lr = 0.001, lrf = 0.01 |
| DenseNet | epochs = 100, batch_size = 16, lr = 0.001, lrf = 0.01 |
The performance comparison of DeepDSR and other models for COVID-19 image binary classification.
| Precision | Recall | Accuracy | F1-score | AUC | AUPR | |
|---|---|---|---|---|---|---|
| EfficientNetV2 | 0.5077 | 0.9015 | 0.6231 | 0.6495 | 0.7800 | 0.6649 |
| ResNet | 0.9786 | 0.9602 | 0.9764 | 0.9693 | 0.9960 | 0.9943 |
| Vision transformer | 0.9811 | 0.9769 | 0.9838 | 0.9790 | 0.9982 | 0.9975 |
| DeepDSR |
|
|
|
|
|
|
The bold fonts represent the best performance in each column.
The performance comparison of DeepDSR and three individual models for binary classification problem.
| Precision | Recall | Accuracy | F1-score | AUC | AUPR | |
|---|---|---|---|---|---|---|
| Swin transformer | 0.9619 | 0.9539 | 0.9675 | 0.9579 | 0.9943 | 0.9924 |
| RegNet | 0.9571 | 0.9832 | 0.9764 | 0.9700 | 0.9963 | 0.9949 |
| DenseNet | 0.9770 | 0.9790 | 0.9829 | 0.9780 | 0.9981 | 0.9973 |
| DeepDSR |
|
|
|
|
|
|
The bold fonts represent the best performance in each column.
Statistical analyses of four models on 1,231 images.
| DenseNet | Swin transformer | RegNet | DeepDSR | |
|---|---|---|---|---|
| TN | 743 | 736 | 733 |
|
| FN | 10 | 22 | 8 |
|
| FP | 11 | 18 | 21 |
|
| TP | 467 | 455 | 469 |
|
The bold fonts represent the best performance in each column.
Figure 4(A) The performance comparison of DeepDSR and other models for COVID-19 image binary classification. (B,C) The AUC and AUPR values of DeepDSR and other models for COVID-19 image binary classification.
Figure 5(A) The performance comparison of DeepDSR and three individual models for COVID-19 binary classification problem; (B,C) The AUC and AUPR values of DeepDSR and three individual models for COVID-19 binary classification problem.
Figure 6Statistical analysis of four methods on 1,231 images.
The affect of transfer learning on the performance.
| Precision | Recall | Accuracy | F1-score | AUC | AUPR | |
|---|---|---|---|---|---|---|
| With pre-train |
|
|
|
|
|
|
| Without pre-train | 0.8773 | 0.914 | 0.9171 | 0.8953 | 0.9716 | 0.9455 |
| Without pre-train (200 epoch) | 0.9544 | 0.9224 | 0.9529 | 0.9382 | 0.9866 | 0.9821 |
The bold fonts represent the best performance in each column.
Figure 7The affect of transfer learning on the performance.
The performance of DeepDSR and other models for three-classification problem.
| Precision | Recall | Accuracy | F1-score | |
|---|---|---|---|---|
| EfficientNet V2 | 0.4023 | 0.4479 | 0.5132 | 0.3736 |
| ResNet | 0.9487 | 0.9397 | 0.9541 | 0.9439 |
| Vision transformer | 0.7112 | 0.6264 | 0.7373 | 0.6301 |
| Swin transformer | 0.9488 | 0.9371 | 0.9548 | 0.9424 |
| RegNet | 0.9492 | 0.9463 | 0.9568 | 0.9476 |
| DenseNet | 0.9552 | 0.953 | 0.9608 | 0.9541 |
| DeepDSR |
|
|
|
|
The bold fonts represent the best performance in each column.
Figure 8The performance of DeepDSR and other models for three-classification problem.