| Literature DB >> 33976457 |
Xia Ma1,2, Bingbing Zheng3, Yu Zhu3, Fuli Yu3, Rixin Zhang2, Budong Chen4.
Abstract
Since discovered in Hubei, China in December 2019, Corona Virus Disease 2019 named COVID-19 has lasted more than one year, and the number of new confirmed cases and confirmed deaths is still at a high level. COVID-19 is an infectious disease caused by SARS-CoV-2. Although RT-PCR is considered the gold standard for detection of COVID-19, CT plays an important role in the diagnosis and evaluation of the therapeutic effect of COVID-19. Diagnosis and localization of COVID-19 on CT images using deep learning can provide quantitative auxiliary information for doctors. This article proposes a novel network with multi-receptive field attention module to diagnose COVID-19 on CT images. This attention module includes three parts, a pyramid convolution module (PCM), a multi-receptive field spatial attention block (SAB), and a multi-receptive field channel attention block (CAB). The PCM can improve the diagnostic ability of the network for lesions of different sizes and shapes. The role of SAB and CAB is to focus the features extracted from the network on the lesion area to improve the ability of COVID-19 discrimination and localization. We verify the effectiveness of the proposed method on two datasets. The accuracy rate of 97.12%, specificity of 96.89%, and sensitivity of 97.21% are achieved by the proposed network on DTDB dataset provided by the Beijing Ditan Hospital Capital Medical University. Compared with other state-of-the-art attention modules, the proposed method achieves better result. As for the public COVID-19 SARS-CoV-2 dataset, 95.16% for accuracy, 95.6% for F1-score and 99.01% for AUC are obtained. The proposed network can effectively assist doctors in the diagnosis of COVID-19 CT images.Entities:
Keywords: Attention; Auxiliary diagnosis; COVID-19; Deep learning
Year: 2021 PMID: 33976457 PMCID: PMC8103744 DOI: 10.1016/j.ijleo.2021.167100
Source DB: PubMed Journal: Optik (Stuttg) ISSN: 0030-4026 Impact factor: 2.443
Fig. 1Architecture of proposed network. (For interpretation of the references to color in this legend, the reader is referred to the web version of this article.)
Fig. 2Architecture of PCM, SAB and CAB. (a) is the structure of PCM, (b) is the structure of SAB, (c) is the structure of CAB. As illustrated, the outputs of PCM are sent to both SAB and CAB which make the attention blocks contain multi-receptive field information.
The size and number of kernels for different input feature maps (based on VGG16).
| Input feature maps | Number of kernels | Kernel sizes | Channel |
|---|---|---|---|
| 4 | 3, 9, 15, 21 | 64 | |
| 4 | 3, 7, 11, 15 | 128 | |
| 4 | 3, 5, 7, 9 | 256 | |
| 4 | 1, 3, 5, 7 | 512 | |
| 4 | 1, 3, 5, 7 | 512 |
Experimental results of ablation experiments (best in bold).
| Description | Accuracy (%) | Specificity (%) | Sensitivity (%) |
|---|---|---|---|
| VGG16 | 93.02 | 94.32 | 91.25 |
| VGG16 + SAB | 94.76 | 96.35 | 93.32 |
| VGG16 + CAB | 96.02 | 96.83 | 95.13 |
| VGG16 + SAB + CAB |
Fig. 3The ROC curve (left) and confusion matrix of ablation studies (right).
COVID-19 diagnostic results on DTDB COVID-19 dataset.(best in bold).
| Description (based onVGG16) | Accuracy (%) | Specificity (%) | Sensitivity (%) |
|---|---|---|---|
| VGG16 | 93.02 | 94.32 | 91.25 |
| RAN | 94.76 | 94.53 | 95.10 |
| SE-Net | 94.06 | 95.35 | 92.56 |
| CBAM | 94.27 | 96.73 | 91.76 |
| Ours |
Fig. 4The ROC curve (left) and the confusion matrix of different attention modules (right).
COVID-19 diagnostic results on SARS-CoV-2 dataset. (best in bold).
| Description | Accuracy (%) | F1-score (%) | AUC (%) |
|---|---|---|---|
| VGG16 | 86.87 | 87.46 | 92.16 |
| Wang et al. | 90.83 | 90.87 | 96.24 |
| Harsh et al. | 95 | 95 | – |
| ours |
Fig. 5The visualization results of Grad-CAM. Row 1 is the visualization results and row 2 are the corresponding original CT images.
Fig. 6COVID-19 lesion localization in different periods of patients.
Fig. 7Combined diagram for variation of weighted normalized maximum area in different periods. The blue dotted tend line represents the change trend of the WNM-Area of the patients over time.