| Literature DB >> 34226836 |
Fuli Yu1, Yu Zhu1, Xiangxiang Qin1, Ying Xin2, Dawei Yang3, Tao Xu4.
Abstract
At the end of 2019, a novel coronavirus COVID-19 broke out. Due to its high contagiousness, more than 74 million people have been infected worldwide. Automatic segmentation of the COVID-19 lesion area in CT images is an effective auxiliary medical technology which can quantitatively diagnose and judge the severity of the disease. In this paper, a multi-class COVID-19 CT image segmentation network is proposed, which includes a pyramid attention module to extract multi-scale contextual attention information, and a residual convolution module to improve the discriminative ability of the network. A wavelet edge loss function is also proposed to extract edge features of the lesion area to improve the segmentation accuracy. For the experiment, a dataset of 4369 CT slices is constructed, including three symptoms: ground glass opacities, interstitial infiltrates, and lung consolidation. The dice similarity coefficients of three symptoms of the model achieve 0.7704, 0.7900, 0.8241 respectively. The performance of the proposed network on public dataset COVID-SemiSeg is also evaluated. The results demonstrate that this model outperforms other state-of-the-art methods and can be a powerful tool to assist in the diagnosis of positive infection cases, and promote the development of intelligent technology in the medical field.Entities:
Keywords: Biology and medical computing; Computer vision and image processing techniques; Image recognition; Optical, image and video signal processing; Patient diagnostic methods and instrumentation; X‐ray techniques: radiography and computed tomography (biomedical imaging/measurement); X‐rays and particle beams (medical uses)
Year: 2021 PMID: 34226836 PMCID: PMC8242907 DOI: 10.1049/ipr2.12249
Source DB: PubMed Journal: IET Image Process ISSN: 1751-9659 Impact factor: 1.773
FIGURE 1Examples of COVID‐19 infected regions in CT slices, where green, yellow, red denote the GGO, interstitial infiltrates and consolidation respectively
FIGURE 2The architecture of the proposed model
The detail parameters of the proposed model
|
|
|
|
|---|---|---|
| 4 | Res‐block4 | 32,32,1024 |
| PAM4 | 32,32,2048 | |
| RCM4 | 32,32,512 | |
| RFB | 32,32,1024 | |
| 3 | Res‐block3 | 64,64,512 |
| PAM3 | 64,64,1024 | |
| RCM3 | 64,64,256 | |
| 2 | Res‐block2 | 128,128,256 |
| PAM2 | 128,128,512 | |
| RCM2 | 128,128,64 | |
| 1 | Res‐block1 | 256,256,64 |
| PAM1 | 256,256,128 | |
| RCM1 | 256,256,32 | |
| Conv | 512,512,4 |
FIGURE 3The architecture of the pyramid attention module (PAM)
The composition of the dataset
|
|
|
|---|---|
| Ground glass opacities | 1091 |
| Interstitial infiltrates | 1012 |
| Consolidation | 1120 |
| Ground glass opacities and interstitial infiltrates | 343 |
| Ground glass opacities and consolidation | 93 |
| Interstitial infiltrates and consolidation | 32 |
| Three types | 17 |
| Normal | 661 |
| Sum | 4369 |
Results of ablation experiments
|
|
|
|
|
|
|---|---|---|---|---|
| A (Backbone) | Ground glass opacities | 0.7041 | 0.7300 | 0.9983 |
| Interstitial infiltrates | 0.7215 | 0.7590 | 0.9955 | |
| Consolidation | 0.8032 | 0.8357 | 0.9983 | |
| B (Backbone + RCM) | Ground glass opacities | 0.7255 | 0.7344 | 0.9988 |
| Interstitial Iinfiltrates | 0.7401 | 0.7520 | 0.9968 | |
| Consolidation | 0.8130 | 0.8358 | 0.9985 | |
| C (Backbone + RCM + PAM) | Ground glass Oopacities | 0.7642 | 0.7604 | 0.9988 |
| Interstitial infiltrates | 0.7733 | 0.8031 | 0.9959 | |
| Consolidation |
| 0.8304 |
| |
| D Our Nnet (Backbone + RCM + PAM + RFB) | Ground glass opacities |
|
|
|
| Interstitial infiltrates |
|
| 0.9968 | |
| Consolidation | 0.8241 |
| 0.9987 | |
| E Our net (w/o wavelet loss) | Ground glass opacities | 0.7546 | 0.7801 | 0.9988 |
| Interstitial infiltrates | 0.7750 | 0.7903 |
| |
| Consolidation | 0.8043 | 0.7972 | 0.9989 |
Segmentation results of our model and other state‐of‐the‐art methods
|
|
|
|
|
|
|---|---|---|---|---|
| UNet++ [ | Ground glass opacities | 0.6955 | 0.6591 | 0.9991 |
| Interstitial infiltrates | 0.6994 | 0.7110 | 0.9956 | |
| Consolidation | 0.7880 | 0.7696 |
| |
| Attention UNet [ | Ground glass opacities | 0.7347 | 0.7026 |
|
| Interstitial infiltrates | 0.7570 | 0.7756 | 0.9957 | |
| Consolidation | 0.8018 | 0.8126 | 0.9987 | |
| UNet‐CBAM [ | Ground glass opacities | 0.7469 | 0.7656 | 0.9988 |
| Interstitial infiltrates | 0.7440 | 0.7768 | 0.9956 | |
| Consolidation | 0.8121 | 0.8199 |
| |
| Our model | Ground glass opacities |
|
| 0.9989 |
| Interstitial infiltrates |
|
|
| |
| Consolidation |
|
| 0.9987 |
FIGURE 4Some segmentation examples of our model and advanced methods, in which green, yellow, red represent three types of symptoms: ground glass opacities, interstitial infiltrates, and lung consolidation
Results of our model and recently proposed methods for binary segmentation on COVID‐SemiSeg dataset
|
|
|
|
|
|---|---|---|---|
| Semi‐Inf‐Net [ | 0.739 | 0.725 | 0.960 |
| MiniSeg [ | 0.773 |
| 0.974 |
| CB‐PL [ | 0.730 | 0.820 | 0.920 |
| Ours |
| 0.791 |
|
Results of our model and other methods for multi‐class segmentation on COVID‐SemiSeg dataset
|
|
|
| ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| DeepLab‐v3+ [ | 0.443 | 0.713 | 0.823 | 0.238 | 0.310 | 0.708 |
| FCN8s [ | 0.471 | 0.537 | 0.905 | 0.279 | 0.268 | 0.716 |
| Semi‐Inf‐Net & FCN8s [ | 0.646 | 0.720 | 0.941 | 0.301 | 0.235 | 0.808 |
| Semi‐Inf‐Net and MC [ | 0.624 | 0.618 | 0.966 | 0.458 | 0.509 | 0.967 |
| Ours |
|
|
|
|
|
|
FIGURE 5Visualization of our model, the four activation maps are obtained from four stages, and the bottom feature map is the output of the fourth stage
FIGURE 6The COVID‐19 segmentation results with pulmonary tuberculosis. (a) The CT image with the tuberculosis lesion tag and segmentation results, (b) the detected mycobacterium tuberculosis (shown in red) by Ziehl–Neelsen stain