| Literature DB >> 35408067 |
Lingyun Li1, Hongbing Ma1,2.
Abstract
Segmenting medical images is a necessary prerequisite for disease diagnosis and treatment planning. Among various medical image segmentation tasks, U-Net-based variants have been widely used in liver tumor segmentation tasks. In view of the highly variable shape and size of tumors, in order to improve the accuracy of segmentation, this paper proposes a U-Net-based hybrid variable structure-RDCTrans U-Net for liver tumor segmentation in computed tomography (CT) examinations. We design a backbone network dominated by ResNeXt50 and supplemented by dilated convolution to increase the network depth, expand the perceptual field, and improve the efficiency of feature extraction without increasing the parameters. At the same time, Transformer is introduced in down-sampling to increase the network's overall perception and global understanding of the image and to improve the accuracy of liver tumor segmentation. The method proposed in this paper tests the segmentation performance of liver tumors on the LiTS (Liver Tumor Segmentation) dataset. It obtained 89.22% mIoU and 98.91% Acc, for liver and tumor segmentation. The proposed model also achieved 93.38% Dice and 89.87% Dice, respectively. Compared with the original U-Net and the U-Net model that introduces dense connection, attention mechanism, and Transformer, respectively, the method proposed in this paper achieves SOTA (state of art) results.Entities:
Keywords: ResNeXt50; U-Net; dilated convolution; liver tumor segmentation; transformer
Mesh:
Year: 2022 PMID: 35408067 PMCID: PMC9003011 DOI: 10.3390/s22072452
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Residual structure.
Figure 2Network structure diagram of RDCTrans U-Net.
Figure 3(a) An early version of Res-Net; (b) an aggregation block of ResNeXt50.
Figure 4(a) The structure of convolution block; (b) the original structure of Bottleneck; (c) the structure of Bottleneck replaced with dilated convolution.
Figure 5Encoder schematic of Transformer.
Figure 6Loss function of the proposed model RDCTrans U-Net.
Comparison of overall segmentation performance of each model on LiTS dataset.
| Network Structure | ||
|---|---|---|
| Original U-Net | 96.65 | 74.29 |
| Attention U-Net | 98.06 | 83.09 |
| Dense U-Net | 96.93 | 79.18 |
| Trans U-Net | 98.17 | 83.32 |
| RDCTrans U-Net | 98.91 | 89.22 |
The segmentation results of liver and tumor of each model on LiTS data set.
| Network Structure | ||||||
|---|---|---|---|---|---|---|
| Liver | Tumor | Liver | Tumor | Liver | Tumor | |
| Original U-Net | 83.99 | 78.01 | 75.44 | 68.02 | 94.73 | 91.41 |
| Attention U-Net | 91.62 | 89.47 | 87.13 | 83.79 | 98.3 | 95.35 |
| Dense U-Net | 89.24 | 78.89 | 84.64 | 67.98 | 94.36 | 93.95 |
| Trans U-Net | 89.71 | 82.62 | 83.19 | 73.58 | 97.34 | 93.82 |
| RDCTrans U-Net | 93.38 | 89.87 | 88.65 | 86.52 | 98.89 | 94.31 |
Results of the ablation study of the proposed model RDCTrans U-Net.
| Network Structure | ||
|---|---|---|
| ResNeXt U-Net | 96.79 | 80.92 |
| Dilated ResNeXt U-Net RDCTrans U-Net | 97.32 | 83.15 |
Figure 7Illustrates the liver and tumor segmentation results of different methods on the test dataset. The red area represents the liver, and the green area represents the tumor.