| Literature DB >> 34095168 |
Bo Wang1,2,3, Jingyi Yang4, Jingyang Ai3, Nana Luo5, Lihua An5, Haixia Feng5, Bo Yang6, Zheng You1,2.
Abstract
Three-dimensional (3D) liver tumor segmentation from Computed Tomography (CT) images is a prerequisite for computer-aided diagnosis, treatment planning, and monitoring of liver cancer. Despite many years of research, 3D liver tumor segmentation remains a challenging task. In this paper, we propose an effective and efficient method for tumor segmentation in liver CT images using encoder-decoder based octave convolution networks. Compared with other convolution networks utilizing standard convolution for feature extraction, the proposed method utilizes octave convolutions for learning multiple-spatial-frequency features, thus can better capture tumors with varying sizes and shapes. The proposed network takes advantage of a fully convolutional architecture which performs efficient end-to-end learning and inference. More importantly, we introduce a deep supervision mechanism during the learning process to combat potential optimization difficulties, and thus the model can acquire a much faster convergence rate and more powerful discrimination capability. Finally, we integrate octave convolutions into the encoder-decoder architecture of UNet, which can generate high resolution tumor segmentation in one single forward feeding without post-processing steps. Both architectures are trained on a subset of the LiTS (Liver Tumor Segmentation) Challenge. The proposed approach is shown to significantly outperform other networks in terms of various accuracy measures and processing speed.Entities:
Keywords: deep learning; liver; liver tumor; octave convolution; segmentation
Year: 2021 PMID: 34095168 PMCID: PMC8169966 DOI: 10.3389/fmed.2021.653913
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Computation graph of the multifrequency feature transformation of octave convolution. The operation mainly contains two processes of the inter-frequency information exchange (f and f) and intra-frequency information update (f and f).
Figure 2The octave convolution kernel. k × k octave convolution kernel W is equivalent to vanilla convolution kernel because they have exactly the same number of parameters.
Figure 3Detailed network architecture of the proposed network.
Figure 4Training data provided by LiTS-challenge.
Figure 5Example of tumor segmentation results from a testing image.
Figure 6Compared results of tumor segmentation with different methods.
Comparing different methods with the proposed dataset on the liver tumor segmentation task.
| Precision | 0.872 | 0.896 | 0.901 | 0.914 | 0.926 | 0.939 |
| Recall | 0.923 | 0.930 | 0.931 | 0.925 | 0.951 | 0.962 |
| Accuracy | 0.912 | 0.930 | 0.942 | 0.951 | 0.956 | 0.959 |
| Specificity | 0.909 | 0.917 | 0.918 | 0.957 | 0.966 | 0.967 |
| DICE | 0.923 | 0.942 | 0.945 | 0.958 | 0.961 | 0.963 |
Ablation study results.
| w/o Octave Conv. | 0.921 | 0.939 | 0.938 | 0.942 | 0.947 |
| w Octave Conv. | 0.928 | 0.946 | 0.944 | 0.950 | 0.951 |
| Add 1 Loss | 0.930 | 0.951 | 0.948 | 0.958 | 0.957 |
| Add 2 Loss | 0.936 | 0.959 | 0.952 | 0.962 | 0.960 |
| OCunet | 0.939 | 0.962 | 0.959 | 0.967 | 0.963 |