| Literature DB >> 35957432 |
Yi Liu1,2, Guanghui Han1,2,3, Xiujian Liu1,2.
Abstract
Nasopharyngeal carcinoma (NPC) is a category of tumours with a high incidence in head-and-neck. To treat nasopharyngeal cancer, doctors invariably need to perform focal segmentation. However, manual segmentation is time consuming and laborious for doctors and the existing automatic segmentation methods require large computing resources, which makes some small and medium-sized hospitals unaffordable. To enable small and medium-sized hospitals with limited computational resources to run the model smoothly and improve the accuracy of structure, we propose a new LW-UNet network. The network utilises lightweight modules to form the Compound Scaling Encoder and combines the benefits of UNet to make the model both lightweight and accurate. Our model achieves a high accuracy with a Dice coefficient value of 0.813 with 3.55 M parameters and 7.51 G of FLOPs within 0.1 s (testing time in GPU), which is the best result compared with four other state-of-the-art models.Entities:
Keywords: deep learning; lightweight; medical image segmentation; nasopharyngeal carcinoma
Mesh:
Year: 2022 PMID: 35957432 PMCID: PMC9371217 DOI: 10.3390/s22155875
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1MRI slices of head and neck and tumour location of nasopharyngeal carcinoma. The area delineated by the red line is the result of the doctor’s manual segmentation.
Figure 2Network of LW-UNET: The Compound Scaling Encoder of LW-UNet consists of seven MBConv blocks of different sizes. Decoder is composed of a series of up-convolutions, and through skip connection and up-convolution can obtain the final segmentation map of nasopharyngeal carcinoma.
The scaling coefficient of LW-UNet 0–3.
| Models | Depth Scaling Coefficient | Width Scaling Coefficient | Resolution Scaling Coefficient |
|---|---|---|---|
| LW-UNet 0 | 1.0 | 1.0 | 1.0 |
| LW-UNet 1 | 1.4 | 1.2 | 1.3 |
| LW-UNet 2 | 2.2 | 1.6 | 2.0 |
| LW-UNet 3 | 3.1 | 2.0 | 2.7 |
| LW-UNet 4 | 3.6 | 2.2 | 3.0 |
| LW-UNet 5 | 4.1 | 2.4 | 3.4 |
The ablation analysis validates the effectiveness of our model configuration.
| Models | DSC | IoU | JC | SE | PC | SP |
|---|---|---|---|---|---|---|
| UNet | 0.769 ± 0.063 | 0.618 ±0.058 | 0.632 ±0.076 | 0.858 ± 0.076 | 0.713 ± 0.089 | 0.996 ± 0.002 |
| LW-UNet-0 | 0.696 ± 0.035 | 0.516 ± 0.043 | 0.542 ± 0.044 | 0.674 ± 0.035 | 0.623 ± 0.067 | 0.986 ± 0.001 |
| LW-UNet-1 | 0.771 ± 0.041 | 0.621 ± 0.058 | 0.634 ± 0.058 | 0.801 ± 0.058 | 0.765 ± 0.059 | 0.997 ± 0.001 |
| LW-UNet-2 | 0.796 ± 0.060 | 0.685 ±0.035 | 0.685 ±0.052 | 0.815 ± 0.048 | 0.767 ±0.053 | 0.998 ± 0.001 |
| LW-UNet-3 (Our) | 0.813 ± 0.039 | 0.696± 0.055 | 0.695 ± 0.055 | 0.824 ± 0.044 | 0.787± 0.043 | 0.998 ± 0.001 |
| LW-UNet-4 | 0.815 ± 0.075 | 0.698 ± 0.043 | 0.699 ± 0.043 | 0.814 ± 0.011 | 0.787± 0.056 | 0.998 ± 0.001 |
| LW-UNet-5 | 0.806 ± 0.054 | 0.688 ± 0.027 | 0.689 ± 0.076 | 0.820 ± 0.058 | 0.774± 0.084 | 0.998 ± 0.001 |
The ablation analysis validates the effectiveness of our data augmentation.
| Models | DSC | IoU | JC | SE | PC | SP |
|---|---|---|---|---|---|---|
| LW-UNet-0 | 0.632 ± 0.081 | 0.443 ± 0.028 | 0.482 ± 0.056 | 0.561 ± 0.048 | 0.545 ± 0.059 | 0.976 ± 0.001 |
| LW-UNet-0 | 0.696 ± 0.035 | 0.516 ± 0.043 | 0.542 ± 0.044 | 0.674 ± 0.035 | 0.623 ± 0.067 | 0.986 ± 0.001 |
| LW-UNet-1 | 0.698 ± 0.071 | 0.513 ± 0.098 | 0.553 ± 0.078 | 0.668 ± 0.038 | 0.626 ± 0.079 | 0.984 ± 0.002 |
| LW-UNet-1 | 0.771 ± 0.041 | 0.621 ± 0.058 | 0.634 ± 0.058 | 0.801 ± 0.058 | 0.765 ± 0.059 | 0.997 ± 0.001 |
| LW-UNet-2 | 0.718 ± 0.053 | 0.561 ± 0.076 | 0.561 ±0.055 | 0.736 ± 0.088 | 0.647 ±0.043 | 0.988 ± 0.001 |
| LW-UNet-2 | 0.796 ± 0.060 | 0.685 ±0.035 | 0.685 ±0.052 | 0.815 ± 0.048 | 0.767 ±0.053 | 0.998 ± 0.001 |
| LW-UNet-3 | 0.735 ± 0.053 | 0.574 ± 0.068 | 0.583 ±0.096 | 0.775 ± 0.064 | 0.686 ± 0.085 | 0.995 ± 0.003 |
| LW-UNet-3 | 0.813 ± 0.039 | 0.696± 0.055 | 0.695 ± 0.055 | 0.824 ± 0.044 | 0.787± 0.043 | 0.998 ± 0.001 |
Figure 3Examples of NPC segmentation results: We select four typical MRI images of nasopharyngeal carcinoma and present the segmentation results of our model and six models used for comparison.
Figure 4Box plots of the DSC and JC values obtained from the tests on the test set and its Kruskal–Wallis results. The results show that our model achieves the highest DSC and JC values in the test of nasopharyngeal carcinoma segmentation and is significantly different from other models.
Figure 5Box plots of the SE and IoU values obtained from the tests on the test set and its Kruskal–Wallis results. The results show that our model achieves the highest SE and IoU values in the test of nasopharyngeal carcinoma segmentation and is significantly different from other models.
Comparison Parameters and FLOPs with Other Models.
| Category | Models | Parameters (M) | FLOPs (G) |
|---|---|---|---|
| Ablation Study Models | LW-UNet 5 | 7.36 | 9.01 |
| LW-UNet 4 | 5.32 | 8.38 | |
| LW-UNet 3 (Our) | 3.55 | 7.51 | |
| LW-UNet 2 | 2.22 | 4.75 | |
| LW-UNet 1 | 1.27 | 2.99 | |
| LW-UNet 0 | 0.85 | 1.85 | |
| UNet | 34.53 | 65.47 | |
| State-of-the-art Models | DeepLabV3 | 15.31 | 16.14 |
| Att-UNet | 34.88 | 66.57 | |
| FCN32 | 14.72 | 20.07 | |
| RendUNet | 45.80 | 47.58 | |
| FastTransNet | 29.85 | 20.50 | |
| TransNet | 105.28 | 24.64 |