| Literature DB >> 35681525 |
Ruifeng Bai1,2, Xinrui Liu2,3, Shan Jiang1, Haijiang Sun1.
Abstract
Automatic extraction of cerebral vessels and cranial nerves has important clinical value in the treatment of trigeminal neuralgia (TGN) and hemifacial spasm (HFS). However, because of the great similarity between different cerebral vessels and between different cranial nerves, it is challenging to segment cerebral vessels and cranial nerves in real time on the basis of true-color microvascular decompression (MVD) images. In this paper, we propose a lightweight, fast semantic segmentation Microvascular Decompression Network (MVDNet) for MVD scenarios which achieves a good trade-off between segmentation accuracy and speed. Specifically, we designed a Light Asymmetric Bottleneck (LAB) module in the encoder to encode context features. A Feature Fusion Module (FFM) was introduced into the decoder to effectively combine high-level semantic features and underlying spatial details. The proposed network has no pretrained model, fewer parameters, and a fast inference speed. Specifically, MVDNet achieved 76.59% mIoU on the MVD test set, has 0.72 M parameters, and has a 137 FPS speed using a single GTX 2080Ti card.Entities:
Keywords: encoder–decoder; microvascular decompression; real-time semantic segmentation
Mesh:
Year: 2022 PMID: 35681525 PMCID: PMC9180010 DOI: 10.3390/cells11111830
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 7.666
Figure 1Relationship observed between the facial nerve and the responsible vessel during an operation. (a) The anterior inferior cerebellar artery crosses the facial nerve, resulting in compression. (b) Teflon filaments are inserted between the facial nerve and the anterior inferior cerebellar artery to relieve compression.
Figure 2(a) ResNet bottleneck design. (b) ERFNet non-bottleneck-1D module. (c) Our LAB module. w: the number of input channels; DConv: depthwise dilated convolution.
Figure 3Feature Fusion Module.
Figure 4Architecture of the proposed MVDNet. C: concatenation; dashed lines indicate average pooling operations.
Architecture details of the proposed MVDNet.
| Layer | Operator | Mode | Channel | Output size |
|---|---|---|---|---|
| 1 | Stride 2 | 32 |
| |
| 2 | Stride 1 | 32 |
| |
| 3 | Stride 1 | 32 |
| |
| 4–5 | Dilated 2 | 32 |
| |
| 6 | Stride 2 | 64 |
| |
| 7–8 | Dilated 4 | 64 |
| |
| 9 | Stride 2 | 128 |
| |
| 10–12 | Dilated 8 | 128 |
| |
| 13 | - | 128 |
| |
| 14 | - | 64 |
| |
| 15 | - | 32 |
| |
| 16 | Stride 1 | 10 |
| |
| 17 | Bilinear interpolation |
| 10 |
|
Figure 5Colormap.
Abbreviations of medical terms.
| Abbreviation | Full Name in English |
|---|---|
| cn5 | Trigeminal nerve |
| cn7 | Facial nerve |
| cn9 | Glossopharyngeal nerve |
| cn10 | Vagus nerve |
| aica | Anterior inferior cerebellar artery |
| pica | Posterior inferior cerebellar artery |
| aica + cn7 | Anterior inferior cerebellar artery and facial nerve |
| pica + cn7 | Posterior inferior cerebellar artery and facial nerve |
| pv | Petrosal vein |
Results of the LAB encoder with different combinations of dilation rates.
| Name | Dilation Rates | mIoU (%) |
|---|---|---|
| LAB_N2M2L4 | 2,4,8 | 73.49 |
| LAB_N2M2L4 | 4,8,16 | 73.07 |
Results of the LAB encoder with different settings. , , .
| Downsampling | Concatenation | mIoU (%) |
|---|---|---|
| 73.49 | ||
| ✓ | 73.71 | |
| ✓ | 73.76 | |
| ✓ | ✓ | 73.83 |
Results of MVDNet with different depths; the number of parameters and FLOPs are estimated for a input.
|
|
|
| Params(M) | FLOPs (G) | mIoU (%) |
|---|---|---|---|---|---|
| 2 | 2 | 2 | 0.56 | 4.24 | 72.89 |
| 2 | 2 | 4 | 0.59 | 4.38 | 73.83 |
| 2 | 4 | 4 | 0.60 | 4.54 | 74.59 |
| 4 | 4 | 4 | 0.60 | 4.71 | 74.63 |
| 2 | 8 | 8 | 0.69 | 5.13 | 74.76 |
Results of the FFM module with different components. , , .
| FFM | Average Pooling | mIoU (%) |
|---|---|---|
| w/o | - | 74.59 |
| w | 77.02 | |
| w | ✓ | 77.45 |
Dilation of MVDNet effect on mIoU.
| Model | mIoU (%) | Params (M) |
|---|---|---|
| MVDNet | 77.45 | 0.72 |
| MVDNet_w/o dilation | 75.59 | 0.72 |
| 76.76 | 0.72 |
Speed and accuracy comparison of MVDNet on the MVD test set.
| Method | Params (M) | Time (ms) | Speed (fps) | mIoU (%) |
|---|---|---|---|---|
| ENet [ | 0.36 | 12.7 | 78.5 | 51.69 |
| ESPNet [ | 0.19 | 6.4 | 156.4 | 57.71 |
| FSSNet [ | 0.17 | 7.6 | 131.7 | 61.27 |
| CGNet [ | 0.49 | 11.4 | 87.4 | 71.35 |
| EDANet [ | 0.69 | 8 | 125 | 74.49 |
| ContextNet [ | 0.88 | 6.1 | 163.3 | 75.62 |
| DABNet [ | 0.75 | 7.7 | 129.1 | 76.29 |
| MVDNet (Ours) | 0.72 | 7.3 | 137.6 | 76.59 |
Figure 6Training loss curve.
Figure 7Visual comparison on MVD validation set. From top to bottom: input images, segmentation outputs from ENet [65], ESPNet [64], FSSNet [66], CGNet [67], EDANet [63], ContextNet [68], DABNet [62], our MVDNet, and ground truth.
Accuracy comparison for different age groups on the MVD test set.
| Age | mIoU (%) | cn5 | cn7 | cn9 | cn10 | aica + cn7 | pica + cn7 | pica | aica | pv |
|---|---|---|---|---|---|---|---|---|---|---|
| 40–50 | 76.87 | 82.51 | 82.52 | 74.12 | 76.93 | 77.46 | 88.43 | 74.08 | 71.43 | 64.35 |
| 50–60 | 77.11 | 82.62 | 84.41 | 79.39 | 81.37 | 73.55 | 87.89 | 73.69 | 70.4 | 60.64 |
| 60–70 | 76.25 | 84.05 | 87.29 | 75.08 | 81.29 | 79.41 | 88.12 | 70.33 | 68.43 | 52.37 |
Accuracy comparison for gender on the MVD test set.
| Gender | mIoU(%) | cn5 | cn7 | cn9 | cn10 | aica + cn7 | pica + cn7 | pica | aica | pv |
|---|---|---|---|---|---|---|---|---|---|---|
| Male | 76.4 | 83.18 | 84.23 | 77.22 | 80.12 | 71.6 | 88.71 | 73.68 | 68.62 | 60.21 |
| Female | 76.29 | 85.19 | 85.19 | 73.06 | 74.41 | 78.88 | 87.93 | 72.38 | 71.74 | 60.14 |