| Literature DB >> 31068797 |
Hancan Zhu1,2, Feng Shi3, Li Wang1, Sheng-Che Hung1, Meng-Hsiang Chen4, Shuai Wang1, Weili Lin1, Dinggang Shen1,5.
Abstract
Accurate and automatic segmentation of infant hippocampal subfields from magnetic resonance (MR) images is an important step for studying memory related infant neurological diseases. However, existing hippocampal subfield segmentation methods were generally designed based on adult subjects, and would compromise performance when applied to infant subjects due to insufficient tissue contrast and fast changing structural patterns of early hippocampal development. In this paper, we propose a new fully convolutional network (FCN) for infant hippocampal subfield segmentation by embedding the dilated dense network in the U-net, namely DUnet. The embedded dilated dense network can generate multi-scale features while keeping high spatial resolution, which is useful in fusing the low-level features in the contracting path with the high-level features in the expanding path. To further improve the performance, we group every pair of convolutional layers with one residual connection in the DUnet, and obtain the Residual DUnet (ResDUnet). Experimental results show that our proposed DUnet and ResDUnet improve the average Dice coefficient by 2.1 and 2.5% for infant hippocampal subfield segmentation, respectively, when compared with the classic 3D U-net. The results also demonstrate that our methods outperform other state-of-the-art methods.Entities:
Keywords: deep learning; dilated dense network; fully convolutional network; hippocampal subfield segmentation; infant hippocampus
Year: 2019 PMID: 31068797 PMCID: PMC6491864 DOI: 10.3389/fninf.2019.00030
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 4.081
Imaging protocol for acquiring infant T1w and T2w MR images.
| T1w | 320 × 320 | 256 × 256 | 0.8 × 0.8 × 0.8 | 8 | 2.24 | 2,400/1,060 | 208/Sag | AF = 2 | 6:38 |
| T2w | 320 × 320 | 256 × 256 | 0.8 × 0.8 × 0.8 | VAR | 564 | 3,200 | 208/Sag | AF = 2 | 5:57 |
Figure 1T1w image and manual segmentation of a representative subject from the BCP dataset (top row) and Kulaga-Yoskovitz dataset (bottom row), respectively.
Figure 2Illustration of dilated convolutional kernels: 1-dilated convolutional kernel (left); 2-dilated convolutional kernel (middle); 4-dilated convolutional kernel (right).
Figure 3The structure of the dilated dense network. The number in each operation rectangle is the number of kernels. All operations are implemented in a 3D manner, and “c” denotes the concatenation.
Figure 4The structure of our proposed DUnet. The number in each operation rectangle is the number of kernels. All operations are implemented in a 3D manner.
Figure 5The structure of our proposed ResDUnet. The number in each operation rectangle is the number of kernels. “⊕” denotes the element-wise summation, and all operations are implemented in a 3D manner.
Mean (STD) values of Dice for each subfield segmentation using different patch sizes (R×R×R) on the BCP dataset by 3D U-net.
| CA1 | 0.635 (0.066) | 0.638 (0.107) | |
| CA2/3 | 0.565 (0.071) | 0.556 (0.099) | |
| SUB | 0.717 (0.038) | 0.708 (0.123) | |
| CA4/DG | 0.709 (0.072) | 0.706 (0.057) | |
| Uncus | 0.710 (0.034) | 0.704 (0.069) | |
| Average | 0.668 | 0.662 |
Higher Dice values indicate better segmentation performance. The best results are shown in bold.
Figure 6An example of isolated tiny blocks, outside the hippocampal region, appeared in the automated segmentation.
Mean (STD) values of Dice for each subfield segmentation using different modalities on the BCP dataset.
| CA1 | 0.604 (0.142) | 0.672 (0.050) | |
| CA2/3 | 0.571 (0.069) | 0.546 (0.104) | |
| SUB | 0.644 (0.223) | 0.745 (0.051) | |
| CA4/DG | 0.723 (0.027) | 0.662 (0.157) | |
| Uncus | 0.725 (0.031) | 0.645 (0.203) | |
| Average | 0.688 | 0.620 |
Higher Dice values indicate better segmentation performance. The best results are shown in bold.
Indicates that T1w + T2w achieves significant improvement over the corresponding method, and
indicates that T1w achieves significant improvement over the corresponding method in the Wilcoxon signed rank tests with p < 0.05.
Mean (STD) values of Dice for each subfield segmentation by different networks on the BCP dataset.
| CA1 | 0.648 (0.078) | 0.670 (0.046) | 0.665 (0.061) | |
| CA2/3 | 0.567 (0.082) | 0.584 (0.038) | 0.589 (0.045) | |
| SUB | 0.719 (0.080) | 0.737 (0.045) | 0.742 (0.052) | |
| CA4/DG | 0.709 (0.072) | 0.726 (0.030) | 0.729 (0.032) | |
| Uncus | 0.712 (0.050) | 0.721 (0.035) | 0.733 (0.034) | |
| Average | 0.671 | 0.688 | 0.692 |
Higher Dice values indicate better segmentation performance. The best results are shown in bold.
Indicates that ResDUnet achieves significant improvement over the corresponding method, and
indicates that DUnet achieves significant improvement over the corresponding method in the Wilcoxon signed rank tests with p < 0.05.
Mean (STD) values of ASSD for each subfield segmentation by different networks on the BCP dataset.
| CA1 | 0.175 (0.089) | 0.158 (0.048) | 0.147 (0.034) | |
| CA2/3 | 0.211 (0.104) | 0.175 (0.020) | 0.178 (0.028) | |
| SUB | 0.153 (0.073) | 0.136 (0.040) | 0.134 (0.039) | |
| CA4/DG | 0.157 (0.080) | 0.134 (0.019) | ||
| Uncus | 0.179 (0.055) | 0.168 (0.038) | 0.170 (0.041) | |
| Average | 0.175 | 0.151 | 0.155 |
Smaller ASSD values indicate better segmentation performance. The best results are shown in bold.
Indicates that ResDUnet achieves significant improvement over the corresponding method in the Wilcoxon signed rank tests with p < 0.05.
Figure 7Hippocampal subfield segmentations of a randomly selected subject from the BCP dataset, obtained by manual segmentation, and four different networks.
Mean (STD) values of Dice for each subfield segmentation by five different methods on the KULAGA-YOSKOVITZ dataset.
| CA1-3 | 0.916 (0.015) | 0.916 (0.011) | 0.918 (0.010) | 0.919 (0.011) | |
| CA4/ DG | 0.862 (0.034) | 0.871 (0.021) | 0.870 (0.016) | 0.875 (0.020) | |
| SUB | 0.886 (0.021) | 0.883 (0.016) | 0.887 (0.018) | 0.888 (0.018) | |
| Average | 0.888 | 0.890 | 0.892 | 0.895 |
Higher Dice values indicate better segmentation performance. Best results are shown in bold.
Indicates that ResDUnet achieves significant improvement over the corresponding method, and
indicates that DUnet achieves significant improvement over the corresponding method in the Wilcoxon signed rank tests with p < 0.05.
Mean (STD) values of ASSD for each subfield segmentation by four different networks on the KULAGA-YOSKOVITZ dataset.
| CA1-3 | 0.065 (0.011) | 0.064 (0.009) | ||
| CA4/DG | 0.077 (0.014) | 0.079 (0.015) | 0.075 (0.015) | |
| SUB | 0.069 (0.013) | 0.066 (0.013) | 0.065 (0.013) | |
| Average | 0.070 | 0.070 | 0.067 |
Smaller ASSD values indicate better segmentation performance. The best results are shown in bold.
Indicates that ResDUnet achieves significant improvement over the corresponding method, and
indicates that DUnet achieves significant improvement over the corresponding method in the Wilcoxon signed rank tests with p < 0.05.
Figure 8Hippocampal subfield segmentations of a randomly selected subject from the Kulaga-Yoskovitz dataset, obtained by manual segmentation, and four different networks.