| Literature DB >> 33144873 |
Ying-Hwey Nai1, Bernice W Teo2, Nadya L Tan3, Koby Yi Wei Chua4, Chun Kit Wong1, Sophie O'Doherty1, Mary C Stephenson1, Josh Schaefferkoetter1,5,6, Yee Liang Thian7, Edmund Chiong8,9, Anthonin Reilhac1.
Abstract
Prostate segmentation in multiparametric magnetic resonance imaging (mpMRI) can help to support prostate cancer diagnosis and therapy treatment. However, manual segmentation of the prostate is subjective and time-consuming. Many deep learning monomodal networks have been developed for automatic whole prostate segmentation from T2-weighted MR images. We aimed to investigate the added value of multimodal networks in segmenting the prostate into the peripheral zone (PZ) and central gland (CG). We optimized and evaluated monomodal DenseVNet, multimodal ScaleNet, and monomodal and multimodal HighRes3DNet, which yielded dice score coefficients (DSC) of 0.875, 0.848, 0.858, and 0.890 in WG, respectively. Multimodal HighRes3DNet and ScaleNet yielded higher DSC with statistical differences in PZ and CG only compared to monomodal DenseVNet, indicating that multimodal networks added value by generating better segmentation between PZ and CG regions but did not improve the WG segmentation. No significant difference was observed in the apex and base of WG segmentation between monomodal and multimodal networks, indicating that the segmentations at the apex and base were more affected by the general network architecture. The number of training data was also varied for DenseVNet and HighRes3DNet, from 20 to 120 in steps of 20. DenseVNet was able to yield DSC of higher than 0.65 even for special cases, such as TURP or abnormal prostate, whereas HighRes3DNet's performance fluctuated with no trend despite being the best network overall. Multimodal networks did not add value in segmenting special cases but generally reduced variations in segmentation compared to the same matched monomodal network.Entities:
Mesh:
Year: 2020 PMID: 33144873 PMCID: PMC7596462 DOI: 10.1155/2020/8861035
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Optimal hyperparameter configurations for DenseVNet, ScaleNet, and HighRes3DNet.
| Hyperparameters | DenseVNet | ScaleNet | HighRes3DNet |
|---|---|---|---|
| Activation function | ReLU | Leaky ReLU | Leaky ReLU |
| Optimizers | Adam | Nesterov momentum | Adam |
| Batch size | 8 | 4 | 4 |
| Interpolation | Linear | B-spline | B-spline |
| Learning rate | 10−3 | 10−2 | 10−2 |
| Loss function | Dice | Dice no square | Dice plus cross-entropy |
| Whitening | True | True | False |
| Normalization | False | False | True |
| Regularization type | L1 | L2 | L2 |
| Sample per volume | 1 | 10 | 8 |
| Volume padding size | 16 | 24 | 16 |
| Window sampling | Resize | Resize | Resize |
| Spatial window size | (56, 56, 56) | (40, 40, 48) | (96, 96, 48) |
| No. of iterations | 2000 | 1000 | 1000 |
Figure 1DSC of all the segmentation of the WG, PZ, and CG regions of 20 subjects, generated using the optimized network of (a) DenseVNet (DVN) against HighRes3DNet (HRN) and (b) DVN against ScaleNet (SN) with different image input combinations.
The DSC, aRVD, and AHD (mean ± standard deviation [worst, best]) of the WG, PZ, and CG segmentation for the optimized networks of DenseVNet, HighRes3DNet, and ScaleNet.
| Metrics | DenseVNet | HighRes3DNet | ScaleNet | ||||||
|---|---|---|---|---|---|---|---|---|---|
| WG | PZ | CG | WG | PZ | CG | WG | PZ | CG | |
| DSC | 0.875 ± 0.039 | 0.527 ± 0.171 | 0.699 ± 0.096 | 0.890 ± 0.049 | 0.712 ± 0.109 | 0.856 ± 0.090 | 0.848 ± 0.102 | 0.623 ± 0.156 | 0.826 ± 0.082 |
| aRVD | 0.088 ± 0.055 | 1.119 ± 1.109 | 0.402 ± 0.190 | 0.077 ± 0.086 | 0.188 ± 0.211 | 0.134 ± 0.200 | 0.101 ± 0.139 | 0.263 ± 0.330 | 0.137 ± 0.132 |
| AHD | 1.901 ± 0.715 | 2.409 ± 1.073 | 2.142 ± 0.968 | 1.792 ± 0.690 | 1.709 ± 0.551 | 1.720 ± 0.622 | 2.084 ± 0.821 | 1.924 ± 0.628 | 1.940 ± 0.767 |
The DSC, aRVD, and AHD (mean ± standard deviation [worst, best]) of the apex, middle, and base of the WG segmentation for the optimized networks of DenseVNet, HighRes3DNet, and ScaleNet.
| Metrics | DenseVNet | HighRes3DNet | ScaleNet | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Apex | Middle | Base | Apex | Middle | Base | Apex | Middle | Base | |
| DSC | 0.757 ± 0.201 | 0.897 ± 0.037 | 0.568 ± 0.295 | 0.788 ± 0.232 | 0.912 ± 0.041 | 0.576 ± 0.281 | 0.694 ± 0.227 | 0.877 ± 0.097 | 0.467 ± 0.303 |
| aRVD | 0.289 ± 0.228 | 0.094 ± 0.062 | 0.412 ± 0.337 | 0.218 ± 0.270 | 0.076 ± 0.067 | 0.433 ± 0.339 | 0.337 ± 0.269 | 0.086 ± 0.124 | 0.578 ± 0.326 |
| AHD | 2.775 ± 1.011 | 3.677 ± 0.536 | 3.004 ± 0.970 | 2.521 ± 0.798 | 3.449 ± 0.607 | 2.864 ± 0.967 | 3.086 ± 0.773 | 4.022 ± 0.690 | 3.306 ± 1.157 |
p values generated between two different networks using Student's t-test for DSC, aRVD, and AHD for WG, PZ, and CG regions and the apex, middle, and base of the WG segmentations of all subjects and excluding the three special cases in (). ∗p < 0.05, ∗∗p < 0.01, and ∗∗∗p < 0.001.
| Metrics | Paired networks | Regions | Within WG regions | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| WG | PZ | CG | Apex | Middle | Base | ||||||||
| DSC | DenseVNet-HighRes3DNet | 0.292 | (∗) | ∗∗∗ | (∗∗∗) | ∗∗∗ | (∗∗∗) | 0.657 | (0.432) | 0.254 | (0.060) | 0.934 | (0.875) |
| DenseVNet-ScaleNet | 0.274 | (0.289) | 0.074 | (∗∗) | ∗∗∗ | (∗∗∗) | 0.358 | (0.680) | 0.379 | (0.573) | 0.291 | (0.458) | |
| HighRes3DNet-ScaleNet | 0.104 | (∗∗∗) | ∗ | (∗) | 0.280 | (∗) | 0.204 | (0.220) | 0.145 | (∗) | 0.245 | (0.355) | |
|
| |||||||||||||
| aRVD | DenseVNet-HighRes3DNet | 0.620 | (0.069) | ∗∗ | (∗∗) | ∗∗∗ | (∗∗∗) | 0.376 | (0.203) | 0.382 | (0.067) | 0.847 | (0.942) |
| DenseVNet-ScaleNet | 0.703 | (0.209) | ∗∗ | (∗∗) | ∗∗∗ | (∗∗∗) | 0.544 | (0.837) | 0.803 | (∗) | 0.122 | (0.237) | |
| HighRes3DNet-ScaleNet | 0.512 | (0.610) | 0.399 | (0.454) | 0.955 | (0.264) | 0.171 | (0.140) | 0.747 | (0.647) | 0.176 | (0.216) | |
|
| |||||||||||||
| AHD | DenseVNet-HighRes3DNet | 0.627 | (0.547) | ∗ | (0.217) | 0.109 | (0.075) | 0.384 | (0.182) | 0.217 | (0.103) | 0.650 | (0.709) |
| DenseVNet-ScaleNet | 0.456 | (0.481) | 0.089 | (0.085) | 0.469 | (0.477) | 0.282 | (0.457) | 0.085 | (0.076) | 0.377 | (0.421) | |
| HighRes3DNet-ScaleNet | 0.231 | (0.209) | 0.257 | (∗∗) | 0.324 | (0.232) | ∗ | (∗) | ∗∗ | (∗∗) | 0.198 | (0.265) | |
Figure 2Transverse views of the ADC and DWI images with the respective segmentations of PZ (red) and CG (blue) of ground truth, DenseVNet, HighRes3DNet, and ScaleNet for 4 subjects with (a) the highest WG's DSC across all networks, (b) TURP, (c) abnormally large prostate volume with uneven image intensities, and (d) PUC on T2-weighted images.
Figure 3DSC of WG segmentations of 20 subjects generated using the optimized network of (a) DenseVNet, (b) monomodal HighRes3DNet, and (c) multimodal HighRes3DNet with 20 to 120 training data, in steps of 20. The dotted lines show the mean, while the line within the box shows the median value. ∗Significant difference was observed between the 2 specified datasets with p < 0.05.
Comparison of previously reported DSC for prostate segmentation using DL networks trained with the stated image inputs and the number of training subject data as reported in the literature. AAM-CNN = active appearance model followed by a CNN. #With an endorectal coil. &With surface coil. $Training and test data are from the same datasets. %Averaged across 2 datasets. Slices instead of subjects. NCI-ISBI 2013 Challenge dataset consisted of PROSTATE-DIAGNOSIS and Prostate-3T datasets (refer to Supplementary Table 1).
| Networks | Input images | No. of training iterations | No. of training subjects | No. of test subjects | DSC (WG) | DSC (PZ) | Ref. |
|---|---|---|---|---|---|---|---|
| UNet | T2w | 15,000 | 173 | 59 | 0.84 ± 0.07 | — | [ |
| VNet | 0.88 ± 0.03 | — | |||||
| HighRes3DNet | 0.89 ± 0.03 | — | |||||
| HolisticNet | 0.88 ± 0.12 | — | |||||
| DenseVNet | 0.88 ± 0.03 | — | |||||
| Adapted UNet | 0.87 ± 0.03 | — | |||||
| ConvNet | T2w | 80 | 141 | 12 | 0.862 ± 0.008 | — | [ |
| Cascaded 2D UNet | DWI ( | 100 | 76 | 51 | 0.927 ± 0.042 | 0.793 ± 0.104 | [ |
| DSCNN | T2w | 77 | 4 | 0.885 | — | [ | |
| PSFCN | T2w | 80,000 | — | 20 | 0.853 ± 0.032 | — | [ |
| Volumetric | T2w | 10,000 | 50 | 30 | 0.894 | — | [ |
| ConvNet | |||||||
| SegNet | T2w | — | 19 | 4 | 0.73 | — | [ |
| AAM-CNN | T2w# | — | 100 | 20 | 0.925 | — | [ |
| 3D Multistream | T2w | — | 220 (GE) | 22 | 0.882 ± 0.058$ | 0.765 ± 0.115$ | [ |
| UNet | (axial, sagittal, coronal) | 330 (Siemens) | 33 | 0.905 ± 0.027$ | 0.799 ± 0.094$ | ||
| FCN | T2w | — | 40 (542 | 82 | 0.866 ± 0.048 | 0.727 ± 0.051 | [ |
| SegNet | T2w& | — | 11 (229 | 72 | 0.843 ± 0.042 | 0.760 ± 0.039 | |
| UNet | 0.884 ± 0.037 | 0.768 ± 0.033 | |||||
| DeepLabV3+ | 0.919 ± 0.020 | 0.789 ± 0.019 | |||||
| UNet | T2w | 36,952 | 141 | 47 | 0.907 ± 0.07 | 0.750 ± 0.10 | [ |
| Cascaded UNet | — | 0.871 ± 0.07 | 0.716 ± 0.10 | ||||
| PSPNet | — | 0.911 ± 0.03 | 0.771 ± 0.10 | ||||
| Dense-2 UNet | 35,760 | 0.921 ± 0.03 | 0.781 ± 0.09 |