| Literature DB >> 35260710 |
Sang-Heon Lim1,2, Young Jae Kim2, Yeon-Ho Park3, Doojin Kim3, Kwang Gi Kim4,5, Doo-Ho Lee6.
Abstract
Pancreas segmentation is necessary for observing lesions, analyzing anatomical structures, and predicting patient prognosis. Therefore, various studies have designed segmentation models based on convolutional neural networks for pancreas segmentation. However, the deep learning approach is limited by a lack of data, and studies conducted on a large computed tomography dataset are scarce. Therefore, this study aims to perform deep-learning-based semantic segmentation on 1006 participants and evaluate the automatic segmentation performance of the pancreas via four individual three-dimensional segmentation networks. In this study, we performed internal validation with 1,006 patients and external validation using the cancer imaging archive pancreas dataset. We obtained mean precision, recall, and dice similarity coefficients of 0.869, 0.842, and 0.842, respectively, for internal validation via a relevant approach among the four deep learning networks. Using the external dataset, the deep learning network achieved mean precision, recall, and dice similarity coefficients of 0.779, 0.749, and 0.735, respectively. We expect that generalized deep-learning-based systems can assist clinical decisions by providing accurate pancreatic segmentation and quantitative information of the pancreas for abdominal computed tomography.Entities:
Mesh:
Year: 2022 PMID: 35260710 PMCID: PMC8904764 DOI: 10.1038/s41598-022-07848-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) Raw volume data were cropped and reconstructed for training data generation. (b) All data were divided into datasets that consisted of almost identical numbers of participants for cross validation. (c) A region of the pancreas was enhanced via CLAHE. Valid, validation; TCIA, the Cancer Imaging Archive; WL, window level; WW, window width; CLAHE, contrast-limited adaptive histogram equalization.
Figure 2Architecture illustration of residual dense u-net. Conv, convolution; BN, batch normalization; ReLU, rectified linear unit.
Demographics of participants.
| Total | Men | Women | ||
|---|---|---|---|---|
| Number | 1006 (100) | 530 (52.6) | 476 (47.4) | |
| Age (years) | 55.3 ± 15.6 | 55.6 ± 15.3 | 54.9 ± 15.9 | 0.508 |
| Height (cm) | 163.5 ± 8.9 | 169.3 ± 6.8 | 157.1 ± 6.2 | < 0.001 |
| Weight (kg) | 65.3 ± 12.1 | 70.3 ± 11.6 | 59.6 ± 10.0 | < 0.001 |
| Body mass index (kg/m2) | 24.3 ± 3.5 | 24.4 ± 3.3 | 24.1 ± 3.7 | 0.165 |
| Hypertension | 364 (36.3) | 205 (38.8) | 159 (33.5) | 0.078 |
| Diabetes mellitus | 190 (18.9) | 108 (20.5) | 82 (17.3) | 0.198 |
| Smoking | 224 (22.3) | 185 (35.0) | 39 (8.2) | < 0.001 |
| Alcohol | 330 (32.9) | 246 (46.6) | 84 (17.7) | < 0.001 |
| Volume of pancreas (cm3) | 62.6 ± 19.0 | 68.8 ± 19.5 | 55.8 ± 16.0 | < 0.001 |
Values are expressed as n (%) or mean ± standard deviation, unless otherwise indicated.
Evaluation metrics for four pancreas segmentation models.
| Precision | Recall | DSC | Trainable parameter | |
|---|---|---|---|---|
| Basic U-net | 11,003,073 | |||
| Dense U-net | 35,261,601 | |||
| Residual U-net | 2,350,857 | |||
| Residual Dense U-net | 47,074,657 |
Results are indicated as mean ± standard deviation, and the best performances are indicated in bold. The results are highlighted in italics if the residual dense u-net performs significantly better than the corresponding method. We used a significance level of 0.05 and a paired t test for network comparison.
DSC, dice similarity coefficient.
Comparison of pancreas segmentation performance according to pancreatic volumes using four independent 3D networks.
| DSC | * | |
|---|---|---|
| PV | ||
| Basic U-net | 0.785 | < 0.001 |
| Dense U-net | 0.794 | 0.013 |
| Residual U-net | 0.756 | < 0.001 |
| Residual Dense U-net | – | |
| 30 cm3
| ||
| Basic U-net | 0.834 | < 0.001 |
| Dense U-net | 0.842 | < 0.001 |
| Residual U-net | 0.815 | < 0.001 |
| Residual Dense U-net | ||
| 60 cm3
| ||
| Basic U-net | 0.859 | < 0.001 |
| Dense U-net | 0.866 | < 0.001 |
| Residual U-net | 0.844 | < 0.001 |
| Residual Dense U-net | ||
| PV | ||
| Basic U-net | 0.852 | < 0.001 |
| Dense U-net | 0.857 | < 0.001 |
| Residual U-net | 0.836 | < 0.001 |
| Residual Dense U-net | ||
Results are indicated as mean ± standard deviation, and the best performances are indicated in bold.
PV, pancreatic volume; DSC, dice similarity coefficient.
*We used a paired t test to compare the residual dense u-net with the corresponding network and used a significance level of 0.05.
Figure 3(a) Representative examples of pancreas segmentation in the 2D axial plane and 3D volume of one patient. (b) DSC metric in each deep learning model according to the volume of the pancreas. GS, gold standard; DSC, dice similarity coefficient; ResDense, residual dense; PV, pancreatic volume.
Figure 4Estimation of pancreatic volume assessments using DL-prediction and manual pancreas segmentation. To validate the DL approaches, the (a) Bland–Altman plot and (b) regression plot were employed for internal validation and (c, d) external validation. SD, standard deviation.