| Literature DB >> 33490720 |
Brian M Anderson1,2, Ethan Y Lin3, Carlos E Cardenas2, Dustin A Gress1, William D Erwin1, Bruno C Odisio3, Eugene J Koay4, Kristy K Brock1,2.
Abstract
PURPOSE: The deformable nature of the liver can make focal treatment challenging and is not adequately addressed with simple rigid registration techniques. More advanced registration techniques can take deformations into account (eg, biomechanical modeling) but require segmentations of the whole liver for each scan, which is a time-intensive process. We hypothesize that fully convolutional networks can be used to rapidly and accurately autosegment the liver, removing the temporal bottleneck for biomechanical modeling. METHODS AND MATERIALS: Manual liver segmentations on computed tomography scans from 183 patients treated at our institution and 30 scans from the Medical Image Computing & Computer Assisted Intervention challenges were collected for this study. Three architectures were investigated for rapid automated segmentation of the liver (VGG-16, DeepLabv3 +, and a 3-dimensional UNet). Fifty-six cases were set aside as a final test set for quantitative model evaluation. Accuracy of the autosegmentations was assessed using Dice similarity coefficient and mean surface distance. Qualitative evaluation was also performed by 3 radiation oncologists on 50 independent cases with previously clinically treated liver contours.Entities:
Year: 2020 PMID: 33490720 PMCID: PMC7807136 DOI: 10.1016/j.adro.2020.04.023
Source DB: PubMed Journal: Adv Radiat Oncol ISSN: 2452-1094
Data distribution
| Data set | Npatients | NImages | Distribution (images) | ||
|---|---|---|---|---|---|
| Train | Validation | Test | |||
| Contrast | 62 | 108 | 72 | 19 | 17 |
| Noncontrast | 93 | 97 | 63 | 15 | 19 |
| MICCAI | 30 | 30 | 0 | 0 | 30 |
Abbreviation: MICCAI = Medical Image Computing & Computer Assisted Intervention.
All patients with multiple examinations were kept in the training set.
Figure 1Hyper-parameter searching for ideal UNet style architecture. Parameters varied were number of layers in depth (2-5), number of convolution layers (0-2) versus atrous layers, and maximum number of filters (16-32). For ease of viewing, convolution layer 2 is not shown.
Test results by group for each model
| Test data | Mean (minimum, maximum) | ||||||
|---|---|---|---|---|---|---|---|
| Dice similarity coefficient | Mean surface distance (mm) | ||||||
| Model name | Model name | ||||||
| Data set | N_Images | 3D Unet | VGG_16 | V3_Plus | 3D Unet | VGG_16 | V3_Plus |
| Contrast | 17 | 0.87 (0.72, 0.92) | 0.96 (0.93,0.97) | 0.96 (0.95, 0.98) | 4.66 (2.35, 13.88) | 1.25 (0.60, 2.95) | 1.02 (0.46, 1.89) |
| Noncontrast | 19 | 0.86 (0.74, 0.93) | 0.95 (0.91, 0.97) | 0.96 (0.91, 0.98) | 5.20 (1.94, 17.92) | 1.37 (0.69, 2.93) | 1.18 (0.41, 3.21) |
| MICCAI | 30 | 0.85 (0.74, 0.91) | 0.95 (0.90, 0.97) | 0.95 (0.90, 0.97) | 5.15 (3.08, 9.07) | 1.80 (0.65, 7.02) | 1.54 (0.90, 3.68) |
Abbreviation: MICCAI = Medical Image Computing & Computer Assisted Intervention.
Figure 2Predictions (red) overlayed on top of computed tomography scans for median and worst cases for each architecture. Red arrows indicate regions of failure. (A color version of this figure is available at https://doi.org/10.1016/j.adro.2020.04.023.)
Consensus model results for the 3 reviewing radiation oncologists
| Reviewers | Majority or one? | Preference | Clinically usable | Minor edits | Major edits | ||||
|---|---|---|---|---|---|---|---|---|---|
| Auto | Manual | Auto | Manual | Auto | Manual | Auto | Manual | ||
| 1a, 1b, 2, 3 | Majority voting | 60% (30/50) | 40% (20/50) | 81% | 89% | 33% | 45% | 19% | 11% |
| At least 1 vote | 82% (41/50) | 64% (32/50) | 96% (48/50) | 100% (50/50) | 86% (43/50) | 96% (48/50) | 58% (29/50) | 52% (26/50) | |
| 1b, 2, 3 | Majority Voting | 62% (31/50) | 38% (19/50) | 76% (38/50) | 82% (41/50) | 42% (21/50) | 50% (25/50) | 24% (12/50) | 18% (19/50) |
| At least 1 vote | 76% (38/50) | 64% (32/50) | 88% (44/50) | 96% (48/50) | 76% (38/50) | 88% (44/50) | 58% (29/50) | 52% (26/50) | |
When specifying reviewers, 1a is Reviewer 1’s initial review and 1b is their review with a 4-month time gap to reduce bias. Majority voting implies at least half of the reviewers agreed on a case-by-case basis, and ties were split. At least 1 implies that at least 1 reviewer voted in the manner listed.
Values of 0.5 were split ties/.
Figure 3(A) Presence of high-contrast biliary stent causing autosegmentation to underestimate liver, requiring edits, and (B) ascites misidentified as liver. Teal: ground truth; red: auto segmentation. (A color version of this figure is available at https://doi.org/10.1016/j.adro.2020.04.023.)
Comparison of proposed method versus recent liver segmentation methods
| Method | Source | Test size | Dice |
|---|---|---|---|
| Proposed | Contrast | 17 | 0.96 |
| Noncontrast | 19 | 0.96 | |
| MICCAI | 30 | 0.95 | |
| 2016 | Contrast | 20 | 0.94 |
| 2017 | Contrast | 127 | 0.96 |
| Noncontrast | 13 | 0.96 | |
| 2017 | Contrast | 150 | 0.95 |
| 2017 | Contrast | 7 | 0.94 |
| 2018 | Contrast | 129 | 0.95 |
| Contrast | 20 | 0.94 |
Abbreviation: MICCAI = Medical Image Computing & Computer Assisted Intervention.