| Literature DB >> 29556054 |
Harini Veeraraghavan1, Brittany Z Dashevsky2,3, Natsuko Onishi3, Meredith Sadinski3, Elizabeth Morris3, Joseph O Deasy4, Elizabeth J Sutton3.
Abstract
We present a segmentation approach that combines GrowCut (GC) with cancer-specific multi-parametric Gaussian Mixture Model (GCGMM) to produce accurate and reproducible segmentations. We evaluated GCGMM using a retrospectively collected 75 invasive ductal carcinoma with ERPR+ HER2- (n = 15), triple negative (TN) (n = 9), and ER-HER2+ (n = 57) cancers with variable presentation (mass and non-mass enhancement) and background parenchymal enhancement (mild and marked). Expert delineated manual contours were used to assess the segmentation performance using Dice coefficient (DSC), mean surface distance (mSD), Hausdorff distance, and volume ratio (VR). GCGMM segmentations were significantly more accurate than GrowCut (GC) and fuzzy c-means clustering (FCM). GCGMM's segmentations and the texture features computed from those segmentations were the most reproducible compared with manual delineations and other analyzed segmentation methods. Finally, random forest (RF) classifier trained with leave-one-out cross-validation using features extracted from GCGMM segmentation resulted in the best accuracy for ER-HER2+ vs. ERPR+/TN (GCGMM 0.95, expert 0.95, GC 0.90, FCM 0.92) and for ERPR + HER2- vs. TN (GCGMM 0.92, expert 0.91, GC 0.77, FCM 0.83).Entities:
Mesh:
Year: 2018 PMID: 29556054 PMCID: PMC5859113 DOI: 10.1038/s41598-018-22980-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Performance of segmentation methods. (a) Example segmentations produced using GrowCut(GC), GC combined with Gaussian mixture models (GCGMM), fuzzy c-means clustering method (FCM) and volumes produced using all methods overlaid with expert delineated volume and (b) overall performance of the segmentation methods for all analyzed tumours. The inter-rater segmentation concordance computed using the various metrics is shown for reference using dashed lines.
Segmentation accuracies generated using GC, GCGMM, and FCM presented using mean and standard deviation (SD).
| Analysis | FCM | GC | GCGMM | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DSC | mSD | HD95 | |VR| | DSC | mSD | HD95 | |VR| | DSC | mSD | HD95 | |VR| | |
| Overall mean | 0.66 | 1.85 | 5.55 | 0.27 | 0.69 | 2.97 | 7.38 | 0.21 | 0.81***,*** | 1.08**,*** | 4.82**,*** | 0.12***,*** |
| SD | 0.15 | 1.31 | 3.41 | 0.16 | 0.15 | 12.29 | 14.18 | 0.18 | 0.07 | 0.59 | 3.67 | 0.08 |
| Mild BPE mean | 0.65 | 1.89 | 5.43 | 0.29 | 0.70 | 1.73 | 8.58 | 0.20 | 0.80***,*** | 1.11*,*** | 5.27 | 0.13*,*** |
| SD | 0.15 | 1.19 | 3.15 | 0.16 | 0.12 | 1.41 | 20.98 | 0.15 | 0.06 | 0.62 | 4.61 | 0.08 |
| Marked BPE mean | 0.68 | 1.74 | 5.73 | 0.25 | 0.68 | 3.48 | 6.41 | 0.24 | 0.81***,*** | 1.01 | 4.44 | 0.10***,*** |
| SD | 0.15 | 1.41 | 3.88 | 0.17 | 0.15 | 14.84 | 4.53 | 0.19 | 0.07 | 0.58 | 2.82 | 0.07 |
| Mass mean | 0.66 | 1.93 | 5.63 | 0.27 | 0.70 | 2.34 | 5.73 | 0.21 | 0.82***,*** | 1.02***,*** | 4.24 | 0.12***,*** |
| SD | 0.16 | 1.39 | 3.66 | 0.17 | 0.14 | 8.49 | 4.10 | 0.16 | 0.07 | 0.45 | 2.49 | 0.08 |
| Non-mass mean | 0.68 | 1.64 | 5.32 | 0.27 | 0.66 | 4.57 | 11.64 | 0.23 | 0.78 | 1.24 | 6.31 | 0.11***, |
| SD | 0.14 | 1.06 | 2.66 | 0.15 | 0.17 | 18.84 | 25.63 | 0.21 | 0.07 | 0.84 | 5.42 | 0.08 |
| ER-HER2+ mean | 0.67 | 1.77 | 5.36 | 0.27 | 0.69 | 2.78 | 7.94 | 0.22 | 0.81***,*** | 1.03***,*** | 4.92 | 0.10***,*** |
| SD | 0.16 | 1.32 | 3.07 | 0.16 | 0.14 | 11.90 | 16.6 | 0.17 | 0.06 | 0.61 | 3.88 | 0.07 |
| TN mean | 0.65 | 2.03 | 5.15 | 0.29 | 0.73 | 5.41 | 6.78 | 0.19 | 0.82 | 1.21 | 4.63 | 0.14*,* |
| SD | 0.14 | 1.26 | 2.34 | 0.16 | 0.19 | 20.65 | 5.85 | 0.22 | 0.09 | 0.52 | 2.50 | 0.09 |
| ERPR + HER2− mean | 0.65 | 2.06 | 6.59 | 0.29 | 0.69 | 2.01 | 5.54 | 0.22 | 0.79 | 1.18 | 4.55 | 0.15 |
| SD | 0.14 | 1.27 | 4.91 | 0.17 | 0.15 | 2.06 | 3.72 | 0.19 | 0.07 | 0.52 | 3.52 | 0.09 |
FCM Fuzzy c-means clustering; GC GrowCut; GCGMM GrowCut with Gaussian Mixture Models.
DSC Dice coefficient; mSD mean surface distance; HD95 95 percentile of Hausdorff distance; |VR| absolute volume ratio.
Significant differences between GCGMM vs. FCM and GCGMM vs. GC are indicated above each metric for the corresponding analysis after adjusting for multiple comparisons using Bonferroni-Holm correction.
ns P ≥ 0.05; *P < 0.05; **P < 0.01; ***P < 0.001.
Reproducibility of segmentations generated using multiple raters and by algorithms (GC, FCM, GCGMM) using different user inputs.
| Method |
| % | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| DSC | mSD ( | HD95 ( | |VR| | Volume (cc) | DSC | mSD | HD95 | |VR| | Volume (cc) | |
| Manual | 0.084 | 0.063 | 4.6 | 0.10 | 1.08 | 11.1 | 48.3 | 48.6 | 62.6 | 29.4 |
| FCM | 0.06 | 0.91 | 2.38 | 0.06 | 2.46 | 13.6 | 31.9 | 29.7 | 33.5 | 36.1 |
| GC | 0.10 | 12.3 | 13.5 | 0.14 | 37.6 | 19.6 | 50.0 | 26.7 | 64.2 | 43.8 |
| GCGMM | 0.038 | 0.31 | 1.33 | 0.057 | 1.75 | 5.07 | 21.2 | 20.7 | 54.3 | 14.5 |
SD Root mean square of standard deviation; %CV Percentage coefficient of variation in the RMS value for a specific metric FCM Fuzzy c-means clustering; GC GrowCut; GCGMM GrowCut with Gaussian Mixture Models.
DSC Dice coefficient; mSD mean surface distance; HD95 95 percentile of Hausdorff distance; |VR| absolute volume ratio.
Figure 2Segmentation variability for the different methods. The inter-rater delineations, and the segmentations generated using three different user inputs are shown in (a). The segmentation accuracies achieved by the different methods for the three different user inputs is shown in (b) and the segmentations with significantly different accuracies using a given measure are identified, where *P < 0.05 and **P < 0.01. The p-values are reported after adjusting for multiple comparisons using Bonferroni-Holm method. The intra-class correlation coefficient (ICC) of the texture measures computed from the generated segmentations are shown in (c).
Classifier accuracies using features computed from different segmentations. TPR - true positive rate, TNR - true negative rate, FPR - false positive rate, FNR - false negative rate, AUC - area under the curve.
| Method | ER-HER2+ vs. ERPR + HER2−/TN | ERPR + HER2− vs. TN | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| TPR | TNR | FPR | FNR | AUC (95% CI) | TPR | TNR | FPR | FNR | AUC (95% CI) | |
| Expert | 0.85 | 0.91 | 0.09 | 0.15 | 0.95 (0.91–0.97) | 0.78 | 0.91 | 0.09 | 0.22 | 0.91 (0.79–0.97) |
| FCM | 0.85 | 0.85 | 0.15 | 0.15 | 0.92 (0.87–0.96) | 0.74 | 0.83 | 0.17 | 0.26 | 0.83 (0.67–0.91) |
| GC | 0.79 | 0.79 | 0.21 | 0.21 | 0.90 (0.86–0.94) | 0.70 | 0.78 | 0.22 | 0.30 | 0.77 (0.61–0.90) |
| GCGMM | 0.93 | 0.81 | 0.19 | 0.07 | 0.95 (0.92–0.98) | 0.83 | 0.96 | 0.04 | 0.17 | 0.92 (0.82–0.97) |
Figure 3Performance of classifiers trained with textures extracted from different segmentations. (a) ROC curves for classifiers trained using features extracted from various segmentations for distinguishing between ER-HER2+ vs. ERPR + HER2−/TN and ERPR + HER2− vs. TN cancers. The five most relevant features and their differences between ERPR + HER2− vs. TN cancers for expert delineated (b) and GCGMM segmented tumors (c) are also shown.
Results of Wilcoxon test to assess the difference between ER-HER2+ vs. ERPR + HER2−/TN and ERPR + HER2− vs. TN cancers using top five-most relevant (determined using Gini importance) features extracted using RF classifiers and trained using features generated from the different segmentation methods. P-values are reported after adjusting for multiple comparisons using Bonferroni-Holm method.
| Expert | p-Value | FCM | p-Value | GC | p-Value | GCGMM | p-Value |
|---|---|---|---|---|---|---|---|
|
| |||||||
| Post2 I | 0.74 | Post3 Kurt | 0.56 | Post1 Skew | 1.00 | Post1 I | 1.00 |
| Post2 Skew | 0.31 | Pre Kurt | 0.36 | Pre Kurt | 1.00 | Post3 I | 1.00 |
| Post1 I | 1.00 | Post2 Kurt | 0.56 | Pre Contrast | 1.00 | Post2 I | 1.00 |
| Post1 Corr | 1.00 | Post1 Kurt | 0.56 | Post1 Kurt | 1.00 | Pre Energy | 0.19 |
| Post1 Entropy | 1.00 | Post3 SD | 0.56 | Post2 Skew | 1.00 | Post3 Skew | 1.00 |
|
| |||||||
| Expert | p-Value | FCM | p-Value | GC | p-Value | GCGMM | p-Value |
| Post1 Contrast | 0.04 | Post3 Kurt | 0.27 | Post3 Homogeneity | 0.71 | Post3 Kurt | 0.01 |
| Post3 Contrast | 0.02 | Post3 SD | 0.65 | Post3 Skew | 0.71 | Post2 Kurt | 0.01 |
| Post2 Contrast | 0.04 | Post2 Skew | 0.65 | Post2 Skew | 0.58 | Post3 Skew | 0.01 |
| Pre Contrast | 0.08 | Post1 Kurt | 0.32 | Pre SD | 1.00 | Post1 Kurt | 0.16 |
| Post1 Corr | 0.04 | Post1 Skew | 0.65 | Post2 I | 1.00 | Post2 Skew | 0.01 |
FCM: Fuzzy c-means; GC: Grow-Cut; GCGMM: Grow-Cut with Gaussian Mixture Models
Pre: Pre constrast MRI; Post1: first post-contrast MRI; Post2: second post-contrast MRI; Post3: third post-contrast MRI
I: intensity; skew: skewness; corr: correlation; kurt: kurtosis; SD: standard deviation.
Figure 4Workflow diagram. (i) Inputs used for generating segmentations, (ii) confidence map computed from GCGMM using region of interest refined input from (i) c, and segmentations generated using two different confidence thresholds (iii,iv) for a triple negative breast cancer.