| Literature DB >> 34170972 |
Karthik V Sarma1, Alex G Raman1,2, Nikhil J Dhinagar1,3, Alan M Priester1, Stephanie Harmon4,5, Thomas Sanford4,6, Sherif Mehralivand4, Baris Turkbey4, Leonard S Marks1, Steven S Raman1, William Speier1, Corey W Arnold1.
Abstract
PURPOSE: Developing large-scale datasets with research-quality annotations is challenging due to the high cost of refining clinically generated markup into high precision annotations. We evaluated the direct use of a large dataset with only clinically generated annotations in development of high-performance segmentation models for small research-quality challenge datasets.Entities:
Mesh:
Year: 2021 PMID: 34170972 PMCID: PMC8232529 DOI: 10.1371/journal.pone.0253829
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Evaluation results for baseline models.
| Soft Dice Coefficients | Average Hausdorff Distance | ||||
|---|---|---|---|---|---|
| Dataset | Overall | Base | Midgland | Apex | |
| 0.909 ± 0.042 | 0.863 ± 0.095 | 0.941 ± 0.030 | 0.832 ± 0.094 | 0.156 ± 0.231 | |
| 0.702 ± 0.083 | 0.679 ± 0.117 | 0.849 ± 0.051 | 0.702 ± 0.093 | 0.480 ± 0.555 | |
| 0.568 ± 0.122 | 0.501 ± 0.168 | 0.762 ± 0.087 | 0.561 ± 0.168 | 2.155 ± 2.466 | |
PX2 = ProstateX-2, P12 = PROMISE12, results reported as mean ± standard deviation across all images
Evaluation results for retargeted models.
| Refining? | Dataset | Soft Dice Coefficients | Average Hausdorff Distance | |||
|---|---|---|---|---|---|---|
| Overall | Base | Midgland | Apex | |||
| PX2 | 0.465 ± 0.291 | 0.314 ± 0.314 | 0.517 ± 0.316 | 0.401 ± 0.312 | 4.824 ± 5.920 | |
| PX2 | 0.912 | 0.851 | 0.949 | 0.849 | 0.150 | |
| P12 | 0.708 ± 0.210 | 0.475 ± 0.317 | 0.779 ± 0.215 | 0.679 ± 0.221 | 1.953 ± 3.747 | |
| P12 | 0.852 | 0.744 | 0.918 | 0.777 | 0.581 | |
* denotes significantly higher than baseline model, p<0.001. PX2 = ProstateX-2, P12 = PROMISE12, results reported as mean ± standard deviation across all images
Model performance using ablated primary dataset (overall soft Dice coefficients).
| Model | 5% | 10% | 15% | 20% | 40% | 60% | 80% | 100% |
|---|---|---|---|---|---|---|---|---|
| 0.638 | 0.754 | 0.775 | 0.825 | 0.883 | 0.901 | 0.906 | 0.909 | |
| 0.740 | 0.814 | 0.829 | 0.861 | 0.899 | 0.907 | 0.909 | 0.912 | |
| 0.625 | 0.721 | 0.727 | 0.781 | 0.831 | 0.848 | 0.842 | 0.852 |
* denotes significantly higher than baseline model, p<0.001. PX2 = ProstateX-2, P12 = PROMISE12, FT = fine-tuned, results reported as mean across all images
Evaluation results for refined BraTS models.
| Soft Dice Coefficients | Average Hausdorff Distance | ||||
|---|---|---|---|---|---|
| Dataset | Overall | Base | Midgland | Apex | |
| 0.834 | 0.783 | 0.903 | 0.775 | 0.299 | |
| 0.704 | 0.614 | 0.820 | 0.644 | 1.428 | |
* denotes significantly higher than baseline model, p<0.001. PX2 = ProstateX-2, P12 = PROMISE12, results reported as mean ± standard deviation across all images