| Literature DB >> 32355285 |
Maximilian W M Wintergerst1, Shekoufeh Gorgi Zadeh2,3, Vitalis Wiens2,4, Sarah Thiele1, Steffen Schmitz-Valckenberg1, Frank G Holz1, Robert P Finger5, Thomas Schultz2,6.
Abstract
Here, we investigate the extent to which re-implementing a previously published algorithm for OCT-based drusen quantification permits replicating the reported accuracy on an independent dataset. We refined that algorithm so that its accuracy is increased. Following a systematic literature search, an algorithm was selected based on its reported excellent results. Several steps were added to improve its accuracy. The replicated and refined algorithms were evaluated on an independent dataset with the same metrics as in the original publication. Accuracy of the refined algorithm (overlap ratio 36-52%) was significantly greater than the replicated one (overlap ratio 25-39%). In particular, separation of the retinal pigment epithelium and the ellipsoid zone could be improved by the refinement. However, accuracy was still lower than reported previously on different data (overlap ratio 67-76%). This is the first replication study of an algorithm for OCT image analysis. Its results indicate that current standards for algorithm validation do not provide a reliable estimate of algorithm performance on images that differ with respect to patient selection and image quality. In order to contribute to an improved reproducibility in this field, we publish both our replication and the refinement, as well as an exemplary dataset.Entities:
Mesh:
Year: 2020 PMID: 32355285 PMCID: PMC7192932 DOI: 10.1038/s41598-020-63924-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Step-by-step illustration of the refinements.
Original results from Chen et al.[13].
| ADAD [μm] | ADAD [%] | OR ± SD | |
|---|---|---|---|
| B-scans with drusen (‘4/340 dataset’) | 10.29 ± 8.9 | 15.70 ± 15.50 | 76.33 ± 11.29 |
| B-scans with largest drusen load per volume (‘143/143 dataset’) | 19.97 ± 14.68 | 23.77 ± 13.8 | 67.18 ± 9.14 |
Image resolution of the used dataset: 512 × 1024 and 128 B-Scans per volume scan; OR = overlap ratio; SD = standard deviation.
Comparison of the algorithms to the ground truth.
| Replicated Chen | Refined algorithm | |||||
|---|---|---|---|---|---|---|
| ADAD ± SD (μm) | ADAD ± SD (%) | OR ± SD (%) | ADAD ± SD (μm) | ADAD ± SD (%) | OR ± SD (%) | |
| B-scans with drusen | 17.60 ± 36.70 | 100.59 ± 304.80 | 24.52 ± 20.56 | 13.28 ± 29.40 | 73.54 ± 217.80 | 35.88 ± 25.25 |
| B-scans with largest drusen load per volume | 19.94 ± 13.54 | 42.70 ± 22.70 | 39.24 ± 22.06 | 15.64 ± 11.05 | 36.86 ± 24.34 | 51.90 ± 23.70 |
| Volumetric Computation | 11.96 ± 12.11 | 46.37 ± 75.36 | 29.35 ± 17.32 | 8.31 ± 6.87 | 30.05 ± 29.76 | 42.20 ± 20.47 |
Image resolution of the used dataset: 512 × 496 and 145 B-Scans per volume scan. OR = overlap ratio; SD = standard deviation.
Figure 2Comparison of replicated and refined drusen segmentation.
Figure 3Algorithm performance stratified for drusen load.