| Literature DB >> 31285505 |
Alessandro A Jammal1,2, Atalie C Thompson1, Nara G Ogata1, Eduardo B Mariottoni1, Carla N Urata1, Vital P Costa2, Felipe A Medeiros3.
Abstract
In this study we developed a deep learning (DL) algorithm that detects errors in retinal never fibre layer (RNFL) segmentation on spectral-domain optical coherence tomography (SDOCT) B-scans using human grades as the reference standard. A dataset of 25,250 SDOCT B-scans reviewed for segmentation errors by human graders was randomly divided into validation plus training (50%) and test (50%) sets. The performance of the DL algorithm was evaluated in the test sample by outputting a probability of having a segmentation error for each B-scan. The ability of the algorithm to detect segmentation errors was evaluated with the area under the receiver operating characteristic (ROC) curve. Mean DL probabilities of segmentation error in the test sample were 0.90 ± 0.17 vs. 0.12 ± 0.22 (P < 0.001) for scans with and without segmentation errors, respectively. The DL algorithm had an area under the ROC curve of 0.979 (95% CI: 0.974 to 0.984) and an overall accuracy of 92.4%. For the B-scans with severe segmentation errors in the test sample, the DL algorithm was 98.9% sensitive. This algorithm can help clinicians and researchers review images for artifacts in SDOCT tests in a timely manner and avoid inaccurate diagnostic interpretations.Entities:
Mesh:
Year: 2019 PMID: 31285505 PMCID: PMC6614403 DOI: 10.1038/s41598-019-46294-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic and clinical characteristics of the eyes and subjects included in the training and test samples by randomization at the patient level.
| Normal | Suspect | Glaucoma | Overall | |
|---|---|---|---|---|
| Number of eyes | 178 | 291 | 213 | 682 |
| Number of patients | 94 | 160 | 100 | 354 |
| Number of images | 1,900 | 5,897 | 5,334 | 13,131 |
| Number of images with errors | 135 | 394 | 834 | 1,363 |
| Age (years) | 48.9 ± 16.4 | 64.9 ± 11.6 | 70.7 ± 11.1 | 62.3 ± 15.4 |
| Female gender (%) | 54.4 | 58.0 | 49.4 | 54.7 |
| Race (%) | ||||
| Caucasian | 71.3 | 68.1 | 56.0 | 65.5 |
| African-American | 28.7 | 31.9 | 44.0 | 34.5 |
| SAP MD (dB) | 0.04 ± 1.25 | −0.45 ± 1.81 | −6.16 ± 6.2 | −2.70 ± 5.06 |
| SDOCT Average RNFL Thickness (μm) | 98.3 ± 9.8 | 86.4 ± 13.4 | 71.6 ± 16.9 | 82.1 ± 17.3 |
|
| ||||
| Number of eyes | 203 | 284 | 194 | 681 |
| Number of patients | 104 | 153 | 98 | 355 |
| Number of images | 2,787 | 4,727 | 4,605 | 12,119 |
| Number of images with errors | 155 | 225 | 854 | 1,234 |
| Age (years) | 50.0 ± 16.3 | 63.4 ± 12.2 | 69.3 ± 12.1 | 61.1 ± 15.5 |
| Female gender (%) | 57.3 | 61.6 | 51.7 | 57.6 |
| Race (%) | ||||
| Caucasian | 49.0 | 75.8 | 63.3 | 64.5 |
| African-American | 51.0 | 24.2 | 36.7 | 35.5 |
| SAP MD (dB) | −0.06 ± 1.19 | −0.25 ± 1.96 | −6.48 ± 6.71 | −2.58 ± 5.32 |
| SDOCT Average RNFL Thickness (μm) | 97.9 ± 10.6 | 87.3 ± 13.9 | 71.2 ± 17.5 | 83.6 ± 18.1 |
SAP = Standard Automated Perimetry; MD = Mean Deviation; SDOCT = Spectral-Domain Optical Coherence Tomography; RNFL = Retinal Nerve Fibre Layer.
Characteristics of the eyes and subjects in the test sample according to the presence of segmentation errors, as classified by human graders.
| B-scans without segmentation errors (n = 10,885) | B-scans with segmentation errors (n = 1,234) | P-value | |
|---|---|---|---|
| Number of eyes | 629 | 52 | — |
| Number of patients | 327 | 28 | — |
| Age at time of scan (years) | 64.1 ± 13.9 | 69.5 ± 12.4 |
|
| Diagnosis (%) |
| ||
| Normal | 31.8 | 5.8 | |
| Suspect | 42.8 | 28.8 | |
| Glaucoma | 25.4 | 65.4 | |
| Female gender (%) | 58.8 | 42.9 | 0.113b |
| Race (%) | 0.682b | ||
| Caucasian | 65.1 | 60.7 | |
| African-American | 34.9 | 39.3 | |
| SAP MD (dB) | −2.08 ± 4.67 | −6.99 ± 7.98 |
|
| SDOCT Quality score | 27.0 ± 4.7 | 27.2 ± 4.2 | 0.538a |
| SDOCT Average RNFL Thickness (μm) | 85.0 ± 16.5 | 71.2 ± 25.5 |
|
| Mean DL probability of a segmentation error | 0.12 ± 0.22 | 0.90 ± 0.17 |
|
SAP = Standard Automated Perimetry; MD = Mean Deviation; SDOCT = Spectral-Domain Optical Coherence Tomography; RNFL = Retinal Nerve Fibre Layer.
Values given as mean ± standard deviation, unless otherwise noted.
aGeneralized estimating equation; bFisher’s exact test.
Figure 1Spectral-domain optical coherence tomography (SDOCT) B-scans with segmentation errors correctly detected by both human graders and the deep learning algorithm. Class activation maps (heatmaps) on the right show the regions of the B-scans that had greatest weight in the deep learning algorithm classification. Probabilities of segmentation error given by the deep learning algorithm were all above 97.5% for these scans. (a) Segmentation error in the internal limiting membrane. (b) Segmentation error in the inner plexiform layer. (c) Multiple segmentation errors. (d) Small segmentation error.
Figure 2Spectral-domain optical coherence tomography (SDOCT) B-scan with no segmentation error, as labelled by human graders and the deep learning algorithm. Class activation map (heatmap) involves the whole B-scan, rather than being concentrated on a particular area. The probability of segmentation error given by the deep learning algorithm was below 1%.
Figure 3Spectral-domain optical coherence tomography (SDOCT) B-scans with disagreement about the presence of segmentation errors between human graders and the deep learning algorithm. (a) Small segmentation error in the internal limiting membrane occurring at the borders of the image (arrows) missed by the deep learning algorithm. The probability of segmentation error given by the deep learning algorithm was 15%. (b) Small segmentation error (detail) missed by the human labelling but identified by the deep learning algorithm. The probability of segmentation error given by the deep learning algorithm was 91%.