| Literature DB >> 32879754 |
Lutfiah Al Turk1, Su Wang2, Paul Krause2, James Wawrzynski3, George M Saleh3, Hend Alsawadi4, Abdulrahman Zaid Alshamrani5, Tunde Peto6, Andrew Bastawrous7, Jingren Li8, Hongying Lilian Tang2.
Abstract
Purpose: The aim of this work is to demonstrate how a retinal image analysis system, DAPHNE, supports the optimization of diabetic retinopathy (DR) screening programs for grading color fundus photography. Method: Retinal image sets, graded by trained and certified human graders, were acquired from Saudi Arabia, China, and Kenya. Each image was subsequently analyzed by the DAPHNE automated software. The sensitivity, specificity, and positive and negative predictive values for the detection of referable DR or diabetic macular edema were evaluated, taking human grading or clinical assessment outcomes to be the gold standard. The automated software's ability to identify co-pathology and to correctly label DR lesions was also assessed.Entities:
Keywords: AI algorithm; deep learning; diabetes; diabetic retinopathy; lesion detection
Mesh:
Year: 2020 PMID: 32879754 PMCID: PMC7443119 DOI: 10.1167/tvst.9.2.44
Source DB: PubMed Journal: Transl Vis Sci Technol ISSN: 2164-2591 Impact factor: 3.283
Figure 1.Top row: images from the same eye taken on October 26, 2015 (baseline), November 3, 2015 (dot hemorrhage), and March 8, 2016 (MA, hemorrhage and preretinal hemorrhage). Bottom row: the comparison of morphological changes for DR signs between a baseline image (October 26, 2015) and a follow-up retinal image (March 8, 2016).
An Overview of the Training and Internal and External Validation Test Sets
| 0 | 1 | 2 | 3 | 4 | |
|---|---|---|---|---|---|
| ICDRS | |||||
| Training samples on DAPHNE classifier using 28,100 from Kaggle | 20647 | 1955 | 4234 | 698 | 566 |
| Internal validation (7024 from Kaggle) | 5161 | 488 | 1058 | 175 | 142 |
| External validation datasets | |||||
| Kenya | 11479 | 9463 | 3395 | 329 | 34 |
| NSC | R0 | R1 | R2 | R3 | — |
| Additional training samples on DAPHNE classifier (only its distribution in NSC is shown for simplicity) | 3659 | 346 | 750 | 224 | — |
| External validation datasets | |||||
| NSC | R0 | R1 | R2 | R3 | — |
| China | 9986 | 3279 | 1240 | 495 | — |
| Saudi Arabia | 7451 | 1854 | 582 | 139 | — |
0, no DR; 1, mild; 2, moderate; 3, severe; 4, proliferative; R0, no DR; R1; background; R2, preproliferative; R3, proliferative.
DAPHNE's Performance on External Validations: (a) Sensitivity, Specificity and Corresponding 95% CIs for Referral Level Output to Detect Referral, PDR and DME, and PDR Level Output to Detect PDR on the Kenya Dataset
| Disease Level | Daphne Predicted Results | Sensitivity | Specificity |
|---|---|---|---|
| Referral vs Non- Referral | Referral | 94.28% (93.1%–95.22%) | 92.12% (88.27%–93.33%) |
| PDR | 100% (95.5%–100%) | — | |
| DME | — | ||
| PDR vs Non-PDR | PDR | 97.35% (92.3%–99.7%) | 85.78% (83.2%–87.81%) |
DAPHNE's Performance on External Validations: Sensitivity, Specificity and Corresponding 95% CIs for Referral Level Output to Detect Referral, PDR and DME, and PDR Level Output to Detect PDR on the China Dataset
| Disease Level | Daphne Predicted Results | Sensitivity | Specificity |
|---|---|---|---|
| Referral vs. Non- Referral | Referral | 95.51% (93.1%–97.50%) | 91.11% (85.11%–92.63%) |
| PDR | 100% (95.8%–100%) | — | |
| DME | — | ||
| PDR vs. Non-PDR | PDR | 97.18% (91.2%–99.6%) | 87.77% (85.3%–88.80%) |
Figure 2.The detected results of DR progression changes over a five-year period: First and second rows: from normal images (R0) to background retinopathy (R1); last row: from preproliferative retinopathy (R2) to stable treated proliferative retinopathy (R3s). First column: baseline images; second column: follow-up fundus images.
Figure 3.The detected results of DR progression changes within one month (between images columns 1 and 2) and two to four months (between images in columns 2 and 3). Each row shows images from one patient.
DAPHNE's Performance on External Validations: Sensitivity, Specificity and Corresponding 95% CIs for Referral Level Output to Detect Referral, PDR and DME, and PDR Level Output to Detect PDR on the Messidor-2 Dataset
| Disease Level | Daphne Predicted Results | Sensitivity | Specificity |
|---|---|---|---|
| Referral vs. Nonreferral | Referral | 95.8% (94%–97.42%) | 91.32% (86.7%–93.53%) |
| PDR | 100% (96.5%–100%) | — | |
| DME | 100% (95.8%–100%) | — | |
| PDR vs. Non-PDR | PDR | 98.55% (91.3%–99.6%) | 86.78% (84.2%–88.8%) |
DAPHNE's Performance on External Validations: Sensitivity, Specificity and Corresponding 95% CIs for Referral Level Output to Detect Referral, PDR and DME, and PDR Level Output to Detect PDR on the Saudi Arabian Dataset
| Disease Level | Daphne Predicted Results | Sensitivity | Specificity |
|---|---|---|---|
| Referral vs. Non- Referral | Referral | 97.1% (95.1%–97.25%) | 90.33% (85.71%–92.17%) |
| PDR | 100% (94.5%–100%) | — | |
| DME | 100% (94.5%–100%) | — | |
| PDR vs. Non-PDR | PDR | 98.23% (93.3%–99.6%) | 83.78% (82.12%–88.87%) |
The ground truth of DME was obtained from eye clinic, to assess the detection of DME markers by the DAPHNE detector.