| Literature DB >> 35652075 |
Wen-Fei Zhang1,2, Dong-Hong Li3, Qi-Jie Wei3, Da-Yong Ding3, Li-Hui Meng1,2, Yue-Lin Wang1,2, Xin-Yu Zhao1,2, You-Xin Chen1,2.
Abstract
Purpose: To evaluate the performance of a deep learning (DL)-based artificial intelligence (AI) hierarchical diagnosis software, EyeWisdom V1 for diabetic retinopathy (DR). Materials andEntities:
Keywords: artificial intelligence; diabetic retinopathy; eye wisdom V1; sensitivity; specificity; validation
Year: 2022 PMID: 35652075 PMCID: PMC9148973 DOI: 10.3389/fmed.2022.839088
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
FIGURE 1The network architecture of EyeWisdom V1.
FIGURE 2The work flowchart of fundus image grading.
Distribution of manual grading and AI software grading for DR.
| AI grading | Manual grading | Total | ||||
| 0 | 1 | 2 | 3 | 4 | ||
| 0 | 306 | 2 | 9 | 0 | 1 | 318(29.20%) |
| 1 | 84 | 41 | 34 | 0 | 0 | 159(14.60%) |
| 2 | 21 | 9 | 310 | 40 | 9 | 389(35.70%) |
| 3 | 0 | 0 | 15 | 110 | 14 | 139(12.80%) |
| 4 | 0 | 1 | 1 | 3 | 79 | 84(7.70%) |
| Total | 411 | 53 | 369 | 153 | 103 | 1089 |
AI = artificial intelligence, DR = diabetic retinopathy
Performance of EyeWisdom V1 in comparison to manual grading.
| Category | Any DR detection(95% CI) | Referable DR detection(95% CI) | STDR detection(95% CI) |
| Sensitivity | 98.23% (96.93% ∼ 99.08%) | 92.96% (90.66% ∼ 94.84%) | 80.47% (75.07% ∼ 85.14%) |
| Specificity | 74.45% (69.95% ∼ 78.60%) | 93.32% (90.65% ∼ 95.42%) | 97.96% (96.75% ∼ 98.81%) |
| PPV | 86.38% (83.76% ∼ 88.72%) | 94.93% (92.89% ∼ 96.53%) | 92.38% (88.07% ∼ 95.50%) |
| NPV | 96.23% (93.50% ∼ 98.04%) | 90.78% (87.81% ∼ 93.22%) | 94.23% (92.46% ∼ 95.68%) |
AI = artificial intelligence, CI = confidence interval, DR = diabetic retinopathy, NPV = negative predictive values, PPV = positive predictive values, STDR = sight threatening DR.
The DAR and kappa between AI software, every grader (A,B,C) and gold standard for detecting any DR and referable DR.
| Any DR detection(95% CI) | Referable DR detection(95% CI) | |||
| DAR | Kappa | DAR | Kappa | |
| Grader A | 92.19% (90.44% ∼ 93.72%) | 0.930 (0.915 ∼ 0.944) | 97.06% (95.88% ∼ 97.98%) | 0.940 (0.927 ∼ 0.953) |
| Grader B | 88.61% (86.58% ∼ 90.44%) | 0.927 (0.912 ∼ 0.942) | 96.32% (95.03% ∼ 97.36%) | 0.929 (0.915 ∼ 0.943) |
| Grader C | 80.53% (78.05% ∼ 82.85%) | 0.804 (0.779 ∼ 0.828) | 92.47% (90.74% ∼ 93.97%) | 0.844 (0.823 ∼ 0.865) |
| AI group | 89.30% (87.30% ∼ 91.00%) | 0.761 (0.734 ∼ 0.787) | 93.11% (91.44% ∼ 94.54%) | 0.860 (0.827 ∼ 0.890) |
AI = artificial intelligence, CI = confidence interval, DAR = diagnostic accordance rate, DR = diabetic retinopathy.
FIGURE 3The AUC of three graders and EyeWisdom V1 for referable DR.
Diagnostic results of different fundus cameras.
| Category | Topcon TRC-NW6S | Cannon CR2 | KOWA Nonmyd α -DIII 8300 |
| Consistent with the gold standard | 219(73.7) | 353(79.7) | 274(78.5) |
| Inconsistent with the gold standard | 78(26.3) | 90(37.0) | 75(21.5) |
| Total | 297(100) | 443(100) | 349(100) |
Several state-of-the-art deep learning models for DR classification.
| DL Systems | Algorithm | Training data set | Test data set | AUC | Sensitivity | Specificity | Aim of detection |
| Abràmoff et al. ( | AlexNet/VGGNet | Messidor-2 | 10 primary care practice sites from the United States | NA | 87 | 91 | mtmDR |
| Gulshan et al. ( | Inception-V3 | EyePACS, Messidor-2 | Messidor-2 | 0.99 | 87 | 99 | referable DR, operating cut point with high specificity |
| 96 | 94 | referable DR, operating cut point with high sensitivity | |||||
| EyePACS-1 | 0.99 | 90 | 98 | referable DR, operating cut point with high specificity | |||
| 98 | 93 | referable DR, operating cut point with high sensitivity | |||||
| Zhang et al. ( | Inception-V3 | SAMS, SPPH | SAMS, SPPH | 0.98 | 98 | 98 | referable DR |
| Gulshan et al. ( | Inception-V4 | EyePACS, Messidor-2 | Aravind | 0.96 | 90 | 92 | referable DR |
| Sankara | 0.98 | 92 | 95 | referable DR | |||
| Bellemo et al. ( | VGGNet/ResNet | SiDRP 2010-2013 | Zambia mobile screening | 0.97 | 92 | 89 | referable DR |
| 0.98 | 92 | 95 | STDR | ||||
| Sayres et al. ( | Inception-V4 | EyePACS, 3 eye hospitals of India | EyePACS2 | 0.88 | 92 | 95 | referable DR |
DL = deep learning, mtmDR = more than mild DR (ETDRS level 35 or higher and/or DNE), NA = not available, SAMS = The Sichuan Academy of Medical Sciences, SiDRP = Singapore Integrated Diabetic Retinopathy Screening program, SPPH = Sichuan Provincial Peoples Hospital, STDR = severe nonproliferative DR or worse.