| Literature DB >> 35160065 |
Joan M Nunez do Rio1,2, Paul Nderitu1,2, Christos Bergeles3, Sobha Sivaprasad1,4, Gavin S W Tan5,6, Rajiv Raman7.
Abstract
Artificial Intelligence has showcased clear capabilities to automatically grade diabetic retinopathy (DR) on mydriatic retinal images captured by clinical experts on fixed table-top retinal cameras within hospital settings. However, in many low- and middle-income countries, screening for DR revolves around minimally trained field workers using handheld non-mydriatic cameras in community settings. This prospective study evaluated the diagnostic accuracy of a deep learning algorithm developed using mydriatic retinal images by the Singapore Eye Research Institute, commercially available as Zeiss VISUHEALTH-AI DR, on images captured by field workers on a Zeiss Visuscout® 100 non-mydriatic handheld camera from people with diabetes in a house-to-house cross-sectional study across 20 regions in India. A total of 20,489 patient eyes from 11,199 patients were used to evaluate algorithm performance in identifying referable DR, non-referable DR, and gradability. For each category, the algorithm achieved precision values of 29.60 (95% CI 27.40, 31.88), 92.56 (92.13, 92.97), and 58.58 (56.97, 60.19), recall values of 62.69 (59.17, 66.12), 85.65 (85.11, 86.18), and 65.06 (63.40, 66.69), and F-score values of 40.22 (38.25, 42.21), 88.97 (88.62, 89.31), and 61.65 (60.50, 62.80), respectively. Model performance reached 91.22 (90.79, 91.64) sensitivity and 65.06 (63.40, 66.69) specificity at detecting gradability and 72.08 (70.68, 73.46) sensitivity and 85.65 (85.11, 86.18) specificity for the detection of all referable eyes. Algorithm accuracy is dependent on the quality of acquired retinal images, and this is a major limiting step for its global implementation in community non-mydriatic DR screening using handheld cameras. This study highlights the need to develop and train deep learning-based screening tools in such conditions before implementation.Entities:
Keywords: deep learning; diabetic retinopathy; handheld non-mydriatic images
Year: 2022 PMID: 35160065 PMCID: PMC8836386 DOI: 10.3390/jcm11030614
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.964
Patient demographics for each site and patient eye grade distribution.
| Sites | Images, | Images per Patient Eye, | Patients, | Age, | Male, | Female, | Patient Eyes, | Referable, | Non-Referable, | Ungradable, |
|---|---|---|---|---|---|---|---|---|---|---|
|
| 2408 (8.1) | 1.5 (0.7) | 836 (7.5) | 54.2 (10.3) | 402 (48.1) | 434 (51.9) |
| 23 (1.5) | 1404 (89.1) | 149 (9.5) |
|
| 2268 (7.6) | 1.2 (0.5) | 994 (8.9) | 54.1 (10.0) | 537 (54.0) | 456 (45.9) |
| 108 (5.9) | 1513 (82.7) | 208 (11.4) |
|
| 730 (2.5) | 1.3 (0.5) | 436 (3.9) | 55.4 (10.7) | 166 (38.1) | 270 (61.9) |
| 12 (2.1) | 354 (61.5) | 210 (36.5) |
|
| 1308 (4.4) | 1.3 (0.5) | 549 (4.9) | 55.5 (11.1) | 304 (55.4) | 245 (44.6) |
| 14 (1.4) | 906 (87.5) | 116 (11.2) |
|
| 338 (1.1) | 1.4 (0.6) | 170 (1.5) | 54.2 (10.8) | 109 (64.1) | 59 (34.7) |
| 4 (1.7) | 193 (83.5) | 34 (14.7) |
|
| 1315 (4.4) | 1.3 (0.5) | 557 (5.0) | 55.1 (10.1) | 315 (56.6) | 242 (43.4) |
| 50 (4.9) | 880 (85.9) | 94 (9.2) |
|
| 1225 (4.1) | 1.1 (0.4) | 569 (5.1) | 56.4 (11.5) | 355 (62.4) | 214 (37.6) |
| 91 (8.5) | 845 (78.6) | 139 (12.9) |
|
| 2110 (7.1) | 1.2 (0.4) | 943 (8.4) | 61.9 (9.8) | 369 (39.1) | 571 (60.6) |
| 125 (6.9) | 1475 (81.4) | 212 (11.7) |
|
| 2952 (10.0) | 2.1 (0.9) | 752 (6.7) | 57.4 (10.1) | 260 (34.6) | 492 (65.4) |
| 22 (1.5) | 1096 (76.3) | 318 (22.1) |
|
| 3635 (12.3) | 1.9 (0.4) | 1066 (9.5) | 62.9 (10.5) | 416 (39.0) | 650 (61.0) |
| 49 (2.5) | 1141 (58.6) | 756 (38.8) |
|
| 714 (2.4) | 1.9 (1.0) | 212 (1.9) | 56.7 (9.5) | 90 (42.5) | 122 (57.5) |
| 21 (5.7) | 275 (74.7) | 72 (19.6) |
|
| 1668 (5.6) | 1.2 (0.4) | 757 (6.8) | 58.4 (11.4) | 389 (51.4) | 367 (48.5) |
| 12 (0.8) | 1145 (80.2) | 270 (18.9) |
|
| 751 (2.5) | 1.3 (0.5) | 318 (2.8) | 54.4 (10.4) | 133 (41.8) | 185 (58.2) |
| 20 (3.4) | 523 (88.5) | 48 (8.1) |
|
| 703 (2.4) | 1.2 (0.5) | 327 (2.9) | 53.7 (8.1) | 232 (70.9) | 94 (28.7) |
| 22 (3.8) | 549 (93.8) | 14 (2.4) |
|
| 662 (2.2) | 1.4 (0.6) | 286 (2.6) | 55.8 (10.4) | 170 (59.4) | 116 (40.6) |
| 10 (2.1) | 384 (80.0) | 86 (17.9) |
|
| 647 (2.2) | 1.1 (0.4) | 336 (3.0) | 56.3 (9.4) | 184 (54.8) | 151 (44.9) |
| 17 (2.9) | 557 (94.9) | 13 (2.2) |
|
| 1486 (5.0) | 2.0 (0.5) | 380 (3.4) | 57.1 (11.2) | 181 (47.6) | 198 (52.1) |
| 36 (5.0) | 612 (84.4) | 77 (10.6) |
|
| 1092 (3.7) | 1.0 (0.2) | 586 (5.2) | 66.7 (12.3) | 92 (15.7) | 494 (84.3) |
| 17 (1.6) | 685 (65.6) | 342 (32.8) |
|
| 2149 (7.2) | 2.9 (1.2) | 377 (3.4) | 54.5 (9.0) | 235 (62.3) | 142 (37.7) |
| 27 (3.7) | 693 (94.8) | 11 (1.5) |
|
| 1495 (5.0) | 1.1 (0.2) | 748 (6.7) | 59.9 (11.2) | 426 (57.0) | 322 (43.0) |
| 92 (6.5) | 1204 (85.4) | 114 (8.1) |
|
| 29,656 | 1.4 (0.7) | 11,199 | 57.7 (11.1) | 5365 (47.9) | 5824 (52.0) |
| 772 (3.8) | 16,434 (80.2) | 3283 (16.0) |
a Percentages correspond to the final dataset (total). b Mean (SD) and percentages correspond to each site. Age and gender were available for 11,191 and 11,189 patients, respectively.
Patient VISUHEALTH-AI DR performance evaluation by site.
| Precision | Recall | F-Score | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Referable | Non-Referable | Ungradable | Referable | Non-Referable | Ungradable | Referable | Non-Referable | Ungradable | |
|
| 19.30 | 99.70 | 28.32 | 47.83 | 71.01 | 98.66 | 27.50 | 82.95 | 44.01 |
|
| 54.05 | 96.09 | 52.89 | 55.56 | 86.05 | 92.31 | 54.79 | 90.79 | 67.25 |
|
| 19.05 | 87.96 | 61.92 | 33.33 | 68.08 | 82.86 | 24.24 | 76.75 | 70.88 |
|
| 23.26 | 96.84 | 79.25 | 71.43 | 94.81 | 72.41 | 35.09 | 95.82 | 75.68 |
|
| 18.18 | 96.23 | 47.54 | 50.00 | 79.27 | 85.29 | 26.67 | 86.93 | 61.05 |
|
| 32.18 | 97.44 | 45.64 | 56.00 | 82.16 | 94.68 | 40.88 | 89.15 | 61.59 |
|
| 61.90 | 92.87 | 83.33 | 71.43 | 94.08 | 68.35 | 66.33 | 93.47 | 75.10 |
|
| 40.66 | 97.25 | 41.54 | 59.20 | 76.68 | 91.51 | 48.21 | 85.75 | 57.14 |
|
| 13.48 | 90.37 | 83.26 | 86.36 | 89.05 | 56.29 | 23.31 | 89.71 | 67.17 |
|
| 22.73 | 73.65 | 86.85 | 71.43 | 89.66 | 46.3 | 34.48 | 80.87 | 60.4 |
|
| 44.19 | 96.79 | 77.63 | 90.48 | 87.64 | 81.94 | 59.38 | 91.98 | 79.73 |
|
| 9.16 | 90.55 | 93.39 | 100.0 | 92.93 | 41.85 | 16.78 | 91.72 | 57.80 |
|
| 20.69 | 94.33 | 58.97 | 60.00 | 89.10 | 47.92 | 30.77 | 91.64 | 52.87 |
|
| 28.33 | 99.17 | 25.58 | 77.27 | 87.07 | 78.57 | 41.46 | 92.73 | 38.60 |
|
| 28.57 | 88.03 | 72.55 | 80.00 | 91.93 | 43.02 | 42.11 | 89.94 | 54.01 |
|
| 19.70 | 99.16 | 25.58 | 76.47 | 85.10 | 84.62 | 31.33 | 91.59 | 39.29 |
|
| 32.63 | 92.98 | 93.75 | 86.11 | 90.85 | 38.96 | 47.33 | 91.90 | 55.05 |
|
| 20.00 | 84.59 | 83.53 | 88.24 | 88.18 | 62.28 | 32.61 | 86.35 | 71.36 |
|
| 27.17 | 99.67 | 40.00 | 92.59 | 88.31 | 90.91 | 42.02 | 93.65 | 55.56 |
|
| 32.00 | 93.62 | 40.51 | 26.09 | 85.38 | 84.21 | 28.74 | 89.31 | 54.70 |
|
| 29.6 | 92.56 | 58.58 | 62.69 | 85.65 | 65.06 | 40.22 | 88.97 | 61.65 |
VISUHEALTH-AI DR gradability and hospital-referable performance evaluation by site, age category, and visual acuity.
| Gradability | Hospital-Referable | ||||
|---|---|---|---|---|---|
| Sensitivity (95% CI) | Specificity (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) | ||
|
|
| 73.93 (71.57–76.19) | 98.66 (95.24–99.84) | 98.26 (94.99–99.64) | 71.01 (68.56–73.37) |
|
| 89.45 (87.85–90.9) | 92.31 (87.81–95.54) | 83.23 (78.64–87.18) | 86.05 (84.21–87.76) | |
|
| 70.77 (65.81–75.38) | 82.86 (77.07–87.7) | 85.14 (79.76–89.54) | 68.08 (62.95–72.91) | |
|
| 97.61 (96.4–98.5) | 72.41 (63.34–80.3) | 78.46 (70.4–85.19) | 94.81 (93.16–96.16) | |
|
| 83.76 (77.85–88.62) | 85.29 (68.94–95.05) | 84.21 (68.75–93.98) | 79.27 (72.87–84.76) | |
|
| 88.6 (86.38–90.57) | 94.68 (88.02–98.25) | 86.81 (80.16–91.87) | 82.16 (79.47–84.63) | |
|
| 97.97 (96.85–98.77) | 68.35 (59.92–75.97) | 73.48 (67.28–79.06) | 94.08 (92.27–95.58) | |
|
| 82.94 (81.0–84.75) | 91.51 (86.91–94.89) | 90.5 (86.86–93.41) | 76.68 (74.43–78.82) | |
|
| 96.78 (95.57–97.73) | 56.29 (50.64–61.82) | 69.41 (64.21–74.27) | 89.05 (87.05–90.84) | |
|
| 95.55 (94.21–96.65) | 46.3 (42.7–49.93) | 54.53 (51.02–58.01) | 89.66 (87.74–91.36) | |
|
| 94.26 (90.96–96.62) | 81.94 (71.11–90.02) | 91.4 (83.75–96.21) | 87.64 (83.15–91.28) | |
|
| 99.31 (98.64–99.7) | 41.85 (35.9–47.98) | 60.64 (54.67–66.38) | 92.93 (91.28–94.34) | |
|
| 97.05 (95.26–98.31) | 47.92 (33.29–62.81) | 58.82 (46.23–70.63) | 89.10 (86.11–91.64) | |
|
| 94.4 (92.18–96.14) | 78.57 (49.2–95.34) | 88.89 (73.94–96.89) | 87.07 (83.97–89.76) | |
|
| 96.45 (94.11–98.04) | 43.02 (32.39–54.15) | 50.00 (39.62–60.38) | 91.93 (88.74–94.45) | |
|
| 94.43 (92.22–96.16) | 84.62 (54.55–98.08) | 86.67 (69.28–96.24) | 85.1 (81.87–87.95) | |
|
| 99.69 (98.89–99.96) | 38.96 (28.05–50.75) | 62.83 (53.24–71.74) | 90.85 (88.28–93.01) | |
|
| 94.02 (92.0–95.65) | 62.28 (56.91–67.44) | 69.36 (64.31–74.09) | 88.18 (85.52–90.5) | |
|
| 97.92 (96.59–98.83) | 90.91 (58.72–99.77) | 94.74 (82.25–99.36) | 88.31 (85.68–90.61) | |
|
| 89.12 (87.3–90.76) | 84.21 (76.2–90.37) | 66.02 (59.11–72.46) | 85.38 (83.26–87.33) | |
|
|
| 91.61 (88.57–94.05) | 66.67 (34.89–90.08) | 63.64 (40.66–82.8) | 91.41 (88.3–93.91) |
|
| 93.13 (92.64–93.59) | 61.36 (58.51–64.15) | 69.57 (67.28–71.79) | 89.21 (88.6–89.79) | |
|
| 88.14 (87.11–89.12) | 65.41 (62.57–68.17) | 72.72 (70.28–75.06) | 80.09 (78.79–81.35) | |
|
| 86.35 (84.64–87.93) | 69.30 (66.26–72.23) | 75.44 (72.68–78.06) | 74.61 (72.45–76.68) | |
|
|
| 91.41 (90.93–91.87) | 62.15 (60.06–64.21) | 69.39 (67.61–71.12) | 86.73 (86.14–87.3) |
|
| 90.71 (89.59–91.74) | 67.95 (64.81–70.98) | 75.27 (72.62–77.79) | 81.56 (80.04–83.01) | |
|
| 82.46 (74.21–88.94) | 80.82 (69.92–89.1) | 87.06 (78.02–93.36) | 71.57 (61.78–80.06) | |
|
| 78.79 (70.82–85.42) | 81.43 (73.98–87.5) | 86.84 (80.41–91.77) | 65.00 (55.76–73.48) | |
|
| 91.22 (90.79–91.64) | 65.06 (63.40–66.69) | 72.08 (70.68–73.46) | 85.65 (85.11–86.18) | |
Quadratic weighted Kappa scores.
| Quadratic Weighted K (95% CI) | |||
|---|---|---|---|
| Referable DR | Gradability | Hospital Referable | |
| Primary vs. Secondary | 0.60 (0.57, 0.63) | 0.60 (0.58, 0.61) | 0.60 (0.58, 0.61) |
| Final grade (GT) vs. Primary | 0.66 (0.64, 0.69) | 0.67 (0.65, 0.68) | 0.67 (0.66, 0.69) |
| Final grade (GT) vs. Secondary | 0.84 (0.83, 0.86) | 0.84 (0.83, 0.85) | 0.84 (0.84, 0.85) |
| VISUHEALTH-AI DR vs. GT | 0.47 (0.44, 0.50) | 0.54 (0.52, 0.56) | 0.51 (0.50, 0.53) |
Figure 1Image examples of disagreement. (a) Primary grader: ungradable, secondary grader: ungradable, final grade: ungradable, VISUHEALTH-AI DR: referable DR. (b) Primary grader: non-referable DR, secondary grader: ungradable, final grade: non-referable DR, VISUHEALTH-AI DR: ungradable. (c) Primary grader: non-referable DR, secondary grader: ungradable, final grade: ungradable, VISUHEALTH-AI DR: non-referable DR.