| Literature DB >> 34349130 |
Jane Scheetz1, Dilara Koca1,2, Myra McGuinness1,2, Edith Holloway1,3,4, Zachary Tan1, Zhuoting Zhu5, Rod O'Day1, Sukhpal Sandhu1, Richard J MacIsaac6, Chris Gilfillan7, Angus Turner8, Stuart Keel1, Mingguang He9,10.
Abstract
This study investigated the diagnostic performance, feasibility, and end-user experiences of an artificial intelligence (AI)-assisted diabetic retinopathy (DR) screening model in real-world Australian healthcare settings. The study consisted of two components: (1) DR screening of patients using an AI-assisted system and (2) in-depth interviews with health professionals involved in implementing screening. Participants with type 1 or type 2 diabetes mellitus attending two endocrinology outpatient and three Aboriginal Medical Services clinics between March 2018 and May 2019 were invited to a prospective observational study. A single 45-degree (macula centred), non-stereoscopic, colour retinal image was taken of each eye from participants and were instantly screened for referable DR using a custom offline automated AI system. A total of 236 participants, including 174 from endocrinology and 62 from Aboriginal Medical Services clinics, provided informed consent and 203 (86.0%) were included in the analysis. A total of 33 consenting participants (14%) were excluded from the primary analysis due to ungradable or missing images from small pupils (n = 21, 63.6%), cataract (n = 7, 21.2%), poor fixation (n = 2, 6.1%), technical issues (n = 2, 6.1%), and corneal scarring (n = 1, 3%). The area under the curve, sensitivity, and specificity of the AI system for referable DR were 0.92, 96.9% and 87.7%, respectively. There were 51 disagreements between the reference standard and index test diagnoses, including 29 which were manually graded as ungradable, 21 false positives, and one false negative. A total of 28 participants (11.9%) were referred for follow-up based on new ocular findings, among whom, 15 (53.6%) were able to be contacted and 9 (60%) adhered to referral. Of 207 participants who completed a satisfaction questionnaire, 93.7% specified they were either satisfied or extremely satisfied, and 93.2% specified they would be likely or extremely likely to use this service again. Clinical staff involved in screening most frequently noted that the AI system was easy to use, and the real-time diagnostic report was useful. Our study indicates that AI-assisted DR screening model is accurate and well-accepted by patients and clinicians in endocrinology and indigenous healthcare settings. Future deployments of AI-assisted screening models would require consideration of downstream referral pathways.Entities:
Year: 2021 PMID: 34349130 PMCID: PMC8339059 DOI: 10.1038/s41598-021-94178-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flow chart of screening procedures and follow-up of participants and screening staff.
Figure 2Sample grading report generated by artificial intelligence-based screening system, including diabetic retinopathy status and referral recommendations. Details of the development and validation of this artificial intelligence system have been described previously[11,21,22].
Characteristics of participants included and excluded from primary analysis.
| Characteristic | Complete case set | Excluded | |||
|---|---|---|---|---|---|
| Box Hill Hospital | St Vincent’s Hospital, Melbourne | Derbarl Yerrigan | Total | ||
| Sex, n (%) | |||||
| Male | 46 (49%) | 34 (58%) | 22 (43%) | 102 (50%) | 19 (58%) |
| Female | 47 (51%) | 25 (42%) | 29 (57%) | 101 (50%) | 13 (42%) |
| Age (years)a, median (IQR) | 51 (38–67) | 46 (29–63) | 62 (56–67) | 56 (40–67) | 65 (58–78) |
| Diabetes typea, n (%) | |||||
| Type 1 diabetes | 36 (39%) | 34 (58%) | 0 (0%) | 70 (34%) | 4 (12%) |
| Type 2 diabetes | 55 (61%) | 25 (42%) | 51 (100%) | 130 (65%) | 29 (88%) |
| Diabetes duration (years), median (IQR) | 10 (4–20) | 14 (5–23) | 15 (10–20) | 13 (5–20) | 20 (7–29) |
| Last eye exama, n (%) | |||||
| ≤ 12 months | 61 (75%) | 26 (63%) | 27 (74%) | 134 (71%) | 18 (56%) |
| ≥ 12–24 months | 17 (21%) | 15 (26%) | 10 (20%) | 42 (22%) | 12 (38%) |
| > 24 months | 3 (4%) | 6 (11%) | 3 (6%) | 12 (7%) | 2 (6%) |
| Referable DR, n (%) | |||||
| No | 86 (92%) | 55 (93%) | 32 (63%) | 173 (85%) | – |
| Yes | 7 (8%) | 4 (7%) | 19 (37%) | 30 (15%) | – |
IQR = interquartile range; DR = diabetic retinopathy.
aMissing data for age (n = 1 included, n = 2 excluded), diabetes type (n = 2 included), time since last eye exam (n = 15 included, n = 1 excluded).
Diagnostic accuracy for referable diabetic retinopathy.
| Primary analysis (n = 203) | Sensitivity analysis scenario (n = 236) | ||
|---|---|---|---|
| Complete-case set (95% CI) | Best-case (95% CI) | Worst-case (95% CI) | |
| Sensitivity (%) | 96.9 [83.8–99.9] | 93.5 [78.6–99.2] | 85.9 [75.0–93.4] |
| Specificity (%) | 87.7 [81.8–92.2] | 76.1 [69.7–81.8] | 86.6 [80.6–91.3] |
| Area under the curve | 0.92 [0.88–0.96] | 0.85 [0.80–0.90] | 0.86 [0.81–0.91] |
| Predictive value | |||
| Positive (%) | 59.6 [45.1–73.0] | – | – |
| Negative (%) | 99.3 [96.4–100] | – | – |
CI, confidence interval.
Cross tabulation results between artificial intelligence system and manual grading.
| Manual grading | AI grading | Total | ||
|---|---|---|---|---|
| Non-referable DR | Referable DR | Ungradable | ||
| Non-referable DR | 150 | 21 | 0 | 171 |
| Referable DR | 1 | 31 | 0 | 32 |
| Ungradable | 7 | 22 | 4 | 33 |
| Total | 158 | 74 | 4 | 236 |
DR = diabetic retinopathy.
Sub-group analysis (n = 203).
| Subgroup | Sensitivity (%, 95% CI) | Specificity (%, 95% CI) | AUC (%, 95% CI) |
|---|---|---|---|
| Type 1 | 100.0 [39.8–100.0] | 90.9 [81.3–96.6] | 0.96 [0.92–0.99] |
| Type 2 | 96.4 [81.7–99.9] | 85.3 [76.9–91.5] | 0.91 [0.86–0.96] |
| Endocrinology | 100.0 [71.5–100.0] | 91.5 [85.6–95.5] | 0.96 [0.93–0.98] |
| Aboriginal medical service | 95.2 [76.1–99.9] | 70.0 [4.86–81.4] | 0.83 [0.73–0.92] |
| Canon | 100.0 [59.0–100.0] | 90.7 [82.5–95.9] | 0.95 [0.92–0.98] |
| DRS | 100.0 [63.1–100.0] | 91.4 [82.3–96.8] | 0.96 [0.92–0.99] |
| Topcon Maestro | 94.1 [71.3–99.9] | 53.3 [26.6–78.7] | 0.74 [0.59–0.88] |