| Literature DB >> 34027334 |
Li Dong1, Qiong Yang1, Rui Heng Zhang1, Wen Bin Wei1.
Abstract
BACKGROUND: Age-related macular degeneration (AMD) is one of the leading causes of vision loss in the elderly population. The application of artificial intelligence (AI) provides convenience for the diagnosis of AMD. This systematic review and meta-analysis aimed to quantify the performance of AI in detecting AMD in fundus photographs.Entities:
Keywords: Agerelated macular degeneration; Algorithm; Artificial intelligence; Convolutional neural networks; Deep learning
Year: 2021 PMID: 34027334 PMCID: PMC8129891 DOI: 10.1016/j.eclinm.2021.100875
Source DB: PubMed Journal: EClinicalMedicine ISSN: 2589-5370
Definition of referable AMD and non-referable AMD in this study.
| Category | Stage | Definition | Classification |
|---|---|---|---|
| 1 | No AMD | No drusen or only small drusen ≤63 μm, and no pigment abnormalities | Non-referable AMD |
| 2 | Early AMD | Medium drusen >63 μm and ≤125 μm, and no pigment abnormalities | |
| 3 | Intermediate AMD | Large drusen >125 μm or any pigment abnormalities | Referable AMD |
| 4 | Advanced AMD | Neovascular AMD or geographical atrophy |
AMD: age-related macular degeneration.
Fig. 1Flow diagram of literature selection.
Basic characteristics of the included studies.
| First author | Publication year | Country | Database | Total images | Method | Outcome | Classification | Performance |
|---|---|---|---|---|---|---|---|---|
| Keenan | 2019 | United States | AREDS | 59,812 | CNN | Dry AMD | Disease/no disease | ACC: 0.965; AUC: 0.976 |
| Zapata | 2020 | Spain | Optretina | 306,302 | CNN | AMD | Disease/no disease | ACC: 0.863; AUC: 0.936 |
| Zheng | 2012 | United Kingdom | ARIA, STARE | 258 | SVM | AMD | Disease/no disease | ACC: 0.996 |
| Kunumpol | 2017 | Thailand | STARE | 106 | ANN | AMD | Disease/no disease | ACC: 0.989 |
| Mookiah | 2014a | Singapore | Private dataset | 540 | SVM | Dry AMD | Disease/no disease | ACC: 0.937 |
| Keel | 2019 | Australia | LabelMe | 56,113 | CNN | Wet AMD | Disease/no disease | ACC: 0.965; AUC: 0.995 |
| González-Gonzalo | 2019 | The Netherlands | 1. DR…-AMD | 134,421 | CNN | Referable AMD | Disease/no disease | ACC1: 0.880; AUC1: 0.949 |
| Burlina | 2017a | United States | AREDS | 133,821 | CNN | Referable AMD | Disease/no disease | ACC: 0.916; AUC: 0.96 |
| Burlina | 2017b | United States | AREDS | 5664 | CNN | Referable AMD | 1. Disease/no disease | ACC1: 0.934 |
| Ting | 2017 | Singapore | SIDRP | 108,558 | CNN | Referable AMD | Disease/no disease | ACC: 0.888; AUC: 0.932 |
| Kankanahalli | 2013 | United States | AREDS | 2772 | CNN | Referable AMD | 1. Disease/no disease | ACC1: 0.955 |
| Burlina | 2011 | United States | Private dataset | 66 | SVM | AMD | Disease/no disease | ACC: 0.955 |
| Bhuiyan | 2020 | United States | AREDS | 116,875 | CNN | 1. Referable AMD | 1. Disease/no disease | ACC1: 0.992 |
| Phan | 2016 | Canada | Private dataset | 279 | SVM, RF | 1. AMD | Disease/no disease | AUC1: 0.877 |
| Govindaiah | 2018 | United States | AREDS | 116,875 | CNN | 1. Referable AMD | 1. Disease/no disease | ACC1: 0.953 |
| Grassmann | 2018 | German | AREDS | 120,656 | CNN | AMD | AMD severity (13 classes) | ACC: 0.633 |
| Mookiah | 2014b | Singapore | 1. Private dataset | 784 | SMV | AMD | AMD severity (4 classes) | ACC1: 0.902 |
| Peng | 2019 | United States | AREDS | 59,302 | CNN | AMD | AMD severity (6 classes) | ACC: 0.671 |
| Mookiah | 2015 | Singapore | 1. Private dataset | 784 | SMV | AMD | AMD severity (4 classes) | ACC1: 0.935 |
AREDS: Age-Related Eye Disease Study, CNN: convolutional neural networks, AMD: age-related macular disease, ACC: Accuracy, AUC: area under curve, ARIA: Automated Retinal Image Analysis, STARE: Structured Analysis of the Retina, SVM: support vector machine, ANN: artificial neural network, DR…: diabetic retinopathy, SIDRP: Singapore National Diabetic Retinopathy Screening Program, RF: random forest.
Referable AMD was defined as intermediate and advanced AMD.
Fig. 2Bias assessment of the included studies via Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool.
Fig. 3Performance of artificial intelligence for the detection of AMD. -Fig. 3A. The pooled area under the receiver operating characteristic curve (AUROC) was 0.983 (95% CI: 0.979–0.987). -Fig. 3B. The pooled sensitivity was 0.88 (95% CI: 0.88–0.88). -Fig. 3C. The pooled specificity was 0.90 (95% CI: 0.90–0.91). -Fig. 3D. The pooled diagnostic odds ratio was 275.27 (95% CI: 158.43–478.27).
Fig. 4Performance of the convolutional neural network (CNN) models for the detection of AMD in the AREDS database. -Fig. 4A. The pooled area under the receiver operating characteristic curve (AUROC) was 0.983 (95% CI: 0.978–0.988). -Fig. 4B. The pooled sensitivity was 0.88 (95% CI: 0.88–0.88). -Fig. 4C. The pooled specificity was 0.91 (95% CI: 0.91–0.91). -Fig. 4D. The pooled diagnostic odds ratio was 273.14 (95% CI: 130.79–570.43).
Subgroup analysis showing the performance of the artificial intelligence for the detection of AMD.
| Variables | No. of study | AUC (95% CI) | Sensitivity (95% CI) | Specificity (95% CI) | LR+ (95% CI) | LR- (95% CI) | DOR (95% CI) | Heterogeneity for DOR | |
|---|---|---|---|---|---|---|---|---|---|
| I2,% | |||||||||
| Methods | |||||||||
| CNN | 9 | 0.980 (0.975–0.985) | 0.88 (0.88–0.88) | 0.90 (0.90–0.91) | 16.0 (11.1–23.1) | 0.07 (0.06–0.10) | 225 (123–409) | 99.7 | <0.001 |
| SVM | 3 | 0.994 (0.988–1.000) | 0.94 (0.92–0.96) | 0.97 (0.95–0.99) | 30.9 (11.9–79.8) | 0.04 (0.01–0.16) | 917 (97–8861) | 71.4 | 0.030 |
| Images for validation | |||||||||
| <500 | 3 | 0.997 (0.995–1.000) | 0.99 (0.96–1.00) | 0.99 (0.97–1.00) | 54.6 (13.9–215.0) | 0.02 (0.01–0.07) | 2656 (286–24,635) | 42.0 | 0.178 |
| 500–5000 | 4 | 0.982 (0.970–0.994) | 0.93 (0.92–0.94) | 0.93 (0.93–0.94) | 16.3 (5.7–46.8) | 0.07 (0.03–0.13) | 252 (50–1274) | 98.1 | <0.001 |
| >5000 | 6 | 0.980 (0.974–0.985) | 0.88 (0.88–0.88) | 0.90 (0.90–0.91) | 17.0 (10.9–26.5) | 0.08 (0.06–0.11) | 216 (105–445) | 99.8 | <0.001 |
| Outcomes | |||||||||
| AMD | 4 | 0.993 (0.984–1.000) | 0.92 (0.90–0.93) | 0.85 (0.83–0.87) | 29.2 (3.4–248.4) | 0.04 (0.01–0.14) | 853 (39–18,403) | 88.2 | <0.001 |
| Referable AMD | 6 | 0.983 (0.978–0.988) | 0.88 (0.88–0.88) | 0.90 (0.90–0.90) | 15.8 (10.2–24.3) | 0.06 (0.05–0.08) | 276 (132–579) | 99.8 | <0.001 |
| Publication year | |||||||||
| Before 2015 | 4 | 0.989 (0.985–0.993) | 0.95 (0.94–0.96) | 0.96 (0.95–0.97) | 22.7 (16.9–30.3) | 0.05 (0.03–0.10) | 474 (197–1142) | 58.1 | 0.067 |
| After 2015 | 9 | 0.980 (0.975–0.985) | 0.88 (0.88–0.88) | 0.90 (0.90–0.91) | 15.9 (10.8–23.3) | 0.08 (0.06–0.10) | 224 (120–418) | 99.7 | <0.001 |
| Regions | |||||||||
| Asian countries | 3 | 0.979 (0.970–0.988) | 0.93 (0.91–0.94) | 0.89 (0.88–0.89) | 20.6 (2.6–163.5) | 0.08 (0.06–0.11) | 212 (73–613) | 77.9 | 0.011 |
| The western countries | 10 | 0.984 (0.980–0.988) | 0.88 (0.88–0.88) | 0.91 (0.91–0.91) | 19.0 (12.0–30.2) | 0.07 (0.05–0.09) | 288 (155–536) | 99.7 | <0.001 |
AUC: area under curve, CI: confidence interval, LR+: positive likelihood ratio, LR-: negative likelihood ratio, DOR: diagnostic odds ratio, CNN: convolutional neural networks, SVM: support vector machine.
Studies detecting dry-AMD only or wet-AMD only were excluded in this analysis.
Referable AMD was defined as intermediate and advanced AMD.