| Literature DB >> 35567610 |
Octavi Font1, Jordina Torrents-Barrena2, Dídac Royo1, Sandra Banderas García3,4, Javier Zarranz-Ventura5,6, Anniken Bures1,7, Cecilia Salinas1,7, Miguel Ángel Zapata1,8.
Abstract
PURPOSE: This study aims to evaluate the ability of an autonomous artificial intelligence (AI) system for detection of the most common central retinal pathologies in fundus photography.Entities:
Keywords: Age-related macular degeneration; Artificial intelligence; Diabetic retinopathy; Retinography; Screening
Mesh:
Year: 2022 PMID: 35567610 PMCID: PMC9477940 DOI: 10.1007/s00417-022-05653-2
Source DB: PubMed Journal: Graefes Arch Clin Exp Ophthalmol ISSN: 0721-832X Impact factor: 3.535
Breakdown of the study validation datasets, as well as the training datasets for each of the AI algorithms that compose the screening system. (M/F male/female, LE/RE left eye/right eye)
| Dataset | Age median (std) (range) | Abnormal (%) | AMD (%) | DR (%) | GON (%) | Nevus (%) | Other (%) | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Study validation | ||||||||||
| Original | 2839 (1786 M/1053 F) | 43 (11.52) [5–87] | 5483 (2747 LE/2736 RE) | 5918 | 321 (5.42%) | 14 (0.23%) | 0 (0%) | 107 (1.81%) | 110 (1.86%) | 90 (1.5%) |
| Original + enriched | 3337 (1999 M/1338 F) | 46 (15.22) [5–96] | 6009 (3013 LE/2996 RE) | 6452 | 855 (13.25%) | 398 (6.17%) | 150 (2.32%) | 107 (1.66%) | 110 (1.7%) | 90 (1.4%) |
| Algorithm training | ||||||||||
| AMD | – (982 M/1526 F) | 75 (11.52) [5–98] | – (1761 LE/1945 RE) | 7218 | – | 4859 (67.31%) | – | – | – | – |
| DR | 71,455 (26,691 M/19,296 F) | 66 (11.42) [11–99] | 116,501 (60,947 LE/55,554 RE) | 139,813 | – | – | 14,376 (10.28%) | – | – | – |
| Glaucoma | 1206 (619 M/587 F) | 54 (15.58) [5–96] | 1738 (794 LE/944 RE) | 2366 | – | – | – | 1168 (49.37%) | – | – |
| GON | 18,750 (8.215 M/10,535 F) | 42 (11.64) [5–96] | 29,352 (15,633 LE/13,719 RE) | 30,054 | – | – | – | – | 4470 (14.87%) | – |
| Abnormality | 31,877 (13,843 M/18,034 F) | 49 (17.33) [5–99] | 52,791 (27,846 LE/24,945 RE) | 53,194 | 17,433 (32.77%) | – | – | – | – | – |
aThe total number of participants as well as the total number of eyes for the AMD training dataset could not be obtained, since the non-pathological images used for training were not assigned to a patient. The reported numbers (M/F, LE/RE, age) are from the pathological cases, which were referenced to a patient
bThe sex split and age median only accounts for 65% of the dataset. One of the image sources for this dataset did not include sex nor age information per patient
Fig. 1Labeling flowchart. The flowchart depicts the 2-tiered approach followed by all specialists to label the dataset. The ground truth was agreed by at least 2 graders
Fig. 2Algorithm execution flowchart. The predictions are performed at the image level. 5 neural networks process independently each image and in case any algorithm is positive, the screened image is classified as “abnormal”
Summary of sensitivity, specificity, and AUC aggregated and per individual algorithm
| Algorithm | Sensitivity (95% CI) [ | Specificity (95% CI) [ |
|---|---|---|
| Combined algorithm: | ||
| Abnormality | 92.9% (91.0, 94.6) [< 0.001]a | 86.8% (85.8, 87.7) [< 0.001]b |
| Single NN algorithms: | ||
| Abnormality | 83.4% (80.6, 85.9) | 93.4% (92.7, 94.0) |
| AMD | 93.8% (91.6, 96.3) | 95.7% (95.2, 96.2) |
| DR | 81.1% (75.3, 88.1) | 94.8% (94.1, 95.4) |
| Glaucoma | 53.6% (43.1, 63.1) | 95.7% (95.2, 96.2) |
| Nevus | 86.7% (80.7, 94.0) | 90.7% (90.1, 91.5) |
ap-value for sensitivity on the combined abnormality algorithm was computed using a one-sided tailed binomial test using a sensitivity of p = 0.75 as the null hypothesis
bp-value for specificity on the combined abnormality algorithm was computed using a one-sided tailed binomial test using a sensitivity of p = 0.775 as the null hypothesis
Fig. 3Receiver operating curve for the combined and individual algorithms