| Literature DB >> 32313813 |
Gilbert Lim1,2, Valentina Bellemo2,3, Yuchen Xie2, Xin Q Lee2, Michelle Y T Yip3, Daniel S W Ting2,3,4,5.
Abstract
BACKGROUND: Effective screening is a desirable method for the early detection and successful treatment for diabetic retinopathy, and fundus photography is currently the dominant medium for retinal imaging due to its convenience and accessibility. Manual screening using fundus photographs has however involved considerable costs for patients, clinicians and national health systems, which has limited its application particularly in less-developed countries. The advent of artificial intelligence, and in particular deep learning techniques, has however raised the possibility of widespread automated screening. MAIN TEXT: In this review, we first briefly survey major published advances in retinal analysis using artificial intelligence. We take care to separately describe standard multiple-field fundus photography, and the newer modalities of ultra-wide field photography and smartphone-based photography. Finally, we consider several machine learning concepts that have been particularly relevant to the domain and illustrate their usage with extant works.Entities:
Keywords: Artificial intelligence; Deep learning; Diabetic retinopathy; Fundus photographs; Retinal imaging modalities; Survey
Year: 2020 PMID: 32313813 PMCID: PMC7155252 DOI: 10.1186/s40662-020-00182-7
Source DB: PubMed Journal: Eye Vis (Lond) ISSN: 2326-0254
Fig. 1Examples of retinal fundus images
Fig. 2Comparison of standard view and ultra-wide field retinal images with and without referable diabetic retinopathy
Summary of the major publications in retinal analysis using DL, grouped by standard multiple-field fundus photography, ultra-wide field photography and smartphone-based photography
| Authors and year of publication | Approach | Training dataset | Validation datasets | Performance |
|---|---|---|---|---|
| Standard view photography | ||||
| Gulshan et al. 2016 [ | Inception-V3 network | Public EyePACS and Messidor-2 (> 120,000 images) | Public EyePACS-1 and Messidor-2 (> 10,000 images) | EyePACS-1 |
| AUC: 0.99 | ||||
| Sensitivity: 90% | ||||
| Specificity: 98% | ||||
| Messidor-2 | ||||
| AUC: 0.99 | ||||
| Sensitivity: 87% | ||||
| Specificity: 99% | ||||
| Abràmoff et al. 2016 [ | AlexNet/VGG network | Public Messidor-2 | Public Messidor-2 (~ 2000 images) | AUC: 0.98 |
| Sensitivity: 97% | ||||
| Specificity: 87% | ||||
| Ting et al. 2017 [ | VGGNet-19 network | Proprietary SiDRP 2010–2013 (> 76,000 images) | Proprietary SiDRP 14–15 and 10 others (> 112,000 images) | SiDRP 2014–2015 |
| AUC: 0.93 | ||||
| Sensitivity: 91% | ||||
| Specificity: 92% | ||||
| Others | ||||
| AUC range: 0.89 to 0.98 | ||||
| Sensitivity range: 92 to 100% | ||||
| Specificity: 76 to 92% | ||||
| Gargeya et al. 2017 [ | Customised CNN network | Public EyePACS-1 (> 75,000 images) | Public EyePACS-1, Messidor-2,E-Ophtha (> 17,000 images) | EyePACS-1 |
| AUC: 0.97 | ||||
| Sensitivity: 94% | ||||
| Specificity: 96% | ||||
| Messidor-2 and E-Ophtha | ||||
| AUC range: 0.83 to 0.95 | ||||
| Sensitivity range: 74 to 93% | ||||
| Specificity range: 87 to 94% | ||||
| Abràmoff et al. 2018 [ | AlexNet/VGGNet network | Public Messidor-2 | Proprietary Primary care sites (~ 900 patients) | Sensitivity: 87% |
| Specificity: 91% | ||||
| Keel et al. 2018 [ | Inception-V3 network | Public LabelMe (~ 59,000) | Proprietary Endocrinology outpatient services (96 patients) | Sensitivity: 92% |
| Specificity: 94% | ||||
| Kanagasingam et al. 2018 [ | Inception-V3 network | Public and proprietary DiaRetDB1, EyePACS, Australian tele-eye care (30,000 images) | Proprietary Primary care (~ 200 patients) | Sensitivity: 92% |
| Gulshan et al. 2019 [ | Inception-v4 network | Public EyePACS and Messidor-2 (> 144,000 images) | Proprietary Two eye hospitals (~ 6000 images) | AUC range: 0.97 to 0.98 |
| Sensitivity range: 89 to 92% | ||||
| Specificity range: 92 to 95% | ||||
| Raumviboonsuk et al. 2019 [ | Inception-v4 network | Public EyePACS and Messidor-2 (> 120,000 images) | Proprietary Hospitals and health centers (~ 30,000 images) | AUC: 0.99 |
| Sensitivity: 96.9% | ||||
| Specificity: 95.3% | ||||
| Bellemo et al. 2019 [ | VGGNet/ResNet network | Proprietary SiDRP 2010–2013 (> 76,000 images) | Proprietary Mobile screening unit (> 4000 images) | AUC: 0.97 |
| Sensitivity: 92% | ||||
| Specificity: 89% | ||||
| Ultra-wide field photography | ||||
| Wang et al. 2018 [ | EyeArt software | – | Proprietary Eye clinics (~ 1500 images) | AUC: 0.85 |
| Sensitivity: 90% | ||||
| Specificity: 54% | ||||
| Nagasawa at al. 2019 [ | VGGNet-16 network | Proprietary Hospitals (< 400 images) | Proprietary Hospitals (< 400 images) | AUC: 0.97 |
| Sensitivity: 95% | ||||
| Specificity: 97% | ||||
| Smartphone-based photography | ||||
| Rajalakshmi et al. 2018 [ | EyeArt software | – | Proprietary Tertiary care diabetes hospital (~ 300 images) | Sensitivity: 96% |
| Specificity: 80% | ||||
| Natarajan et al. 2019 [ | Remidio software Inception-V3 network | Public and proprietary EyePACS and hospitals (> 52,000 images) | Proprietary Population-based screening (> 4000 images) | Sensitivity range: 96 to 100% |
| Specificity range: 79 to 88% | ||||
| Rogers et al. 2019 [ | Pegasus software | – | Public and proprietary IDRiD and research laboratory study (> 6000 images) | AUC range: 89 to 99% |
| Sensitivity range: 82 to 93% | ||||
| Specificity range: 82 to 94% | ||||
Major deep learning model architecture families and characteristics. Note that there may be multiple variants (usually with different number of layers/parameters) within each architecture family
| Architecture family | Original year | Parameters | Layers | Module organization | Example application(s) |
|---|---|---|---|---|---|
| AlexNet | 2012 | ~ 60 million | 8 | Convolutional, Max Pooling | Abràmoff et al. [ Quellec et al. [ |
| VGGNet | 2014 | ~ 180 million | 19 | Convolutional, Max Pooling | Abràmoff et al. [ |
| GoogLeNet (also Inception v1) | 2015 | ~ 7 million | 22 | Inception, Pool+Concat | Takahashi et al. [ |
| Inception (v3) | 2015 | ~ 24 million | 42 | Inception, Pool+Concat | Gulshan et al. [ |
| ResNet | 2016 | ~ 60 million | 152 | Convolutional, Skip Connections | Bellemo et al. [ |
| Inception-ResNet (v2) | 2016 | ~ 56 million | 164 | Residual Inception | – |
| SqueezeNet | 2016 | ~ 1.2 million(before pruning) | 14 | 1 × 1 Convolutional, Squeeze & Expand Layers | – |
| ResNeXt | 2017 | ~ 25 million | 50 | Convolutional (Grouped) | – |
| DenseNet | 2017 | ~ 20 million | 201 | Dense, Transition | – |
Fig. 3Basic transfer learning method. A deep learning model is first trained on some general task. This trained model is then trained on the actual target medical task, possibly with the parameters for earlier layers representing low-level features frozen
Fig. 4AI flow for diabetic retinopathy. In the diabetic retinopathy screening domain, the AI implementation allows automated diagnosis and subsequent clinical decisions. In the example presented in the figure, the AI system would recommend referring the patient to the eye clinic because of the referable diagnosis for diabetic retinopathy. To allow researchers and clinicians determine how the AI model makes the decision, the heatmap attempts to display the contribution of each image pixel or region, to the final classification. Heatmaps open the ‘black box’ highlighting the areas in which the AI system is focusing on to build trust among practitioners and patients. Abbreviations: DR; diabetic retinopathy; NPDR: non-proliferative diabetic retinopathy; PDR: proliferative diabetic retinopathy