| Literature DB >> 35083634 |
Wei Xiang Lim1, ZhiYuan Chen2, Amr Ahmed2.
Abstract
Diabetic retinopathy (DR) is a chronic eye condition that is rapidly growing due to the prevalence of diabetes. There are challenges such as the dearth of ophthalmologists, healthcare resources, and facilities that are unable to provide patients with appropriate eye screening services. As a result, deep learning (DL) has the potential to play a critical role as a powerful automated diagnostic tool in the field of ophthalmology, particularly in the early detection of DR when compared to traditional detection techniques. The DL models are known as black boxes, despite the fact that they are widely adopted. They make no attempt to explain how the model learns representations or why it makes a particular prediction. Due to the black box design architecture, DL methods make it difficult for intended end-users like ophthalmologists to grasp how the models function, preventing model acceptance for clinical usage. Recently, several studies on the interpretability of DL methods used in DR-related tasks such as DR classification and segmentation have been published. The goal of this paper is to provide a detailed overview of interpretability strategies used in DR-related tasks. This paper also includes the authors' insights and future directions in the field of DR to help the research community overcome research problems.Entities:
Keywords: Deep learning; Diabetic retinopathy; Interpretability; Review
Mesh:
Year: 2022 PMID: 35083634 PMCID: PMC8791699 DOI: 10.1007/s11517-021-02487-8
Source DB: PubMed Journal: Med Biol Eng Comput ISSN: 0140-0118 Impact factor: 3.079
Fig. 1Overview of selection criteria of literature
Fig. 2Overview of deep learning pipeline in DR-related tasks
Comparison table of recently published DR-related papers that adopt interpretability techniques
| Author (year) | Image pre-processing | CNN architecture | Interpretability techniques | |
|---|---|---|---|---|
| Geometric perturbation | Photometric perturbation | |||
| Kermany et al. [ | ✗ | ✗ | Inception-v3 | Occlusion |
| Grassmann et al. [ | ✓ | ✓ | Ensemble Network | Occlusion |
| Kumar et al. [ | ✓ | ✗ | CNN | CAM |
| Tu et al. [ | ✗ | ✗ | CNN | CAM |
| Wang and Yang [ | ✓ | ✗ | Net-4, Net-5 | CAM |
| Gargeya and Leng [ | ✓ | ✗ | CNN | CAM |
| Gondal et al. [ | ✓ | ✓ | o_O network (Antony and Brüggemann 2015) | CAM |
| Jiang et al. [ | ✓ | ✓ | RestNet50 | Grad-CAM |
| Pratt et al. [ | ✗ | ✗ | DenseNet-121 | Sensitivity analysis, CAM |
| Sayres et al. [ | ✓ | ✗ | Inception-v4 | Integrated gradient |
| Quellec et al. [ | ✓ | ✓ | o_O network (Antony and Brüggemann 2015), AlexNet | Layer-wise relevance propagation |
| de La Torre et al. [ | ✓ | ✗ | CNN | Layer-wise relevance propagation |
Fig. 3An illustration of the class activation map [29]
The limitations of interpretability techniques adopted in recently published DR-related papers
| Author (Year) | Limitation |
|---|---|
| Kermany et al. [ | ■ Diabetic macular edema and choroidal neovascularization did not highlight a clear point of interest ■ Requires additional mode training with occluded fundus image to monitor the classification score of a particular class |
| Grassmann et al. [ | ■ The accuracy of identifying the disease can be improved by including additional disease features ■ Requires additional mode training with occluded fundus image to monitor the classification score of a particular class |
| Kumar et al. [ | ■ Highlight on the neovascularization of the optic disc is absent ■ Visualization (CAM) on lesions areas may not be accurate as there is no pixel-level ground truth presented |
| Tu et al. [ | ■ Minority of the important lesion areas are highlighted as low impact after lesions regularization |
| Wang and Yang [ | ■ Fails to highlight the correct lesions that correspond to a class |
| Gargeya and Leng [ | ■ Visualization (CAM) on lesions areas may not be accurate as there is no pixel-level ground truth presented |
| Gondal et al. [ | ■ Lesion, specifically red small dots did not detect and highlight accurately |
| Jiang et al. [ | ■ Visualization (CAM) on lesions areas may not be accurate as there is no pixel-level ground truth presented |
| Pratt et al. [ | ■ Visualization (CAM) on lesions areas may not be accurate as there is no pixel-level ground truth presented |
| Sayres et al. [ | ■ Visualizing lesions in misclassified cases may cause over-diagnosis |
| Quellec et al. [ | ■ Poor quality of visualization of lesions (hemorrhages and microaneurysms) in AlexNet |
| de La Torre et al. [ | ■ Noisy visualization of lesions |
The summary of qualitative and quantitative metrics used for network’s interpretability
| Author (year) | Metrics | |
|---|---|---|
| Qualitative visual explanation (heat maps) | Quantitative evaluation | |
| Kermany et al. [ | ✓ | ✓ |
| Grassmann et al. [ | ✓ | ✓ |
| Kumar et al. [ | ✓ | ✗ |
| Tu et al. [ | ✓ | ✓ |
| Wang and Yang [ | ✓ | ✗ |
| Gargeya and Leng [ | ✓ | ✗ |
| Gondal et al. [ | ✓ | ✓ |
| Jiang et al. [ | ✓ | ✗ |
| Pratt et al. [ | ✓ | ✗ |
| Sayres et al. [ | ✓ | ✗ |
| Quellec et al. [ | ✓ | ✓ |
| de La Torre et al. [ | ✓ | ✓ |