| Literature DB >> 35528131 |
Veena Mayya1,2, Sowmya Kamath S1, Uma Kulkarni3, Divyalakshmi Kaiyoor Surya3, U Rajendra Acharya4,5,6.
Abstract
Chronic Ocular Diseases (COD) such as myopia, diabetic retinopathy, age-related macular degeneration, glaucoma, and cataract can affect the eye and may even lead to severe vision impairment or blindness. According to a recent World Health Organization (WHO) report on vision, at least 2.2 billion individuals worldwide suffer from vision impairment. Often, overt signs indicative of COD do not manifest until the disease has progressed to an advanced stage. However, if COD is detected early, vision impairment can be avoided by early intervention and cost-effective treatment. Ophthalmologists are trained to detect COD by examining certain minute changes in the retina, such as microaneurysms, macular edema, hemorrhages, and alterations in the blood vessels. The range of eye conditions is diverse, and each of these conditions requires a unique patient-specific treatment. Convolutional neural networks (CNNs) have demonstrated significant potential in multi-disciplinary fields, including the detection of a variety of eye diseases. In this study, we combined several preprocessing approaches with convolutional neural networks to accurately detect COD in eye fundus images. To the best of our knowledge, this is the first work that provides a qualitative analysis of preprocessing approaches for COD classification using CNN models. Experimental results demonstrate that CNNs trained on the region of interest segmented images outperform the models trained on the original input images by a substantial margin. Additionally, an ensemble of three preprocessing techniques outperformed other state-of-the-art approaches by 30% and 3%, in terms of Kappa and F 1 scores, respectively. The developed prototype has been extensively tested and can be evaluated on more comprehensive COD datasets for deployment in the clinical setup.Entities:
Keywords: Clinical decision support systems; Convolutional neural networks; Explainability; Fundus imaging; Healthcare informatics
Year: 2022 PMID: 35528131 PMCID: PMC9059700 DOI: 10.1007/s10489-022-03490-8
Source DB: PubMed Journal: Appl Intell (Dordr) ISSN: 0924-669X Impact factor: 5.019
Fig. 1Overall methodology employed in the proposed study for COD classification
Fig. 2Results of preprocessing on input sample images. A)Original, B)Otsu’s thresholding segmentation, C)Proposed Hough transform segmentation, D)Green channel, E)Green channel CLAHE, F)Green channel Gaussian convolution, G)RGB CLAHE, H)RGB Gaussian convolution, I)MSR, and J)MIRNET images
Fig. 3Results of segmentation and inpainting of vessel structure on sample input images. A)Original (normal, DR, glaucoma, cataract and AMD), B)Vessel segmentation, C)Vessel inpainting images
Fig. 4Sample of generated fundus images using StyleGAN2 for the given ocular conditions. A)Normal, B)DR, C)Cataract, D)AMD, E)Myopia
Details of model training parameters
| Model | Parameters |
|---|---|
| SqueezeNet [ | 0.726600 |
| MobileNetv2 [ | 2.234120 |
| Inceptionv1 [ | 5.608104 |
| DenseNet121 [ | 6.962056 |
| EfficientNet-B3 [ | 10.708528 |
| ResNeXt50 [ | 22.996296 |
| ResNet50 [ | 23.524424 |
| Inceptionv3 [ | 25.128656 |
| EfficientNet-B7 [ | 63.807448 |
| WideResNet50 [ | 66.850632 |
| VGG16 [ | 134.29332 |
Details of ODIR training data
| Type of COD (Class) | Training images |
|---|---|
| Normal | 3,098 |
| DR | 1,801 |
| Others | 1,200 |
| Glaucoma | 326 |
| Cataract | 313 |
| AMD | 280 |
| Myopia | 268 |
| Hypertension | 193 |
Sample fundoscopy images from ODIR dataset
| Final label | Right & left eye images | |
|---|---|---|
| DR |
|
|
| AMD + DR |
|
|
| DR + Myopia + Others |
|
|
| DR + Cataract + Others |
|
|
Observed performance for state-of-the-art DL models on the testset
| With original image | With cropped image | |||||
|---|---|---|---|---|---|---|
| Kappa | AUC | Kappa | AUC | |||
| DenseNet [ | 0.4659 | 0.7888 | 0.8698 | 0.5195 | 0.8210 | 0.8804 |
| EfficientNetB3 [ | 0.4691 | 0.7922 | 0.8695 | 0.5090 | 0.8199 | 0.8803 |
| EfficientNetB7 [ | 0.4677 | 0.7265 | 0.872 | 0.5260 | 0.8317 | 0.8845 |
| Inceptionv1 [ | 0.4124 | 0.7746 | 0.8553 | 0.4733 | 0.8021 | 0.8718 |
| Inceptionv3 [ | 0.3131 | 0.7201 | 0.8365 | 0.3882 | 0.7671 | 0.8535 |
| MobileNet [ | 0.4664 | 0.7984 | 0.8703 | 0.4852 | 0.8191 | 0.8740 |
| ResNet50 [ | 0.4110 | 0.7602 | 0.8575 | 0.5022 | 0.8093 | 0.8782 |
| ResNeXt50 [ | 0.4733 | 0.7921 | 0.8717 | 0.5680 | 0.8606 | 0.8953 |
| WideResNet [ | 0.4222 | 0.7634 | 0.8618 | 0.4734 | 0.8343 | 0.8743 |
| SqueezeNet [ | 0.1347 | 0.6178 | 0.7838 | 0.1594 | 0.6412 | 0.7873 |
| VGG16 [ | 0.5092 | 0.7828 | 0.8790 | 0.5268 | 0.8248 | 0.8813 |
Results of ResNeXt50 (best performing model) with proposed preprocessing pipeline
| Preprocessing method | Observed performance | ||
|---|---|---|---|
| Kappa | AUC | ||
| Original image | 0.4733 | 0.7921 | 0.8717 |
| Cropped image | 0.5680 | 0.8606 | 0.8953 |
| Green channel | 0.5336 | 0.8186 | 0.8882 |
| Green channel+CLAHE | 0.5198 | 0.8189 | 0.8840 |
| Green channel+Gaussian | 0.4898 | 0.8082 | 0.8777 |
| RGB+CLAHE | 0.5308 | 0.8368 | 0.8860 |
| RGB+Gaussian | 0.5260 | 0.8206 | 0.8840 |
| Multiscale Retinex (MSR) | 0.5330 | 0.8418 | 0.8865 |
| MIRNET | 0.4438 | 0.8403 | 0.8550 |
| Vessel segmentation | 0.3097 | 0.7212 | 0.8325 |
| Vessel inpaint | 0.4865 | 0.8500 | 0.8765 |
Observations w.r.t top three performing models, when used with RoI cropped images
| DL model | RGB | Green | MSR |
|---|---|---|---|
| ResNeXt50 [ | 0.8953 | 0.8882 | 0.8865 |
| VGG16 [ | 0.8813 | 0.8498 | 0.8465 |
| EfficientNetB7 [ | 0.8845 | 0.8668 | 0.8605 |
Comparative performance of proposed augmentation and ensemble techniques
| Method | Observed performance | ||
|---|---|---|---|
| Kappa | AUC | ||
| No augmentation | 0.5246 | 0.8323 | 0.850 |
| Batch-level | 0.5680 | 0.8606 | 0.8953 |
| Condition-level | 0.4228 | 0.8534 | 0.8710 |
| Ensemble 1 | 0.5815 | 0.8532 | 0.9008 |
| Ensemble 2 | 0.6081 | 0.8806 | 0.9070 |
Comparative performance of proposed approaches against state-of-the-art techniques
| No. | Models | Dataset | Observed performance | ||
|---|---|---|---|---|---|
| Kappa | AUC | ||||
| 1 | ResNet-101 backbone [ | 1166 patients data (ODIR train set) | 0.6370 | 0.9300 | 0.9130 |
| 2 | ResNet-101 + Textual features [ | 1166 patients data (ODIR train set) | 0.6410 | 0.9380 | 0.9130 |
| 3 | Graph convolutional network [ | 996 images of 498 patients (ODIR) | 0.5765 | 0.7816 | 0.8966 |
| 4 | EfficientNet-B3 [ | ODIR offline challenge test set | 0.5200 | 0.7400 | 0.8900 |
| 5 | Shallow CNN [ | ODIR offline challenge test set | 0.3100 | 0.8050 | – |
| 6 | Two input VGG16 [ | ODIR offline challenge test set | – | 0.6888 | 0.8557 |
| 7 | VGG-16 [ | ODIR offline challenge test set | 0.4494 | 0.8881 | 0.8730 |
| 8 | Proposed pipeline (§ | ODIR offline challenge test set | 0.5680 | 0.8606 | 0.8953 |
| 9 | Proposed DL ensemble (§ | ODIR offline challenge test set | 0.5891 | 0.8610 | 0.9025 |
| 10 | Proposed preprocessing ensemble (§ | ODIR offline challenge test set | 0.6081 | 0.8806 | 0.9070 |
Fig. 5Visualization of Grad-CAM heatmap on the original input images. Columns i-iv show the original images, where as, columns v-viii are cropped versions. The annotated labels are A) Mild DR (D), epiretinal membrane (O); B) Mild DR (D), drusen (O); and C) Mild DR (D), glaucoma (G), vitreous degeneration (O)
Summary of preprocessing techniques for COD detection using fundus images
| Method | Observed execution time (in | Observations on COD fundus images |
|---|---|---|
| Individual channels | 0.0348 | Green channel best differentiates blood vessels, exudates, and haemorrhages and is often used to identify DR. Unlike the red and blue channels, this channel is neither under- nor over-illuminated. CNN trained only on green channels needs fewer training parameters. However, the green channel has less information on the optic disc, which is necessary for diagnosing other eye illnesses such as glaucoma. The red channel is the brightest, and it can distinguish the optic disc from other portions of the fundus image. Segmentation of the optic disc is primarily used to identify eye disorders such as glaucoma. However, it is more noisy, so it is not suitable for detecting other COD. The blue channel is the darkest component and has not been extensively studied for use in detecting COD. |
| CLAHE | 2.855 | It is a sharpening filter that increases the contrast of fundus images and is commonly used to detect DR and glaucoma. Enhances the low-contrast regions, especially the contrast enhancement of microaneurysms and small blood vessels. However, if the majority of the pixels in the fundus image are dark, an excessive enhancing effect may occur, distorting the image’s overall visibility. |
| Gaussian convolution | 2.259 | The Gaussian smoothed image reduces noise, and when subtracted, the fundus image is sharpened. This increases the contrast between blood vessels and the surrounding environment and is often employed in DR detection. However, minute features are obscured in brighter regions, such as the optic disc. Additionally, border areas for the brighter photos exhibit additional artefacts. |
| MSR | 17.667 | The difference between the input value (centre) and normalized surround or neighbourhood values determines MSR output. The MSR technique enhances images captured under a variety of nonlinear lighting conditions to the degree that a person would perceive them in real time. However, several parameters in this improvement procedure are image-dependent and must be modified accordingly. Additionally, the algorithm will introduce extra artefacts into the enhanced image for the regions with significant brightness changes. |
| MIRNET | 60896.423 | Full-resolution processing recovers the original image’s high-quality content from its degraded counterpart, while the complementary set of parallel branches gives enhanced contextual features. MIRNET establishes links between features both inside and across branches of varying sizes. The method of feature fusion enables dynamic adaptation of the receptive field without jeopardizing the original feature details. However, additional artefacts are seen in images with a high number of brighter lesions. |
| Vessel segmentation | 6056.545 | Segmentation of the vascular structure is commonly utilized to detect COD such as AMD, diabetic retinopathy, and glaucoma. However, segmented vessels often have poor contrast, particularly thin and tiny vessels. Identifying minute changes in vascular structure for the purpose of detecting COD is often challenging without patient demographic information. Other retinal structures (optic disc, macula, fovea, etc.) and lesions (microaneurysms, exudates, etc.) also contribute significantly to the detection of COD. |
| Vessel inpainting | 6133.436 | Blood vessel inpainting is a technique that includes inpainting segmented vessels with a fundus backdrop. It is primarily used to diagnose glaucoma by the localization and segmentation of the optic disc. However, the anatomy of the vasculature is critical in diagnosing other COD. |
Comparative evaluation of augmentation and preprocessing techniques on DDR testset
| Method | Observed performance | |||
|---|---|---|---|---|
| Kappa | AUC | Sensitivity | ||
| No augmentation | 0.5797 | 0.7898 | 0.8669 | 0.9276 |
| Batch-level augmentation | 0.5812 | 0.7906 | 0.8793 | 0.9170 |
| Condition-level augmentation | 0.6052 | 0.8026 | 0.8769 | 0.9091 |
| Ensemble 1 | 0.6196 | 0.8098 | 0.8749 | 0.9563 |
| Ensemble 2 | 0.6302 | 0.8151 | 0.8730 | 0.9570 |
| Patch-based lesion localization model [ | – | – | 0.8480 | 0.8910 |
Operational definitions used by domain experts for the testset image labeling
| Ocular disease | Operational definition |
|---|---|
| Cataract | Fundus image is hazy, may not permit or only permits a faint view of the disc, macula and the vascular arcades. |
| Diabetic retinopathy | Fundus image shows evidence of microaneurysms with one or more of the following: dot and blot hemorrhages, intraretinal microvascular abnormalities (IRMA), hard exudates, venous beading, neovascularization. |
| Hypertensive retinopathy | Fundus image shows evidence of arteriolar narrowing and arteriovenous crossing changes with any of the following changes: flame-shaped hemorrhages, soft exudates, hard exudates, optic disc edema. |
| Glaucoma | Fundus image shows cup:disc ratio of > 0.5 with nasalization of vessels. |
| AMD | Fundus image shows evidence of soft or hard drusen with pigmentary changes in the macula. |
| Myopia | Fundus image shows evidence of a large temporal or an annular crescent with chorioretinal degenerative changes. |
| Others | Fundus image shows other fundus lesions like medullated nerve fibers, macular hole, pigmentation, or any other lesion unrelated to the above conditions. |
| Normal | Fundus image shows a normal disc and macula, without any of the above possible diagnosis. |
Results of pilot study -illustrating its benefits to the ophthalmologists in their diagnosis
| Input images | Observed performance | ||
|---|---|---|---|
| Kappa | AUC | ||
| RoI cropped images | 0.2631 | 0.6382 | 0.8078 |
| Green channel | 0.2387 | 0.6291 | 0.7958 |