| Literature DB >> 29888061 |
Carson Lam1, Darvin Yi1, Margaret Guo2, Tony Lindsey1,3.
Abstract
Diabetic retinopathy is a leading cause of blindness among working-age adults. Early detection of this condition is critical for good prognosis. In this paper, we demonstrate the use of convolutional neural networks (CNNs) on color fundus images for the recognition task of diabetic retinopathy staging. Our network models achieved test metric performance comparable to baseline literature results, with validation sensitivity of 95%. We additionally explored multinomial classification models, and demonstrate that errors primarily occur in the misclassification of mild disease as normal due to the CNNs inability to detect subtle disease features. We discovered that preprocessing with contrast limited adaptive histogram equalization and ensuring dataset fidelity by expert verification of class labels improves recognition of subtle features. Transfer learning on pretrained GoogLeNet and AlexNet models from ImageNet improved peak test set accuracies to 74.5%, 68.8%, and 57.2% on 2-ary, 3-ary, and 4-ary classification models, respectively.Entities:
Year: 2018 PMID: 29888061 PMCID: PMC5961805
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.Representative retinal images of DR at various stages of the disease, as labeled: A- normal, B- end stage, C- early stage. Arrows in B point to pathological indications. White boxes in C enclose very small lesions that the CNNs have difficulty discerning.
Retinopathy grades in Messidor dataset
| Grade | Description | Nb Images |
|---|---|---|
| R0 | ( | 546 |
| R1 | (0 < | 153 |
| R2 | (5 < | 247 |
| R3 | ( | 254 |
N: number of MAs, HEs and neovessels (NV), respectively
Figure 2.Contrast Limited Adaptive Histogram Equalization enhances contrast and the detection of subtle features. Shown are fundoscopic illustrations before and after CLAHE application.
Figure 3.Sensitivity of a 3-ary (no DR, mild, and severe classes) GoogLeNet classifier before (black) and after (red) CLAHE application on the Messidor dataset.
Figure 4.Training Curve for model on the binary classified Kaggle data set of DR fundoscope images. Sensitivity of 95% and specificity of 96% was achieved.
Confusion matrix on test set of Kaggle dataset
| - | Pred R0 | Pred R1 | Pred R2 or R3 |
|---|---|---|---|
| True R0 | 149 | 1 | 2 |
| True R1 | 21 | 2 | 7 |
| True R2 or R3 | 1 | 15 | 202 |
Figure 5.Heat map on a representative DR image. Green: Regions that do not change the probability of an abnormal binary classification (neutral areas or unfamiliar areas); Orange: Regions that increase the probability of an abnormal binary classification (suspicious areas); Clear or light blue: Regions that decrease the probability of abnormal binary classification (normal areas).
Hyperparameter optimization of the Messidor dataset trained using transfer learning on a pretrained GoogLeNet model from ImageNet. 2-ary dataset classes were group C0:R0, R1 and C1:R2, R3. 3-ary dataset classes were C0: R0, C1:R1, and C2:R2, R3. 4-ary dataset classes were C0:R0, C1:R1, C2:R2, C3:R3. C represents the label within the CNN architecture, and R represents the label from the dataset.
| GoogLeNet Rapid Prototyping Results-Raw Images | |||||
|---|---|---|---|---|---|
| Model | Solver | Learning Rate | Policy | Validation Accuracy% | Test Set Accuracy% |
| 2-ary | SGD | 1e-3 | Step Down | 83.82 | 72.75 |
| 2-ary | NAG | 1e-3 | Step Down | 82.36 | 72.75 |
| 2-ary | Adam | 1e-4 | Step Down | 86.40 | 71.75 |
| 2-ary | AdaGrad | 1e-3 | Exponential Decay | 84.55 | 64.25 |
| 2-ary | RMSProp | 1e-4 | Sigmoid Decay | 79.04 | 64.25 |
| 3-ary | RMSProp | 1e-4 | Exponential Decay | 63.97 | 66.25 |
| 3-ary | SGD | 1e-3 | Step Down | 71.69 | 64.25 |
| 3-ary | Adam | 1e-4 | Step Down | 72.40 | 61.50 |
| 3-ary | NAG | 1e-3 | Step Down | 69.85 | 58.75 |
| 3-ary | AdaGrad | 1e-3 | Exponential Decay | 72.43 | 58.25 |
| 4-ary | Adam | 1e-4 | Step Down | 67.65 | 57.25 |
| 4-ary | SGD | 1e-3 | Step Down | 65.07 | 55.25 |
| 4-ary | AdaGrad | 1e-3 | Exponential Decay | 66.54 | 53.25 |
| 4-ary | NAG | 1e-3 | Step Down | 66.18 | 52.75 |
| 4-ary | RMSProp | 1e-4 | Step Down | 62.50 | 49.75 |
Hyperparameter optimization of preprocessed Messidor dataset trained using transfer learning on a pretrained GoogLeNet model from ImageNet. 2-ary dataset classes were group C0:R0, R1 and C1:R2, R3. 3-ary dataset classes were C0: R0, C1:R1, and C2:R2, R3. 4-ary dataset classes were C0:R0, C1:R1, C2:R2, C3:R3. C represents the label within the CNN architecture, and R represents the label from the dataset.
| GoogLeNet Rapid Prototyping Results-Data Augmentation, Contrast Filtering & Regularization | ||||||
|---|---|---|---|---|---|---|
| Model | Solver | Learning Rate | Policy | Drop Out[ | Validation Accuracy% | Test Set Accuracy% |
| 2-ary | Adam | 1e-4 | Step Down | (0.8,0.7,0.4) | 88.35 | 74.50 |
| 2-ary | SGD | 1e-3 | Step Down | (0.6,0.6,0.4) | 88.07 | 71.75 |
| 2-ary | NAG | 1e-3 | Step Down | (0.7,0.8,0.4) | 87.50 | 70.00 |
| 2-ary | RMSProp | 1e-4 | Exponential Decay | (0.7,0.7,0.4) | 85.80 | 69.50 |
| 2-ary | AdaGrad | 1e-3 | Exponential Decay | (0.7,0.7,0.5) | 87.22 | 66.25 |
| 3-ary | AdaGrad | 1e-3 | Exponential Decay | (0.7,0.7,0.5) | 63.28 | 68.75 |
| 3-ary | SGD | 1e-3 | Step Down | (0.6,0.6,0.4) | 65.63 | 67.00 |
| 3-ary | NAG | 1e-3 | Step Down | (0.7,0.8,0.4) | 67.71 | 64.75 |
| 3-ary | RMSProp | 1e-4 | Exponential Decay | (0.7,0.7,0.4) | 60.68 | 64.00 |
| 3-ary | Adam | 1e-4 | Step Down | (0.8,0.7,0.4) | 64.32 | 63.50 |
| 4-ary | SGD | 1e-3 | Step Down | (0.6,0.6,0.4) | 60.00 | 51.25 |
| 4-ary | Adam | 1e-4 | Step Down | (0.8,0.7,0.4) | 53.75 | 49.50 |
| 4-ary | NAG | 1e-3 | Step Down | (0.7,0.8,0.4) | 55.00 | 47.75 |
| 4-ary | AdaGrad | 1e-3 | Exponential Decay | (0.7,0.7,0.5) | 57.50 | 47.00 |
| 4-ary | RMSProp | 1e-4 | Exponential Decay | (0.7,0.7,0.4) | 54.75 | 44.25 |
1counting only bypass procedures with two or more Procedure site – direct attributes
Figure 6.Test set accuracies for 2-ary, 3-ary, and 4-ary classifiers for transfer learning models based on AlexNet and GoogLeNet. Preprocessed images indicates the presence of real-time data augmentation and histogram equalization.