| Literature DB >> 32832207 |
Adrian Galdran1,2, Jihed Chelbi3, Riadh Kobi3, José Dolz1, Hervé Lombaert1, Ismail Ben Ayed1, Hadi Chakor3.
Abstract
Purpose: Introducing a new technique to improve deep learning (DL) models designed for automatic grading of diabetic retinopathy (DR) from retinal fundus images by enhancing predictions' consistency.Entities:
Keywords: deep learning; diabetic retinopathy grading; label smoothing; retinal image analysis
Mesh:
Year: 2020 PMID: 32832207 PMCID: PMC7414697 DOI: 10.1167/tvst.9.2.34
Source DB: PubMed Journal: Transl Vis Sci Technol ISSN: 2164-2591 Impact factor: 3.283
Data Set Summary
| No. of Images (Unique Individuals) | 46,865 (27,361) | |||
|---|---|---|---|---|
| Age (Mean ± SD) | 59.6 ± 14 | |||
| Female/Male | 17,658/9233 | |||
| Characteristic | Total, No. (%) | Training, No. | Validation, No. | Test, No. |
| No DR | 31,447 (67.1) | 23,585 | 3,145 | 4,717 |
| Mild DR | 1264 (2.7) | 948 | 126 | 190 |
| Moderate DR | 6822 (14.6) | 5117 | 682 | 1,023 |
| Severe DR | 230 (0.5) | 172 | 23 | 35 |
| Proliferative DR | 683 (1.5) | 512 | 68 | 103 |
| Ungradability | 6419 (13.7) | — | — | — |
Figure 1.Schematic representation of our proposed label representation.
Performance Comparison for the ResNet50 CNN in Terms of Mean Differences in Quadratic-Weighted κ and Other Metrics of Interest, Obtained from 1000 Bootstrap Iterations
| Quad-κ | Average AUROC | Weighted F1 | ||||||
|---|---|---|---|---|---|---|---|---|
| N-ULS |
| N-ULS |
| N-ULS |
| |||
| CE | 73.17 |
| CE | 91.36 | Δ = | CE | 85.35 |
|
| LS | 71.26 |
| LS | 90.51 |
| LS | 85.10 |
|
| Weighted Precision | Weighted Recall | MCC | ||||||
| N-ULS |
| N-ULS |
| N-ULS |
| |||
| CE | 86.48 | Δ = | CE | 84.44 |
| CE | 59.71 |
|
| LS | 85.21 |
| LS | 87.03 |
| LS | 60.71 |
|
Statistically significant improvements are marked bold.
Performance Comparison for tde ResNet101 CNN in Terms of Mean Differences in Quadratic-Weighted κ and Other Metrics of Interest, Obtained from 1000 Bootstrap Iterations
| Quad-κ | Average AUROC | Weighted F1 | ||||||
|---|---|---|---|---|---|---|---|---|
| N-ULS |
| N-ULS | 91.02 | N-ULS |
| |||
| CE | 71.98 |
| CE | 91.48 | Δ = –0.46 ( | CE | 84.76 |
|
| LS | 74.52 |
| LS | 91.26 | Δ = –0.24 ( | LS | 86.78 |
|
| Weighted Precision | Weighted Recall | MCC | ||||||
| N-ULS |
| N-ULS |
| N-ULS |
| |||
| CE | 86.69 | Δ = | CE | 83.64 |
| CE | 60.90 |
|
| LS | 86.39 |
| LS | 87.97 |
| LS | 64.37 |
|
Statistically significant improvements are marked bold.
Figure 2.Bootstrapped ROC curve resulting from training a Resnet50 (left) and a Resnet101 (right) CNN with CE/LS/N-ULS.