| Literature DB >> 34901058 |
Yueye Wang1, Danli Shi1, Zachary Tan2, Yong Niu3, Yu Jiang1, Ruilin Xiong1, Guankai Peng4, Mingguang He1,2,5,6.
Abstract
Purpose: To assess the accuracy and efficacy of a semi-automated deep learning algorithm (DLA) assisted approach to detect vision-threatening diabetic retinopathy (DR).Entities:
Keywords: artificial intelligence; cost-saving analysis; deep learning; diabetic retinopathy; screening
Year: 2021 PMID: 34901058 PMCID: PMC8656222 DOI: 10.3389/fmed.2021.740987
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Characteristics of baseline participants with fundus images.
|
|
|
|---|---|
| No. of participants | 4,900 |
| Age (mean ± sd, yrs) | 59.1 ± 8.78 |
| Male ( | 2,846 (58.5%) |
| Smoke ( | 823 (16.9%) |
| Alcohol consumption ( | 2,545 (52.2%) |
| BMI (mean ± sd, Kg/m2) | 24 ± 3.00 |
| SBP (mmHg) | 129 ± 18.0 |
| DBP (mmHg) | 75.0 ± 11.3 |
| Fast blood glucose (mmol/L) | 5.60 ± 1.17 |
| Triglyceride (mmol/L) | 1.75 ± 1.43 |
| Cholesterol (mmol/L) | 5.45 ± 0.95 |
| Diabetes ( | 591 (12.0%) |
| Diabetes duration of diabetic patients | 6.44 ± 6.01 |
BMI, body mass index; SBP, systolic blood pressure; DBP, diastolic blood pressure.
Figure 1Grading workflow of semi-automated DLA-assisted detection of referable DR. DLA, deep learning algorithm; DR, diabetic retinopathy.
Confusion matrix of DLA grading results for baseline images.
|
|
| |||
|---|---|---|---|---|
|
|
|
| ||
| DLA grading | Positive | (TP) | (FP) | 480 |
| Negative | (FN) | (TN) | 2,124 | |
| Total | 198 | 2,124 | 2,604 | |
DLA, deep learning algorithm; TP, true-positive; FP, false-positive; FN, false-negative; TN, true-negative.
Figure 2ROC curve of DLA grading in analyzing baseline images. The blue curve represents the model's trade-off, with the black dot marking the threshold point with an optimal cut-off value. This threshold point yields an optimal cut-off probability for having referable DR of 0.485, with a specificity and a sensitivity of 0.879 and 0.970, respectively. ROC, receiver operating characteristic; AUC, area under the receiver operating characteristic curve; DLA, deep learning algorithm.
Features and numbers of DLA-classified false-positive and false-negative referable DR cases.
|
|
| |
|---|---|---|
| False-positive images | ||
| Myopic retinopathy | 102 | 35.29% |
| Normal fundus with artifacts | 80 | 27.68% |
| Opaque refracting media | 32 | 11.07% |
| Age-related macular degeneration | 28 | 9.69% |
| Retinal pigment changes | 17 | 5.88% |
| Retinal atrophy | 9 | 3.11% |
| Retinal vessel occlusion | 8 | 2.77% |
| Background DR | 2 | 0.69% |
| Others | 11 | 3.81% |
| Total | 289 | 100.00% |
| False-negative images | ||
| Intraretinal microvascular abnormalities | 3 | 42.86% |
| Blurred peripheral retina exudation | 2 | 28.57% |
| Diabetic macular exudation | 1 | 14.29% |
| Questionable new vessels | 1 | 14.29% |
| Total | 7 | 100.00% |
DLA, deep learning algorithm; DR, diabetic retinopathy.
Figure 3Representative sample of typical images classified by the DLA. (A) Represents a true-positive image, (B–D) represent typical false-positive images, (E) represents a true-negative image, (F–H) represent typical false-negative images. (A) Pre-proliferative DR (R2) with microaneurysms, multiple blot hemorrhages, hard exudates; (B) myopic retinopathy; (E) normal fundus with artifacts; (D) opaque refracting media; (A,E) normal fundus; (F) intra-retinal microvascular abnormalities; (G) questionable new vessels; (H) blurred peripheral retina exudates. DLA, deep learning algorithm.
DLA grading results across the baseline and 2011-2017 follow-up cohorts.
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| No. of participants | 4,900 | 3,505 | 3,325 | 3,198 | 3,104 | 3,038 | 2.873 | 2,579 | - |
| No. of included images | 33,115 | 13,927 | 13,590 | 13,089 | 12,929 | 12,303 | 11,749 | 10,776 | 88,363 |
| Positive images ( | 480 | 186 | 185 | 169 | 182 | 199 | 196 | 150 | 1267 |
| Negative images ( | 31,898 | 13,363 | 12,992 | 12,378 | 11,834 | 11,316 | 10,998 | 10,094 | 82,975 |
| Ungradable images ( | 737 | 378 | 413 | 542 | 913 | 788 | 555 | 532 | 4121 |
| No. of images graded by human | 2,604 | 1,628 | 1,419 | 1,355 | 1,333 | 1,179 | 1,099 | 962 | 8,975 |
| Positive images by human grading | 198 | 57 | 62 | 55 | 80 | 76 | 91 | 71 | 492 |
| AUC | 0.953 | 0.870 | 0.910 | 0.919 | 0.939 | 0.921 | 0.937 | 0.899 | 0.914 |
| Sensitivity | 0.970 | 0.903 | 0.855 | 0.873 | 0.887 | 0.816 | 0.915 | 0.718 | 0.852 |
| Specificity | 0.879 | 0.741 | 0.883 | 0.835 | 0.867 | 0.900 | 0.824 | 0.918 | 0.853 |
| Accuracy of the DLA | 0.886 | 0.921 | 0.913 | 0.916 | 0.923 | 0.896 | 0.904 | 0.918 | 0.913 |
DLA, deep learning algorithm; DR, diabetic retinopathy; AUC, area under the receiver operating characteristic curve.
Comparison of time and cost for grading follow-up images between different grading procedures.
|
|
|
|
| |
|---|---|---|---|---|
|
|
| |||
| Time for grading (h) | 55 | 543 | 84.8 | 139 |
| Mean time of grading per image (min) | 0.368 | 0.368 | 0.058 | 0.094 |
| Total cost of grading ($) | 1,544 | 15,202 | – | 1,544 |
| Mean cost of grading per image ($) | 0.172 | 0.172 | – | 0.017 |
DLA, deep learning algorithm.