| Literature DB >> 35595794 |
Omkar G Kaskar1, Elaine Wells-Gray2, David Fleischman3, Landon Grace4.
Abstract
Several artificial intelligence algorithms have been proposed to help diagnose glaucoma by analyzing the functional and/or structural changes in the eye. These algorithms require carefully curated datasets with access to ocular images. In the current study, we have modeled and evaluated classifiers to predict self-reported glaucoma using a single, easily obtained ocular feature (intraocular pressure (IOP)) and non-ocular features (age, gender, race, body mass index, systolic and diastolic blood pressure, and comorbidities). The classifiers were trained on publicly available data of 3015 subjects without a glaucoma diagnosis at the time of enrollment. 337 subjects subsequently self-reported a glaucoma diagnosis in a span of 1-12 years after enrollment. The classifiers were evaluated on the ability to identify these subjects by only using their features recorded at the time of enrollment. Support vector machine, logistic regression, and adaptive boosting performed similarly on the dataset with F1 scores of 0.31, 0.30, and 0.28, respectively. Logistic regression had the highest sensitivity at 60% with a specificity of 69%. Predictive classifiers using primarily non-ocular features have the potential to be used for identifying suspected glaucoma in non-eye care settings, including primary care. Further research into finding additional features that improve the performance of predictive classifiers is warranted.Entities:
Mesh:
Year: 2022 PMID: 35595794 PMCID: PMC9122936 DOI: 10.1038/s41598-022-12270-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Quantitative description of categorical features of subjects at the time of enrollment.
| Features | Categories | Total (N = 3015) | Glaucoma Count (%) | Non-glaucoma Count (%) |
|---|---|---|---|---|
| Gender | Male | 1353 | 167 (12.3%) | 1186 (87.7%) |
| Female | 1662 | 170 (10.2%) | 1492 (89.8%) | |
| Race | White | 2913 | 315 (10.8%) | 2598 (89.2%) |
| Black | 84 | 18 (21.4%) | 66 (78.6%) | |
| Hispanic | 9 | 1 (11.1%) | 8 (88.9%) | |
| Asian | 4 | 1 (25%) | 3 (75%) | |
| Other | 5 | 2 (40%) | 3 (60%) | |
| Diabetes | Positive | 239 | 32 (13.4%) | 207 (86.6%) |
| Negative | 2776 | 305 (11%) | 2471 (89%) | |
| Arthritis | Positive | 1354 | 157 (11.6%) | 1197 (88.4%) |
| Negative | 1661 | 180 (10.8%) | 1481 (89.2%) | |
| AMD* | Category 1 | 746 | 90 (12.1%) | 656 (87.9%) |
| Category 2 | 673 | 65 (9.7%) | 608 (90.3%) | |
| Category 3 | 1054 | 119 (11.3%) | 935 (88.7%) | |
| Category 4 | 542 | 63 (11.6%) | 479 (88.4%) |
*AMD category descriptions[33].
Category 1: A few small or no drusen.
Category 2: Many small drusen or a few medium-sized drusen in one or both eyes.
Category 3: Many medium-sized drusen or one or more large drusen in one or both eyes.
Category 4: Breakdown of light-sensitive cells and supporting tissue in the central retinal.
Area or abnormal and fragile blood vessels under the retina.
Statistical summary of the numerical features of the subjects at the time of enrollment.
| Feature | Glaucoma (Self-report at end of study, N = 337) | Non-glaucoma (N = 2678) | ||||||
|---|---|---|---|---|---|---|---|---|
| Mean | Standard deviation | Maximum | Minimum | Mean | Standard deviation | Maximum | Minimum | |
| Age | 70.3 | 5 | 81.6 | 56.3 | 69.4 | 5 | 81.7 | 55.8 |
| Systolic blood pressure | 138.6 | 18 | 200 | 100 | 137 | 18 | 220 | 70 |
| Diastolic Blood pressure | 79.2 | 9.7 | 120 | 50 | 78.5 | 9.5 | 120 | 42 |
| BMI | 27.9 | 4.8 | 45.6 | 18.2 | 27.4 | 4.8 | 58.2 | 8.9 |
| IOP (right eye) | 18.2 | 3.6 | 30 | 10 | 15.8 | 3.1 | 26 | 5 |
| IOP (left eye) | 18.3 | 3.7 | 30 | 10 | 15.9 | 3 | 30 | 4 |
Performance metrics reported as mean (standard deviation) over all the executions.
| Models (N = 25) | Sensitivity/ Recall | Specificity | F1 score | Accuracy | Area under precision-recall curve |
|---|---|---|---|---|---|
| Support Vector Machine | 0.52 (0.06) | 0.77 (0.03) | 0.31 (0.04) | 0.74 (0.03) | 0.29 (0.05) |
| Logistic Regression | 0.60 (0.07) | 0.69 (0.02) | 0.30 (0.03) | 0.68 (0.02) | 0.28 (0.05) |
| AdaBoost | 0.57 (0.11) | 0.69 (0.06) | 0.28 (0.03) | 0.68 (0.04) | 0.30 (0.07) |
| IOP greater than 21 mm Hg | 0.25 | 0.93 | 0.28 | 0.86 |
Figure 1The average Precision-Recall curves for all classifiers with respect to a dummy classifier. The area under the curve (AUC) reported as mean (standard deviation): Adaptive boosting (AdaBoost) – 0.30 (0.07), support vector machine – 0.29 (0.05), and logistic regression – 0.28 (0.05).
Figure 2Permutation feature importance applied to each classifier: (a) Logistic regression, (b) Support vector machine, and (c) Adaptive boosting (AdaBoost). Mean decrease in F1 score is shown for each feature: age, systolic and diastolic blood pressure, gender (male), body mass index (BMI), intraocular pressure (IOP) in the right eye (RE) and left eye (LE), age-related macular degeneration (AMD) category, race (black, Hispanic, Asian, and other), and presence of diabetes and arthritis.
Summary of artificial intelligence-based glaucoma risk prediction models that do not use visual fields and imaging data.
| Reference | Description | Features used | Performance |
|---|---|---|---|
| Baxter et al.[ | Predicting need for surgical intervention within 6 months for patients (N = 385) with open angle glaucoma | 48 features that can be broadly categorized into vital signs, body mass index, smoking status, comorbidities, hospitalization status, medications, and lab values | Logistic regression Accuracy: 62% Sensitivity: 78% Specificity: 50% |
| Mehta et al.[ | Predicting self-report of open angle glaucoma in a population (N = 1689) without a clinical diagnosis at the time of testing | Age, gender, ethnicity, body mass index, forced vital capacity, peak expiratory flow, heart rate, diastolic and systolic blood pressure, diabetes, recent nicotine and caffeine intake, intraocular pressure, corneal hysteresis, and corneal resistance factor | Extreme gradient boosting (XGBoost) Accuracy: 75% |
| Tielsch et al.[ | Predicting glaucoma in a normal population (N = 5308) | Age, race, intraocular pressure, family history of glaucoma, and diabetes | Logistic regression Predicted probability threshold ≥ 0.025 Sensitivity: 86% Specificity: 66% |
| Current study | Predicting self-report of glaucoma in a population without a clinical diagnosis at the time of testing | Age, gender, race, BMI, systolic and diastolic blood pressures, and comorbidities | Logistic regression, support vector machine, and adaptive boosting Accuracy: 68%–74% Sensitivity: 52%–57% Specificity: 69%–77% |