| Literature DB >> 35169235 |
Hasan Ul Banna1, Ahmed Zanabli1, Brian McMillan1, Maria Lehmann1, Sumeet Gupta1, Michael Gerbo1, Joel Palko2.
Abstract
The purpose of this study was to evaluate the performance of machine learning algorithms to predict trabeculectomy surgical outcomes. Preoperative systemic, demographic and ocular data from consecutive trabeculectomy surgeries from a single academic institution between January 2014 and December 2018 were incorporated into models using random forest, support vector machine, artificial neural networks and multivariable logistic regression. Mean area under the receiver operating characteristic curve (AUC) and accuracy were used to evaluate the discrimination of each model to predict complete success of trabeculectomy surgery at 1 year. The top performing model was optimized using recursive feature selection and hyperparameter tuning. Calibration and net benefit of the final models were assessed. Among the 230 trabeculectomy surgeries performed on 184 patients, 104 (45.2%) were classified as complete success. Random forest was found to be the top performing model with an accuracy of 0.68 and AUC of 0.74 using 5-fold cross-validation to evaluate the final optimized model. These results provide evidence that machine learning models offer value in predicting trabeculectomy outcomes in patients with refractory glaucoma.Entities:
Mesh:
Year: 2022 PMID: 35169235 PMCID: PMC8847459 DOI: 10.1038/s41598-022-06438-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Pairwise pearson correlation map of features. The feature number corresponds to its number as presented in Tables 1, 2 and 3.
Univariate analysis of demographic features recorded in the electronic health record system.
| Features | Surgical failure (n = 126) | Surgical success (n = 104) | |
|---|---|---|---|
| (1) Age (Mean, SD) | 69.24 (12.19) | 68.08 (12.55) | 0.479 |
| (2) Gender | 0.678 | ||
| Male (n, %) | 76 (60.3) | 59 (56.7) | |
| Female (n, %) | 50 (39.7) | 45 (43.3) | |
| (3) Race | 0.066 | ||
| White (n, %) | 110 (87.3) | 99 (95.2) | |
| Black (n, %) | 16 (12.7) | 5 (4.8) |
n = number; SD = standard deviation.
The threshold for statistical significance was P .
Univariate analysis of systemic features recorded in the electronic health record system.
| Features | Surgical failure ( | Surgical success ( | |
|---|---|---|---|
| (4) Statins (n, %) | 67 (53.2) | 54 (51.9) | 0.955 |
| (5) Angiotensin II receptor blockers (ARBs) (n, %) | 22 (17.5) | 27 (26.0) | 0.160 |
| (6) Inhaled corticosteriods (n, %) | 10 (7.9) | 15 (14.4) | 0.174 |
| (7) Nonsteroidal immunosuppressants (n, %) | 10 (7.9) | 6 (5.8) | 0.702 |
| (8) Asthma (n, %) | 13 (10.3) | 9 (8.7) | 0.840 |
| (9) Chronic obstructive pulmonary disease (COPD) (n, %) | 8 (6.3) | 14 (13.5) | 0.110 |
| (10) Congestive heart failure (CHF) (n, %) | 9 (7.1) | 4 (3.8) | 0.429 |
| (11) Myocardial infarction (MI) (n, %) | 17 (13.5) | 5 (4.8) | 0.045 |
| (12) Cerebrovascular accident (CVA) (n, %) | 4 (3.2) | 7 (6.7) | 0.343 |
| (13) Hyperlipidemia (HLD) (n, %) | 66 (52.4) | 62 (59.6) | 0.334 |
| (14) Hypertension (HTN) (n, %) | 91 (72.2) | 76 (73.1) | 1.000 |
| (15) Obstructive sleep apnea (OSA) (n, %) | 13 (10.3) | 8 (7.7) | 0.647 |
| (16) Body Mass Index (BMI) (Mean, SD) | 29.1 (6.8) | 28.9 (6.8) | 0.858 |
| (17) Smoking history (n, %) | 52 (41.3) | 43 (41.3) | 1.000 |
| (18) Active smoker (n, %) | 13 (10.3) | 12 (11.5) | 0.934 |
n = number; SD = standard deviation.
The threshold for statistical significance was P .
Univariate analysis of ocular features recorded in the electronic health record system.
| Features | Surgical failure ( | Surgical success ( | |
|---|---|---|---|
| (19) History of angle surgery (n, %) | 11 (8.7) | 9 (8.7) | 1.000 |
| (20) History of selective laser trabeculoplasty (SLT) (n, %) | 44 (34.9) | 33 (31.7) | 0.712 |
| (21) History of previous trabeculectomy (n, %) | 5 (4.0) | 5 (4.8) | 1.000 |
| (22) Pseudophakia (n, %) | 43 (34.1) | 36 (34.6) | 1.000 |
| (23) Diabetes mellitus (DM) (n, %) | 36 (28.6) | 35 (33.7) | 0.492 |
| (24) Study eye | 1.000 | ||
| Left (n, %) | 63 (50.0) | 52 (50.0) | |
| Right (n, %) | 63 (50.0) | 52 (50.0) | |
| (25) (0, 1) (n, %) | 7 (5.6) | 10 (9.6) | 0.359 |
| (26) (2, 3) (n, %) | 82 (65.1) | 71 (68.3) | 0.712 |
| (27) (4, 5) (n, %) | 37 (29.4) | 23 (22.1) | 0.273 |
| (28) Topical beta blocker (n, %) | 92 (73.0) | 74 (71.2) | 0.868 |
| (29) Topical prostaglandin analogue (PGA) (n, %) | 103 (81.7) | 73 (70.2) | 0.057 |
| (30) Topical alpha-agonist (n, %) | 62 (49.2) | 53 (51.0) | 0.895 |
| (31) Topical carbonic anhydrase inhibitor (CAI) (n, %) | 67 (53.2) | 51 (49.0) | 0.623 |
| (32) Oral CAI (n, %) | 17 (13.5) | 10 (9.6) | 0.482 |
| (33) Angle anatomy | 0.647 | ||
| Open angle glaucoma (OAG) (n, %) | 113 (89.7) | 96 (92.3) | |
| Angle closure glaucoma (ACG) (n, %) | 13 (10.3) | 8 (7.7) | |
| (34) Pre-operative visual acuity (VA) (Mean, SD) | 0.49 (0.6) | 0.46 (0.7) | 0.720 |
| (35) < 18 mmHg (n, %) | 42 (33.3) | 30 (28.8) | 0.557 |
| (36) | 40 (31.7) | 34 (32.7) | 0.991 |
| (37) | 44 (34.9) | 40 (38.5) | 0.676 |
| (38) Surgery-trabeculectomy with cataract extraction (n, %) | 21 (16.7) | 13 (12.5) | 0.484 |
| (39) Central corneal thickness (CCT) (Mean, SD) | 554.9 (48.3) | 548.8 (45.2) | 0.326 |
.
The threshold for statistical significance was P .
Comparison of predictive models trained using the the demographic and ocular (DO) dataset.
| Predictive model | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|
| Random forest | 0.64 | 0.52 | 0.77 | 0.64 |
| Logistic regression | 0.54 | 0.55 | 0.53 | 0.50 |
| Artificial neural network | 0.56 | 0.60 | 0.51 | 0.47 |
| Support vector machine | 0.59 | 0.57 | 0.62 | 0.57 |
AUC = area under the receiver operating characteristic curve.
Comparison of predictive models trained using the systemic, demographic and ocular (SDO) dataset.
| Predictive model | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|
| Random forest | 0.65 | 0.44 | 0.86 | 0.68 |
| Logistic regression | 0.59 | 0.58 | 0.57 | 0.55 |
| Artificial neural network | 0.53 | 0.49 | 0.59 | 0.51 |
| Support vector machine | 0.62 | 0.48 | 0.76 | 0.64 |
AUC = area under the receiver operating characteristic curve.
Figure 2Average receiver operating characteristic curves for the 4 predictive models trained on (left) demographic and ocular (DO) dataset and (right) systemic, demographic and ocular (SDO) dataset.
Relative contribution of various features in the multivariate logistic regression model predicting outcomes of trabeculectomy surgical intervention.
| Features | Odds ratio (95% confidence interval) | |
|---|---|---|
| Male gender | 0.31 (0.10, 0.79) | 0.023 |
| Topical prostaglandin analogue (PGA) | 0.52 (0.28, 0.97) | 0.041 |
| Myocardial infarction (MI) | 0.32 (0.10, 0.85) | 0.032 |
| Systemic Meds: Statins | 0.74 (0.54, 0.99) | 0.045 |
| White race | 2.88 (1.08, 9.06) | 0.046 |
The threshold for statistical significance was P .
Figure 3Results of recursive feature elimination algorithm applied to the random forest model using the (left) demographic and ocular (DO) dataset and (right) systemic, demographic and ocular (SDO) dataset.
Comparison of the optimized random forest models trained on the demographic and ocular (DO) features and systemic, demographic and ocular (SDO) features.
| Case study | Accuracy | Sensitivity | Specificity | AUC | PPV | NPV |
|---|---|---|---|---|---|---|
| DO features | 0.67 | 0.49 | 0.86 | 0.68 | 0.71 | 0.64 |
| SDO features | 0.68 | 0.60 | 0.76 | 0.74 | 0.69 | 0.65 |
AUC = area under the receiver operating characteristic curve; PPV = positive predictive value; NPV = negative predictive value.
Figure 4Calibration curves for the optimized random forest models using the (left) demographic and ocular (DO) and (right) systemic, demographic and ocular (SDO) datasets. The flexible curve with pointwise confidence intervals (gray area) was based on local regression (loess). The bottom of the graph shows histograms of the predicted risks for eyes with complete success (1) and failure (0) at 1 year.
Figure 5Decision curve analysis (DCA) for the optimized random forest models. The model trained on the SDO dataset trended towards greater net benefit compared to the DO trained model. Both models provided a greater net benefit compared to the decision of performing surgery on all or no patients across a wide range of threshold probabilities.
Figure 6Importance of top features from the systemic, demographic and ocular (SDO) dataset based on (left) MDA (right) MDI.
Figure 7Flow chart of the presented method.