| Literature DB >> 36249690 |
Sophia Y Wang1, Benjamin Tseng1, Tina Hernandez-Boussard2.
Abstract
Purpose: Advances in artificial intelligence have produced a few predictive models in glaucoma, including a logistic regression model predicting glaucoma progression to surgery. However, uncertainty exists regarding how to integrate the wealth of information in free-text clinical notes. The purpose of this study was to predict glaucoma progression requiring surgery using deep learning (DL) approaches on data from electronic health records (EHRs), including features from structured clinical data and from natural language processing of clinical free-text notes. Design: Development of DL predictive model in an observational cohort. Participants: Adult patients with glaucoma at a single center treated from 2008 through 2020.Entities:
Keywords: AUC, area under the receiver operating characteristic curve; Artificial Intelligence; DL, deep learning; Deep Learning; EHR, electronic health record; Glaucoma; IOP, intraocular pressure; Informatics
Year: 2022 PMID: 36249690 PMCID: PMC9559076 DOI: 10.1016/j.xops.2022.100127
Source DB: PubMed Journal: Ophthalmol Sci ISSN: 2666-9145
Figure 1Deep learning model architecture relying on only free-text clinical notes as inputs to predict glaucoma progression to surgery. Architecture for free-text notes based on multiple 1-dimensional convolutions with different filter widths. L2 indicates the type of regularization used. EHR = electronic health record; FC = fully connected layer; ReLU = rectified linear unit activation function.
Figure 2Deep learning model architecture using both free-text clinical notes and structured electronic health records (EHRs) data as inputs to predict glaucoma progression to surgery. Architecture that combines free-text notes and structured data into 1 predictive model. L2 indicates the type of regularization used. FC = fully connected layer; ReLU = rectified linear unit activation function.
Population Characteristics
| Characteristic | Total (n = 4512) | No Surgery (n = 3764) | Progressed to Surgery (n = 748) |
|---|---|---|---|
| Age (yrs) | 65.0 ± 17.9 | 65.0 ± 18.1 | 64.8 ± 17.0 |
| IOP (mmHg) | |||
| Right eye | 18.3 ± 12.3 | 18.0 ± 6.2 | 20.1 ± 27.7 |
| Left eye | 18.8 ± 19.1 | 18.3 ± 6.5 | 21.8 ± 45.8 |
| VA (logMAR) | |||
| Right eye | 0.39 ± 0.74 | 0.39 ± 0.74 | 0.43 ± 0.76 |
| Left eye | 0.43 ± 0.78 | 0.43 ± 0.79 | 0.43 ± 0.76 |
| Female sex | 2270 (50.3) | 1920 (51.0) | 350 (46.8) |
| Race | |||
| White | 1892 (41.9) | 1616 (42.9) | 276 (36.9) |
| Asian/Pacific Islander | 1225 (27.1) | 992 (26.4) | 233 (31.1) |
| Other/Native American | 991 (22.0) | 812 (21.6) | 179 (23.9) |
| Black | 216 (4.8) | 168 (4.5) | 48 (6.4) |
| Unknown | 188 (4.2) | 176 (4.7) | 12 (1.6) |
| Ethnicity | |||
| Non-Hispanic | 3791 (84.0) | 3159 (83.9) | 632 (84.5) |
| Hispanic/Latino | 566 (12.5) | 460 (12.2) | 106 (14.2) |
| Unknown | 155 (3.4) | 145 (3.9) | 10 (1.3) |
| Common glaucoma medication use | |||
| Latanoprost | 1570 (34.8) | 1279 (34.0) | 291 (38.9) |
| Bimatoprost | 413 (9.2) | 311 (8.3) | 102 (13.6) |
| Timolol | 1709 (37.9) | 1318 (35.0) | 391 (52.3) |
| Dorzolamide | 928 (20.6) | 678 (18.0) | 250 (33.4) |
| Brimonidine | 1212 (26.9) | 891 (23.7) | 321 (42.9) |
| Acetazolamide | 253 (5.6) | 178 (4.7) | 75 (10.0) |
IOP = intraocular pressure; logMAR = logarithm of the minimum angle of resolution; VA = visual acuity.
Data are presented as mean±standard deviation or no. (%).
Numeric variables were standardized to a mean of 0 and variance of 1 before input into model, but reported here conventionally for ease of interpretation.
Categorical variables were separated into a series of Boolean dummy variables before input into model.
Figure 3Graphs showing (A) receiver operating characteristic curves and (B) precision recall curves for the 3 different types of models developed to predict glaucoma progression to surgery based on electronic health records data: models that used structured data only, text data only, or a combination of both. The performance of an ophthalmologist reviewing the health records of these patients and making a clinical prediction is also shown.
Performance Metrics for Models and Clinical Predictions
| Variable | Proportion Predicted to Progress to Surgery | F1 | Sensitivity (Recall) | Specificity | Positive Predictive Value (Precision) | Negative Predictive Value | Accuracy | Probability Threshold |
|---|---|---|---|---|---|---|---|---|
| Clinical predictions | 0.13 | 0.29 | 0.25 | 0.85 | — | |||
| Structured model | 0.51 | 0.34 | 0.69 | 0.53 | 0.23 | 0.89 | 0.56 | 0.15 |
| Text-only model | 0.28 | 0.56 | 0.77 | 0.33 | 0.90 | 0.74 | 0.20 | |
| Combined model | 0.49 | 0.40 | 0.57 | 0.27 | 0.60 | 0.15 |
– = not applicable.
Boldface indicates best value across all models.
Figure 4Graph showing the 25 most important features for the structured model predictions. The mean local interpretable model-agnostic explanation (LIME) coefficients of individual structured input features were averaged to produce an overall importance across the entire test set for each feature. Bars in red show features important for predicting that the patient would undergo surgery, whereas bars in green show features important for predicting that the patient would not undergo surgery. ICD = International Classification of Diseases; IOP = intraocular pressure.