| Literature DB >> 35353148 |
Abstract
Purpose: We evaluated the use of massive transformer-based language models to predict glaucoma progression requiring surgery using ophthalmology clinical notes from electronic health records (EHRs).Entities:
Mesh:
Year: 2022 PMID: 35353148 PMCID: PMC8976929 DOI: 10.1167/tvst.11.3.37
Source DB: PubMed Journal: Transl Vis Sci Technol ISSN: 2164-2591 Impact factor: 3.283
Figure 1.Flow diagram of patient identification and cohort creation.
Population Characteristics
| Total | No Surgery | Progressed to | |
|---|---|---|---|
| ( | ( | Surgery ( | |
| Age (y), mean ± SD | 65.0 ± 17.9 | 65.0 ± 18.1 | 64.8 ± 17.0 |
| Right IOP, mean ± SD | 18.3 ± 12.3 | 18.0 ± 6.2 | 20.1 ± 27.7 |
| Left IOP, mean ± SD | 18.8 ± 19.1 | 18.3 ± 6.5 | 21.8 ± 45.8 |
| Right visual acuity (logMAR), mean ± SD | 0.39 ± 0.74 | 0.39 ± 0.74 | 0.43 ± 0.76 |
| Left visual acuity (logMAR), mean ± SD | 0.43 ± 0.78 | 0.43 ± 0.79 | 0.43 ± 0.76 |
|
|
|
| |
| Gender (female) | 2270 (50.3) | 1920 (51.0) | 350 (46.8) |
| Race | |||
| White | 1892 (41.9) | 1616 (42.9) | 276 (36.9) |
| Asian/Pacific Islander | 1225 (27.1) | 992 (26.4) | 233 (31.1) |
| Other/Native American | 991 (22.0) | 812 (21.6) | 179 (23.9) |
| Black | 216 (4.8) | 168 (4.5) | 48 (6.4) |
| Unknown | 188 (4.2) | 176 (4.7) | 12 (1.6) |
| Ethnicity | |||
| Non-Hispanic | 3791 (84.0) | 3159 (83.9) | 632 (84.5) |
| Hispanic/Latino | 566 (12.5) | 460 (12.2) | 106 (14.2) |
| Unknown | 155 (3.4) | 145 (3.9) | 10 (1.3) |
Performance Metrics for BERT Models for Predicting Glaucoma Progression to Surgery
| Sensitivity | Positive Predictive | Negative Predictive | |||||
|---|---|---|---|---|---|---|---|
| Model | F1 (%) | (Recall) | Specificity | Value (Precision) | Value | Accuracy | Threshold |
| BERTBase | 0.45 | 0.60 | 0.79 | 0.36 | 0.91 | 0.76 | 0.50 |
| BioBERTv1.1+PubMed | 0.42 | 0.69 | 0.67 | 0.30 | 0.92 | 0.68 | 0.48 |
| RoBERTaBase | 0.45 | 0.40 | 0.92 | 0.50 | 0.88 | 0.83 | 0.83 |
| DistilBERTbase | 0.43 | 0.64 | 0.72 | 0.32 | 0.91 | 0.71 | 0.54 |
| Clinical prediction | 0.29 | 0.25 | 0.90 | 0.34 | 0.85 | 0.79 | — |
Figure 2.Receiver operating and precision recall curves for predictive models. The AUPRC (left) and AUROC (right) curves are shown for the BERT, BioBERT, RoBERTa, and DistilBERT models, as well as for the clinical prediction made by an ophthalmologist reviewing the patients’ notes.
Figure 3.Example clinical notes with the most important words to model prediction highlighted. Shown are excerpts from example patient clinical notes, including those correctly and incorrectly classified by the model. Words that are most highly attended to in the model and therefore most important to the prediction are highlighted in red.