| Literature DB >> 36004369 |
Jimmy S Chen1,2, Sally L Baxter1,2.
Abstract
Advances in technology, including novel ophthalmic imaging devices and adoption of the electronic health record (EHR), have resulted in significantly increased data available for both clinical use and research in ophthalmology. While artificial intelligence (AI) algorithms have the potential to utilize these data to transform clinical care, current applications of AI in ophthalmology have focused mostly on image-based deep learning. Unstructured free-text in the EHR represents a tremendous amount of underutilized data in big data analyses and predictive AI. Natural language processing (NLP) is a type of AI involved in processing human language that can be used to develop automated algorithms using these vast quantities of available text data. The purpose of this review was to introduce ophthalmologists to NLP by (1) reviewing current applications of NLP in ophthalmology and (2) exploring potential applications of NLP. We reviewed current literature published in Pubmed and Google Scholar for articles related to NLP and ophthalmology, and used ancestor search to expand our references. Overall, we found 19 published studies of NLP in ophthalmology. The majority of these publications (16) focused on extracting specific text such as visual acuity from free-text notes for the purposes of quantitative analysis. Other applications included: domain embedding, predictive modeling, and topic modeling. Future ophthalmic applications of NLP may also focus on developing search engines for data within free-text notes, cleaning notes, automated question-answering, and translating ophthalmology notes for other specialties or for patients, especially with a growing interest in open notes. As medicine becomes more data-oriented, NLP offers increasing opportunities to augment our ability to harness free-text data and drive innovations in healthcare delivery and treatment of ophthalmic conditions.Entities:
Keywords: artificial intelligence; big data; data science; informatics; machine learning; natural language processing; ophthalmology
Year: 2022 PMID: 36004369 PMCID: PMC9393550 DOI: 10.3389/fmed.2022.906554
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Intersection of natural language processing (NLP) with artificial intelligence (AI), machine learning (ML), and deep learning (DL). NLP is a branch of AI concerned with processing and analyzing text data. ML is a subfield of AI aimed at modeling data, and DL is a subfield of ML that uses neural networks to analyze large datasets. NLP techniques may utilize ML and DL when used for classification of words, sentences, or even paragraphs.
Figure 2Examples of natural language processing (NLP) techniques and applications. Natural language processing, or NLP, is an area of artificial intelligence (AI) that deals with processing and analyzing textual data. Several NLP techniques include: relevance ranking, named entity recognition (NER), text cleaning, word embedding, which has applications in question-answering, summarization, topic modeling, among several other use cases.
Figure 3Methodology for Review of Ophthalmic Studies Utilizing Natural Language Processing (NLP). We searched PubMed and Google Scholars, augmented by ancestor search for studies related to use of NLP in ophthalmology applications.
Summary of Current Studies using NLP in Ophthalmology.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Barrows et al. ( | 2000 | Automated extraction of demographic and clinical parameters relevant to glaucoma from EHR notes | Text data extraction (Rule-based search, MedLEE) | All parameters were extracted with a >90% accuracy | Rule-based search performed similarly to NLP methods in terms of extracting text data from the EHR |
| Smith et al. ( | 2008 | Extract visual acuity and diabetic retinopathy stage for analysis of quality of life | Text data extraction (Rule-based search) | No specific performance metrics for accuracy of text extraction was reported | NLP can identify visual acuity of patients with diabetic retinopathy for use in secondary regression analysis to predict quality of life with vision loss |
| Peissig et al. ( | 2012 | Automated identification of cataracts from free-text and image-based EHR scans of text | Text data extraction (Rule-based search, MedLEE) | Positive predictive value >95% | Multi-modal strategies are highly accurate in identifying cataracts and generalizable across institutions |
| Mbagwu et al. ( | 2016 | Extract Visual Acuity from defined structured fields and clinical notes in the EHR | Text data extraction (Rule-based search) | 99% agreement between clinician and algorithm | Best corrected visual acuity can be automatically extracted from clinical notes in the EHR with high accuracy |
| Liu et al. ( | 2017 | Extract antibiotics and intraoperative complications from cataract surgery operative notes | Text data extraction (Rule-based search) | Positive and negative predictive values for identification of antibiotic injection were >99%. For operative complications, extraction accuracy was >94% | NLP can extract data from operative notes with high accuracy |
| Baughman et al. ( | 2017 | Automated visual acuity extraction from free-text clinical notes | Text data extraction (Rule-based search) | Manually reviewed and automated extracted visual acuities had a 95% concordance, K = 0.94 | Automated visual acuity extraction from EHR free text notes is highly accurate |
| Gaskin et al. ( | 2017 | Extract features from text notes using NLP for regression analysis | Text data extraction (Text negation and extraction using a predefined ontology) | No specific performance metrics for accuracy of text extraction was reported | Automated extraction of demographic factors, systemic disease, and drug information can be used to create high-performing models of cataract surgery complications |
| Zheng et al. ( | 2018 | Extract diagnosis of herpes zoster ophthalmicus from EHR notes | Text data extraction (negating, lemmatization), part-of-speech tagging, indexing, tokenization, Classification | Sensitivity = 95.6%, specificity = 99.3% | NLP algorithms can classify herpes zoster ophthalmicus vs. not based on progress note data |
| Stein et al. ( | 2019 | Identify exfoliation syndrome in free-text EHR notes | Text data extraction (Rule-based search) | Positive predictive value = 95% and Negative predictive value = 100% | Automated extraction of exfoliation syndrome appears to be more accurate than conventional assessment of billing codes |
| Maganti et al. ( | 2019 | Extract microbial keratitis morphology measurements from free-text in the examination notes | Text data extraction (Rule-based search) | Microbial keratitis measurements were extracted with a sensitivity of 75–96% and specificity of 91–96% | Metrics of microbial keratitis can be automatically extracted from exam data in EHR notes |
| Tan et al. ( | 2019 | Use NLP on free-text referrals to develop ML models for triaging | Text data extraction (Text negation, stripping), Classification | No specific performance metrics for accuracy of text extraction were reported | NLP can facilitate the training of ML models for triage |
| Baxter et al. ( | 2020 | Identify fungal ocular involvement from EHR notes | Text data extraction (Rule-based search) | Rule-based search yield 683/26,830 notes with possible fungal ocular involvement. Manual review found 0% fungal ocular involvement. | NLP can expedite review of notes for fungal ocular disease |
| Wang et al. ( | 2020 | Extract concepts related to vision outcomes from free-text notes and medication orders from the EHR | Text data extraction (Rule-based search, MedEx) | Rule-based laterality classifier: 100% accuracy. Implant usage: 99–100% accuracy. Glaucoma medications: 90.7% inter-annotator agreement, 85% accuracy for medications extracted by MedEx | NLP can be used to accurately extract laterality, medications, and implant model usage from cataract and glaucoma surgeries |
| Hallak et al. ( | 2020 | Identify focuses of research in ophthalmology and AI related to the COVID-19 pandemic | Topic modeling | >200 manuscripts: 57.8% focused on patient care and practice management, 19.4% on transmission, 17.2% on ocular manifestations, 5.6% on treatment/diagnosis | NLP can identify recent focuses of ophthalmic research during the COVID-19 pandemic including ocular manifestations and applications of AI |
| Wang et al. ( | 2021 | Create ophthalmology-specific word embeddings using published literature and EHR notes | Word embedding, Classification | PubMed and EHR word-embeddings resulting in similar AUROCs (~0.83) and outperformed previous non-ophthalmic word embeddings in predicting low vision prognosis | Ophthalmology-specific word embeddings can be used to increase prediction accuracy on prognosis of low-vision patients from EHR notes |
| Nguyen et al. ( | 2021 | Utilize clinical data from social media and forums to understand patient attitudes toward care | Text data extraction (Rule-based search and text stripping into keywords), Sentiment analysis | Complications, body parts, and undiagnosed symptoms were associated with sadness. Joy was slightly more likely to be expressed after doctor response. | Sentiment analysis can be used to better understand patient perspectives and promote patient-centered care |
| Gui et al. ( | 2022 | Use structured and extracted unstructured data to create models to predict low vision prognosis | Text data extraction (stripping), named entity recognition, word embeddings, classification | Several NLP techniques performed comparably well for predicting low vision prognosis (F1 0.63–0.7) | Free text progress notes can be used to accurately predict low vision prognosis using various NLP techniques |
| Wang et al. ( | 2022 | Use extracted free-text data from notes and structured data to predict glaucoma progression to surgery using deep learning | Text data extraction (stripping), word embeddings, classification | Convolutional neural networks trained on structured + unstructured inputs outperformed models trained on structural features alone (F1 0.42 vs. 0.34) and both outperformed a glaucoma clinician (F1 0.30) | Structured and unstructured data from the EHR can predict glaucoma progression with accuracy |
| Lin et al. ( | 2022 | Extract medication data from free-text progress notes | Named entity recognition | Overall F1 score = 0.95 for all medication entities (drug name, route, frequency, etc.) | Medication information can be accurately extracted from free-text data |
Overall, 19 studies using NLP in an ophthalmic context were identified from the literature. The majority of these studies primarily used rule-based search as their use case for NLP, though more recent studies have begun using more sophisticated techniques such as word embedding and named entity recognition. These studies spanned publication from 2000 to 2022, with the majority written after 2019.