Literature DB >> 32704419

Applications of Artificial Intelligence to Electronic Health Record Data in Ophthalmology.

Wei-Chun Lin¹, Jimmy S Chen², Michael F Chiang^1,3, Michelle R Hribar¹.

Abstract

Widespread adoption of electronic health records (EHRs) has resulted in the collection of massive amounts of clinical data. In ophthalmology in particular, the volume range of data captured in EHR systems has been growing rapidly. Yet making effective secondary use of this EHR data for improving patient care and facilitating clinical decision-making has remained challenging due to the complexity and heterogeneity of these data. Artificial intelligence (AI) techniques present a promising way to analyze these multimodal data sets. While AI techniques have been extensively applied to imaging data, there are a limited number of studies employing AI techniques with clinical data from the EHR. The objective of this review is to provide an overview of different AI methods applied to EHR data in the field of ophthalmology. This literature review highlights that the secondary use of EHR data has focused on glaucoma, diabetic retinopathy, age-related macular degeneration, and cataracts with the use of AI techniques. These techniques have been used to improve ocular disease diagnosis, risk assessment, and progression prediction. Techniques such as supervised machine learning, deep learning, and natural language processing were most commonly used in the articles reviewed. Copyright 2020 The Authors.

Entities: Chemical Disease Gene Species

Keywords: artificial intelligence; electronic health record; machine learning; ophthalmology

Year: 2020 PMID： 32704419 PMCID： PMC7347028 DOI： 10.1167/tvst.9.2.13

Source DB: PubMed Journal: Transl Vis Sci Technol ISSN： 2164-2591 Impact factor: 3.283

Introduction

The rapid adoption of electronic health records (EHRs) in recent decades has generated large volumes of clinical data with potential to support secondary use in research.– Indeed, a recurring justification for EHR adoption has been to support the collection and analysis of “big data” to gain meaningful insights., The clinical research community has expressed growing interest in developing effective techniques to reuse clinical data from EHRs, in part because of the benefits of secondary data reuse over primary data collection., Researchers reusing EHR data may not need to recruit patients or collect new data, potentially reducing cost compared with traditional clinical research. Moreover, EHR data often contain valuable longitudinal data regarding a patient's status, medical care, and disease progression, which have been previously shown to support clinical decision support, medical concept extraction, diagnosis, and risk assessment. However, there are challenges associated with reusing EHR data, particularly because of its complexity and heterogeneity. For example, in ophthalmology, patient data contained in EHRs may include fields as diverse as demographic information, diagnoses, laboratory tests, prescriptions, eye examinations, imaging, and surgical records. Interpreting these heterogeneous data requires strategies such as information extraction, dimension reduction, and predictive modeling typical of machine learning and, more broadly, artificial intelligence (AI) techniques. Applying AI to EHR data has been productive in a variety of domains. For instance, studies in cardiology have broadly used AI techniques with EHR data for the early detection of heart failure, to predict the onset of congestive heart failure, and to improve risk assessment in patients with suspected coronary artery disease. Likewise in ophthalmology, machine learning classifiers with EHR data have been used to predict risks of cataract surgery complications, improve diagnosis of glaucoma and age-related macular degeneration (AMD), and perform risk assessment of diabetic retinopathy (DR).– Although the application of AI to EHR data related to ocular diseases has increased during the past decade, there have been no published reviews of this literature. One literature review of machine learning techniques applied in ophthalmology was published in 2017; however, the included studies mainly focused on the application of machine learning techniques to imaging data, rather than EHR data. This manuscript addresses this knowledge gap by reviewing the literature applying AI techniques to EHR data for ocular disease diagnosis and monitoring. With this review, we explore the type of AI techniques used, the performance of these techniques, and how AI has been applied to specific ocular diseases, providing future directions to clinical practice and research.

Methods

An exhaustive search was performed in the PubMed database using search terms related to “Artificial intelligence”, “Electronic health records,” and “Eye” in any field of articles. See the Appendix for the full query. The results were then examined and narrowed according to the following criteria: Duplicates were removed. Studies were eliminated for lack of relevance after review of the title and abstract; studies that used only imaging data without any EHR data were excluded. Studies without direct clinical application or not related to the topic were excluded. The review process is summarized in Figure 1. One author (WL) identified articles for inclusion through manual title, abstract, and content review. Two authors (WL and JSC) extracted data for each study: the aim, disease, algorithm, specific techniques, performance assessment, and conclusion of the articles that met the inclusion criteria, as summarized in the Table.–,–

Figure 1.

Flow diagram for the literatures selection.

Table.

Studies on Ocular Diseases Using Artificial Intelligence Techniques With EHR Data

Authors	Aim	Disease	Algorithm Type	Specific Techniques	Performance	Conclusions
Lin et al.²⁰	Disease detection	Myopia	Supervised machine learning	Random forest	95% CI for predicting onset of high myopia. 3 years onset prediction (AUC: 94%–98.5%), 5 years (85.6%–90.1%), 8 years (80.1%–83.7%)	Machine learning with EHR data can accurately predict myopia onset
Lee et al.²¹	Improve diagnostic accuracy	AMD	Deep learning	Convolutional neural networks	For each patient, AUC (97.45%), accuracy (93.54%), sensitivity (92.64%), and specificity (93.69%)	Linked OCT images to EMR data can improve the accuracy of a deep learning model when used to distinguish AMD from normal OCT images
Baxter et al.²²	Risk assessment	Open-angle glaucoma	Supervised machine learningDeep learning	Logistic regression, random forests,ANNs	AUC of logistic model (67%), random forest (65%), ANNs (65%)	Existing systemic data in the EHR can identify POAG patients at risk of progression to surgical intervention
Chaganti et al.¹⁶	Identify risk factors and improve diagnostic accuracy	Glaucoma, intrinsic optic nerve disease, optic nerve edema, orbital inflammation, and thyroid eye disease	Supervised machine learning	Random forest	AUC of classifiers: glaucoma (88%), intrinsic optic neuritis (76%), optic nerve edema (78%), orbital inflammation (77%), thyroid eye disease (85%)	EMR phenotype (from pyPheWAS) can improve the predictive performance of a random forest classifier with imaging biomarkers
Apostolova et al.²³	Patient identification	Open globe injury	Supervised machine learning & Text-mining	SVMNLP–Word embeddings	Text classification: precision (92.50%), recall (89.83%)	Free-form text with machine learning methods can used to identify open globe injury
Saleh et al.¹⁸	Risk assessment	DR	Supervised machine learning	FRF, DRSA	Performance of FRF:Accuracy (80.29%), sensitivity (80.67%), specificity (80.18%)Performance of DRSA:Accuracy (77.32 %), sensitivity (76.89 %), specificity (77.43%) of DRSA.	Ensemble classifiers (RFR and DRSA) can be applied for diabetic retinopathy risk assessment. The 2-step aggregation procedure is recommended
Rohm et al.²⁴	Predict progression	AMD	Supervised machine learning	AdaBoost, Gradient Boosting, Random Forests, Extremely Randomized trees, LASSO	Accuracy of logMAR VA prediction after VEGF injections.3 months: MAE (0.14), RMSE (0.18)12 months: MAE (0.16), RMSE (0.2)	EHR data of patients with neovascular AMD can be used to predict visual acuity by using machine learning models
Yoo and Park²⁵	Risk assessment	DR	Supervised machine learning	Ridge, elastic net, and LASSO	In external validation, LASSO predicted DR: AUC (82%), accuracy (75.2%), sensitivity (72.1%), and specificity (76.0%)	LASSO with EHR data can be used to predict DR risk among diabetic patients
Fraccaro et al.¹⁷	Improve diagnostic accuracy	AMD	Supervised machine learning	Logistic regression, decision trees, SVM, random forests, and AdaBoost	AUC of random forest, logistic regression, and AdaBoost (92%); SVM, decision trees (90%)	Machine learning algorithms using clinical EHR data can be used to improve diagnostic accuracy of AMD
Sramka et al.²⁶	Improve surgical outcome	Cataracts	Supervised machine learningDeep learning	SVM-RMMLNN-EM	Both SVM-RM and MLNN-EM achieved significantly better results than the Barrett Universal II formula in the ±0.50 D PE category	SVM-RM and MLNN-EM with EHR data can be used to improve clinical IOL calculations and improve cataract surgery refractive outcomes
Peissig et al.²⁷	Patient identification	Cataracts	Text-mining	NLP	The multimodal model shows results including sensitivity (84.6%), specificity (98.7%), PPV (95.6%), and NPV (95.1%)	A multimodal strategy incorporating optical character recognition and natural language processing can increase the number of cataracts cases identified
Gaskin et al.¹⁵	Identify and predict risks of cataract surgery complications	Cataract	Supervised machine learningText-mining	Bootstrapped LASSO, random forestNLP	Based on the LASSO model, younger age (<60 years old), prior anterior vitrectomy or refractive surgery, history of AMD, and complex cataract surgery were risk factors associated with postoperative complicationsThe random forest model shows high NPV > 95% and moderate sensitivity (67%) and AUC (65%)	Bootstrapped LASSO can be used to identify risk factors of postoperative complications of cataract surgeryRandom forest shows good reliability for predicting cataract surgery complications
Skevofilakas et al.²⁸	Risk assessment	DR	Deep learningSupervised machine learning	FNN and iHWNNCART	AUC of hybrid DSS (98%), iHWNN (97%), FNN (88%), and CART (86%).	Hybrid DSS trained on imaging and related EHR data can estimate the risk of a type 1 diabetic patient developing diabetic retinopathy

AMD, age-related macular degeneration; ANN, artificial neural network; AUC, area under the curve; CART, classification and regression tree; CI, confidence interval; DR, diabetic retinopathy; DRSA, dominance-based rough set approach; DSS, decision support system; EHR, electronic medical record; EMR, electronic medical record; FNN, feed forward neural network; FRF, fuzzy random forest; iHWNN, improved hybrid wavelet neural network; IOL, intraocular lens; LogMAR, logarithm of the minimum angle of resolution; LASSO, least absolute shrinkage and selection operator; MAE, mean absolute error; MLNN-EM, multilayer neural network ensemble model; NLP, natural language processing; NPV, negative predictive value; OCT, optical coherence tomography; POAG, primary open-angle glaucoma; RFR, random forest regression; RMSE, root mean squared error; SVM, support vector machine; SVM-RM, support vector machine regression model; VA, visual acuity; VEGF, vascular endothelial growth factor.

Flow diagram for the literatures selection. Studies on Ocular Diseases Using Artificial Intelligence Techniques With EHR Data AMD, age-related macular degeneration; ANN, artificial neural network; AUC, area under the curve; CART, classification and regression tree; CI, confidence interval; DR, diabetic retinopathy; DRSA, dominance-based rough set approach; DSS, decision support system; EHR, electronic medical record; EMR, electronic medical record; FNN, feed forward neural network; FRF, fuzzy random forest; iHWNN, improved hybrid wavelet neural network; IOL, intraocular lens; LogMAR, logarithm of the minimum angle of resolution; LASSO, least absolute shrinkage and selection operator; MAE, mean absolute error; MLNN-EM, multilayer neural network ensemble model; NLP, natural language processing; NPV, negative predictive value; OCT, optical coherence tomography; POAG, primary open-angle glaucoma; RFR, random forest regression; RMSE, root mean squared error; SVM, support vector machine; SVM-RM, support vector machine regression model; VA, visual acuity; VEGF, vascular endothelial growth factor.

Results

The PubMed query returned 164 articles published through August 2019. In total, 161 articles were reviewed after removing 3 duplicates. Then 118 articles were excluded because of lack of relevance based on the title and abstract. A total of 13 articles were considered that met inclusion criteria (Fig. 1).

AI Techniques

Three major techniques were used in these studies: 11 studies used supervised machine learning, of which 3 studies specifically used a deep learning technique; 2 studies also used natural language processing (NLP) to generate structured data suitable for analysis from unstructured text. Only 1 study used deep learning by itself, and another study used NLP independent of other techniques (Table). Figure 2 illustrates a simplified machine learning process and the relationship among these 3 techniques. In short, NLP can be used to extract useful information from text-based data and process it into a format suitable for machine learning. Supervised machine learning techniques, some of which use deep learning algorithms, can then be applied to these and other structured data sets to develop predictive models or classifiers.

Figure 2.

Schematic of the steps of machine learning application. NLP, natural language processing; SVM, support vector machine; CART, classification and regression tree; CNN, convolutional neural network; FNN, feed forward neural network.

Machine Learning

Machine learning techniques are computational methods that learn patterns or classifications within data without being explicitly programmed to do so. Machine learning can be divided into 2 methods based on the use of “ground truth” data: supervised learning and unsupervised learning. In supervised learning, a model learns from “ground truth” data in a training data set that contains labeled output data and then can predict the output for new cases. The algorithm is typically a classifier with categorical output or a regression algorithm with continuous output. In unsupervised learning, the model learns from a training data set without labeled output and identifies underlying patterns or structures within its input data. In medicine, machine learning has been widely used in several specialties such as radiology, cardiology, oncology, and ophthalmology to improve diagnostic accuracy and early disease detection. In this review, most studies used supervised machine learning techniques such as random forest, logistic regression, support vector machines (SVMs), gradient boosting, least absolute shrinkage and selection operator (LASSO), AdaBoost, and classification and regression tree (CART). As shown in Figure 3B, logistic regression is an extension of linear regression (Fig. 3A). In linear regression, the data is modeled as a linear relationship that can be used to predict a value for a given input. In logistic regression, a non-linear function, called the logistic function, converts prediction values into binary categories based on a threshold. Some methods can be used to improve the prediction accuracy of logistic regression, such as least absolute shrinkage and selection operator (LASSO). LASSO is a statistical method that selects a smaller subset of predictor variables most related to the outcome variable and shrinks regression coefficients to improve accuracy and generalizability. SVM is another popular machine learning model used for classification analysis. As shown in Figure 3C, a boundary is created to split input data into two distinct groups and can be used to classify new data into similar distinct categories.

Figure 3.

Illustrations of machine learning models. 3A. Linear regression; 3B. Logistic regression; 3C. Support vector machine; 3D. Classification and regression trees (CART); 3E. Ensemble methods; 3F. Artificial neural network (ANN).

A decision tree is an important supervised machine learning algorithm. Figure 3D illustrates a decision tree with a root node as a start followed by the branched nodes and terminal nodes. The root node is the first decision node representing the best predictor variable. Each branched node represents the output of a given input variable. As more input variables are added to subsequent branching nodes, the decision tree becomes more sophisticated in predicting the outcome variable at the terminal nodes. Ensemble methods combine multiple machine learning models and are commonly used to improve the performance of prediction models. The two most common methods: bootstrapping aggregation (bagging) and boosting were shown in Figure 3E. In a bagging method, multiple subsets of data are randomly selected from the original dataset and each subset data are used to train a separate prediction model. The final predictions will be aggregated from all prediction models. Random forest algorithms are examples of an ensemble machine learning method that combine bagging and decision trees. Boosting is another technique that combines multiple models to create a more accurate one. Adaboost and gradient boosting are widely used boosting machine learning algorithms. Illustrations of machine learning models. 3A. Linear regression; 3B. Logistic regression; 3C. Support vector machine; 3D. Classification and regression trees (CART); 3E. Ensemble methods; 3F. Artificial neural network (ANN). As shown in the Table, random forest was used by Lin et al. to predict myopia onset and by Chaganti et al. to improve the diagnostic accuracy of glaucoma. In addition, Baxter et al. used random forest and logistic regression to identify patients with open-angle glaucoma who had a risk of progression to surgical intervention. Fraccaro et al. used logistic regression, decision trees, SVMs, random forests, and AdaBoost to improve diagnostic accuracy of AMD. In addition, fuzzy random forest (FRF) and dominance-based rough set approach (DRSA) were used by Saleh et al. for DR risk assessment. And Gaskin et al. used random forest and bootstrapped LASSO to identify and predict risks of cataract surgery complications. Moreover, Yoo and Park used elastic net and LASSO to predict DR risk among diabetic patients.

Deep Learning

Deep learning is a subset of machine learning techniques based on artificial neural networks (ANNs) that mimic human brain processing. As shown in Figure 3F, multiple layers of computation are constructed in a deep learning model, and each layer is used to perform computations on data from the previous layer. The layers between the input layer and the output layer are called hidden layers. While the information may flow from the input to subsequent output layers (feedforward), information can also flow backward from hidden layers to input layers (backpropagation). The inputs and outputs of hidden layers are not reported; deep learning algorithms present only the final outcome of the output layer. Deep learning does not use structured features for input as machine learning does; therefore, deep learning is useful for raw images because they do not have to be prefiltered as they do for machine learning algorithms. After processing raw input through multiple layers within deep neural networks, the algorithms find appropriate features for classifying output. In this review, several articles used deep learning algorithms such as ANNs, convolutional neural networks (CNNs), multilayer neural network ensemble models (MLNN-EMs), and feed forward neural networks (FNNs). CNN is a subtype of deep neural network commonly used in image classification. In a CNN model, special convolution and pooling layers are used to reduce a raw image to essential features necessary for the model to classify or label the image. In other words, these techniques use machine learning to determine model input features from the raw image data, rather than a human or a separate image processing program. MLNN-EM is a learning technique that integrates several neural networks to aggregated outcome. In addition, FNN is another subtype of neural network where the information moves forward in (one direction) from root nodes; information never moves backwards. The nodes between input and out layers do not form a cycle of information. As shown in the Table, Lee et al. used CNNs to distinguish AMD from normal OCT images, Baxter et al. used ANNs to identify open-angle glaucoma patients at risk of progression to surgery. Also, Sramka et al. used models MLNN-EMs and support vector machine regression models (SVM-RM) to improve clinical intraocular lens (IOL) calculations, and Skevofilakas et al. used feed forward neural network (FNN) and improved hybrid wavelet neural networks to develop hybrid decision support system for predicting DR risk among diabetic patients.

NLP

NLP is a branch of AI in which computers attempt to interpret human language in written or spoken form. By using NLP, researchers can extract information from text; some uses in medicine include separating progress notes into sections, determining diagnoses from notes, and identifying the documentation of adverse events. As shown in the Table, Apostolova et al., Peissig et al., and Gaskin et al. describe the use of NLP in extracting cataract information from free-form text clinical notes.

Outcome Metrics for Evaluation of Performance of AI Techniques

Performance evaluation of different AI techniques depends on the chosen algorithm, the purpose of the study, and the input data set. In supervised machine learning algorithms, classifiers are evaluated based on a comparison between the known categorical output and the predicted categorical output. For outputs with 2 categories, the accuracy, sensitivity, specificity, positive predictive value, and negative predictive value can be computed. Another important evaluation metric is the AUC-ROC (area under the curve–receiving operating characteristic), which is used to evaluate the performance of classifiers based on different thresholds. ROC is a probability curve that visualizes the true positive rate (sensitivity) change with respect to false positive rate (1–specificity) for different threshold values used in the model. The AUC represents the ability of a model to distinguish between different outcome values. An AUC equal to 1 is ideal and represents the model's ability to perfectly distinguish between two outcomes. On the other hand, an AUC of approximately 0.5 is the worst case because it means that the model is not better than chance for distinguishing between two outcomes. As shown in the Table, 8 studies used AUC-ROC to evaluate the performance of classifiers.–,–,, The range of AUC-ROC was from 65% to 98.5%, and the median AUC in all included studies was 90%. In addition, precision and recall were used to evaluate the performance of text-mining algorithms. Apostolova et al. and Peissig et al. used precision and recall to evaluate the performance of text classification. For regression models, 2 evaluation metrics—mean absolute error (MAE) and root mean squared error (RMSE)—are commonly used to measure accuracy for continuous variables. They measure the average difference between actual observations and predictions. MAE shows the absolute differences with equal weight for each difference. In contrast, RMSE penalized larger errors by taking the square of the difference before averaging. In the study by Rohm et al., MAE and RMSE were used to evaluate visual acuity prediction.

Application of AI to Clinical Ophthalmology

AI techniques have been applied clinically to improving ocular disease diagnosis, predicting disease progression, and risk assessment (Table). Several diseases were studied in articles included in this review including glaucoma, cataracts, AMD, and DR. We will present the benefits of AI techniques with EHR data in these diseases.

Glaucoma

Two studies in this review focused on the field of glaucoma and used supervised machine learning techniques to improve diagnosis and predict progression., In the study by Chaganti et al., a good performance was obtained (AUC of glaucoma diagnosis 88%), and results showed that the addition of an EMR phenotype could improve the classification accuracy of a random forest classifier with imaging biomarkers. On the other hand, Baxter et al. reported a moderate performance (AUC 67%) in a study that used EHR data alone to predict risk of progression to surgical intervention in patients with open-angle glaucoma. In addition to model performance, it is important to know which factors can be used to improve disease diagnosis. The work performed by Chaganti et al. began to explore this problem by comparing the performance of classifiers using EMR phenotypes, visual disability scores, and imaging metrics.

Cataracts

Three studies applying different AI techniques to cataract diagnosis and management were reviewed. In the study by Peissig et al., NLP was used to extract cataract information from free-text documents. An EHR-based cataract phenotyping algorithm, which consisted of structured data, information from free-text notes, and optical character recognition on scanned clinical images, was developed to identify cataract subjects. The result of the study showed good performance (predictive positive value >95%). Additionally, Gaskin et al. used supervised machine learning algorithms to identify risk factors and to predict intraoperative and postoperative complications of cataract surgery. The investigators used data mining via NLP to extract cataract information from the EHR system. The predictive model showed moderate performance (AUC 65%), and the risk factors associated with surgical complications included younger patients, refractive surgery history, AMD history, and complex cataract surgery. These risk factors were associated with postoperative complications, and the predictive model showed moderate performance (AUC 65%). Supervised machine learning (SVM-RM) and deep learning (MLNN-EM) algorithms were used to improve the IOL power calculation by Sramka et al. Both SVM-RM and MLNN-EM model provided better IOL calculations than the Barrett Universal II formula.

AMD

Three studies used AI in AMD. Lee et al.1 used deep learning techniques to improve the diagnosis of AMD. Optical coherence tomography (OCT) images of each patient were linked to EMR clinical end points extracted from EPIC (Verona, WI) for each patient to predict a diagnosis of AMD. The model had high accuracy with an AUC 97% in distinguishing AMD from normal OCT images. Another study conducted by Rohm et al. used supervised regression models to accurately predict visual acuity in response to anti–vascular endothelial growth factor injections in patients with neovascular AMD. Models predicting treatment response may have implications in encouraging patients adhering to intravitreal therapy. Also, as demonstrated by Fraccaro et al., supervised machine learning techniques can be incorporated into EHR systems providing real-time support for AMD diagnosis.

DR

DR is one of the most common comorbidities of diabetes, and frequent screening examinations for diabetic patients are resource consuming. Three studies explore this problem by using AI techniques with EHR data to determine patient risk for the development of DR. Saleh et al. used 2 kinds of ensemble classifiers—FRF and DRSA—to predict DR risk using EHRs. Good performance (accuracy 80%) of the FRF model was shown in this study. Similarly, Yoo and Park proposed a comparison between the learning models—ridge, elastic net, and LASSO—using the traditional indicators of DR. They showed that the performance of LASSO (AUC 81%) was significantly better than the traditional indicators (AUC of glycated hemoglobin 69%; AUC of fasting plasma glucose 54%) in diagnosing DR. In addition, a hybrid DSS was developed by Skevofilakas et al. to estimate the risk of a patient with type 1 diabetes to develop DR. The hybrid DSS showed an excellent performance with an AUC of 98%. Overall, these studies show that integrating these techniques with an EHR system has promise in improving early detection of diabetic patients at risk of DR progression.

Discussion

This article reviews the literature applying AI techniques to EHR data to aid in ocular disease diagnosis and risk assessment. We focus the discussion on 3 areas: AI techniques used to analyze EHR data, the performance of techniques, and the ocular diseases most commonly analyzed. First, secondary use of EHR data via AI techniques can be used to improve ocular disease diagnosis, risk assessment, and disease progression. The predictive models across the 8 classifiers showed good performance with a median AUC of 90%. One study, prediction of postoperative complications of cataract surgery, reported moderate accuracy with 65%, perhaps because of insufficient predictors, such as lack of surgeon-relevant information. Also, the prevalence of various complications may affect the reliability of prediction outcomes. For example, a rare prevalence complication may not be handled well with standard classification techniques because of imbalanced data., When a dataset contains a very few number of cases of disease or complications, there is not enough data about these cases for the model to accurately learn how to predict them. On the other hand, excellent performance of classifiers trained on combined EHR and image data were reported by Skevofilakas et al. and Lee et al. For future studies, a feasible direction might be to develop a hybrid model that uses both the routine EHR data and image data sets to have a more complete picture of patient variables associated with the outcome of interest. Second, supervised machine learning was the most common technique used with EHR data to analyze ocular diseases. These studies focused on improving diagnosis, predicting progression, or assessing risk for early detection. The predictors defined were based on the risk factors of disease, demographic features found from literature review, and clinical experiences. None of the studies reviewed used unsupervised machine learning techniques where the desired output and the relationship between the outcome variable and the predictors are unknown. These methods are used to identify clusters of data that are similar and can help discover the hidden factors that are useful for improving the diagnosis. However, unsupervised learning has been successfully applied to other fields. For example, Marlin et al. demonstrated that the probabilistic clustering model for time-series data from real-world EHRs could be able to capture patterns of physiology and be used to construct mortality prediction models. For future studies, unsupervised machine learning techniques might be used to find hidden patterns from EHR data for improving clinical predictions of ocular diseases. Finally, in this review, studies that analyzed EHR data with AI techniques mainly focused on 4 diseases: glaucoma, DR, AMD, and cataracts. The focus on these diseases (glaucoma, AMD, and DR) is likely due to their prevalence as the major causes of irreversible blindness in the world. Early detection or treatment can delay or halt the progression of such diseases, reduce visual morbidity, and preserve a patient's quality of life., These studies suggest that AI techniques can be used to achieve this goal. Furthermore, cataract surgery is the most common refractive surgical procedure and is one of the most common surgeries performed in ophthalmology. Risk assessment of the postoperative complications and decreasing the risk of reoperation are crucial to patient outcomes, and AI techniques can help approach these issues. This review presents the AI techniques used in vision sciences based on EHR data. However, several problems still need to be addressed for future studies. One of the major problems is data quality. EHR data required for research are essentially different from data collected during a traditional clinical research study. EHR data collected from clinical practice may have incomplete information due to incorrect data entry, nonanswers, and recording errors. Consequently, the performance of machine learning models will be dependent on data quality and is an issue when using AI techniques with EHR data.– Additionally, except for the work reported by Lin et al., all reviewed studies were single-center studies. Thus, the results of studies may not be generalizable to other healthcare systems. Although imaging data do not suffer from the data quality issues of other clinical data, there is no well-established gold standard for many imaging techniques. For instance, Garvin et al. presented an automated 3-dimensional intraretinal layer segmentation algorithm using OCT image data. The gold standard was determined by 2 retinal experts’ recommendations. This requires more time and resources to analyze and cross-validate the outcomes. Also, different preprocessing and postprocessing algorithms, hardware configurations, and image processing steps are intended to improve image quality for easier automated diagnosis. However, these factors often make models difficult to replicate. In addition, using imaging analysis without other prior information, such as medical history information, may affect the model performance and lead to biased results. Therefore, integration of imaging data and routine EHR data allows us to obtain prior information to input to the predictive model.

Conclusion

AI techniques are rapidly being adopted in ophthalmology and have the potential to improve the quality and delivery of ophthalmic care. Moreover, secondary use of EHR data is an emerging approach for clinical research involving AI, particularly given the availability of large-scale data sets and analytic methods.– In this review, we describe applications of AI methods to ocular diseases and problems such as diagnostic accuracy, disease progression, and risk assessment and find that the number of published studies in this area has been relatively limited due to challenges with the current quality of EHR data. In the future, we expect that AI using EHR data will be applied more widely in ophthalmic care, particularly as techniques improve and EHR data quality issues are resolved.

38 in total

Review 1. Machine learning for medical diagnosis: history, state of the art and perspective.

Authors: I Kononenko
Journal: Artif Intell Med Date: 2001-08 Impact factor: 5.326

2. Importance of multi-modal approaches to effectively identify cataract cases from electronic health records.

Authors: Peggy L Peissig; Luke V Rasmussen; Richard L Berg; James G Linneman; Catherine A McCarty; Carol Waudby; Lin Chen; Joshua C Denny; Russell A Wilke; Jyotishman Pathak; David Carrell; Abel N Kho; Justin B Starren
Journal: J Am Med Inform Assoc Date: 2012 Mar-Apr Impact factor: 4.497

3. Adding value to the electronic health record through secondary use of data for quality assurance, research, and surveillance.

Authors: William R Hersh
Journal: Am J Manag Care Date: 2007-06 Impact factor: 2.229

4. A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries.

Authors: Min Jiang; Yukun Chen; Mei Liu; S Trent Rosenbloom; Subramani Mani; Joshua C Denny; Hua Xu
Journal: J Am Med Inform Assoc Date: 2011-04-20 Impact factor: 4.497

Review 5. Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress.

Authors: S M Meystre; C Lovis; T Bürkle; G Tognola; A Budrionis; C U Lehmann
Journal: Yearb Med Inform Date: 2017-09-11

6. The meaning and use of the area under a receiver operating characteristic (ROC) curve.

Authors: J A Hanley; B J McNeil
Journal: Radiology Date: 1982-04 Impact factor: 11.105

7. Medication Accuracy in Electronic Health Records for Microbial Keratitis.

Authors: Hamza A Ashfaq; Corey A Lester; Dena Ballouz; Josh Errickson; Maria A Woodward
Journal: JAMA Ophthalmol Date: 2019-08-01 Impact factor: 7.389

8. Deep learning is effective for the classification of OCT images of normal versus Age-related Macular Degeneration.

Authors: Cecilia S Lee; Doug M Baughman; Aaron Y Lee
Journal: Ophthalmol Retina Date: 2017-02-13

9. Machine learning for prediction of all-cause mortality in patients with suspected coronary artery disease: a 5-year multicentre prospective registry analysis.

Authors: Manish Motwani; Damini Dey; Daniel S Berman; Guido Germano; Stephan Achenbach; Mouaz H Al-Mallah; Daniele Andreini; Matthew J Budoff; Filippo Cademartiri; Tracy Q Callister; Hyuk-Jae Chang; Kavitha Chinnaiyan; Benjamin J W Chow; Ricardo C Cury; Augustin Delago; Millie Gomez; Heidi Gransar; Martin Hadamitzky; Joerg Hausleiter; Niree Hindoyan; Gudrun Feuchtner; Philipp A Kaufmann; Yong-Jin Kim; Jonathon Leipsic; Fay Y Lin; Erica Maffei; Hugo Marques; Gianluca Pontone; Gilbert Raff; Ronen Rubinshtein; Leslee J Shaw; Julia Stehli; Todd C Villines; Allison Dunning; James K Min; Piotr J Slomka
Journal: Eur Heart J Date: 2017-02-14 Impact factor: 29.983

10. Using recurrent neural network models for early detection of heart failure onset.

Authors: Edward Choi; Andy Schuetz; Walter F Stewart; Jimeng Sun
Journal: J Am Med Inform Assoc Date: 2017-03-01 Impact factor: 4.497

12 in total

1. Biomarkers for Progression in Diabetic Retinopathy: Expanding Personalized Medicine through Integration of AI with Electronic Health Records.

Authors: Cris Martin P Jacoba; Leo Anthony Celi; Paolo S Silva
Journal: Semin Ophthalmol Date: 2021-03-18 Impact factor: 1.975

Review 2. Applications of interpretability in deep learning models for ophthalmology.

Authors: Adam M Hanif; Sara Beqiri; Pearse A Keane; J Peter Campbell
Journal: Curr Opin Ophthalmol Date: 2021-09-01 Impact factor: 4.299

Review 3. Gaps in standards for integrating artificial intelligence technologies into ophthalmic practice.

Authors: Sally L Baxter; Aaron Y Lee
Journal: Curr Opin Ophthalmol Date: 2021-09-01 Impact factor: 4.299

Review 4. Precision health in Taiwan: A data-driven diagnostic platform for the future of disease prevention.

Authors: Wesley Wei-Wen Hsiao; Jui-Chu Lin; Chien-Te Fan; Saint Shiou-Sheng Chen
Journal: Comput Struct Biotechnol J Date: 2022-03-26 Impact factor: 6.155

5. A qualitative investigation into the impact of hemophagocytic lymphohistiocytosis on children and their caregivers.

Authors: Annabel Nixon; Elina Roddick; Karen Moore; Diane Wild
Journal: Orphanet J Rare Dis Date: 2021-05-06 Impact factor: 4.123

6. People to policy: The promise and challenges of big data for India.

Authors: Anthony Vipin Das
Journal: Indian J Ophthalmol Date: 2021-11 Impact factor: 1.848

7. Diabetic retinopathy classification for supervised machine learning algorithms.

Authors: Luis Filipe Nakayama; Lucas Zago Ribeiro; Mariana Batista Gonçalves; Daniel A Ferraz; Helen Nazareth Veloso Dos Santos; Fernando Korn Malerbi; Paulo Henrique Morales; Mauricio Maia; Caio Vinicius Saito Regatieri; Rubens Belfort Mattos
Journal: Int J Retina Vitreous Date: 2022-01-03

8. A new approach to identifying patients with elevated risk for Fabry disease using a machine learning algorithm.

Authors: John L Jefferies; Alison K Spencer; Heather A Lau; Matthew W Nelson; Joseph D Giuliano; Joseph W Zabinski; Costas Boussios; Gary Curhan; Richard E Gliklich; David G Warnock
Journal: Orphanet J Rare Dis Date: 2021-12-20 Impact factor: 4.123

9. Effect Evaluation of Artificial Intelligence-Based Electronic Health PDCA Nursing Model in the Treatment of Mycoplasma Pneumonia in Children.

Authors: Yan Zhao
Journal: J Healthc Eng Date: 2022-03-11 Impact factor: 2.682

10. Predictive modeling of proliferative vitreoretinopathy using automated machine learning by ophthalmologists without coding experience.

Authors: Fares Antaki; Ghofril Kahwati; Julia Sebag; Razek Georges Coussa; Anthony Fanous; Renaud Duval; Mikael Sebag
Journal: Sci Rep Date: 2020-11-11 Impact factor: 4.379