The successful marriage of big data and artificial intelligence (AI) has been one of the greatest steps forward in medical research this century. The term big data, defined to encompass both large quantities and diverse types of data, was coined in the 1990s, and the term AI emerged even earlier: the first academic conference on AI occurred in 1956. However, these concepts did not become revolutionary drivers of medical research until recently, when all types of health care settings began passively collecting massive amounts of medical data. Electronic health records (EHRs) data and medical imaging data are major examples of these types of routinely collected big data. At the same time, multiomics have given rise to complex datasets with multidimensional data for each patient. These dramatic changes in data collection have enabled AI to progress rapidly toward solving challenging medical research questions.Ophthalmology has been at the forefront of AI research, in particular machine learning and deep learning approaches, because of the ubiquitous availability of noninvasive, rapid, and relatively inexpensive ophthalmic imaging. These advances in ophthalmic imaging have provided the big data required for successful applications of computer vision advances., The Intelligent Research in Sight (IRIS®) Registry, the largest clinical database in the world led by the American Academy of Ophthalmology, is an additional example of why ophthalmology is uniquely positioned for big data questions., Both big data and AI research are data driven, rather than based on the traditional observation-to-hypothesis approach, allowing discoveries outside the boundaries of traditional research methodology. Examples include novel biomarkers in the midst of multiomics datasets, subtle associations that would not be detectable with smaller scale datasets,, and identification of imaging features that were undetected with human eyes, such as the hyporeflective outer retinal band associated with delayed rod-mediated dark adaptation, an established marker of early age-related macular degeneration. This process drives the generation of new hypotheses and better insights into disease mechanisms.Some of the many exciting study results include novel associations between progression of glaucoma and pulmonary function. This was an unexpected finding from multimodal machine learning models that combined retinal imaging, demographics, and systemic variables from the UK Biobank database, which achieved an area under the receiver operating characteristic curve of 0.97 in predicting progression of glaucoma. Additional examples include deep learning algorithms to detect drusen, neovascular age-related macular degeneration, and diabetic macular edema after training with 101 418 OCT images of the retina from 5761 patients, creating OCT angiography images from structural OCT images alone, forecasting future visual field deficits from a single baseline visual field, and fully automated, open-sourced quantification of oculoplastics measurements using routine facial photographs.Despite these numerous successes, several inherent and data-specific limitations must be kept in mind before interpreting study results. First, statistical significance becomes less meaningful when handling millions of data points because even minor differences will show statistical significance. To address this, clinical significance and biological plausibility should be emphasized when designing the study and analyzing their results. Second, a clear understanding of the data used to train AI algorithms is critical because the resulting algorithms may not be generalizable to the intended populations. Third, validation of both big data and AI study results in independent study cohorts are key.As this field of research continues to advance, other critical issues remain. Although incorporating data from digital imaging systems is one thing, incorporating data from clinical notes and manually entered simple data like visual acuity and IOP is another, especially because today’s clinical chart is filled with entry errors and imported text. How can data cleaning methods be applied effectively and accurately to large EHR datasets such as the IRIS® Registry? How do we assess the validity of the EHR data? What are scalable statistical methods for a dataset of similar size and extent as the IRIS® Registry? How do we define clinically significant differences, rather than simply identifying statistical significance, in the setting of large datasets? Despite the data-rich ophthalmic environment, we remain hampered by limited interoperability between diagnostic systems and EHRs. How do we incentivize manufacturers of new diagnostic methods and EHRs to share data reliably and seamlessly from the outset? While doing so, how do we protect the privacy of participants when we continue to add complex linkages between many dimensional datasets? What are the ethical implications of big data and AI research? How do we standardize the datasets and promote data sharing?, Who is responsible for data breaches or failures of clinically deployed algorithms?Ophthalmology Science is pleased to announce the Big Data & AI Special Issue with the hope of providing a home for studies in this growing field. In this special issue, we are particularly interested in research that highlights novel methodologies, such as nontraditional statistical approaches that can be applied to big data and machine learning or deep learning studies. Another goal is to initiate discussions of current challenges and potential strategies to overcome them. Some examples include standardization of the datasets, data sharing processes, data privacy, and obstacles to transferring research findings into clinical care. Understanding both the potential and the limitations of big data and AI approaches through an array of diverse studies and commentaries is the primary goal of this Ophthalmology Science special issue.As vision science researchers and clinicians, we are uniquely positioned to take advantage of big data and AI research. As editors, we are grateful for this opportunity and look forward to engaging actively with the scientific community through this remarkable special issue.
Authors: Bradley S Henriksen; Isaac H Goldstein; Adam Rule; Abigail E Huang; Haley Dusek; Austin Igelman; Michael F Chiang; Michelle R Hribar Journal: Am J Ophthalmol Date: 2019-12-05 Impact factor: 5.258
Authors: Parmita Mehta; Christine A Petersen; Joanne C Wen; Michael R Banitt; Philip P Chen; Karine D Bojikian; Catherine Egan; Su-In Lee; Magdalena Balazinska; Aaron Y Lee; Ariel Rokem Journal: Am J Ophthalmol Date: 2021-05-02 Impact factor: 5.258
Authors: Alexandra Van Brummen; Julia P Owen; Theodore Spaide; Colin Froines; Randy Lu; Megan Lacy; Marian Blazes; Emily Li; Cecilia S Lee; Aaron Y Lee; Matthew Zhang Journal: Am J Ophthalmol Date: 2021-05-16 Impact factor: 5.258
Authors: Aaron Y Lee; Ryan T Yanagihara; Cecilia S Lee; Marian Blazes; Hoon C Jung; Yewlin E Chee; Michael D Gencarella; Harry Gee; April Y Maa; Glenn C Cockerham; Mary Lynch; Edward J Boyko Journal: Diabetes Care Date: 2021-01-05 Impact factor: 19.112
Authors: Aaron Y Lee; Cecilia S Lee; Marian S Blazes; Julia P Owen; Yelena Bagdasarova; Yue Wu; Theodore Spaide; Ryan T Yanagihara; Yuka Kihara; Mark E Clark; MiYoung Kwon; Cynthia Owsley; Christine A Curcio Journal: Transl Vis Sci Technol Date: 2020-12-15 Impact factor: 3.283
Authors: Julia P Owen; Marian Blazes; Megan Lacy; Ryan T Yanagihara; Russell N Van Gelder; Aaron Y Lee; Cecilia S Lee Journal: JAMA Ophthalmol Date: 2021-08-01 Impact factor: 8.253
Authors: Elysse Tom; Pearse A Keane; Marian Blazes; Louis R Pasquale; Michael F Chiang; Aaron Y Lee; Cecilia S Lee Journal: Transl Vis Sci Technol Date: 2020-07-06 Impact factor: 3.283