Adam Hanif1, Charles Lu2, Ken Chang2, Praveer Singh2, Aaron S Coyner1, James M Brown3, Susan Ostmo1, Robison V Paul Chan4, Daniel Rubin5, Michael F Chiang6, Jayashree Kalpathy-Cramer2, John Peter Campbell7. 1. Department of Ophthalmology, Oregon Health & Science University, Portland, Oregon. 2. Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Charlestown, Massachusetts; Center for Clinical Data Science, Massachusetts General Hospital and Brigham and Women's Hospital, Boston, Massachusetts. 3. School of Computer Science, University of Lincoln, Lincoln, United Kingdom. 4. Ophthalmology and Visual Sciences, University of Illinois at Chicago, Chicago, Illinois. 5. Center for Biomedical Informatics Research, Stanford University School of Medicine, Stanford, California. 6. National Eye Institute, National Institutes of Health, Bethesda, Maryland. 7. Department of Ophthalmology, Oregon Health & Science University, Portland, Oregon. Electronic address: campbelp@ohsu.edu.
Abstract
OBJECTIVE: To utilize a deep learning (DL) model trained via federated learning (FL), a method of collaborative training without sharing patient data, to delineate institutional differences in clinician diagnostic paradigms and disease epidemiology in retinopathy of prematurity (ROP). DESIGN: Evaluation of a diagnostic test or technology. SUBJECTS AND CONTROLS: We included 5245 patients with wide-angle retinal imaging from the neonatal intensive care units of 7 institutions as part of the Imaging and Informatics in ROP study. Images were labeled with the clinical diagnoses of plus disease (plus, preplus, no plus), which were documented in the chart, and a reference standard diagnosis was determined by 3 image-based ROP graders and the clinical diagnosis. METHODS: Demographics (birth weight, gestational age) and clinical diagnoses for all eye examinations were recorded from each institution. Using an FL approach, a DL model for plus disease classification was trained using only the clinical labels. The 3 class probabilities were then converted into a vascular severity score (VSS) for each eye examination, as well as an "institutional VSS," in which the average of the VSS values assigned to patients' higher severity ("worse") eyes at each examination was calculated for each institution. MAIN OUTCOME MEASURES: We compared demographics, clinical diagnoses of plus disease, and institutional VSSs between institutions using the McNemar-Bowker test, 2-proportion Z test, and 1-way analysis of variance with post hoc analysis by the Tukey-Kramer test. Single regression analysis was performed to explore the relationship between demographics and VSSs. RESULTS: We found that the proportion of patients diagnosed with preplus disease varied significantly between institutions (P < 0.001). Using the DL-derived VSS trained on the data from all institutions using FL, we observed differences in the institutional VSS and the level of vascular severity diagnosed as no plus (P < 0.001) across institutions. A significant, inverse relationship between the institutional VSS and mean gestational age was found (P = 0.049, adjusted R2 = 0.49). CONCLUSIONS: A DL-derived ROP VSS developed without sharing data between institutions using FL identified differences in the clinical diagnoses of plus disease and overall levels of ROP severity between institutions. Federated learning may represent a method to standardize clinical diagnoses and provide objective measurements of disease for image-based diseases.
OBJECTIVE: To utilize a deep learning (DL) model trained via federated learning (FL), a method of collaborative training without sharing patient data, to delineate institutional differences in clinician diagnostic paradigms and disease epidemiology in retinopathy of prematurity (ROP). DESIGN: Evaluation of a diagnostic test or technology. SUBJECTS AND CONTROLS: We included 5245 patients with wide-angle retinal imaging from the neonatal intensive care units of 7 institutions as part of the Imaging and Informatics in ROP study. Images were labeled with the clinical diagnoses of plus disease (plus, preplus, no plus), which were documented in the chart, and a reference standard diagnosis was determined by 3 image-based ROP graders and the clinical diagnosis. METHODS: Demographics (birth weight, gestational age) and clinical diagnoses for all eye examinations were recorded from each institution. Using an FL approach, a DL model for plus disease classification was trained using only the clinical labels. The 3 class probabilities were then converted into a vascular severity score (VSS) for each eye examination, as well as an "institutional VSS," in which the average of the VSS values assigned to patients' higher severity ("worse") eyes at each examination was calculated for each institution. MAIN OUTCOME MEASURES: We compared demographics, clinical diagnoses of plus disease, and institutional VSSs between institutions using the McNemar-Bowker test, 2-proportion Z test, and 1-way analysis of variance with post hoc analysis by the Tukey-Kramer test. Single regression analysis was performed to explore the relationship between demographics and VSSs. RESULTS: We found that the proportion of patients diagnosed with preplus disease varied significantly between institutions (P < 0.001). Using the DL-derived VSS trained on the data from all institutions using FL, we observed differences in the institutional VSS and the level of vascular severity diagnosed as no plus (P < 0.001) across institutions. A significant, inverse relationship between the institutional VSS and mean gestational age was found (P = 0.049, adjusted R2 = 0.49). CONCLUSIONS: A DL-derived ROP VSS developed without sharing data between institutions using FL identified differences in the clinical diagnoses of plus disease and overall levels of ROP severity between institutions. Federated learning may represent a method to standardize clinical diagnoses and provide objective measurements of disease for image-based diseases.
Authors: Michael C Ryan; Susan Ostmo; Karyn Jonas; Audina Berrocal; Kimberly Drenser; Jason Horowitz; Thomas C Lee; Charles Simmons; Maria-Ana Martinez-Castellanos; R V Paul Chan; Michael F Chiang Journal: AMIA Annu Symp Proc Date: 2014-11-14
Authors: Rene Y Choi; James M Brown; Jayashree Kalpathy-Cramer; R V Paul Chan; Susan Ostmo; Michael F Chiang; J Peter Campbell Journal: Ophthalmol Retina Date: 2020-05-04
Authors: James M Brown; J Peter Campbell; Andrew Beers; Ken Chang; Susan Ostmo; R V Paul Chan; Jennifer Dy; Deniz Erdogmus; Stratis Ioannidis; Jayashree Kalpathy-Cramer; Michael F Chiang Journal: JAMA Ophthalmol Date: 2018-07-01 Impact factor: 7.389
Authors: J Peter Campbell; Michael C Ryan; Emily Lore; Peng Tian; Susan Ostmo; Karyn Jonas; R V Paul Chan; Michael F Chiang Journal: Ophthalmology Date: 2016-05-27 Impact factor: 12.079
Authors: James D Reynolds; Velma Dobson; Graham E Quinn; Alistair R Fielder; Earl A Palmer; Richard A Saunders; Robert J Hardy; Dale L Phelps; John D Baker; Michael T Trese; David Schaffer; Betty Tung Journal: Arch Ophthalmol Date: 2002-11
Authors: Kellyn N Bellsmith; James Brown; Sang Jin Kim; Isaac H Goldstein; Aaron Coyner; Susan Ostmo; Kishan Gupta; R V Paul Chan; Jayashree Kalpathy-Cramer; Michael F Chiang; J Peter Campbell Journal: Ophthalmology Date: 2020-02-07 Impact factor: 12.079
Authors: Kishan Gupta; J Peter Campbell; Stanford Taylor; James M Brown; Susan Ostmo; R V Paul Chan; Jennifer Dy; Deniz Erdogmus; Stratis Ioannidis; Jayashree Kalpathy-Cramer; Sang J Kim; Michael F Chiang Journal: JAMA Ophthalmol Date: 2019-07-03 Impact factor: 7.389
Authors: Stanford Taylor; James M Brown; Kishan Gupta; J Peter Campbell; Susan Ostmo; R V Paul Chan; Jennifer Dy; Deniz Erdogmus; Stratis Ioannidis; Sang J Kim; Jayashree Kalpathy-Cramer; Michael F Chiang Journal: JAMA Ophthalmol Date: 2019-07-03 Impact factor: 7.389
Authors: Michael F Chiang; Graham E Quinn; Alistair R Fielder; Susan R Ostmo; R V Paul Chan; Audina Berrocal; Gil Binenbaum; Michael Blair; J Peter Campbell; Antonio Capone; Yi Chen; Shuan Dai; Anna Ells; Brian W Fleck; William V Good; M Elizabeth Hartnett; Gerd Holmstrom; Shunji Kusaka; Andrés Kychenthal; Domenico Lepore; Birgit Lorenz; Maria Ana Martinez-Castellanos; Şengül Özdek; Dupe Ademola-Popoola; James D Reynolds; Parag K Shah; Michael Shapiro; Andreas Stahl; Cynthia Toth; Anand Vinekar; Linda Visser; David K Wallace; Wei-Chi Wu; Peiquan Zhao; Andrea Zin Journal: Ophthalmology Date: 2021-07-08 Impact factor: 12.079