Niranjan Balachandar1, Ken Chang2, Jayashree Kalpathy-Cramer2,3, Daniel L Rubin1. 1. Laboratory of Quantitative Imaging and Artificial Intelligence, Department of Radiology and Biomedical Data Science, Stanford University, Stanford, CA, USA. 2. Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Charlestown, MA, USA. 3. MGH and BWH Center for Clinical Data Science, Massachusetts General Hospital, Boston, MA, USA.
Abstract
OBJECTIVES: Sharing patient data across institutions to train generalizable deep learning models is challenging due to regulatory and technical hurdles. Distributed learning, where model weights are shared instead of patient data, presents an attractive alternative. Cyclical weight transfer (CWT) has recently been demonstrated as an effective distributed learning method for medical imaging with homogeneous data across institutions. In this study, we optimize CWT to overcome performance losses from variability in training sample sizes and label distributions across institutions. MATERIALS AND METHODS: Optimizations included proportional local training iterations, cyclical learning rate, locally weighted minibatch sampling, and cyclically weighted loss. We evaluated our optimizations on simulated distributed diabetic retinopathy detection and chest radiograph classification. RESULTS: Proportional local training iteration mitigated performance losses from sample size variability, achieving 98.6% of the accuracy attained by centrally hosting in the diabetic retinopathy dataset split with highest sample size variance across institutions. Locally weighted minibatch sampling and cyclically weighted loss both mitigated performance losses from label distribution variability, achieving 98.6% and 99.1%, respectively, of the accuracy attained by centrally hosting in the diabetic retinopathy dataset split with highest label distribution variability across institutions. DISCUSSION: Our optimizations to CWT improve its capability of handling data variability across institutions. Compared to CWT without optimizations, CWT with optimizations achieved performance significantly closer to performance from centrally hosting. CONCLUSION: Our work is the first to identify and address challenges of sample size and label distribution variability in simulated distributed deep learning for medical imaging. Future work is needed to address other sources of real-world data variability.
OBJECTIVES: Sharing patient data across institutions to train generalizable deep learning models is challenging due to regulatory and technical hurdles. Distributed learning, where model weights are shared instead of patient data, presents an attractive alternative. Cyclical weight transfer (CWT) has recently been demonstrated as an effective distributed learning method for medical imaging with homogeneous data across institutions. In this study, we optimize CWT to overcome performance losses from variability in training sample sizes and label distributions across institutions. MATERIALS AND METHODS: Optimizations included proportional local training iterations, cyclical learning rate, locally weighted minibatch sampling, and cyclically weighted loss. We evaluated our optimizations on simulated distributed diabetic retinopathy detection and chest radiograph classification. RESULTS: Proportional local training iteration mitigated performance losses from sample size variability, achieving 98.6% of the accuracy attained by centrally hosting in the diabetic retinopathy dataset split with highest sample size variance across institutions. Locally weighted minibatch sampling and cyclically weighted loss both mitigated performance losses from label distribution variability, achieving 98.6% and 99.1%, respectively, of the accuracy attained by centrally hosting in the diabetic retinopathy dataset split with highest label distribution variability across institutions. DISCUSSION: Our optimizations to CWT improve its capability of handling data variability across institutions. Compared to CWT without optimizations, CWT with optimizations achieved performance significantly closer to performance from centrally hosting. CONCLUSION: Our work is the first to identify and address challenges of sample size and label distribution variability in simulated distributed deep learning for medical imaging. Future work is needed to address other sources of real-world data variability.
Authors: Curtis P Langlotz; Bibb Allen; Bradley J Erickson; Jayashree Kalpathy-Cramer; Keith Bigelow; Tessa S Cook; Adam E Flanders; Matthew P Lungren; David S Mendelson; Jeffrey D Rudie; Ge Wang; Krishna Kandarpa Journal: Radiology Date: 2019-04-16 Impact factor: 11.105
Authors: Arnaud Arindra Adiyoso Setio; Francesco Ciompi; Geert Litjens; Paul Gerke; Colin Jacobs; Sarah J van Riel; Mathilde Marie Winkler Wille; Matiullah Naqibullah; Clara I Sanchez; Bram van Ginneken Journal: IEEE Trans Med Imaging Date: 2016-03-01 Impact factor: 10.048
Authors: James M Brown; J Peter Campbell; Andrew Beers; Ken Chang; Susan Ostmo; R V Paul Chan; Jennifer Dy; Deniz Erdogmus; Stratis Ioannidis; Jayashree Kalpathy-Cramer; Michael F Chiang Journal: JAMA Ophthalmol Date: 2018-07-01 Impact factor: 7.389
Authors: John R Zech; Marcus A Badgeley; Manway Liu; Anthony B Costa; Joseph J Titano; Eric Karl Oermann Journal: PLoS Med Date: 2018-11-06 Impact factor: 11.069
Authors: Ken Chang; Andrew L Beers; Harrison X Bai; James M Brown; K Ina Ly; Xuejun Li; Joeky T Senders; Vasileios K Kavouridis; Alessandro Boaro; Chang Su; Wenya Linda Bi; Otto Rapalino; Weihua Liao; Qin Shen; Hao Zhou; Bo Xiao; Yinyan Wang; Paul J Zhang; Marco C Pinho; Patrick Y Wen; Tracy T Batchelor; Jerrold L Boxerman; Omar Arnaout; Bruce R Rosen; Elizabeth R Gerstner; Li Yang; Raymond Y Huang; Jayashree Kalpathy-Cramer Journal: Neuro Oncol Date: 2019-11-04 Impact factor: 12.300
Authors: Ken Chang; Niranjan Balachandar; Carson Lam; Darvin Yi; James Brown; Andrew Beers; Bruce Rosen; Daniel L Rubin; Jayashree Kalpathy-Cramer Journal: J Am Med Inform Assoc Date: 2018-08-01 Impact factor: 7.942
Authors: Joshua R Astley; Alberto M Biancardi; Paul J C Hughes; Helen Marshall; Laurie J Smith; Guilhem J Collier; James A Eaden; Nicholas D Weatherley; Matthew Q Hatton; Jim M Wild; Bilal A Tahir Journal: Sci Rep Date: 2022-06-22 Impact factor: 4.996
Authors: Karthik V Sarma; Stephanie Harmon; Thomas Sanford; Holger R Roth; Ziyue Xu; Jesse Tetreault; Daguang Xu; Mona G Flores; Alex G Raman; Rushikesh Kulkarni; Bradford J Wood; Peter L Choyke; Alan M Priester; Leonard S Marks; Steven S Raman; Dieter Enzmann; Baris Turkbey; William Speier; Corey W Arnold Journal: J Am Med Inform Assoc Date: 2021-06-12 Impact factor: 4.497
Authors: Reihaneh Torkzadehmahani; Reza Nasirigerdeh; David B Blumenthal; Tim Kacprowski; Markus List; Julian Matschinske; Julian Spaeth; Nina Kerstin Wenke; Jan Baumbach Journal: Methods Inf Med Date: 2022-01-21 Impact factor: 1.800