Ana Borovac1,2, Steinn Gudmundsson1, Gardar Thorvardsson2, Saeed M Moghadam3, Paivi Nevalainen3,4, Nathan Stevenson5, Sampsa Vanhatalo3, Thomas P Runarsson1. 1. Faculty of Industrial Engineering, Mechanical Engineering and Computer ScienceUniversity of Iceland 107 Reykjavik Iceland. 2. Kvikna Medical ehf. 110 Reykjavik Iceland. 3. BABA Center, Pediatric Research CenterDepartment of PhysiologyUniversity of Helsinki 00014 Helsinki Finland. 4. HUS Diagnostic CenterEpilepsia Helsinki and Department of Clinical NeurophysiologyNew Children's Hospital, Helsinki University Hospital 00029 Helsinki Finland. 5. Brain Modelling GroupQIMR Berghofer Medical Research Institute Herston QLD 4006 Australia.
Abstract
OBJECTIVE: Sharing medical data between institutions is difficult in practice due to data protection laws and official procedures within institutions. Therefore, most existing algorithms are trained on relatively small electroencephalogram (EEG) data sets which is likely to be detrimental to prediction accuracy. In this work, we simulate a case when the data can not be shared by splitting the publicly available data set into disjoint sets representing data in individual institutions. METHODS AND PROCEDURES: We propose to train a (local) detector in each institution and aggregate their individual predictions into one final prediction. Four aggregation schemes are compared, namely, the majority vote, the mean, the weighted mean and the Dawid-Skene method. The method was validated on an independent data set using only a subset of EEG channels. RESULTS: The ensemble reaches accuracy comparable to a single detector trained on all the data when sufficient amount of data is available in each institution. CONCLUSION: The weighted mean aggregation scheme showed best performance, it was only marginally outperformed by the Dawid-Skene method when local detectors approach performance of a single detector trained on all available data. CLINICAL IMPACT: Ensemble learning allows training of reliable algorithms for neonatal EEG analysis without a need to share the potentially sensitive EEG data between institutions.
OBJECTIVE: Sharing medical data between institutions is difficult in practice due to data protection laws and official procedures within institutions. Therefore, most existing algorithms are trained on relatively small electroencephalogram (EEG) data sets which is likely to be detrimental to prediction accuracy. In this work, we simulate a case when the data can not be shared by splitting the publicly available data set into disjoint sets representing data in individual institutions. METHODS AND PROCEDURES: We propose to train a (local) detector in each institution and aggregate their individual predictions into one final prediction. Four aggregation schemes are compared, namely, the majority vote, the mean, the weighted mean and the Dawid-Skene method. The method was validated on an independent data set using only a subset of EEG channels. RESULTS: The ensemble reaches accuracy comparable to a single detector trained on all the data when sufficient amount of data is available in each institution. CONCLUSION: The weighted mean aggregation scheme showed best performance, it was only marginally outperformed by the Dawid-Skene method when local detectors approach performance of a single detector trained on all available data. CLINICAL IMPACT: Ensemble learning allows training of reliable algorithms for neonatal EEG analysis without a need to share the potentially sensitive EEG data between institutions.
Authors: A H Ansari; P J Cherian; A Dereymaeker; V Matic; K Jansen; L De Wispelaere; C Dielman; J Vervisch; R M Swarte; P Govaert; G Naulaers; M De Vos; S Van Huffel Journal: Clin Neurophysiol Date: 2016-06-27 Impact factor: 3.708
Authors: Dmitry Yu Isaev; Dmitry Tchapyjnikov; C Michael Cotten; David Tanaka; Natalia Martinez; Martin Bertran; Guillermo Sapiro; David Carlson Journal: Proc Mach Learn Res Date: 2020-08
Authors: Aileen Malone; C Anthony Ryan; Anthony Fitzgerald; Louise Burgoyne; Sean Connolly; Geraldine B Boylan Journal: Epilepsia Date: 2009-06-01 Impact factor: 5.864
Authors: Ken Chang; Niranjan Balachandar; Carson Lam; Darvin Yi; James Brown; Andrew Beers; Bruce Rosen; Daniel L Rubin; Jayashree Kalpathy-Cramer Journal: J Am Med Inform Assoc Date: 2018-08-01 Impact factor: 7.942