Literature DB >> 29500022

Federated learning of predictive models from federated Electronic Health Records.

Theodora S Brisimi1, Ruidi Chen1, Theofanie Mela2, Alex Olshevsky1, Ioannis Ch Paschalidis3, Wei Shi4.   

Abstract

BACKGROUND: In an era of "big data," computationally efficient and privacy-aware solutions for large-scale machine learning problems become crucial, especially in the healthcare domain, where large amounts of data are stored in different locations and owned by different entities. Past research has been focused on centralized algorithms, which assume the existence of a central data repository (database) which stores and can process the data from all participants. Such an architecture, however, can be impractical when data are not centrally located, it does not scale well to very large datasets, and introduces single-point of failure risks which could compromise the integrity and privacy of the data. Given scores of data widely spread across hospitals/individuals, a decentralized computationally scalable methodology is very much in need.
OBJECTIVE: We aim at solving a binary supervised classification problem to predict hospitalizations for cardiac events using a distributed algorithm. We seek to develop a general decentralized optimization framework enabling multiple data holders to collaborate and converge to a common predictive model, without explicitly exchanging raw data.
METHODS: We focus on the soft-margin l1-regularized sparse Support Vector Machine (sSVM) classifier. We develop an iterative cluster Primal Dual Splitting (cPDS) algorithm for solving the large-scale sSVM problem in a decentralized fashion. Such a distributed learning scheme is relevant for multi-institutional collaborations or peer-to-peer applications, allowing the data holders to collaborate, while keeping every participant's data private.
RESULTS: We test cPDS on the problem of predicting hospitalizations due to heart diseases within a calendar year based on information in the patients Electronic Health Records prior to that year. cPDS converges faster than centralized methods at the cost of some communication between agents. It also converges faster and with less communication overhead compared to an alternative distributed algorithm. In both cases, it achieves similar prediction accuracy measured by the Area Under the Receiver Operating Characteristic Curve (AUC) of the classifier. We extract important features discovered by the algorithm that are predictive of future hospitalizations, thus providing a way to interpret the classification results and inform prevention efforts.
Copyright © 2018 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Distributed learning; Electronic Health Records (EHRs); Federated databases; Heart diseases; Hospitalization; Predictive models

Mesh:

Year:  2018        PMID: 29500022      PMCID: PMC5836813          DOI: 10.1016/j.ijmedinf.2018.01.007

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  7 in total

1.  GEMS: a system for automated cancer diagnosis and biomarker discovery from microarray gene expression data.

Authors:  Alexander Statnikov; Ioannis Tsamardinos; Yerbolat Dosbayev; Constantin F Aliferis
Journal:  Int J Med Inform       Date:  2005-08       Impact factor: 4.046

2.  A new initiative on precision medicine.

Authors:  Francis S Collins; Harold Varmus
Journal:  N Engl J Med       Date:  2015-01-30       Impact factor: 91.245

3.  Prediction of hospitalization due to heart diseases by supervised learning methods.

Authors:  Wuyang Dai; Theodora S Brisimi; William G Adams; Theofanie Mela; Venkatesh Saligrama; Ioannis Ch Paschalidis
Journal:  Int J Med Inform       Date:  2014-10-16       Impact factor: 4.046

4.  Application of support vector machine modeling for prediction of common diseases: the case of diabetes and pre-diabetes.

Authors:  Wei Yu; Tiebin Liu; Rodolfo Valdez; Marta Gwinn; Muin J Khoury
Journal:  BMC Med Inform Decis Mak       Date:  2010-03-22       Impact factor: 2.796

5.  General cardiovascular risk profile for use in primary care: the Framingham Heart Study.

Authors:  Ralph B D'Agostino; Ramachandran S Vasan; Michael J Pencina; Philip A Wolf; Mark Cobain; Joseph M Massaro; William B Kannel
Journal:  Circulation       Date:  2008-01-22       Impact factor: 29.690

6.  Support vector machines for automated recognition of obstructive sleep apnea syndrome from ECG recordings.

Authors:  Ahsan H Khandoker; Marimuthu Palaniswami; Chandan K Karmakar
Journal:  IEEE Trans Inf Technol Biomed       Date:  2009-01

7.  Application of support vector machine for prediction of medication adherence in heart failure patients.

Authors:  Youn-Jung Son; Hong-Gee Kim; Eung-Hee Kim; Sangsup Choi; Soo-Kyoung Lee
Journal:  Healthc Inform Res       Date:  2010-12-31
  7 in total
  43 in total

1.  Asymptotic Network Independence in Distributed Stochastic Optimization for Machine Learning.

Authors:  Shi Pu; Alex Olshevsky; Ioannis Ch Paschalidis
Journal:  IEEE Signal Process Mag       Date:  2020-05-06       Impact factor: 12.551

2.  Considerations for Improving the Portability of Electronic Health Record-Based Phenotype Algorithms.

Authors:  Luke V Rasmussen; Pascal S Brandt; Guoqian Jiang; Richard C Kiefer; Jennifer A Pacheco; Prakash Adekkanattu; Jessica S Ancker; Fei Wang; Zhenxing Xu; Jyotishman Pathak; Yuan Luo
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

Review 3.  Radiomics: from qualitative to quantitative imaging.

Authors:  William Rogers; Sithin Thulasi Seetha; Turkey A G Refaee; Relinde I Y Lieverse; Renée W Y Granzier; Abdalla Ibrahim; Simon A Keek; Sebastian Sanduleanu; Sergey P Primakov; Manon P L Beuque; Damiënne Marcus; Alexander M A van der Wiel; Fadila Zerka; Cary J G Oberije; Janita E van Timmeren; Henry C Woodruff; Philippe Lambin
Journal:  Br J Radiol       Date:  2020-02-26       Impact factor: 3.039

Review 4.  Shifting machine learning for healthcare from development to deployment and from models to data.

Authors:  Angela Zhang; Lei Xing; James Zou; Joseph C Wu
Journal:  Nat Biomed Eng       Date:  2022-07-04       Impact factor: 25.671

5.  A Federated Mining Approach on Predicting Diabetes-Related Complications: Demonstration Using Real-World Clinical Data.

Authors:  Humayera Islam; Abu Mosa
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

6.  A Privacy-Preserved Transfer Learning Concept to Predict Diabetic Kidney Disease at Out-of-Network Siloed Sites Using an In-Network Federated Model on Real-World Data.

Authors:  Humayera Islam; Khuder Alaboud; Tanmoy Paul; Md Kamruz Zaman Rana; Abu Mosa
Journal:  AMIA Annu Symp Proc       Date:  2022-05-23

7.  Predicting Adverse Drug Reactions on Distributed Health Data using Federated Learning.

Authors:  Olivia Choudhury; Yoonyoung Park; Theodoros Salonidis; Aris Gkoulalas-Divanis; Issa Sylla; Amar K Das
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

Review 8.  The Promise of AI in Detection, Diagnosis, and Epidemiology for Combating COVID-19: Beyond the Hype.

Authors:  Musa Abdulkareem; Steffen E Petersen
Journal:  Front Artif Intell       Date:  2021-05-14

9.  Blockchain-Enabled Asynchronous Federated Learning in Edge Computing.

Authors:  Yinghui Liu; Youyang Qu; Chenhao Xu; Zhicheng Hao; Bruce Gu
Journal:  Sensors (Basel)       Date:  2021-05-11       Impact factor: 3.576

10.  A framework for the prediction of earthquake using federated learning.

Authors:  Rabia Tehseen; Muhammad Shoaib Farooq; Adnan Abid
Journal:  PeerJ Comput Sci       Date:  2021-05-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.