Literature DB >> 31816040

Learning from electronic health records across multiple sites: A communication-efficient and privacy-preserving distributed algorithm.

Rui Duan1, Mary Regina Boland1, Zixuan Liu2, Yue Liu3, Howard H Chang4, Hua Xu5, Haitao Chu6, Christopher H Schmid7, Christopher B Forrest8, John H Holmes1, Martijn J Schuemie9, Jesse A Berlin9, Jason H Moore1, Yong Chen1.   

Abstract

OBJECTIVES: We propose a one-shot, privacy-preserving distributed algorithm to perform logistic regression (ODAL) across multiple clinical sites.
MATERIALS AND METHODS: ODAL effectively utilizes the information from the local site (where the patient-level data are accessible) and incorporates the first-order (ODAL1) and second-order (ODAL2) gradients of the likelihood function from other sites to construct an estimator without requiring iterative communication across sites or transferring patient-level data. We evaluated ODAL via extensive simulation studies and an application to a dataset from the University of Pennsylvania Health System. The estimation accuracy was evaluated by comparing it with the estimator based on the combined individual participant data or pooled data (ie, gold standard).
RESULTS: Our simulation studies revealed that the relative estimation bias of ODAL1 compared with the pooled estimates was <3%, and the ratio of standard errors was <1.25 for all scenarios. ODAL2 achieved higher accuracy (with relative bias <0.1% and ratio of standard errors <1.05). In real data analysis, we investigated the associations of 100 medications with fetal loss during pregnancy. We found that ODAL1 provided estimates with relative bias <10% for 85% of medications, and ODAL2 has relative bias <10% for 99% of medications. For communication cost, ODAL1 requires transferring p numbers from each site to the local site and ODAL2 requires transferring (p×p+p) numbers from each site to the local site, where p is the number of parameters in the regression model.
CONCLUSIONS: This study demonstrates that ODAL is privacy-preserving and communication-efficient with small bias and high statistical efficiency.
© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  distributed algorithm; electronic health record; learning health system; logistic regression

Mesh:

Year:  2020        PMID: 31816040      PMCID: PMC7025371          DOI: 10.1093/jamia/ocz199

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  18 in total

1.  WebDISCO: a web service for distributed cox model learning without patient-level data sharing.

Authors:  Chia-Lun Lu; Shuang Wang; Zhanglong Ji; Yuan Wu; Li Xiong; Xiaoqian Jiang; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2015-07-09       Impact factor: 4.497

Review 2.  Clinical research data warehouse governance for distributed research networks in the USA: a systematic review of the literature.

Authors:  John H Holmes; Thomas E Elliott; Jeffrey S Brown; Marsha A Raebel; Arthur Davidson; Andrew F Nelson; Annie Chung; Pierre La Chance; John F Steiner
Journal:  J Am Med Inform Assoc       Date:  2014-03-28       Impact factor: 4.497

Review 3.  Big and disparate data: considerations for pediatric consortia.

Authors:  Jeanette A Stingone; Nancy Mervish; Patricia Kovatch; Deborah L McGuinness; Chris Gennings; Susan L Teitelbaum
Journal:  Curr Opin Pediatr       Date:  2017-04       Impact factor: 2.856

4.  Management of first trimester pregnancy loss can be safely moved into the office.

Authors:  Jana L Allison; Rebecca S Sherwood; Danny J Schust
Journal:  Rev Obstet Gynecol       Date:  2011

5.  Achieving a nationwide learning health system.

Authors:  Charles P Friedman; Adam K Wong; David Blumenthal
Journal:  Sci Transl Med       Date:  2010-11-10       Impact factor: 17.956

6.  Estimates of global seasonal influenza-associated respiratory mortality: a modelling study.

Authors:  A Danielle Iuliano; Katherine M Roguski; Howard H Chang; David J Muscatello; Rakhee Palekar; Stefano Tempia; Cheryl Cohen; Jon Michael Gran; Dena Schanzer; Benjamin J Cowling; Peng Wu; Jan Kyncl; Li Wei Ang; Minah Park; Monika Redlberger-Fritz; Hongjie Yu; Laura Espenhain; Anand Krishnan; Gideon Emukule; Liselotte van Asten; Susana Pereira da Silva; Suchunya Aungkulanon; Udo Buchholz; Marc-Alain Widdowson; Joseph S Bresee
Journal:  Lancet       Date:  2017-12-14       Impact factor: 79.321

7.  Sharing individual level data from observational studies and clinical trials: a perspective from NHLBI.

Authors:  Sean A Coady; Elizabeth Wagner
Journal:  Trials       Date:  2013-07-09       Impact factor: 2.279

8.  ODAL: A one-shot distributed algorithm to perform logistic regressions on electronic health records data from multiple clinical sites.

Authors:  Rui Duan; Mary Regina Boland; Jason H Moore; Yong Chen
Journal:  Pac Symp Biocomput       Date:  2019

9.  Uncovering exposures responsible for birth season - disease effects: a global study.

Authors:  Mary Regina Boland; Pradipta Parhi; Li Li; Riccardo Miotto; Robert Carroll; Usman Iqbal; Phung-Anh Alex Nguyen; Martijn Schuemie; Seng Chan You; Donahue Smith; Sean Mooney; Patrick Ryan; Yu-Chuan Jack Li; Rae Woong Park; Josh Denny; Joel T Dudley; George Hripcsak; Pierre Gentine; Nicholas P Tatonetti
Journal:  J Am Med Inform Assoc       Date:  2018-03-01       Impact factor: 4.497

10.  Privacy, Security, and Patient Engagement: The Changing Health Data Governance Landscape.

Authors:  John H Holmes
Journal:  EGEMS (Wash DC)       Date:  2016-03-31
View more
  15 in total

1.  Leverage Real-world Longitudinal Data in Large Clinical Research Networks for Alzheimer's Disease and Related Dementia (ADRD).

Authors:  Rui Duan; Zhaoyi Chen; Jiayi Tong; Chongliang Luo; Tianchen Lyu; Cui Tao; Demetrius Maraganore; Jiang Bian; Yong Chen
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

2.  Fold-stratified cross-validation for unbiased and privacy-preserving federated learning.

Authors:  Romain Bey; Romain Goussault; François Grolleau; Mehdi Benchoufi; Raphaël Porcher
Journal:  J Am Med Inform Assoc       Date:  2020-08-01       Impact factor: 4.497

3.  dPQL: a lossless distributed algorithm for generalized linear mixed model with application to privacy-preserving hospital profiling.

Authors:  Chongliang Luo; Md Nazmul Islam; Natalie E Sheils; John Buresh; Martijn J Schuemie; Jalpa A Doshi; Rachel M Werner; David A Asch; Yong Chen
Journal:  J Am Med Inform Assoc       Date:  2022-07-12       Impact factor: 7.942

4.  Identifying Clinical Risk Factors for Opioid Use Disorder using a Distributed Algorithm to Combine Real-World Data from a Large Clinical Data Research Network.

Authors:  Jiayi Tong; Zhaoyi Chen; Rui Duan; Wei-Hsuan Lo-Ciganic; Tianchen Lyu; Cui Tao; Peter A Merkel; Henry R Kranzler; Jiang Bian; Yong Chen
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

5.  Why Is the Electronic Health Record So Challenging for Research and Clinical Care?

Authors:  John H Holmes; James Beinlich; Mary R Boland; Kathryn H Bowles; Yong Chen; Tessa S Cook; George Demiris; Michael Draugelis; Laura Fluharty; Peter E Gabriel; Robert Grundmeier; C William Hanson; Daniel S Herman; Blanca E Himes; Rebecca A Hubbard; Charles E Kahn; Dokyoon Kim; Ross Koppel; Qi Long; Nebojsa Mirkovic; Jeffrey S Morris; Danielle L Mowery; Marylyn D Ritchie; Ryan Urbanowicz; Jason H Moore
Journal:  Methods Inf Med       Date:  2021-07-19       Impact factor: 1.800

Review 6.  Review of Clinical Research Informatics.

Authors:  Anthony Solomonides
Journal:  Yearb Med Inform       Date:  2020-08-21

7.  DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models.

Authors:  Chongliang Luo; Md Nazmul Islam; Natalie E Sheils; John Buresh; Jenna Reps; Martijn J Schuemie; Patrick B Ryan; Mackenzie Edmondson; Rui Duan; Jiayi Tong; Arielle Marks-Anglin; Jiang Bian; Zhaoyi Chen; Talita Duarte-Salles; Sergio Fernández-Bertolín; Thomas Falconer; Chungsoo Kim; Rae Woong Park; Stephen R Pfohl; Nigam H Shah; Andrew E Williams; Hua Xu; Yujia Zhou; Ebbing Lautenbach; Jalpa A Doshi; Rachel M Werner; David A Asch; Yong Chen
Journal:  Nat Commun       Date:  2022-03-30       Impact factor: 14.919

8.  Federated Learning for Healthcare Informatics.

Authors:  Jie Xu; Benjamin S Glicksberg; Chang Su; Peter Walker; Jiang Bian; Fei Wang
Journal:  J Healthc Inform Res       Date:  2020-11-12

9.  An efficient and accurate distributed learning algorithm for modeling multi-site zero-inflated count outcomes.

Authors:  Mackenzie J Edmondson; Chongliang Luo; Rui Duan; Mitchell Maltenfort; Zhaoyi Chen; Kenneth Locke; Justine Shults; Jiang Bian; Patrick B Ryan; Christopher B Forrest; Yong Chen
Journal:  Sci Rep       Date:  2021-10-04       Impact factor: 4.379

10.  Harnessing electronic health records to study emerging environmental disasters: a proof of concept with perfluoroalkyl substances (PFAS).

Authors:  Mary Regina Boland; Lena M Davidson; Silvia P Canelón; Jessica Meeker; Trevor Penning; John H Holmes; Jason H Moore
Journal:  NPJ Digit Med       Date:  2021-08-11
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.