Literature DB >> 30825539

Distributed learning from multiple EHR databases: Contextual embedding models for medical events.

Ziyi Li1, Kirk Roberts2, Xiaoqian Jiang3, Qi Long4.   

Abstract

Electronic health record (EHR) data provide promising opportunities to explore personalized treatment regimes and to make clinical predictions. Compared with regular clinical data, EHR data are known for their irregularity and complexity. In addition, analyzing EHR data involves privacy issues and sharing such data is often infeasible among multiple research sites due to regulatory and other hurdles. A recently published work uses contextual embedding models and successfully builds one predictive model for more than seventy common diagnoses. Despite of the high predictive power, the model cannot be generalized to other institutions without sharing data. In this work, a novel method is proposed to learn from multiple databases and build predictive models based on Distributed Noise Contrastive Estimation (Distributed NCE). We use differential privacy to safeguard the intermediary information sharing. The numerical study with a real dataset demonstrates that the proposed method not only can build predictive models in a distributed manner with privacy protection, but also preserve model structure well and achieve comparable prediction accuracy. The proposed methods have been implemented as a stand-alone Python library and the implementation is available on Github (https://github.com/ziyili20/DistributedLearningPredictor) with installation instructions and use-cases.
Copyright © 2019. Published by Elsevier Inc.

Entities:  

Keywords:  Contextual embedding models; Diagnoses prediction; Distributed computing; EHR data

Mesh:

Year:  2019        PMID: 30825539      PMCID: PMC6533615          DOI: 10.1016/j.jbi.2019.103138

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  14 in total

1.  Advancing the science for active surveillance: rationale and design for the Observational Medical Outcomes Partnership.

Authors:  Paul E Stang; Patrick B Ryan; Judith A Racoosin; J Marc Overhage; Abraham G Hartzema; Christian Reich; Emily Welebob; Thomas Scarnecchia; Janet Woodcock
Journal:  Ann Intern Med       Date:  2010-11-02       Impact factor: 25.391

2.  Prediction modeling using EHR data: challenges, strategies, and a comparison of machine learning approaches.

Authors:  Jionglin Wu; Jason Roy; Walter F Stewart
Journal:  Med Care       Date:  2010-06       Impact factor: 2.983

3.  Joint Learning of Representations of Medical Concepts and Words from EHR Data.

Authors:  Tian Bai; Ashis Kumar Chanda; Brian L Egleston; Slobodan Vucetic
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-12-18

4.  Legal issues concerning electronic health information: privacy, quality, and liability.

Authors:  J G Hodge; L O Gostin; P D Jacobson
Journal:  JAMA       Date:  1999-10-20       Impact factor: 56.272

5.  Mapping Patient Trajectories using Longitudinal Extraction and Deep Learning in the MIMIC-III Critical Care Database.

Authors:  Brett K Beaulieu-Jones; Patryk Orzechowski; Jason H Moore
Journal:  Pac Symp Biocomput       Date:  2018

6.  Privacy-Preserving Predictive Modeling: Harmonization of Contextual Embeddings From Different Sources.

Authors:  Yingxiang Huang; Junghye Lee; Shuang Wang; Jimeng Sun; Hongfang Liu; Xiaoqian Jiang
Journal:  JMIR Med Inform       Date:  2018-05-16

7.  Privacy-Preserving Patient Similarity Learning in a Federated Environment: Development and Analysis.

Authors:  Junghye Lee; Jimeng Sun; Fei Wang; Shuang Wang; Chi-Hyuck Jun; Xiaoqian Jiang
Journal:  JMIR Med Inform       Date:  2018-04-13

8.  Next-generation phenotyping of electronic health records.

Authors:  George Hripcsak; David J Albers
Journal:  J Am Med Inform Assoc       Date:  2012-09-06       Impact factor: 4.497

9.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

10.  Learning Low-Dimensional Representations of Medical Concepts.

Authors:  Youngduck Choi; Chill Yi-I Chiu; David Sontag
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2016-07-20
View more
  5 in total

Review 1.  Differential privacy in health research: A scoping review.

Authors:  Joseph Ficek; Wei Wang; Henian Chen; Getachew Dagne; Ellen Daley
Journal:  J Am Med Inform Assoc       Date:  2021-09-18       Impact factor: 7.942

2.  SMART COVID Navigator, a Clinical Decision Support Tool for COVID-19 Treatment: Design and Development Study.

Authors:  Jeremy Warner; Gil Alterovitz; Varun Suraj; Catherine Del Vecchio Fitz; Laura B Kleiman; Suresh K Bhavnani; Chinmay Jani; Surbhi Shah; Rana R McKay
Journal:  J Med Internet Res       Date:  2022-02-18       Impact factor: 5.428

3.  Improving the Performance of Outcome Prediction for Inpatients With Acute Myocardial Infarction Based on Embedding Representation Learned From Electronic Medical Records: Development and Validation Study.

Authors:  Yanqun Huang; Zhimin Zheng; Moxuan Ma; Xin Xin; Honglei Liu; Xiaolu Fei; Lan Wei; Hui Chen
Journal:  J Med Internet Res       Date:  2022-08-03       Impact factor: 7.076

4.  Federated Learning for Healthcare Informatics.

Authors:  Jie Xu; Benjamin S Glicksberg; Chang Su; Peter Walker; Jiang Bian; Fei Wang
Journal:  J Healthc Inform Res       Date:  2020-11-12

5.  Contrastive learning improves critical event prediction in COVID-19 patients.

Authors:  Tingyi Wanyan; Hossein Honarvar; Suraj K Jaladanki; Chengxi Zang; Nidhi Naik; Sulaiman Somani; Jessica K De Freitas; Ishan Paranjpe; Akhil Vaid; Jing Zhang; Riccardo Miotto; Zhangyang Wang; Girish N Nadkarni; Marinka Zitnik; Ariful Azad; Fei Wang; Ying Ding; Benjamin S Glicksberg
Journal:  Patterns (N Y)       Date:  2021-10-25
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.