Literature DB >> 32989459

Generating sequential electronic health records using dual adversarial autoencoder.

Dongha Lee1, Hwanjo Yu1, Xiaoqian Jiang2, Deevakar Rogith2, Meghana Gudala2, Mubeen Tejani2, Qiuchen Zhang3, Li Xiong3.   

Abstract

OBJECTIVE: Recent studies on electronic health records (EHRs) started to learn deep generative models and synthesize a huge amount of realistic records, in order to address significant privacy issues surrounding the EHR. However, most of them only focus on structured records about patients' independent visits, rather than on chronological clinical records. In this article, we aim to learn and synthesize realistic sequences of EHRs based on the generative autoencoder.
MATERIALS AND METHODS: We propose a dual adversarial autoencoder (DAAE), which learns set-valued sequences of medical entities, by combining a recurrent autoencoder with 2 generative adversarial networks (GANs). DAAE improves the mode coverage and quality of generated sequences by adversarially learning both the continuous latent distribution and the discrete data distribution. Using the MIMIC-III (Medical Information Mart for Intensive Care-III) and UT Physicians clinical databases, we evaluated the performances of DAAE in terms of predictive modeling, plausibility, and privacy preservation.
RESULTS: Our generated sequences of EHRs showed the comparable performances to real data for a predictive modeling task, and achieved the best score in plausibility evaluation conducted by medical experts among all baseline models. In addition, differentially private optimization of our model enables to generate synthetic sequences without increasing the privacy leakage of patients' data.
CONCLUSIONS: DAAE can effectively synthesize sequential EHRs by addressing its main challenges: the synthetic records should be realistic enough not to be distinguished from the real records, and they should cover all the training patients to reproduce the performance of specific downstream tasks.
© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  differential privacy; electornic health records (EHRs); generative adversarial networks (GANs); generative autoencoder; sequential data generation

Mesh:

Year:  2020        PMID: 32989459      PMCID: PMC7647348          DOI: 10.1093/jamia/ocaa119

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  13 in total

1.  PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals.

Authors:  A L Goldberger; L A Amaral; L Glass; J M Hausdorff; P C Ivanov; R G Mark; J E Mietus; G B Moody; C K Peng; H E Stanley
Journal:  Circulation       Date:  2000-06-13       Impact factor: 29.690

2.  Synthesizing electronic health records using improved generative adversarial networks.

Authors:  Mrinal Kanti Baowaly; Chia-Ching Lin; Chao-Lin Liu; Kuan-Ta Chen
Journal:  J Am Med Inform Assoc       Date:  2019-03-01       Impact factor: 4.497

3.  SegAN: Adversarial Network with Multi-scale L1 Loss for Medical Image Segmentation.

Authors:  Yuan Xue; Tao Xu; Han Zhang; L Rodney Long; Xiaolei Huang
Journal:  Neuroinformatics       Date:  2018-10

4.  Ensuring electronic medical record simulation through better training, modeling, and evaluation.

Authors:  Ziqi Zhang; Chao Yan; Diego A Mesa; Jimeng Sun; Bradley A Malin
Journal:  J Am Med Inform Assoc       Date:  2020-01-01       Impact factor: 4.497

5.  Doctor AI: Predicting Clinical Events via Recurrent Neural Networks.

Authors:  Edward Choi; Mohammad Taha Bahadori; Andy Schuetz; Walter F Stewart; Jimeng Sun
Journal:  JMLR Workshop Conf Proc       Date:  2016-12-10

6.  Medical Image Synthesis with Context-Aware Generative Adversarial Networks.

Authors:  Dong Nie; Roger Trullo; Jun Lian; Caroline Petitjean; Su Ruan; Qian Wang; Dinggang Shen
Journal:  Med Image Comput Comput Assist Interv       Date:  2017-09-04

7.  A systematic review of re-identification attacks on health data.

Authors:  Khaled El Emam; Elizabeth Jonker; Luk Arbuckle; Bradley Malin
Journal:  PLoS One       Date:  2011-12-02       Impact factor: 3.240

8.  Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing.

Authors:  Brett K Beaulieu-Jones; Zhiwei Steven Wu; Chris Williams; Ran Lee; Sanjeev P Bhavnani; James Brian Byrd; Casey S Greene
Journal:  Circ Cardiovasc Qual Outcomes       Date:  2019-07-09

9.  Evaluating the risk of patient re-identification from adverse drug event reports.

Authors:  Khaled El Emam; Fida K Dankar; Angelica Neisa; Elizabeth Jonker
Journal:  BMC Med Inform Decis Mak       Date:  2013-10-05       Impact factor: 2.796

10.  MIMIC-III, a freely accessible critical care database.

Authors:  Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark
Journal:  Sci Data       Date:  2016-05-24       Impact factor: 6.444

View more
  5 in total

1.  Informatics impact requires effective, scalable tools and standards-based infrastructure.

Authors:  Suzanne Bakken
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

2.  Membership inference attacks against synthetic health data.

Authors:  Ziqi Zhang; Chao Yan; Bradley A Malin
Journal:  J Biomed Inform       Date:  2021-12-14       Impact factor: 6.317

3.  Keeping synthetic patients on track: feedback mechanisms to mitigate performance drift in longitudinal health data simulation.

Authors:  Ziqi Zhang; Chao Yan; Bradley A Malin
Journal:  J Am Med Inform Assoc       Date:  2022-10-07       Impact factor: 7.942

Review 4.  Differential privacy in health research: A scoping review.

Authors:  Joseph Ficek; Wei Wang; Henian Chen; Getachew Dagne; Ellen Daley
Journal:  J Am Med Inform Assoc       Date:  2021-09-18       Impact factor: 7.942

5.  Pretrained transformer framework on pediatric claims data for population specific tasks.

Authors:  Xianlong Zeng; Simon L Linwood; Chang Liu
Journal:  Sci Rep       Date:  2022-03-07       Impact factor: 4.379

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.