Literature DB >> 26681811

A Distributed Ensemble Approach for Mining Healthcare Data under Privacy Constraints.

Yan Li1, Changxin Bai1, Chandan K Reddy1.   

Abstract

In recent years, electronic health records (EHRs) have been widely adapted at many healthcare facilities in an attempt to improve the quality of patient care and increase the productivity and efficiency of healthcare delivery. These EHRs can accurately diagnose diseases if utilized appropriately. While the EHRs can potentially resolve many of the existing problems associated with disease diagnosis, one of the main obstacles in effectively using them is the patient privacy and sensitivity of the medical information available in the EHR. Due to these concerns, even if the EHRs are available for storage and retrieval purposes, sharing of the patient records between different healthcare facilities has become a major concern and has hampered some of the effective advantages of using EHRs. Due to this lack of data sharing, most of the facilities aim at building clinical decision support systems using limited amount of patient data from their own EHR systems to provide important diagnosis related decisions. It becomes quite infeasible for a newly established healthcare facility to build a robust decision making system due to the lack of sufficient patient records. However, to make effective decisions from clinical data, it is indispensable to have large amounts of data to train the decision models. In this regard, there are conflicting objectives of preserving patient privacy and having sufficient data for modeling and decision making. To handle such disparate goals, we develop two adaptive distributed privacy-preserving algorithms based on a distributed ensemble strategy. The basic idea of our approach is to build an elegant model for each participating facility to accurately learn the data distribution, and then can transfer the useful healthcare knowledge acquired on their data from these participators in the form of their own decision models without revealing and sharing the patient-level sensitive data, thus protecting patient privacy. We demonstrate that our approach can successfully build accurate and robust prediction models, under privacy constraints, using the healthcare data collected from different geographical locations. We demonstrate the performance of our method using the Type-2 diabetes EHRs accumulated from multiple sources from all fifty states in the U.S. Our method was evaluated on diagnosing diabetes in the presence of insufficient number of patient records from certain regions without revealing the actual patient data from other regions. Using the proposed approach, we also discovered the important biomarkers, both universal and region-specific, and validated the selected biomarkers using the biomedical literature.

Entities:  

Keywords:  Boosting; Electronic health records; Ensemble learning; Healthcare; Machine learning; Privacy-preserving data mining

Year:  2016        PMID: 26681811      PMCID: PMC4677334          DOI: 10.1016/j.ins.2015.10.011

Source DB:  PubMed          Journal:  Inf Sci (N Y)        ISSN: 0020-0255            Impact factor:   6.795


  9 in total

1.  Health information technology: initial set of standards, implementation specifications, and certification criteria for electronic health record technology. Interim final rule.

Authors: 
Journal:  Fed Regist       Date:  2010-01-13

2.  Physicians and electronic health records: a statewide survey.

Authors:  Steven R Simon; Rainu Kaushal; Paul D Cleary; Chelsea A Jenter; Lynn A Volk; E John Orav; Elisabeth Burdick; Eric G Poon; David W Bates
Journal:  Arch Intern Med       Date:  2007-03-12

3.  Small, nonteaching, and rural hospitals continue to be slow in adopting electronic health record systems.

Authors:  Catherine M DesRoches; Chantal Worzala; Maulik S Joshi; Peter D Kralovec; Ashish K Jha
Journal:  Health Aff (Millwood)       Date:  2012-04-24       Impact factor: 6.301

4.  Health Insurance Portability and Accountability Act of 1996. Public Law 104-191.

Authors: 
Journal:  US Statut Large       Date:  1996-08-21

5.  Diagnosis and classification of diabetes mellitus.

Authors: 
Journal:  Diabetes Care       Date:  2008-01       Impact factor: 19.112

6.  Use of electronic health records in U.S. hospitals.

Authors:  Ashish K Jha; Catherine M DesRoches; Eric G Campbell; Karen Donelan; Sowmya R Rao; Timothy G Ferris; Alexandra Shields; Sara Rosenbaum; David Blumenthal
Journal:  N Engl J Med       Date:  2009-03-25       Impact factor: 91.245

Review 7.  Diabetes, hypertension, and cardiovascular disease: an update.

Authors:  J R Sowers; M Epstein; E D Frohlich
Journal:  Hypertension       Date:  2001-04       Impact factor: 10.190

8.  Deaths: preliminary data for 2011.

Authors:  Donna L Hoyert; Jiaquan Xu
Journal:  Natl Vital Stat Rep       Date:  2012-10-10

9.  Insulin sensitivity indices obtained from oral glucose tolerance testing: comparison with the euglycemic insulin clamp.

Authors:  M Matsuda; R A DeFronzo
Journal:  Diabetes Care       Date:  1999-09       Impact factor: 19.112

  9 in total
  3 in total

1.  Privacy Policy and Technology in Biomedical Data Science.

Authors:  April Moreno Arellano; Wenrui Dai; Shuang Wang; Xiaoqian Jiang; Lucila Ohno-Machado
Journal:  Annu Rev Biomed Data Sci       Date:  2018-07

2.  Hybrid Disease Diagnosis Using Multiobjective Optimization with Evolutionary Parameter Optimization.

Authors:  MadhuSudana Rao Nalluri; Kannan K; Manisha M; Diptendu Sinha Roy
Journal:  J Healthc Eng       Date:  2017-07-04       Impact factor: 2.682

3.  A network-based method with privacy-preserving for identifying influential providers in large healthcare service systems.

Authors:  Xiaoyu Qi; Gang Mei; Salvatore Cuomo; Lei Xiao
Journal:  Future Gener Comput Syst       Date:  2020-04-06       Impact factor: 7.187

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.