| Literature DB >> 30815160 |
Yichen Jiang1, Chenghong Wang1, Zhixuan Wu1,2, Xin Du1,2, Shuang Wang1.
Abstract
Sharing medical data can benefit many aspects of biomedical research studies. However, medical data usually contains sensitive patient information, which cannot be shared directly. Summary statistics, like histogram, are widely used in medical research which serves as a sanitized synopsis of the raw health dataset such as Electrical Health Records (EHR). Such synopsized representation is then be used to support advanced operations over health dataset such as counting queries and learning based tasks. While privacy becomes an increasingly important issue for generating and publishing health data based histograms. Previous solutions show promise on securely generating histogram via differential privacy, however such methods only consider a centralized solution and the accuracy is still a limitation for real world applications. In this paper, we propose a novel hybrid solution to combine two rigorous theoretical models (homomorphic encryption and differential privacy) for securely generating synthetic V-optimal histograms over distributed datasets. Our results demonstrated accuracy improvement over previous study over real medical datasets.Entities:
Mesh:
Year: 2018 PMID: 30815160 PMCID: PMC6371369
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076