Literature DB >> 25405241

Differentially Private Synthesization of Multi-Dimensional Data using Copula Functions.

Haoran Li1, Li Xiong2, Xiaoqian Jiang3.   

Abstract

Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Most of the existing techniques that generate differentially private histograms or synthetic data only work well for single dimensional or low-dimensional histograms. They become problematic for high dimensional and large domain data due to increased perturbation error and computation complexity. In this paper, we propose DPCopula, a differentially private data synthesization technique using Copula functions for multi-dimensional data. The core of our method is to compute a differentially private copula function from which we can sample synthetic data. Copula functions are used to describe the dependence between multivariate random vectors and allow us to build the multivariate joint distribution using one-dimensional marginal distributions. We present two methods for estimating the parameters of the copula functions with differential privacy: maximum likelihood estimation and Kendall's τ estimation. We present formal proofs for the privacy guarantee as well as the convergence property of our methods. Extensive experiments using both real datasets and synthetic datasets demonstrate that DPCopula generates highly accurate synthetic multi-dimensional data with significantly better utility than state-of-the-art techniques.

Entities:  

Year:  2014        PMID: 25405241      PMCID: PMC4232968          DOI: 10.5441/002/edbt.2014.43

Source DB:  PubMed          Journal:  Adv Database Technol


  7 in total

1.  Big Data Privacy in Biomedical Research.

Authors:  Shuang Wang; Luca Bonomi; Wenrui Dai; Feng Chen; Cynthia Cheung; Cinnamon S Bloss; Samuel Cheng; Xiaoqian Jiang
Journal:  IEEE Trans Big Data       Date:  2016-09-13

2.  Partitioning-based mechanisms under personalized differential privacy.

Authors:  Haoran Li; Li Xiong; Zhanglong Ji; Xiaoqian Jiang
Journal:  Adv Knowl Discov Data Min (2017)       Date:  2017-04-23

3.  Model-Protected Multi-Task Learning.

Authors:  Jian Liang; Ziqi Liu; Jiayu Zhou; Xiaoqian Jiang; Changshui Zhang; Fei Wang
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2022-01-07       Impact factor: 6.226

4.  Differentially Private Histogram Publication For Dynamic Datasets: An Adaptive Sampling Approach.

Authors:  Haoran Li; Xiaoqian Jiang; Li Xiong; Jinfei Liu
Journal:  Proc ACM Int Conf Inf Knowl Manag       Date:  2015-10

5.  Quantifying Differential Privacy under Temporal Correlations.

Authors:  Yang Cao; Masatoshi Yoshikawa; Yonghui Xiao; Li Xiong
Journal:  Proc Int Conf Data Eng       Date:  2017-05-18

6.  DPSynthesizer: Differentially Private Data Synthesizer for Privacy Preserving Data Sharing.

Authors:  Haoran Li; Li Xiong; Lifan Zhang; Xiaoqian Jiang
Journal:  Proceedings VLDB Endowment       Date:  2014-08

7.  Privacy preserving RBF kernel support vector machine.

Authors:  Haoran Li; Li Xiong; Lucila Ohno-Machado; Xiaoqian Jiang
Journal:  Biomed Res Int       Date:  2014-06-12       Impact factor: 3.411

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.