| Literature DB >> 26973795 |
Haoran Li1, Xiaoqian Jiang2, Li Xiong1, Jinfei Liu1.
Abstract
Differential privacy has recently become a de facto standard for private statistical data release. Many algorithms have been proposed to generate differentially private histograms or synthetic data. However, most of them focus on "one-time" release of a static dataset and do not adequately address the increasing need of releasing series of dynamic datasets in real time. A straightforward application of existing histogram methods on each snapshot of such dynamic datasets will incur high accumulated error due to the composibility of differential privacy and correlations or overlapping users between the snapshots. In this paper, we address the problem of releasing series of dynamic datasets in real time with differential privacy, using a novel adaptive distance-based sampling approach. Our first method, DSFT, uses a fixed distance threshold and releases a differentially private histogram only when the current snapshot is sufficiently different from the previous one, i.e., with a distance greater than a predefined threshold. Our second method, DSAT, further improves DSFT and uses a dynamic threshold adaptively adjusted by a feedback control mechanism to capture the data dynamics. Extensive experiments on real and synthetic datasets demonstrate that our approach achieves better utility than baseline methods and existing state-of-the-art methods.Entities:
Keywords: Differential privacy; adaptive sampling; dynamic dataset release
Year: 2015 PMID: 26973795 PMCID: PMC4788513 DOI: 10.1145/2806416.2806441
Source DB: PubMed Journal: Proc ACM Int Conf Inf Knowl Manag ISSN: 2155-0751