Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Differentially Private Empirical Risk Minimization.

Literature DB >> 21892342

Differentially Private Empirical Risk Minimization.

Kamalika Chaudhuri¹, Claire Monteleoni, Anand D Sarwate.

Abstract

Privacy-preserving machine learning algorithms are crucial for the increasingly common setting in which personal data, such as medical or financial records, are analyzed. We provide general techniques to produce privacy-preserving approximations of classifiers learned via (regularized) empirical risk minimization (ERM). These algorithms are private under the ε-differential privacy definition due to Dwork et al. (2006). First we apply the output perturbation ideas of Dwork et al. (2006), to ERM classification. Then we propose a new method, objective perturbation, for privacy-preserving machine learning algorithm design. This method entails perturbing the objective function before optimizing over classifiers. If the loss and regularizer satisfy certain convexity and differentiability criteria, we prove theoretical results showing that our algorithms preserve privacy, and provide generalization bounds for linear and nonlinear kernels. We further present a privacy-preserving technique for tuning the parameters in general machine learning algorithms, thereby providing end-to-end privacy guarantees for the training process. We apply these results to produce privacy-preserving analogues of regularized logistic regression and support vector machines. We obtain encouraging results from evaluating their performance on real demographic and benchmark data sets. Our results show that both theoretically and empirically, objective perturbation is superior to the previous state-of-the-art, output perturbation, in managing the inherent tradeoff between privacy and learning performance.

Entities: Chemical Disease Gene Mutation Species

Year: 2011 PMID： 21892342 PMCID： PMC3164588

Source DB: PubMed Journal: J Mach Learn Res ISSN： 1532-4435 Impact factor: 3.654

3 in total

Review 1. Weaving technology and policy together to maintain confidentiality.

Authors: L Sweeney
Journal: J Law Med Ethics Date: 1997 Summer-Fall Impact factor: 1.718

2. Training a support vector machine in the primal.

Authors: Olivier Chapelle
Journal: Neural Comput Date: 2007-05 Impact factor: 2.026

3. Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays.

Authors: Nils Homer; Szabolcs Szelinger; Margot Redman; David Duggan; Waibhav Tembe; Jill Muehling; John V Pearson; Dietrich A Stephan; Stanley F Nelson; David W Craig
Journal: PLoS Genet Date: 2008-08-29 Impact factor: 5.917

3 in total

31 in total

1. iDASH: integrating data for analysis, anonymization, and sharing.

Authors: Lucila Ohno-Machado; Vineet Bafna; Aziz A Boxwala; Brian E Chapman; Wendy W Chapman; Kamalika Chaudhuri; Michele E Day; Claudiu Farcas; Nathaniel D Heintzman; Xiaoqian Jiang; Hyeoneui Kim; Jihoon Kim; Michael E Matheny; Frederic S Resnic; Staal A Vinterbo
Journal: J Am Med Inform Assoc Date: 2011-11-10 Impact factor: 4.497

Differentially Private Empirical Risk Minimization.

Review 1. Weaving technology and policy together to maintain confidentiality.

2. Training a support vector machine in the primal.

3. Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays.

1. iDASH: integrating data for analysis, anonymization, and sharing.

2. Privacy-Preserving Methods for Vertically Partitioned Incomplete Data.

Review 3. Deep learning for healthcare: review, opportunities and challenges.

4. Toward practicing privacy.

5. Scalable privacy-preserving data sharing methodology for genome-wide association studies.

6. Differential privacy based on importance weighting.

Review 7. A Comprehensive Survey on Local Differential Privacy toward Data Statistics and Analysis.

8. Partitioning-based mechanisms under personalized differential privacy.

9. Differentially Private Distributed Online Learning.

10. Federated Tensor Factorization for Computational Phenotyping.