Literature DB >> 31564813

Differentially Private Distributed Online Learning.

Chencheng Li1, Pan Zhou1, Li Xiong2, Qian Wang3, Ting Wang4.   

Abstract

In the big data era, the generation of data presents some new characteristics, including wide distribution, high velocity, high dimensionality, and privacy concern. To address these challenges for big data analytics, we develop a privacy-preserving distributed online learning framework on the data collected from distributed data sources. Specifically, each node (i.e., data source) has the capacity of learning a model from its local dataset, and exchanges intermediate parameters with a random part of their own neighboring (logically connected) nodes. Hence, the topology of the communications in our distributed computing framework is unfixed in practice. As online learning always performs on the sensitive data, we introduce the notion of differential privacy (DP) into our distributed online learning algorithm (DOLA) to protect the data privacy during the learning, which prevents an adversary from inferring any significant sensitive information. Our model is of general value for big data analytics in the distributed setting, because it can provide rigorous and scalable privacy proof and have much less computational complexity when compared to classic schemes, e.g., secure multiparty computation (SMC). To tackle high-dimensional incoming data entries, we study a sparse version of the DOLA with novel DP techniques to save the computing resources and improve the utility. Furthermore, we present two modified private DOLAs to meet the need of practical applications. One is to convert the DOLA to distributed stochastic optimization in an offline setting, the other is to use the mini-batches approach to reduce the amount of the perturbation noise and improve the utility. We conduct experiments on real datasets in a configured distributed platform. Numerical experiment results validate the feasibility of our private DOLAs.

Entities:  

Keywords:  Differential privacy; big data; distributed optimization; mini-batch; online learning; sparse

Year:  2018        PMID: 31564813      PMCID: PMC6764830          DOI: 10.1109/TKDE.2018.2794384

Source DB:  PubMed          Journal:  IEEE Trans Knowl Data Eng        ISSN: 1041-4347            Impact factor:   6.977


  2 in total

1.  A Comprehensive Comparison of Multiparty Secure Additions with Differential Privacy.

Authors:  Slawomir Goryczka; Li Xiong
Journal:  IEEE Trans Dependable Secure Comput       Date:  2015-10-01       Impact factor: 7.329

2.  Differentially Private Empirical Risk Minimization.

Authors:  Kamalika Chaudhuri; Claire Monteleoni; Anand D Sarwate
Journal:  J Mach Learn Res       Date:  2011-03       Impact factor: 3.654

  2 in total
  1 in total

1.  A Correlated Noise-assisted Decentralized Differentially Private Estimation Protocol, and its application to fMRI Source Separation.

Authors:  Hafiz Imtiaz; Jafar Mohammadi; Rogers Silva; Bradley Baker; Sergey M Plis; Anand D Sarwate; Vince D Calhoun
Journal:  IEEE Trans Signal Process       Date:  2021-11-11       Impact factor: 4.875

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.