Literature DB >> 26705435

Learning from Data with Heterogeneous Noise using SGD.

Shuang Song1, Kamalika Chaudhuri1, Anand D Sarwate2.   

Abstract

We consider learning from data of variable quality that may be obtained from different heterogeneous sources. Addressing learning from heterogenous data in its full generality is a challenging problem. In this paper, we adopt instead a model in which data is observed through heterogeneous noise, where the noise level reflects the quality of the data source. We study how to use stochastic gradient algorithms to learn in this model. Our study is motivated by two concrete examples where this problem arises naturally: learning with local differential privacy based on data from multiple sources with different privacy requirements, and learning from data with labels of variable quality. The main contribution of this paper is to identify how heterogeneous noise impacts performance. We show that given two datasets with heterogeneous noise, the order in which to use them in standard SGD depends on the learning rate. We propose a method for changing the learning rate as a function of the heterogeneity, and prove new regret bounds for our method in two cases of interest. Experiments on real data show that our method performs better than using a single learning rate and using only the less noisy of the two datasets when the noise level is low to moderate.

Entities:  

Year:  2015        PMID: 26705435      PMCID: PMC4687916     

Source DB:  PubMed          Journal:  JMLR Workshop Conf Proc        ISSN: 1938-7288


  1 in total

1.  Differentially Private Empirical Risk Minimization.

Authors:  Kamalika Chaudhuri; Claire Monteleoni; Anand D Sarwate
Journal:  J Mach Learn Res       Date:  2011-03       Impact factor: 3.654

  1 in total
  2 in total

Review 1.  Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis.

Authors:  Davood Karimi; Haoran Dou; Simon K Warfield; Ali Gholipour
Journal:  Med Image Anal       Date:  2020-06-20       Impact factor: 8.545

2.  A Correlated Noise-assisted Decentralized Differentially Private Estimation Protocol, and its application to fMRI Source Separation.

Authors:  Hafiz Imtiaz; Jafar Mohammadi; Rogers Silva; Bradley Baker; Sergey M Plis; Anand D Sarwate; Vince D Calhoun
Journal:  IEEE Trans Signal Process       Date:  2021-11-11       Impact factor: 4.875

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.