Literature DB >> 26752802

Digression and Value Concatenation to Enable Privacy-Preserving Regression.

Xiao-Bai Li1, Sumit Sarkar2.   

Abstract

Regression techniques can be used not only for legitimate data analysis, but also to infer private information about individuals. In this paper, we demonstrate that regression trees, a popular data-analysis and data-mining technique, can be used to effectively reveal individuals' sensitive data. This problem, which we call a "regression attack," has not been addressed in the data privacy literature, and existing privacy-preserving techniques are not appropriate in coping with this problem. We propose a new approach to counter regression attacks. To protect against privacy disclosure, our approach introduces a novel measure, called digression, which assesses the sensitive value disclosure risk in the process of building a regression tree model. Specifically, we develop an algorithm that uses the measure for pruning the tree to limit disclosure of sensitive data. We also propose a dynamic value-concatenation method for anonymizing data, which better preserves data utility than a user-defined generalization scheme commonly used in existing approaches. Our approach can be used for anonymizing both numeric and categorical data. An experimental study is conducted using real-world financial, economic and healthcare data. The results of the experiments demonstrate that the proposed approach is very effective in protecting data privacy while preserving data quality for research and analysis.

Entities:  

Keywords:  Privacy; anonymization; data analytics; data mining; regression; regression trees

Year:  2014        PMID: 26752802      PMCID: PMC4703130          DOI: 10.25300/misq/2014/38.3.03

Source DB:  PubMed          Journal:  MIS Q        ISSN: 0276-7783            Impact factor:   7.198


  3 in total

1.  Protecting Privacy When Sharing and Releasing Data with Multiple Records per Person.

Authors:  Hasan B Kartal; Xiao-Bai Li
Journal:  J Assoc Inf Syst       Date:  2020       Impact factor: 5.149

2.  Unveiling consumer's privacy paradox behaviour in an economic exchange.

Authors:  Luvai F Motiwalla; Xiao-Bai Li
Journal:  Int J Bus Inf Syst       Date:  2016

3.  Preserving Patient Privacy When Sharing Same-Disease Data.

Authors:  Xiaoping Liu; Xiao-Bai Li; Luvai Motiwalla; Wenjun Li; Hua Zheng; Patricia D Franklin
Journal:  ACM J Data Inf Qual       Date:  2016-10
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.