Literature DB >> 24307745

Class Restricted Clustering and Micro-Perturbation for Data Privacy.

Xiao-Bai Li1, Sumit Sarkar.   

Abstract

The extensive use of information technologies by organizations to collect and share personal data has raised strong privacy concerns. To respond to the public's demand for data privacy, a class of clustering-based data masking techniques is increasingly being used for privacy-preserving data sharing and analytics. Traditional clustering-based approaches for masking numeric attributes, while addressing re-identification risks, typically do not consider the disclosure risk of categorical confidential attributes. We propose a new approach to deal with this problem. The proposed method clusters data such that the data points within a group are similar in the non-confidential attribute values whereas the confidential attribute values within a group are well distributed. To accomplish this, the clustering method, which is based on a minimum spanning tree (MST) technique, uses two risk-utility tradeoff measures in the growing and pruning stages of the MST technique respectively. As part of our approach we also propose a novel cluster-level micro-perturbation method for masking data that overcomes a common problem of traditional clustering-based methods for data masking, which is their inability to preserve important statistical properties such as the variance of attributes and the covariance across attributes. We show that the mean vector and the covariance matrix of the masked data generated using the micro-perturbation method are unbiased estimates of the original mean vector and covariance matrix. An experimental study on several real-world datasets demonstrates the effectiveness of the proposed approach.

Entities:  

Keywords:  Privacy; clustering; confidentiality; data perturbation; information theory; microaggregation; minimum spanning tree

Year:  2013        PMID: 24307745      PMCID: PMC3846357          DOI: 10.1287/mnsc.1120.1584

Source DB:  PubMed          Journal:  Manage Sci        ISSN: 0025-1909            Impact factor:   4.883


  1 in total

1.  A research agenda for personal health records (PHRs).

Authors:  David C Kaelber; Ashish K Jha; Douglas Johnston; Blackford Middleton; David W Bates
Journal:  J Am Med Inform Assoc       Date:  2008-08-28       Impact factor: 4.497

  1 in total
  5 in total

1.  Protecting Privacy When Sharing and Releasing Data with Multiple Records per Person.

Authors:  Hasan B Kartal; Xiao-Bai Li
Journal:  J Assoc Inf Syst       Date:  2020       Impact factor: 5.149

2.  Unveiling consumer's privacy paradox behaviour in an economic exchange.

Authors:  Luvai F Motiwalla; Xiao-Bai Li
Journal:  Int J Bus Inf Syst       Date:  2016

3.  Anonymizing and Sharing Medical Text Records.

Authors:  Xiao-Bai Li; Jialun Qin
Journal:  Inf Syst Res       Date:  2017-04-12

4.  Preserving Patient Privacy When Sharing Same-Disease Data.

Authors:  Xiaoping Liu; Xiao-Bai Li; Luvai Motiwalla; Wenjun Li; Hua Zheng; Patricia D Franklin
Journal:  ACM J Data Inf Qual       Date:  2016-10

5.  Pricing and disseminating customer data with privacy awareness.

Authors:  Xiao-Bai Li; Srinivasan Raghunathan
Journal:  Decis Support Syst       Date:  2014-03-01       Impact factor: 5.795

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.