Literature DB >> 22563145

Utility-preserving transaction data anonymization with low information loss.

Grigorios Loukides1, Aris Gkoulalas-Divanis.   

Abstract

Transaction data record various information about individuals, including their purchases and diagnoses, and are increasingly published to support large-scale and low-cost studies in domains such as marketing and medicine. However, the dissemination of transaction data may lead to privacy breaches, as it allows an attacker to link an individual's record to their identity. Approaches that anonymize data by eliminating certain values in an individual's record or by replacing them with more general values have been proposed recently, but they often produce data of limited usefulness. This is because these approaches adopt value transformation strategies that do not guarantee data utility in intended applications and objective measures that may lead to excessive data distortion. In this paper, we propose a novel approach for anonymizing data in a way that satisfies data publishers' utility requirements and incurs low information loss. To achieve this, we introduce an accurate information loss measure and an effective anonymization algorithm that explores a large part of the problem space. An extensive experimental study, using click-stream and medical data, demonstrates that our approach permits many times more accurate query answering than the state-of-the-art methods, while it is comparable to them in terms of efficiency.

Entities:  

Year:  2012        PMID: 22563145      PMCID: PMC3340604          DOI: 10.1016/j.eswa.2012.02.179

Source DB:  PubMed          Journal:  Expert Syst Appl        ISSN: 0957-4174            Impact factor:   6.954


  6 in total

1.  Association rule discovery with the train and test approach for heart disease prediction.

Authors:  Carlos Ordonez
Journal:  IEEE Trans Inf Technol Biomed       Date:  2006-04

2.  Development of a large-scale de-identified DNA biobank to enable personalized medicine.

Authors:  D M Roden; J M Pulley; M A Basford; G R Bernard; E W Clayton; J R Balser; D R Masys
Journal:  Clin Pharmacol Ther       Date:  2008-05-21       Impact factor: 6.875

Review 3.  A HapMap harvest of insights into the genetics of common disease.

Authors:  Teri A Manolio; Lisa D Brooks; Francis S Collins
Journal:  J Clin Invest       Date:  2008-05       Impact factor: 14.808

Review 4.  Using electronic health records to drive discovery in disease genomics.

Authors:  Isaac S Kohane
Journal:  Nat Rev Genet       Date:  2011-05-18       Impact factor: 53.242

5.  The disclosure of diagnosis codes can breach research participants' privacy.

Authors:  Grigorios Loukides; Joshua C Denny; Bradley Malin
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

6.  Anonymization of electronic medical records for validating genome-wide association studies.

Authors:  Grigorios Loukides; Aris Gkoulalas-Divanis; Bradley Malin
Journal:  Proc Natl Acad Sci U S A       Date:  2010-04-12       Impact factor: 11.205

  6 in total
  1 in total

1.  A comprehensive review on privacy preserving data mining.

Authors:  Yousra Abdul Alsahib S Aldeen; Mazleena Salleh; Mohammad Abdur Razzaque
Journal:  Springerplus       Date:  2015-11-12
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.