Literature DB >> 18579830

Protecting privacy using k-anonymity.

Khaled El Emam1, Fida Kamal Dankar.   

Abstract

OBJECTIVE: There is increasing pressure to share health information and even make it publicly available. However, such disclosures of personal health information raise serious privacy concerns. To alleviate such concerns, it is possible to anonymize the data before disclosure. One popular anonymization approach is k-anonymity. There have been no evaluations of the actual re-identification probability of k-anonymized data sets.
DESIGN: Through a simulation, we evaluated the re-identification risk of k-anonymization and three different improvements on three large data sets. MEASUREMENT: Re-identification probability is measured under two different re-identification scenarios. Information loss is measured by the commonly used discernability metric.
RESULTS: For one of the re-identification scenarios, k-Anonymity consistently over-anonymizes data sets, with this over-anonymization being most pronounced with small sampling fractions. Over-anonymization results in excessive distortions to the data (i.e., high information loss), making the data less useful for subsequent analysis. We found that a hypothesis testing approach provided the best control over re-identification risk and reduces the extent of information loss compared to baseline k-anonymity.
CONCLUSION: Guidelines are provided on when to use the hypothesis testing approach instead of baseline k-anonymity.

Mesh:

Year:  2008        PMID: 18579830      PMCID: PMC2528029          DOI: 10.1197/jamia.M2716

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  19 in total

1.  Authors should make their data available.

Authors:  D G Altman; C Cates
Journal:  BMJ       Date:  2001-11-03

2.  The case for samples of anonymized records from the 1991 census.

Authors:  C Marsh; C Skinner; S Arber; B Penhale; S Openshaw; J Hobcraft; D Lievesley; N Walford
Journal:  J R Stat Soc Ser A Stat Soc       Date:  1991       Impact factor: 2.483

3.  Are journals doing enough to prevent fraudulent publication?

Authors: 
Journal:  CMAJ       Date:  2006-02-14       Impact factor: 8.262

4.  Ethical issues in sharing epidemiologic data.

Authors:  C J Hogue
Journal:  J Clin Epidemiol       Date:  1991       Impact factor: 6.437

5.  Whose data are they anyway?

Authors:  T Delamothe
Journal:  BMJ       Date:  1996-05-18

6.  How can medical journals help prevent poor medical research? Some opportunities presented by electronic publishing.

Authors:  I Chalmers; D G Altman
Journal:  Lancet       Date:  1999-02-06       Impact factor: 79.321

7.  Making original data from clinical studies available for alternative analysis.

Authors:  J R Kirwan
Journal:  J Rheumatol       Date:  1997-05       Impact factor: 4.666

8.  Obtaining access to data from government-sponsored medical research.

Authors:  B J Yolles; J C Connors; S Grufferman
Journal:  N Engl J Med       Date:  1986-12-25       Impact factor: 91.245

9.  Conforming to HIPAA regulations and compilation of research data.

Authors:  Steven L Clause; Darren M Triller; Colleen P H Bornhorst; Robert A Hamilton; Leon E Cosler
Journal:  Am J Health Syst Pharm       Date:  2004-05-15       Impact factor: 2.637

10.  Whose data set is it anyway? Sharing raw data from randomized trials.

Authors:  Andrew J Vickers
Journal:  Trials       Date:  2006-05-16       Impact factor: 2.279

View more
  57 in total

1.  Attribute Utility Motivated k-anonymization of datasets to support the heterogeneous needs of biomedical researchers.

Authors:  Huimin Ye; Elizabeth S Chen
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

2.  Ethics and privacy issues of a practice-based surveillance system: need for a national-level institutional research ethics board and consent standards.

Authors:  Jyoti A Kotecha; Donna Manca; Anita Lambert-Lanning; Karim Keshavjee; Neil Drummond; Marshall Godwin; Michelle Greiver; Wayne Putnam; Marie-Thérèse Lussier; Richard Birtwhistle
Journal:  Can Fam Physician       Date:  2011-10       Impact factor: 3.275

3.  Never too old for anonymity: a statistical standard for demographic data sharing via the HIPAA Privacy Rule.

Authors:  Bradley Malin; Kathleen Benitez; Daniel Masys
Journal:  J Am Med Inform Assoc       Date:  2011 Jan-Feb       Impact factor: 4.497

4.  Anonymization of longitudinal electronic medical records.

Authors:  Acar Tamersoy; Grigorios Loukides; Mehmet Ercan Nergiz; Yucel Saygin; Bradley Malin
Journal:  IEEE Trans Inf Technol Biomed       Date:  2012-01-27

5.  A globally optimal k-anonymity method for the de-identification of health data.

Authors:  Khaled El Emam; Fida Kamal Dankar; Romeo Issa; Elizabeth Jonker; Daniel Amyot; Elise Cogo; Jean-Pierre Corriveau; Mark Walker; Sadrul Chowdhury; Regis Vaillancourt; Tyson Roffey; Jim Bottomley
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

6.  R-U policy frontiers for health data de-identification.

Authors:  Weiyi Xia; Raymond Heatherly; Xiaofeng Ding; Jiuyong Li; Bradley A Malin
Journal:  J Am Med Inform Assoc       Date:  2015-04-24       Impact factor: 4.497

7.  ARX--A Comprehensive Tool for Anonymizing Biomedical Data.

Authors:  Fabian Prasser; Florian Kohlmayer; Ronald Lautenschläger; Klaus A Kuhn
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

8.  An Open Source Tool for Game Theoretic Health Data De-Identification.

Authors:  Fabian Prasser; James Gaupp; Zhiyu Wan; Weiyi Xia; Yevgeniy Vorobeychik; Murat Kantarcioglu; Klaus Kuhn; Brad Malin
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

Review 9.  Routes for breaching and protecting genetic privacy.

Authors:  Yaniv Erlich; Arvind Narayanan
Journal:  Nat Rev Genet       Date:  2014-05-08       Impact factor: 53.242

Review 10.  Managing protected health information in distributed research network environments: automated review to facilitate collaboration.

Authors:  Christine E Bredfeldt; Amy Butani; Sandhyasree Padmanabhan; Paul Hitz; Roy Pardee
Journal:  BMC Med Inform Decis Mak       Date:  2013-03-22       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.