Literature DB >> 25911674

R-U policy frontiers for health data de-identification.

Weiyi Xia1, Raymond Heatherly2, Xiaofeng Ding3, Jiuyong Li4, Bradley A Malin5.   

Abstract

OBJECTIVE: The Health Insurance Portability and Accountability Act Privacy Rule enables healthcare organizations to share de-identified data via two routes. They can either 1) show re-identification risk is small (e.g., via a formal model, such as k-anonymity) with respect to an anticipated recipient or 2) apply a rule-based policy (i.e., Safe Harbor) that enumerates attributes to be altered (e.g., dates to years). The latter is often invoked because it is interpretable, but it fails to tailor protections to the capabilities of the recipient. The paper shows rule-based policies can be mapped to a utility (U) and re-identification risk (R) space, which can be searched for a collection, or frontier, of policies that systematically trade off between these goals.
METHODS: We extend an algorithm to efficiently compose an R-U frontier using a lattice of policy options. Risk is proportional to the number of patients to which a record corresponds, while utility is proportional to similarity of the original and de-identified distribution. We allow our method to search 20 000 rule-based policies (out of 2(700)) and compare the resulting frontier with k-anonymous solutions and Safe Harbor using the demographics of 10 U.S. states.
RESULTS: The results demonstrate the rule-based frontier 1) consists, on average, of 5000 policies, 2% of which enable better utility with less risk than Safe Harbor and 2) the policies cover a broader spectrum of utility and risk than k-anonymity frontiers.
CONCLUSIONS: R-U frontiers of de-identification policies can be discovered efficiently, allowing healthcare organizations to tailor protections to anticipated needs and trustworthiness of recipients.
© The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  de-identification; optimization; policy; privacy; secondary use

Mesh:

Year:  2015        PMID: 25911674      PMCID: PMC4986667          DOI: 10.1093/jamia/ocv004

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  25 in total

1.  Science and government. An international framework to promote access to data.

Authors:  Peter Arzberger; Peter Schroeder; Anne Beaulieu; Geof Bowker; Kathleen Casey; Leif Laaksonen; David Moorman; Paul Uhlir; Paul Wouters
Journal:  Science       Date:  2004-03-19       Impact factor: 47.728

2.  A globally optimal k-anonymity method for the de-identification of health data.

Authors:  Khaled El Emam; Fida Kamal Dankar; Romeo Issa; Elizabeth Jonker; Daniel Amyot; Elise Cogo; Jean-Pierre Corriveau; Mark Walker; Sadrul Chowdhury; Regis Vaillancourt; Tyson Roffey; Jim Bottomley
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

3.  Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.

Authors:  Katherine M Newton; Peggy L Peissig; Abel Ngo Kho; Suzette J Bielinski; Richard L Berg; Vidhu Choudhary; Melissa Basford; Christopher G Chute; Iftikhar J Kullo; Rongling Li; Jennifer A Pacheco; Luke V Rasmussen; Leslie Spangler; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-03-26       Impact factor: 4.497

4.  The inevitable application of big data to health care.

Authors:  Travis B Murdoch; Allan S Detsky
Journal:  JAMA       Date:  2013-04-03       Impact factor: 56.272

5.  Beyond Safe Harbor: Automatic Discovery of Health Information De-identification Policy Alternatives.

Authors:  Kathleen Benitez; Grigorios Loukides; Bradley Malin
Journal:  IHI       Date:  2010

6.  PARAMO: a PARAllel predictive MOdeling platform for healthcare analytic research using electronic health records.

Authors:  Kenney Ng; Amol Ghoting; Steven R Steinhubl; Walter F Stewart; Bradley Malin; Jimeng Sun
Journal:  J Biomed Inform       Date:  2013-12-25       Impact factor: 6.317

7.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.

Authors:  Joshua C Denny; Lisa Bastarache; Marylyn D Ritchie; Robert J Carroll; Raquel Zink; Jonathan D Mosley; Julie R Field; Jill M Pulley; Andrea H Ramirez; Erica Bowton; Melissa A Basford; David S Carrell; Peggy L Peissig; Abel N Kho; Jennifer A Pacheco; Luke V Rasmussen; David R Crosslin; Paul K Crane; Jyotishman Pathak; Suzette J Bielinski; Sarah A Pendergrass; Hua Xu; Lucia A Hindorff; Rongling Li; Teri A Manolio; Christopher G Chute; Rex L Chisholm; Eric B Larson; Gail P Jarvik; Murray H Brilliant; Catherine A McCarty; Iftikhar J Kullo; Jonathan L Haines; Dana C Crawford; Daniel R Masys; Dan M Roden
Journal:  Nat Biotechnol       Date:  2013-12       Impact factor: 54.908

8.  Building public trust in uses of Health Insurance Portability and Accountability Act de-identified data.

Authors:  Deven McGraw
Journal:  J Am Med Inform Assoc       Date:  2012-06-26       Impact factor: 4.497

Review 9.  The Electronic Medical Records and Genomics (eMERGE) Network: past, present, and future.

Authors:  Omri Gottesman; Helena Kuivaniemi; Gerard Tromp; W Andrew Faucett; Rongling Li; Teri A Manolio; Saskia C Sanderson; Joseph Kannry; Randi Zinberg; Melissa A Basford; Murray Brilliant; David J Carey; Rex L Chisholm; Christopher G Chute; John J Connolly; David Crosslin; Joshua C Denny; Carlos J Gallego; Jonathan L Haines; Hakon Hakonarson; John Harley; Gail P Jarvik; Isaac Kohane; Iftikhar J Kullo; Eric B Larson; Catherine McCarty; Marylyn D Ritchie; Dan M Roden; Maureen E Smith; Erwin P Böttinger; Marc S Williams
Journal:  Genet Med       Date:  2013-06-06       Impact factor: 8.822

10.  Developing a data infrastructure for a learning health system: the PORTAL network.

Authors:  Elizabeth A McGlynn; Tracy A Lieu; Mary L Durham; Alan Bauck; Reesa Laws; Alan S Go; Jersey Chen; Heather Spencer Feigelson; Douglas A Corley; Deborah Rohm Young; Andrew F Nelson; Arthur J Davidson; Leo S Morales; Michael G Kahn
Journal:  J Am Med Inform Assoc       Date:  2014-05-12       Impact factor: 4.497

View more
  9 in total

1.  The machine giveth and the machine taketh away: a parrot attack on clinical text deidentified with hiding in plain sight.

Authors:  David S Carrell; David J Cronkite; Muqun Rachel Li; Steve Nyemba; Bradley A Malin; John S Aberdeen; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2019-12-01       Impact factor: 4.497

Review 2.  Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress.

Authors:  S M Meystre; C Lovis; T Bürkle; G Tognola; A Budrionis; C U Lehmann
Journal:  Yearb Med Inform       Date:  2017-09-11

3.  Privacy Policy and Technology in Biomedical Data Science.

Authors:  April Moreno Arellano; Wenrui Dai; Shuang Wang; Xiaoqian Jiang; Lucila Ohno-Machado
Journal:  Annu Rev Biomed Data Sci       Date:  2018-07

4.  Resilience of clinical text de-identified with "hiding in plain sight" to hostile reidentification attacks by human readers.

Authors:  David S Carrell; Bradley A Malin; David J Cronkite; John S Aberdeen; Cheryl Clark; Muqun Rachel Li; Dikshya Bastakoty; Steve Nyemba; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2020-07-01       Impact factor: 4.497

5.  Enabling realistic health data re-identification risk assessment through adversarial modeling.

Authors:  Weiyi Xia; Yongtai Liu; Zhiyu Wan; Yevgeniy Vorobeychik; Murat Kantacioglu; Steve Nyemba; Ellen Wright Clayton; Bradley A Malin
Journal:  J Am Med Inform Assoc       Date:  2021-03-18       Impact factor: 4.497

Review 6.  Use and Understanding of Anonymization and De-Identification in the Biomedical Literature: Scoping Review.

Authors:  Raphaël Chevrier; Vasiliki Foufi; Christophe Gaudet-Blavignac; Arnaud Robert; Christian Lovis
Journal:  J Med Internet Res       Date:  2019-05-31       Impact factor: 5.428

7.  Using game theory to thwart multistage privacy intrusions when sharing data.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Weiyi Xia; Yongtai Liu; Myrna Wooders; Jia Guo; Zhijun Yin; Ellen Wright Clayton; Murat Kantarcioglu; Bradley A Malin
Journal:  Sci Adv       Date:  2021-12-10       Impact factor: 14.136

8.  Efficient and effective pruning strategies for health data de-identification.

Authors:  Fabian Prasser; Florian Kohlmayer; Klaus A Kuhn
Journal:  BMC Med Inform Decis Mak       Date:  2016-04-30       Impact factor: 2.796

9.  A comprehensive tool for creating and evaluating privacy-preserving biomedical prediction models.

Authors:  Johanna Eicher; Raffael Bild; Helmut Spengler; Klaus A Kuhn; Fabian Prasser
Journal:  BMC Med Inform Decis Mak       Date:  2020-02-11       Impact factor: 2.796

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.