Literature DB >> 29854212

An Open Source Tool for Game Theoretic Health Data De-Identification.

Fabian Prasser1, James Gaupp2, Zhiyu Wan2, Weiyi Xia2, Yevgeniy Vorobeychik2, Murat Kantarcioglu3, Klaus Kuhn1, Brad Malin2.   

Abstract

Biomedical data continues to grow in quantity and quality, creating new opportunities for research and data-driven applications. To realize these activities at scale, data must be shared beyond its initial point of collection. To maintain privacy, healthcare organizations often de-identify data, but they assume worst-case adversaries, inducing high levels of data corruption. Recently, game theory has been proposed to account for the incentives of data publishers and recipients (who attempt to re-identify patients), but this perspective has been more hypothetical than practical. In this paper, we report on a new game theoretic data publication strategy and its integration into the open source software ARX. We evaluate our implementation with an analysis on the relationship between data transformation, utility, and efficiency for over 30,000 demographic records drawn from the U.S. Census Bureau. The results indicate that our implementation is scalable and can be combined with various data privacy risk and quality measures.

Entities:  

Mesh:

Year:  2018        PMID: 29854212      PMCID: PMC5977602     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  11 in total

1.  Protecting privacy using k-anonymity.

Authors:  Khaled El Emam; Fida Kamal Dankar
Journal:  J Am Med Inform Assoc       Date:  2008-06-25       Impact factor: 4.497

2.  Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Weiyi Xia; Ellen Wright Clayton; Murat Kantarcioglu; Bradley Malin
Journal:  Am J Hum Genet       Date:  2017-01-05       Impact factor: 11.025

3.  Learning from big health care data.

Authors:  Sebastian Schneeweiss
Journal:  N Engl J Med       Date:  2014-06-05       Impact factor: 91.245

Review 4.  Mining electronic health records: towards better research applications and clinical care.

Authors:  Peter B Jensen; Lars J Jensen; Søren Brunak
Journal:  Nat Rev Genet       Date:  2012-05-02       Impact factor: 53.242

5.  Technical and policy approaches to balancing patient privacy and data sharing in clinical and translational research.

Authors:  Bradley Malin; David Karp; Richard H Scheuermann
Journal:  J Investig Med       Date:  2010-01       Impact factor: 2.895

6.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data.

Authors:  Joshua C Denny; Lisa Bastarache; Marylyn D Ritchie; Robert J Carroll; Raquel Zink; Jonathan D Mosley; Julie R Field; Jill M Pulley; Andrea H Ramirez; Erica Bowton; Melissa A Basford; David S Carrell; Peggy L Peissig; Abel N Kho; Jennifer A Pacheco; Luke V Rasmussen; David R Crosslin; Paul K Crane; Jyotishman Pathak; Suzette J Bielinski; Sarah A Pendergrass; Hua Xu; Lucia A Hindorff; Rongling Li; Teri A Manolio; Christopher G Chute; Rex L Chisholm; Eric B Larson; Gail P Jarvik; Murray H Brilliant; Catherine A McCarty; Iftikhar J Kullo; Jonathan L Haines; Dana C Crawford; Daniel R Masys; Dan M Roden
Journal:  Nat Biotechnol       Date:  2013-12       Impact factor: 54.908

7.  A systematic review of re-identification attacks on health data.

Authors:  Khaled El Emam; Elizabeth Jonker; Luk Arbuckle; Bradley Malin
Journal:  PLoS One       Date:  2011-12-02       Impact factor: 3.240

8.  A game theoretic framework for analyzing re-identification risk.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Weiyi Xia; Ellen Wright Clayton; Murat Kantarcioglu; Ranjit Ganta; Raymond Heatherly; Bradley A Malin
Journal:  PLoS One       Date:  2015-03-25       Impact factor: 3.240

9.  Translational bioinformatics in the era of real-time biomedical, health care and wellness data streams.

Authors:  Khader Shameer; Marcus A Badgeley; Riccardo Miotto; Benjamin S Glicksberg; Joseph W Morgan; Joel T Dudley
Journal:  Brief Bioinform       Date:  2016-02-14       Impact factor: 11.622

10.  Efficient and effective pruning strategies for health data de-identification.

Authors:  Fabian Prasser; Florian Kohlmayer; Klaus A Kuhn
Journal:  BMC Med Inform Decis Mak       Date:  2016-04-30       Impact factor: 2.796

View more
  3 in total

1.  A scalable software solution for anonymizing high-dimensional biomedical data.

Authors:  Thierry Meurers; Raffael Bild; Kieu-Mi Do; Fabian Prasser
Journal:  Gigascience       Date:  2021-10-04       Impact factor: 6.524

2.  Using game theory to thwart multistage privacy intrusions when sharing data.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Weiyi Xia; Yongtai Liu; Myrna Wooders; Jia Guo; Zhijun Yin; Ellen Wright Clayton; Murat Kantarcioglu; Bradley A Malin
Journal:  Sci Adv       Date:  2021-12-10       Impact factor: 14.136

3.  A comprehensive tool for creating and evaluating privacy-preserving biomedical prediction models.

Authors:  Johanna Eicher; Raffael Bild; Helmut Spengler; Klaus A Kuhn; Fabian Prasser
Journal:  BMC Med Inform Decis Mak       Date:  2020-02-11       Impact factor: 2.796

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.