Literature DB >> 28065469

Expanding Access to Large-Scale Genomic Data While Promoting Privacy: A Game Theoretic Approach.

Zhiyu Wan1, Yevgeniy Vorobeychik2, Weiyi Xia2, Ellen Wright Clayton3, Murat Kantarcioglu4, Bradley Malin5.   

Abstract

Emerging scientific endeavors are creating big data repositories of data from millions of individuals. Sharing data in a privacy-respecting manner could lead to important discoveries, but high-profile demonstrations show that links between de-identified genomic data and named persons can sometimes be reestablished. Such re-identification attacks have focused on worst-case scenarios and spurred the adoption of data-sharing practices that unnecessarily impede research. To mitigate concerns, organizations have traditionally relied upon legal deterrents, like data use agreements, and are considering suppressing or adding noise to genomic variants. In this report, we use a game theoretic lens to develop more effective, quantifiable protections for genomic data sharing. This is a fundamentally different approach because it accounts for adversarial behavior and capabilities and tailors protections to anticipated recipients with reasonable resources, not adversaries with unlimited means. We demonstrate this approach via a new public resource with genomic summary data from over 8,000 individuals-the Sequence and Phenotype Integration Exchange (SPHINX)-and show that risks can be balanced against utility more effectively than with traditional approaches. We further show the generalizability of this framework by applying it to other genomic data collection and sharing endeavors. Recognizing that such models are dependent on a variety of parameters, we perform extensive sensitivity analyses to show that our findings are robust to their fluctuations.
Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Electronic Medical Records and Genomics Network; Sequence and Phenotype Integration Exchange; adversarial modeling; game theory; genetic algorithm; genomic data privacy; genomic data sharing policy; re-identification risk; sensitivity analysis; summary statistics

Mesh:

Year:  2017        PMID: 28065469      PMCID: PMC5294764          DOI: 10.1016/j.ajhg.2016.12.002

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  26 in total

1.  Standards for privacy of individually identifiable health information. Final rule.

Authors: 
Journal:  Fed Regist       Date:  2002-08-14

2.  Research ethics. The complexities of genomic identifiability.

Authors:  Laura L Rodriguez; Lisa D Brooks; Judith H Greenberg; Eric D Green
Journal:  Science       Date:  2013-01-18       Impact factor: 47.728

3.  Assessing data intrusion threats.

Authors:  Daniel Barth-Jones; Khaled El Emam; Jane Bambauer; Ann Cavoukian; Bradley Malin
Journal:  Science       Date:  2015-04-10       Impact factor: 47.728

Review 4.  Assessing and managing risk when sharing aggregate genetic variant data.

Authors:  David W Craig; Robert M Goor; Zhenyuan Wang; Justin Paschall; Jim Ostell; Michael Feolo; Stephen T Sherry; Teri A Manolio
Journal:  Nat Rev Genet       Date:  2011-09-16       Impact factor: 53.242

Review 5.  Routes for breaching and protecting genetic privacy.

Authors:  Yaniv Erlich; Arvind Narayanan
Journal:  Nat Rev Genet       Date:  2014-05-08       Impact factor: 53.242

6.  Biobanks and electronic medical records: enabling cost-effective research.

Authors:  Erica Bowton; Julie R Field; Sunny Wang; Jonathan S Schildcrout; Sara L Van Driest; Jessica T Delaney; James Cowan; Peter Weeke; Jonathan D Mosley; Quinn S Wells; Jason H Karnes; Christian Shaffer; Josh F Peterson; Joshua C Denny; Dan M Roden; Jill M Pulley
Journal:  Sci Transl Med       Date:  2014-04-30       Impact factor: 17.956

7.  A game theoretic framework for analyzing re-identification risk.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Weiyi Xia; Ellen Wright Clayton; Murat Kantarcioglu; Ranjit Ganta; Raymond Heatherly; Bradley A Malin
Journal:  PLoS One       Date:  2015-03-25       Impact factor: 3.240

8.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

9.  Design and anticipated outcomes of the eMERGE-PGx project: a multicenter pilot for preemptive pharmacogenomics in electronic health record systems.

Authors:  L J Rasmussen-Torvik; S C Stallings; A S Gordon; B Almoguera; M A Basford; S J Bielinski; A Brautbar; M H Brilliant; D S Carrell; J J Connolly; D R Crosslin; K F Doheny; C J Gallego; O Gottesman; D S Kim; K A Leppig; R Li; S Lin; S Manzi; A R Mejia; J A Pacheco; V Pan; J Pathak; C L Perry; J F Peterson; C A Prows; J Ralston; L V Rasmussen; M D Ritchie; S Sadhasivam; S A Scott; M Smith; A Vega; A A Vinks; S Volpi; W A Wolf; E Bottinger; R L Chisholm; C G Chute; J L Haines; J B Harley; B Keating; I A Holm; I J Kullo; G P Jarvik; E B Larson; T Manolio; C A McCarty; D A Nickerson; S E Scherer; M S Williams; D M Roden; J C Denny
Journal:  Clin Pharmacol Ther       Date:  2014-06-24       Impact factor: 6.875

10.  Efficient analysis of large datasets and sex bias with ADMIXTURE.

Authors:  Suyash S Shringarpure; Carlos D Bustamante; Kenneth Lange; David H Alexander
Journal:  BMC Bioinformatics       Date:  2016-05-23       Impact factor: 3.169

View more
  12 in total

1.  The machine giveth and the machine taketh away: a parrot attack on clinical text deidentified with hiding in plain sight.

Authors:  David S Carrell; David J Cronkite; Muqun Rachel Li; Steve Nyemba; Bradley A Malin; John S Aberdeen; Lynette Hirschman
Journal:  J Am Med Inform Assoc       Date:  2019-12-01       Impact factor: 4.497

2.  Detecting the Presence of an Individual in Phenotypic Summary Data.

Authors:  Yongtai Liu; Zhiyu Wan; Weiyi Xia; Murat Kantarcioglu; Yevgeniy Vorobeychik; Ellen Wright Clayton; Abel Kho; David Carrell; Bradley A Malin
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

3.  An Open Source Tool for Game Theoretic Health Data De-Identification.

Authors:  Fabian Prasser; James Gaupp; Zhiyu Wan; Weiyi Xia; Yevgeniy Vorobeychik; Murat Kantarcioglu; Klaus Kuhn; Brad Malin
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

4.  Biomedical Research Cohort Membership Disclosure on Social Media.

Authors:  Yongtai Liu; Chao Yan; Zhijun Yin; Zhiyu Wan; Weiyi Xia; Murat Kantarcioglu; Yevgeniy Vorobeychik; Ellen Wright Clayton; Bradley A Malin
Journal:  AMIA Annu Symp Proc       Date:  2020-03-04

5.  A scalable software solution for anonymizing high-dimensional biomedical data.

Authors:  Thierry Meurers; Raffael Bild; Kieu-Mi Do; Fabian Prasser
Journal:  Gigascience       Date:  2021-10-04       Impact factor: 6.524

6.  Controlling the signal: Practical privacy protection of genomic data sharing through Beacon services.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Murat Kantarcioglu; Bradley Malin
Journal:  BMC Med Genomics       Date:  2017-07-26       Impact factor: 3.063

7.  Using game theory to thwart multistage privacy intrusions when sharing data.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Weiyi Xia; Yongtai Liu; Myrna Wooders; Jia Guo; Zhijun Yin; Ellen Wright Clayton; Murat Kantarcioglu; Bradley A Malin
Journal:  Sci Adv       Date:  2021-12-10       Impact factor: 14.136

Review 8.  A community effort to protect genomic data sharing, collaboration and outsourcing.

Authors:  Shuang Wang; Xiaoqian Jiang; Haixu Tang; Xiaofeng Wang; Diyue Bu; Knox Carey; Stephanie Om Dyke; Dov Fox; Chao Jiang; Kristin Lauter; Bradley Malin; Heidi Sofia; Amalio Telenti; Lei Wang; Wenhao Wang; Lucila Ohno-Machado
Journal:  NPJ Genom Med       Date:  2017-10-27       Impact factor: 8.617

9.  A systematic literature review of individuals' perspectives on privacy and genetic information in the United States.

Authors:  Ellen W Clayton; Colin M Halverson; Nila A Sathe; Bradley A Malin
Journal:  PLoS One       Date:  2018-10-31       Impact factor: 3.240

Review 10.  Lessons learned from the eMERGE Network: balancing genomics in discovery and practice.

Authors: 
Journal:  HGG Adv       Date:  2020-12-25
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.