Literature DB >> 24509073

Scalable privacy-preserving data sharing methodology for genome-wide association studies.

Fei Yu1, Stephen E Fienberg2, Aleksandra B Slavković3, Caroline Uhler4.   

Abstract

The protection of privacy of individual-level information in genome-wide association study (GWAS) databases has been a major concern of researchers following the publication of "an attack" on GWAS data by Homer et al. (2008). Traditional statistical methods for confidentiality and privacy protection of statistical databases do not scale well to deal with GWAS data, especially in terms of guarantees regarding protection from linkage to external information. The more recent concept of differential privacy, introduced by the cryptographic community, is an approach that provides a rigorous definition of privacy with meaningful privacy guarantees in the presence of arbitrary external information, although the guarantees may come at a serious price in terms of data utility. Building on such notions, Uhler et al. (2013) proposed new methods to release aggregate GWAS data without compromising an individual's privacy. We extend the methods developed in Uhler et al. (2013) for releasing differentially-private χ(2)-statistics by allowing for arbitrary number of cases and controls, and for releasing differentially-private allelic test statistics. We also provide a new interpretation by assuming the controls' data are known, which is a realistic assumption because some GWAS use publicly available data as controls. We assess the performance of the proposed methods through a risk-utility analysis on a real data set consisting of DNA samples collected by the Wellcome Trust Case Control Consortium and compare the methods with the differentially-private release mechanism proposed by Johnson and Shmatikov (2013).
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Allelic test; Contingency table; Differential privacy; Genome-wide association study (GWAS); Pearson -test; Single-nucleotide polymorphism (SNP)

Mesh:

Year:  2014        PMID: 24509073      PMCID: PMC4221263          DOI: 10.1016/j.jbi.2014.01.008

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  19 in total

1.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

2.  Genetic privacy. Whole-genome data not anonymous, challenging assumptions.

Authors:  Jennifer Couzin
Journal:  Science       Date:  2008-09-05       Impact factor: 47.728

3.  Protecting aggregate genomic data.

Authors:  Elias A Zerhouni; Elizabeth G Nabel
Journal:  Science       Date:  2008-09-04       Impact factor: 47.728

4.  Anonymization of electronic medical records for validating genome-wide association studies.

Authors:  Grigorios Loukides; Aris Gkoulalas-Divanis; Bradley Malin
Journal:  Proc Natl Acad Sci U S A       Date:  2010-04-12       Impact factor: 11.205

5.  On sharing quantitative trait GWAS results in an era of multiple-omics data and the limits of genomic privacy.

Authors:  Hae Kyung Im; Eric R Gamazon; Dan L Nicolae; Nancy J Cox
Journal:  Am J Hum Genet       Date:  2012-03-28       Impact factor: 11.025

6.  USING LINEAR PREDICTORS TO IMPUTE ALLELE FREQUENCIES FROM SUMMARY OR POOLED GENOTYPE DATA.

Authors:  Xiaoquan Wen; Matthew Stephens
Journal:  Ann Appl Stat       Date:  2010-09       Impact factor: 2.083

7.  Differentially Private Empirical Risk Minimization.

Authors:  Kamalika Chaudhuri; Claire Monteleoni; Anand D Sarwate
Journal:  J Mach Learn Res       Date:  2011-03       Impact factor: 3.654

8.  On inferring presence of an individual in a mixture: a Bayesian approach.

Authors:  David Clayton
Journal:  Biostatistics       Date:  2010-06-03       Impact factor: 5.899

9.  Privacy-Preserving Data Sharing for Genome-Wide Association Studies.

Authors:  Caroline Uhlerop; Aleksandra Slavković; Stephen E Fienberg
Journal:  J Priv Confid       Date:  2013

10.  The limits of individual identification from sample allele frequencies: theory and statistical analysis.

Authors:  Peter M Visscher; William G Hill
Journal:  PLoS Genet       Date:  2009-10-02       Impact factor: 5.917

View more
  17 in total

Review 1.  Privacy challenges and research opportunities for genomic data sharing.

Authors:  Luca Bonomi; Yingxiang Huang; Lucila Ohno-Machado
Journal:  Nat Genet       Date:  2020-06-29       Impact factor: 38.330

Review 2.  Privacy-preserving techniques of genomic data-a survey.

Authors:  Md Momin Al Aziz; Md Nazmus Sadat; Dima Alhadidi; Shuang Wang; Xiaoqian Jiang; Cheryl L Brown; Noman Mohammed
Journal:  Brief Bioinform       Date:  2019-05-21       Impact factor: 11.622

3.  Big Data Privacy in Biomedical Research.

Authors:  Shuang Wang; Luca Bonomi; Wenrui Dai; Feng Chen; Cynthia Cheung; Cinnamon S Bloss; Samuel Cheng; Xiaoqian Jiang
Journal:  IEEE Trans Big Data       Date:  2016-09-13

4.  Privacy Policy and Technology in Biomedical Data Science.

Authors:  April Moreno Arellano; Wenrui Dai; Shuang Wang; Xiaoqian Jiang; Lucila Ohno-Machado
Journal:  Annu Rev Biomed Data Sci       Date:  2018-07

Review 5.  Routes for breaching and protecting genetic privacy.

Authors:  Yaniv Erlich; Arvind Narayanan
Journal:  Nat Rev Genet       Date:  2014-05-08       Impact factor: 53.242

6.  Enabling Privacy-Preserving GWASs in Heterogeneous Human Populations.

Authors:  Sean Simmons; Cenk Sahinalp; Bonnie Berger
Journal:  Cell Syst       Date:  2016-07-21       Impact factor: 10.304

7.  Size matters: how population size influences genotype-phenotype association studies in anonymized data.

Authors:  Raymond Heatherly; Joshua C Denny; Jonathan L Haines; Dan M Roden; Bradley A Malin
Journal:  J Biomed Inform       Date:  2014-07-16       Impact factor: 6.317

Review 8.  Genome privacy: challenges, technical approaches to mitigate risk, and ethical considerations in the United States.

Authors:  Shuang Wang; Xiaoqian Jiang; Siddharth Singh; Rebecca Marmor; Luca Bonomi; Dov Fox; Michelle Dow; Lucila Ohno-Machado
Journal:  Ann N Y Acad Sci       Date:  2016-09-28       Impact factor: 5.691

9.  Privacy in the Genomic Era.

Authors:  Muhammad Naveed; Erman Ayday; Ellen W Clayton; Jacques Fellay; Carl A Gunter; Jean-Pierre Hubaux; Bradley A Malin; Xiaofeng Wang
Journal:  ACM Comput Surv       Date:  2015-09       Impact factor: 10.282

10.  Mechanisms to protect the privacy of families when using the transmission disequilibrium test in genome-wide association studies.

Authors:  Meng Wang; Zhanglong Ji; Shuang Wang; Jihoon Kim; Hai Yang; Xiaoqian Jiang; Lucila Ohno-Machado
Journal:  Bioinformatics       Date:  2017-12-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.