Literature DB >> 30052749

Re-identification of individuals in genomic data-sharing beacons via allele inference.

Nora von Thenen1, Erman Ayday1,2, A Ercument Cicek1,3.   

Abstract

Motivation: Genomic data-sharing beacons aim to provide a secure, easy to implement and standardized interface for data-sharing by only allowing yes/no queries on the presence of specific alleles in the dataset. Previously deemed secure against re-identification attacks, beacons were shown to be vulnerable despite their stringent policy. Recent studies have demonstrated that it is possible to determine whether the victim is in the dataset, by repeatedly querying the beacon for his/her single-nucleotide polymorphisms (SNPs). Here, we propose a novel re-identification attack and show that the privacy risk is more serious than previously thought.
Results: Using the proposed attack, even if the victim systematically hides informative SNPs, it is possible to infer the alleles at positions of interest as well as the beacon query results with very high confidence. Our method is based on the fact that alleles at different loci are not necessarily independent. We use linkage disequilibrium and a high-order Markov chain-based algorithm for inference. We show that in a simulated beacon with 65 individuals from the European population, we can infer membership of individuals with 95% confidence with only 5 queries, even when SNPs with MAF <0.05 are hidden. We need less than 0.5% of the number of queries that existing works require, to determine beacon membership under the same conditions. We show that countermeasures such as hiding certain parts of the genome or setting a query budget for the user would fail to protect the privacy of the participants. Availability and implementation: Software is available at http://ciceklab.cs.bilkent.edu.tr/beacon_attack. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2019        PMID: 30052749     DOI: 10.1093/bioinformatics/bty643

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  Privacy-preserving biomedical data dissemination via a hybrid approach.

Authors:  Yichen Jiang; Chenghong Wang; Zhixuan Wu; Xin Du; Shuang Wang
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

Review 2.  Are Public Repository Requirements Exacerbating Lack of Diversity?

Authors:  Thomas May
Journal:  Trends Genet       Date:  2020-04-14       Impact factor: 11.639

3.  Reconstructing Genotypes in Private Genomic Databases from Genetic Risk Scores.

Authors:  Brooks Paige; James Bell; Aurélien Bellet; Adrià Gascón; Daphne Ezer
Journal:  J Comput Biol       Date:  2021-01-05       Impact factor: 1.479

4.  WHY WE FEAR GENETIC INFORMANTS: USING GENETIC GENEALOGY TO CATCH SERIAL KILLERS.

Authors:  Teneille R Brown
Journal:  Columbia Sci Technol Law Rev       Date:  2019

5.  Privacy Risks of Sharing Data from Environmental Health Studies.

Authors:  Katherine E Boronow; Laura J Perovich; Latanya Sweeney; Ji Su Yoo; Ruthann A Rudel; Phil Brown; Julia Green Brody
Journal:  Environ Health Perspect       Date:  2020-01-10       Impact factor: 9.031

6.  Using game theory to thwart multistage privacy intrusions when sharing data.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Weiyi Xia; Yongtai Liu; Myrna Wooders; Jia Guo; Zhijun Yin; Ellen Wright Clayton; Murat Kantarcioglu; Bradley A Malin
Journal:  Sci Adv       Date:  2021-12-10       Impact factor: 14.136

Review 7.  Sociotechnical safeguards for genomic data privacy.

Authors:  Zhiyu Wan; James W Hazel; Ellen Wright Clayton; Yevgeniy Vorobeychik; Murat Kantarcioglu; Bradley A Malin
Journal:  Nat Rev Genet       Date:  2022-03-04       Impact factor: 59.581

8.  The effect of kinship in re-identification attacks against genomic data sharing beacons.

Authors:  Kerem Ayoz; Miray Aysen; Erman Ayday; A Ercument Cicek
Journal:  Bioinformatics       Date:  2020-12-30       Impact factor: 6.937

9.  Haplotype-based membership inference from summary genomic data.

Authors:  Diyue Bu; Xiaofeng Wang; Haixu Tang
Journal:  Bioinformatics       Date:  2021-07-12       Impact factor: 6.937

Review 10.  Beyond Genes: Re-Identifiability of Proteomic Data and Its Implications for Personalized Medicine.

Authors:  Kurt Boonen; Kristien Hens; Gerben Menschaert; Geert Baggerman; Dirk Valkenborg; Gokhan Ertaylan
Journal:  Genes (Basel)       Date:  2019-09-05       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.