Literature DB >> 28507140

Linkage disequilibrium matches forensic genetic records to disjoint genomic marker sets.

Michael D Edge1, Bridget F B Algee-Hewitt1, Trevor J Pemberton2, Jun Z Li3, Noah A Rosenberg4.   

Abstract

Combining genotypes across datasets is central in facilitating advances in genetics. Data aggregation efforts often face the challenge of record matching-the identification of dataset entries that represent the same individual. We show that records can be matched across genotype datasets that have no shared markers based on linkage disequilibrium between loci appearing in different datasets. Using two datasets for the same 872 people-one with 642,563 genome-wide SNPs and the other with 13 short tandem repeats (STRs) used in forensic applications-we find that 90-98% of forensic STR records can be connected to corresponding SNP records and vice versa. Accuracy increases to 99-100% when ∼30 STRs are used. Our method expands the potential of data aggregation, but it also suggests privacy risks intrinsic in maintenance of databases containing even small numbers of markers-including databases of forensic significance.

Entities:  

Keywords:  forensic DNA; genomic privacy; imputation; population genetics; record matching

Mesh:

Substances:

Year:  2017        PMID: 28507140      PMCID: PMC5465933          DOI: 10.1073/pnas.1619944114

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  32 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  CODIS STR loci data from 41 sample populations.

Authors:  B Budowle; B Shea; S Niezgoda; R Chakraborty
Journal:  J Forensic Sci       Date:  2001-05       Impact factor: 1.832

3.  Graydon et al. provide no new evidence that forensic STR loci are functional.

Authors:  Kirk E Lohmueller
Journal:  Forensic Sci Int Genet       Date:  2009-10-02       Impact factor: 4.882

4.  Inferring ethnicity using 15 autosomal STR loci--comparisons among populations of similar and distinctly different physical traits.

Authors:  Matthew Graydon; François Cholette; Lay-Keow Ng
Journal:  Forensic Sci Int Genet       Date:  2009-05-05       Impact factor: 4.882

5.  STRait Razor v2.0: the improved STR Allele Identification Tool--Razor.

Authors:  David H Warshauer; Jonathan L King; Bruce Budowle
Journal:  Forensic Sci Int Genet       Date:  2014-10-22       Impact factor: 4.882

Review 6.  Routes for breaching and protecting genetic privacy.

Authors:  Yaniv Erlich; Arvind Narayanan
Journal:  Nat Rev Genet       Date:  2014-05-08       Impact factor: 53.242

Review 7.  Genotype imputation.

Authors:  Yun Li; Cristen Willer; Serena Sanna; Gonçalo Abecasis
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

8.  Human genetic diversity: Lewontin's fallacy.

Authors:  A W F Edwards
Journal:  Bioessays       Date:  2003-08       Impact factor: 4.345

9.  Population structure in a comprehensive genomic data set on human microsatellite variation.

Authors:  Trevor J Pemberton; Michael DeGiorgio; Noah A Rosenberg
Journal:  G3 (Bethesda)       Date:  2013-05-20       Impact factor: 3.154

10.  Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays.

Authors:  Nils Homer; Szabolcs Szelinger; Margot Redman; David Duggan; Waibhav Tembe; Jill Muehling; John V Pearson; Dietrich A Stephan; Stanley F Nelson; David W Craig
Journal:  PLoS Genet       Date:  2008-08-29       Impact factor: 5.917

View more
  11 in total

1.  DNA Data Collection and Analysis in the Forensic Arena.

Authors:  Sydnie Grabell; Noam Shomron
Journal:  Methods Mol Biol       Date:  2021

2.  Statistical Detection of Relatives Typed with Disjoint Forensic and Biomedical Loci.

Authors:  Jaehee Kim; Michael D Edge; Bridget F B Algee-Hewitt; Jun Z Li; Noah A Rosenberg
Journal:  Cell       Date:  2018-10-11       Impact factor: 41.582

3.  Human-Genetic Ancestry Inference and False Positives in Forensic Familial Searching.

Authors:  Alyssa Lyn Fortier; Jaehee Kim; Noah A Rosenberg
Journal:  G3 (Bethesda)       Date:  2020-08-05       Impact factor: 3.154

4.  mixIndependR: a R package for statistical independence testing of loci in database of multi-locus genotypes.

Authors:  Bing Song; August E Woerner; John Planz
Journal:  BMC Bioinformatics       Date:  2021-01-06       Impact factor: 3.169

5.  Using game theory to thwart multistage privacy intrusions when sharing data.

Authors:  Zhiyu Wan; Yevgeniy Vorobeychik; Weiyi Xia; Yongtai Liu; Myrna Wooders; Jia Guo; Zhijun Yin; Ellen Wright Clayton; Murat Kantarcioglu; Bradley A Malin
Journal:  Sci Adv       Date:  2021-12-10       Impact factor: 14.136

Review 6.  Sociotechnical safeguards for genomic data privacy.

Authors:  Zhiyu Wan; James W Hazel; Ellen Wright Clayton; Yevgeniy Vorobeychik; Murat Kantarcioglu; Bradley A Malin
Journal:  Nat Rev Genet       Date:  2022-03-04       Impact factor: 59.581

7.  Estimation of kinship coefficient in structured and admixed populations using sparse sequencing data.

Authors:  Jinzhuang Dou; Baoluo Sun; Xueling Sim; Jason D Hughes; Dermot F Reilly; E Shyong Tai; Jianjun Liu; Chaolong Wang
Journal:  PLoS Genet       Date:  2017-09-29       Impact factor: 5.917

8.  A reference haplotype panel for genome-wide imputation of short tandem repeats.

Authors:  Shubham Saini; Ileena Mitra; Nima Mousavi; Stephanie Feupe Fotsing; Melissa Gymrek
Journal:  Nat Commun       Date:  2018-10-23       Impact factor: 14.919

9.  Forensic genealogy, bioethics and the Golden State Killer case.

Authors:  Ray A Wickenheiser
Journal:  Forensic Sci Int Synerg       Date:  2019-07-12

10.  Attacks on genetic privacy via uploads to genealogical databases.

Authors:  Michael D Edge; Graham Coop
Journal:  Elife       Date:  2020-01-07       Impact factor: 8.713

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.