Literature DB >> 22611595

Detecting sample misidentifications in genetic association studies.

Claus T Ekstrøm1, Bjarke Feenstra.   

Abstract

Genetic association studies require that the genotype data from a given person can be correctly linked to the phenotype data from the same person. However, sample misidentification errors sometimes happen, whereby the link becomes invalid for some of the subjects in a study. This can have substantial consequences in terms of power to detect truly associated variants. In family-based studies, Mendelian inconsistencies can be used to detect sample misidentification. Genome-wide association studies (GWAS), however, typically use unrelated individuals, making error detection more problematic. Here we present a method for identifying potential sample misidentifications in GWAS and other genetic association studies building on ideas from forensic sciences. A widely used ad-hoc method for error detection is to check if the sex of an individual matches its X-linked genotype. We generalize this idea to less stringent associations between known genotypes and phenotypes, and show that if several known associations are combined, the power to detect misidentifications increases substantially. Individuals with an unlikely set of phenotypes given their genotypes are flagged as potential errors. We provide analytical and simulation results comparing the odds that the genotype and phenotype are both from the same individual for different numbers of available genotype-p henotype associations and for different information content of the associations. Our method has good sensitivity and specificity with as few as ten moderately informative genotype-phenotype associations. We apply the method to GWAS data from the Danish National Birth Cohort.

Entities:  

Mesh:

Year:  2012        PMID: 22611595     DOI: 10.1515/1544-6115.1772

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  2 in total

1.  Identification and Correction of Sample Mix-Ups in Expression Genetic Data: A Case Study.

Authors:  Karl W Broman; Mark P Keller; Aimee Teo Broman; Christina Kendziorski; Brian S Yandell; Śaunak Sen; Alan D Attie
Journal:  G3 (Bethesda)       Date:  2015-08-19       Impact factor: 3.154

2.  reGenotyper: Detecting mislabeled samples in genetic data.

Authors:  Konrad Zych; Basten L Snoek; Mark Elvin; Miriam Rodriguez; K Joeri Van der Velde; Danny Arends; Harm-Jan Westra; Morris A Swertz; Gino Poulin; Jan E Kammenga; Rainer Breitling; Ritsert C Jansen; Yang Li
Journal:  PLoS One       Date:  2017-02-13       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.