Literature DB >> 20718045

Quality control and quality assurance in genotypic data for genome-wide association studies.

Cathy C Laurie1, Kimberly F Doheny, Daniel B Mirel, Elizabeth W Pugh, Laura J Bierut, Tushar Bhangale, Frederick Boehm, Neil E Caporaso, Marilyn C Cornelis, Howard J Edenberg, Stacy B Gabriel, Emily L Harris, Frank B Hu, Kevin B Jacobs, Peter Kraft, Maria Teresa Landi, Thomas Lumley, Teri A Manolio, Caitlin McHugh, Ian Painter, Justin Paschall, John P Rice, Kenneth M Rice, Xiuwen Zheng, Bruce S Weir.   

Abstract

Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies (GWAS). This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium test P-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis to SNP selection. The methods are illustrated with examples from the "Gene Environment Association Studies" (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of GWAS. (c) 2010 Wiley-Liss, Inc.

Entities:  

Mesh:

Year:  2010        PMID: 20718045      PMCID: PMC3061487          DOI: 10.1002/gepi.20516

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  27 in total

1.  Replicating genotype-phenotype associations.

Authors:  Stephen J Chanock; Teri Manolio; Michael Boehnke; Eric Boerwinkle; David J Hunter; Gilles Thomas; Joel N Hirschhorn; Goncalo Abecasis; David Altshuler; Joan E Bailey-Wilson; Lisa D Brooks; Lon R Cardon; Mark Daly; Peter Donnelly; Joseph F Fraumeni; Nelson B Freimer; Daniela S Gerhard; Chris Gunter; Alan E Guttmacher; Mark S Guyer; Emily L Harris; Josephine Hoh; Robert Hoover; C Augustine Kong; Kathleen R Merikangas; Cynthia C Morton; Lyle J Palmer; Elizabeth G Phimister; John P Rice; Jerry Roberts; Charles Rotimi; Margaret A Tucker; Kyle J Vogan; Sholom Wacholder; Ellen M Wijsman; Deborah M Winn; Francis S Collins
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

2.  Genomewide association studies--illuminating biologic pathways.

Authors:  Joel N Hirschhorn
Journal:  N Engl J Med       Date:  2009-04-15       Impact factor: 91.245

3.  Mechanisms of mosaicism, chimerism and uniparental disomy identified by single nucleotide polymorphism array analysis.

Authors:  Laura K Conlin; Brian D Thiel; Carsten G Bonnemann; Livija Medne; Linda M Ernst; Elaine H Zackai; Matthew A Deardorff; Ian D Krantz; Hakon Hakonarson; Nancy B Spinner
Journal:  Hum Mol Genet       Date:  2010-01-06       Impact factor: 6.150

4.  Bayesian methods for examining Hardy-Weinberg equilibrium.

Authors:  Jon Wakefield
Journal:  Biometrics       Date:  2009-05-12       Impact factor: 2.571

5.  The Gene, Environment Association Studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions.

Authors:  Marilyn C Cornelis; Arpana Agrawal; John W Cole; Nadia N Hansel; Kathleen C Barnes; Terri H Beaty; Siiri N Bennett; Laura J Bierut; Eric Boerwinkle; Kimberly F Doheny; Bjarke Feenstra; Eleanor Feingold; Myriam Fornage; Christopher A Haiman; Emily L Harris; M Geoffrey Hayes; John A Heit; Frank B Hu; Jae H Kang; Cathy C Laurie; Hua Ling; Teri A Manolio; Mary L Marazita; Rasika A Mathias; Daniel B Mirel; Justin Paschall; Louis R Pasquale; Elizabeth W Pugh; John P Rice; Jenna Udren; Rob M van Dam; Xiaojing Wang; Janey L Wiggs; Kayleen Williams; Kai Yu
Journal:  Genet Epidemiol       Date:  2010-05       Impact factor: 2.135

6.  Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs.

Authors:  Joshua M Korn; Finny G Kuruvilla; Steven A McCarroll; Alec Wysoker; James Nemesh; Simon Cawley; Earl Hubbell; Jim Veitch; Patrick J Collins; Katayoon Darvishi; Charles Lee; Marcia M Nizzari; Stacey B Gabriel; Shaun Purcell; Mark J Daly; David Altshuler
Journal:  Nat Genet       Date:  2008-09-07       Impact factor: 38.330

7.  Genes mirror geography within Europe.

Authors:  John Novembre; Toby Johnson; Katarzyna Bryc; Zoltán Kutalik; Adam R Boyko; Adam Auton; Amit Indap; Karen S King; Sven Bergmann; Matthew R Nelson; Matthew Stephens; Carlos D Bustamante
Journal:  Nature       Date:  2008-08-31       Impact factor: 49.962

8.  Genome-wide SNP assay reveals structural genomic variation, extended homozygosity and cell-line induced alterations in normal individuals.

Authors:  Javier Simon-Sanchez; Sonja Scholz; Hon-Chung Fung; Mar Matarin; Dena Hernandez; J Raphael Gibbs; Angela Britton; Fabienne Wavrant de Vrieze; Elizabeth Peckham; Katrina Gwinn-Hardy; Anthony Crawley; Judith C Keen; Josefina Nash; Digamber Borgaonkar; John Hardy; Andrew Singleton
Journal:  Hum Mol Genet       Date:  2006-11-20       Impact factor: 6.150

9.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

10.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors: 
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

View more
  240 in total

1.  A likelihood-based approach to transcriptome association analysis.

Authors:  Jing Qian; Evan Ray; Regina L Brecha; Muredach P Reilly; Andrea S Foulkes
Journal:  Stat Med       Date:  2018-12-04       Impact factor: 2.373

2.  Next generation analytic tools for large scale genetic epidemiology studies of complex diseases.

Authors:  Leah E Mechanic; Huann-Sheng Chen; Christopher I Amos; Nilanjan Chatterjee; Nancy J Cox; Rao L Divi; Ruzong Fan; Emily L Harris; Kevin Jacobs; Peter Kraft; Suzanne M Leal; Kimberly McAllister; Jason H Moore; Dina N Paltoo; Michael A Province; Erin M Ramos; Marylyn D Ritchie; Kathryn Roeder; Daniel J Schaid; Matthew Stephens; Duncan C Thomas; Clarice R Weinberg; John S Witte; Shunpu Zhang; Sebastian Zöllner; Eric J Feuer; Elizabeth M Gillanders
Journal:  Genet Epidemiol       Date:  2011-12-06       Impact factor: 2.135

3.  Gene-environment interactions in genome-wide association studies: a comparative study of tests applied to empirical studies of type 2 diabetes.

Authors:  Marilyn C Cornelis; Eric J Tchetgen Tchetgen; Liming Liang; Lu Qi; Nilanjan Chatterjee; Frank B Hu; Peter Kraft
Journal:  Am J Epidemiol       Date:  2011-12-22       Impact factor: 4.897

4.  Genetic susceptibility to dental caries on pit and fissure and smooth surfaces.

Authors:  J R Shaffer; X Wang; R S Desensi; S Wendell; R J Weyant; K T Cuenco; R Crout; D W McNeil; M L Marazita
Journal:  Caries Res       Date:  2012-01-25       Impact factor: 4.056

5.  CHRNB3 is more strongly associated with Fagerström test for cigarette dependence-based nicotine dependence than cigarettes per day: phenotype definition changes genome-wide association studies results.

Authors:  John P Rice; Sarah M Hartz; Arpana Agrawal; Laura Almasy; Siiri Bennett; Naomi Breslau; Kathleen K Bucholz; Kimberly F Doheny; Howard J Edenberg; Alison M Goate; Victor Hesselbrock; William B Howells; Eric O Johnson; John Kramer; Robert F Krueger; Samuel Kuperman; Cathy Laurie; Teri A Manolio; Rosalind J Neuman; John I Nurnberger; Bernice Porjesz; Elizabeth Pugh; Erin M Ramos; Nancy Saccone; Scott Saccone; Marc Schuckit; Laura J Bierut
Journal:  Addiction       Date:  2012-06-15       Impact factor: 6.526

6.  Statistical genetic issues for genome-wide association studies.

Authors:  Bruce S Weir
Journal:  Genome       Date:  2010-11       Impact factor: 2.166

Review 7.  The role of replicates for error mitigation in next-generation sequencing.

Authors:  Kimberly Robasky; Nathan E Lewis; George M Church
Journal:  Nat Rev Genet       Date:  2013-12-10       Impact factor: 53.242

8.  Confidence intervals for heritability via Haseman-Elston regression.

Authors:  Tamar Sofer
Journal:  Stat Appl Genet Mol Biol       Date:  2017-09-26

9.  Genetic variation near IRS1 is associated with adiposity and a favorable metabolic profile in U.S. Hispanics/Latinos.

Authors:  Qibin Qi; Stephanie M Gogarten; Leslie S Emery; Tin Louie; Adrienne Stilp; Jianwen Cai; Neil Schneiderman; M Larissa Avilés-Santa; Robert C Kaplan; Kari E North; Cathy C Laurie; Ruth J F Loos; Carmen R Isasi
Journal:  Obesity (Silver Spring)       Date:  2016-09-24       Impact factor: 5.002

10.  Genome-Wide Association Study of Radiographic Knee Osteoarthritis in North American Caucasians.

Authors:  Michelle S Yau; Laura M Yerges-Armstrong; Youfang Liu; Cora E Lewis; David J Duggan; Jordan B Renner; James Torner; David T Felson; Charles E McCulloch; C Kent Kwoh; Michael C Nevitt; Marc C Hochberg; Braxton D Mitchell; Joanne M Jordan; Rebecca D Jackson
Journal:  Arthritis Rheumatol       Date:  2017-02       Impact factor: 10.995

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.