Literature DB >> 20501555

A quality control algorithm for filtering SNPs in genome-wide association studies.

Monnat Pongpanich1, Patrick F Sullivan, Jung-Ying Tzeng.   

Abstract

MOTIVATION: The quality control (QC) filtering of single nucleotide polymorphisms (SNPs) is an important step in genome-wide association studies to minimize potential false findings. SNP QC commonly uses expert-guided filters based on QC variables [e.g. Hardy-Weinberg equilibrium, missing proportion (MSP) and minor allele frequency (MAF)] to remove SNPs with insufficient genotyping quality. The rationale of the expert filters is sensible and concrete, but its implementation requires arbitrary thresholds and does not jointly consider all QC features.
RESULTS: We propose an algorithm that is based on principal component analysis and clustering analysis to identify low-quality SNPs. The method minimizes the use of arbitrary cutoff values, allows a collective consideration of the QC features and provides conditional thresholds contingent on other QC variables (e.g. different MSP thresholds for different MAFs). We apply our method to the seven studies from the Wellcome Trust Case Control Consortium and the major depressive disorder study from the Genetic Association Information Network. We measured the performance of our method compared to the expert filters based on the following criteria: (i) percentage of SNPs excluded due to low quality; (ii) inflation factor of the test statistics (lambda); (iii) number of false associations found in the filtered dataset; and (iv) number of true associations missed in the filtered dataset. The results suggest that with the same or fewer SNPs excluded, the proposed algorithm tends to give a similar or lower value of lambda, a reduced number of false associations, and retains all true associations. AVAILABILITY: The algorithm is available at http://www4.stat.ncsu.edu/jytzeng/software.php

Entities:  

Mesh:

Year:  2010        PMID: 20501555      PMCID: PMC2894516          DOI: 10.1093/bioinformatics/btq272

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  30 in total

1.  Assessment and management of single nucleotide polymorphism genotype errors in genetic association analysis.

Authors:  D Gordon; J Ott
Journal:  Pac Symp Biocomput       Date:  2001

2.  A transmission/disequilibrium test that allows for genotyping errors in the analysis of single-nucleotide polymorphism data.

Authors:  D Gordon; S C Heath; X Liu; J Ott
Journal:  Am J Hum Genet       Date:  2001-07-05       Impact factor: 11.025

3.  Allowing for genotyping error in analysis of unmatched case-control studies.

Authors:  K M Rice; P Holmans
Journal:  Ann Hum Genet       Date:  2003-03       Impact factor: 1.670

4.  Population structure, differential bias and genomic control in a large-scale, case-control association study.

Authors:  David G Clayton; Neil M Walker; Deborah J Smyth; Rebecca Pask; Jason D Cooper; Lisa M Maier; Luc J Smink; Alex C Lam; Nigel R Ovington; Helen E Stevens; Sarah Nutland; Joanna M M Howson; Malek Faham; Martin Moorhead; Hywel B Jones; Matthew Falkowski; Paul Hardenbol; Thomas D Willis; John A Todd
Journal:  Nat Genet       Date:  2005-10-09       Impact factor: 38.330

Review 5.  Genotyping errors: causes, consequences and solutions.

Authors:  François Pompanon; Aurélie Bonin; Eva Bellemain; Pierre Taberlet
Journal:  Nat Rev Genet       Date:  2005-11       Impact factor: 53.242

6.  Incorporating individual error rate into association test of unmatched case-control design.

Authors:  Ke Hao; Xiaobin Wang
Journal:  Hum Hered       Date:  2004       Impact factor: 0.444

7.  Quantification of the power of Hardy-Weinberg equilibrium testing to detect genotyping error.

Authors:  David G Cox; Peter Kraft
Journal:  Hum Hered       Date:  2006-03-01       Impact factor: 0.444

8.  Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn's disease.

Authors:  J P Hugot; M Chamaillard; H Zouali; S Lesage; J P Cézard; J Belaiche; S Almer; C Tysk; C A O'Morain; M Gassull; V Binder; Y Finkel; A Cortot; R Modigliani; P Laurent-Puig; C Gower-Rousseau; J Macry; J F Colombel; M Sahbatou; G Thomas
Journal:  Nature       Date:  2001-05-31       Impact factor: 49.962

Review 9.  Progress in defining the molecular basis of type 2 diabetes mellitus through susceptibility-gene identification.

Authors:  Mark I McCarthy
Journal:  Hum Mol Genet       Date:  2004-01-13       Impact factor: 6.150

10.  On quality control measures in genome-wide association studies: a test to assess the genotyping quality of individual probands in family-based association studies and an application to the HapMap data.

Authors:  David W Fardo; Iuliana Ionita-Laza; Christoph Lange
Journal:  PLoS Genet       Date:  2009-07-24       Impact factor: 6.020

View more
  9 in total

1.  PBAP: a pipeline for file processing and quality control of pedigree data with dense genetic markers.

Authors:  Alejandro Q Nato; Nicola H Chapman; Harkirat K Sohi; Hiep D Nguyen; Zoran Brkanac; Ellen M Wijsman
Journal:  Bioinformatics       Date:  2015-07-30       Impact factor: 6.937

2.  Genetic variation predicting cisplatin cytotoxicity associated with overall survival in lung cancer patients receiving platinum-based chemotherapy.

Authors:  Xiang-Lin Tan; Ann M Moyer; Brooke L Fridley; Daniel J Schaid; Nifang Niu; Anthony J Batzler; Gregory D Jenkins; Ryan P Abo; Liang Li; Julie M Cunningham; Zhifu Sun; Ping Yang; Liewei Wang
Journal:  Clin Cancer Res       Date:  2011-07-20       Impact factor: 12.531

3.  Establishing analytical validity of BeadChip array genotype data by comparison to whole-genome sequence and standard benchmark datasets.

Authors:  Praveen F Cherukuri; Melissa M Soe; David E Condon; Shubhi Bartaria; Kaitlynn Meis; Shaopeng Gu; Frederick G Frost; Lindsay M Fricke; Krzysztof P Lubieniecki; Joanna M Lubieniecka; Robert E Pyatt; Catherine Hajek; Cornelius F Boerkoel; Lynn Carmichael
Journal:  BMC Med Genomics       Date:  2022-03-14       Impact factor: 3.063

4.  IL1RN coding variant is associated with lower risk of acute respiratory distress syndrome and increased plasma IL-1 receptor antagonist.

Authors:  Nuala J Meyer; Rui Feng; Mingyao Li; Yang Zhao; Chau-Chyun Sheu; Paula Tejera; Robert Gallop; Scarlett Bellamy; Melanie Rushefski; Paul N Lanken; Richard Aplenc; Grant E O'Keefe; Mark M Wurfel; David C Christiani; Jason D Christie
Journal:  Am J Respir Crit Care Med       Date:  2013-05-01       Impact factor: 21.405

Review 5.  Genomics models in radiotherapy: From mechanistic to machine learning.

Authors:  John Kang; James T Coates; Robert L Strawderman; Barry S Rosenstein; Sarah L Kerns
Journal:  Med Phys       Date:  2020-06       Impact factor: 4.071

6.  A genome-wide association study identifies two novel promising candidate genes affecting Escherichia coli F4ab/F4ac susceptibility in swine.

Authors:  Wei-Xuan Fu; Yang Liu; Xin Lu; Xiao-Yan Niu; Xiang-Dong Ding; Jian-Feng Liu; Qin Zhang
Journal:  PLoS One       Date:  2012-03-23       Impact factor: 3.240

7.  Genome wide association study for drought, aflatoxin resistance, and important agronomic traits of maize hybrids in the sub-tropics.

Authors:  Ivan D Barrero Farfan; Gerald N De La Fuente; Seth C Murray; Thomas Isakeit; Pei-Cheng Huang; Marilyn Warburton; Paul Williams; Gary L Windham; Mike Kolomiets
Journal:  PLoS One       Date:  2015-02-25       Impact factor: 3.240

8.  Whole Genome Multi-Locus Sequence Typing and Genomic Single Nucleotide Polymorphism Analysis for Epidemiological Typing of Pseudomonas aeruginosa From Indonesian Intensive Care Units.

Authors:  Manisha Goyal; Andreu Coello Pelegrin; Magali Jaillard; Yulia Rosa Saharman; Corné H W Klaassen; Henri A Verbrugh; Juliëtte A Severin; Alex van Belkum
Journal:  Front Microbiol       Date:  2022-07-14       Impact factor: 6.064

9.  Genome-Wide Association Studies of 11 Agronomic Traits in Cassava (Manihot esculenta Crantz).

Authors:  Shengkui Zhang; Xin Chen; Cheng Lu; Jianqiu Ye; Meiling Zou; Kundian Lu; Subin Feng; Jinli Pei; Chen Liu; Xincheng Zhou; Ping'an Ma; Zhaogui Li; Cuijuan Liu; Qi Liao; Zhiqiang Xia; Wenquan Wang
Journal:  Front Plant Sci       Date:  2018-04-19       Impact factor: 5.753

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.