MOTIVATION: The technology to genotype single nucleotide polymorphisms (SNPs) at extremely high densities provides for hypothesis-free genome-wide scans for common polymorphisms associated with complex disease. However, we find that some errors introduced by commonly employed genotyping algorithms may lead to inflation of false associations between markers and phenotype. RESULTS: We have developed a novel SNP genotype calling program, SNiPer-High Density (SNiPer-HD), for highly accurate genotype calling across hundreds of thousands of SNPs. The program employs an expectation-maximization (EM) algorithm with parameters based on a training sample set. The algorithm choice allows for highly accurate genotyping for most SNPs. Also, we introduce a quality control metric for each assayed SNP, such that poor-behaving SNPs can be filtered using a metric correlating to genotype class separation in the calling algorithm. SNiPer-HD is superior to the standard dynamic modeling algorithm and is complementary and non-redundant to other algorithms, such as BRLMM. Implementing multiple algorithms together may provide highly accurate genotyping calls, without inflation of false positives due to systematically miss-called SNPs. A reliable and accurate set of SNP genotypes for increasingly dense panels will eliminate some false association signals and false negative signals, allowing for rapid identification of disease susceptibility loci for complex traits. AVAILABILITY: SNiPer-HD is available at TGen's website: http://www.tgen.org/neurogenomics/data.
MOTIVATION: The technology to genotype single nucleotide polymorphisms (SNPs) at extremely high densities provides for hypothesis-free genome-wide scans for common polymorphisms associated with complex disease. However, we find that some errors introduced by commonly employed genotyping algorithms may lead to inflation of false associations between markers and phenotype. RESULTS: We have developed a novel SNP genotype calling program, SNiPer-High Density (SNiPer-HD), for highly accurate genotype calling across hundreds of thousands of SNPs. The program employs an expectation-maximization (EM) algorithm with parameters based on a training sample set. The algorithm choice allows for highly accurate genotyping for most SNPs. Also, we introduce a quality control metric for each assayed SNP, such that poor-behaving SNPs can be filtered using a metric correlating to genotype class separation in the calling algorithm. SNiPer-HD is superior to the standard dynamic modeling algorithm and is complementary and non-redundant to other algorithms, such as BRLMM. Implementing multiple algorithms together may provide highly accurate genotyping calls, without inflation of false positives due to systematically miss-called SNPs. A reliable and accurate set of SNP genotypes for increasingly dense panels will eliminate some false association signals and false negative signals, allowing for rapid identification of disease susceptibility loci for complex traits. AVAILABILITY: SNiPer-HD is available at TGen's website: http://www.tgen.org/neurogenomics/data.
Authors: George Zogopoulos; Kevin C H Ha; Faisal Naqib; Sara Moore; Hyeja Kim; Alexandre Montpetit; Frederick Robidoux; Philippe Laflamme; Michelle Cotterchio; Celia Greenwood; Stephen W Scherer; Brent Zanke; Thomas J Hudson; Gary D Bader; Steven Gallinger Journal: Hum Genet Date: 2007-07-19 Impact factor: 4.132
Authors: Nils Homer; Waibhav D Tembe; Szabolcs Szelinger; Margot Redman; Dietrich A Stephan; John V Pearson; Stanley F Nelson; David Craig Journal: Bioinformatics Date: 2008-07-10 Impact factor: 6.937
Authors: Matthew E Ritchie; Benilton S Carvalho; Kurt N Hetrick; Simon Tavaré; Rafael A Irizarry Journal: Bioinformatics Date: 2009-08-06 Impact factor: 6.937
Authors: Jason J Corneveaux; Winnie S Liang; Eric M Reiman; Jennifer A Webster; Amanda J Myers; Victoria L Zismann; Keta D Joshipura; John V Pearson; Diane Hu-Lince; David W Craig; Keith D Coon; Travis Dunckley; Daniel Bandy; Wendy Lee; Kewei Chen; Thomas G Beach; Diego Mastroeni; Andrew Grover; Rivka Ravid; Sigrid B Sando; Jan O Aasly; Reinhard Heun; Frank Jessen; Heike Kölsch; Joseph Rogers; Michael L Hutton; Stacey Melquist; Ron C Petersen; Gene E Alexander; Richard J Caselli; Andreas Papassotiropoulos; Dietrich A Stephan; Matthew J Huentelman Journal: Neurobiol Aging Date: 2008-09-13 Impact factor: 4.673
Authors: Jumamurat R Bayjanov; Michiel Wels; Marjo Starrenburg; Johan E T van Hylckama Vlieg; Roland J Siezen; Douwe Molenaar Journal: Bioinformatics Date: 2009-01-07 Impact factor: 6.937
Authors: Chris D Greenman; Graham Bignell; Adam Butler; Sarah Edkins; Jon Hinton; Dave Beare; Sajani Swamy; Thomas Santarius; Lina Chen; Sara Widaa; P Andy Futreal; Michael R Stratton Journal: Biostatistics Date: 2009-10-15 Impact factor: 5.899