Literature DB >> 20945829

FastANOVA: an Efficient Algorithm for Genome-Wide Association Study.

Xiang Zhang1, Fei Zou, Wei Wang.   

Abstract

Studying the association between quantitative phenotype (such as height or weight) and single nucleotide polymorphisms (SNPs) is an important problem in biology. To understand underlying mechanisms of complex phenotypes, it is often necessary to consider joint genetic effects across multiple SNPs. ANOVA (analysis of variance) test is routinely used in association study. Important findings from studying gene-gene (SNP-pair) interactions are appearing in the literature. However, the number of SNPs can be up to millions. Evaluating joint effects of SNPs is a challenging task even for SNP-pairs. Moreover, with large number of SNPs correlated, permutation procedure is preferred over simple Bonferroni correction for properly controlling family-wise error rate and retaining mapping power, which dramatically increases the computational cost of association study.In this paper, we study the problem of finding SNP-pairs that have significant associations with a given quantitative phenotype. We propose an efficient algorithm, FastANOVA, for performing ANOVA tests on SNP-pairs in a batch mode, which also supports large permutation test. We derive an upper bound of SNP-pair ANOVA test, which can be expressed as the sum of two terms. The first term is based on single-SNP ANOVA test. The second term is based on the SNPs and independent of any phenotype permutation. Furthermore, SNP-pairs can be organized into groups, each of which shares a common upper bound. This allows for maximum reuse of intermediate computation, efficient upper bound estimation, and effective SNP-pair pruning. Consequently, FastANOVA only needs to perform the ANOVA test on a small number of candidate SNP-pairs without the risk of missing any significant ones. Extensive experiments demonstrate that FastANOVA is orders of magnitude faster than the brute-force implementation of ANOVA tests on all SNP pairs.

Year:  2008        PMID: 20945829      PMCID: PMC2951741     

Source DB:  PubMed          Journal:  KDD        ISSN: 2154-817X


  21 in total

1.  Selecting SNPs in two-stage analysis of disease association data: a model-free approach.

Authors:  J Hoh; A Wille; R Zee; S Cheng; R Reynolds; K Lindpaintner; J Ott
Journal:  Ann Hum Genet       Date:  2000-09       Impact factor: 1.670

Review 2.  Mathematical multi-locus approaches to localizing complex human trait genes.

Authors:  Josephine Hoh; Jurg Ott
Journal:  Nat Rev Genet       Date:  2003-09       Impact factor: 53.242

3.  Minimal haplotype tagging.

Authors:  Paola Sebastiani; Ross Lazarus; Scott T Weiss; Louis M Kunkel; Isaac S Kohane; Marco F Ramoni
Journal:  Proc Natl Acad Sci U S A       Date:  2003-08-04       Impact factor: 11.205

Review 4.  Mapping complex disease loci in whole-genome association studies.

Authors:  Christopher S Carlson; Michael A Eberle; Leonid Kruglyak; Deborah A Nickerson
Journal:  Nature       Date:  2004-05-27       Impact factor: 49.962

5.  Modular epistasis in yeast metabolism.

Authors:  Daniel Segrè; Alexander Deluna; George M Church; Roy Kishony
Journal:  Nat Genet       Date:  2004-12-12       Impact factor: 38.330

6.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer.

Authors:  M D Ritchie; L W Hahn; N Roodi; L R Bailey; W D Dupont; F F Parl; J H Moore
Journal:  Am J Hum Genet       Date:  2001-06-11       Impact factor: 11.025

7.  Genome-wide epistatic interaction analysis reveals complex genetic determinants of circadian behavior in mice.

Authors:  K Shimomura; S S Low-Zeddies; D P King; T D Steeves; A Whiteley; J Kushla; P D Zemenides; A Lin; M H Vitaterna; G A Churchill; J S Takahashi
Journal:  Genome Res       Date:  2001-06       Impact factor: 9.043

8.  Genetic and haplotype diversity among wild-derived mouse inbred strains.

Authors:  Folami Y Ideraabdullah; Elena de la Casa-Esperón; Timothy A Bell; David A Detwiler; Terry Magnuson; Carmen Sapienza; Fernando Pardo-Manuel de Villena
Journal:  Genome Res       Date:  2004-10       Impact factor: 9.043

9.  Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels.

Authors:  Richa Saxena; Benjamin F Voight; Valeriya Lyssenko; Noël P Burtt; Paul I W de Bakker; Hong Chen; Jeffrey J Roix; Sekar Kathiresan; Joel N Hirschhorn; Mark J Daly; Thomas E Hughes; Leif Groop; David Altshuler; Peter Almgren; Jose C Florez; Joanne Meyer; Kristin Ardlie; Kristina Bengtsson Boström; Bo Isomaa; Guillaume Lettre; Ulf Lindblad; Helen N Lyon; Olle Melander; Christopher Newton-Cheh; Peter Nilsson; Marju Orho-Melander; Lennart Råstam; Elizabeth K Speliotes; Marja-Riitta Taskinen; Tiinamaija Tuomi; Candace Guiducci; Anna Berglund; Joyce Carlson; Lauren Gianniny; Rachel Hackett; Liselotte Hall; Johan Holmkvist; Esa Laurila; Marketa Sjögren; Maria Sterner; Aarti Surti; Margareta Svensson; Malin Svensson; Ryan Tewhey; Brendan Blumenstiel; Melissa Parkin; Matthew Defelice; Rachel Barry; Wendy Brodeur; Jody Camarata; Nancy Chia; Mary Fava; John Gibbons; Bob Handsaker; Claire Healy; Kieu Nguyen; Casey Gates; Carrie Sougnez; Diane Gage; Marcia Nizzari; Stacey B Gabriel; Gung-Wei Chirn; Qicheng Ma; Hemang Parikh; Delwood Richardson; Darrell Ricke; Shaun Purcell
Journal:  Science       Date:  2007-04-26       Impact factor: 47.728

10.  Genome-wide association scan shows genetic variants in the FTO gene are associated with obesity-related traits.

Authors:  Angelo Scuteri; Serena Sanna; Wei-Min Chen; Manuela Uda; Giuseppe Albai; James Strait; Samer Najjar; Ramaiah Nagaraja; Marco Orrú; Gianluca Usala; Mariano Dei; Sandra Lai; Andrea Maschio; Fabio Busonero; Antonella Mulas; Georg B Ehret; Ashley A Fink; Alan B Weder; Richard S Cooper; Pilar Galan; Aravinda Chakravarti; David Schlessinger; Antonio Cao; Edward Lakatta; Gonçalo R Abecasis
Journal:  PLoS Genet       Date:  2007-07       Impact factor: 5.917

View more
  14 in total

1.  A cautionary note on the impact of protocol changes for genome-wide association SNP × SNP interaction studies: an example on ankylosing spondylitis.

Authors:  Kyrylo Bessonov; Elena S Gusareva; Kristel Van Steen
Journal:  Hum Genet       Date:  2015-05-05       Impact factor: 4.132

2.  An Efficient Nonlinear Regression Approach for Genome-wide Detection of Marginal and Interacting Genetic Variations.

Authors:  Seunghak Lee; Aurélie Lozano; Prabhanjan Kambadur; Eric P Xing
Journal:  J Comput Biol       Date:  2016-05       Impact factor: 1.479

3.  FastChi: an efficient algorithm for analyzing gene-gene interactions.

Authors:  Xiang Zhang; Fei Zou; Wei Wang
Journal:  Pac Symp Biocomput       Date:  2009

4.  TEAM: efficient two-locus epistasis tests in human genome-wide association study.

Authors:  Xiang Zhang; Shunping Huang; Fei Zou; Wei Wang
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

5.  Ultrafast genome-wide scan for SNP-SNP interactions in common complex disease.

Authors:  Snehit Prabhu; Itsik Pe'er
Journal:  Genome Res       Date:  2012-07-05       Impact factor: 9.043

6.  Chapter 10: Mining genome-wide genetic markers.

Authors:  Xiang Zhang; Shunping Huang; Zhaojun Zhang; Wei Wang
Journal:  PLoS Comput Biol       Date:  2012-12-27       Impact factor: 4.475

7.  Epistasis detection on quantitative phenotypes by exhaustive enumeration using GPUs.

Authors:  Tony Kam-Thong; Benno Pütz; Nazanin Karbalai; Bertram Müller-Myhsok; Karsten Borgwardt
Journal:  Bioinformatics       Date:  2011-07-01       Impact factor: 6.937

8.  Tools for efficient epistasis detection in genome-wide association study.

Authors:  Xiang Zhang; Shunping Huang; Fei Zou; Wei Wang
Journal:  Source Code Biol Med       Date:  2011-01-04

9.  GWIS--model-free, fast and exhaustive search for epistatic interactions in case-control GWAS.

Authors:  Benjamin Goudey; David Rawlinson; Qiao Wang; Fan Shi; Herman Ferra; Richard M Campbell; Linda Stern; Michael T Inouye; Cheng Soon Ong; Adam Kowalczyk
Journal:  BMC Genomics       Date:  2013-05-28       Impact factor: 3.969

10.  eQTL Epistasis - Challenges and Computational Approaches.

Authors:  Yang Huang; Stefan Wuchty; Teresa M Przytycka
Journal:  Front Genet       Date:  2013-05-31       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.