MOTIVATION: In genome-wide association studies (GWAS) of complex diseases, genetic variants having real but weak associations often fail to be detected at the stringent genome-wide significance level. Pathway analysis, which tests disease association with combined association signals from a group of variants in the same pathway, has become increasingly popular. However, because of the complexities in genetic data and the large sample sizes in typical GWAS, pathway analysis remains to be challenging. We propose a new statistical model for pathway analysis of GWAS. This model includes a fixed effects component that models mean disease association for a group of genes, and a random effects component that models how each gene's association with disease varies about the gene group mean, thus belongs to the class of mixed effects models. RESULTS: The proposed model is computationally efficient and uses only summary statistics. In addition, it corrects for the presence of overlapping genes and linkage disequilibrium (LD). Via simulated and real GWAS data, we showed our model improved power over currently available pathway analysis methods while preserving type I error rate. Furthermore, using the WTCCC Type 1 Diabetes (T1D) dataset, we demonstrated mixed model analysis identified meaningful biological processes that agreed well with previous reports on T1D. Therefore, the proposed methodology provides an efficient statistical modeling framework for systems analysis of GWAS. AVAILABILITY: The software code for mixed models analysis is freely available at http://biostat.mc.vanderbilt.edu/LilyWang.
MOTIVATION: In genome-wide association studies (GWAS) of complex diseases, genetic variants having real but weak associations often fail to be detected at the stringent genome-wide significance level. Pathway analysis, which tests disease association with combined association signals from a group of variants in the same pathway, has become increasingly popular. However, because of the complexities in genetic data and the large sample sizes in typical GWAS, pathway analysis remains to be challenging. We propose a new statistical model for pathway analysis of GWAS. This model includes a fixed effects component that models mean disease association for a group of genes, and a random effects component that models how each gene's association with disease varies about the gene group mean, thus belongs to the class of mixed effects models. RESULTS: The proposed model is computationally efficient and uses only summary statistics. In addition, it corrects for the presence of overlapping genes and linkage disequilibrium (LD). Via simulated and real GWAS data, we showed our model improved power over currently available pathway analysis methods while preserving type I error rate. Furthermore, using the WTCCC Type 1 Diabetes (T1D) dataset, we demonstrated mixed model analysis identified meaningful biological processes that agreed well with previous reports on T1D. Therefore, the proposed methodology provides an efficient statistical modeling framework for systems analysis of GWAS. AVAILABILITY: The software code for mixed models analysis is freely available at http://biostat.mc.vanderbilt.edu/LilyWang.
Authors: R D Wolfinger; G Gibson; E D Wolfinger; L Bennett; H Hamadeh; P Bushel; C Afshari; R S Paules Journal: J Comput Biol Date: 2001 Impact factor: 1.479
Authors: Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin Journal: Am J Hum Genet Date: 2010-06-11 Impact factor: 11.025
Authors: Daniel F Schwarz; Oliver Hädicke; Jeanette Erdmann; Andreas Ziegler; Daniel Bayer; Steffen Möller Journal: Bioinformatics Date: 2007-11-17 Impact factor: 6.937
Authors: Teri A Manolio; Laura Lyman Rodriguez; Lisa Brooks; Gonçalo Abecasis; Dennis Ballinger; Mark Daly; Peter Donnelly; Stephen V Faraone; Kelly Frazer; Stacey Gabriel; Pablo Gejman; Alan Guttmacher; Emily L Harris; Thomas Insel; John R Kelsoe; Eric Lander; Norma McCowin; Matthew D Mailman; Elizabeth Nabel; James Ostell; Elizabeth Pugh; Stephen Sherry; Patrick F Sullivan; John F Thompson; James Warram; David Wholley; Patrice M Milos; Francis S Collins Journal: Nat Genet Date: 2007-09 Impact factor: 38.330
Authors: Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio Journal: Proc Natl Acad Sci U S A Date: 2009-05-27 Impact factor: 11.205
Authors: Peter Holmans; Elaine K Green; Jaspreet Singh Pahwa; Manuel A R Ferreira; Shaun M Purcell; Pamela Sklar; Michael J Owen; Michael C O'Donovan; Nick Craddock Journal: Am J Hum Genet Date: 2009-06-18 Impact factor: 11.025
Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205
Authors: David S Rowlands; Rachel A Page; William R Sukala; Mamta Giri; Svetlana D Ghimbovschi; Irum Hayat; Birinder S Cheema; Isabelle Lys; Murray Leikis; Phillip W Sheard; St John Wakefield; Bernhard Breier; Yetrib Hathout; Kristy Brown; Ramya Marathi; Funda E Orkunoglu-Suer; Joseph M Devaney; Benjamin Leiken; Gina Many; Jeremy Krebs; Will G Hopkins; Eric P Hoffman Journal: Physiol Genomics Date: 2014-08-19 Impact factor: 3.107
Authors: Claudia P Cabrera; Pau Navarro; Jennifer E Huffman; Alan F Wright; Caroline Hayward; Harry Campbell; James F Wilson; Igor Rudan; Nicholas D Hastie; Veronique Vitart; Chris S Haley Journal: G3 (Bethesda) Date: 2012-09-01 Impact factor: 3.154
Authors: Andrea S Foulkes; Gregory J Matthews; Ujjwal Das; Jane F Ferguson; Rongheng Lin; Muredach P Reilly Journal: PLoS One Date: 2013-02-06 Impact factor: 3.240