Literature DB >> 26457126

A FAST ALGORITHM FOR DETECTING GENE-GENE INTERACTIONS IN GENOME-WIDE ASSOCIATION STUDIES.

Jiahan Li1, Wei Zhong2, Runze Li3, Rongling Wu4.   

Abstract

With the recent advent of high-throughput genotyping techniques, genetic data for genome-wide association studies (GWAS) have become increasingly available, which entails the development of efficient and effective statistical approaches. Although many such approaches have been developed and used to identify single-nucleotide polymorphisms (SNPs) that are associated with complex traits or diseases, few are able to detect gene-gene interactions among different SNPs. Genetic interactions, also known as epistasis, have been recognized to play a pivotal role in contributing to the genetic variation of phenotypic traits. However, because of an extremely large number of SNP-SNP combinations in GWAS, the model dimensionality can quickly become so overwhelming that no prevailing variable selection methods are capable of handling this problem. In this paper, we present a statistical framework for characterizing main genetic effects and epistatic interactions in a GWAS study. Specifically, we first propose a two-stage sure independence screening (TS-SIS) procedure and generate a pool of candidate SNPs and interactions, which serve as predictors to explain and predict the phenotypes of a complex trait. We also propose a rates adjusted thresholding estimation (RATE) approach to determine the size of the reduced model selected by an independence screening. Regularization regression methods, such as LASSO or SCAD, are then applied to further identify important genetic effects. Simulation studies show that the TS-SIS procedure is computationally efficient and has an outstanding finite sample performance in selecting potential SNPs as well as gene-gene interactions. We apply the proposed framework to analyze an ultrahigh-dimensional GWAS data set from the Framingham Heart Study, and select 23 active SNPs and 24 active epistatic interactions for the body mass index variation. It shows the capability of our procedure to resolve the complexity of genetic control.

Entities:  

Keywords:  GWAS; Gene–gene interaction; high-dimensional data; sure independence screening; variable selection

Year:  2014        PMID: 26457126      PMCID: PMC4595934          DOI: 10.1214/14-aoas771

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  47 in total

1.  Variable Selection using MM Algorithms.

Authors:  David R Hunter; Runze Li
Journal:  Ann Stat       Date:  2005       Impact factor: 4.028

2.  SNPHarvester: a filtering-based approach for detecting epistatic interactions in genome-wide association studies.

Authors:  Can Yang; Zengyou He; Xiang Wan; Qiang Yang; Hong Xue; Weichuan Yu
Journal:  Bioinformatics       Date:  2008-12-19       Impact factor: 6.937

3.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

4.  Machine learning in genome-wide association studies.

Authors:  Silke Szymczak; Joanna M Biernacka; Heather J Cordell; Oscar González-Recio; Inke R König; Heping Zhang; Yan V Sun
Journal:  Genet Epidemiol       Date:  2009       Impact factor: 2.135

5.  Brain dopamine and obesity.

Authors:  G J Wang; N D Volkow; J Logan; N R Pappas; C T Wong; W Zhu; N Netusil; J S Fowler
Journal:  Lancet       Date:  2001-02-03       Impact factor: 79.321

6.  Screen and clean: a tool for identifying interactions in genome-wide association studies.

Authors:  Jing Wu; Bernie Devlin; Steven Ringquist; Massimo Trucco; Kathryn Roeder
Journal:  Genet Epidemiol       Date:  2010-04       Impact factor: 2.135

Review 7.  Genomewide association studies: history, rationale, and prospects for psychiatric disorders.

Authors:  Sven Cichon; Nick Craddock; Mark Daly; Stephen V Faraone; Pablo V Gejman; John Kelsoe; Thomas Lehner; Douglas F Levinson; Audra Moran; Pamela Sklar; Patrick F Sullivan
Journal:  Am J Psychiatry       Date:  2009-04-01       Impact factor: 18.112

8.  Statistical power of model selection strategies for genome-wide association studies.

Authors:  Zheyang Wu; Hongyu Zhao
Journal:  PLoS Genet       Date:  2009-07-31       Impact factor: 5.917

9.  Evaluation of random forests performance for genome-wide association studies in the presence of interaction effects.

Authors:  Yoonhee Kim; Robert Wojciechowski; Heejong Sung; Rasika A Mathias; Li Wang; Alison P Klein; Rhoshel K Lenroot; James Malley; Joan E Bailey-Wilson
Journal:  BMC Proc       Date:  2009-12-15

10.  Genome-wide association scan shows genetic variants in the FTO gene are associated with obesity-related traits.

Authors:  Angelo Scuteri; Serena Sanna; Wei-Min Chen; Manuela Uda; Giuseppe Albai; James Strait; Samer Najjar; Ramaiah Nagaraja; Marco Orrú; Gianluca Usala; Mariano Dei; Sandra Lai; Andrea Maschio; Fabio Busonero; Antonella Mulas; Georg B Ehret; Ashley A Fink; Alan B Weder; Richard S Cooper; Pilar Galan; Aravinda Chakravarti; David Schlessinger; Antonio Cao; Edward Lakatta; Gonçalo R Abecasis
Journal:  PLoS Genet       Date:  2007-07       Impact factor: 5.917

View more
  12 in total

Review 1.  Mapping complex traits as a dynamic system.

Authors:  Lidan Sun; Rongling Wu
Journal:  Phys Life Rev       Date:  2015-02-20       Impact factor: 11.025

2.  Sparse group variable selection for gene-environment interactions in the longitudinal study.

Authors:  Fei Zhou; Xi Lu; Jie Ren; Kun Fan; Shuangge Ma; Cen Wu
Journal:  Genet Epidemiol       Date:  2022-06-29       Impact factor: 2.344

3.  A selective overview of feature screening for ultrahigh-dimensional data.

Authors:  Liu JingYuan; Zhong Wei; L I RunZe
Journal:  Sci China Math       Date:  2015-08-22       Impact factor: 1.331

4.  Statistical inference of genetic pathway analysis in high dimensions.

Authors:  Yang Liu; Wei Sun; Alexander P Reiner; Charles Kooperberg; Qianchuan He
Journal:  Biometrika       Date:  2019-07-13       Impact factor: 2.445

5.  A block mixture model to map eQTLs for gene clustering and networking.

Authors:  Ningtao Wang; Kirk Gosik; Runze Li; Bruce Lindsay; Rongling Wu
Journal:  Sci Rep       Date:  2016-02-19       Impact factor: 4.379

6.  Combining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies.

Authors:  Bettina Mieth; Marius Kloft; Juan Antonio Rodríguez; Sören Sonnenburg; Robin Vobruba; Carlos Morcillo-Suárez; Xavier Farré; Urko M Marigorta; Ernst Fehr; Thorsten Dickhaus; Gilles Blanchard; Daniel Schunk; Arcadi Navarro; Klaus-Robert Müller
Journal:  Sci Rep       Date:  2016-11-28       Impact factor: 4.379

7.  Detection of Epistasis for Flowering Time Using Bayesian Multilocus Estimation in a Barley MAGIC Population.

Authors:  Boby Mathew; Jens Léon; Wiebke Sannemann; Mikko J Sillanpää
Journal:  Genetics       Date:  2017-12-18       Impact factor: 4.562

8.  Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects.

Authors:  Zobaer Akond; Md Asif Ahsan; Munirul Alam; Md Nurul Haque Mollah
Journal:  Sci Rep       Date:  2021-06-22       Impact factor: 4.379

9.  Mucosal microbiome dysbiosis in gastric carcinogenesis.

Authors:  Olabisi Oluwabukola Coker; Zhenwei Dai; Yongzhan Nie; Guijun Zhao; Lei Cao; Geicho Nakatsu; William Kk Wu; Sunny Hei Wong; Zigui Chen; Joseph J Y Sung; Jun Yu
Journal:  Gut       Date:  2017-08-01       Impact factor: 23.059

10.  Overlapping group screening for detection of gene-gene interactions: application to gene expression profiles with survival trait.

Authors:  Jie-Huei Wang; Yi-Hau Chen
Journal:  BMC Bioinformatics       Date:  2018-09-21       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.