Literature DB >> 27159633

An Efficient Nonlinear Regression Approach for Genome-wide Detection of Marginal and Interacting Genetic Variations.

Seunghak Lee1, Aurélie Lozano2, Prabhanjan Kambadur3, Eric P Xing1.   

Abstract

Genome-wide association studies have revealed individual genetic variants associated with phenotypic traits such as disease risk and gene expressions. However, detecting pairwise interaction effects of genetic variants on traits still remains a challenge due to a large number of combinations of variants (∼10(11) SNP pairs in the human genome), and relatively small sample sizes (typically <10(4)). Despite recent breakthroughs in detecting interaction effects, there are still several open problems, including: (1) how to quickly process a large number of SNP pairs, (2) how to distinguish between true signals and SNPs/SNP pairs merely correlated with true signals, (3) how to detect nonlinear associations between SNP pairs and traits given small sample sizes, and (4) how to control false positives. In this article, we present a unified framework, called SPHINX, which addresses the aforementioned challenges. We first propose a piecewise linear model for interaction detection, because it is simple enough to estimate model parameters given small sample sizes but complex enough to capture nonlinear interaction effects. Then, based on the piecewise linear model, we introduce randomized group lasso under stability selection, and a screening algorithm to address the statistical and computational challenges mentioned above. In our experiments, we first demonstrate that SPHINX achieves better power than existing methods for interaction detection under false positive control. We further applied SPHINX to late-onset Alzheimer's disease dataset, and report 16 SNPs and 17 SNP pairs associated with gene traits. We also present a highly scalable implementation of our screening algorithm, which can screen ∼118 billion candidates of associations on a 60-node cluster in <5.5 hours.

Entities:  

Keywords:  SNP-SNP interaction; genome-wide association mapping; group lasso; piecewise linear model screening; stability selection

Mesh:

Year:  2016        PMID: 27159633      PMCID: PMC4876555          DOI: 10.1089/cmb.2015.0202

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  29 in total

1.  Penalized logistic regression for detecting gene interactions.

Authors:  Mee Young Park; Trevor Hastie
Journal:  Biostatistics       Date:  2007-04-11       Impact factor: 5.899

2.  The human leukocyte antigen class III haplotype approach: new insight in Alzheimer's disease inflammation hypothesis.

Authors:  Elisa Maggioli; Chiara Boiocchi; Michele Zorzetto; Elena Sinforiani; Cristina Cereda; Giovanni Ricevuti; Mariaclara Cuccia
Journal:  Curr Alzheimer Res       Date:  2013-12       Impact factor: 3.498

3.  HLA class I, II & III genes in confirmed late-onset Alzheimer's disease.

Authors:  D J Lehmann; H Wiebusch; S E Marshall; C Johnston; D R Warden; K Morgan; K Schappert; J Poirier; J Xuereb; N Kalsheker; K I Welsh; A D Smith
Journal:  Neurobiol Aging       Date:  2001 Jan-Feb       Impact factor: 4.673

4.  Progesterone increases dynorphin a concentrations in cerebrospinal fluid and preprodynorphin messenger ribonucleic Acid levels in a subset of dynorphin neurons in the sheep.

Authors:  Chad D Foradori; Robert L Goodman; Van L Adams; Miroslav Valent; Michael N Lehman
Journal:  Endocrinology       Date:  2005-01-13       Impact factor: 4.736

5.  HIGH DIMENSIONAL VARIABLE SELECTION.

Authors:  Larry Wasserman; Kathryn Roeder
Journal:  Ann Stat       Date:  2009-01-01       Impact factor: 4.028

6.  HLA-A*01 is associated with late onset of Alzheimer's disease in Italian patients.

Authors:  F R Guerini; C Tinelli; E Calabrese; C Agliardi; M Zanzottera; A De Silvestri; M Franceschi; L M E Grimaldi; R Nemni; M Clerici
Journal:  Int J Immunopathol Pharmacol       Date:  2009 Oct-Dec       Impact factor: 3.219

7.  PUMA: a unified framework for penalized multiple regression analysis of GWAS data.

Authors:  Gabriel E Hoffman; Benjamin A Logsdon; Jason G Mezey
Journal:  PLoS Comput Biol       Date:  2013-06-27       Impact factor: 4.475

8.  Replication of the association of HLA-B7 with Alzheimer's disease: a role for homozygosity?

Authors:  Donald J Lehmann; Martin C N M Barnardo; Susan Fuggle; Isabel Quiroga; Andrew Sutherland; Donald R Warden; Lin Barnetson; Roger Horton; Stephan Beck; A David Smith
Journal:  J Neuroinflammation       Date:  2006-12-18       Impact factor: 8.322

9.  Data-driven encoding for quantitative genetic trait prediction.

Authors:  Dan He; Zhanyong Wang; Laxmi Parida
Journal:  BMC Bioinformatics       Date:  2015-02-18       Impact factor: 3.169

10.  Statistical estimation of correlated genome associations to a quantitative trait network.

Authors:  Seyoung Kim; Eric P Xing
Journal:  PLoS Genet       Date:  2009-08-14       Impact factor: 5.917

View more
  2 in total

1.  Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions.

Authors:  Nicholas Dominic; Tjeng Wawan Cenggoro; Arif Budiarto; Bens Pardamean
Journal:  Sci Rep       Date:  2022-08-15       Impact factor: 4.996

2.  pulver: an R package for parallel ultra-rapid p-value computation for linear regression interaction terms.

Authors:  Sophie Molnos; Clemens Baumbach; Simone Wahl; Martina Müller-Nurasyid; Konstantin Strauch; Rui Wang-Sattler; Melanie Waldenberger; Thomas Meitinger; Jerzy Adamski; Gabi Kastenmüller; Karsten Suhre; Annette Peters; Harald Grallert; Fabian J Theis; Christian Gieger
Journal:  BMC Bioinformatics       Date:  2017-09-29       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.