Literature DB >> 18205210

An ensemble learning approach jointly modeling main and interaction effects in genetic association studies.

Zhaogong Zhang1, Shuanglin Zhang, Man-Yu Wong, Nicholas J Wareham, Qiuying Sha.   

Abstract

Complex diseases are presumed to be the results of interactions of several genes and environmental factors, with each gene only having a small effect on the disease. Thus, the methods that can account for gene-gene interactions to search for a set of marker loci in different genes or across genome and to analyze these loci jointly are critical. In this article, we propose an ensemble learning approach (ELA) to detect a set of loci whose main and interaction effects jointly have a significant association with the trait. In the ELA, we first search for "base learners" and then combine the effects of the base learners by a linear model. Each base learner represents a main effect or an interaction effect. The result of the ELA is easy to interpret. When the ELA is applied to analyze a data set, we can get a final model, an overall P-value of the association test between the set of loci involved in the final model and the trait, and an importance measure for each base learner and each marker involved in the final model. The final model is a linear combination of some base learners. We know which base learner represents a main effect and which one represents an interaction effect. The importance measure of each base learner or marker can tell us the relative importance of the base learner or marker in the final model. We used intensive simulation studies as well as a real data set to evaluate the performance of the ELA. Our simulation studies demonstrated that the ELA is more powerful than the single-marker test in all the simulation scenarios. The ELA also outperformed the other three existing multi-locus methods in almost all cases. In an application to a large-scale case-control study for Type 2 diabetes, the ELA identified 11 single nucleotide polymorphisms that have a significant multi-locus effect (P-value=0.01), while none of the single nucleotide polymorphisms showed significant marginal effects and none of the two-locus combinations showed significant two-locus interaction effects.

Entities:  

Mesh:

Year:  2008        PMID: 18205210      PMCID: PMC3572743          DOI: 10.1002/gepi.20304

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  48 in total

1.  Genetic analysis of case/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer's disease.

Authors:  D Fallin; A Cohen; L Essioux; I Chumakov; M Blumenfeld; D Cohen; N J Schork
Journal:  Genome Res       Date:  2001-01       Impact factor: 9.043

2.  A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation.

Authors:  M R Nelson; S L Kardia; R E Ferrell; C F Sing
Journal:  Genome Res       Date:  2001-03       Impact factor: 9.043

3.  Trimming, weighting, and grouping SNPs in human case-control association studies.

Authors:  J Hoh; A Wille; J Ott
Journal:  Genome Res       Date:  2001-12       Impact factor: 9.043

4.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

5.  Oligogenic combinations associated with breast cancer risk in women under 53 years of age.

Authors:  Christopher E Aston; David A Ralph; Dominique P Lalo; Sharmila Manjeshwar; Bobby A Gramling; Daniele C DeFreese; Amy D West; Dannielle E Branam; Linda F Thompson; Melissa A Craft; Debra S Mitchell; Craig D Shimasaki; John J Mulvihill; Eldon R Jupe
Journal:  Hum Genet       Date:  2004-12-21       Impact factor: 4.132

6.  Identifying SNPs predictive of phenotype using random forests.

Authors:  Alexandre Bureau; Josée Dupuis; Kathleen Falls; Kathryn L Lunetta; Brooke Hayward; Tim P Keith; Paul Van Eerdewegh
Journal:  Genet Epidemiol       Date:  2005-02       Impact factor: 2.135

7.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer.

Authors:  M D Ritchie; L W Hahn; N Roodi; L R Bailey; W D Dupont; F F Parl; J H Moore
Journal:  Am J Hum Genet       Date:  2001-06-11       Impact factor: 11.025

8.  Statistical modeling of interlocus interactions in a complex disease: rejection of the multiplicative model of epistasis in type 1 diabetes.

Authors:  H J Cordell; J A Todd; N J Hill; C J Lord; P A Lyons; L B Peterson; L S Wicker; D G Clayton
Journal:  Genetics       Date:  2001-05       Impact factor: 4.562

Review 9.  Searching for genetic determinants in the new millennium.

Authors:  N J Risch
Journal:  Nature       Date:  2000-06-15       Impact factor: 49.962

10.  Screening large-scale association study data: exploiting interactions using random forests.

Authors:  Kathryn L Lunetta; L Brooke Hayward; Jonathan Segal; Paul Van Eerdewegh
Journal:  BMC Genet       Date:  2004-12-10       Impact factor: 2.797

View more
  10 in total

1.  Efficient simulation of epistatic interactions in case-parent trios.

Authors:  Qing Li; Holger Schwender; Thomas A Louis; M Daniele Fallin; Ingo Ruczinski
Journal:  Hum Hered       Date:  2013-03-27       Impact factor: 0.444

2.  Identification of interacting genes in genome-wide association studies using a model-based two-stage approach.

Authors:  Zhaogong Zhang; Adan Niu; Qiuying Sha
Journal:  Ann Hum Genet       Date:  2010-07-15       Impact factor: 1.670

Review 3.  Detecting gene-gene interactions that underlie human diseases.

Authors:  Heather J Cordell
Journal:  Nat Rev Genet       Date:  2009-06       Impact factor: 53.242

4.  SNP selection in genome-wide and candidate gene studies via penalized logistic regression.

Authors:  Kristin L Ayers; Heather J Cordell
Journal:  Genet Epidemiol       Date:  2010-12       Impact factor: 2.135

5.  Analysis of North American Rheumatoid Arthritis Consortium data using a penalized logistic regression approach.

Authors:  Pascal Croiseau; Heather J Cordell
Journal:  BMC Proc       Date:  2009-12-15

6.  Detecting purely epistatic multi-locus interactions by an omnibus permutation test on ensembles of two-locus analyses.

Authors:  Waranyu Wongseree; Anunchai Assawamakin; Theera Piroonratana; Saravudh Sinsomros; Chanin Limwongse; Nachol Chaiyaratana
Journal:  BMC Bioinformatics       Date:  2009-09-17       Impact factor: 3.169

7.  Identification of multiple gene-gene interactions for ordinal phenotypes.

Authors:  Kyunga Kim; Min-Seok Kwon; Sohee Oh; Taesung Park
Journal:  BMC Med Genomics       Date:  2013-05-07       Impact factor: 3.063

8.  Ensemble survival tree models to reveal pairwise interactions of variables with time-to-events outcomes in low-dimensional setting.

Authors:  Jean-Eudes Dazard; Hemant Ishwaran; Rajeev Mehlotra; Aaron Weinberg; Peter Zimmerman
Journal:  Stat Appl Genet Mol Biol       Date:  2018-02-17

9.  An omnibus permutation test on ensembles of two-locus analyses can detect pure epistasis and genetic heterogeneity in genome-wide association studies.

Authors:  Damrongrit Setsirichok; Phuwadej Tienboon; Nattapong Jaroonruang; Somkit Kittichaijaroen; Waranyu Wongseree; Theera Piroonratana; Touchpong Usavanarong; Chanin Limwongse; Chatchawit Aporntewan; Marong Phadoongsidhi; Nachol Chaiyaratana
Journal:  Springerplus       Date:  2013-05-19

10.  Detecting gene-gene interactions for complex quantitative traits using generalized fuzzy classification.

Authors:  Xiangdong Zhou; Keith C C Chan
Journal:  BMC Bioinformatics       Date:  2018-09-18       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.