Literature DB >> 20461113

A screening methodology based on Random Forests to improve the detection of gene-gene interactions.

Lizzy De Lobel1, Pierre Geurts, Guy Baele, Francesc Castro-Giner, Manolis Kogevinas, Kristel Van Steen.   

Abstract

The search for susceptibility loci in gene-gene interactions imposes a methodological and computational challenge for statisticians because of the large dimensionality inherent to the modelling of gene-gene interactions or epistasis. In an era in which genome-wide scans have become relatively common, new powerful methods are required to handle the huge amount of feasible gene-gene interactions and to weed out false positives and negatives from these results. One solution to the dimensionality problem is to reduce data by preliminary screening of markers to select the best candidates for further analysis. Ideally, this screening step is statistically independent of the testing phase. Initially developed for small numbers of markers, the Multifactor Dimensionality Reduction (MDR) method is a nonparametric, model-free data reduction technique to associate sets of markers with optimal predictive properties to disease. In this study, we examine the power of MDR in larger data sets and compare it with other approaches that are able to identify gene-gene interactions. Under various interaction models (purely and not purely epistatic), we use a Random Forest (RF)-based prescreening method, before executing MDR, to improve its performance. We find that the power of MDR increases when noisy SNPs are first removed, by creating a collection of candidate markers with RFs. We validate our technique by extensive simulation studies and by application to asthma data from the European Committee of Respiratory Health Study II.

Mesh:

Substances:

Year:  2010        PMID: 20461113      PMCID: PMC2987456          DOI: 10.1038/ejhg.2010.48

Source DB:  PubMed          Journal:  Eur J Hum Genet        ISSN: 1018-4813            Impact factor:   4.246


  17 in total

1.  Accounting for variability in the use of permutation testing to detect quantitative trait loci.

Authors:  D Nettleton; R W Doerge
Journal:  Biometrics       Date:  2000-03       Impact factor: 2.571

2.  Positional cloning of a novel gene influencing asthma from chromosome 2q14.

Authors:  Maxine Allen; Andrea Heinzmann; Emiko Noguchi; Gonçalo Abecasis; John Broxholme; Chris P Ponting; Sumit Bhattacharyya; Jon Tinsley; Youming Zhang; Richard Holt; E Yvonne Jones; Nick Lench; Alisoun Carey; Helene Jones; Nicholas J Dickens; Claire Dimon; Rosie Nicholls; Crystal Baker; Luzheng Xue; Elizabeth Townsend; Michael Kabesch; Stephan K Weiland; David Carr; Erika von Mutius; Ian M Adcock; Peter J Barnes; G Mark Lathrop; Mark Edwards; Miriam F Moffatt; William O C M Cookson
Journal:  Nat Genet       Date:  2003-10-19       Impact factor: 38.330

3.  Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity.

Authors:  Marylyn D Ritchie; Lance W Hahn; Jason H Moore
Journal:  Genet Epidemiol       Date:  2003-02       Impact factor: 2.135

4.  The ubiquitous nature of epistasis in determining susceptibility to common human diseases.

Authors:  Jason H Moore
Journal:  Hum Hered       Date:  2003       Impact factor: 0.444

5.  Exploring the performance of Multifactor Dimensionality Reduction in large scale SNP studies and in the presence of genetic heterogeneity among epistatic disease models.

Authors:  Todd L Edwards; Kenneth Lewis; Digna R Velez; Scott Dudek; Marylyn D Ritchie
Journal:  Hum Hered       Date:  2008-12-15       Impact factor: 0.444

6.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer.

Authors:  M D Ritchie; L W Hahn; N Roodi; L R Bailey; W D Dupont; F F Parl; J H Moore
Journal:  Am J Hum Genet       Date:  2001-06-11       Impact factor: 11.025

7.  Cloning and characterization of dipeptidyl peptidase 10, a new member of an emerging subgroup of serine proteases.

Authors:  Shu Y Qi; Pierre J Riviere; Jerzy Trojnar; Jean-Louis Junien; Karen O Akinsanya
Journal:  Biochem J       Date:  2003-07-01       Impact factor: 3.857

8.  TNFA -308G>A in two international population-based cohorts and risk of asthma.

Authors:  F Castro-Giner; M Kogevinas; M Mächler; R de Cid; K Van Steen; M Imboden; C Schindler; W Berger; J R Gonzalez; K A Franklin; C Janson; D Jarvis; E Omenaas; P Burney; T Rochat; X Estivill; J M Antó; M Wjst; N M Probst-Hensch
Journal:  Eur Respir J       Date:  2008-04-02       Impact factor: 16.671

9.  Characterization of a common susceptibility locus for asthma-related traits.

Authors:  Tarja Laitinen; Anne Polvi; Pia Rydman; Johanna Vendelin; Ville Pulkkinen; Paula Salmikangas; Siru Mäkelä; Marko Rehn; Asta Pirskanen; Anna Rautanen; Marco Zucchelli; Harriet Gullstén; Marina Leino; Harri Alenius; Tuula Petäys; Tari Haahtela; Annika Laitinen; Catherine Laprise; Thomas J Hudson; Lauri A Laitinen; Juha Kere
Journal:  Science       Date:  2004-04-09       Impact factor: 47.728

10.  The protective effect of farm animal exposure on childhood allergy is modified by NPSR1 polymorphisms.

Authors:  S Bruce; F Nyberg; E Melén; A James; V Pulkkinen; C Orsmark-Pietras; A Bergström; B Dahlén; M Wickman; E von Mutius; G Doekes; R Lauener; J Riedler; W Eder; M van Hage; G Pershagen; A Scheynius; J Kere
Journal:  J Med Genet       Date:  2008-02-19       Impact factor: 6.318

View more
  24 in total

1.  TRM: a powerful two-stage machine learning approach for identifying SNP-SNP interactions.

Authors:  Hui-Yi Lin; Y Ann Chen; Ya-Yu Tsai; Xiaotao Qu; Tung-Sung Tseng; Jong Y Park
Journal:  Ann Hum Genet       Date:  2011-12-11       Impact factor: 1.670

Review 2.  Random forests for genetic association studies.

Authors:  Benjamin A Goldstein; Eric C Polley; Farren B S Briggs
Journal:  Stat Appl Genet Mol Biol       Date:  2011-07-12

3.  Identification of immune correlates of protection in Shigella infection by application of machine learning.

Authors:  Jorge M Arevalillo; Marcelo B Sztein; Karen L Kotloff; Myron M Levine; Jakub K Simon
Journal:  J Biomed Inform       Date:  2017-08-09       Impact factor: 6.317

Review 4.  Random forests for genomic data analysis.

Authors:  Xi Chen; Hemant Ishwaran
Journal:  Genomics       Date:  2012-04-21       Impact factor: 5.736

Review 5.  Integrative systems biology approaches in asthma pharmacogenomics.

Authors:  Amber Dahlin; Kelan G Tantisira
Journal:  Pharmacogenomics       Date:  2012-09       Impact factor: 2.533

6.  Trees Assembling Mann-Whitney approach for detecting genome-wide joint association among low-marginal-effect loci.

Authors:  Changshuai Wei; Daniel J Schaid; Qing Lu
Journal:  Genet Epidemiol       Date:  2012-11-07       Impact factor: 2.135

7.  Effective Analysis of Inpatient Satisfaction: The Random Forest Algorithm.

Authors:  Chengcheng Li; Conghui Liao; Xuehui Meng; Honghua Chen; Weiling Chen; Bo Wei; Pinghua Zhu
Journal:  Patient Prefer Adherence       Date:  2021-04-07       Impact factor: 2.711

8.  Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?

Authors:  Wouter G Touw; Jumamurat R Bayjanov; Lex Overmars; Lennart Backus; Jos Boekhorst; Michiel Wels; Sacha A F T van Hijum
Journal:  Brief Bioinform       Date:  2012-07-10       Impact factor: 11.622

9.  Application of a spatially-weighted Relief algorithm for ranking genetic predictors of disease.

Authors:  Matthew E Stokes; Shyam Visweswaran
Journal:  BioData Min       Date:  2012-12-03       Impact factor: 2.522

Review 10.  Detecting epistasis in human complex traits.

Authors:  Wen-Hua Wei; Gibran Hemani; Chris S Haley
Journal:  Nat Rev Genet       Date:  2014-09-09       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.