Literature DB >> 18671250

A computationally efficient hypothesis testing method for epistasis analysis using multifactor dimensionality reduction.

Kristine A Pattin1, Bill C White, Nate Barney, Jiang Gui, Heather H Nelson, Karl T Kelsey, Angeline S Andrew, Margaret R Karagas, Jason H Moore.   

Abstract

Multifactor dimensionality reduction (MDR) was developed as a nonparametric and model-free data mining method for detecting, characterizing, and interpreting epistasis in the absence of significant main effects in genetic and epidemiologic studies of complex traits such as disease susceptibility. The goal of MDR is to change the representation of the data using a constructive induction algorithm to make nonadditive interactions easier to detect using any classification method such as naïve Bayes or logistic regression. Traditionally, MDR constructed variables have been evaluated with a naïve Bayes classifier that is combined with 10-fold cross validation to obtain an estimate of predictive accuracy or generalizability of epistasis models. Traditionally, we have used permutation testing to statistically evaluate the significance of models obtained through MDR. The advantage of permutation testing is that it controls for false positives due to multiple testing. The disadvantage is that permutation testing is computationally expensive. This is an important issue that arises in the context of detecting epistasis on a genome-wide scale. The goal of the present study was to develop and evaluate several alternatives to large-scale permutation testing for assessing the statistical significance of MDR models. Using data simulated from 70 different epistasis models, we compared the power and type I error rate of MDR using a 1,000-fold permutation test with hypothesis testing using an extreme value distribution (EVD). We find that this new hypothesis testing method provides a reasonable alternative to the computationally expensive 1,000-fold permutation test and is 50 times faster. We then demonstrate this new method by applying it to a genetic epidemiology study of bladder cancer susceptibility that was previously analyzed using MDR and assessed using a 1,000-fold permutation test.

Entities:  

Mesh:

Year:  2009        PMID: 18671250      PMCID: PMC2700860          DOI: 10.1002/gepi.20360

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  21 in total

1.  A complete enumeration and classification of two-locus disease models.

Authors:  W Li; J Reich
Journal:  Hum Hered       Date:  2000 Nov-Dec       Impact factor: 0.444

2.  Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions.

Authors:  Lance W Hahn; Marylyn D Ritchie; Jason H Moore
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

3.  Genes, environment, and cardiovascular disease.

Authors:  Charles F Sing; Jari H Stengård; Sharon L R Kardia
Journal:  Arterioscler Thromb Vasc Biol       Date:  2003-05-01       Impact factor: 8.311

4.  Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity.

Authors:  Marylyn D Ritchie; Lance W Hahn; Jason H Moore
Journal:  Genet Epidemiol       Date:  2003-02       Impact factor: 2.135

Review 5.  Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans.

Authors:  Heather J Cordell
Journal:  Hum Mol Genet       Date:  2002-10-01       Impact factor: 6.150

Review 6.  New strategies for identifying gene-gene interactions in hypertension.

Authors:  Jason H Moore; Scott M Williams
Journal:  Ann Med       Date:  2002       Impact factor: 4.709

7.  Efficient computation of significance levels for multiple associations in large studies of correlated data, including genomewide association studies.

Authors:  Frank Dudbridge; Bobby P C Koeleman
Journal:  Am J Hum Genet       Date:  2004-07-19       Impact factor: 11.025

8.  The ubiquitous nature of epistasis in determining susceptibility to common human diseases.

Authors:  Jason H Moore
Journal:  Hum Hered       Date:  2003       Impact factor: 0.444

9.  A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction.

Authors:  Digna R Velez; Bill C White; Alison A Motsinger; William S Bush; Marylyn D Ritchie; Scott M Williams; Jason H Moore
Journal:  Genet Epidemiol       Date:  2007-05       Impact factor: 2.135

10.  The future of genetic studies of complex human diseases.

Authors:  N Risch; K Merikangas
Journal:  Science       Date:  1996-09-13       Impact factor: 47.728

View more
  42 in total

1.  A screening methodology based on Random Forests to improve the detection of gene-gene interactions.

Authors:  Lizzy De Lobel; Pierre Geurts; Guy Baele; Francesc Castro-Giner; Manolis Kogevinas; Kristel Van Steen
Journal:  Eur J Hum Genet       Date:  2010-05-12       Impact factor: 4.246

2.  A simple and computationally efficient sampling approach to covariate adjustment for multifactor dimensionality reduction analysis of epistasis.

Authors:  Jiang Gui; Angeline S Andrew; Peter Andrews; Heather M Nelson; Karl T Kelsey; Margaret R Karagas; Jason H Moore
Journal:  Hum Hered       Date:  2010-10-01       Impact factor: 0.444

3.  A robust multifactor dimensionality reduction method for detecting gene-gene interactions with application to the genetic analysis of bladder cancer susceptibility.

Authors:  Jiang Gui; Angeline S Andrew; Peter Andrews; Heather M Nelson; Karl T Kelsey; Margaret R Karagas; Jason H Moore
Journal:  Ann Hum Genet       Date:  2010-11-22       Impact factor: 1.670

4.  A general framework for formal tests of interaction after exhaustive search methods with applications to MDR and MDR-PDT.

Authors:  Todd L Edwards; Stephen D Turner; Eric S Torstenson; Scott M Dudek; Eden R Martin; Marylyn D Ritchie
Journal:  PLoS One       Date:  2010-02-23       Impact factor: 3.240

5.  Analysis of gene-gene interactions.

Authors:  Diane Gilbert-Diamond; Jason H Moore
Journal:  Curr Protoc Hum Genet       Date:  2011-07

6.  Association of natriuretic peptide polymorphisms with left ventricular dysfunction in southern Han Chinese coronary artery disease patients.

Authors:  Zhijun Wu; Min Xu; Haihui Sheng; Yuqing Lou; Xiuxiu Su; Yanjia Chen; Lin Lu; Yan Liu; Wei Jin
Journal:  Int J Clin Exp Pathol       Date:  2014-09-15

7.  Methods for optimizing statistical analyses in pharmacogenomics research.

Authors:  Stephen D Turner; Dana C Crawford; Marylyn D Ritchie
Journal:  Expert Rev Clin Pharmacol       Date:  2009-09-01       Impact factor: 5.045

8.  Tag polymorphisms of solute carrier family 12 member 3 gene modify the risk of hypertension in northeastern Han Chinese.

Authors:  Y L Wang; Y Qi; J N Bai; Z M Qi; J R Li; H Y Zhao; Y F Wang; C Z Lu; Y Xiao; N Jia; B Wang; W Q Niu
Journal:  J Hum Hypertens       Date:  2014-01-16       Impact factor: 3.012

9.  Enabling personal genomics with an explicit test of epistasis.

Authors:  Casey S Greene; Daniel S Himmelstein; Heather H Nelson; Karl T Kelsey; Scott M Williams; Angeline S Andrew; Margaret R Karagas; Jason H Moore
Journal:  Pac Symp Biocomput       Date:  2010

Review 10.  Bioinformatics challenges for genome-wide association studies.

Authors:  Jason H Moore; Folkert W Asselbergs; Scott M Williams
Journal:  Bioinformatics       Date:  2010-01-06       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.