Literature DB >> 21291414

A comparison of multifactor dimensionality reduction and L1-penalized regression to identify gene-gene interactions in genetic association studies.

Stacey Winham1, Chong Wang, Alison A Motsinger-Reif.   

Abstract

Recently, the amount of high-dimensional data has exploded, creating new analytical challenges for human genetics. Furthermore, much evidence suggests that common complex diseases may be due to complex etiologies such as gene-gene interactions, which are difficult to identify in high-dimensional data using traditional statistical approaches. Data-mining approaches are gaining popularity for variable selection in association studies, and one of the most commonly used methods to evaluate potential gene-gene interactions is Multifactor Dimensionality Reduction (MDR). Additionally, a number of penalized regression techniques, such as Lasso, are gaining popularity within the statistical community and are now being applied to association studies, including extensions for interactions. In this study, we compare the performance of MDR, the traditional lasso with L1 penalty (TL1), and the group lasso for categorical data with group-wise L1 penalty (GL1) to detect gene-gene interactions through a broad range of simulations. We find that each method has both advantages and disadvantages, and relative performance is context dependent. TL1 frequently over-fits, identifying false positive as well as true positive loci. MDR has higher power for epistatic models that exhibit independent main effects; for both Lasso methods, main effects tend to dominate. For purely epistatic models, GL1 has the best performance for lower minor allele frequencies, but MDR performs best for higher frequencies. These results provide guidance of when each approach might be best suited for detecting and characterizing interactions with different mechanisms.

Entities:  

Mesh:

Year:  2011        PMID: 21291414      PMCID: PMC3045083          DOI: 10.2202/1544-6115.1613

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  23 in total

1.  A complete enumeration and classification of two-locus disease models.

Authors:  W Li; J Reich
Journal:  Hum Hered       Date:  2000 Nov-Dec       Impact factor: 0.444

2.  Two-locus models of disease.

Authors:  R J Neuman; J P Rice
Journal:  Genet Epidemiol       Date:  1992       Impact factor: 2.135

3.  A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction.

Authors:  Digna R Velez; Bill C White; Alison A Motsinger; William S Bush; Marylyn D Ritchie; Scott M Williams; Jason H Moore
Journal:  Genet Epidemiol       Date:  2007-05       Impact factor: 2.135

4.  ABCB1 and GST polymorphisms associated with TP53 status in breast cancer.

Authors:  Silje H Nordgard; Marylyn D Ritchie; Sigrid D Jensrud; Alison A Motsinger; Grethe I G Alnaes; Gordon Lemmon; Marianne Berg; Stephanie Geisler; Jason H Moore; Per Eystein Lønning; Anne-Lise Børresen-Dale; Vessela N Kristensen
Journal:  Pharmacogenet Genomics       Date:  2007-02       Impact factor: 2.089

5.  Multifactor dimensionality reduction reveals gene-gene interactions associated with multiple sclerosis susceptibility in African Americans.

Authors:  D Brassat; A A Motsinger; S J Caillier; H A Erlich; K Walker; L L Steiner; B A C Cree; L F Barcellos; M A Pericak-Vance; S Schmidt; S Gregory; S L Hauser; J L Haines; J R Oksenberg; M D Ritchie
Journal:  Genes Immun       Date:  2006-04-20       Impact factor: 2.676

6.  Penalized logistic regression for detecting gene interactions.

Authors:  Mee Young Park; Trevor Hastie
Journal:  Biostatistics       Date:  2007-04-11       Impact factor: 5.899

7.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

8.  Screen and clean: a tool for identifying interactions in genome-wide association studies.

Authors:  Jing Wu; Bernie Devlin; Steven Ringquist; Massimo Trucco; Kathryn Roeder
Journal:  Genet Epidemiol       Date:  2010-04       Impact factor: 2.135

9.  The effect of alternative permutation testing strategies on the performance of multifactor dimensionality reduction.

Authors:  Alison A Motsinger-Reif
Journal:  BMC Res Notes       Date:  2008-12-30

10.  Power of multifactor dimensionality reduction and penalized logistic regression for detecting gene-gene interaction in a case-control study.

Authors:  Hua He; William S Oetting; Marcia J Brott; Saonli Basu
Journal:  BMC Med Genet       Date:  2009-12-04       Impact factor: 2.103

View more
  4 in total

1.  Analysis of gene-gene interactions.

Authors:  Diane Gilbert-Diamond; Jason H Moore
Journal:  Curr Protoc Hum Genet       Date:  2011-07

2.  Detecting genetic epistasis by differential departure from independence.

Authors:  Ruby Sharma; Zeinab Sadeghian Tehrani; Sajal Kumar; Mingzhou Song
Journal:  Mol Genet Genomics       Date:  2022-05-23       Impact factor: 3.291

Review 3.  Detecting epistasis in human complex traits.

Authors:  Wen-Hua Wei; Gibran Hemani; Chris S Haley
Journal:  Nat Rev Genet       Date:  2014-09-09       Impact factor: 53.242

4.  Stability SCAD: a powerful approach to detect interactions in large-scale genomic study.

Authors:  Jianwei Gou; Yang Zhao; Yongyue Wei; Chen Wu; Ruyang Zhang; Yongyong Qiu; Ping Zeng; Wen Tan; Dianke Yu; Tangchun Wu; Zhibin Hu; Dongxin Lin; Hongbing Shen; Feng Chen
Journal:  BMC Bioinformatics       Date:  2014-03-01       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.