Literature DB >> 22150548

TRM: a powerful two-stage machine learning approach for identifying SNP-SNP interactions.

Hui-Yi Lin1, Y Ann Chen, Ya-Yu Tsai, Xiaotao Qu, Tung-Sung Tseng, Jong Y Park.   

Abstract

Studies have shown that interactions of single nucleotide polymorphisms (SNPs) may play an important role in understanding the causes of complex disease. We have proposed an integrated machine learning method that combines two machine-learning methods-Random Forests (RF) and Multivariate Adaptive Regression Splines (MARS)-to identify a subset of important SNPs and detect interaction patterns more effectively and efficiently. In this two-stage RF-MARS (TRM) approach, RF is first applied to detect a predictive subset of SNPs, and then MARS is used to identify the interaction patterns. We evaluated the TRM performances in four models. RF variable selection was based on out-of-bag classification error rate (OOB) and variable important spectrum (IS). Our results support that RF(OOB) had better performance than MARS and RF(IS) in detecting important variables. This study demonstrates that TRM(OOB) , which is RF(OOB) plus MARS, has combined the strengths of RF and MARS in identifying SNP-SNP interactions in a scenario of 100 candidate SNPs. TRM(OOB) had greater true positive rate and lower false positive rate compared with MARS, particularly for searching interactions with a strong association with the outcome. Therefore, the use of TRM(OOB) is favored for exploring SNP-SNP interactions in a large-scale genetic variation study.
© 2011 The Authors Annals of Human Genetics © 2011 Blackwell Publishing Ltd/University College London.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22150548      PMCID: PMC3243917          DOI: 10.1111/j.1469-1809.2011.00692.x

Source DB:  PubMed          Journal:  Ann Hum Genet        ISSN: 0003-4800            Impact factor:   1.670


  32 in total

1.  BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies.

Authors:  Xiang Wan; Can Yang; Qiang Yang; Hong Xue; Xiaodan Fan; Nelson L S Tang; Weichuan Yu
Journal:  Am J Hum Genet       Date:  2010-09-10       Impact factor: 11.025

2.  Efficiency and power in genetic association studies.

Authors:  Paul I W de Bakker; Roman Yelensky; Itsik Pe'er; Stacey B Gabriel; Mark J Daly; David Altshuler
Journal:  Nat Genet       Date:  2005-10-23       Impact factor: 38.330

3.  Logic regression for analysis of the association between genetic variation in the renin-angiotensin system and myocardial infarction or stroke.

Authors:  Charles Kooperberg; Joshua C Bis; Kristin D Marciante; Susan R Heckbert; Thomas Lumley; Bruce M Psaty
Journal:  Am J Epidemiol       Date:  2006-11-02       Impact factor: 4.897

4.  Single and multigenic analysis of the association between variants in 12 steroid hormone metabolism genes and risk of prostate cancer.

Authors:  Joke Beuten; Jonathan A L Gelfond; Jennifer L Franke; Korri S Weldon; Analisa C Crandall; Teresa L Johnson-Pais; Ian M Thompson; Robin J Leach
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2009-06       Impact factor: 4.254

5.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer.

Authors:  M D Ritchie; L W Hahn; N Roodi; L R Bailey; W D Dupont; F F Parl; J H Moore
Journal:  Am J Hum Genet       Date:  2001-06-11       Impact factor: 11.025

6.  An application of Random Forests to a genome-wide association dataset: methodological considerations & new findings.

Authors:  Benjamin A Goldstein; Alan E Hubbard; Adele Cutler; Lisa F Barcellos
Journal:  BMC Genet       Date:  2010-06-14       Impact factor: 2.797

7.  Interactions of cytokine gene polymorphisms in prostate cancer risk.

Authors:  Jovanny Zabaleta; Hui-Yi Lin; Rosa A Sierra; M Craig Hall; Peter E Clark; Oliver A Sartor; Jennifer J Hu; Augusto C Ochoa
Journal:  Carcinogenesis       Date:  2008-01-03       Impact factor: 4.944

Review 8.  Estrogen and prostate cancer: an eclipsed truth in an androgen-dominated scenario.

Authors:  Giuseppe Carruba
Journal:  J Cell Biochem       Date:  2007-11-01       Impact factor: 4.429

9.  Screening large-scale association study data: exploiting interactions using random forests.

Authors:  Kathryn L Lunetta; L Brooke Hayward; Jonathan Segal; Paul Van Eerdewegh
Journal:  BMC Genet       Date:  2004-12-10       Impact factor: 2.797

10.  Application of two machine learning algorithms to genetic association studies in the presence of covariates.

Authors:  Bareng A S Nonyane; Andrea S Foulkes
Journal:  BMC Genet       Date:  2008-11-14       Impact factor: 2.797

View more
  8 in total

1.  Coexpression and expression quantitative trait loci analyses of the angiogenesis gene-gene interaction network in prostate cancer.

Authors:  Hui-Yi Lin; Chia-Ho Cheng; Dung-Tsa Chen; Y Ann Chen; Jong Y Park
Journal:  Transl Cancer Res       Date:  2016-10       Impact factor: 1.241

2.  SNP interaction pattern identifier (SIPI): an intensive search for SNP-SNP interaction patterns.

Authors:  Hui-Yi Lin; Dung-Tsa Chen; Po-Yu Huang; Yung-Hsin Liu; Augusto Ochoa; Jovanny Zabaleta; Donald E Mercante; Zhide Fang; Thomas A Sellers; Julio M Pow-Sang; Chia-Ho Cheng; Rosalind Eeles; Doug Easton; Zsofia Kote-Jarai; Ali Amin Al Olama; Sara Benlloch; Kenneth Muir; Graham G Giles; Fredrik Wiklund; Henrik Gronberg; Christopher A Haiman; Johanna Schleutker; Børge G Nordestgaard; Ruth C Travis; Freddie Hamdy; Nora Pashayan; Kay-Tee Khaw; Janet L Stanford; William J Blot; Stephen N Thibodeau; Christiane Maier; Adam S Kibel; Cezary Cybulski; Lisa Cannon-Albright; Hermann Brenner; Radka Kaneva; Jyotsna Batra; Manuel R Teixeira; Hardev Pandha; Yong-Jie Lu; Jong Y Park
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

3.  Genome wide analysis of flowering time trait in multiple environments via high-throughput genotyping technique in Brassica napus L.

Authors:  Lun Li; Yan Long; Libin Zhang; Jessica Dalton-Morgan; Jacqueline Batley; Longjiang Yu; Jinling Meng; Maoteng Li
Journal:  PLoS One       Date:  2015-03-19       Impact factor: 3.240

Review 4.  Machine Learning and Radiogenomics: Lessons Learned and Future Directions.

Authors:  John Kang; Tiziana Rancati; Sangkyu Lee; Jung Hun Oh; Sarah L Kerns; Jacob G Scott; Russell Schwartz; Seyoung Kim; Barry S Rosenstein
Journal:  Front Oncol       Date:  2018-06-21       Impact factor: 6.244

Review 5.  Detecting epistasis in human complex traits.

Authors:  Wen-Hua Wei; Gibran Hemani; Chris S Haley
Journal:  Nat Rev Genet       Date:  2014-09-09       Impact factor: 53.242

6.  SNP-SNP interaction network in angiogenesis genes associated with prostate cancer aggressiveness.

Authors:  Hui-Yi Lin; Ernest K Amankwah; Tung-Sung Tseng; Xiaotao Qu; Dung-Tsa Chen; Jong Y Park
Journal:  PLoS One       Date:  2013-04-03       Impact factor: 3.240

7.  EPAS1 gene variants are associated with sprint/power athletic performance in two cohorts of European athletes.

Authors:  Sarah Voisin; Pawel Cieszczyk; Vladimir P Pushkarev; Dmitry A Dyatlov; Boris F Vashlyayev; Vladimir A Shumaylov; Agnieszka Maciejewska-Karlowska; Marek Sawczuk; Lidia Skuza; Zbigniew Jastrzebski; David J Bishop; Nir Eynon
Journal:  BMC Genomics       Date:  2014-05-18       Impact factor: 3.969

8.  A Bayesian model for detection of high-order interactions among genetic variants in genome-wide association studies.

Authors:  Juexin Wang; Trupti Joshi; Babu Valliyodan; Haiying Shi; Yanchun Liang; Henry T Nguyen; Jing Zhang; Dong Xu
Journal:  BMC Genomics       Date:  2015-11-25       Impact factor: 3.969

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.