Literature DB >> 22851473

Bootstrap aggregating of alternating decision trees to detect sets of SNPs that associate with disease.

Richard T Guy1, Peter Santago, Carl D Langefeld.   

Abstract

Complex genetic disorders are a result of a combination of genetic and nongenetic factors, all potentially interacting. Machine learning methods hold the potential to identify multilocus and environmental associations thought to drive complex genetic traits. Decision trees, a popular machine learning technique, offer a computationally low complexity algorithm capable of detecting associated sets of single nucleotide polymorphisms (SNPs) of arbitrary size, including modern genome-wide SNP scans. However, interpretation of the importance of an individual SNP within these trees can present challenges. We present a new decision tree algorithm denoted as Bagged Alternating Decision Trees (BADTrees) that is based on identifying common structural elements in a bootstrapped set of Alternating Decision Trees (ADTrees). The algorithm is order nk(2), where n is the number of SNPs considered and k is the number of SNPs in the tree constructed. Our simulation study suggests that BADTrees have higher power and lower type I error rates than ADTrees alone and comparable power with lower type I error rates compared to logistic regression. We illustrate the application of these data using simulated data as well as from the Lupus Large Association Study 1 (7,822 SNPs in 3,548 individuals). Our results suggest that BADTrees hold promise as a low computational order algorithm for detecting complex combinations of SNP and environmental factors associated with disease.
© 2012 Wiley Periodicals, Inc.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22851473      PMCID: PMC3769952          DOI: 10.1002/gepi.21608

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  9 in total

1.  Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity.

Authors:  Marylyn D Ritchie; Lance W Hahn; Jason H Moore
Journal:  Genet Epidemiol       Date:  2003-02       Impact factor: 2.135

Review 2.  Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans.

Authors:  Heather J Cordell
Journal:  Hum Mol Genet       Date:  2002-10-01       Impact factor: 6.150

Review 3.  Genetics, statistics and human disease: analytical retooling for complexity.

Authors:  Tricia A Thornton-Wells; Jason H Moore; Jonathan L Haines
Journal:  Trends Genet       Date:  2004-12       Impact factor: 11.639

4.  Identifying SNPs predictive of phenotype using random forests.

Authors:  Alexandre Bureau; Josée Dupuis; Kathleen Falls; Kathryn L Lunetta; Brooke Hayward; Tim P Keith; Paul Van Eerdewegh
Journal:  Genet Epidemiol       Date:  2005-02       Impact factor: 2.135

5.  Traversing the conceptual divide between biological and statistical epistasis: systems biology and a more modern synthesis.

Authors:  Jason H Moore; Scott M Williams
Journal:  Bioessays       Date:  2005-06       Impact factor: 4.345

6.  An algorithm for learning maximum entropy probability models of disease risk that efficiently searches and sparingly encodes multilocus genomic interactions.

Authors:  David J Miller; Yanxin Zhang; Guoqiang Yu; Yongmei Liu; Li Chen; Carl D Langefeld; David Herrington; Yue Wang
Journal:  Bioinformatics       Date:  2009-07-16       Impact factor: 6.937

7.  Enabling personal genomics with an explicit test of epistasis.

Authors:  Casey S Greene; Daniel S Himmelstein; Heather H Nelson; Karl T Kelsey; Scott M Williams; Angeline S Andrew; Margaret R Karagas; Jason H Moore
Journal:  Pac Symp Biocomput       Date:  2010

8.  Genome-wide association scan in women with systemic lupus erythematosus identifies susceptibility variants in ITGAM, PXK, KIAA1542 and other loci.

Authors:  John B Harley; Marta E Alarcón-Riquelme; Lindsey A Criswell; Chaim O Jacob; Robert P Kimberly; Kathy L Moser; Betty P Tsao; Timothy J Vyse; Carl D Langefeld; Swapan K Nath; Joel M Guthridge; Beth L Cobb; Daniel B Mirel; Miranda C Marion; Adrienne H Williams; Jasmin Divers; Wei Wang; Summer G Frank; Bahram Namjou; Stacey B Gabriel; Annette T Lee; Peter K Gregersen; Timothy W Behrens; Kimberly E Taylor; Michelle Fernando; Raphael Zidovetzki; Patrick M Gaffney; Jeffrey C Edberg; John D Rioux; Joshua O Ojwang; Judith A James; Joan T Merrill; Gary S Gilkeson; Michael F Seldin; Hong Yin; Emily C Baechler; Quan-Zhen Li; Edward K Wakeland; Gail R Bruner; Kenneth M Kaufman; Jennifer A Kelly
Journal:  Nat Genet       Date:  2008-01-20       Impact factor: 38.330

9.  Boosting alternating decision trees modeling of disease trait information.

Authors:  Kuang-Yu Liu; Jennifer Lin; Xiaobo Zhou; Stephen T C Wong
Journal:  BMC Genet       Date:  2005-12-30       Impact factor: 2.797

  9 in total
  6 in total

1.  Particle swarm optimization algorithm for analyzing SNP-SNP interaction of renin-angiotensin system genes against hypertension.

Authors:  Shyh-Jong Wu; Li-Yeh Chuang; Yu-Da Lin; Wen-Hsien Ho; Fu-Tien Chiang; Cheng-Hong Yang; Hsueh-Wei Chang
Journal:  Mol Biol Rep       Date:  2013-05-22       Impact factor: 2.316

Review 2.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09

3.  Elucidation of the complex metabolic profile of cerebrospinal fluid using an untargeted biochemical profiling assay.

Authors:  Adam D Kennedy; Kirk L Pappan; Taraka R Donti; Anne M Evans; Jacob E Wulff; Luke A D Miller; V Reid Sutton; Qin Sun; Marcus J Miller; Sarah H Elsea
Journal:  Mol Genet Metab       Date:  2017-04-09       Impact factor: 4.797

Review 4.  The Prediction of Radiotherapy Toxicity Using Single Nucleotide Polymorphism-Based Models: A Step Toward Prevention.

Authors:  Sarah L Kerns; Suman Kundu; Jung Hun Oh; Sandeep K Singhal; Michelle Janelsins; Lois B Travis; Joseph O Deasy; A Cecile J E Janssens; Harry Ostrer; Matthew Parliament; Nawaid Usmani; Barry S Rosenstein
Journal:  Semin Radiat Oncol       Date:  2015-05-15       Impact factor: 5.934

5.  Machine learning approaches for the genomic prediction of rheumatoid arthritis and systemic lupus erythematosus.

Authors:  Chih-Wei Chung; Tzu-Hung Hsiao; Chih-Jen Huang; Yen-Ju Chen; Hsin-Hua Chen; Ching-Heng Lin; Seng-Cho Chou; Tzer-Shyong Chen; Yu-Fang Chung; Hwai-I Yang; Yi-Ming Chen
Journal:  BioData Min       Date:  2021-12-11       Impact factor: 2.522

Review 6.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.