Literature DB >> 26109276

A random forest approach to capture genetic effects in the presence of population structure.

Johannes Stephan1,2, Oliver Stegle2, Andreas Beyer1.   

Abstract

The accurate mapping of causal variants in genome-wide association studies requires the consideration of both, confounding factors (for example, population structure) and nonlinear interactions between individual genetic variants. Here, we propose a method termed 'mixed random forest' that simultaneously accounts for population structure and captures nonlinear genetic effects. We test the model in simulation experiments and show that the mixed random forest approach improves detection power compared with established approaches. In an application to data from an outbred mouse population, we find that mixed random forest identifies associations that are more consistent with prior knowledge than competing methods. Further, our approach allows predicting phenotypes from genotypes with greater accuracy than any of the other methods that we tested. Our results show that approaches that simultaneously account for both, confounding due to population structure and epistatic interactions, are important to fully explain the heritable component of complex quantitative traits.

Entities:  

Mesh:

Year:  2015        PMID: 26109276     DOI: 10.1038/ncomms8432

Source DB:  PubMed          Journal:  Nat Commun        ISSN: 2041-1723            Impact factor:   14.919


  52 in total

1.  The mystery of missing heritability: Genetic interactions create phantom heritability.

Authors:  Or Zuk; Eliana Hechter; Shamil R Sunyaev; Eric S Lander
Journal:  Proc Natl Acad Sci U S A       Date:  2012-01-05       Impact factor: 11.205

2.  Improved linear mixed models for genome-wide association studies.

Authors:  Jennifer Listgarten; Christoph Lippert; Carl M Kadie; Robert I Davidson; Eleazar Eskin; David Heckerman
Journal:  Nat Methods       Date:  2012-05-30       Impact factor: 28.547

3.  Genome-wide genetic association of complex traits in heterogeneous stock mice.

Authors:  William Valdar; Leah C Solberg; Dominique Gauguier; Stephanie Burnett; Paul Klenerman; William O Cookson; Martin S Taylor; J Nicholas P Rawlins; Richard Mott; Jonathan Flint
Journal:  Nat Genet       Date:  2006-07-09       Impact factor: 38.330

4.  Power to detect higher-order epistatic interactions in a metabolic pathway using a new mapping strategy.

Authors:  Benjamin Stich; Jianming Yu; Albrecht E Melchinger; Hans-Peter Piepho; H Friedrich Utz; Hans P Maurer; Edward S Buckler
Journal:  Genetics       Date:  2006-12-28       Impact factor: 4.562

5.  Increased accuracy of artificial selection by using the realized relationship matrix.

Authors:  B J Hayes; P M Visscher; M E Goddard
Journal:  Genet Res (Camb)       Date:  2009-02       Impact factor: 1.588

6.  A stage-wise approach for the analysis of multi-environment trials.

Authors:  Hans-Peter Piepho; Jens Möhring; Torben Schulz-Streeck; Joseph O Ogutu
Journal:  Biom J       Date:  2012-09-25       Impact factor: 2.207

Review 7.  Using biological knowledge to uncover the mystery in the search for epistasis in genome-wide association studies.

Authors:  Marylyn D Ritchie
Journal:  Ann Hum Genet       Date:  2011-01       Impact factor: 1.670

8.  An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations.

Authors:  Vincent Segura; Bjarni J Vilhjálmsson; Alexander Platt; Arthur Korte; Ümit Seren; Quan Long; Magnus Nordborg
Journal:  Nat Genet       Date:  2012-06-17       Impact factor: 38.330

9.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines.

Authors:  Susanna Atwell; Yu S Huang; Bjarni J Vilhjálmsson; Glenda Willems; Matthew Horton; Yan Li; Dazhe Meng; Alexander Platt; Aaron M Tarone; Tina T Hu; Rong Jiang; N Wayan Muliyati; Xu Zhang; Muhammad Ali Amer; Ivan Baxter; Benjamin Brachi; Joanne Chory; Caroline Dean; Marilyne Debieu; Juliette de Meaux; Joseph R Ecker; Nathalie Faure; Joel M Kniskern; Jonathan D G Jones; Todd Michael; Adnane Nemri; Fabrice Roux; David E Salt; Chunlao Tang; Marco Todesco; M Brian Traw; Detlef Weigel; Paul Marjoram; Justin O Borevitz; Joy Bergelson; Magnus Nordborg
Journal:  Nature       Date:  2010-03-24       Impact factor: 49.962

10.  Finding the sources of missing heritability in a yeast cross.

Authors:  Joshua S Bloom; Ian M Ehrenreich; Wesley T Loo; Thúy-Lan Võ Lite; Leonid Kruglyak
Journal:  Nature       Date:  2013-02-03       Impact factor: 49.962

View more
  29 in total

1.  Imputing Gene Expression in Uncollected Tissues Within and Beyond GTEx.

Authors:  Jiebiao Wang; Eric R Gamazon; Brandon L Pierce; Barbara E Stranger; Hae Kyung Im; Robert D Gibbons; Nancy J Cox; Dan L Nicolae; Lin S Chen
Journal:  Am J Hum Genet       Date:  2016-03-31       Impact factor: 11.025

2.  An enhanced machine learning tool for cis-eQTL mapping with regularization and confounder adjustments.

Authors:  Kang K Yan; Hongyu Zhao; Joseph T Wu; Herbert Pang
Journal:  Genet Epidemiol       Date:  2020-07-22       Impact factor: 2.135

3.  Machine Learning on a Genome-wide Association Study to Predict Late Genitourinary Toxicity After Prostate Radiation Therapy.

Authors:  Sangkyu Lee; Sarah Kerns; Harry Ostrer; Barry Rosenstein; Joseph O Deasy; Jung Hun Oh
Journal:  Int J Radiat Oncol Biol Phys       Date:  2018-01-31       Impact factor: 7.038

4.  High-Resolution Genomic Comparisons within Salmonella enterica Serotypes Derived from Beef Feedlot Cattle: Parsing the Roles of Cattle Source, Pen, Animal, Sample Type, and Production Period.

Authors:  Gizem Levent; Ashlynn Schlochtermeier; Samuel E Ives; Keri N Norman; Sara D Lawhon; Guy H Loneragan; Robin C Anderson; Javier Vinasco; Henk C den Bakker; H Morgan Scott
Journal:  Appl Environ Microbiol       Date:  2021-05-26       Impact factor: 4.792

5.  Machine learning in postgenomic biology and personalized medicine.

Authors:  Animesh Ray
Journal:  Wiley Interdiscip Rev Data Min Knowl Discov       Date:  2022-01-24

6.  Uncovering the genetic signature of quantitative trait evolution with replicated time series data.

Authors:  S U Franssen; R Kofler; C Schlötterer
Journal:  Heredity (Edinb)       Date:  2016-11-16       Impact factor: 3.821

7.  Machine learning for genetic prediction of psychiatric disorders: a systematic review.

Authors:  Matthew Bracher-Smith; Karen Crawford; Valentina Escott-Price
Journal:  Mol Psychiatry       Date:  2020-06-26       Impact factor: 15.992

8.  Adolescent HIV-related behavioural prediction using machine learning: a foundation for precision HIV prevention.

Authors:  Bo Wang; Feifan Liu; Lynette Deveaux; Arlene Ash; Samiran Gosh; Xiaoming Li; Elke Rundensteiner; Lesley Cottrell; Richard Adderley; Bonita Stanton
Journal:  AIDS       Date:  2021-05-01       Impact factor: 4.177

9.  Predicting quantitative traits from genome and phenome with near perfect accuracy.

Authors:  Kaspar Märtens; Johan Hallin; Jonas Warringer; Gianni Liti; Leopold Parts
Journal:  Nat Commun       Date:  2016-05-10       Impact factor: 14.919

10.  A platform for experimental precision medicine: The extended BXD mouse family.

Authors:  David G Ashbrook; Danny Arends; Pjotr Prins; Megan K Mulligan; Suheeta Roy; Evan G Williams; Cathleen M Lutz; Alicia Valenzuela; Casey J Bohl; Jesse F Ingels; Melinda S McCarty; Arthur G Centeno; Reinmar Hager; Johan Auwerx; Lu Lu; Robert W Williams
Journal:  Cell Syst       Date:  2021-01-19       Impact factor: 10.304

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.