Literature DB >> 32227194

A parallelized strategy for epistasis analysis based on Empirical Bayesian Elastic Net models.

Jia Wen1, Colby T Ford2,3, Daniel Janies2, Xinghua Shi4.   

Abstract

MOTIVATION: Epistasis reflects the distortion on a particular trait or phenotype resulting from the combinatorial effect of two or more genes or genetic variants. Epistasis is an important genetic foundation underlying quantitative traits in many organisms as well as in complex human diseases. However, there are two major barriers in identifying epistasis using large genomic datasets. One is that epistasis analysis will induce over-fitting of an over-saturated model with the high-dimensionality of a genomic dataset. Therefore, the problem of identifying epistasis demands efficient statistical methods. The second barrier comes from the intensive computing time for epistasis analysis, even when the appropriate model and data are specified.
RESULTS: In this study, we combine statistical techniques and computational techniques to scale up epistasis analysis using Empirical Bayesian Elastic Net (EBEN) models. Specifically, we first apply a matrix manipulation strategy for pre-computing the correlation matrix and pre-filter to narrow down the search space for epistasis analysis. We then develop a parallelized approach to further accelerate the modeling process. Our experiments on synthetic and empirical genomic data demonstrate that our parallelized methods offer tens of fold speed up in comparison with the classical EBEN method which runs in a sequential manner. We applied our parallelized approach to a yeast dataset, and we were able to identify both main and epistatic effects of genetic variants associated with traits such as fitness.
AVAILABILITY AND IMPLEMENTATION: The software is available at github.com/shilab/parEBEN.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2020        PMID: 32227194      PMCID: PMC7320619          DOI: 10.1093/bioinformatics/btaa216

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  47 in total

1.  Detecting genome-wide epistases based on the clustering of relatively frequent items.

Authors:  Minzhu Xie; Jing Li; Tao Jiang
Journal:  Bioinformatics       Date:  2011-11-03       Impact factor: 6.937

Review 2.  Bayesian models for detecting epistatic interactions from genetic data.

Authors:  Yu Zhang; Bo Jiang; Jun Zhu; Jun S Liu
Journal:  Ann Hum Genet       Date:  2010-11-22       Impact factor: 1.670

3.  The cost of gene expression underlies a fitness trade-off in yeast.

Authors:  Gregory I Lang; Andrew W Murray; David Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-19       Impact factor: 11.205

4.  Epistasis in a quantitative trait captured by a molecular model of transcription factor interactions.

Authors:  Jason Gertz; Justin P Gerke; Barak A Cohen
Journal:  Theor Popul Biol       Date:  2009-10-08       Impact factor: 1.570

5.  High-throughput analysis of epistasis in genome-wide association studies with BiForce.

Authors:  Attila Gyenesei; Jonathan Moody; Colin A M Semple; Chris S Haley; Wen-Hua Wei
Journal:  Bioinformatics       Date:  2012-05-21       Impact factor: 6.937

6.  Recent developments in statistical methods for detecting genetic loci affecting phenotypic variability.

Authors:  Lars Rönnegård; William Valdar
Journal:  BMC Genet       Date:  2012-07-24       Impact factor: 2.797

7.  Two-stage two-locus models in genome-wide association.

Authors:  David M Evans; Jonathan Marchini; Andrew P Morris; Lon R Cardon
Journal:  PLoS Genet       Date:  2006-09-22       Impact factor: 5.917

8.  Accounting for genetic interactions improves modeling of individual quantitative trait phenotypes in yeast.

Authors:  Simon K G Forsberg; Joshua S Bloom; Meru J Sadhu; Leonid Kruglyak; Örjan Carlborg
Journal:  Nat Genet       Date:  2017-02-27       Impact factor: 38.330

Review 9.  How to increase our belief in discovered statistical interactions via large-scale association studies?

Authors:  K Van Steen; J H Moore
Journal:  Hum Genet       Date:  2019-03-06       Impact factor: 4.132

10.  Genetic interactions affecting human gene expression identified by variance association mapping.

Authors:  Andrew Anand Brown; Alfonso Buil; Ana Viñuela; Tuuli Lappalainen; Hou-Feng Zheng; J Brent Richards; Kerrin S Small; Timothy D Spector; Emmanouil T Dermitzakis; Richard Durbin
Journal:  Elife       Date:  2014-04-25       Impact factor: 8.140

View more
  1 in total

1.  Deep polygenic neural network for predicting and identifying yield-associated genes in Indonesian rice accessions.

Authors:  Nicholas Dominic; Tjeng Wawan Cenggoro; Arif Budiarto; Bens Pardamean
Journal:  Sci Rep       Date:  2022-08-15       Impact factor: 4.996

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.