Literature DB >> 12855432

Bayesian gene/species tree reconciliation and orthology analysis using MCMC.

Lars Arvestad1, Ann-Charlotte Berglund, Jens Lagergren, Bengt Sennblad.   

Abstract

MOTIVATION: Comparative genomics in general and orthology analysis in particular are becoming increasingly important parts of gene function prediction. Previously, orthology analysis and reconciliation has been performed only with respect to the parsimony model. This discards many plausible solutions and sometimes precludes finding the correct one. In many other areas in bioinformatics probabilistic models have proven to be both more realistic and powerful than parsimony models. For instance, they allow for assessing solution reliability and consideration of alternative solutions in a uniform way. There is also an added benefit in making model assumptions explicit and therefore making model comparisons possible. For orthology analysis, uncertainty has recently been addressed using parsimonious reconciliation combined with bootstrap techniques. However, until now no probabilistic methods have been available.
RESULTS: We introduce a probabilistic gene evolution model based on a birth-death process in which a gene tree evolves 'inside' a species tree. Based on this model, we develop a tool with the capacity to perform practical orthology analysis, based on Fitch's original definition, and more generally for reconciling pairs of gene and species trees. Our gene evolution model is biologically sound (Nei et al., 1997) and intuitively attractive. We develop a Bayesian analysis based on MCMC which facilitates approximation of an a posteriori distribution for reconciliations. That is, we can find the most probable reconciliations and estimate the probability of any reconciliation, given the observed gene tree. This also gives a way to estimate the probability that a pair of genes are orthologs. The main algorithmic contribution presented here consists of an algorithm for computing the likelihood of a given reconciliation. To the best of our knowledge, this is the first successful introduction of this type of probabilistic methods, which flourish in phylogeny analysis, into reconciliation and orthology analysis. The MCMC algorithm has been implemented and, although not yet being in its final form, tests show that it performs very well on synthetic as well as biological data. Using standard correspondences, our results carry over to allele trees as well as biogeography.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12855432     DOI: 10.1093/bioinformatics/btg1000

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  59 in total

1.  The consistent phylogenetic signal in genome trees revealed by reducing the impact of noise.

Authors:  Bas E Dutilh; Martijn A Huynen; William J Bruno; Berend Snel
Journal:  J Mol Evol       Date:  2004-05       Impact factor: 2.395

2.  Unified modeling of gene duplication, loss, and coalescence using a locus tree.

Authors:  Matthew D Rasmussen; Manolis Kellis
Journal:  Genome Res       Date:  2012-01-23       Impact factor: 9.043

3.  Inferring gene duplications, transfers and losses can be done in a discrete framework.

Authors:  Vincent Ranwez; Celine Scornavacca; Jean-Philippe Doyon; Vincent Berry
Journal:  J Math Biol       Date:  2015-09-04       Impact factor: 2.259

Review 4.  Probabilistic models of eukaryotic evolution: time for integration.

Authors:  Nicolas Lartillot
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2015-09-26       Impact factor: 6.237

5.  Protein content of minimal and ancestral ribosome.

Authors:  Arcady Mushegian
Journal:  RNA       Date:  2005-07-25       Impact factor: 4.942

6.  Optimal gene trees from sequences and species trees using a soft interpretation of parsimony.

Authors:  Ann-Charlotte Berglund-Sonnhammer; Pär Steffansson; Matthew J Betts; David A Liberles
Journal:  J Mol Evol       Date:  2006-07-07       Impact factor: 2.395

7.  Reconstructing patterns of reticulate evolution in plants.

Authors:  C Randal Linder; Loren H Rieseberg
Journal:  Am J Bot       Date:  2004-10       Impact factor: 3.844

8.  Simultaneous Bayesian gene tree reconstruction and reconciliation analysis.

Authors:  Orjan Akerborg; Bengt Sennblad; Lars Arvestad; Jens Lagergren
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-19       Impact factor: 11.205

9.  Reconciliation with non-binary species trees.

Authors:  Benjamin Vernot; Maureen Stolzer; Aiton Goldman; Dannie Durand
Journal:  J Comput Biol       Date:  2008-10       Impact factor: 1.479

10.  Genome-wide probabilistic reconciliation analysis across vertebrates.

Authors:  Owais Mahmudi; Joel Sjöstrand; Bengt Sennblad; Jens Lagergren
Journal:  BMC Bioinformatics       Date:  2013-10-15       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.