Literature DB >> 32060618

Counting and sampling gene family evolutionary histories in the duplication-loss and duplication-loss-transfer models.

Cedric Chauve1,2,3, Yann Ponty4, Michael Wallner5,6.   

Abstract

Given a set of species whose evolution is represented by a species tree, a gene family is a group of genes having evolved from a single ancestral gene. A gene family evolves along the branches of a species tree through various mechanisms, including-but not limited to-speciation ([Formula: see text]), gene duplication ([Formula: see text]), gene loss ([Formula: see text]), and horizontal gene transfer ([Formula: see text]). The reconstruction of a gene tree representing the evolution of a gene family constrained by a species tree is an important problem in phylogenomics. However, unlike in the multispecies coalescent evolutionary model that considers only speciation and incomplete lineage sorting events, very little is known about the search space for gene family histories accounting for gene duplication, gene loss and horizontal gene transfer (the [Formula: see text]-model). In this work, we introduce the notion of evolutionary histories defined as a binary ordered rooted tree describing the evolution of a gene family, constrained by a species tree in the [Formula: see text]-model. We provide formal grammars describing the set of all evolutionary histories that are compatible with a given species tree, whether it is ranked or unranked. These grammars allow us, using either analytic combinatorics or dynamic programming, to efficiently compute the number of histories of a given size, and also to generate random histories of a given size under the uniform distribution. We apply these tools to obtain exact asymptotics for the number of gene family histories for two species trees, the rooted caterpillar and complete binary tree, as well as estimates of the range of the exponential growth factor of the number of histories for random species trees of size up to 25. Our results show that including horizontal gene transfers induce a dramatic increase of the number of evolutionary histories. We also show that, within ranked species trees, the number of evolutionary histories in the [Formula: see text]-model is almost independent of the species tree topology. These results establish firm foundations for the development of ensemble methods for the prediction of reconciliations.

Entities:  

Keywords:  Asymptotics; Enumerative combinatorics; Phylogenetics; Sampling algorithms

Mesh:

Year:  2020        PMID: 32060618      PMCID: PMC7052048          DOI: 10.1007/s00285-019-01465-x

Source DB:  PubMed          Journal:  J Math Biol        ISSN: 0303-6812            Impact factor:   2.259


  41 in total

1.  Unified modeling of gene duplication, loss, and coalescence using a locus tree.

Authors:  Matthew D Rasmussen; Manolis Kellis
Journal:  Genome Res       Date:  2012-01-23       Impact factor: 9.043

2.  Gene tree distributions under the coalescent process.

Authors:  James H Degnan; Laura A Salter
Journal:  Evolution       Date:  2005-01       Impact factor: 3.694

Review 3.  Gene tree discordance, phylogenetic inference and the multispecies coalescent.

Authors:  James H Degnan; Noah A Rosenberg
Journal:  Trends Ecol Evol       Date:  2009-03-21       Impact factor: 17.712

4.  Gene tree species tree reconciliation with gene conversion.

Authors:  Damir Hasić; Eric Tannier
Journal:  J Math Biol       Date:  2019-02-15       Impact factor: 2.259

5.  The space of ultrametric phylogenetic trees.

Authors:  Alex Gavryushkin; Alexei J Drummond
Journal:  J Theor Biol       Date:  2016-05-14       Impact factor: 2.691

6.  On the Number of Non-equivalent Ancestral Configurations for Matching Gene Trees and Species Trees.

Authors:  Filippo Disanto; Noah A Rosenberg
Journal:  Bull Math Biol       Date:  2017-09-14       Impact factor: 1.758

7.  Inferring duplications, losses, transfers and incomplete lineage sorting with nonbinary species trees.

Authors:  Maureen Stolzer; Han Lai; Minli Xu; Deepa Sathaye; Benjamin Vernot; Dannie Durand
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

8.  Joint amalgamation of most parsimonious reconciled gene trees.

Authors:  Celine Scornavacca; Edwin Jacox; Gergely J Szöllősi
Journal:  Bioinformatics       Date:  2014-11-06       Impact factor: 6.937

9.  An algorithm for computing the gene tree probability under the multispecies coalescent and its application in the inference of population tree.

Authors:  Yufeng Wu
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

10.  Lateral gene transfer from the dead.

Authors:  Gergely J Szöllosi; Eric Tannier; Nicolas Lartillot; Vincent Daubin
Journal:  Syst Biol       Date:  2013-01-25       Impact factor: 15.683

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.