Literature DB >> 24000925

The duplication-loss small phylogeny problem: from cherries to trees.

Sandro Andreotti1, Knut Reinert, Stefan Canzar.   

Abstract

The reconstruction of the history of evolutionary genome-wide events among a set of related organisms is of great biological interest since it can help to reveal the genomic basis of phenotypes. The sequencing of whole genomes faciliates the study of gene families that vary in size through duplication and loss events, like transfer RNA. However, a high sequence similarity often does not allow one to distinguish between orthologs and paralogs. Previous methods have addressed this difficulty by taking into account flanking regions of members of a family independently. We go one step further by inferring the order of genes of (a set of) families for ancestral genomes by considering the order of these genes on sequenced genomes. We present a novel branch-and-cut algorithm to solve the two species small phylogeny problem in the evolutionary model of duplications and losses. On average, our implementation, DupLoCut, improves the running time of a recently proposed method in the experiments on six Vibrionaceae lineages by a factor of ∼200. Besides the mere improvement in running time, the efficiency of our approach allows us to extend our model from cherries of a species tree, that is, subtrees with two leaves, to the median of three species setting. Being able to determine the median of three species is of key importance to one of the most common approaches to ancestral reconstruction, and our experiments show that its repeated computation considerably reduces the number of duplications and losses along the tree both on simulated instances comprising 128 leaves and a set of Bacillus genomes. Furthermore, in our simulations we show that a reduction in cost goes hand in hand with an improvement of the predicted ancestral genomes. Finally, we prove that the small phylogeny problem in the duplication-loss model is NP-complete already for two species.

Mesh:

Substances:

Year:  2013        PMID: 24000925      PMCID: PMC3761417          DOI: 10.1089/cmb.2013.0057

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  7 in total

1.  The evolutionary fate and consequences of duplicate genes.

Authors:  M Lynch; J S Conery
Journal:  Science       Date:  2000-11-10       Impact factor: 47.728

2.  Breakpoint Phylogenies.

Authors: 
Journal:  Genome Inform Ser Workshop Genome Inform       Date:  1997

3.  Inferring the history of interchromosomal gene transposition in Drosophila using n-dimensional parsimony.

Authors:  Mira V Han; Matthew W Hahn
Journal:  Genetics       Date:  2011-11-17       Impact factor: 4.562

4.  Archaeology and evolution of transfer RNA genes in the Escherichia coli genome.

Authors:  Mike Withers; Lorenz Wernisch; Mario dos Reis
Journal:  RNA       Date:  2006-04-17       Impact factor: 4.942

5.  Identifying parent-daughter relationships among duplicated genes.

Authors:  Mira V Han; Matthew W Hahn
Journal:  Pac Symp Biocomput       Date:  2009

6.  The evolution of tRNA genes in Drosophila.

Authors:  Hubert H Rogers; Casey M Bergman; Sam Griffiths-Jones
Journal:  Genome Biol Evol       Date:  2010-07-12       Impact factor: 3.416

7.  Gene family evolution across 12 Drosophila genomes.

Authors:  Matthew W Hahn; Mira V Han; Sang-Gook Han
Journal:  PLoS Genet       Date:  2007-11       Impact factor: 5.917

  7 in total
  2 in total

1.  Gene order alignment on trees with multiOrthoAlign.

Authors:  Billel Benzaid; Nadia El-Mabrouk
Journal:  BMC Genomics       Date:  2014-10-17       Impact factor: 3.969

2.  Operon-based approach for the inference of rRNA and tRNA evolutionary histories in bacteria.

Authors:  Tomasz Pawliszak; Meghan Chua; Carson K Leung; Olivier Tremblay-Savard
Journal:  BMC Genomics       Date:  2020-04-16       Impact factor: 3.969

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.