Literature DB >> 22759416

Efficient error correction algorithms for gene tree reconciliation based on duplication, duplication and loss, and deep coalescence.

Ruchi Chaudhary1, J Gordon Burleigh, Oliver Eulenstein.   

Abstract

BACKGROUND: Gene tree - species tree reconciliation problems infer the patterns and processes of gene evolution within a species tree. Gene tree parsimony approaches seek the evolutionary scenario that implies the fewest gene duplications, duplications and losses, or deep coalescence (incomplete lineage sorting) events needed to reconcile a gene tree and a species tree. While a gene tree parsimony approach can be informative about genome evolution and phylogenetics, error in gene trees can profoundly bias the results.
RESULTS: We introduce efficient algorithms that rapidly search local Subtree Prune and Regraft (SPR) or Tree Bisection and Reconnection (TBR) neighborhoods of a given gene tree to identify a topology that implies the fewest duplications, duplication and losses, or deep coalescence events. These algorithms improve on the current solutions by a factor of n for searching SPR neighborhoods and n2 for searching TBR neighborhoods, where n is the number of taxa in the given gene tree. They provide a fast error correction protocol for ameliorating the effects of gene tree error by allowing small rearrangements in the topology to improve the reconciliation cost. We also demonstrate a simple protocol to use the gene rearrangement algorithm to improve gene tree parsimony phylogenetic analyses.
CONCLUSIONS: The new gene tree rearrangement algorithms provide a fast method to address gene tree error. They do not make assumptions about the underlying processes of genome evolution, and they are amenable to analyses of large-scale genomic data sets. These algorithms are also easily incorporated into gene tree parsimony phylogenetic analyses, potentially producing more credible estimates of reconciliation cost.

Entities:  

Mesh:

Year:  2012        PMID: 22759416      PMCID: PMC3382437          DOI: 10.1186/1471-2105-13-S10-S11

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  22 in total

1.  NOTUNG: a program for dating gene duplications and optimizing gene family trees.

Authors:  K Chen; D Durand; M Farach-Colton
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

2.  Genome-scale approaches to resolving incongruence in molecular phylogenies.

Authors:  Antonis Rokas; Barry L Williams; Nicole King; Sean B Carroll
Journal:  Nature       Date:  2003-10-23       Impact factor: 49.962

3.  A hybrid micro-macroevolutionary approach to gene tree reconstruction.

Authors:  Dannie Durand; Bjarni V Halldórsson; Benjamin Vernot
Journal:  J Comput Biol       Date:  2006-03       Impact factor: 1.479

4.  Optimal gene trees from sequences and species trees using a soft interpretation of parsimony.

Authors:  Ann-Charlotte Berglund-Sonnhammer; Pär Steffansson; Matthew J Betts; David A Liberles
Journal:  J Mol Evol       Date:  2006-07-07       Impact factor: 2.395

5.  Inferring phylogeny from whole genomes.

Authors:  Paweł Górecki; Jerzy Tiuryn
Journal:  Bioinformatics       Date:  2007-01-15       Impact factor: 6.937

6.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

7.  Reconstruction of ancient molecular phylogeny.

Authors:  R Guigó; I Muchnik; T F Smith
Journal:  Mol Phylogenet Evol       Date:  1996-10       Impact factor: 4.286

8.  On a Mirkin-Muchnik-Smith conjecture for comparing molecular phylogenies.

Authors:  L Zhang
Journal:  J Comput Biol       Date:  1997       Impact factor: 1.479

9.  Inferring species trees from gene trees: a phylogenetic analysis of the Elapidae (Serpentes) based on the amino acid sequences of venom proteins.

Authors:  J B Slowinski; A Knight; A P Rooney
Journal:  Mol Phylogenet Evol       Date:  1997-12       Impact factor: 4.286

10.  An ILP solution for the gene duplication problem.

Authors:  Wen-Chieh Chang; Gordon J Burleigh; David F Fernández-Baca; Oliver Eulenstein
Journal:  BMC Bioinformatics       Date:  2011-02-15       Impact factor: 3.169

View more
  8 in total

1.  DrML: probabilistic modeling of gene duplications.

Authors:  Paweł Górecki; Oliver Eulenstein
Journal:  J Comput Biol       Date:  2013-09-27       Impact factor: 1.479

2.  Gene tree correction guided by orthology.

Authors:  Manuel Lafond; Magali Semeria; Krister M Swenson; Eric Tannier; Nadia El-Mabrouk
Journal:  BMC Bioinformatics       Date:  2013-10-15       Impact factor: 3.169

3.  The link between orthology relations and gene trees: a correction perspective.

Authors:  Manuel Lafond; Riccardo Dondi; Nadia El-Mabrouk
Journal:  Algorithms Mol Biol       Date:  2016-04-16       Impact factor: 1.405

4.  Embedding gene trees into phylogenetic networks by conflict resolution algorithms.

Authors:  Marcin Wawerka; Dawid Dąbkowski; Natalia Rutecka; Agnieszka Mykowiecka; Paweł Górecki
Journal:  Algorithms Mol Biol       Date:  2022-05-19       Impact factor: 1.721

5.  Non-parametric correction of estimated gene trees using TRACTION.

Authors:  Sarah Christensen; Erin K Molloy; Pranjal Vachaspati; Ananya Yammanuru; Tandy Warnow
Journal:  Algorithms Mol Biol       Date:  2020-01-04       Impact factor: 1.405

6.  Polytomy refinement for the correction of dubious duplications in gene trees.

Authors:  Manuel Lafond; Cedric Chauve; Riccardo Dondi; Nadia El-Mabrouk
Journal:  Bioinformatics       Date:  2014-09-01       Impact factor: 6.937

Review 7.  The inference of gene trees with species trees.

Authors:  Gergely J Szöllősi; Eric Tannier; Vincent Daubin; Bastien Boussau
Journal:  Syst Biol       Date:  2014-07-28       Impact factor: 15.683

8.  Efficient Gene Tree Correction Guided by Genome Evolution.

Authors:  Emmanuel Noutahi; Magali Semeria; Manuel Lafond; Jonathan Seguin; Bastien Boussau; Laurent Guéguen; Nadia El-Mabrouk; Eric Tannier
Journal:  PLoS One       Date:  2016-08-11       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.