Literature DB >> 18808330

Reconciliation with non-binary species trees.

Benjamin Vernot1, Maureen Stolzer, Aiton Goldman, Dannie Durand.   

Abstract

Reconciliation extracts information from the topological incongruence between gene and species trees to infer duplications and losses in the history of a gene family. The inferred duplication-loss histories provide valuable information for a broad range of biological applications, including ortholog identification, estimating gene duplication times, and rooting and correcting gene trees. While reconciliation for binary trees is a tractable and well studied problem, there are no algorithms for reconciliation with non-binary species trees. Yet a striking proportion of species trees are non-binary. For example, 64% of branch points in the NCBI taxonomy have three or more children. When applied to non-binary species trees, current algorithms overestimate the number of duplications because they cannot distinguish between duplication and incomplete lineage sorting. We present the first algorithms for reconciling binary gene trees with non-binary species trees under a duplication-loss parsimony model. Our algorithms utilize an efficient mapping from gene to species trees to infer the minimum number of duplications in O(|V(G) | x (k(S) + h(S))) time, where |V(G)| is the number of nodes in the gene tree, h(S) is the height of the species tree and k(S) is the size of its largest polytomy. We present a dynamic programming algorithm which also minimizes the total number of losses. Although this algorithm is exponential in the size of the largest polytomy, it performs well in practice for polytomies with outdegree of 12 or less. We also present a heuristic which estimates the minimal number of losses in polynomial time. In empirical tests, this algorithm finds an optimal loss history 99% of the time. Our algorithms have been implemented in NOTUNG, a robust, production quality, tree-fitting program, which provides a graphical user interface for exploratory analysis and also supports automated, high-throughput analysis of large data sets.

Entities:  

Mesh:

Year:  2008        PMID: 18808330      PMCID: PMC3205801          DOI: 10.1089/cmb.2008.0092

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  46 in total

1.  Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution.

Authors:  Xun Gu; Yufeng Wang; Jianying Gu
Journal:  Nat Genet       Date:  2002-05-28       Impact factor: 38.330

2.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

3.  Bayesian gene/species tree reconciliation and orthology analysis using MCMC.

Authors:  Lars Arvestad; Ann-Charlotte Berglund; Jens Lagergren; Bengt Sennblad
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

4.  Duplication and DNA segmental loss in the rice genome: implications for diploidization.

Authors:  Xiyin Wang; Xiaoli Shi; Bailin Hao; Song Ge; Jingchu Luo
Journal:  New Phytol       Date:  2005-03       Impact factor: 10.151

5.  Evolutionary change of the numbers of homeobox genes in bilateral animals.

Authors:  Jongmin Nam; Masatoshi Nei
Journal:  Mol Biol Evol       Date:  2005-08-03       Impact factor: 16.240

6.  GeneTree: comparing gene and species phylogenies using reconciled trees.

Authors:  R D Page
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

7.  Reconstruction of ancient molecular phylogeny.

Authors:  R Guigó; I Muchnik; T F Smith
Journal:  Mol Phylogenet Evol       Date:  1996-10       Impact factor: 4.286

8.  Phylogeny of the Lake Tanganyika cichlid species flock and its relationship to the Central and East African haplochromine cichlid fish faunas.

Authors:  Walter Salzburger; Axel Meyer; Sanja Baric; Erik Verheyen; Christian Sturmbauer
Journal:  Syst Biol       Date:  2002-02       Impact factor: 15.683

9.  mtDNA diversity in rhesus monkeys reveals overestimates of divergence time and paraphyly with neighboring species.

Authors:  D J Melnick; G A Hoelzer; R Absher; M V Ashley
Journal:  Mol Biol Evol       Date:  1993-03       Impact factor: 16.240

10.  Gene family evolution across 12 Drosophila genomes.

Authors:  Matthew W Hahn; Mira V Han; Sang-Gook Han
Journal:  PLoS Genet       Date:  2007-11       Impact factor: 5.917

View more
  64 in total

1.  Dissecting plant genomes with the PLAZA comparative genomics platform.

Authors:  Michiel Van Bel; Sebastian Proost; Elisabeth Wischnitzki; Sara Movahedi; Christopher Scheerlinck; Yves Van de Peer; Klaas Vandepoele
Journal:  Plant Physiol       Date:  2011-12-23       Impact factor: 8.340

2.  Unified modeling of gene duplication, loss, and coalescence using a locus tree.

Authors:  Matthew D Rasmussen; Manolis Kellis
Journal:  Genome Res       Date:  2012-01-23       Impact factor: 9.043

3.  Inferring gene duplications, transfers and losses can be done in a discrete framework.

Authors:  Vincent Ranwez; Celine Scornavacca; Jean-Philippe Doyon; Vincent Berry
Journal:  J Math Biol       Date:  2015-09-04       Impact factor: 2.259

4.  PLAZA: a comparative genomics resource to study gene and genome evolution in plants.

Authors:  Sebastian Proost; Michiel Van Bel; Lieven Sterck; Kenny Billiau; Thomas Van Parys; Yves Van de Peer; Klaas Vandepoele
Journal:  Plant Cell       Date:  2009-12-29       Impact factor: 11.277

Review 5.  Visualization of multiple alignments, phylogenies and gene family evolution.

Authors:  James B Procter; Julie Thompson; Ivica Letunic; Chris Creevey; Fabrice Jossinet; Geoffrey J Barton
Journal:  Nat Methods       Date:  2010-03       Impact factor: 28.547

6.  Reconciling event-labeled gene trees with MUL-trees and species networks.

Authors:  Marc Hellmuth; Katharina T Huber; Vincent Moulton
Journal:  J Math Biol       Date:  2019-08-13       Impact factor: 2.259

7.  Gene tree species tree reconciliation with gene conversion.

Authors:  Damir Hasić; Eric Tannier
Journal:  J Math Biol       Date:  2019-02-15       Impact factor: 2.259

8.  Biologically feasible gene trees, reconciliation maps and informative triples.

Authors:  Marc Hellmuth
Journal:  Algorithms Mol Biol       Date:  2017-08-29       Impact factor: 1.405

9.  Untangling spider silk evolution with spidroin terminal domains.

Authors:  Jessica E Garb; Nadia A Ayoub; Cheryl Y Hayashi
Journal:  BMC Evol Biol       Date:  2010-08-09       Impact factor: 3.260

10.  Intron-loss evolution of hatching enzyme genes in Teleostei.

Authors:  Mari Kawaguchi; Junya Hiroi; Masaki Miya; Mutsumi Nishida; Ichiro Iuchi; Shigeki Yasumasu
Journal:  BMC Evol Biol       Date:  2010-08-27       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.