Literature DB >> 17060196

SDM: a fast distance-based approach for (super) tree building in phylogenomics.

Alexis Criscuolo1, Vincent Berry, Emmanuel J P Douzery, Olivier Gascuel.   

Abstract

Phylogenomic studies aim to build phylogenies from large sets of homologous genes. Such "genome-sized" data require fast methods, because of the typically large numbers of taxa examined. In this framework, distance-based methods are useful for exploratory studies and building a starting tree to be refined by a more powerful maximum likelihood (ML) approach. However, estimating evolutionary distances directly from concatenated genes gives poor topological signal as genes evolve at different rates. We propose a novel method, named super distance matrix (SDM), which follows the same line as average consensus supertree (ACS; Lapointe and Cucumel, 1997) and combines the evolutionary distances obtained from each gene into a single distance supermatrix to be analyzed using a standard distance-based algorithm. SDM deforms the source matrices, without modifying their topological message, to bring them as close as possible to each other; these deformed matrices are then averaged to obtain the distance supermatrix. We show that this problem is equivalent to the minimization of a least-squares criterion subject to linear constraints. This problem has a unique solution which is obtained by resolving a linear system. As this system is sparse, its practical resolution requires O(naka) time, where n is the number of taxa, k the number of matrices, and a < 2, which allows the distance supermatrix to be quickly obtained. Several uses of SDM are proposed, from fast exploratory studies to more accurate approaches requiring heavier computing time. Using simulations, we show that SDM is a relevant alternative to the standard matrix representation with parsimony (MRP) method, notably when the taxa sets of the different genes have low overlap. We also show that SDM can be used to build an excellent starting tree for an ML approach, which both reduces the computing time and increases the topogical accuracy. We use SDM to analyze the data set of Gatesy et al. (2002, Syst. Biol. 51: 652-664) that involves 48 genes of 75 placental mammals. The results indicate that these genes have strong rate heterogeneity and confirm the simulation conclusions.

Entities:  

Mesh:

Year:  2006        PMID: 17060196     DOI: 10.1080/10635150600969872

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  31 in total

Review 1.  Molecular phylogenetics: principles and practice.

Authors:  Ziheng Yang; Bruce Rannala
Journal:  Nat Rev Genet       Date:  2012-03-28       Impact factor: 53.242

2.  Phylogenetic inference with weighted codon evolutionary distances.

Authors:  Alexis Criscuolo; Christian J Michel
Journal:  J Mol Evol       Date:  2009-03-24       Impact factor: 2.395

3.  A hierarchical model for incomplete alignments in phylogenetic inference.

Authors:  Fuxia Cheng; Stefanie Hartmann; Mayetri Gupta; Joseph G Ibrahim; Todd J Vision
Journal:  Bioinformatics       Date:  2009-01-15       Impact factor: 6.937

4.  Determining the Null Model for Detecting Adaptive Convergence from Genomic Data: A Case Study using Echolocating Mammals.

Authors:  Gregg W C Thomas; Matthew W Hahn
Journal:  Mol Biol Evol       Date:  2015-01-27       Impact factor: 16.240

5.  BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees.

Authors:  Markus Fleischauer; Sebastian Böcker
Journal:  PeerJ       Date:  2018-06-08       Impact factor: 2.984

6.  An application of supertree methods to Mammalian mitogenomic sequences.

Authors:  Véronique Campbell; François-Joseph Lapointe
Journal:  Evol Bioinform Online       Date:  2010-05-12       Impact factor: 1.625

7.  SuperTriplets: a triplet-based supertree approach to phylogenomics.

Authors:  Vincent Ranwez; Alexis Criscuolo; Emmanuel J P Douzery
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

8.  Horizontal gene transfer and homologous recombination drive the evolution of the nitrogen-fixing symbionts of Medicago species.

Authors:  Xavier Bailly; Isabelle Olivieri; Brigitte Brunel; Jean-Claude Cleyet-Marel; Gilles Béna
Journal:  J Bacteriol       Date:  2007-05-11       Impact factor: 3.490

9.  Cohort study of molecular identification and typing of Mycobacterium abscessus, Mycobacterium massiliense, and Mycobacterium bolletii.

Authors:  Adrian M Zelazny; Jeremy M Root; Yvonne R Shea; Rhonda E Colombo; Isdore C Shamputa; Frida Stock; Sean Conlan; Steven McNulty; Barbara A Brown-Elliott; Richard J Wallace; Kenneth N Olivier; Steven M Holland; Elizabeth P Sampaio
Journal:  J Clin Microbiol       Date:  2009-05-06       Impact factor: 5.948

10.  A simulation study comparing supertree and combined analysis methods using SMIDGen.

Authors:  M Shel Swenson; François Barbançon; Tandy Warnow; C Randal Linder
Journal:  Algorithms Mol Biol       Date:  2010-01-04       Impact factor: 1.405

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.