Literature DB >> 27377322

PhyPA: Phylogenetic method with pairwise sequence alignment outperforms likelihood methods in phylogenetics involving highly diverged sequences.

Xuhua Xia1.   

Abstract

While pairwise sequence alignment (PSA) by dynamic programming is guaranteed to generate one of the optimal alignments, multiple sequence alignment (MSA) of highly divergent sequences often results in poorly aligned sequences, plaguing all subsequent phylogenetic analysis. One way to avoid this problem is to use only PSA to reconstruct phylogenetic trees, which can only be done with distance-based methods. I compared the accuracy of this new computational approach (named PhyPA for phylogenetics by pairwise alignment) against the maximum likelihood method using MSA (the ML+MSA approach), based on nucleotide, amino acid and codon sequences simulated with different topologies and tree lengths. I present a surprising discovery that the fast PhyPA method consistently outperforms the slow ML+MSA approach for highly diverged sequences even when all optimization options were turned on for the ML+MSA approach. Only when sequences are not highly diverged (i.e., when a reliable MSA can be obtained) does the ML+MSA approach outperforms PhyPA. The true topologies are always recovered by ML with the true alignment from the simulation. However, with MSA derived from alignment programs such as MAFFT or MUSCLE, the recovered topology consistently has higher likelihood than that for the true topology. Thus, the failure to recover the true topology by the ML+MSA is not because of insufficient search of tree space, but by the distortion of phylogenetic signal by MSA methods. I have implemented in DAMBE PhyPA and two approaches making use of multi-gene data sets to derive phylogenetic support for subtrees equivalent to resampling techniques such as bootstrapping and jackknifing.
Copyright © 2016 The Author. Published by Elsevier Inc. All rights reserved.

Keywords:  Codon-based alignment; Deep phylogeny; Distance-based methods; Likelihood-based methods; Sequence alignment

Mesh:

Year:  2016        PMID: 27377322     DOI: 10.1016/j.ympev.2016.07.001

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  5 in total

Review 1.  Bioinformatics and Drug Discovery.

Authors:  Xuhua Xia
Journal:  Curr Top Med Chem       Date:  2017       Impact factor: 3.295

2.  DAMBE6: New Tools for Microbial Genomics, Phylogenetics, and Molecular Evolution.

Authors:  Xuhua Xia
Journal:  J Hered       Date:  2017-06-01       Impact factor: 2.645

3.  The Role of +4U as an Extended Translation Termination Signal in Bacteria.

Authors:  Yulong Wei; Xuhua Xia
Journal:  Genetics       Date:  2016-11-30       Impact factor: 4.562

4.  DAMBE7: New and Improved Tools for Data Analysis in Molecular Biology and Evolution.

Authors:  Xuhua Xia
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

5.  Post-Alignment Adjustment and Its Automation.

Authors:  Xuhua Xia
Journal:  Genes (Basel)       Date:  2021-11-18       Impact factor: 4.096

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.