Literature DB >> 16049194

Assessment of protein distance measures and tree-building methods for phylogenetic tree reconstruction.

Volker Hollich1, Lena Milchert, Lars Arvestad, Erik L L Sonnhammer.   

Abstract

Distance-based methods are popular for reconstructing evolutionary trees of protein sequences, mainly because of their speed and generality. A number of variants of the classical neighbor-joining (NJ) algorithm have been proposed, as well as a number of methods to estimate protein distances. We here present a large-scale assessment of performance in reconstructing the correct tree topology for the most popular algorithms. The programs BIONJ, FastME, Weighbor, and standard NJ were run using 12 distance estimators, producing 48 tree-building/distance estimation method combinations. These were evaluated on a test set based on real trees taken from 100 Pfam families. Each tree was used to generate multiple sequence alignments with the ROSE program using three evolutionary models. The accuracy of each method was analyzed as a function of both sequence divergence and location in the tree. We found that BIONJ produced the overall best results, although the average accuracy differed little between the tree-building methods (normally less than 1%). A noticeable trend was that FastME performed poorer than the rest on long branches. Weighbor was several orders of magnitude slower than the other programs. Larger differences were observed when using different distance estimators. Protein-adapted Jukes-Cantor and Kimura distance correction produced clearly poorer results than the other methods, even worse than uncorrected distances. We also assessed the recently developed Scoredist measure, which performed equally well as more complex methods.

Mesh:

Substances:

Year:  2005        PMID: 16049194     DOI: 10.1093/molbev/msi224

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  7 in total

1.  A novel method for protein-protein interaction site prediction using phylogenetic substitution models.

Authors:  David La; Daisuke Kihara
Journal:  Proteins       Date:  2011-10-12

2.  MultiSeq: unifying sequence and structure data for evolutionary analysis.

Authors:  Elijah Roberts; John Eargle; Dan Wright; Zaida Luthey-Schulten
Journal:  BMC Bioinformatics       Date:  2006-08-16       Impact factor: 3.169

3.  Whole-genome-based phylogeny of African swine fever virus.

Authors:  Levon Aslanyan; Hranush Avagyan; Zaven Karalyan
Journal:  Vet World       Date:  2020-10-10

4.  A pore-forming protein drives macropinocytosis to facilitate toad water maintaining.

Authors:  Zhong Zhao; Zhi-Hong Shi; Chen-Jun Ye; Yun Zhang
Journal:  Commun Biol       Date:  2022-07-22

5.  FastTree: computing large minimum evolution trees with profiles instead of a distance matrix.

Authors:  Morgan N Price; Paramvir S Dehal; Adam P Arkin
Journal:  Mol Biol Evol       Date:  2009-04-17       Impact factor: 16.240

6.  Performance comparison between k-tuple distance and four model-based distances in phylogenetic tree reconstruction.

Authors:  Kuan Yang; Liqing Zhang
Journal:  Nucleic Acids Res       Date:  2008-02-22       Impact factor: 16.971

7.  Genomic Diversity and Evolution of Quasispecies in Newcastle Disease Virus Infections.

Authors:  Archana Jadhav; Lele Zhao; Weiwei Liu; Chan Ding; Venugopal Nair; Sebastian E Ramos-Onsins; Luca Ferretti
Journal:  Viruses       Date:  2020-11-14       Impact factor: 5.048

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.