Literature DB >> 19179695

Barking up the wrong treelength: the impact of gap penalty on alignment and tree accuracy.

Kevin Liu1, Serita Nelesen, Sindhu Raghavan, C Randal Linder, Tandy Warnow.   

Abstract

Several methods have been developed for simultaneous estimation of alignment and tree, of which POY is the most popular. In a 2007 paper published in Systematic Biology, Ogden and Rosenberg reported on a simulation study in which they compared POY to estimating the alignment using ClustalW and then analyzing the resultant alignment using maximum parsimony. They found that ClustalW+MP outperformed POY with respect to alignment and phylogenetic tree accuracy, and they concluded that simultaneous estimation techniques are not competitive with two-phase techniques. Our paper presents a simulation study in which we focus on the NP-hard optimization problem that POY addresses: minimizing treelength. Our study considers the impact of the gap penalty and suggests that the poor performance observed for POY by Ogden and Rosenberg is due to the simple gap penalties they used to score alignment/tree pairs. Our study suggests that optimizing under an affine gap penalty might produce alignments that are better than ClustalW alignments, and competitive with those produced by the best current alignment methods. We also show that optimizing under this affine gap penalty produces trees whose topological accuracy is better than ClustalW+MP, and competitive with the current best two-phase methods.

Mesh:

Year:  2009        PMID: 19179695     DOI: 10.1109/TCBB.2008.63

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  11 in total

1.  Large-scale multiple sequence alignment and tree estimation using SATé.

Authors:  Kevin Liu; Tandy Warnow
Journal:  Methods Mol Biol       Date:  2014

2.  SATCHMO-JS: a webserver for simultaneous protein multiple sequence alignment and phylogenetic tree construction.

Authors:  Raffi Hagopian; John R Davidson; Ruchira S Datta; Bushra Samad; Glen R Jarvis; Kimmen Sjölander
Journal:  Nucleic Acids Res       Date:  2010-04-29       Impact factor: 16.971

3.  The tree alignment problem.

Authors:  Andrés Varón; Ward C Wheeler
Journal:  BMC Bioinformatics       Date:  2012-11-09       Impact factor: 3.169

4.  Multiple sequence alignment: a major challenge to large-scale phylogenetics.

Authors:  Kevin Liu; C Randal Linder; Tandy Warnow
Journal:  PLoS Curr       Date:  2010-11-19

5.  Standard maximum likelihood analyses of alignments with gaps can be statistically inconsistent.

Authors:  Tandy Warnow
Journal:  PLoS Curr       Date:  2012-03-09

6.  Treelength optimization for phylogeny estimation.

Authors:  Kevin Liu; Tandy Warnow
Journal:  PLoS One       Date:  2012-03-19       Impact factor: 3.240

7.  Accurate reconstruction of insertion-deletion histories by statistical phylogenetics.

Authors:  Oscar Westesson; Gerton Lunter; Benedict Paten; Ian Holmes
Journal:  PLoS One       Date:  2012-04-20       Impact factor: 3.240

8.  Structural homology guided alignment of cysteine rich proteins.

Authors:  Thomas M A Shafee; Andrew J Robinson; Nicole van der Weerden; Marilyn A Anderson
Journal:  Springerplus       Date:  2016-01-12

9.  Local search for the generalized tree alignment problem.

Authors:  Andrés Varón; Ward C Wheeler
Journal:  BMC Bioinformatics       Date:  2013-02-26       Impact factor: 3.169

10.  Origin and higher-level diversification of acariform mites - evidence from nuclear ribosomal genes, extensive taxon sampling, and secondary structure alignment.

Authors:  A R Pepato; P B Klimov
Journal:  BMC Evol Biol       Date:  2015-09-02       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.