Literature DB >> 18990223

Stability of multiple alignments and phylogenetic trees: an analysis of ABC-transporter proteins family.

Holger Wagner1, Burkhard Morgenstern, Andreas Dress.   

Abstract

BACKGROUND: Sequence-based phylogeny reconstruction is a fundamental task in Bioinformatics. Practically all methods for phylogeny reconstruction are based on multiple alignments. The quality and stability of the underlying alignments is therefore crucial for phylogenetic analysis.
RESULTS: In this short report, we investigate alignments and alignment-based phylogenies constructed for a set of 22 ABC transporters using CLUSTAL W and DIALIGN. Comparing the 22 "one-out phylogenies" one can obtain for this sequence set, some intrinsic phylogenetic instability is observed - even if attention is restricted to branches with high bootstrapping frequencies, the so-called safe branches. We show that this instability is caused by the fact that both, CLUSTAL W as well as DIALIGN, apparently get "confused" by sequence repeats in some of the ABC-transporter. To deal with such problems, two new DIALIGN options are introduced that prove helpful in our context, the "exclude-fragment" (or "xfr") and the "self-comparison" (or "sc") option.
CONCLUSION: "One-out strategies", known to be a useful tool for testing the stability of all sorts of data-analysis procedures, can successfully be used also in testing alignment stability. In case instabilities are observed, the sequences under consideration should be carefully checked for putative causes. In case one suspects sequence repeats to be the cause, the new "sc" option can be used to detect such repeats, and the "xfr" option can help to resolve the resulting problems.

Entities:  

Year:  2008        PMID: 18990223      PMCID: PMC2637874          DOI: 10.1186/1748-7188-3-15

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  29 in total

1.  AltAVisT: comparing alternative multiple sequence alignments.

Authors:  Burkhard Morgenstern; Sachin Goel; Alexander Sczyrba; Andreas Dress
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

2.  The CHAOS/DIALIGN WWW server for multiple alignment of genomic sequences.

Authors:  Michael Brudno; Rasmus Steinkamp; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  Multiple DNA and protein sequence alignment based on segment-to-segment comparison.

Authors:  B Morgenstern; A Dress; T Werner
Journal:  Proc Natl Acad Sci U S A       Date:  1996-10-29       Impact factor: 11.205

Review 4.  The Escherichia coli ATP-binding cassette (ABC) proteins.

Authors:  K J Linton; C F Higgins
Journal:  Mol Microbiol       Date:  1998-04       Impact factor: 3.501

5.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

6.  Getting in or out: early segregation between importers and exporters in the evolution of ATP-binding cassette (ABC) transporters.

Authors:  W Saurin; M Hofnung; E Dassa
Journal:  J Mol Evol       Date:  1999-01       Impact factor: 2.395

7.  Multiple sequence alignment with user-defined anchor points.

Authors:  Burkhard Morgenstern; Sonja J Prohaska; Dirk Pöhler; Peter F Stadler
Journal:  Algorithms Mol Biol       Date:  2006-04-19       Impact factor: 1.405

8.  DIALIGN-T: an improved algorithm for segment-based multiple sequence alignment.

Authors:  Amarendran R Subramanian; Jan Weyer-Menkhoff; Michael Kaufmann; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2005-03-22       Impact factor: 3.169

9.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity.

Authors:  Robert C Edgar
Journal:  BMC Bioinformatics       Date:  2004-08-19       Impact factor: 3.169

10.  DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment.

Authors:  Amarendran R Subramanian; Michael Kaufmann; Burkhard Morgenstern
Journal:  Algorithms Mol Biol       Date:  2008-05-27       Impact factor: 1.405

View more
  2 in total

1.  Identification and Characterization of microRNA319a and Its Putative Target Gene, PvPCF5, in the Bioenergy Grass Switchgrass (Panicum virgatum).

Authors:  Qi Xie; Xue Liu; Yinbing Zhang; Jinfu Tang; Dedong Yin; Bo Fan; Lihuang Zhu; Liebao Han; Guilong Song; Dayong Li
Journal:  Front Plant Sci       Date:  2017-03-30       Impact factor: 5.753

2.  DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment.

Authors:  Amarendran R Subramanian; Michael Kaufmann; Burkhard Morgenstern
Journal:  Algorithms Mol Biol       Date:  2008-05-27       Impact factor: 1.405

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.