Literature DB >> 15531607

Extracting multiple structural alignments from pairwise alignments: a comparison of a rigorous and a heuristic approach.

Erik Sandelin1.   

Abstract

MOTIVATION: Multiple structural alignments (MSTAs) provide position-specific information on the sequence variability allowed by protein folds. This information can be exploited to better understand the evolution of proteins and the physical chemistry of polypeptide folding. Most MSTA methods rely on a pre-computed library of pairwise alignments. This library will in general contain conflicting residue equivalences not all of which can be realized in the final MSTA. Hence to build a consistent MSTA, these methods have to select a conflict-free subset of equivalences.
RESULTS: Using a dataset with 327 families from SCOP 1.63 we compare the ability of two different methods to select an optimal conflict-free subset of equivalences. One is an implementation of Reinert et al.'s integer linear programming formulation (ILP) of the maximum weight trace problem (Reinert et al., 1997, Proc. 1st Ann. Int. Conf. Comput. Mol. Biol. (RECOMB-97), ACM Press, New York). This ILP formulation is a rigorous approach but its complexity is difficult to predict. The other method is T-Coffee (Notredame et al., 2000) which uses a heuristic enhancement of the equivalence weights which allow it to use the speed and simplicity of the progressive alignment approach while still incorporating information of all alignments in each step of building the MSTA. We find that although the ILP formulation consistently selects a more optimal set of conflict-free equivalences, the differences are small and the quality of the resulting MSTAs are essentially the same for both methods. Given its speed and predictable complexity, our results show that T-Coffee is an attractive alternative for producing high-quality MSTAs.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15531607      PMCID: PMC2692033          DOI: 10.1093/bioinformatics/bti117

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  18 in total

1.  A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3.

Authors:  S Dietmann; J Park; C Notredame; A Heger; M Lappe; L Holm
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

3.  MASS: multiple structural alignment by secondary structures.

Authors:  O Dror; H Benyamini; R Nussinov; H Wolfson
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

4.  3DCoffee: combining protein sequences and structures within multiple sequence alignments.

Authors:  Orla O'Sullivan; Karsten Suhre; Chantal Abergel; Desmond G Higgins; Cédric Notredame
Journal:  J Mol Biol       Date:  2004-07-02       Impact factor: 5.469

5.  Structural similarity of DNA-binding domains of bacteriophage repressors and the globin core.

Authors:  S Subbiah; D V Laurents; M Levitt
Journal:  Curr Biol       Date:  1993-03       Impact factor: 10.834

6.  Definition of general topological equivalence in protein structures. A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming.

Authors:  A Sali; T L Blundell
Journal:  J Mol Biol       Date:  1990-03-20       Impact factor: 5.469

7.  SSAP: sequential structure alignment program for protein structure comparison.

Authors:  C A Orengo; W R Taylor
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

Review 8.  Mapping the protein universe.

Authors:  L Holm; C Sander
Journal:  Science       Date:  1996-08-02       Impact factor: 47.728

9.  A unified statistical framework for sequence comparison and structure comparison.

Authors:  M Levitt; M Gerstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-05-26       Impact factor: 11.205

10.  SCOP: a structural classification of proteins database for the investigation of sequences and structures.

Authors:  A G Murzin; S E Brenner; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1995-04-07       Impact factor: 5.469

View more
  1 in total

1.  An enhanced partial order curve comparison algorithm and its application to analyzing protein folding trajectories.

Authors:  Hong Sun; Hakan Ferhatosmanoglu; Motonori Ota; Yusu Wang
Journal:  BMC Bioinformatics       Date:  2008-08-18       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.