Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Extracting multiple structural alignments from pairwise alignments: a comparison of a rigorous and a heuristic approach.

Literature DB >> 15531607

Extracting multiple structural alignments from pairwise alignments: a comparison of a rigorous and a heuristic approach.

Abstract

MOTIVATION: Multiple structural alignments (MSTAs) provide position-specific information on the sequence variability allowed by protein folds. This information can be exploited to better understand the evolution of proteins and the physical chemistry of polypeptide folding. Most MSTA methods rely on a pre-computed library of pairwise alignments. This library will in general contain conflicting residue equivalences not all of which can be realized in the final MSTA. Hence to build a consistent MSTA, these methods have to select a conflict-free subset of equivalences.
RESULTS: Using a dataset with 327 families from SCOP 1.63 we compare the ability of two different methods to select an optimal conflict-free subset of equivalences. One is an implementation of Reinert et al.'s integer linear programming formulation (ILP) of the maximum weight trace problem (Reinert et al., 1997, Proc. 1st Ann. Int. Conf. Comput. Mol. Biol. (RECOMB-97), ACM Press, New York). This ILP formulation is a rigorous approach but its complexity is difficult to predict. The other method is T-Coffee (Notredame et al., 2000) which uses a heuristic enhancement of the equivalence weights which allow it to use the speed and simplicity of the progressive alignment approach while still incorporating information of all alignments in each step of building the MSTA. We find that although the ILP formulation consistently selects a more optimal set of conflict-free equivalences, the differences are small and the quality of the resulting MSTAs are essentially the same for both methods. Given its speed and predictable complexity, our results show that T-Coffee is an attractive alternative for producing high-quality MSTAs.

Entities: Chemical

Mesh：

Substances：
Proteins

Year: 2004 PMID： 15531607 PMCID： PMC2692033 DOI： 10.1093/bioinformatics/bti117

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

18 in total

1. A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3.

Authors: S Dietmann; J Park; C Notredame; A Heger; M Lappe; L Holm
Journal: Nucleic Acids Res Date: 2001-01-01 Impact factor: 16.971

2. T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors: C Notredame; D G Higgins; J Heringa
Journal: J Mol Biol Date: 2000-09-08 Impact factor: 5.469

3. MASS: multiple structural alignment by secondary structures.

Authors: O Dror; H Benyamini; R Nussinov; H Wolfson
Journal: Bioinformatics Date: 2003 Impact factor: 6.937

4. 3DCoffee: combining protein sequences and structures within multiple sequence alignments.

Authors: Orla O'Sullivan; Karsten Suhre; Chantal Abergel; Desmond G Higgins; Cédric Notredame
Journal: J Mol Biol Date: 2004-07-02 Impact factor: 5.469

5. Structural similarity of DNA-binding domains of bacteriophage repressors and the globin core.

Authors: S Subbiah; D V Laurents; M Levitt
Journal: Curr Biol Date: 1993-03 Impact factor: 10.834

6. Definition of general topological equivalence in protein structures. A procedure involving comparison of properties and relationships through simulated annealing and dynamic programming.

Authors: A Sali; T L Blundell
Journal: J Mol Biol Date: 1990-03-20 Impact factor: 5.469

7. SSAP: sequential structure alignment program for protein structure comparison.

Authors: C A Orengo; W R Taylor
Journal: Methods Enzymol Date: 1996 Impact factor: 1.600

Review 8. Mapping the protein universe.

Authors: L Holm; C Sander
Journal: Science Date: 1996-08-02 Impact factor: 47.728