Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Identifying sequence-structure pairs undetected by sequence alignments.

Literature DB >> 10906342

Identifying sequence-structure pairs undetected by sequence alignments.

Abstract

We examine how effectively simple potential functions previously developed can identify compatibilities between sequences and structures of proteins for database searches. The potential function consists of pairwise contact energies, repulsive packing potentials of residues for overly dense arrangement and short-range potentials for secondary structures, all of which were estimated from statistical preferences observed in known protein structures. Each potential energy term was modified to represent compatibilities between sequences and structures for globular proteins. Pairwise contact interactions in a sequence-structure alignment are evaluated in a mean field approximation on the basis of probabilities of site pairs to be aligned. Gap penalties are assumed to be proportional to the number of contacts at each residue position, and as a result gaps will be more frequently placed on protein surfaces than in cores. In addition to minimum energy alignments, we use probability alignments made by successively aligning site pairs in order by pairwise alignment probabilities. The results show that the present energy function and alignment method can detect well both folds compatible with a given sequence and, inversely, sequences compatible with a given fold, and yield mostly similar alignments for these two types of sequence and structure pairs. Probability alignments consisting of most reliable site pairs only can yield extremely small root mean square deviations, and including less reliable pairs increases the deviations. Also, it is observed that secondary structure potentials are usefully complementary to yield improved alignments with this method. Remarkably, by this method some individual sequence-structure pairs are detected having only 5-20% sequence identity.

Mesh：

Year: 2000 PMID： 10906342 DOI： 10.1093/protein/13.7.459

Source DB: PubMed Journal: Protein Eng ISSN： 0269-2139

Keyword Cloud
Cited

8 in total

1. Identification of related proteins with weak sequence identity using secondary structure information.

Authors: C Geourjon; C Combet; C Blanchet; G Deléage
Journal: Protein Sci Date: 2001-04 Impact factor: 6.725

2. Statistical potentials for fold assessment.

Authors: Francisco Melo; Roberto Sánchez; Andrej Sali
Journal: Protein Sci Date: 2002-02 Impact factor: 6.725

3. Statistical potential for assessment and prediction of protein structures.

Authors: Min-Yi Shen; Andrej Sali
Journal: Protein Sci Date: 2006-11 Impact factor: 6.725

4. Computational immunology meets bioinformatics: the use of prediction tools for molecular binding in the simulation of the immune system.

Authors: Nicolas Rapin; Ole Lund; Massimo Bernaschi; Filippo Castiglione
Journal: PLoS One Date: 2010-04-16 Impact factor: 3.240

Identifying sequence-structure pairs undetected by sequence alignments.

1. Identification of related proteins with weak sequence identity using secondary structure information.

2. Statistical potentials for fold assessment.

3. Statistical potential for assessment and prediction of protein structures.

4. Computational immunology meets bioinformatics: the use of prediction tools for molecular binding in the simulation of the immune system.

5. From isotropic to anisotropic side chain representations: comparison of three models for residue contact estimation.

6. Predicting binding sites of hydrolase-inhibitor complexes by combining several methods.

7. BioShell-Threading: versatile Monte Carlo package for protein 3D threading.

8. Profile conditional random fields for modeling protein families with structural information.