Literature DB >> 10906342

Identifying sequence-structure pairs undetected by sequence alignments.

S Miyazawa1, R L Jernigan.   

Abstract

We examine how effectively simple potential functions previously developed can identify compatibilities between sequences and structures of proteins for database searches. The potential function consists of pairwise contact energies, repulsive packing potentials of residues for overly dense arrangement and short-range potentials for secondary structures, all of which were estimated from statistical preferences observed in known protein structures. Each potential energy term was modified to represent compatibilities between sequences and structures for globular proteins. Pairwise contact interactions in a sequence-structure alignment are evaluated in a mean field approximation on the basis of probabilities of site pairs to be aligned. Gap penalties are assumed to be proportional to the number of contacts at each residue position, and as a result gaps will be more frequently placed on protein surfaces than in cores. In addition to minimum energy alignments, we use probability alignments made by successively aligning site pairs in order by pairwise alignment probabilities. The results show that the present energy function and alignment method can detect well both folds compatible with a given sequence and, inversely, sequences compatible with a given fold, and yield mostly similar alignments for these two types of sequence and structure pairs. Probability alignments consisting of most reliable site pairs only can yield extremely small root mean square deviations, and including less reliable pairs increases the deviations. Also, it is observed that secondary structure potentials are usefully complementary to yield improved alignments with this method. Remarkably, by this method some individual sequence-structure pairs are detected having only 5-20% sequence identity.

Mesh:

Year:  2000        PMID: 10906342     DOI: 10.1093/protein/13.7.459

Source DB:  PubMed          Journal:  Protein Eng        ISSN: 0269-2139


  8 in total

1.  Identification of related proteins with weak sequence identity using secondary structure information.

Authors:  C Geourjon; C Combet; C Blanchet; G Deléage
Journal:  Protein Sci       Date:  2001-04       Impact factor: 6.725

2.  Statistical potentials for fold assessment.

Authors:  Francisco Melo; Roberto Sánchez; Andrej Sali
Journal:  Protein Sci       Date:  2002-02       Impact factor: 6.725

3.  Statistical potential for assessment and prediction of protein structures.

Authors:  Min-Yi Shen; Andrej Sali
Journal:  Protein Sci       Date:  2006-11       Impact factor: 6.725

4.  Computational immunology meets bioinformatics: the use of prediction tools for molecular binding in the simulation of the immune system.

Authors:  Nicolas Rapin; Ole Lund; Massimo Bernaschi; Filippo Castiglione
Journal:  PLoS One       Date:  2010-04-16       Impact factor: 3.240

5.  From isotropic to anisotropic side chain representations: comparison of three models for residue contact estimation.

Authors:  Weitao Sun; Jing He
Journal:  PLoS One       Date:  2011-04-28       Impact factor: 3.240

6.  Predicting binding sites of hydrolase-inhibitor complexes by combining several methods.

Authors:  Taner Z Sen; Andrzej Kloczkowski; Robert L Jernigan; Changhui Yan; Vasant Honavar; Kai-Ming Ho; Cai-Zhuang Wang; Yungok Ihm; Haibo Cao; Xun Gu; Drena Dobbs
Journal:  BMC Bioinformatics       Date:  2004-12-17       Impact factor: 3.169

7.  BioShell-Threading: versatile Monte Carlo package for protein 3D threading.

Authors:  Pawel Gniewek; Andrzej Kolinski; Andrzej Kloczkowski; Dominik Gront
Journal:  BMC Bioinformatics       Date:  2014-01-20       Impact factor: 3.169

8.  Profile conditional random fields for modeling protein families with structural information.

Authors:  Akira R Kinjo
Journal:  Biophysics (Nagoya-shi)       Date:  2009-05-30
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.