Literature DB >> 10373380

New efficient statistical sequence-dependent structure prediction of short to medium-sized protein loops based on an exhaustive loop classification.

J Wojcik1, J P Mornon, J Chomilier.   

Abstract

A bank of 13,563 loops from three to eight amino acid residues long, representing motifs between two consecutive regular secondary structures, has been derived from protein structures presenting less than 95 % sequence identity. Statistical analyses of occurrences of conformations and residues revealed length-dependent over-representations of particular amino acids (glycine, proline, asparagine, serine, and aspartate) and conformations (alphaL, epsilon, betaPregions of the Ramachandran plot). A position-dependent distribution of these occurrences was observed for N and C-terminal residues, which are correlated to the nature of the flanking regions. Loops of the same length were clustered into statistically meaningful families on the basis of their backbone structures when placed in a common reference frame, independent of the flanks. These clusters present significantly different distributions of sequence, conformations, and endpoint residue Calphadistances. On the basis of the sequence-structure correlation of this clustering, an automatic loop modeling algorithm was developed. Based on the knowledge of its sequence and of its flank backbone structures each query loop is assigned to a family and target loop supports are selected in this family. The support backbones of these target loops are then adjusted on flanking structures by partial exploration of the conformational space. Loop closure is performed by energy minimization for each support and the final model is chosen among connected supports based upon energy criteria. The quality of the prediction is evaluated by the root-mean-square deviation (rmsd) between the final model and the native loops when the whole bank is re-attributed on itself with a Jackknife test. This average rmsd ranges from 1.1 A for three-residue loops to 3.8 A for eight-residue loops. A few poorly predicted loops are inescapable, considering the high level of diversity in loops and the lack of environment data. To overcome such modeling problems, a statistical reliability score was assigned for each prediction. This score is correlated to the quality of the prediction, in terms of rmsd, and thus improves the selection accuracy of the model. The algorithm efficiency was compared to CASP3 target loop predictions. Moreover, when tested on a test loop bank, this algorithm was shown to be robust when the loops are not precisely delimited, therefore proving to be a useful tool in practice for protein modeling. Copyright 1999 Academic Press.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10373380     DOI: 10.1006/jmbi.1999.2826

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  29 in total

1.  Modeling of loops in protein structures.

Authors:  A Fiser; R K Do; A Sali
Journal:  Protein Sci       Date:  2000-09       Impact factor: 6.725

2.  Evaluating conformational free energies: the colony energy and its application to the problem of loop prediction.

Authors:  Zhexin Xiang; Cinque S Soto; Barry Honig
Journal:  Proc Natl Acad Sci U S A       Date:  2002-05-28       Impact factor: 11.205

3.  Extension of a local backbone description using a structural alphabet: a new approach to the sequence-structure relationship.

Authors:  Alexandre G de Brevern; Hélène Valadié; Serge Hazout; Catherine Etchebest
Journal:  Protein Sci       Date:  2002-12       Impact factor: 6.725

4.  Ab initio construction of all-atom loop conformations.

Authors:  Haiyan Jiang; Christian Blouin
Journal:  J Mol Model       Date:  2005-10-25       Impact factor: 1.810

Review 5.  Advances in homology protein structure modeling.

Authors:  Zhexin Xiang
Journal:  Curr Protein Pept Sci       Date:  2006-06       Impact factor: 3.272

Review 6.  Exploring conformational space using a mean field technique with MOLS sampling.

Authors:  P Arun Prasad; V Kanagasabai; J Arunachalam; N Gautham
Journal:  J Biosci       Date:  2007-08       Impact factor: 1.826

7.  Optimization of the GB/SA solvation model for predicting the structure of surface loops in proteins.

Authors:  Agnieszka Szarecka; Hagai Meirovitch
Journal:  J Phys Chem B       Date:  2006-02-16       Impact factor: 2.991

8.  Physical-chemical determinants of turn conformations in globular proteins.

Authors:  Timothy O Street; Nicholas C Fitzkee; Lauren L Perskie; George D Rose
Journal:  Protein Sci       Date:  2007-08       Impact factor: 6.725

9.  ProRegIn: a regularity index for the selection of native-like tertiary structures of proteins.

Authors:  Lipi Thukral; Sandhya R Shenoy; Kumkum Bhushan; B Jayaram
Journal:  J Biosci       Date:  2007-01       Impact factor: 1.826

10.  "Pinning strategy": a novel approach for predicting the backbone structure in terms of protein blocks from sequence.

Authors:  A G De Brevern; C Etchebest; C Benros; S Hazout
Journal:  J Biosci       Date:  2007-01       Impact factor: 1.826

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.