| Literature DB >> 22689641 |
Alexandre G de Brevern1, Aurélie Bornot, Pierrick Craveur, Catherine Etchebest, Jean-Christophe Gelly.
Abstract
Protein structures are necessary for understanding protein function at a molecular level. Dynamics and flexibility of protein structures are also key elements of protein function. So, we have proposed to look at protein flexibility using novel methods: (i) using a structural alphabet and (ii) combining classical X-ray B-factor data and molecular dynamics simulations. First, we established a library composed of structural prototypes (LSPs) to describe protein structure by a limited set of recurring local structures. We developed a prediction method that proposes structural candidates in terms of LSPs and predict protein flexibility along a given sequence. Second, we examine flexibility according to two different descriptors: X-ray B-factors considered as good indicators of flexibility and the root mean square fluctuations, based on molecular dynamics simulations. We then define three flexibility classes and propose a method based on the LSP prediction method for predicting flexibility along the sequence. This method does not resort to sophisticate learning of flexibility but predicts flexibility from average flexibility of predicted local structures. The method is implemented in PredyFlexy web server. Results are similar to those obtained with the most recent, cutting-edge methods based on direct learning of flexibility data conducted with sophisticated algorithms. PredyFlexy can be accessed at http://www.dsimb.inserm.fr/dsimb_tools/predyflexy/.Entities:
Mesh:
Year: 2012 PMID: 22689641 PMCID: PMC3394303 DOI: 10.1093/nar/gks482
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The framework of PredyFlexy and underlying methods. User must give a single sequence as input (a), a PSSM is computed using PSI-BLAST (b) and split into fragments of 11 residues. Prediction of LSPs is done using trained SVMs (c); scores are ranked and the best five are kept (d). Using these LSPs, prediction of flexibility is done in three states (rigid, intermediate and flexible) (e); predicted B-factorNorm, RMSFNorm and a confidence index are also provided (f).
Figure 2.Protein prediction example. The prediction highlights the regions from residue 100 to 150. In regions (a) to (d) are located residues predicted with a high accuracy (confidence index of 15 or better): it represents flexible (a) to rigid (b) with an intermediate to a flexible zone (c) and then coming back to rigid zone (d). Following region (e) is predicted as flexible, but the low confidence index (=3) makes the prediction not reliable.