Literature DB >> 20001907

Robust prediction of B-factor profile from sequence using two-stage SVR based on random forest feature selection.

Xiao-Yong Pan1, Hong-Bin Shen.   

Abstract

B-factor is highly correlated with protein internal motion, which is used to measure the uncertainty in the position of an atom within a crystal structure. Although the rapid progress of structural biology in recent years makes more accurate protein structures available than ever, with the avalanche of new protein sequences emerging during the post-genomic Era, the gap between the known protein sequences and the known protein structures becomes wider and wider. It is urgent to develop automated methods to predict B-factor profile from the amino acid sequences directly, so as to be able to timely utilize them for basic research. In this article, we propose a novel approach, called PredBF, to predict the real value of B-factor. We firstly extract both global and local features from the protein sequences as well as their evolution information, then the random forests feature selection is applied to rank their importance and the most important features are inputted to a two-stage support vector regression (SVR) for prediction, where the initial predicted outputs from the 1(st) SVR are further inputted to the 2nd layer SVR for final refinement. Our results have revealed that a systematic analysis of the importance of different features makes us have deep insights into the different contributions of features and is very necessary for developing effective B-factor prediction tools. The two-layer SVR prediction model designed in this study further enhanced the robustness of predicting the B-factor profile. As a web server, PredBF is freely available at: http://www.csbio.sjtu.edu.cn/bioinf/PredBF for academic use.

Mesh:

Substances:

Year:  2009        PMID: 20001907     DOI: 10.2174/092986609789839250

Source DB:  PubMed          Journal:  Protein Pept Lett        ISSN: 0929-8665            Impact factor:   1.890


  21 in total

1.  Predicting protein flexibility through the prediction of local structures.

Authors:  Aurélie Bornot; Catherine Etchebest; Alexandre G de Brevern
Journal:  Proteins       Date:  2010-12-06

2.  Sequence-specific dynamic information in proteins.

Authors:  H A Scheraga; S Rackovsky
Journal:  Proteins       Date:  2019-06-11

3.  Fluctuations of backbone torsion angles obtained from NMR-determined structures and their prediction.

Authors:  Tuo Zhang; Eshel Faraggi; Yaoqi Zhou
Journal:  Proteins       Date:  2010-12

4.  Multiscale multiphysics and multidomain models--flexibility and rigidity.

Authors:  Kelin Xia; Kristopher Opron; Guo-Wei Wei
Journal:  J Chem Phys       Date:  2013-11-21       Impact factor: 3.488

5.  In-silico prediction of disorder content using hybrid sequence representation.

Authors:  Marcin J Mizianty; Tuo Zhang; Bin Xue; Yaoqi Zhou; A Keith Dunker; Vladimir N Uversky; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2011-06-17       Impact factor: 3.169

6.  TANGLE: two-level support vector regression approach for protein backbone torsion angle prediction from primary sequences.

Authors:  Jiangning Song; Hao Tan; Mingjun Wang; Geoffrey I Webb; Tatsuya Akutsu
Journal:  PLoS One       Date:  2012-02-02       Impact factor: 3.240

7.  PredyFlexy: flexibility and local structure prediction from sequence.

Authors:  Alexandre G de Brevern; Aurélie Bornot; Pierrick Craveur; Catherine Etchebest; Jean-Christophe Gelly
Journal:  Nucleic Acids Res       Date:  2012-06-11       Impact factor: 16.971

Review 8.  Protein flexibility in the light of structural alphabets.

Authors:  Pierrick Craveur; Agnel P Joseph; Jeremy Esque; Tarun J Narwani; Floriane Noël; Nicolas Shinada; Matthieu Goguet; Sylvain Leonard; Pierre Poulain; Olivier Bertrand; Guilhem Faure; Joseph Rebehmed; Amine Ghozlane; Lakshmipuram S Swapna; Ramachandra M Bhaskara; Jonathan Barnoud; Stéphane Téletchéa; Vincent Jallu; Jiri Cerny; Bohdan Schneider; Catherine Etchebest; Narayanaswamy Srinivasan; Jean-Christophe Gelly; Alexandre G de Brevern
Journal:  Front Mol Biosci       Date:  2015-05-27

9.  DFLpred: High-throughput prediction of disordered flexible linker regions in protein sequences.

Authors:  Fanchi Meng; Lukasz Kurgan
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

10.  Many local pattern texture features: which is better for image-based multilabel human protein subcellular localization classification?

Authors:  Fan Yang; Ying-Ying Xu; Hong-Bin Shen
Journal:  ScientificWorldJournal       Date:  2014-06-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.