Literature DB >> 21082205

Determination of protein folding kinetic types using sequence and predicted secondary structure and solvent accessibility.

Hua Zhang1, Tuo Zhang, Jianzhao Gao, Jishou Ruan, Shiyi Shen, Lukasz Kurgan.   

Abstract

Proteins fold through a two-state (TS), with no visible intermediates, or a multi-state (MS), via at least one intermediate, process. We analyze sequence-derived factors that determine folding types by introducing a novel sequence-based folding type predictor called FOKIT. This method implements a logistic regression model with six input features which hybridize information concerning amino acid composition and predicted secondary structure and solvent accessibility. FOKIT provides predictions with average Matthews correlation coefficient (MCC) between 0.58 and 0.91 measured using out-of-sample tests on four benchmark datasets. These results are shown to be competitive or better than results of four modern predictors. We also show that FOKIT outperforms these methods when predicting chains that share low similarity with the chains used to build the model, which is an important advantage given the limited number of annotated chains. We demonstrate that inclusion of solvent accessibility helps in discrimination of the folding kinetic types and that three of the features constitute statistically significant markers that differentiate TS and MS folders. We found that the increased content of exposed Trp and buried Leu are indicative of the MS folding, which implies that the exposure/burial of certain hydrophobic residues may play important role in the formation of the folding intermediates. Our conclusions are supported by two case studies.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21082205     DOI: 10.1007/s00726-010-0805-y

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  5 in total

1.  Improving protein fold recognition using the amalgamation of evolutionary-based and structural based information.

Authors:  Kuldip K Paliwal; Alok Sharma; James Lyons; Abdollah Dehzangi
Journal:  BMC Bioinformatics       Date:  2014-12-08       Impact factor: 3.169

2.  A strategy to select suitable physicochemical attributes of amino acids for protein fold recognition.

Authors:  Alok Sharma; Kuldip K Paliwal; Abdollah Dehzangi; James Lyons; Seiya Imoto; Satoru Miyano
Journal:  BMC Bioinformatics       Date:  2013-07-24       Impact factor: 3.169

3.  Sequence based prediction of DNA-binding proteins based on hybrid feature selection using random forest and Gaussian naïve Bayes.

Authors:  Wangchao Lou; Xiaoqing Wang; Fan Chen; Yixiao Chen; Bo Jiang; Hua Zhang
Journal:  PLoS One       Date:  2014-01-24       Impact factor: 3.240

4.  Quad-PRE: a hybrid method to predict protein quaternary structure attributes.

Authors:  Yajun Sheng; Xingye Qiu; Chen Zhang; Jun Xu; Yanping Zhang; Wei Zheng; Ke Chen
Journal:  Comput Math Methods Med       Date:  2014-05-18       Impact factor: 2.238

5.  A Multifeatures Fusion and Discrete Firefly Optimization Method for Prediction of Protein Tyrosine Sulfation Residues.

Authors:  Song Guo; Chunhua Liu; Peng Zhou; Yanling Li
Journal:  Biomed Res Int       Date:  2016-03-10       Impact factor: 3.411

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.