Literature DB >> 20455267

Accurate prediction of protein folding rates from sequence and sequence-derived residue flexibility and solvent accessibility.

Jianzhao Gao1, Tuo Zhang, Hua Zhang, Shiyi Shen, Jishou Ruan, Lukasz Kurgan.   

Abstract

Protein folding rates vary by several orders of magnitude and they depend on the topology of the fold and the size and composition of the sequence. Although recent works show that the rates can be predicted from the sequence, allowing for high-throughput annotations, they consider only the sequence and its predicted secondary structure. We propose a novel sequence-based predictor, PFR-AF, which utilizes solvent accessibility and residue flexibility predicted from the sequence, to improve predictions and provide insights into the folding process. The predictor includes three linear regressions for proteins with two-state, multistate, and unknown (mixed-state) folding kinetics. PFR-AF on average outperforms current methods when tested on three datasets. The proposed approach provides high-quality predictions in the absence of similarity between the predicted and the training sequences. The PFR-AF's predictions are characterized by high (between 0.71 and 0.95, depending on the dataset) correlation and the lowest (between 0.75 and 0.9) mean absolute errors with respect to the experimental rates, as measured using out-of-sample tests. Our models reveal that for the two-state chains inclusion of solvent-exposed Ala may accelerate the folding, while increased content of Ile may reduce the folding speed. We also demonstrate that increased flexibility of coils facilitates faster folding and that proteins with larger content of solvent-exposed strands may fold at a slower pace. The increased flexibility of the solvent-exposed residues is shown to elongate folding, which also holds, with a lower correlation, for buried residues. Two case studies are included to support our findings.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20455267     DOI: 10.1002/prot.22727

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  7 in total

1.  Real value prediction of protein folding rate change upon point mutation.

Authors:  Liang-Tsung Huang; M Michael Gromiha
Journal:  J Comput Aided Mol Des       Date:  2012-03-18       Impact factor: 3.686

2.  GENN: a GEneral Neural Network for learning tabulated data with examples from protein structure prediction.

Authors:  Eshel Faraggi; Andrzej Kloczkowski
Journal:  Methods Mol Biol       Date:  2015

3.  Accurate single-sequence prediction of solvent accessible surface area using local and global features.

Authors:  Eshel Faraggi; Yaoqi Zhou; Andrzej Kloczkowski
Journal:  Proteins       Date:  2014-09-25

4.  In-silico prediction of disorder content using hybrid sequence representation.

Authors:  Marcin J Mizianty; Tuo Zhang; Bin Xue; Yaoqi Zhou; A Keith Dunker; Vladimir N Uversky; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2011-06-17       Impact factor: 3.169

5.  Lid opening and conformational stability of T1 Lipase is mediated by increasing chain length polar solvents.

Authors:  Jonathan Maiangwa; Thean Chor Leow; Mohd Shukuri Mohamad Ali; Abu Bakar Salleh; Raja Noor Zaliha Raja Abd Rahman; Yahaya M Normi; Fairolniza Mohd Shariff
Journal:  PeerJ       Date:  2017-05-18       Impact factor: 2.984

6.  Sequence analysis on the information of folding initiation segments in ferredoxin-like fold proteins.

Authors:  Masanari Matsuoka; Takeshi Kikuchi
Journal:  BMC Struct Biol       Date:  2014-05-23

7.  The cleverSuite approach for protein characterization: predictions of structural properties, solubility, chaperone requirements and RNA-binding abilities.

Authors:  Petr Klus; Benedetta Bolognesi; Federico Agostini; Domenica Marchese; Andreas Zanzoni; Gian Gaetano Tartaglia
Journal:  Bioinformatics       Date:  2014-02-03       Impact factor: 6.937

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.