Literature DB >> 21252072

Critical assessment of high-throughput standalone methods for secondary structure prediction.

Hua Zhang1, Tuo Zhang, Ke Chen, Kanaka Durga Kedarisetti, Marcin J Mizianty, Qingbo Bao, Wojciech Stach, Lukasz Kurgan.   

Abstract

Sequence-based prediction of protein secondary structure (SS) enjoys wide-spread and increasing use for the analysis and prediction of numerous structural and functional characteristics of proteins. The lack of a recent comprehensive and large-scale comparison of the numerous prediction methods results in an often arbitrary selection of a SS predictor. To address this void, we compare and analyze 12 popular, standalone and high-throughput predictors on a large set of 1975 proteins to provide in-depth, novel and practical insights. We show that there is no universally best predictor and thus detailed comparative studies are needed to support informed selection of SS predictors for a given application. Our study shows that the three-state accuracy (Q3) and segment overlap (SOV3) of the SS prediction currently reach 82% and 81%, respectively. We demonstrate that carefully designed consensus-based predictors improve the Q3 by additional 2% and that homology modeling-based methods are significantly better by 1.5% Q3 than ab initio approaches. Our empirical analysis reveals that solvent exposed and flexible coils are predicted with a higher quality than the buried and rigid coils, while inverse is true for the strands and helices. We also show that longer helices are easier to predict, which is in contrast to longer strands that are harder to find. The current methods confuse 1-6% of strand residues with helical residues and vice versa and they perform poorly for residues in the β- bridge and 3(10)-helix conformations. Finally, we compare predictions of the standalone implementations of four well-performing methods with their corresponding web servers.

Mesh:

Substances:

Year:  2011        PMID: 21252072     DOI: 10.1093/bib/bbq088

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  15 in total

1.  CONFOLD: Residue-residue contact-guided ab initio protein folding.

Authors:  Badri Adhikari; Debswapna Bhattacharya; Renzhi Cao; Jianlin Cheng
Journal:  Proteins       Date:  2015-06-06

Review 2.  From local structure to a global framework: recognition of protein folds.

Authors:  Agnel Praveen Joseph; Alexandre G de Brevern
Journal:  J R Soc Interface       Date:  2014-04-16       Impact factor: 4.118

3.  Mapping membrane activity in undiscovered peptide sequence space using machine learning.

Authors:  Ernest Y Lee; Benjamin M Fulan; Gerard C L Wong; Andrew L Ferguson
Journal:  Proc Natl Acad Sci U S A       Date:  2016-11-14       Impact factor: 11.205

4.  SPINE X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles.

Authors:  Eshel Faraggi; Tuo Zhang; Yuedong Yang; Lukasz Kurgan; Yaoqi Zhou
Journal:  J Comput Chem       Date:  2011-11-02       Impact factor: 3.376

5.  Comparative Analysis on Alignment-Based and Pretrained Feature Representations for the Identification of DNA-Binding Proteins.

Authors:  Die Chen; Hua Zhang; Zeqi Chen; Bo Xie; Ye Wang
Journal:  Comput Math Methods Med       Date:  2022-06-28       Impact factor: 2.809

Review 6.  Machine learning-enabled discovery and design of membrane-active peptides.

Authors:  Ernest Y Lee; Gerard C L Wong; Andrew L Ferguson
Journal:  Bioorg Med Chem       Date:  2017-07-08       Impact factor: 3.641

7.  Sequence-derived structural features driving proteolytic processing.

Authors:  Alexander A Belushkin; Dmitry V Vinogradov; Mikhail S Gelfand; Andrei L Osterman; Piotr Cieplak; Marat D Kazanov
Journal:  Proteomics       Date:  2013-12-11       Impact factor: 3.984

8.  Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features.

Authors:  Ya-Nan Zhang; Dong-Jun Yu; Shu-Sen Li; Yong-Xian Fan; Yan Huang; Hong-Bin Shen
Journal:  BMC Bioinformatics       Date:  2012-05-31       Impact factor: 3.169

9.  TANGLE: two-level support vector regression approach for protein backbone torsion angle prediction from primary sequences.

Authors:  Jiangning Song; Hao Tan; Mingjun Wang; Geoffrey I Webb; Tatsuya Akutsu
Journal:  PLoS One       Date:  2012-02-02       Impact factor: 3.240

10.  Bayesian model of protein primary sequence for secondary structure prediction.

Authors:  Qiwei Li; David B Dahl; Marina Vannucci; Jerry W Tsai
Journal:  PLoS One       Date:  2014-10-14       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.