Literature DB >> 19004001

Evaluating the absolute quality of a single protein model using structural features and support vector machines.

Zheng Wang1, Allison N Tegge, Jianlin Cheng.   

Abstract

Knowing the quality of a protein structure model is important for its appropriate usage. We developed a model evaluation method to assess the absolute quality of a single protein model using only structural features with support vector machine regression. The method assigns an absolute quantitative score (i.e. GDT-TS) to a model by comparing its secondary structure, relative solvent accessibility, contact map, and beta sheet structure with their counterparts predicted from its primary sequence. We trained and tested the method on the CASP6 dataset using cross-validation. The correlation between predicted and true scores is 0.82. On the independent CASP7 dataset, the correlation averaged over 95 protein targets is 0.76; the average correlation for template-based and ab initio targets is 0.82 and 0.50, respectively. Furthermore, the predicted absolute quality scores can be used to rank models effectively. The average difference (or loss) between the scores of the top-ranked models and the best models is 5.70 on the CASP7 targets. This method performs favorably when compared with the other methods used on the same dataset. Moreover, the predicted absolute quality scores are comparable across models for different proteins. These features make the method a valuable tool for model quality assurance and ranking.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19004001     DOI: 10.1002/prot.22275

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  47 in total

1.  Improving threading algorithms for remote homology modeling by combining fragment and template comparisons.

Authors:  Hongyi Zhou; Jeffrey Skolnick
Journal:  Proteins       Date:  2010-07

2.  An iterative self-refining and self-evaluating approach for protein model quality estimation.

Authors:  Zheng Wang; Jianlin Cheng
Journal:  Protein Sci       Date:  2011-11-23       Impact factor: 6.725

3.  GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction.

Authors:  Hongyi Zhou; Jeffrey Skolnick
Journal:  Biophys J       Date:  2011-10-19       Impact factor: 4.033

4.  Predicting Protein Model Quality from Sequence Alignments by Support Vector Machines.

Authors:  Xin Deng; Jilong Li; Jianlin Cheng
Journal:  J Proteomics Bioinform       Date:  2013-11-04

5.  An Improved Integration of Template-Based and Template-Free Protein Structure Modeling Methods and its Assessment in CASP11.

Authors:  Jilong Li; Badri Adhikari; Jianlin Cheng
Journal:  Protein Pept Lett       Date:  2015       Impact factor: 1.890

6.  QAcon: single model quality assessment using protein structural and contact information with machine learning techniques.

Authors:  Renzhi Cao; Badri Adhikari; Debswapna Bhattacharya; Miao Sun; Jie Hou; Jianlin Cheng
Journal:  Bioinformatics       Date:  2017-02-15       Impact factor: 6.937

7.  Elucidating the druggability of the human proteome with eFindSite.

Authors:  Omar Kana; Michal Brylinski
Journal:  J Comput Aided Mol Des       Date:  2019-03-19       Impact factor: 3.686

8.  Recursive protein modeling: a divide and conquer strategy for Protein Structure Prediction and its case study in CASP9.

Authors:  Jianlin Cheng; Jesse Eickholt; Zheng Wang; Xin Deng
Journal:  J Bioinform Comput Biol       Date:  2012-06       Impact factor: 1.122

9.  NNcon: improved protein contact map prediction using 2D-recursive neural networks.

Authors:  Allison N Tegge; Zheng Wang; Jesse Eickholt; Jianlin Cheng
Journal:  Nucleic Acids Res       Date:  2009-05-06       Impact factor: 16.971

10.  MULTICOM: a multi-level combination approach to protein structure prediction and its assessments in CASP8.

Authors:  Zheng Wang; Jesse Eickholt; Jianlin Cheng
Journal:  Bioinformatics       Date:  2010-02-11       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.