| Literature DB >> 24776231 |
Renzhi Cao, Zheng Wang, Yiheng Wang, Jianlin Cheng1.
Abstract
BACKGROUND: It is important to predict the quality of a protein structural model before its native structure is known. The method that can predict the absolute local quality of individual residues in a single protein model is rare, yet particularly needed for using, ranking and refining protein models.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24776231 PMCID: PMC4013430 DOI: 10.1186/1471-2105-15-120
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The evaluation results of residue-specific local quality predictions of single-model local quality QA tools (SMOQ) on CASP9 single-domain proteins. Basic (20 targets) denotes the SVM model trained using the basic feature set on 20 CASP8 single-domain targets. Basic (85 targets) denotes the SVM model trained using basic feature set on 85 CASP8 single-domain targets. Basic (20 targets, no homologue) denotes the basic model trained on 20 CASP8 single-domain targets, but tested on the CASP9 single-domain targets that are not homologues of CASP8 targets. Profile and profile+SOV denote the two SVM models using profile and profile+SOV feature set that were trained on 20 CASP8 single-domain targets and tested on CASP9 targets without homologue removal. The absolute difference errors of the predictions were plotted against the real distance deviations.
The average correlation and absolute difference between real and predicted deviation on CASP9 targets for residue-specific quality prediction
| 0.42 | 7.09 | |
| ProQ2 | 0.47 | 6.63 |
| QMEAN | 0.43 | 7.46 |
Figure 2The predicted deviation against real deviation for our basic SVM model and other two local prediction methods (ProQ2 and QMEAN) on 84 CASP9 targets.
Figure 3The absolute difference error between real and predicted deviation against real deviation for our basic SVM model and ProQ2 and QMEAN.
The performance of the global quality predictions of our three tools and the other four methods in terms of average correlation, overall correlation, average real GDT-TS score of top 1 models ranked by each method, and average loss of top 1 models ranked by each method, evaluated on 84 CASP9 single-domain targets
| 0.737 | 0.588 | 0.082 | ||
| 0.658 | 0.589 | 0.080 | ||
| 0.696 | 0.681 | 0.594 | ||
| ModelEvaluator | 0.636 | |||
| ProQ | 0.494 | 0.707 | 0.563 | 0.110 |
| ProQ2 | 0.662 | |||
| QMEAN | 0.078 |
Basic, profile, and profile + SOV are the three single-model local QA tools (SMOQ) presented in this manuscript.
The other four QA predictors are ModelEvaluator (predictor name in CASP9: MULTICOM-NOVEL), ProQ, ProQ2, and QMEAN. Top 3 QA predictors’ performances according to each metric were bolded.
The performance of the QA predictor in terms of average correlation, overall correlation, average real GDT-TS score of top 1 models ranked by each method, and average loss of top 1 models ranked by each method, evaluated on 8 FM (free modeling) CASP9 single-domain targets
| 0.427 | 0.254 | 0.091 | ||
| 0.431 | ||||
| M.-NOVEL | 0.386 | 0.235 | 0.115 | |
| ProQ | 0.478 | 0.437 | 0.266 | 0.090 |
| ProQ2 | 0.529 | |||
| QMEAN | 0.507 | 0.456 | 0.266 | 0.090 |
Top 3 QA predictors’ performances according to each metric were bolded.
Figure 4An example illustrates the real and predicted distances between a model and the native structure. The model is the first model of the MULTICOM-CLUSTER tertiary structure predictor for CASP9 target T0563. (A) The real and predicted distance between the native structure and the model at each amino acid position. (B) The superimposition between the model (green and red) and the native structure (grey). Red highlights the two regions where the model has a relatively large deviation compared with the native structure.