| Literature DB >> 32631256 |
Tong Liu1, Zheng Wang2.
Abstract
BACKGROUND: Protein model quality assessment (QA) is an essential procedure in protein structure prediction. QA methods can predict the qualities of protein models and identify good models from decoys. Clustering-based methods need a certain number of models as input. However, if a pool of models are not available, methods that only need a single model as input are indispensable.Entities:
Keywords: Protein energy potentials; Protein model quality assessment; Random forests; Single-model QA
Mesh:
Substances:
Year: 2020 PMID: 32631256 PMCID: PMC7336608 DOI: 10.1186/s12859-020-3383-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Evaluations of our method MASS with four top-performing single-model methods in stage 2 for 75 targets of CASP 11 (groups are ranked by wmPMCC and best results are highlighted in bold)
| Group ID | wmPMCC | Ave loss | Ave ΔGDT | MCC | ROC |
|---|---|---|---|---|---|
| MASS | 0.07029 | 0.00076 | 0.60 | ||
| QAcon | 0.390 | 0.07543 | |||
| Qprob | 0.368 | 0.07540 | 0.00113 | 0.52 | 0.86 |
| ProQ2-refine | 0.351 | 0.07068 | 0.00083 | 0.58 | 0.86 |
| ProQ2 | 0.349 | 0.00085 | 0.57 | 0.85 |
Fig. 1Two examples showing the real GDT-TS scores and MASS predicted scores. a The native 3D structure of T0882 in CASP12 (green) superimposed with the model of IntFOLD4_TS3 (blue). b The native structure of T0760 in CASP11 (green) superimposed with the model of Pcons-net_TS4 (blue)
Evaluations of our method MASS with seven top-ranking single-model methods in stage 2 for 72 targets of CASP 12 (groups are ranked by wmPMCC and best results are highlighted in bold)
| Group ID | wmPMCC | Ave loss | Ave ΔGDT | MCC | ROC |
|---|---|---|---|---|---|
| SVMQA | 0.00091 | 0.62 | 0.90 | ||
| ProQ3 | 0.664 | 0.06073 | 0.00064 | 0.67 | |
| MASS | 0.649 | 0.08744 | 0.00086 | 0.62 | 0.90 |
| QASproGP | 0.634 | 0.07992 | 0.00069 | 0.65 | 0.92 |
| VoroMQA | 0.619 | 0.08169 | 0.00077 | 0.16 | 0.86 |
| DeepQA | 0.616 | 0.08145 | |||
| Myprotein-me | 0.614 | 0.10350 | 0.00089 | 0.44 | 0.80 |
| QMEAN | 0.311 | 0.10546 | 0.00141 | 0.41 | 0.81 |
Evaluations of our method MASS with seven top-ranking single-model methods in stage 2 for 57 targets of CASP 13 (groups are ranked by wmPMCC and best results are highlighted in bold)
| Group ID | wmPMCC | Ave loss | Ave ΔGDT | MCC | ROC |
|---|---|---|---|---|---|
| ModFOLD7 | 0.0936 | 0.00041 | 0.72 | ||
| ModFOLD7_cor | 0.888 | 0.09313 | |||
| ModFOLD7_rank | 0.839 | 0.00091 | 0.64 | 0.93 | |
| FaeNNz | 0.78 | 0.09127 | 0.00083 | 0.58 | 0.89 |
| ProQ4 | 0.773 | 0.08708 | 0.00106 | 0.57 | 0.86 |
| MESHI-enrich-server | 0.756 | 0.08826 | 0.00087 | 0.52 | 0.88 |
| MESHI-corr-server | 0.742 | 0.08727 | 0.00088 | 0.57 | 0.88 |
| VoroMQA-A | 0.721 | 0.08322 | 0.00098 | 0.34 | 0.87 |
| MUFold_server | 0.714 | 0.08675 | 0.00095 | 0.6 | 0.89 |
| VoroMQA-B | 0.69 | 0.07854 | 0.001 | 0.33 | 0.86 |
| MASS | 0.682 | 0.09037 | 0.00106 | 0.54 | 0.85 |
| MULTICOM-NOVEL | 0.667 | 0.07839 | 0.00113 | 0.38 | 0.83 |
| MASS2 | 0.652 | 0.09748 | 0.00124 | 0.46 | 0.83 |
| Bhattacharya-SingQ | 0.638 | 0.08676 | 0.00097 | 0.46 | 0.81 |
| Bhattacharya-Server | 0.601 | 0.11021 | 0.00106 | 0.44 | 0.82 |
| PLU-AngularQA | 0.57 | 0.13504 | 0.00097 | 0.44 | 0.83 |
| PLU-TopQA | 0.026 | 0.20285 | 0.00165 | 0.21 | 0.65 |
Fig. 2The feature importance of 70 features used in MASS. Different feature classes are highlighted with different colors