| Literature DB >> 19566958 |
Kevin W DeRonne1, George Karypis.
Abstract
BACKGROUND: Methods that can automatically assess the quality of computationally predicted protein structures are important, as they enable the selection of the most accurate structure from an ensemble of predictions. Assessment methods that determine the quality of a predicted structure by comparing it against the various structures predicted by different servers have been shown to outperform approaches that rely on the intrinsic characteristics of the structure itself.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19566958 PMCID: PMC2712475 DOI: 10.1186/1472-6807-9-41
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Prediction performance of the static methods
| CD6 | CD7 | |||
| Method | CC | RMSE | CC | RMSE |
| LGA-Distance | 0.68 | 11.71 | ||
| LGA- | 0.66 | 0.74 | 7.78 | |
| LGscore-Distance | 0.66 | 11.46 | 0.77 | 7.66 |
| LGscore- | 11.19 | 0.76 | 8.18 | |
Values in this table represent the Pearson correlation coefficient (CC) and root mean squared error (RMSE) between the predicted and true per-residue distances. Boldfaced entries correspond to the best performing scheme for each dataset and performance assessment metric.
Prediction performance of the weight-learning methods
| CD6 | CD7 | |||
| Method | CC | RMSE | CC | RMSE |
| Support Vector Regression | 0.68 | 10.35 | 0.80 | 6.41 |
| Linear Perceptron | 0.69 | 9.74 | 0.80 | 6.46 |
| Standard Regression | 9.32 | 0.80 | 6.28 | |
| Constrained Regression | ||||
Values in this table represent the Pearson correlation coefficient (CC) and root mean squared error (RMSE) between the predicted and true per-residue distances. Boldfaced entries correspond to the best performing scheme for each dataset and performance assessment metric.
Figure 1Model weights vs Server quality.
Prediction performance of the weight-learning methods with filled values.
| CD6 | CD7 | |||
| Method | CC | RMSE | CC | RMSE |
| Support Vector Regression | 0.69 | 10.21 | 0.80 | 6.36 |
| Linear Perceptron | 0.68 | 11.31 | 0.79 | 7.45 |
| Standard Regression | 0.69 | 0.80 | ||
| Constrained Regression | 10.04 | 6.37 | ||
Values in this table represent the Pearson correlation coefficient (CC) and root mean squared error (RMSE) between the predicted and true per-residue distances. Boldfaced entries correspond to the best performing scheme for each dataset and performance assessment metric.
The average correlation coefficient (CC) and RMSE for the PD6 and PD7 datasets.
| PD6 | PD7 | |||
| Method | CC | RMSE | CC | RMSE |
| Custom training | ||||
| Support Vector Regression | ||||
| Linear Perceptron | 3.81 | 2.93 | ||
| Standard Regression | 0.70 | 6.02 | 0.82 | 3.80 |
| Constrained Regression | 3.46 | |||
| Global training | ||||
| Support Vector Regression | ||||
| Linear Perceptron | 0.88 | 3.64 | 0.89 | 2.90 |
| Standard Regression | 0.89 | 3.20 | 0.90 | 2.76 |
| Constrained Regression | 0.89 | 3.15 | 0.89 | 2.79 |
| Static consensus | ||||
| LGA-Distance | 4.29 | |||
| LGA- | 0.83 | 3.56 | ||
| LGscore-Distance | 0.67 | 5.56 | 0.72 | 4.48 |
| LGscore- | 0.81 | 4.57 | 0.79 | 4.33 |
Values in this table represent the Pearson correlation coefficient (CC) and root mean squared error (RMSE) between the predicted and true per-residue distances. Boldfaced entries correspond to the best performing scheme for each dataset and performance assessment metric.
Figure 2Radius of gyration distribution.
Figure 3Size distribution.