| Literature DB >> 22963006 |
Arjun Ray1, Erik Lindahl, Björn Wallner.
Abstract
BACKGROUND: Employing methods to assess the quality of modeled protein structures is now standard practice in bioinformatics. In a broad sense, the techniques can be divided into methods relying on consensus prediction on the one hand, and single-model methods on the other. Consensus methods frequently perform very well when there is a clear consensus, but this is not always the case. In particular, they frequently fail in selecting the best possible model in the hard cases (lacking consensus) or in the easy cases where models are very similar. In contrast, single-model methods do not suffer from these drawbacks and could potentially be applied on any protein of interest to assess quality or as a scoring function for sampling-based refinement.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22963006 PMCID: PMC3584948 DOI: 10.1186/1471-2105-13-224
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Pearson’s correlation coefficient for different input features
| Atom | 0.43 (±0.006) |
| Residue | 0.27 (±0.008) |
| Surface | 0.47 (±0.006) |
| Residue + Profile Weighting | 0.32 (±0.007) |
| Surface + Profile Weighting | 0.51 (±0.006) |
| Base + Global Surface Area Prediction | 0.65 (±0.005) |
| Base + Global Secondary Structure Pred. | 0.65 (±0.005) |
| Base + Profile Weighting | 0.62 (±0.005) |
| Base + Local Surface Area Prediction | 0.58 (±0.005) |
| Base + Local Secondary Structure Pred. | 0.58 (±0.005) |
| Base + Information per position (Conservation) | 0.56 (±0.006) |
| All Combined (ProQ2) | 0.71 (±0.004) |
Overall Pearson’s correlation coefficient, for different input features, benchmarked using cross-validation on the CASP7 data set. (Errors correspond to 99.9% confidence intervals).
Figure 1Optimization of linear combination of ProQ2 and Pcons to improve model selection.
Description of the methods included in the benchmark
| ProQ∗ (S) | Neural network trained on structural features to predict LGscore [ |
| QMEAN (S) | Potential of mean force, top-ranked single MQAP in CASP8 and CASP9 [ |
| MetaMQAP (S) | Neural network trained on the output from primary MQAPs [ |
| Distill_NNPIF (S) | Neural network trained on CA-CA interactions [ |
| ConQuass (S) | Correlates conservation and solvent accessibility, only global [ |
| MULTICOM-CMFR (S) | Top-ranked single MQAP in CASP8, only global [ |
| QMEANclust (C) | QMEAN-weighted GDT_TS averaging, top-ranked consensus method MQAP in CASP8 and CASP9 [ |
Description of the single-model methods and the reference consensus method included in the benchmark. The single methods (S) do not use any template or consensus information. Consensus and hybrid methods (C) are free to use any type of information. ∗This method was originally called ProQres, but for clarity it will be referred to as ProQ both for global and local quality prediction. (S) single-model method (C) consensus method.
Local model quality benchmark on the CASP8/CASP9 data sets
| ProQ2 | 0.70/0.68 | 0.58/0.54 | 0.54/0.47 |
| MetaMQAP | –/0.62 | –/0.48 | –/0.42 |
| QMEAN | 0.59/0.59 | 0.51/0.49 | 0.49/0.44 |
| ProQ | 0.52/0.49 | 0.46/0.42 | 0.45/0.40 |
| QMEANclust | 0.83/0.77 | 0.73/0.70 | 0.68/0.61 |
Benchmark of local model quality on the CASP8/CASP9 data sets measured by correlations. R is overall Pearson’s correlation, 〈R〉 is the average correlation per target, 〈R〉 is the average correlation per model. First value correspond to CASP8, second to CASP9. The standard error is <0.002.
Statistical significance test for local quality prediction
| ProQ2 | 1 | | -127.47 | -142.98 | -170.37 | -152.64 |
| MetaMQAP | 2 | n/a | | -95.06 | -152.15 | -170.69 |
| QMEAN | 3 | -146.99 | n/a | | -140.19 | -176.87 |
| ProQ | 4 | -164.75 | n/a | -121.07 | | -191.26 |
| QMEANclust | 5 | -168.81 | n/a | -188.12 | -195.66 |
Pairwise statistical significance test on the correlation coefficients for local quality prediction from CASP8 (below diagonal) and CASP9 (above diagonal). The values correspond to the logarithm of the P-value for methods being statistically indistinguishable, obtained by comparing the distribution of Fisher’s Z (Eq: 1) for the correlation coefficients, R, from Table 3.
Figure 2Local quality prediction performance as measured by the average distance deviation for different fraction of top ranking residues for CASP8 (A) and CASP9 (B).
Figure 3Accuracy for finding correct and incorrect residues for different coverage levels on CASP8 (A) and CASP9 (B).
Benchmark of global model quality
| 0.80/0.80 | 0.72/0.69 | 75.2/47.0 | 100.4/68.6 | |
| QMEAN | 0.75/0.77 | 0.71/0.66 | 73.6/44.7 | 81.1/52.1 |
| MetaMQAP | –/0.76 | –/0.59 | –/43.1 | –/40.3 |
| ConQuass | –/0.73 | –/0.66 | –/40.4 | –/20.4 |
| Distill_NNPIF | –/0.71 | –/0.64 | –/43.9 | –/43.5 |
| MULTICOM-CMFR | 0.71/– | 0.68/– | 74.0/– | 83.7/– |
| ProQ | 0.67/0.68 | 0.65/0.54 | 71.5/42.3 | 59.3/40.0 |
| QMEANclust | 0.89/0.96 | 0.94/0.91 | 75.8/48.6 | 104.1/81.41 |
| MULTICOM-CLUSTER | 0.96 | 0.91 | 48.7 | 82.3 |
| Mufold | 0.96 | 0.91 | 48.7 | 82.5 |
| 0.89/0.95 | 0.94/0.89 | 76.9/48.7 | 118.5/81.6 | |
| Pcons | 0.89/0.95 | 0.95/0.91 | 75.9/48.3 | 101.6/76.8 |
| PconsM | 0.95 | 0.90 | 47.9 | 70.2 |
| United3D | 0.95 | 0.92 | 48.8 | 81.2 |
| MUFOLD-QA | 0.95 | 0.92 | 48.3 | 79.5 |
| ModFOLDclust2 | 0.95 | 0.90 | 48.4 | 80.6 |
| MetaMQAPclust | 0.95 | 0.91 | 48.4 | 78.2 |
| IntFOLD-QA | 0.95 | 0.90 | 48.4 | 79.9 |
| MULTICOM-REFINE | 0.94 | 0.88 | 46.2 | 66.6 |
| MULTICOM | 0.94 | 0.88 | 48.7 | 84.7 |
| MQAPmulti | 0.94 | 0.91 | 48.2 | 75.4 |
| ModFOLDclustQ | 0.94 | 0.87 | 48.6 | 82.3 |
| MQAPsingle | 0.92 | 0.81 | 45.3 | 45.2 |
| MULTICOM-CONSTRUCT | 0.90 | 0.82 | 46.6 | 63.3 |
| gws | 0.90 | 0.81 | 45.3 | 44.2 |
| Splicer | 0.89 | 0.85 | 47.6 | 75.4 |
| LEE | 0.89 | 0.80 | 45.1 | 42.9 |
| Splicer_QA | 0.88 | 0.84 | 47.8 | 77.4 |
| Modcheck-J2 | 0.87 | 0.77 | 41.7 | 26.2 |
| MUFOLD-WQA | 0.86 | 0.91 | 49.0 | 83.9 |
| SMEG-CCP | 0.83 | 0.76 | 47.9 | 74.9 |
| QMEANdist | 0.80 | 0.84 | 47.8 | 77.1 |
| QMEANfamily | 0.75 | 0.68 | 44.8 | 50.3 |
| GRIER-CONSENSUS | 0.68 | 0.86 | 48.3 | 82.0 |
| Baltymus | 0.58 | 0.53 | 41.8 | 32.8 |
| 1.00/1.00 | 1.00/1.00 | 82.3/52.2 | 182.2/127.9 | |
Benchmark of global model quality on CASP8 and CASP9 data sets. R is overall correlation, R the average correlation per target, ∑GDT1 is the the sum of the first-ranked models for each target and ∑Z is the summed Z-score for the first-ranked models for each target. The first value corresponds to results on CASP8, the second to CASP9, and cells with only one value are CASP9 only.
Statistical significance test for global quality prediction
| ProQ2 | 1 | | -14.48 | -21.73 | -56.80 | -39.07 | -47.35 | n/a | -103.07 |
| QMEAN | 2 | -29.86 | | −1.91 | -43.51 | -18.50 | -30.30 | n/a | -106.70 |
| MetaMQAP | 3 | n/a | n/a | | -38.46 | -11.29 | -23.67 | n/a | -107.76 |
| ProQ | 4 | -61.20 | -39.03 | n/a | | -20.91 | -8.69 | n/a | -114.42 |
| ConQuass | 5 | n/a | n/a | n/a | n/a | | -4.79 | n/a | -110.58 |
| Distill_NNPIF | 6 | n/a | n/a | n/a | n/a | n/a | | n/a | -112.22 |
| MULTICOM-CMFR | 7 | -49.09 | -17.94 | n/a | -14.99 | n/a | n/a | | n/a |
| QMEANclust | 8 | -65.82 | -79.03 | n/a | -91.56 | n/a | n/a | -86.12 |
Pairwise statistical significance test on the correlation coefficients for global quality prediction from CASP8 (below diagonal) and CASP9 (above diagonal). The values correspond to the logarithm of the P-value for methods being statistically indistinguishable, obtained by comparing the distribution of Fisher’s Z (Eq: 1) for the correlation coefficients, R, from Table 5. Values in italics are not distinguishable at the 10−3significance level.
Figure 4Distribution of Z-score for model selection on CASP8 (A) and CASP9 (B).