| Literature DB >> 21060880 |
Abstract
BACKGROUND: An accurate potential function is essential to attack protein folding and structure prediction problems. The key to developing efficient knowledge-based potential functions is to design reference states that can appropriately counteract generic interactions. The reference states of many knowledge-based distance-dependent atomic potential functions were derived from non-interacting particles such as ideal gas, however, which ignored the inherent sequence connectivity and entropic elasticity of proteins.Entities:
Mesh:
Year: 2010 PMID: 21060880 PMCID: PMC2965178 DOI: 10.1371/journal.pone.0015386
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Performance on native structure recognition.
| Decoy sets | DFIRE | DOPE | RW | RWplus | #Targets |
| 4state_reduced | 6 (−3.44) |
| 6 (−3.45) | 6(−3.54) | 7 |
| Fisa | 3 (−4.67) | 3 (−3.91) | 3 (−4.87) | 3( | 4 |
| fisa_casp3 | 3 (−4.93) | 3 (−5.06) |
|
| 5 |
| Lmds | 7 (−0.99) | 7 (−1.34) | 7 (−1.20) | 7( | 10 |
| lattice_ssfit | 8 (−8.00) | 8 (−7.43) | 8 (−8.15) | 8( | 8 |
| Moulder | 19 (−2.79) | 19 ( | 19 (−2.79) | 19(−3.04) | 20 |
| ROSETTA | 22 (−1.67) | 21 (−1.61) | 20 (−1.62) | 20( | 58 |
| I-TASSER | 47 (−3.58) | 30 (−2.18) | 53 (−4.42) |
| 56 |
| #Total(Z-score) | 115 (−2.94) | 98 (−2.47) | 120 (−3.23) |
| 168 |
The data shows the number of targets which have the native structure ranked as the lowest energy. The values in parenthesis are the average Z-score of the corresponding potentials. The highlights are those having the highest number in each category.
Average RMSD (Å) and TM-score (in parenthesis) of models selected from I-TASSER and ROSETTA decoy sets.
| DFIRE | DOPE | RW | RWplus | ||
| I-TASSER Decoys | First model | 5.61 (0.558) | 5.31 (0.560) | 5.22 (0.569) |
|
| Top-five | 4.45 (0.612) |
| 4.30 ( | 4.29 (0.608) | |
| Top-ten | 3.95 (0.632) | 3.89 (0.631) | 3.89 ( | 3.89 (0.625) | |
| ROSETTA Decoys | First model |
| 7.43 (0.466) | 7.62 (0.460) | 7.48 (0.464) |
| Top-five | 6.08 (0.533) | 6.10 (0.536) | 6.04 ( |
| |
| Top-ten | 5.79 (0.559) | 5.85 (0.555) | 5.78 ( |
| |
The highlights are the highest value in each category.
The Pearson correlation coefficients between energy and Cα RMSD (CC-RMSD) and TM-score (CC-TMscore) for the I-TASSER and ROSETTA decoys.
| CC-RMSD | CC-TMscore | |||||||
| Decoys | DFIRE | DOPE | RW | RWplus | DFIRE | DOPE | RW | RWplus |
| I-TASSER | 0.514 | 0.319 | 0.520 |
| −0.492 | −0.317 | −0.500 |
|
| ROSETTA | 0.440 | 0.421 |
| 0.435 | −0.432 | −0.427 |
| 0.427 |
The highlights are the highest value in each category.
Figure 1Illustrative examples of the correlations between the RW potential and the RMSD to native of the I-TASSER decoys.
The number shows the Pearson correlation coefficients.
Performance of various potentials on selecting models generated in CASP5-8 experiments as collected by Rykunov and Fiser [37].
| Scoring function | models only | native included | ||
| Average | ranked 1 | Average | ranked 1 | |
| QMEAN6 | 2.87 | 85 | 1.71 | 113 |
|
|
|
|
|
|
|
|
|
|
|
|
| QMEANall_atom | 3.59 | 74 | 1.71 | 119 |
| QMEANSSE_agree | 3.74 | 62 | 3.72 | 39 |
| QMEANACC_agree | 4.04 | 40 | 3.78 | 48 |
| RF_CB_SRS_OD | 4.16 | 61 | 2.08 | 110 |
| RF_CB_OD | 4.62 | 62 | 2 | 111 |
| RF_HA_SRS | 4.65 | 49 | 1.38 | 137 |
| RF_CB_SRS | 4.72 | 56 | 2.18 | 114 |
| OPUS_CA | 4.72 | 79 | 5.13 | 55 |
| VSCOREcombined | 4.79 | 53 | 2.2 | 117 |
| QMEAN-pairwise | 4.8 | 54 | 3.15 | 85 |
| Rosetta | 5.01 | 57 | 4.09 | 68 |
| Dong-pair | 5.01 | 58 | 6.32 | 4 |
| RF_CB | 5.06 | 52 | 2.46 | 106 |
| VSCORE-pair | 5.08 | 54 | 1.85 | 128 |
| PROSAcombined | 5.11 | 57 | 3.38 | 87 |
| OPUS_PSP | 5.39 | 54 | 2.99 | 118 |
| RF_HA | 5.44 | 62 | 2.78 | 112 |
| DOPE | 5.77 | 54 | 3.27 | 95 |
| dFIRE | 6.03 | 50 | 5.69 | 33 |
| PROSA-pair | 6.03 | 56 | 3.54 | 95 |
| QMEAN-torsion | 6.71 | 45 | 3.24 | 114 |
| Shortle2006 | 6.85 | 35 | 1.79 | 129 |
| Liang_geometric | 6.88 | 44 | 2.48 | 114 |
| QMEANsolvation | 7.32 | 33 | 6.27 | 54 |
| Shortle2005 | 7.73 | 42 | 3.39 | 109 |
| Floudas-CM | 7.75 | 38 | 7.05 | 42 |
| Floudas-Ca | 7.79 | 33 | 8.36 | 10 |
| NAMD_1000 | 8.06 | 24 | 4.96 | 78 |
| Melo-ANOLEA | 9.62 | 19 | 5.19 | 86 |
| PC2CA | 9.75 | 19 | 5.06 | 85 |
| Melo-NL | 9.99 | 14 | 5.85 | 80 |
| NAMD_1 | 11.91 | 5 | 10.98 | 24 |
| Random | 9.72 | 13.9 | 10.1 | 8.3 |
The average rank of lowest energy decoy according to GDT_TS score (over 143 decoy sets) in the absence of native structures.
The number of sets when the best model was ranked as first, in the absence of native structures.
The average rank of the lowest energy decoy in GDT_TS when native structures are present.
The number of sets when the best model was ranked as first when native structures are present.
Expected random values were generated by picking a wining model fromthe decoy sets randomly. Average values over 1000 random trials are shown [37].
Figure 2Distance dependence of DFIRE and RW potentials for four representative atom pairs.
Figure 3The ratio of reference state at a distance R to that at 15 Å versus R for FIRE, DOPE and RW potentials for a protein of 100 AA.
Figure 4The illustration of random-walk ideal chain model and the relationship with real protein chain.
A protein with N residues can be mapped to a freely-jointed chain with N subunits.
Figure 5The definition of 20 vector pairs.
Figure 6The definition of relative orientation of two vector pairs A and B.
is the direction vector from A to B. is the direction vector from B to A. Ω is the torsion angle between plane A and plane ABB.
The definition of the relative direction bins for a direction vector and the probability of the reference state for each bin.
| θΦ | (0,π/6) | (π/6,π/3) | (π/3,π2/3) | (π2/3,π5/6) | (π5/6,π) |
| (π/8,π3/8) | A | B | C | B | A |
| (π3/8,π5/8) | B | C | B | ||
| (π5/8,π7/8) | B | C | B | ||
| (π7/8,π9/8) | B | C | B | ||
| (π9/8,π11/8) | B | C | B | ||
| (π11/8,π13/8) | B | C | B | ||
| (π13/8,π15/8) | B | C | B | ||
| (π15/8,π3/8) | B | C | B |
A = .
B = .
C = .