| Literature DB >> 19419562 |
Xin Gao1, Dongbo Bu, Jinbo Xu, Ming Li.
Abstract
BACKGROUND: Protein inter-residue contacts play a crucial role in the determination and prediction of protein structures. Previous studies on contact prediction indicate that although template-based consensus methods outperform sequence-based methods on targets with typical templates, such consensus methods perform poorly on new fold targets. However, we find out that even for new fold targets, the models generated by threading programs can contain many true contacts. The challenge is how to identify them.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19419562 PMCID: PMC2689239 DOI: 10.1186/1472-6807-9-28
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Average and deviation of contact accuracy and coverage of the six individual servers on the 86 CASP7 targets
| FOLDpro | 45 | 8.2 | 48 | 9.3 |
| mGenThreader | 43 | 6.6 | 45 | 8.5 |
| RAPTOR | 48 | 6.6 | 52 | 7.0 |
| FUGUE3 | 46 | 7.9 | 37 | 5.5 |
| SAM-T02 | 53 | 6.5 | 37 | 5.5 |
| SPARK3 | 48 | 7.3 | 51 | 7.6 |
| Overall | 12 | 7.2 | 80 | 2.3 |
All values are percentiles.
Pairwise correlation of the six individual servers
| Server | FOLDpro | mGenThreader | RAPTOR | FUGUE3 | SAM-T02 | SPARK3 |
| FOLDpro | 1 | 0.34 | 0.43 | 0.25 | 0.30 | 0.41 |
| mGenThreader | 0.35 | 1 | 0.42 | 0.26 | 0.30 | 0.41 |
| RAPTOR | 0.43 | 0.41 | 1 | 0.30 | 0.35 | 0.51 |
| FUGUE3 | 0.35 | 0.35 | 0.40 | 1 | 0.37 | 0.40 |
| SAM-T02 | 0.50 | 0.50 | 0.59 | 0.47 | 1 | 0.59 |
| SPARK3 | 0.40 | 0.41 | 0.50 | 0.29 | 0.34 | 1 |
Relationship among the six individual servers and the independent latent servers
| Server | H1 | H2 | H3 | H4 | H5 | H6 |
| FOLDpro | 0.37 | -0.35 | 0.66 | 0.01 | -0.08 | -0.55 |
| mGenThreader | 0.37 | -0.26 | -0.75 | -0.01 | -0.02 | -0.48 |
| RAPTOR | 0.42 | -0.23 | 0.04 | 0.27 | 0.76 | 0.36 |
| FUGUE3 | 0.37 | 0.82 | 0.04 | 0.37 | 0.01 | -0.22 |
| SAM-T02 | 0.49 | 0.20 | 0.03 | -0.81 | -0.04 | 0.23 |
| SPARK3 | 0.41 | -0.21 | -0.02 | 0.36 | -0.65 | 0.49 |
Linear combination representation of new server S* on the six individual servers and correlated mutation
| S | FOLDpro | mGenThreader | RAPTOR | FUGUE3 | SAM-T02 | SPARK3 | CM |
| 0.29 | -0.28 | 1.27 | 1.47 | 0.23 | 0.62 | 0.30 | |
| 0.301 | -0.27 | 1.35 | 1.35 | 0.22 | 0.58 | 0.37 | |
| 0.38 | -0.29 | 1.37 | 1.36 | 0.14 | 0.65 | 0.28 | |
| 0.29 | -0.44 | 1.29 | 1.39 | 0.12 | 0.56 | 0.23 |
CM: correlated mutation.
Figure 1ROC curves for our method and the six individual servers. Performance comparison using ROC plots for S* (thick solid line), FOLDpro (thick dotted line), mGenThreader (thin dashdot line), RAPTOR (thin dotted line), FUGUE3 (thick dashed line), SAM-T02 (thin solid line), and SPARK3 (thin dashed line).
Average accuracy of the top contacts predicted by S* on different test sets, and the accuracy of the majority voting method
| # Contacts | ||||||
| 69 | 60 | 57 | 65 | |||
| 75 | 67 | 63 | 72 | |||
| 80 | 73 | 67 | 74 | |||
| 79 | 74 | 69 | 76 |
The first column shows the number of top contacts being considered. The second to fifth columns show the accuracy of our method on the four test sets respectively. The sixth column shows the overall accuracy of our method on all the four test sets. The last column shows the overall accuracy of the majority voting method on all the four test sets. All values are percentiles.
Figure 2Prediction accuracy for the top .
Accuracy and deviation of top L/5 contacts of the six individual servers, the majority voting method, and our method on easy, medium, and hard target sets
| Server Name | Easy Targets | Medium Targets | Hard Targets | |||
| FOLDpro | 1.1 | 44 | 5.3 | 10 | 5.8 | |
| mGenThreader | 68 | 3.8 | 43 | 4.4 | 11 | 7.4 |
| RAPTOR | 75 | 1.3 | 50 | 3.9 | 13 | 7.1 |
| FUGUE3 | 75 | 0.7 | 47 | 6.2 | 12 | 9.1 |
| SAM-T02 | 75 | 1.3 | 5.2 | 14.7 | ||
| SPARK3 | 76 | 1.4 | 48 | 4.7 | 11 | 7.4 |
| Majority Voting | 0.7 | 8.1 | 6.9 | |||
| 0.4 | 8.5 | 28.2 | ||||
Accu.: Accuracy. Dev.: Deviation. All values are percentiles.
Performance of our method on different separation ranges of target protein classes with different difficulty levels
| Target Classes | Short-range | Medium-range | Long-range | All-range |
| Easy Targets | 91 | 90 | 93 | 94 |
| Medium Targets | 73 | 74 | 70 | 76 |
| Hard Targets | 41 | 35 | 26 | 37 |
| All Targets | 72 | 71 | 68 | 73 |
Top L/5 contacts are considered. All values are percentiles.
Accuracy of top L/5 contacts of the six individual servers, the majority voting method, and our method on the 15 new fold targets of CASP7
| FDP | MGTH | RAP | FUG | SAM | SP3 | MV | ||
| 8 | 6 | 9 | 5 | 12 | 6 | 9 | 33 | |
| 2 | 25 | 7 | 3 | 29 | 7 | 8 | 17 | |
| 16 | 4 | 6 | 10 | 3 | 6 | 8 | 18 | |
| 3 | 6 | 10 | 15 | 18 | 10 | 7 | 12 | |
| 22 | 3 | 6 | 6 | 32 | 5 | 8 | 0 | |
| 12 | 6 | 7 | 8 | 3 | 6 | 8 | 5 | |
| 15 | 18 | 14 | 24 | 31 | 16 | 21 | 88 | |
| 9 | 9 | 15 | 29 | 0 | 8 | 11 | 40 | |
| 13 | 3 | 14 | 5 | 46 | 28 | 26 | 48 | |
| 11 | 8 | 27 | 9 | 35 | 26 | 21 | 80 | |
| 17 | 23 | 24 | 29 | 26 | 18 | 12 | 22 | |
| 4 | 18 | 6 | 3 | 0 | 5 | 8 | 36 | |
| 6 | 10 | 12 | 9 | 12 | 10 | 11 | 79 | |
| 4 | 4 | 19 | 4 | 9 | 10 | 6 | 21 | |
| 12 | 15 | 23 | 18 | 5 | 11 | 25 | 56 | |
| 10 | 11 | 13 | 12 | 17 | 11 | 13 | 37 | |
FDP: FOLDpro, MGTH: mGenThreader, RAP: RAPTOR, FUG: FUGUE3, SAM: SAM-T02, SP3:
SPARK3, MV: majority voting, S*: our method. All values are percentiles.
TM-score, contact accuracy, and contact coverage of the best models by the six individual servers for T0319 and T0350
| FOLDpro | mGenThreader | RAPTOR | FUGUE3 | SAM-T02 | SPARK3 | ||
| T0319 | 0.20 | 0.18 | 0.27 | 0.26 | 0.12 | 0.22 | |
| 9% | 9% | 15% | 29% | 0 | 8% | ||
| 7% | 6% | 14% | 13% | 0 | 6% | ||
| T0350 | 0.24 | 0.23 | 0.33 | 0.26 | 0.26 | 0.27 | |
| 11% | 8% | 27% | 9% | 35% | 26% | ||
| 15% | 9% | 29% | 3% | 12% | 28% | ||
TM-score of the best model among the five submitted models for each server.
Contact accuracy of the best model for each server. All contacts contained in this model are evaluated.
Contact coverage of the best model for each server. All contacts contained in this model are evaluated.
Figure 3Performance comparison on T0319 and T0350. Accuracy of top L/5 contacts of the six threading servers, the majority voting method, and our method on T0319 and T0350. FDP: FOLDpro, MGTH: mGenThreader, RAP: RAPTOR, FUG: FUGUE3, SAM: SAM-T02, SP3: SPARK3, MV: majority voting, S*: our method.
The ranking of the best model (in terms of TM-score) for each individual server by its default ranking method and by our contact score for T0319 and T0350
| FOLDpro | mGenThreader | RAPTOR | FUGUE3 | SAM-T02 | SPARK3 | ||
| T0319 | Default ranking | 4 | 3 | 4 | 4 | 4 | 3 |
| Contact ranking | 2 | 1 | 1 | 1 | 3 | 2 | |
| T0350 | Default ranking | 1 | 3 | 4 | 4 | 1 | 4 |
| Contact ranking | 2 | 1 | 1 | 1 | 3 | 1 | |