| Literature DB >> 17478507 |
Abstract
We developed LOMETS, a local threading meta-server, for quick and automated predictions of protein tertiary structures and spatial constraints. Nine state-of-the-art threading programs are installed and run in a local computer cluster, which ensure the quick generation of initial threading alignments compared with traditional remote-server-based meta-servers. Consensus models are generated from the top predictions of the component-threading servers, which are at least 7% more accurate than the best individual servers based on TM-score at a t-test significance level of 0.1%. Moreover, side-chain and C-alpha (C(alpha)) contacts of 42 and 61% accuracy respectively, as well as long- and short-range distant maps, are automatically constructed from the threading alignments. These data can be easily used as constraints to guide the ab initio procedures such as TASSER for further protein tertiary structure modeling. The LOMETS server is freely available to the academic community at http://zhang.bioinformatics.ku.edu/LOMETS.Entities:
Mesh:
Year: 2007 PMID: 17478507 PMCID: PMC1904280 DOI: 10.1093/nar/gkm251
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.TM-score of threading alignments of nine component servers on 620 non-homologous proteins versus the Z-score, where Z-score is defined as the deviation of the inherent raw score from mean divided by the SD. The vertical line in each box indicates a Z-score cutoff to distinguish ‘bad’ and ‘good’ predictions.
Summary of component-threading programs and the meta-server selections
| Threading servers or meta-servers | TM-score of threading alignments (MODELLER models) | RMSD (Å) of aligned residues (MODELLER models) | Coverage | |||
|---|---|---|---|---|---|---|
| First model | Best in top five models | First model | Best in top five models | First model | Best in top five models | |
| PPA-I | 0.4001 (0.4117) | 0.4389 (0.4531) | 10.11 (16.66) | 9.13 (14.02) | 0.831 | 0.846 |
| SP3 | 0.3991 (0.4138) | 0.4391 (0.4551) | 10.50 (13.86) | 9.62 (12.83) | 0.858 | 0.867 |
| PPA-II | 0.3900 (0.4076) | 0.4306 (0.4512) | 10.72 (14.89) | 9.40 (13.02) | 0.837 | 0.847 |
| SPARKS2 | 0.3855 (0.3973) | 0.4283 (0.4441) | 11.62 (13.60) | 10.03 (12.23) | 0.895 | 0.893 |
| PROSPECT2 | 0.3793 (0.3914) | 0.4245 (0.4384) | 12.19 (13.01) | 10.68 (12.02) | 0.903 | 0.903 |
| FUGUE | 0.3580 (0.3721) | 0.4038 (0.4173) | 10.78 (19.26) | 10.30 (15.82) | 0.827 | 0.872 |
| HHSEARCH | 0.3635 (0.3827) | 0.4016 (0.4224) | 6.92 (22.38) | 6.44 (19.04) | 0.607 | 0.643 |
| PAINT | 0.3558 (0.3758) | 0.4045 (0.4210) | 10.35 (15.74) | 9.86 (14.21) | 0.735 | 0.786 |
| SAM-T0 2 | 0.3402 (0.3575) | 0.3798 (0.3971) | 10.19 (21.75) | 9.83 (17.53) | 0.721 | 0.777 |
| LOMETS | 0.4287 (0.4434) | 0.4481 (0.4669) | 10.18 (10.99) | 9.49 (10.61) | 0.890 | 0.882 |
| PCONS5 | 0.4117 (0.4272) | 0.4434 (0.4628) | 10.03 (15.39) | 9.14 (13.67) | 0.840 | 0.852 |
aCoverage = length of aligned residues/length of target sequence.
Figure 2.(a) Average accuracy of predicted Cα and side-chain contacts versus the relative occurrence frequency of the contacts in the LOMETS threading templates. (b) Coverage of the predicted contacts versus the relative occurrence frequency. For each frequency value (f), the data is calculated as an average within the bin of [f − 0.05, f + 0.05].
Summary of constraint predictions by LOMETS and the threading programs (with a relative occurrence frequency ⩾0.18 for contact predictions)
| Threading servers or meta-servers | ||||||||
|---|---|---|---|---|---|---|---|---|
| PPA-I | 0.249 | 1.655 | 0.431 | 0.696 | 1.178 | 600.5 | 3.732 | 1159.5 |
| SP3 | 0.239 | 1.713 | 0.405 | 0.712 | 1.220 | 612.3 | 3.817 | 1196.6 |
| PPA-II | 0.253 | 1.527 | 0.410 | 0.661 | 1.216 | 591.3 | 3.894 | 1173.0 |
| SPARKS2 | 0.223 | 1.654 | 0.375 | 0.659 | 1.356 | 629.4 | 3.804 | 1203.8 |
| PROSPECT2 | 0.236 | 1.599 | 0.411 | 0.653 | 1.219 | 631.7 | 3.591 | 1198.0 |
| FUGUE | 0.221 | 1.185 | 0.379 | 0.438 | 1.586 | 625.3 | 3.649 | 1175.7 |
| HHSEARCH | 0.359 | 0.842 | 0.528 | 0.404 | 1.024 | 357.1 | 4.111 | 743.7 |
| PAINT | 0.248 | 1.174 | 0.372 | 0.529 | 1.267 | 527.3 | 4.138 | 1103.2 |
| SAM-T02 | 0.227 | 1.164 | 0.350 | 0.534 | 1.597 | 520.0 | 3.923 | 1019.4 |
| LOMETS | 0.421 | 0.910 | 0.607 | 0.405 | 1.186 | 632.7 | 3.455 | 1193.0 |
aACCsc: Average accuracy for side-chain center of mass contact predictions.
bCovsc: Average coverage for side-chain center of mass contact predictions.
cAccCα: Average accuracy for Cα atom contact predictions.
dCovCα: Average coverage for Cα atom contact predictions.
eDifshort: Average difference (Å) between native and predicted short-range Cα-distances.
fNoshort: Average number of predicted short-range Cα-distances.
gDiflong: Average difference (Å) between native and the best predicted long-range Cα-distances.
hNolong: Average number of the best predicted long-range Cα-distances.
Figure 3.The average result of spatial constraint predictions for ‘Easy’, ‘Medium’ and ‘Hard’ targets on 620 non-homologous proteins. (a) Accuracy of Cα and side-chain center contact predictions. (b) Coverage of Cα and side-chain center contact predictions. (c) Prediction error of short-range and the best long-range distance map.