| Literature DB >> 17488521 |
Sitao Wu1, Jeffrey Skolnick, Yang Zhang.
Abstract
BACKGROUND: Predicting 3-dimensional protein structures from amino-acid sequences is an important unsolved problem in computational structural biology. The problem becomes relatively easier if close homologous proteins have been solved, as high-resolution models can be built by aligning target sequences to the solved homologous structures. However, for sequences without similar folds in the Protein Data Bank (PDB) library, the models have to be predicted from scratch. Progress in the ab initio structure modeling is slow. The aim of this study was to extend the TASSER (threading/assembly/refinement) method for the ab initio modeling and examine systemically its ability to fold small single-domain proteins.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17488521 PMCID: PMC1878469 DOI: 10.1186/1741-7007-5-17
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Summary of I-TASSER modeling on benchmark I in comparison with atomic ROSETTA [13]
| Cα-RMSD (Å) of ROSETTA models | Cα-RMSD (Å) (TM-score) of I-TASSER models | ||||||
| Protein name | Length (residues) | Secondary structure | Round 1 | Round 2 | Best in top five clusters | First cluster | Best in top five clusters |
| 1b72A | 49 | α | 0.8 | 1.1 | 1.0 | 3.3 (0.64) | 3.1 (0.64) |
| 1shfA | 59 | β | 11.1 | 10.8 | 10.9 | 1.7 (0.75) | 1.7 (0.75) |
| 1tif_ | 59 | αβ | 5.3 | 4.1 | 3.8 | 7.0 (0.33) | 7.0 (0.36) |
| 2reb_2 | 60 | αβ | 1.2 | 2.1 | 1.3 | 5.6 (0.37) | 4.7 (0.57) |
| 1r69_ | 61 | α | 2.1 | 1.2 | 1.7 | 1.9 (0.75) | 1.9 (0.75) |
| 1csp_ | 67 | β | 5.1 | 4.7 | 5.1 | 2.1 (0.76) | 2.1 (0.76) |
| 1di2A_ | 69 | αβ | 2.6 | 2.6 | 1.9 | 2.3 (0.78) | 2.3 (0.78) |
| 1n0uA4 | 69 | αβ | 9.9 | 10.2 | 2.7 | 4.4 (0.48) | 4.4 (0.48) |
| 1mla_2 | 70 | αβ | 8.4 | 8.7 | 7.2 | 2.8 (0.66) | 2.7 (0.66) |
| 1af7__ | 72 | α | 10.1 | 10.4 | 1.7 | 4.2 (0.49) | 4.2 (0.49) |
| 1ogwA_ | 72 | αβ | 2.7 | 1.0 | 2.6 | 1.1 (0.88) | 1.1 (0.88) |
| 1dcjA_ | 73 | αβ | 3.2 | 2.5 | 2.0 | 10.5 (0.39) | 10.0 (0.39) |
| 1dtjA_ | 74 | αβ | 1.0 | 1.2 | 1.8 | 1.9 (0.80) | 1.7 (0.82) |
| 1o2fB_ | 77 | αβ | 10.1 | N/A | 10.3 | 7.1 (0.41) | 5.2 (0.43) |
| 1mkyA3 | 81 | αβ | 3.2 | 6.3 | 3.7 | 5.2 (0.40) | 4.5 (0.50) |
| 1tig_ | 88 | αβ | 4.1 | 3.5 | 2.4 | 7.7 (0.50) | 4.4 (0.54) |
| Average | 69 | 5.1 | 4.7 | 3.8 | 4.3 (0.59) | 3.8 (0.61) | |
Figure 2Examples of I-TASSER models from three independent benchmark sets. The green color is for I-TASSER models and blue for the native structures. (A–C) are from benchmark I (Bradley et al [13]); (D–F) are from benchmark II (Zhang et al [12]); and (G–I) are from benchmark III, selected directly from the PDB library. Column 1 contains the high-resolution models with a Cα-RMSD ≤ 1.5Å; column 2 contains the medium-resolution models with a Cα-RMSD of 1.5–5Å; column 3 contains the low-resolution models with a Cα-RMSD > 5Å. The Cα-RMSD value for the examples are: (A) 1ogwA_ (1.1Å), (B) 1di2A_ (2.3Å), (C) 1dcjA_(10.0Å), (D) 1cy5A (1.5Å), (E) 1pgx (3.1Å), (F) 1gnuA (8.2Å), (G) 1cqkA (1.5Å), (H) 1gyvA (3.3Å), (I) 1no5A(10.5Å). The pictures were generated using PyMOL software [45].
Figure 3Comparison of I-TASSER models with the PPA threading alignment results. (A) Cα-RMSD to native of the I-TASSER models versus Cα-RMSD to native of the best threading alignment over the same aligned regions. (B) TM-score of the I-TASSER models versus TM-score of the best threading alignments.
Summary of I-TASSER modeling on benchmark II in comparison with TOUCHSTONE-II [12]
| Cα-RMSD (Å) of TOUCHSTONE-II models | Cα-RMSD (Å) (TM-score) of I-TASSER models | ||||
| Protein name | Length (residues) | Secondary structure | Best in top five clusters | First cluster | Best in top five clusters |
| 1gpt_ | 47 | αβ | 4.0 | 5.2 (0.54) | 3.8 (0.56) |
| 1tfi_ | 47 | β | 6.2 | 4.6 (0.56) | 4.0 (0.57) |
| 1bq9A | 53 | β | 4.8 | 7.3 (0.41) | 5.0 (0.46) |
| 1pgx_ | 59 | αβ | 6.0 | 3.1 (0.55) | 3.1 (0.55) |
| 1ah9_ | 63 | β | 5.1 | 4.3 (0.56) | 2.8 (0.67) |
| 1aoy_ | 65 | α | 4.7 | 4.5 (0.70) | 2.7 (0.70) |
| 1sro_ | 71 | β | 4.3 | 3.4 (0.66) | 3.0 (0.68) |
| 1kjs_ | 74 | α | 8.2 | 8.5 (0.38) | 5.7 (0.50) |
| 1vcc_ | 76 | αβ | 7.3 | 5.7 (0.44) | 5.7 (0.44) |
| 1npsA | 88 | αβ | 3.4 | 2.1 (0.79) | 2.1 (0.79) |
| 1hbkA | 89 | α | 8.5 | 3.5 (0.69) | 3.5 (0.69) |
| 1cy5A | 92 | α | 1.8 | 1.5 (0.89) | 1.5 (0.89) |
| 1bm8_ | 99 | αβ | 9.0 | 6.3 (0.42) | 6.3 (0.42) |
| 2pcy_ | 99 | β | 4.3 | 4.6 (0.66) | 4.6 (0.66) |
| 256bA | 106 | α | 3.2 | 3.4 (0.77) | 3.4 (0.77) |
| 1cewI | 108 | αβ | 6.3 | 3.6 (0.73) | 3.6 (0.73) |
| 1thx_ | 108 | αβ | 2.3 | 2.1 (0.83) | 2.1 (0.83) |
| 1sfp_ | 111 | β | 6.0 | 5.1 (0.75) | 5.1 (0.75) |
| 1gnuA | 117 | αβ | 9.3 | 8.2 (0.58) | 6.5 (0.60) |
| 2a0b_ | 118 | α | 12.8 | 2.5 (0.81) | 2.5 (0.81) |
| Average | 85 | 5.9 | 4.5 (0.64) | 3.9 (0.65) | |
Summary of I-TASSER modeling on the Benchmark III
| Cα-RMSD (Å) (TM-score) of I-TASSER models | ||||
| Protein name | Length (residues) | Secondary structure | First cluster | Best in top five clusters |
| 1ne3A | 56 | β | 4.6 (0.45) | 4.6 (0.48) |
| 2cr7A | 60 | α | 4.5 (0.48) | 2.6 (0.66) |
| 2f3nA | 65 | α | 1.8 (0.74) | 1.8 (0.74) |
| 1itpA | 68 | αβ | 10.9 (0.33) | 4.5 (0.40) |
| 1kviA | 68 | αβ | 2.0 (0.72) | 2.0 (0.72) |
| 1b4bA | 71 | αβ | 6.4 (0.48) | 5.6 (0.54) |
| 1gjxA | 77 | β | 6.9 (0.44) | 5.6 (0.47) |
| 1of9A | 77 | α | 3.6 (0.53) | 3.6 (0.53) |
| 1mn8A | 84 | α | 7.0 (0.35) | 7.0 (0.35) |
| 1fo5A | 85 | αβ | 3.8 (0.54) | 3.8 (0.54) |
| 1ten_ | 87 | β | 1.6 (0.85) | 1.6 (0.85) |
| 1fadA | 92 | α | 3.6 (0.59) | 3.6 (0.59) |
| 1no5A | 93 | αβ | 10.6 (0.43) | 10.5 (0.45) |
| 1g1cA | 98 | β | 2.5 (0.79) | 2.5 (0.79) |
| 1cqkA | 101 | β | 1.5 (0.88) | 1.5 (0.88) |
| 1abv_ | 103 | α | 13.0 (0.28) | 6.8 (0.40) |
| 1jnuA | 104 | αβ | 2.7 (0.75) | 2.7 (0.75) |
| 1egxA | 115 | αβ | 2.3 (0.80) | 2.3 (0.83) |
| 1gyvA | 117 | β | 3.3 (0.78) | 3.3 (0.78) |
| 1orgA | 118 | α | 2.4(0.78) | 2.4(0.78) |
| Average | 87 | 4.8 (0.60) | 3.9 (0.63) | |
Figure 1Flowchart of I-TASSER method for protein structure prediction.