| Literature DB >> 19420062 |
Allison N Tegge1, Zheng Wang, Jesse Eickholt, Jianlin Cheng.
Abstract
Protein contact map prediction is useful for protein folding rate prediction, model selection and 3D structure prediction. Here we describe NNcon, a fast and reliable contact map prediction server and software. NNcon was ranked among the most accurate residue contact predictors in the Eighth Critical Assessment of Techniques for Protein Structure Prediction (CASP8), 2008. Both NNcon server and software are available at http://casp.rnet.missouri.edu/nncon.html.Entities:
Mesh:
Year: 2009 PMID: 19420062 PMCID: PMC2703959 DOI: 10.1093/nar/gkp305
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The 2D-RNN architecture for contact prediction. For a protein sequence with length n, the input to a 2D-RNN is an n × n input matrix and the output is an n × n probability matrix residue contacts.
Results of NNcon and SVMcon on 116 CASP8 targets
| Method | Acc6 | Cov6 | Acc12 | Cov12 | Acc24 | Cov24 |
|---|---|---|---|---|---|---|
| NNcon ( | 0.58 | 0.07 | 0.51 | 0.06 | 0.31 | 0.05 |
| SVMcon ( | 0.5 | 0.06 | 0.42 | 0.06 | 0.27 | 0.05 |
Acc6, Acc12, Acc24 denote prediction accuracy (specificity) at sequence separation ≥6, 12, 24 residues, respectively. Cov6, Cov12, Cov24 denote prediction coverage (sensitivity) at sequence separation ≥6, 12, 24 residues, respectively.
Multiple contact map predictors evaluated on 11 CASP8 ab initio domains
| Method | Acc6 | Cov6 | Acc12 | Cov12 | Acc24 | Cov24 |
|---|---|---|---|---|---|---|
| NNcon | 0.68 | 0.11 | 0.51 | 0.09 | 0.18 | 0.05 |
| SVMcon | 0.68 | 0.09 | 0.39 | 0.09 | 0.18 | 0.05 |
| SAM08_2stage | 0.28 | 0.05 | 0.26 | 0.06 | 0.17 | 0.05 |
| SAM06 | 0.26 | 0.04 | 0.24 | 0.05 | 0.16 | 0.06 |
| Fang | 0.44 | 0.07 | 0.31 | 0.06 | 0.16 | 0.05 |
| MUprot | 0.59 | 0.09 | 0.37 | 0.08 | 0.15 | 0.05 |
| Distill | 0.32 | 0.05 | 0.16 | 0.03 | 0.14 | 0.05 |
| 3Dpro | 0.05 | 0.01 | 0.33 | 0.07 | 0.14 | 0.05 |
| SAM08_server | 0.24 | 0.04 | 0.21 | 0.05 | 0.13 | 0.05 |
| SVMSEQ | 0.56 | 0.09 | 0.34 | 0.07 | 0.13 | 0.05 |
| Hamilton | 0.08 | 0.01 | 0.12 | 0.02 | 0.12 | 0.02 |
| Spine | 0.09 | 0.01 | 0.09 | 0.02 | 0.07 | 0.02 |
| Lee | 0.1 | 0.01 | 0.09 | 0.02 | 0.07 | 0.02 |
| Pairings | 0.36 | 0.05 | 0.35 | 0.06 | 0.05 | 0.01 |
For each domain, select top L/5 predicted contacts ranked by contact probabilities. Acc6, Acc12, Acc24 denote prediction accuracy (specificity) at sequence separation ≥6, 12, 24 residues, respectively. Cov6, Cov12, Cov24 denote prediction coverage (sensitivity) at sequence separation ≥6, 12, 24 residues, respectively.
Figure 2.(a) The 3D structure of CASP8 target T0507. The protein has five strands (S1–S5) that forms a parallel beta sheet. (b) The true contact map (upper triangle, blue) and predicted contact map (lower triangle, red). Each dot denotes a contact. It shows that some key contacts in four strand pairs (S1–S2, S2–S3, S1–S4, S4–S5) are correctly predicted. (c) Selective visualization of four residue–residue contacts correctly predicted (2–30, 3–31, 4–90, 7–34).