| Literature DB >> 17553833 |
Abstract
Protein domain prediction is important for protein structure prediction, structure determination, function annotation, mutagenesis analysis and protein engineering. Here we describe an accurate protein domain prediction server (DOMAC) combining both template-based and ab initio methods. The preliminary version of the server was ranked among the top domain prediction servers in the seventh edition of Critical Assessment of Techniques for Protein Structure Prediction (CASP7), 2006. DOMAC server and datasets are available at: http://www.bioinfotool.org/domac.html.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17553833 PMCID: PMC1933197 DOI: 10.1093/nar/gkm390
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The performance of 13 domain prediction servers in CASP7
| Method | Target Num | Domain Num Acc. (%) | CASP7 Score |
|---|---|---|---|
| FOLDpro (DOMAC) | 95 | 93.7 | 0.963 |
| Baker-RosettaDom ( | 94 | 86.2 | 0.940 |
| Ma-OPUS-DOM | 94 | 87.2 | 0.933 |
| ROBETTA-GINZU ( | 94 | 84.0 | 0.932 |
| DomSSEA ( | 94 | 78.7 | 0.910 |
| HHpred3 ( | 95 | 75.8 | 0.910 |
| Meta-DP ( | 95 | 74.7 | 0.907 |
| HHpred1 ( | 93 | 75.3 | 0.902 |
| DomFOLD | 95 | 75.8 | 0.898 |
| DPS( | 93 | 75.3 | 0.889 |
| Chop ( | 83 | 56.6 | 0.827 |
| Distill ( | 95 | 70.5 | 0.819 |
| NN_PUT-Lab | 92 | 58.7 | 0.795 |
The second column (target num) lists the number of targets for which a predictor made predictions.
The specificity and sensitivity of domain number prediction on the Holland's dataset using the template-based and ab initio methods
| Method | Acc. (%) | 1-dom | 2-dom | 3-dom | 4-dom | 5-dom | 6-dom |
|---|---|---|---|---|---|---|---|
| Template | Sens. | 96.1 | 66.7 | 56.0 | 75.0 | 66.7 | – |
| Spec. | 74.2 | 88.0 | 70.0 | 42.9 | 33.3 | – | |
| Sens. | 88.5 | 31.3 | 12.0 | – | – | – | |
| Spec. | 46.5 | 48.8 | 30.0 | – | – | – |
Figure 1.Domain prediction result of CASP7 target T0324. The protein is predicted to have two domains. Domain 1 has two non-continuous segments, spanning from residues 1 to 16 and residues 82 to 208, respectively. Domain 2 spans from residues 17 to 81. The templates used to make the domain prediction are identified by PDB code + chain id. The chain in a single-chain protein is always assigned chain id ‘A’ instead of ‘-’.