| Literature DB >> 18366803 |
Katia S Guimarães1, Teresa M Przytycka.
Abstract
BACKGROUND: The identification and characterization of interacting domain pairs is an important step towards understanding protein interactions. In the last few years, several methods to predict domain interactions have been proposed. Understanding the power and the limitations of these methods is key to the development of improved approaches and better understanding of the nature of these interactions.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18366803 PMCID: PMC2358894 DOI: 10.1186/1471-2105-9-171
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Analysis of co-occurring domain pairs in the dataset and predictions
| 29,364 | 1,343 | 4.6% | ||
| 26,113 | 1,146 | 4.4% | ||
| 783 | 486 | 62.1% | ||
| 691 | 427 | 61.8% | ||
| 1,852 | 284 | 15.3% | ||
| 1,399 | 321 | 22.9% | ||
| 230 | 182 | 79.1% | ||
| 239 | 173 | 72.4% | ||
Analysis of homodimers in the dataset, in GPEbenchmark set, and also among pairs predicted by GPE.
| 26,113 | 656 | 2.5% | |
| 1,399 | 203 | 14.5% | |
| 691 | 383 | 55.4% | |
| 239 | 145 | 60.7% | |
The seventy top-scoring domain pairs with pw-score at most 0.01.
| 7tm_1 | IL8 | 1 | 0 | 0 | 80 |
| AAA | AAA | 1 | 0 | 1 | 36 |
| AAA | PCI | 1 | 0 | 0 | 18 |
| Ank | RHD | 1 | 0 | 1 | 0 |
| Cpn60_TCP1 | WD40 | 1 | 0 | 0 | 50 |
| Cyclin_N | Pkinase | 1 | 0 | 1 | 24 |
| LSM | LSM | 1 | 0 | 1 | 76 |
| Pkinase | zf-C2H2 | 1 | 0 | 0 | 22 |
| Pkinase | Pkinase | 1 | 0 | 1 | 60 |
| Pkinase | WD40 | 1 | 0 | 0 | 30 |
| Proteasome | Proteasome | 1 | 0 | 1 | 36 |
| RNase_PH | RNase_PH | 1 | 0 | 1 | 0 |
| RNase_PH | RNase_PH_C | 1 | 0 | 1 | 0 |
| RRM_1 | RRM_1 | 1 | 0 | 1 | 70 |
| RRM_1 | WD40 | 1 | 0 | 0 | 21 |
| RRM_1 | zf-C2H2 | 1 | 0 | 0 | 23 |
| WD40 | WD40 | 1 | 0 | 1 | 38 |
| zf-C2H2 | zf-C2H2 | 1 | 0 | 1 | 106 |
| UQ_con | zf-C3HC4 | 1 | 0.00001 | 1 | 17 |
| Pkinase | RRM_1 | 1 | 0.00002 | 0 | 16 |
| TNF | TNFR_c6 | 1 | 0.00002 | 1 | 16 |
| GTP_CDC | GTP_CDC | 1 | 0.00003 | 0 | 15 |
| Homeobox | Pkinase | 1 | 0.00003 | 0 | 15 |
| GDI | Ras | 1 | 0.00006 | 1 | 14 |
| PCI | PCI | 1 | 0.00006 | 0 | 14 |
| TPR | zf-C2H2 | 1 | 0.00006 | 0 | 14 |
| LSM | WD40 | 1 | 0.00012 | 0 | 13 |
| Metallophos | Proteasome | 1 | 0.00012 | 0 | 13 |
| AAA | WD40 | 1 | 0.00024 | 0 | 12 |
| HLH | HLH | 1 | 0.00024 | 1 | 12 |
| Ras | Yip1 | 1 | 0.00024 | 0 | 12 |
| WD40 | zf-C2H2 | 1 | 0.00024 | 0 | 12 |
| Hrf1 | Ras | 1 | 0.00049 | 0 | 11 |
| Metallophos | efhand | 1 | 0.00049 | 1 | 11 |
| TPR | WD40 | 1 | 0.00049 | 0 | 11 |
| Brix | WD40 | 1 | 0.00098 | 0 | 10 |
| Homeobox | Homeobox | 1 | 0.00098 | 1 | 10 |
| Homeobox | zf-C2H2 | 1 | 0.00098 | 0 | 10 |
| HSP70 | Pkinase | 1 | 0.00098 | 0 | 10 |
| KH_1 | WD40 | 1 | 0.00098 | 0 | 10 |
| LSM | RRM_1 | 1 | 0.00098 | 0 | 10 |
| LSM | Nop | 1 | 0.00098 | 0 | 10 |
| Metallophos | Pkinase | 1 | 0.00098 | 0 | 10 |
| PCI | Proteasome | 1 | 0.00098 | 0 | 10 |
| AAA | TPR | 1 | 0.00195 | 0 | 9 |
| Ank | Pkinase | 1 | 0.00195 | 1 | 9 |
| Chitin_bind_4 | Chitin_bind_4 | 1 | 0.00195 | 0 | 9 |
| E2F_TDP | SUPRDOM8 | 1 | 0.00195 | 0 | 9 |
| HSP70 | WD40 | 1 | 0.00195 | 0 | 9 |
| KH_1 | RRM_1 | 1 | 0.00195 | 0 | 9 |
| LSM | zf-C2H2 | 1 | 0.00195 | 0 | 9 |
| PH | Pkinase | 1 | 0.00195 | 0 | 9 |
| Pkinase | Rad50_zn_hook | 1 | 0.00195 | 0 | 9 |
| Pkinase | efhand | 1 | 0.00195 | 0 | 9 |
| Pkinase | TPR | 1 | 0.00195 | 0 | 9 |
| Histone | WD40 | 1 | 0.00391 | 0 | 8 |
| Pkinase | Ras | 1 | 0.00391 | 0 | 8 |
| zf-C3HC4 | zf-C3HC4 | 1 | 0.00391 | 0 | 8 |
| HATPase_c | HATPase_c | 1 | 0.005 | 1 | 1 |
| SNARE | SNARE | 1 | 0.005 | 1 | 1 |
| efhand | efhand | 1 | 0.00781 | 1 | 7 |
| Homeobox | WD40 | 1 | 0.00781 | 0 | 7 |
| PP2C | Pkinase | 1 | 0.00781 | 0 | 7 |
| RRM_1 | TPR | 1 | 0.00781 | 0 | 7 |
| Zn_clus | Zn_clus | 1 | 0.00781 | 1 | 7 |
| HSP90 | Pkinase | 0.999 | 0 | 0 | 0 |
| RHD | RHD | 0.999 | 0 | 1 | 0 |
| RHD | TIG | 0.999 | 0 | 1 | 0 |
| TIG | TIG | 0.999 | 0 | 1 | 0 |
| KE2 | Prefoldin | 0.999 | 0.00098 | 1 | 10 |
Figure 1Benchmark pairs among top-scoring predictions.
Figure 2ROC Curves of GPE and PE.
Figure 3Comparison of the Positive Predictive Values for several methods relative to the corresponding random performance. The methods are grouped according to the domain definition. Note that performance of Random varies between the groups. GPE* denotes results obtained by projecting supra-domain from the GPE method is back into Pfam domains where the "children" domains inherit the scores from the supra-domain. A more formal comparison method from different groups and relies on counting how often each of them over/under-performed the corresponding random selection and is described in the text. The performance of GPE and PE was identical while their desistance to the next closed method was statistically significant.