| Literature DB >> 22595003 |
Zhi-Ping Liu1, Jiguang Wang, Yu-Qing Qiu, Ross K K Leung, Xiang-Sun Zhang, Stephen K W Tsui, Luonan Chen.
Abstract
BACKGROUND: Mycobacterium tuberculosis is an infectious bacterium posing serious threats to human health. Due to the difficulty in performing molecular biology experiments to detect protein interactions, reconstruction of a protein interaction map of M. tuberculosis by computational methods will provide crucial information to understand the biological processes in the pathogenic microorganism, as well as provide the framework upon which new therapeutic approaches can be developed.Entities:
Mesh:
Year: 2012 PMID: 22595003 PMCID: PMC3348046 DOI: 10.1186/1471-2105-13-S7-S6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1ROC curves of the five-fold cross validation predictions in E. coli.
Prediction performances of the codon-based SVM predictor in E. coli
| Feature | ACC | SN | SP | PRE | AUC |
|---|---|---|---|---|---|
| Codon | 0.9003 | 0.7576 | 0.9486 | 0.8327 | 0.9507 |
| Codon-mer | 0.9595 | 0.8986 | 0.9801 | 0.9386 | 0.9835 |
Details of predicted protein interactions in M. tuberculosis
| Species | Database | Original PPI | Predicted PPI | Percentage (%) |
|---|---|---|---|---|
| By machine learning | ||||
| E. coli | ECID | 14,058(positive)+ | 12,899 | 27.97 |
| By interolog | ||||
| Escherichia coli | IntAct | 14,158 | 16,468 | 35.71 |
| Campylobacter jejuni | IntAct | 11,870 | 7,674 | 16.64 |
| Treponema pallidum | IntAct | 3,744 | 324 | 0.70 |
| Synechocystis | IntAct | 2,625 | 2,481 | 5.38 |
| Myxococcus xanthus | IntAct | 384 | 253 | 0.55 |
| Synechocystis sp. | IntAct | 219 | 220 | 0.48 |
| Rickettsia sibirica | IntAct | 282 | 24 | 0.05 |
| Streptococcus pneumoniae | IntAct | 193 | 47 | 0.10 |
| Drosophila melanogaster | DIP | 22,650 | 1,558 | 3.38 |
| Saccharomyces cerevisiae | DIP | 21,769 | 2,701 | 5.86 |
| Caenorhabditis elegans | DIP | 3,979 | 229 | 0.50 |
| Homo sapiens | DIP | 1,485 | 84 | 0.18 |
| Mus musculus | DIP | 287 | 36 | 0.06 |
| Rattus norvegicus | DIP | 69 | 2 | 0.15 |
| Total: 46,119 interactions in 3,465 proteins (with 530 known PPIs) | ||||
Figure 2Inferred protein interaction map in M. tuberculosis.
Figure 3Boxplot of coexpression (a), coevoluation (b) and cofunction values (c)-(e) of the predicted interactions and that of the same-size random selected protein pairs.
Topological parameters of protein-interaction map in M. tuberculosis
| Threshold | Node | Edge | Diameter | Characteristic path length | Clustering coefficient | Power law fitting |
|---|---|---|---|---|---|---|
| None | 3465 | 46119 | 8 | 3.341 | 0.093 | 1.237 |
| 0.5 (PCC) | 3409 | 35582 | 8 | 3.419 | 0.074 | 1.310 |
| 0.5 (COG) | 3397 | 43425 | 8 | 3.445 | 0.099 | 1.220 |
| 0.5 (GOCC) | 3461 | 45193 | 8 | 3.355 | 0.091 | 1.245 |
| 0.5 (GOMF) | 3400 | 31045 | 8 | 3.632 | 0.067 | 1.431 |
| 0.5 (GOBP) | 3373 | 26200 | 9 | 3.713 | 0.076 | 1.440 |
| 0.5 + 0.5 + 0.3 | 3383 | 26827 | 9 | 3.730 | 0.062 | 1.522 |
| (GOCC + GOMF + GOBP) | ||||||
| 0.5 + 0.7 | 3324 | 33134 | 9 | 3.616 | 0.080 | 1.289 |
| (PCC + COG) | ||||||
| 0.5 + 0.9 + 0.8 | 3304 | 29891 | 9 | 3.691 | 0.072 | 1.336 |
| (PCC + COG + GOCC) | ||||||
| 0.5 + 0.7 + 0.4 | 3212 | 13244 | 9 | 4.388 | 0.044 | 1.803 |
| (PCC + GOMF + GOBP) | ||||||
| 0.5 + 0.7 | 3281 | 21240 | 11 | 3.920 | 0.081 | 1.414 |
| (COG + GOBP) |
Figure 4ROC curves of predicting performance in the known interactions in M. tuberculosis.
Prediction performances of the SVM predictor in these known protein interactions of M. tuberculosis
| Feature | ACC | SN | SP | PRE | AUC |
|---|---|---|---|---|---|
| Codon | 0.8728 | 0.8932 | 0.8524 | 0.8582 | 0.9454 |
| Codon-mer | 0.8738 | 0.8971 | 0.8505 | 0.8571 | 0.9507 |
Figure 5Framework of inferring protein interaction map in M. tuberculosis.
Figure 6Identification of evolutionary relationship between two interacting proteins. The maximum of COG value between two groups in which the interacting proteins are located are used as the value of their evolutionary relationship.