| Literature DB >> 23046795 |
Chen Chen1, Jun-Fei Zhao, Qiang Huang, Rui-Sheng Wang, Xiang-Sun Zhang.
Abstract
BACKGROUND: As protein domains are functional and structural units of proteins, a large proportion of protein-protein interactions (PPIs) are achieved by domain-domain interactions (DDIs), many computational efforts have been made to identify DDIs from experimental PPIs since high throughput technologies have produced a large number of PPIs for different species. These methods can be separated into two categories: deterministic and probabilistic. In deterministic methods, parsimony assumption has been utilized. Parsimony principle has been widely used in computational biology as the evolution of the nature is considered as a continuous optimization process. In the context of identifying DDIs, parsimony methods try to find a minimal set of DDIs that can explain the observed PPIs. This category of methods are promising since they can be formulated and solved easily. Besides, researches have shown that they can detect specific DDIs, which is often hard for many probabilistic methods. We notice that existing methods just view PPI networks as simply assembled by single interactions, but there is now ample evidence that PPI networks should be considered in a global (systematic) point of view for it exhibits general properties of complex networks, such as 'scale-free' and 'small-world'.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23046795 PMCID: PMC3403472 DOI: 10.1186/1752-0509-6-S1-S7
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1PPIs and protein domain compositions are parsimoniously organized in nature. Under each null model, 200 data sets are simulated. The distribution of T is shown as 'violin plot' and p-value is computed based on Gaussian distribution.
Figure 2S.cerevisiae's PPI network shows a relatively larger clustering coefficient. To make the observed clustering coefficient of the PPI network (0.0970) comparable, two network generation procedures are employed as null models. The clustering coefficients of the null models are shown as boxplots.
Figure 3WILP outperforms ILP in terms of the number of the predicted DDIs confirmed by the golden data set. (A) Sensitivities of WILP and ILP are compared as sd varies from 0.05 to 1. In WILP, K is set to 50. (B) Performances of WILP are shown in different K settings. There is a broad interval in which WILP outperforms ILP robustly.
Performance comparison between WILP and ILP
| sd | Total Predictions | True Positives | Sensitivity(%) | Fold Change |
|---|---|---|---|---|
| 1 | 12663 (12663) | 382 (375) | 50.53 (49.60) | 1.21 (1.19) |
| 0.9 | 10592 (10592) | 361 (351) | 47.75 (46.43) | 1.37 (1.33) |
| 0.8 | 8521 (8521) | 341 (342) | 45.11 (45.24) | 1.61 (1.61) |
| 0.7 | 6450 (7102) | 306 (306) | 40.48 (40.48) | 1.91 (1.73) |
| 0.6 | 4379 (5162) | 276 (223) | 36.51 (29.50) | 2.53 (1.74) |
| 0.5 | 2648 (3091) | 190 (176) | 25.13 (23.28) | 2.88 (2.29) |
| 0.4 | 1613 (1620) | 145 (143) | 19.18 (18.92) | 3.61 (3.55) |
| 0.3 | 875 (779) | 104 (89) | 13.76 (11.77) | 4.78 (4.59) |
| 0.2 | 430 (279) | 69 (37) | 9.13 (4.89) | 6.45 (5.33) |
| 0.1 | 131 (63) | 29 (16) | 3.84 (2.12) | 8.90 (10.21) |
Comparison of WILP and ILP in terms of the number of the predicted DDIs confirmed by the golden data set. Predicted DDIs verified according to the golden data set are denoted as true positives. 'Sensitivity' and 'Fold Change' are defined in the main text. Numbers marked in red means that WILP outperforms ILP
Figure 4Statistical significance of the weights. Random weights are given to WILP and the distributions of 'TP' are shown as 'violin plots'.
Figure 5Similarity analysis of the predicted DDIs. Comparison of functional similarities of the predicted DDIs obtained by ILP and WILP (sd varies from 0.5 to 1).