| Literature DB >> 27100396 |
Xianjun Shen1, Li Yi1, Xingpeng Jiang1, Tingting He1, Xiaohua Hu1,2, Jincai Yang1.
Abstract
The identification of temporal protein complexes would make great contribution to our knowledge of the dynamic organization characteristics in protein interaction networks (PINs). Recent studies have focused on integrating gene expression data into static PIN to construct dynamic PIN which reveals the dynamic evolutionary procedure of protein interactions, but they fail in practice for recognizing the active time points of proteins with low or high expression levels. We construct a Time-Evolving PIN (TEPIN) with a novel method called Deviation Degree, which is designed to identify the active time points of proteins based on the deviation degree of their own expression values. Owing to the differences between protein interactions, moreover, we weight TEPIN with connected affinity and gene co-expression to quantify the degree of these interactions. To validate the efficiencies of our methods, ClusterONE, CAMSE and MCL algorithms are applied on the TEPIN, DPIN (a dynamic PIN constructed with state-of-the-art three-sigma method) and SPIN (the original static PIN) to detect temporal protein complexes. Each algorithm on our TEPIN outperforms that on other networks in terms of match degree, sensitivity, specificity, F-measure and function enrichment etc. In conclusion, our Deviation Degree method successfully eliminates the disadvantages which exist in the previous state-of-the-art dynamic PIN construction methods. Moreover, the biological nature of protein interactions can be well described in our weighted network. Weighted TEPIN is a useful approach for detecting temporal protein complexes and revealing the dynamic protein assembly process for cellular organization.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27100396 PMCID: PMC4839750 DOI: 10.1371/journal.pone.0153967
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The properties of TEPIN, DPIN and SPIN.
| Network | Average nodes | Average edges | Average density |
|---|---|---|---|
| 447 | 839 | 0.008800675 | |
| 609 | 917 | 0.004794229 | |
| 5093 | 24743 | 0.001908184 |
Fig 1Distribution of the number of proteins with varying amount of active time points in TEPIN.
Fig 2Percentage comparison of known protein complexes matched by the predicted protein complexes detected from various kinds of networks.
Fig 3The protein complexes labeled as 550.1.213 predicted from WTEPIN and WDPIN.
(A) shows the real complex labeled as 550.1.213 in the known protein complex set. (B) and (C) are the protein complexes with the identical label predicted from WTEPIN and WDPIN by CAMSE algorithm respectively. For each predicted protein complex, the proteins shown in red are involved in the real complex, while those shown in blue are not.
Fig 4Percentage comparison of known protein complexes matched by the predicted protein complexes detected from unweighted and weighted networks.
Performance comparison of SPIN, DPIN and TEPIN.
| Algorithms | Networks | #AS | #PC | #MPC | MKC | #Perfect | |||
|---|---|---|---|---|---|---|---|---|---|
| WTEPIN | 9.0 | 2906 | 1599 | 647 | 0.794 | 0.550 | 0.650 | 28 | |
| WDPIN | 9.8 | 2893 | 1389 | 449 | 0.693 | 0.480 | 0.567 | 21 | |
| WSPIN | 8.7 | 1150 | 734 | 487 | 0.560 | 0.638 | 0.597 | 11 | |
| TEPIN | 4.9 | 2401 | 925 | 526 | 0.633 | 0.385 | 0.479 | 16 | |
| DPIN | 4.3 | 2433 | 967 | 468 | 0.607 | 0.397 | 0.480 | 19 | |
| SPIN | 8.7 | 1274 | 586 | 414 | 0.474 | 0.460 | 0.467 | 0 | |
| WTEPIN | 5.6 | 1630 | 672 | 568 | 0.576 | 0.412 | 0.481 | 32 | |
| WDPIN | 5.7 | 1317 | 576 | 426 | 0.475 | 0.437 | 0.455 | 24 | |
| WSPIN | 3.9 | 741 | 269 | 438 | 0.301 | 0.363 | 0.329 | 54 | |
| TEPIN | 5.8 | 2276 | 608 | 505 | 0.521 | 0.267 | 0.353 | 18 | |
| DPIN | 6.9 | 2202 | 538 | 394 | 0.446 | 0.244 | 0.316 | 12 | |
| SPIN | 5.1 | 957 | 226 | 340 | 0.238 | 0.236 | 0.237 | 20 | |
| TEPIN | 3.5 | 1744 | 583 | 518 | 0.517 | 0.334 | 0.406 | 18 | |
| DPIN | 3.8 | 1689 | 598 | 400 | 0.474 | 0.354 | 0.405 | 11 | |
| SPIN | 3.7 | 613 | 180 | 257 | 0.183 | 0.294 | 0.225 | 15 |
#AS: the average size of predicted protein complexes;
#PC: the total number of predicted protein complexes;
#MPC: the number of predicted protein complexes matched by known protein complexes;
#MKC: the number of known complexes matched by predicted protein complexes;
#Perfect: the number of known complexes perfectly matched by predicted protein complexes.
Function enrichment analysis of predicted protein complexes detected from WTEPIN and WDPIN.
| Network | #PC | <E-15 | [E-15, E-10) | [E-10, E-5) | [E-5, 0.01) | > = 0.01 |
|---|---|---|---|---|---|---|
| 301 | 34.9% (105) | 27.6% (83) | 27.2% (82) | 9.0% (27) | 1.3% (4) | |
| 329 | 22.8% (75) | 25.2% (83) | 35.0% (115) | 15.2% (50) | 1.8% (6) |
#PC: the total number of predicted protein complexes.
Some examples of the predicted protein complexes with small p-values detected from WTEPIN.
| No. | p-value | Predicted protein complex | GO term | |
|---|---|---|---|---|
| 1 | 4.37e-44 | 0.16 | ||
| 2 | 1.57e-38 | 0.50 | ||
| 3 | 1.16e-37 | 0.45 | ||
| 4 | 3.15e-37 | 0.54 | ||
| 5 | 7.02e-36 | 0.51 | ||
| 6 | 7.24e-11 | histone acetyltransferase activity | AmiGO | 0.16 | |
| 7 | 7.17e-08 | RNA polymerase II activity | AmiGO | 0.13 | |
| 8 | 3.54e-07 | 0.24 | ||
| 9 | 2.11E-07 | transcription factor activity, transcription factor binding | AmiGO | 0.26 | |
| 10 | 2.50E-07 | transcription factor activity, transcription factor binding | AmiGO | 0.13 |
Fig 5Dynamic evolutionary procedure of a predicted temporal protein complex.
The red proteins are unchanged in this procedure; the blue ones shown in (A) are absent in (B), and then reappear in (C); and the green protein shown in (A) is absent in both (B) and (C).