| Literature DB >> 23368964 |
Abstract
Functional module identification in biological networks may provide new insights into the complex interactions among biomolecules for a better understanding of cellular functional organization. Most of existing functional module identification methods are based on the optimization of network modularity and cluster networks into groups of nodes within which there are a higher-than-expectation number of edges. However, module identification simply based on this topological criterion may not discover certain kinds of biologically meaningful modules within which nodes are sparsely connected but have similar interaction patterns with the rest of the network. In order to unearth more biologically meaningful functional modules, we propose a novel efficient convex programming algorithm based on the subgradient method with heuristic path generation to solve the problem in a recently proposed framework of blockmodel module identification. We have implemented our algorithm for large-scale protein-protein interaction (PPI) networks, including Saccharomyces cerevisia and Homo sapien PPI networks collected from the Database of Interaction Proteins (DIP) and Human Protein Reference Database (HPRD). Our experimental results have shown that our algorithm achieves comparable network clustering performance in comparison to the more time-consuming simulated annealing (SA) optimization. Furthermore, preliminary results for identifying fine-grained functional modules in both biological networks and the comparison with the commonly adopted Markov Clustering (MCL) algorithm have demonstrated the potential of our algorithm to discover new types of modules, within which proteins are sparsely connected but with significantly enriched biological functionalities.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23368964 PMCID: PMC3549836 DOI: 10.1186/1471-2105-14-S2-S23
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1An example of path generation: A. Network structure; B. Path generation procedure.
Parameter settings in SA and SGPG
| SA | 0.99 | 40 | 0.001 | 100 | 20 | - | - |
| SGPG | - | - | - | - | - | 10 | 5 |
Comparison of SA and SGPG on Homo sapien and Saccharomyces cerevisia PPI networks
| PPI | Method | Time(h) | Time(h) | Time(h) | |||
|---|---|---|---|---|---|---|---|
| SA | 0.5393 | 1.73 | 0.6530 | 45.07 | 0.7180 | 180.26 | |
| SGPR | 0.5346 | 0.5 | 0.6452 | 1.95 | 0.6898 | 6.35 | |
| SA | 0.5692 | 1.35 | 0.6834 | 25.02 | 0.7544 | 102.65 | |
| SGPR | 0.5690 | 0.3 | 0.6752 | 1.15 | 0.7292 | 3.34 | |
Figure 2Comparison between SA and SGPG for the number of identified modules of .
Figure 3Percentage of different categories of modules detected by SGPG and MCL (annotated by KOG). A. KOG percentage of Saccharomyces cerevisia PPI network. B. KOG percentage of Homo sapien PPI network.
Topological analysis of different KOG categories in Saccharomyces cerevisia PPI network
| KOG ID | Method | proteins | sparse modules/modules | Avg. density | |
|---|---|---|---|---|---|
| U | SGPG | 353 | 15/26 | 2.98% | 0.0814 |
| MCL | 256 | 0/21 | 27.38% | 0.2402 | |
| K | SGPG | 359 | 6/24 | 6.68% | 0.1352 |
| MCL | 361 | 0/19 | 26.35%0 | 0.1834 | |
| J | SGPG | 579 | 9/24 | 9.16% | 0.0678 |
| MCL | 358 | 0/25 | 37.90% | 0.1429 | |
| T | SGPG | 169 | 13/21 | 3.47% | 0.0755 |
| MCL | 94 | 0/12 | 31.31% | 0.0912 | |
Figure 4A subnetwork with sparsely connected modules detected by SGPG. Module A is enriched in hexokinase activity with p-value 1.71e-5. Module B is enriched in response to endogenous stimulus with p-value 4.77e-5. Module C is enriched in nucleoside phosphate metabolism with p-value 3.43e-6. Patterns I and II are two specific interaction patterns in the subnetwork.
Sparse modules in U and T KOG categories for Saccharomyces cerevisia PPI network
| KOG ID | Sparse module example | Enriched genes | Enriched GO Term | GO Level | |
|---|---|---|---|---|---|
| U | YDR179C, YNL287W, YDL216C YCR099C, YIL004C,YAL026C YLR268W, YLR093C, YPR163C YPR148C, YOL064C, YOL117W YGL084C, YLR031W, YIL076W YPL179W, YKL191W, YPL010W | YOL117W, YDR179C, YDL216C | protein deneddylation | [+8, 0] | 2.01e-5 |
| T | YJL092W, YDR490C, YOR231W YJL005W, YPL074W, YPL083C YNL323W,YOL100W | YDR490C, YOL100W, YNL323W, YJL005W, YOR231W | signal transduction | [+3, -1] | 6.09e-5 |
| T | YDR076W, YDL059C, YJL173C YPL164C, YER171W, YPL026C YCR079W, YPL150W, YHR169W YJR062C | YDL059C, YPL026C, YER171W, YPL164C, YJL173C, YDR076W | response to endogenous stimulus | [+2, -1] | 4.77e-5 |
Topological analysis of different KOG categories in Homo sapien PPI network
| KOG ID | Method | proteins | sparse modules/modules | Avg. density | |
|---|---|---|---|---|---|
| T | SGPG | 1970 | 59/126 | 4.91% | 0.0822 |
| MCL | 2481 | 0/66 | 26.32% | 0.1696 | |
| K | SGPG | 878 | 27/59 | 3.15% | 0.0779 |
| MCL | 916 | 0/37 | 30.41%0 | 0.1928 | |
| U | SGPG | 592 | 3/24 | 4.95% | 0.0448 |
| MCL | 517 | 0/33 | 31.42% | 0.1359 | |
Figure 5A subnetwork with sparsely connected modules detected by SGPG. Module A is enriched in sequence-specific DNA binding with p-value 9.91e-7. Module B is enriched in cellular response to calcium ion with p-value 4.04e-7. Module D is enriched in MAP kinase activity with p-value 8.60e-5. Patterns I and II are two specific interaction patterns in the subnetwork.
Sparse modules in T and K KOG categories for Homo sapien PPI network
| KOG ID | Sparse module example | Enriched genes | Enriched GO Term | GO Level | |
|---|---|---|---|---|---|
| T | NTRK1, NTRK3, NTRK2 VAV1, VAV3 | NTRK1, NTRK2, NTRK3 | neurotrophin receptor activity | [+3, -1] | 2.95e-9 |
| T | PIK3R3, PIK3R2, PIK3R1 | PIK3R3, PIK3R2, PIK3R1 | phosphatidylinositol 3-kinase complex | [+5, -1] | 4.77e-9 |
| K | JUN, JUNB, JUND | JUN, JUNB, | cellular response to | [+6, -1] | 4.04e-7 |