| Literature DB >> 22759424 |
Jieyue He1, Chaojun Li, Baoliu Ye, Wei Zhong.
Abstract
BACKGROUND: Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures.Entities:
Mesh:
Year: 2012 PMID: 22759424 PMCID: PMC3314584 DOI: 10.1186/1471-2105-13-S10-S19
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
| Algorithm GSM-CA |
|---|
| Input: a graph |
| Algorithm GSM-FC |
|---|
| Input: a graph |
Results of various algorithms compared with MIPS complexes using DIP data
| Algorithms | MCODE | CFinder | DPClus | COACH | GSM-CA |
|---|---|---|---|---|---|
| #predicted | 59 | 245 | 1143 | 745 | |
| complexes | |||||
| |TP| | 18 | 52 | 133 | 155 | |
| |TB| | 19 | 61 | 144 | 106 | |
| s-measure | 0.132 | 0.231 | 0.198 | 0.307 |
Comparison of the results before and after adding attachments
| Average Size | f-measure of BP | -log(p-value) | |
|---|---|---|---|
| Before | 5.29 | 0.356 | 7.2 |
Statistical significance of functional modules predicted by various methods
| Algorithms | No. of Modules size>=3 | No. of Significant Modules | Average Size | Maximum | f-measure of BP | -log(p-value) | Parameters |
|---|---|---|---|---|---|---|---|
| MCODE | 59 | 54 | 83.8 | 549 | 0.296 | 10.87 | fluff = 0.1; VWP = 0.2 |
| CFinder | 245 | 157 | 10.2 | 1409 | 0.246 | 4.49 | K = 3 |
| DPClus | 217 | 187 | 5.23 | 25 | 0.335 | 6.78 | Density = 0.7;CPin=0.5 |
| COACH | 746 | 608 | 8.54 | 44 | 0.272 | 6.96 | Null |
Figure 1Comparison of f-measure based on three types of GO of GSM-CA and other algorithms.
List of top ten scoring modules identified by GSM-CA and their most enriched GO terms for Biological Process
| ID | Size of module | Number of proteins enriched the same GO Term | Size of GO Term | Name of GO Term | p-value |
|---|---|---|---|---|---|
| 1 | 35 | 33 | 221 | rRNA processing | 1.70e-41 |
| 2 | 42 | 38 | 323 | ribosome biogenesis | 1.61e-39 |
| 3 | 13 | 13 | 15 | tRNA transcription | 1.92e-35 |
| 4 | 19 | 14 | 19 | mRNA polyadenylation | 1.96e-31 |
| 5 | 11 | 11 | 12 | cyclin catabolic process | 8.92e-31 |
| 6 | 14 | 12 | 14 | polyadenylation-dependent snoRNA 3'-end processing | 1.91e-30 |
| 7 | 19 | 18 | 93 | mitochondrial translation | 8.08e-30 |
| 8 | 16 | 14 | 29 | energy coupled proton transport, down electrochemical gradient | 1.33e-29 |
| 9 | 22 | 14 | 20 | RNA polymerase II transcriptional preinitiation complex assembly | 2.70e-29 |
| 10 | 19 | 18 | 101 | nuclear mRNA splicing, via spliceosome | 8.91e-29 |
Figure 2An example of modules identified by the GSM-CA method.
Figure 3The effects of cn. (a) The size of the biggest cluster (b) The total number of the clusters whose size is greater than 2 (c) The average size of the clusters whose size is greater than 2 (d) The average f-measure.
Comparison of the running time of the GSM-FC algorithm and other algorithms
| Algorithms | The running time |
|---|---|
| MCODE | 71.5 s |
| COACH | 6.8 s |
| CFinder | 24.4 s |
| DPClus | 926.0 s |
| GSM-AC | 82.6 s |