| Literature DB >> 22759576 |
Jian Wang1, Dong Xie1, Hongfei Lin1, Zhihao Yang1, Yijia Zhang1.
Abstract
BACKGROUND: Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification.Entities:
Year: 2012 PMID: 22759576 PMCID: PMC3380758 DOI: 10.1186/1477-5956-10-S1-S18
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
Figure 1Flow of our algorithm.
Details of interaction datasets
| Datasets | Number of proteins | Number of interactions |
|---|---|---|
| Gavin | 1430 | 6531 |
| Krogan | 3581 | 14076 |
| DIP | 4928 | 17201 |
Figure 2The effect of filter_thres.
Figure 3The effect of merge_thres.
Figure 4Performance comparison of various approaches on Gavin-Combined.
Figure 5Performance comparison of various approaches on Krogan-Combined.
Figure 6Performance comparison of various approaches on DIP-Combined.
Performance comparison of various approaches on Gavin-CYC2008
| Method | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|
| MCODE | 0.739 | 0.154 | 0.255 | 0.283 | 0.519 | 0.384 |
| CFinder | 0.663 | 0.191 | 0.297 | 0.513 | 0.343 | 0.419 |
| CMC | 0.608 | 0.218 | 0.321 | 0.371 | 0.606 | 0.474 |
| RRW | 0.704 | 0.238 | 0.355 | 0.294 | 0.657 | 0.439 |
| COACH | 0.525 | 0.331 | 0.406 | 0.44 | 0.547 | 0.49 |
| CORE | 0.469 | 0.38 | 0.42 | 0.446 | 0.585 | 0.511 |
| Ours | 0.678 | 0.404 | 0.507 | 0.405 | 0.663 | 0.518 |
Performance comparison of various approaches on Krogan-CYC2008
| Method | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|
| MCODE | 0.612 | 0.081 | 0.142 | 0.273 | 0.345 | 0.307 |
| CFinder | 0.451 | 0.15 | 0.225 | 0.56 | 0.22 | 0.351 |
| CMC | 0.224 | 0.377 | 0.281 | 0.472 | 0.58 | 0.523 |
| RRW | 0.581 | 0.277 | 0.375 | 0.32 | 0.605 | 0.44 |
| COACH | 0.472 | 0.38 | 0.421 | 0.477 | 0.498 | 0.487 |
| CORE | 0.275 | 0.691 | 0.394 | 0.566 | 0.537 | 0.551 |
| Ours | 0.626 | 0.559 | 0.59 | 0.509 | 0.663 | 0.581 |
Performance comparison of various approaches on DIP-CYC2008
| Method | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|
| MCODE | 0.576 | 0.096 | 0.164 | 0.282 | 0.328 | 0.304 |
| CFinder | 0.396 | 0.257 | 0.312 | 0.612 | 0.297 | 0.437 |
| CMC | 0.283 | 0.505 | 0.363 | 0.523 | 0.526 | 0.525 |
| RRW | 0.541 | 0.365 | 0.436 | 0.378 | 0.557 | 0.459 |
| COACH | 0.418 | 0.529 | 0.467 | 0.545 | 0.481 | 0.512 |
| CORE | 0.16 | 0.662 | 0.258 | 0.569 | 0.567 | 0.568 |
| Ours | 0.537 | 0.699 | 0.607 | 0.583 | 0.578 | 0.581 |
Figure 7The effect of core-attachment clustering strategy on filtered networks.
Examples of predicted complexes
| ID | predicted complex | NA | GO biological processes | GO molecular functions | GO cellular components | |||
|---|---|---|---|---|---|---|---|---|
| annotation | p-value | annotation | p-value | annotation | p-value | |||
| 1 | YGR095C YDL111C YGR158C | 1 | GO:0071051 | 1.10e-33 | GO:0000175 | 2.77e-19 | GO:0000176 | 1.25e-33 |
| 2 | YPL243W YML105C YKL122C | 1 | GO:0006617 | 3.10e-18 | GO:0008312 | 1.37e-15 | GO:0005786 | 5.59e-19 |
| 3 | YBR060C YPR162C YNL261W | 1 | GO:0006267 | 3.07e-15 | GO:0003688 | 3.54e-14 | GO:0000808 | 1.65e-19 |
| 4 | YHL025W YJL176C YNR023W | 0.917 | GO:0042766 | 5.97e-30 | GO:0008094 | 1.09e-3 | GO:0016514 | 4.90e-33 |
| 5 | YLR071C YGR104C YOR174W | 0.76 | GO:0006366 | 1.80e-22 | GO:0001104 | 8.82e-54 | GO:0016592 | 4.61e-51 |
| 6 | YLR357W YFR037C YPR034W | - | GO:0006338 | 7.87e-24 | GO:0016887 | 9.77e-17 | GO:0016585 | 2.54e-19 |
| 7 | YDR416W YAL032C YMR288W | - | GO:0000398 | 2.35e-19 | GO:0000384 | 2.04e-14 | GO:0005681 | 5.52e-21 |
| 8 | YNL252C YML025C YDR116C | - | GO:0032543 | 2.74e-7 | GO:0003735 | 2.16e-9 | GO:0000315 | 1.05e-12 |
Figure 8Topology of complex number 6, 7 and 8 in Table 5.