| Literature DB >> 19770263 |
Abstract
MOTIVATION: Clustering of protein-protein interaction networks is one of the most common approaches for predicting functional modules, protein complexes and protein functions. But, how well does clustering perform at these tasks?Entities:
Mesh:
Substances:
Year: 2009 PMID: 19770263 PMCID: PMC3167697 DOI: 10.1093/bioinformatics/btp551
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Performance as judged via three measures (Jaccard, PR and sDensity) of six clustering algorithms and OneCluster in how well they recapitulate (A) MIPS complexes, (B) BP modules and (C) CC modules from the HTP (top) and Y2H (bottom) S.cerevisiae networks.
PR AUC for BP and CC predictions of six clustering algorithms and Neighborhood in the HTP S.cerevisiae network
| MCL | |||||||
|---|---|---|---|---|---|---|---|
| (A) Function prediction | |||||||
| 0.0411 | 0.1557 | 0.0975 | 0.1213 | 0.1276 | 0.1082 | 0.1784 | |
| 0.1337 | 0.3309 | 0.1890 | 0.3003 | 0.2789 | 0.2506 | 0.3743 | |
| (B) Function prediction when factoring out singleton clusters | |||||||
| 0.0411 | 0.1593 | 0.1625 | 0.1251 | 0.1498 | 0.1432 | 0.1784 | |
| 0.1337 | 0.3467 | 0.3512 | 0.3084 | 0.3140 | 0.2985 | 0.3743 | |
| (C) Function prediction when factoring out singleton clusters and large clusters | |||||||
| 0.1320 | 0.1593 | 0.1625 | 0.1251 | 0.1491 | 0.1432 | 0.1784 | |
| 0.3189 | 0.3467 | 0.3512 | 0.3084 | 0.3253 | 0.2985 | 0.3743 | |
| (D) Function prediction when factoring out singleton clusters and poorly annotated clusters | |||||||
| 0.1715 | 0.1755 | 0.1677 | 0.1654 | 0.1787 | 0.1444 | 0.1784 | |
| 0.3331 | 0.3604 | 0.3577 | 0.3420 | 0.3577 | 0.3054 | 0.3743 | |
| (E) Function prediction when local topology is considered for clustering algorithms | |||||||
| 0.1716 | 0.1679 | 0.1710 | 0.1546 | 0.1827 | 0.1815 | 0.1784 | |
| 0.3535 | 0.3521 | 0.3646 | 0.3496 | 0.3772 | 0.3676 | 0.3743 | |
See text for details.
Fig. 2.Function prediction performance as protein annotations are removed. As BP (A) or CC (B) annotations are removed for 10%, 30%, 50%, 70% and 90% of the proteins in the HTP interaction network, the PR AUC of Neighborhood deteriorates more rapidly than that of any of the six clustering algorithms. The average PR AUC over 10 networks is plotted, with each error bar showing ±1SD from the average.