| Literature DB >> 25405206 |
Qiguo Dai1, Maozu Guo1, Yingjie Guo1, Xiaoyan Liu1, Yang Liu1, Zhixia Teng2.
Abstract
Protein complex formed by a group of physical interacting proteins plays a crucial role in cell activities. Great effort has been made to computationally identify protein complexes from protein-protein interaction (PPI) network. However, the accuracy of the prediction is still far from being satisfactory, because the topological structures of protein complexes in the PPI network are too complicated. This paper proposes a novel optimization framework to detect complexes from PPI network, named PLSMC. The method is on the basis of the fact that if two proteins are in a common complex, they are likely to be interacting. PLSMC employs this relation to determine complexes by a penalized least squares method. PLSMC is applied to several public yeast PPI networks, and compared with several state-of-the-art methods. The results indicate that PLSMC outperforms other methods. In particular, complexes predicted by PLSMC can match known complexes with a higher accuracy than other methods. Furthermore, the predicted complexes have high functional homogeneity.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25405206 PMCID: PMC4227386 DOI: 10.1155/2014/720960
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Algorithm 1PLSMC (G, λ, τ, N ).
Figure 1Comparison of PLSMC with different parameter setting. (a) and (b) are the comparison of composite score and execution time of PLSMC with different value of N (max size of subnetwork) applied to the four networks. (c) is the composite scores of PLSMC and PLSMC without the penalty term (denoted by LSMC).
Figure 2Comparison on composite score of the algorithms applied to four networks. Various shades of the same color denote f-measure, Acc, and MMR submetrics. The total height of each bar is the value of composite score.
Figure 3The number of matched known complexes of the algorithms.
Figure 4The COMPASS complex as detected by the six algorithms. Hexagon nodes represent the proteins involved in the COMPASS complex. Shaded areas are the clusters detected by the algorithms, which have the max overlapping scores (os) with COMPASS complex.
Comparison on biological relevance of complexes predicted by the algorithms.
| Network | Method | MF | BP | CC |
|---|---|---|---|---|
| Krogan |
|
|
|
|
| SLCP2 | 0.394 | 0.114 | 0.094 | |
| ClusterONE | 0.311 | 0.291 | 0.357 | |
| RSGNM | 0.392 | 0.270 | 0.270 | |
| OCG | 0.199 | 0.185 | 0.331 | |
| MCL | 0.265 | 0.057 | 0.033 | |
| CFinder | 0.296 | 0.287 | 0.330 | |
|
| ||||
| Collins |
|
|
|
|
| SLCP2 | 0.405 | 0.353 | 0.410 | |
| ClusterONE | 0.401 | 0.377 | 0.419 | |
| RSGNM | 0.376 | 0.371 | 0.418 | |
| OCG | 0.519 | 0.439 | 0.612 | |
| MCL | 0.380 | 0.240 | 0.331 | |
| CFinder | 0.439 | 0.351 | 0.439 | |
|
| ||||
| Gavin |
|
|
|
|
| SLCP2 | 0.374 | 0.153 | 0.189 | |
| ClusterONE | 0.374 | 0.308 | 0.360 | |
| RSGNM | 0.382 | 0.333 | 0.389 | |
| OCG | 0.381 | 0.310 | 0.405 | |
| MCL | 0.308 | 0.112 | 0.210 | |
| CFinder | 0.387 | 0.350 | 0.401 | |
|
| ||||
| BioGRID |
|
|
|
|
| SLCP2 | 0.443 | 0.184 | 0.117 | |
| ClusterONE | 0.439 | 0.447 | 0.439 | |
| RSGNM | 0.363 | 0.277 | 0.267 | |
| OCG | 0.262 | 0.321 | 0.343 | |
| MCL | 0.400 | 0.176 | 0.140 | |
| CFinder | — | — | — | |
|
| ||||
| CYC2008 | 0.458 | 0.424 | 0.525 | |
MF, molecular function; BP, biological process; CC, cellular compartment.