| Literature DB >> 34970305 |
Rongquan Wang1, Huimin Ma1, Caixia Wang2.
Abstract
Identifying the protein complexes in protein-protein interaction (PPI) networks is essential for understanding cellular organization and biological processes. To address the high false positive/negative rates of PPI networks and detect protein complexes with multiple topological structures, we developed a novel improved memetic algorithm (IMA). IMA first combines the topological and biological properties to obtain a weighted PPI network with reduced noise. Next, it integrates various clustering results to construct the initial populations. Furthermore, a fitness function is designed based on the five topological properties of the protein complexes. Finally, we describe the rest of our IMA method, which primarily consists of four steps: selection operator, recombination operator, local optimization strategy, and updating the population operator. In particular, IMA is a combination of genetic algorithm and a local optimization strategy, which has a strong global search ability, and searches for local optimal solutions effectively. The experimental results demonstrate that IMA performs much better than the base methods and existing state-of-the-art techniques. The source code and datasets of the IMA can be found at https://github.com/RongquanWang/IMA.Entities:
Keywords: fitness function; graph clustering methods; memetic algorithm; protein complexes; protein-protein interaction networks
Year: 2021 PMID: 34970305 PMCID: PMC8712950 DOI: 10.3389/fgene.2021.794354
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Statistics of used four PPI networks in the study.
| Dataset | Number of node | Number of edge | Density |
|---|---|---|---|
| Krogan | 2674 | 7075 | 0.0019796849348 |
| DIP | 4930 | 17201 | 0.0014157219124 |
| combined6 | 3869 | 17327 | 0.0023156247135 |
| WI-PHI | 5955 | 49604 | 0.0027980540426 |
Statistics of used standard protein complexes.
| Datasets | Number | Protein coverage | Avg size |
|---|---|---|---|
| Standard protein complexes 1 | 812 | 2773 | 8.92 |
| Standard protein complexes 2 | 1045 | 2778 | 8.97 |
| CYC2008 protein complexes | 193 | 1371 | 8.33 |
| MIPS protein complexes | 212 | 1202 | 15.61 |
FIGURE 1The flowchart of IMA framework.
FIGURE 2The effect of Pop. It shows how the variation of parameter Pop affect the Total score of IMA method in standard protein complexes 1 and standard protein complexes 2.
FIGURE 3The effect of MaxIter. It shows how the variation of parameter MaxIter affect the Total score of IMA method in standard protein complexes 1 and standard protein complexes 2.
Parameters of each method used in the study.
| ID | Algorithms | Parameter |
|---|---|---|
| 1 | MCODE | (default setting) |
| 2 | MCL | inflation = 2 (author suggestions) |
| 3 | IPCA | S = 3, |
| 4 | COACH | w = 0.225 (default setting) |
| 5 | ClusterONE | Density = auto, Overlap threshold = 0.8 (author suggestions) |
| 6 | PEWCC | Overlap = 0.8,-r = 0.1,Re-join = 0.3 (author suggestions) |
| 7 | ProRank+ | AdjstCD threshold = 0.45 (author suggestions) |
| 8 | WPNCA | lambda = 0.3,size = 3 (author suggestions) |
| 9 | WEC | Balance factor ( |
| 10 | EWCA | Structural similarity (ss = 0.4) (author suggestions) |
| 11 | SE-DMTG | Size = 3 (author suggestions) |
| 12 | MPC-C | Overlap threshold = 0.8 (author suggestions) |
| 13 | IMA | Generations ( |
Performance of different algorithms with respect to standard protein complexes 1.
| Algorithms | Num | F-measure | CR | ACC | MMR | Jaccard | Total score |
|---|---|---|---|---|---|---|---|
| Krogan | |||||||
| MCODE | 39 | 0.3414 | 0.2140 | 0.1994 | 0.0351 | 0.2359 | 1.0258 |
| MCL | 370 | 0.4004 | 0.3895 |
| 0.1123 | 0.2902 | 1.5117 |
| IPCA | 582 | 0.5573 | 0.3389 | 0.2653 | 0.2368 | 0.3713 | 1.7696 |
| COACH | 345 | 0.5254 | 0.3473 | 0.2667 | 0.1824 | 0.3556 | 1.6775 |
| ClusterONE | 240 | 0.4694 | 0.3085 | 0.2829 | 0.1262 | 0.3324 | 1.5194 |
| PEWCC | 383 | 0.5289 | 0.3231 | 0.2554 | 0.2194 | 0.3786 | 1.7053 |
| ProRank+ | 357 | 0.5448 | 0.3660 | 0.2718 | 0.2018 | 0.3544 | 1.7388 |
| WPNCA | 369 | 0.5446 | 0.3897 | 0.2758 | 0.1663 | 0.3415 | 1.7179 |
| WEC | 516 | 0.5440 | 0.3442 | 0.2637 | 0.2573 | 0.4005 | 1.8089 |
| EWCA | 676 | 0.5883 | 0.3782 | 0.2769 | 0.3019 | 0.4073 | 1.9525 |
| SE-DMTG | 372 | 0.5878 | 0.3504 | 0.2820 | 0.2284 | 0.4058 | 1.8781 |
| MPC-C | 458 | 0.6010 | 0.3760 | 0.2814 | 0.2344 | 0.3882 | 1.8810 |
| IMA-unweighted | 767 | 0.5917 | 0.3827 | 0.2843 | 0.3221 | 0.4174 | 1.9982 |
| IMA | 773 |
|
| 0.2859 |
|
|
|
| DIP | |||||||
| MCODE | 26 | 0.1300 | 0.2193 | 0.1337 | 0.0103 | 0.1292 | 0.6224 |
| MCL | 628 | 0.3106 | 0.3578 | 0.2684 | 0.0752 | 0.2155 | 1.2275 |
| IPCA | 1242 | 0.5741 | 0.3519 | 0.2404 | 0.2096 | 0.3004 | 1.6764 |
| COACH | 329 | 0.5850 | 0.3697 | 0.2462 | 0.1254 | 0.3305 | 1.6568 |
| ClusterONE | 904 | 0.5118 |
|
| 0.1467 | 0.3297 | 1.8214 |
| PEWCC | 648 | 0.6004 | 0.3783 | 0.2438 | 0.1938 | 0.3514 | 1.7677 |
| ProRank+ | 167 | 0.3123 | 0.2115 | 0.1870 | 0.0452 | 0.2007 | 0.9567 |
| WPNCA | 623 | 0.5888 | 0.4307 | 0.2594 | 0.1807 | 0.3360 | 1.7955 |
| WEC | 253 | 0.4185 | 0.3104 | 0.2309 | 0.0953 | 0.3078 | 1.3628 |
| EWCA | 964 | 0.6428 | 0.4374 | 0.2691 | 0.2534 | 0.3723 | 1.9750 |
| SE-DMTG | 869 | 0.6309 | 0.3822 | 0.2674 | 0.2264 | 0.3573 | 1.8482 |
| MPC-C | 1477 | 0.6632 | 0.4413 | 0.2729 | 0.2716 | 0.3537 | 2.0027 |
| IMA-unweighted | 1569 | 0.6861 | 0.4488 | 0.2731 | 0.2957 | 0.3959 | 2.0995 |
| IMA | 1338 |
| 0.4528 | 0.2820 |
|
|
|
| combined6 | |||||||
| MCODE | 63 | 0.2483 | 0.3441 | 0.1762 | 0.0313 | 0.1832 | 0.9833 |
| MCL | 508 | 0.3606 | 0.4628 | 0.3098 | 0.0909 | 0.2871 | 1.5114 |
| IPCA | 2160 | 0.7218 |
| 0.2783 | 0.3093 | 0.4396 | 2.2597 |
| COACH | 682 | 0.5623 | 0.4839 | 0.2653 | 0.2035 | 0.3703 | 1.8855 |
| ClusterONE | 648 | 0.4165 | 0.5098 |
| 0.1235 | 0.3173 | 1.6979 |
| PEWCC | 737 | 0.6586 | 0.4713 | 0.2594 | 0.2661 | 0.4370 | 2.0924 |
| ProRank+ | 472 | 0.5837 | 0.3898 | 0.2394 | 0.2162 | 0.4363 | 1.8657 |
| WPNCA | 898 | 0.5912 | 0.5725 | 0.2720 | 0.1872 | 0.3306 | 1.9537 |
| WEC | 544 | 0.5614 | 0.4367 | 0.2504 | 0.2001 | 0.4120 | 1.8609 |
| EWCA | 935 | 0.6860 | 0.5058 | 0.2771 | 0.3097 | 0.4534 | 2.2321 |
| SE-DMTG | 490 | 0.6854 | 0.4347 | 0.2767 | 0.2326 | 0.4517 | 2.2321 |
| MPC-C | 1008 | 0.7001 | 0.4871 | 0.2769 | 0.2820 | 0.4438 | 2.1899 |
| IMA-unweighted | 1183 | 0.7097 | 0.4746 | 0.2771 | 0.3341 | 0.4755 | 2.271 |
| IMA | 1054 |
| 0.4829 | 0.2734 |
|
|
|
| WI-PHI | |||||||
| MCODE | 124 | 0.1095 | 0.4282 | 0.1720 | 0.0142 | 0.1095 | 0.8333 |
| MCL | 772 | 0.2597 | 0.4323 | 0.2960 | 0.0647 | 0.2246 | 1.2773 |
| IPCA | 2181 | 0.5361 | 0.5789 | 0.2819 | 0.2604 | 0.3585 | 2.0156 |
| COACH | 1353 | 0.4689 | 0.6095 | 0.2752 | 0.1803 | 0.3144 | 1.8483 |
| ClusterONE | 1313 | 0.1813 | 0.4908 | 0.3103 | 0.0689 | 0.2065 | 1.2577 |
| PEWCC | 1813 | 0.5440 | 0.5943 | 0.2757 | 0.2535 | 0.3516 | 2.0192 |
| ProRank+ | 255 | 0.1814 | 0.2038 | 0.1801 | 0.0259 | 0.1719 | 0.7630 |
| WPNCA | 1813 | 0.5385 | 0.6198 | 0.2834 | 0.2257 | 0.3432 | 2.0106 |
| WEC | 729 | 0.3700 | 0.4830 | 0.2353 | 0.0969 | 0.2987 | 1.4839 |
| EWCA | 964 | 0.6428 | 0.4374 | 0.2691 | 0.2534 | 0.3723 | 1.9750 |
| SE-DMTG | 774 | 0.4945 | 0.5198 |
| 0.1816 | 0.3827 | 1.8894 |
| MPC-C | 2560 | 0.6068 | 0.5054 | 0.2793 | 0.2013 | 0.3668 | 1.9597 |
| IMA-unweighted | 3316 | 0.6769 | 0.5924 | 0.2983 | 0.3841 | 0.4423 | 2.3941 |
| IMA | 2561 |
|
| 0.3060 |
|
|
|
The bold values are the highest value of each metric of each PPI network.
Performance of different algorithms with respect to standard protein complexes 2.
| Algorithms | Num | F-measure | CR | ACC | MMR | Jaccard | Total score |
|---|---|---|---|---|---|---|---|
| Krogan | |||||||
| MCODE | 39 | 0.2317 | 0.1863 | 0.1861 | 0.0238 | 0.1982 | 0.8260 |
| MCL | 370 | 0.3214 | 0.3534 |
| 0.0792 | 0.2559 | 1.3187 |
| IPCA | 582 | 0.4606 | 0.3097 | 0.2405 | 0.1545 | 0.3271 | 1.4924 |
| COACH | 345 | 0.4369 | 0.3166 | 0.2441 | 0.1228 | 0.3136 | 1.4340 |
| ClusterONE | 240 | 0.3913 | 0.2729 | 0.2756 | 0.0887 | 0.2826 | 1.3110 |
| PEWCC | 383 | 0.4228 | 0.2913 | 0.2343 | 0.1418 | 0.3247 | 1.4150 |
| ProRank+ | 357 | 0.4435 | 0.3282 | 0.2621 | 0.1306 | 0.3067 | 1.4711 |
| WPNCA | 369 | 0.4361 |
| 0.2614 | 0.1083 | 0.2960 | 1.4590 |
| WEC | 516 | 0.4344 | 0.3022 | 0.2465 | 0.1559 | 0.3351 | 1.4741 |
| EWCA | 676 | 0.5112 | 0.3483 | 0.2663 | 0.1986 | 0.3607 | 1.6851 |
| SE-DMTG | 372 | 0.5060 | 0.3092 | 0.2684 | 0.1445 | 0.3471 | 1.5852 |
| MPC-C | 458 | 0.5252 | 0.3354 | 0.2706 | 0.1583 | 0.3338 | 1.6233 |
| IMA-unweighted | 767 | 0.5091 | 0.3488 | 0.2709 | 0.2078 | 0.3665 | 1.7031 |
| IMA | 773 |
| 0.3526 | 0.2733 |
|
|
|
| DIP | |||||||
| MCODE | 26 | 0.1061 | 0.1982 | 0.1205 | 0.0071 | 0.1114 | 0.5433 |
| MCL | 628 | 0.2409 | 0.3025 | 0.2504 | 0.0482 | 0.1921 | 1.0341 |
| IPCA | 1242 | 0.4516 | 0.3196 | 0.2304 | 0.1298 | 0.2674 | 1.3989 |
| COACH | 329 | 0.4703 | 0.3184 | 0.2307 | 0.0800 | 0.2829 | 1.3823 |
| ClusterONE | 904 | 0.4232 |
| 0.2937 | 0.0972 | 0.2874 | 1.5373 |
| PEWCC | 648 | 0.4812 | 0.3336 | 0.2329 | 0.1125 | 0.2986 | 1.4588 |
| ProRank+ | 167 | 0.2506 | 0.1895 | 0.1802 | 0.0323 | 0.1784 | 0.8310 |
| WPNCA | 623 | 0.4603 | 0.3709 | 0.2472 | 0.1065 | 0.2866 | 1.4715 |
| WEC | 253 | 0.2921 | 0.2588 | 0.2422 | 0.0526 | 0.2497 | 1.0954 |
| EWCA | 964 | 0.5334 | 0.3812 | 0.2536 | 0.1522 | 0.3226 | 1.6429 |
| SE-DMTG | 869 | 0.5305 | 0.3403 | 0.2562 | 0.1382 | 0.3108 | 1.5697 |
| MPC-C | 1477 | 0.5692 | 0.3799 | 0.2538 | 0.1706 | 0.3050 | 1.6785 |
| IMA-unweighted | 1569 | 0.5763 | 0.3873 | 0.2583 | 0.1855 | 0.336 | 1.7435 |
| IMA | 1338 |
| 0.3828 |
|
|
|
|
| combined6 | |||||||
| MCODE | 63 | 0.1771 | 0.2943 | 0.1642 | 0.0213 | 0.1543 | 0.8113 |
| MCL | 508 | 0.2902 | 0.4078 | 0.2966 | 0.0629 | 0.2605 | 1.3181 |
| IPCA | 2160 | 0.5641 | 0.4521 | 0.2824 | 0.1854 | 0.3725 | 1.8567 |
| COACH | 682 | 0.4454 | 0.4171 | 0.2626 | 0.1229 | 0.3184 | 1.5665 |
| ClusterONE | 648 | 0.3454 | 0.4385 |
| 0.0885 | 0.2881 | 1.4752 |
| PEWCC | 737 | 0.5223 | 0.4064 | 0.2588 | 0.1531 | 0.3739 | 1.7145 |
| ProRank+ | 472 | 0.4697 | 0.3305 | 0.2322 | 0.1237 | 0.3631 | 1.5194 |
| WPNCA | 898 | 0.4968 |
| 0.2822 | 0.1182 | 0.3140 | 1.7231 |
| WEC | 544 | 0.4324 | 0.3842 | 0.2577 | 0.1172 | 0.3562 | 1.5478 |
| EWCA | 935 | 0.5657 | 0.4523 | 0.2810 | 0.1846 | 0.3971 | 1.8807 |
| SE-DMTG | 490 | 0.5568 | 0.3747 | 0.2782 | 0.1414 | 0.3793 | 1.7306 |
| MPC-C | 1008 | 0.5964 | 0.4077 | 0.2677 | 0.1752 | 0.3756 | 1.8225 |
| IMA-unweighted | 1183 | 0.602 | 0.3999 | 0.2695 | 0.1993 | 0.3988 | 1.8694 |
| IMA | 1054 |
| 0.4138 | 0.2828 |
|
|
|
| WI-PHI | |||||||
| MCODE | 124 | 0.0766 | 0.3701 | 0.1606 | 0.0086 | 0.0938 | 0.7096 |
| MCL | 772 | 0.2116 | 0.3563 | 0.2776 | 0.0445 | 0.2042 | 1.0941 |
| IPCA | 2181 | 0.4655 | 0.4970 | 0.2830 | 0.1634 | 0.3250 | 1.7340 |
| COACH | 1353 | 0.3577 | 0.5228 | 0.2559 | 0.1115 | 0.2821 | 1.5300 |
| ClusterONE | 1313 | 0.1571 | 0.4179 | 0.2920 | 0.0481 | 0.1956 | 1.1108 |
| PEWCC | 1813 | 0.4464 | 0.5215 | 0.2615 | 0.1495 | 0.3189 | 1.6979 |
| ProRank+ | 255 | 0.1397 | 0.1732 | 0.1714 | 0.0190 | 0.1502 | 0.6536 |
| WPNCA | 1813 | 0.4285 |
| 0.2776 | 0.1346 | 0.3093 | 1.6945 |
| WEC | 729 | 0.2914 | 0.4329 | 0.2523 | 0.0590 | 0.2767 | 1.3122 |
| EWCA | 2347 | 0.4346 | 0.5382 | 0.2840 | 0.1679 | 0.3295 | 1.7542 |
| SE-DMTG | 774 | 0.4252 | 0.4397 | 0.2894 | 0.1183 | 0.3307 | 1.6033 |
| MPC-C | 2560 | 0.6068 | 0.5054 | 0.2793 | 0.2013 | 0.3668 | 1.9597 |
| IMA-unweighted | 3316 | 0.5822 | 0.5041 | 0.2902 | 0.2427 | 0.3838 | 2.0029 |
| IMA | 2561 |
| 0.5342 |
|
|
|
|
The bold values are the highest value of each metric of each PPI network.
Functional enrichment of the protein complexes identified using different algorithms.
| Algorithms | Num |
|
|
|
| Significant |
|---|---|---|---|---|---|---|
| Krogan | ||||||
| MCODE | 39 | 6 (15.38%) | 8 (20.51%) | 14 (35.89%) | 24 (61.53%) | 33 (84.61%) |
| MCL | 370 | 43 (11.62%) | 72 (19.46%) | 125 (33.78%) | 246 (66.48%) | 275 (74.32%) |
| IPCA | 582 | 108 (18.56%) | 162 (27.84%) | 244 (41.93%) | 445 (76.47%) | 485 (83.34%) |
| COACH | 345 | 66 (19.13%) | 107 (31.01%) | 165 (47.82%) | 272 (78.83%) | 295 (85.5%) |
| ClusterONE | 240 | 62 (25.83%) | 92 (38.33%) | 130 (54.16%) | 199 (82.91%) | 212 (88.33%) |
| PEWCC | 383 | 144 (37.6%) | 196 (51.18%) | 275 (71.81%) | 357 (93.22%) | 374 (97.66%) |
| ProRank+ | 357 | 88 (24.65%) | 119 (33.33%) | 184 (51.54%) | 283 (79.27%) | 311 (87.11%) |
| WPNCA | 369 | 62 (16.8%) | 100 (27.1%) | 167 (45.26%) | 290 (78.59%) | 311 (84.28%) |
| WEC | 516 | 133 (25.78%) | 186 (36.05%) | 262 (50.78%) | 421 (81.59%) | 447 (86.63%) |
| EWCA | 676 | 149 (22.04%) | 216 (31.95%) | 323 (47.78%) | 529 (78.25%) | 564 (83.43%) |
| SE-DMTG | 372 | 80 (21.51%) | 110 (29.57%) | 161 (43.28%) | 282 (75.81%) | 301 (80.92%) |
| MPC-C | 458 | 130 (28.38%) | 199 (43.45%) | 293 (63.97%) | 442 (96.5%) |
|
| IMA-unweighted | 767 | 219 (28.55%) | 301 (39.24%) | 457 (59.58%) | 681 (88.78%) | 712 (92.82%) |
| IMA | 773 | 226 (29.24%) | 319 (41.27%) | 501 (64.81%) | 719 (93.01%) | 735 (95.08%) |
| DIP | ||||||
| MCODE | 26 | 8 (30.77%) | 12 (46.15%) | 14 (53.84%) | 19 (73.07%) | 19 (73.07%) |
| MCL | 628 | 118 (18.79%) | 184 (29.3%) | 279 (44.43%) | 443 (70.54%) | 485 (77.23%) |
| IPCA | 1242 | 147 (11.84%) | 315 (25.37%) | 556 (44.77%) | 972 (78.26%) | 1039 (83.65%) |
| COACH | 329 | 75 (22.8%) | 122 (37.09%) | 177 (53.81%) | 290 (88.16%) | 305 (92.72%) |
| ClusterONE | 904 | 137 (15.15%) | 201 (22.23%) | 337 (37.27%) | 690 (76.32%) | 772 (85.39%) |
| PEWCC | 648 | 153 (23.61%) | 247 (38.12%) | 376 (58.03%) | 572 (88.28%) | 597 (92.14%) |
| ProRank+ | 167 | 23 (13.77%) | 38 (22.75%) | 63 (37.72%) | 129 (77.24%) | 138 (82.63%) |
| WPNCA | 623 | 156 (25.04%) | 242 (38.84%) | 370 (59.39%) | 562 (90.21%) | 590 (94.7%) |
| WEC | 253 | 97 (38.34%) | 121 (47.83%) | 149 (58.9%) | 195 (77.08%) | 209 (82.61%) |
| EWCA | 964 | 172 (17.84%) | 284 (29.46%) | 477 (49.48%) | 823 (85.37%) | 866 (89.83%) |
| SE-DMTG | 869 | 142 (16.34%) | 213 (24.51%) | 358 (41.2%) | 708 (81.48%) | 770 (88.61%) |
| MPC-C | 1477 | 323 (21.87%) | 538 (36.43%) | 906 (61.35%) | 1398 (94.66%) | 1445 (97.84%) |
| IMA-unweighted | 1569 | 327 (20.84%) | 495 (31.55%) | 810 (51.63%) | 1430 (91.15%) | 1492 (95.1%) |
| IMA | 1338 | 382 (28.55%) | 577 (43.12%) | 897 (67.04%) | 1305 (97.53%) |
|
| combined6 | ||||||
| MCODE | 63 | 26 (41.27%) | 31 (49.21%) | 42 (66.67%) | 57 (90.48%) | 60 (95.24%) |
| MCL | 508 | 129 (25.39%) | 162 (31.89%) | 209 (41.14%) | 323 (63.58%) | 349 (68.7%) |
| IPCA | 2160 | 579 (26.81%) | 784 (36.3%) | 1145 (53.01%) | 1923 (89.03%) | 2027 (93.84%) |
| COACH | 682 | 156 (22.87%) | 196 (28.74%) | 290 (42.52%) | 520 (76.24%) | 575 (84.3%) |
| ClusterONE | 648 | 148 (22.84%) | 208 (32.1%) | 258 (39.82%) | 420 (64.82%) | 461 (71.15%) |
| PEWCC | 737 | 285 (38.67%) | 375 (50.88%) | 505 (68.52%) | 688 (93.35%) | 707 (95.93%) |
| ProRank+ | 472 | 255 (54.03%) | 324 (68.65%) | 395 (83.69%) | 443 (93.86%) | 452 (95.77%) |
| WPNCA | 898 | 375 (41.76%) | 493 (54.9%) | 609 (67.82%) | 797 (88.76%) | 829 (92.32%) |
| WEC | 544 | 235 (43.2%) | 273 (50.19%) | 310 (56.99%) | 400 (73.53%) | 423 (77.76%) |
| EWCA | 935 | 274 (29.3%) | 337 (36.04%) | 437 (46.74%) | 721 (77.11%) | 770 (82.35%) |
| SE-DMTG | 490 | 147 (30.0%) | 199 (40.61%) | 248 (50.61%) | 431 (87.96%) | 455 (92.86%) |
| MPC-C | 1008 | 311 (30.85%) | 437 (43.35%) | 651 (64.58%) | 969 (96.13%) | 993 (98.51%) |
| IMA-unweighted | 1183 | 370 (31.28%) | 547 (46.24%) | 798 (67.46%) | 1117 (94.43%) | 1152 (97.39%) |
| IMA | 1054 | 387 (36.72%) | 557 (52.85%) | 771 (73.15%) | 1032 (97.91%) |
|
| WI-PHI | ||||||
| MCODE | 124 | 24 (19.35%) | 29 (23.38%) | 40 (32.25%) | 58 (46.77%) | 64 (51.61%) |
| MCL | 772 | 25 (3.24%) | 35 (4.54%) | 74 (9.59%) | 234 (30.32%) | 287 (37.19%) |
| IPCA | 2181 | 411 (18.84%) | 550 (25.21%) | 807 (36.99%) | 1259 (57.71%) | 1345 (61.65%) |
| COACH | 1353 | 303 (22.39%) | 422 (31.19%) | 591 (43.68%) | 921 (68.07%) | 989 (73.1%) |
| ClusterONE | 1313 | 198 (15.08%) | 256 (19.5%) | 342 (26.05%) | 555 (42.27%) | 635 (48.36%) |
| PEWCC | 1813 | 435 (23.99%) | 627 (34.58%) | 906 (49.97%) | 1297 (71.54%) | 1363 (75.18%) |
| ProRank+ | 255 | 53 (20.78%) | 60 (23.53%) | 83 (32.55%) | 145 (56.86%) | 156 (61.17%) |
| WPNCA | 1813 | 429 (23.66%) | 594 (32.76%) | 834 (46.0%) | 1253 (69.11%) | 1336 (73.69%) |
| WEC | 729 | 215 (29.49%) | 264 (36.21%) | 337 (46.22%) | 478 (65.56%) | 501 (68.72%) |
| EWCA | 2347 | 474 (20.2%) | 675 (28.76%) | 950 (40.48%) | 1428 (60.85%) | 1532 (65.28%) |
| SE-DMTG | 774 | 87 (11.24%) | 146 (18.86%) | 255 (32.94%) | 496 (64.08%) | 540 (69.76%) |
| MPC-C | 2560 | 452 (17.66%) | 766 (29.93%) | 1382 (53.99%) | 2163 (84.5%) | 2239 (87.47%) |
| IMA-unweighted | 3316 | 715 (21.56%) | 1062 (32.02%) | 1656 (49.93%) | 2646 (79.79%) | 2815 (84.89%) |
| IMA | 2561 | 847 (33.07%) | 1243 (48.53%) | 1732 (67.62%) | 2326 (90.81%) |
|
The bold values are the highest value of each metric of each PPI network.
The co-localization scores of protein complexes detected by different methods in four PPI networks.
| Algorithms | Num | Krogan | Num | DIP | Num | combined6 | Num | WI-PHI |
|---|---|---|---|---|---|---|---|---|
| Co-localization score | ||||||||
| MCODE | 39 |
| 26 | 0.6156 | 63 | 0.6481 | 124 | 0.5586 |
| MCL | 370 | 0.5820 | 628 | 0.5598 | 508 | 0.5182 | 772 | 0.5510 |
| IPCA | 582 | 0.6656 | 1242 | 0.6113 | 2160 | 0.5883 | 2181 | 0.5585 |
| COACH | 345 | 0.6587 | 329 | 0.6527 | 682 | 0.5833 | 1353 | 0.4960 |
| ClusterONE | 240 | 0.6716 | 904 | 0.5778 | 648 | 0.5277 | 1313 | 0.5207 |
| PEWCC | 383 | 0.7107 | 648 | 0.6271 | 737 | 0.6493 | 1813 | 0.5110 |
| ProRank+ | 357 | 0.6779 | 167 |
| 472 | 0.7377 | 255 | 0.5570 |
| WPNCA | 369 | 0.6245 | 623 | 0.6063 | 898 | 0.5270 | 1813 | 0.4997 |
| WEC | 516 | 0.7182 | 253 | 0.6520 | 544 | 0.6803 | 729 | 0.5039 |
| EWCA | 676 | 0.6960 | 964 | 0.6430 | 935 | 0.6707 | 964 | 0.5303 |
| SE-DMTG | 372 | 0.7164 | 869 | 0.6725 | 490 | 0.7247 | 774 | 0.6419 |
| MPC-C | 458 | 0.7315 | 1477 | 0.6503 | 1008 | 0.7178 | 2560 | 0.6111 |
| IMA-unweighted | 767 | 0.7052 | 1569 | 0.6556 | 1183 | 0.7090 | 3316 | 0.6199 |
| IMA | 773 | 0.7351 | 1338 | 0.6728 | 1054 |
| 2561 |
|
The bold values are the highest value of each metric of each PPI network.
FIGURE 4Some examples of detected protein complexes with different topological structures by IMA. Note that these protein complexes contain the number of proteins between 3 and 14. Proteins that have binary interactions are connected by dashed lines.
FIGURE 5The 130 protein complex in standard protein complexes 2 detected by different methods based on WI-PHI PPI network. Note that the red nodes represent the correctly identified proteins, and the blue nodes represent the wrongly identified proteins by method.