| Literature DB >> 35281831 |
Rongquan Wang1, Huimin Ma1, Caixia Wang2.
Abstract
Detecting protein complexes is one of the keys to understanding cellular organization and processes principles. With high-throughput experiments and computing science development, it has become possible to detect protein complexes by computational methods. However, most computational methods are based on either unsupervised learning or supervised learning. Unsupervised learning-based methods do not need training datasets, but they can only detect one or several topological protein complexes. Supervised learning-based methods can detect protein complexes with different topological structures. However, they are usually based on a type of training model, and the generalization of a single model is poor. Therefore, we propose an Ensemble Learning Framework for Detecting Protein Complexes (ELF-DPC) within protein-protein interaction (PPI) networks to address these challenges. The ELF-DPC first constructs the weighted PPI network by combining topological and biological information. Second, it mines protein complex cores using the protein complex core mining strategy we designed. Third, it obtains an ensemble learning model by integrating structural modularity and a trained voting regressor model. Finally, it extends the protein complex cores and forms protein complexes by a graph heuristic search strategy. The experimental results demonstrate that ELF-DPC performs better than the twelve state-of-the-art approaches. Moreover, functional enrichment analysis illustrated that ELF-DPC could detect biologically meaningful protein complexes. The code/dataset is available for free download from https://github.com/RongquanWang/ELF-DPC.Entities:
Keywords: biological information; ensemble learning; graph clustering algorithms; network embedding; protein complexes; protein-protein interaction networks
Year: 2022 PMID: 35281831 PMCID: PMC8908451 DOI: 10.3389/fgene.2022.839949
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The detailed properties of the protein-protein interaction datasets.
| Dataset | Number of node | Number of edge | Density |
|---|---|---|---|
| Gavin | 1855 | 7,669 | 0.004 459 796 985 |
| Krogan core | 2,674 | 7,075 | 0.001 979 684 934 |
| DIP | 4,930 | 17 201 | 0.001 415 721 912 41 |
| MIPS | 4,553 | 12 318 | 0.001 188 694 605 27 |
The properties of the standard protein complexes.
| Datasets | Number | Protein coverage | Avg size |
|---|---|---|---|
| standard protein complexes 1 | 812 | 2,773 | 8.92 |
| standard protein complexes 2 | 1,045 | 2,778 | 8.97 |
FIGURE 1The ensemble framework of proposed protein complex detection.
FIGURE 2A procession of training a regression model.
The topological features are used for representing protein complexes.
| Num | Feature name | Num | Feature name |
|---|---|---|---|
| 1 | Graph entropy | 2 | Graph weight entropy |
| 3 | Node size | 4 | Edge size |
| 5 | Graph clustering coefficient | 6 | Maximum degree |
| 7 | Minimum degree | 8 | Mean degree |
| 9 | Median degree | 10 | Variance degree |
| 11 | standard deviation degree | 12 | Maximum weight degree |
| 13 | Minimum weight degree | 14 | Average weight degree |
| 15 | Median weight degree | 16 | standard weight degree |
| 17 | Graph density | 18 | Graph weight density |
| 19 | Edge mean weight | 20 | Edge median weight |
| 21 | Edge variance weight | 22 | Edge standard weight |
| 23 | Average shortest path length | 24 | Graph diameter |
| 25 | Maximum Clustering Coefficient | 26 | Minimum Clustering Coefficient |
| 27 | Mean Clustering Coefficient | 28 | Median Clustering Coefficient |
| 29 | Variance Clustering Coefficient | 30 | Graph conductance |
| 31 | Graph weight conductance | 32 | Modularity score |
| 33 | Weight modularity score | 34 | Average boundary edge weight |
| 35 | Average edge modularity | 36 | Average common neighbor |
| 37 | Standard common neighbor | 38 | Variance common neighbor |
| 39 | Minimum common neighbor | 40 | Median common neighbor |
| 41 | Maximum common neighbor | 42 | Mean topological features |
| 43 | Median topological feature | 44 | Variance topological feature |
| 45 | Maximum topological feature | 46 | Minimum topological feature |
| 47 | Standard topological feature | 48 | Mean Degree correlation |
| 49 | Minimum Degree correlation | 50 | Variance Degree correlation |
| 51 | Maximum Degree correlation | 52 | Median Degree correlation |
| 53 | Community model | 54 | Weight community model |
| 55 | Topological Change 1 | 56 | Topological Change 2 |
| 57 | Topological Change 3 | 58 | Topological Change 4 |
| 59 | Topological Change 5 | 60 | Topological Change 6 |
| 61 | Topological Change 7 | 62 | Topological Change 8 |
| 63 | First Eigenvalues 1 | 64 | First Eigenvalues 2 |
| 65 | First Eigenvalues 3 |
FIGURE 3Value of parameters ratio for ELF-DPC based on standard protein complexes 1.
FIGURE 4Value of parameters ratio for ELF-DPC based on standard protein complexes 2.
Parameters of each method used in the study.
| ID | Year | Algorithms | Parameter |
|---|---|---|---|
| 1 | 2003 | MCL | inflation = 2 (default setting) |
| 2 | 2006 | DPClus |
|
| 3 | 2009 | CMC | min _ |
| 4 | 2012 | ClusterONE | Density = auto, Overlap threshold = 0.8(author suggestions) |
| 5 | 2013 | PEWCC | Overlap = 0.8,-r = 0.1, Re-join = 0.3(author suggestions) |
| 6 | 2015 | WPNCA | lambda = 0.3, size = 3 (author suggestions) |
| 7 | 2016 | CPredictor2.0 |
|
| 8 | 2016 | Zhang |
|
| 9 | 2017 | ClusterEPs | NEPs of Complexes (minimum support threshold = 0.4, maximum support threshold = 0.05); NEPs of non-complexes (maximum support threshold = 0.05, minimum support threshold = 0.4); maximum overlap = 0.9, Maximum size of clusters = 100 (author suggestions) |
| 10 | 2018 | ClusterSS | numEpochs = 500, learnRate = 0.2, thresholdIn = 1.0, thresholdOut = 1.02, negativeTime = 20, minimum cluster size = 3 (author suggestions) |
| 11 | 2019 | ICJointLE | -L = 1,-r = 999,-d = 0.3,-c = 0.7,-f = 0.75,-p = 0.3,-m = 0.08, -u = 0.01,-e = 0.9, size = 3 (author suggestions) |
| 12 | 2021 | PC2P | minimum cluster size = 3 |
| 13 | 2022 | ELF-DPC |
|
Experimental results by the different methods using standard protein complexes 1.
| Name | Num | F-measure | CR | ACC | MMR | Jaccard | Total score |
|---|---|---|---|---|---|---|---|
|
| |||||||
| MCL | 220 | 0.535 8 | 0.489 1 |
| 0.149 4 | 0.361 0 | 1.901 0 |
| DPClus | 285 | 0.597 2 | 0.438 2 | 0.346 6 | 0.173 6 | 0.402 5 | 1.958 1 |
| CMC | 294 | 0.584 4 | 0.450 1 | 0.348 7 | 0.222 9 | 0.417 9 | 2.023 9 |
| ClusterONE | 258 | 0.597 6 | 0.451 4 | 0.345 8 | 0.192 1 | 0.397 4 | 1.984 4 |
| PEWCC |
| 0.657 6 | 0.431 6 | 0.314 6 |
| 0.396 9 | 2.154 6 |
| WPNCA | 484 | 0.642 8 |
| 0.311 4 | 0.255 7 | 0.355 4 | 2.060 2 |
| CPredictor2.0 | 266 | 0.628 6 | 0.375 0 | 0.306 2 | 0.214 4 | 0.412 4 | 1.936 5 |
| Zhang | 438 | 0.647 5 | 0.397 6 | 0.315 6 | 0.318 2 | 0.408 4 | 2.087 2 |
| ClusterEPs | 271 | 0.601 4 | 0.365 6 | 0.284 1 | 0.216 6 | 0.409 0 | 1.876 6 |
| ClusterSS | 482 | 0.560 0 | 0.394 1 | 0.321 8 | 0.253 5 | 0.368 5 | 1.897 9 |
| ICJointLE | 243 | 0.632 9 | 0.355 7 | 0.298 9 | 0.261 9 | 0.402 1 | 1.951 5 |
| PC2P | 219 | 0.576 9 | 0.443 9 | 0.355 1 | 0.182 5 | 0.392 2 | 1.950 5 |
| ELF-DPC | 286 |
| 0.479 2 | 0.339 1 | 0.251 6 |
|
|
|
| |||||||
| MCL | 370 | 0.400 4 | 0.389 5 |
| 0.136 1 | 0.290 2 | 1.535 4 |
| DPClus | 497 | 0.413 8 | 0.367 2 | 0.307 1 | 0.174 5 | 0.323 5 | 1.586 1 |
| CMC | 264 | 0.481 9 | 0.365 6 | 0.297 8 | 0.158 4 | 0.368 8 | 1.672 4 |
| ClusterONE | 240 | 0.469 4 | 0.308 5 | 0.282 9 | 0.152 3 | 0.332 4 | 1.545 4 |
| PEWCC | 383 | 0.528 9 | 0.323 1 | 0.230 9 | 0.147 1 | 0.378 6 | 1.608 5 |
| WPNCA | 369 | 0.544 6 | 0.389 7 | 0.275 8 | 0.191 2 | 0.341 5 | 1.742 8 |
| CPredictor2.0 | 236 | 0.589 5 | 0.303 7 | 0.272 5 | 0.195 4 | 0.368 8 | 1.729 8 |
| Zhang | 326 | 0.556 3 | 0.288 4 | 0.254 9 | 0.218 2 | 0.340 8 | 1.658 5 |
| ClusterEPs | 410 | 0.583 6 | 0.335 2 | 0.262 1 | 0.220 9 | 0.344 8 | 1.746 7 |
| ClusterSS |
| 0.437 7 | 0.375 8 | 0.307 2 | 0.240 2 | 0.335 7 | 1.696 6 |
| ICJointLE | 216 | 0.538 9 | 0.220 6 | 0.228 4 | 0.193 6 | 0.304 2 | 1.485 7 |
| PC2P | 249 | 0.435 6 | 0.345 8 | 0.297 0 | 0.133 7 | 0.319 0 | 1.531 0 |
| ELF-DPC | 304 |
|
| 0.298 4 |
|
|
|
|
| |||||||
| MCL | 628 | 0.310 6 | 0.357 8 | 0.268 4 | 0.093 2 | 0.215 5 | 1.245 5 |
| DPClus | 909 | 0.308 5 | 0.379 2 | 0.272 0 | 0.123 7 | 0.264 5 | 1.348 0 |
| CMC | 1,192 | 0.361 1 | 0.355 2 | 0.248 8 | 0.197 3 | 0.296 0 | 1.458 4 |
| ClusterONE | 904 | 0.511 8 |
|
| 0.175 2 | 0.329 7 | 1.849 9 |
| PEWCC | 648 | 0.600 4 | 0.378 3 | 0.226 2 | 0.157 3 |
| 1.713 6 |
| WPNCA | 623 | 0.588 8 | 0.430 7 | 0.259 4 | 0.207 0 | 0.336 0 | 1.821 9 |
| CPredictor2.0 | 293 | 0.500 8 | 0.230 2 | 0.228 7 | 0.111 0 | 0.282 5 | 1.353 3 |
| Zhang | 502 | 0.562 2 | 0.325 7 | 0.242 6 | 0.181 1 | 0.322 3 | 1.633 9 |
| ClusterEPs | 804 | 0.573 0 | 0.295 4 | 0.214 7 | 0.215 4 | 0.308 7 | 1.607 3 |
| ClusterSS |
| 0.323 0 | 0.333 5 | 0.257 7 |
| 0.257 3 | 1.404 7 |
| ICJointLE | 286 | 0.573 3 | 0.232 9 | 0.204 6 | 0.150 7 | 0.303 9 | 1.465 5 |
| PC2P | 441 | 0.341 9 | 0.340 1 | 0.254 2 | 0.085 4 | 0.232 4 | 1.254 0 |
| ELF-DPC | 564 |
| 0.492 2 | 0.276 8 | 0.227 3 | 0.345 4 |
|
|
| |||||||
| MCL | 594 | 0.068 1 | 0.168 6 | 0.157 7 | 0.021 4 | 0.106 4 | 0.522 1 |
| DPClus | 207 | 0.378 4 | 0.203 1 | 0.213 3 | 0.082 0 | 0.226 4 | 1.103 1 |
| CMC | 408 | 0.334 4 | 0.233 4 | 0.212 6 | 0.099 7 | 0.225 8 | 1.105 9 |
| ClusterONE | 690 | 0.292 5 | 0.271 9 |
| 0.098 9 | 0.204 4 | 1.116 7 |
| PEWCC | 382 | 0.280 2 | 0.190 0 | 0.138 9 | 0.056 6 | 0.167 9 | 0.833 5 |
| WPNCA | 527 | 0.330 1 | 0.260 3 | 0.182 4 | 0.101 7 | 0.179 8 | 1.054 3 |
| CPredictor2.0 | 265 | 0.434 4 | 0.221 2 | 0.228 8 | 0.114 0 | 0.254 5 | 1.252 9 |
| Zhang | 406 | 0.370 2 | 0.205 1 | 0.202 5 | 0.107 7 | 0.217 6 | 1.103 1 |
| ClusterEPs | 645 | 0.461 0 | 0.242 6 | 0.194 3 | 0.158 0 | 0.254 3 | 1.310 2 |
| ClusterSS |
| 0.230 9 | 0.240 0 | 0.232 0 | 0.124 2 | 0.194 2 | 1.021 3 |
| ICJointLE | 121 | 0.364 9 | 0.134 3 | 0.172 3 | 0.084 5 | 0.206 6 | 0.962 6 |
| PC2P | 374 | 0.234 7 | 0.237 1 | 0.213 7 | 0.065 2 | 0.166 2 | 0.917 0 |
| ELF-DPC | 483 |
|
| 0.223 7 |
|
|
|
The bold values are the highest value of each metric of each PPI network.
Experimental results by the different methods using standard protein complexes 2.
| Name | Num | F-measure | CR | ACC | MMR | Jaccard | Total score |
|---|---|---|---|---|---|---|---|
|
| |||||||
| MCL | 220 | 0.375 6 | 0.409 1 |
| 0.115 3 | 0.312 6 | 1.571 3 |
| DPClus | 285 | 0.385 4 | 0.348 3 | 0.329 3 | 0.140 5 | 0.314 7 | 1.518 2 |
| CMC | 294 | 0.380 3 | 0.357 5 | 0.330 1 | 0.145 9 | 0.325 7 | 1.539 5 |
| ClusterONE | 258 | 0.409 0 | 0.363 3 | 0.335 9 | 0.141 9 | 0.320 0 | 1.570 3 |
| PEWCC |
| 0.418 5 | 0.348 3 | 0.313 7 |
| 0.299 9 | 1.595 5 |
| WPNCA | 484 | 0.421 7 |
| 0.330 5 | 0.167 0 | 0.296 2 | 1.627 0 |
| CPredictor2.0 | 266 |
| 0.307 6 | 0.281 6 | 0.156 4 | 0.330 9 | 1.558 4 |
| Zhang | 438 | 0.436 5 | 0.320 9 | 0.294 2 | 0.205 7 | 0.318 6 | 1.575 8 |
| ClusterEPs | 271 | 0.433 1 | 0.290 6 | 0.271 5 | 0.167 0 | 0.317 3 | 1.479 5 |
| ClusterSS | 487 | 0.372 9 | 0.327 9 | 0.317 0 | 0.171 6 | 0.292 4 | 1.481 9 |
| ICJointLE | 243 | 0.486 1 | 0.292 0 | 0.283 4 | 0.191 2 | 0.325 7 | 1.578 5 |
| PC2P | 219 | 0.402 5 | 0.361 0 | 0.341 3 | 0.129 5 | 0.320 4 | 1.554 7 |
| ELF-DPC | 265 | 0.454 6 | 0.383 8 | 0.325 9 |
|
|
|
|
| |||||||
| MCL | 370 | 0.321 4 | 0.353 4 |
| 0.094 4 | 0.255 9 | 1.333 9 |
| DPClus |
| 0.357 7 | 0.333 5 | 0.289 9 | 0.120 0 | 0.289 3 | 1.390 4 |
| CMC | 264 | 0.399 9 | 0.319 2 | 0.273 2 | 0.110 1 | 0.314 9 | 1.417 3 |
| ClusterONE | 240 | 0.391 3 | 0.272 9 | 0.275 6 | 0.105 8 | 0.282 6 | 1.328 2 |
| PEWCC | 383 | 0.422 8 | 0.291 3 | 0.212 5 | 0.098 7 | 0.324 7 | 1.350 0 |
| WPNCA | 369 | 0.436 1 | 0.357 2 | 0.261 4 | 0.125 0 | 0.296 0 | 1.475 7 |
| CPredictor2.0 | 236 | 0.493 2 | 0.278 7 | 0.242 1 | 0.125 8 | 0.321 6 | 1.461 4 |
| Zhang | 326 | 0.463 7 | 0.263 4 | 0.237 3 | 0.145 6 | 0.295 7 | 1.405 7 |
| ClusterEPs | 410 | 0.465 8 | 0.302 1 | 0.239 0 | 0.144 4 | 0.297 5 | 1.448 8 |
| ClusterSS | 342 | 0.430 4 | 0.320 1 | 0.270 5 | 0.131 8 | 0.314 0 | 1.466 9 |
| ICJointLE | 216 | 0.451 6 | 0.208 3 | 0.214 7 | 0.123 0 | 0.272 6 | 1.270 2 |
| PC2P | 249 | 0.363 6 | 0.314 1 | 0.288 4 | 0.095 1 | 0.281 8 | 1.342 9 |
| ELF-DPC | 281 |
|
| 0.282 7 | 0.175 0 |
|
|
|
| |||||||
| MCL | 628 | 0.240 9 | 0.302 5 | 0.250 4 | 0.061 3 | 0.192 1 | 1.047 3 |
| DPClus | 909 | 0.278 4 | 0.342 4 | 0.249 3 | 0.089 8 | 0.244 5 | 1.204 4 |
| CMC | 1,192 | 0.313 0 | 0.321 3 | 0.219 3 | 0.132 9 | 0.266 4 | 1.253 0 |
| ClusterONE | 904 | 0.423 2 |
|
| 0.118 4 | 0.287 4 | 1.558 5 |
| PEWCC | 648 | 0.481 2 | 0.333 6 | 0.218 2 | 0.095 0 | 0.298 6 | 1.426 6 |
| WPNCA | 623 | 0.460 3 | 0.370 9 | 0.247 2 | 0.122 6 | 0.286 6 | 1.487 6 |
| CPredictor2.0 | 293 | 0.465 3 | 0.226 5 | 0.207 7 | 0.073 6 | 0.263 5 | 1.236 7 |
| Zhang | 502 | 0.492 9 | 0.292 8 | 0.221 5 | 0.122 3 | 0.281 8 | 1.411 3 |
| ClusterEPs | 804 | 0.461 1 | 0.264 6 | 0.192 9 | 0.132 3 | 0.265 2 | 1.316 2 |
| ClusterSS |
| 0.367 6 | 0.316 8 | 0.236 0 | 0.158 8 | 0.234 0 | 1.313 2 |
| ICJointLE | 286 | 0.473 4 | 0.216 8 | 0.202 7 | 0.096 1 | 0.266 8 | 1.255 8 |
| PC2P | 441 | 0.266 2 | 0.296 7 | 0.233 7 | 0.058 8 | 0.208 3 | 1.063 6 |
| ELF-DPC | 545 |
| 0.399 8 | 0.260 7 |
|
|
|
|
| |||||||
| MCL | 594 | 0.055 1 | 0.164 0 | 0.147 5 | 0.012 5 | 0.103 1 | 0.482 2 |
| DPClus | 207 | 0.330 7 | 0.193 4 | 0.194 8 | 0.054 7 | 0.204 9 | 0.978 5 |
| CMC | 408 | 0.298 1 | 0.212 5 | 0.187 3 | 0.064 2 | 0.199 9 | 0.962 0 |
| ClusterONE | 690 | 0.247 3 | 0.238 4 |
| 0.063 0 | 0.180 1 | 0.943 5 |
| PEWCC | 382 | 0.230 9 | 0.170 0 | 0.116 6 | 0.029 6 | 0.130 1 | 0.677 3 |
| WPNCA | 527 | 0.264 0 | 0.238 3 | 0.154 9 | 0.062 1 | 0.152 2 | 0.871 6 |
| CPredictor2.0 | 265 | 0.384 3 | 0.208 6 | 0.196 6 | 0.067 2 | 0.226 4 | 1.083 1 |
| Zhang | 406 | 0.341 3 | 0.194 4 | 0.185 7 | 0.071 0 | 0.200 2 | 0.992 5 |
| ClusterEPs | 645 | 0.358 2 | 0.211 5 | 0.172 0 | 0.088 4 | 0.212 0 | 1.042 1 |
| ClusterSS |
| 0.253 9 | 0.256 6 | 0.207 4 | 0.089 4 | 0.186 7 | 0.994 0 |
| ICJointLE | 121 | 0.295 9 | 0.122 4 | 0.159 3 | 0.053 8 | 0.178 7 | 0.810 1 |
| PC2P | 374 | 0.207 8 | 0.213 6 | 0.194 1 | 0.043 2 | 0.152 4 | 0.811 2 |
| ELF-DPC | 469 |
|
| 0.193 7 |
|
|
|
The bold values are the highest value of each metric of each PPI network.
Results of function enrichment test with different thresholds of p-value on Gavin and Krogan core.
| Algorithms | Num | As |
|
|
|
| Significant |
|---|---|---|---|---|---|---|---|
|
| |||||||
| MCL | 220 | 7.56 | 39(17.73%) | 48(21.82%) | 83(37.73%) | 183(83.18%) | 194(88.18%) |
| DPClus | 285 | 6.09 | 30(10.53%) | 49(17.2%) | 88(30.88%) | 182(63.86%) | 208(72.98%) |
| CMC | 294 | 5.83 | 43(14.63%) | 57(19.39%) | 82(27.89%) | 171(58.16%) | 206(70.06%) |
| ClusterONE | 258 | 7.24 | 39(15.12%) | 53(20.55%) | 101(39.15%) | 187(72.48%) | 205(79.46%) |
| PEWCC |
| 8.14 | 61(9.19%) | 117(17.62%) | 238(35.84%) | 480(72.29%) | 546(82.23%) |
| CPredictor2.0 | 266 | 6.04 | 29(10.9%) | 51(19.17%) | 122(45.86%) | 231(86.84%) | 244(91.73%) |
| WPNCA | 484 | 16.62 |
|
|
| 423(87.4%) | 449(92.77%) |
| Zhang | 438 | 6.30 | 44(10.05%) | 83(18.95%) | 164(37.44%) | 318(72.6%) | 354(80.82%) |
| ClusterEPs | 271 | 6.25 | 53(19.56%) | 86(31.74%) | 143(52.77%) |
|
|
| ClusterSS | 482 | 5.62 | 63(13.07%) | 95(19.71%) | 167(34.65%) | 336(69.71%) | 368(76.35%) |
| 487 | 5.36 | 50(10.27%) | 83(17.05%) | 147(30.19%) | 324(66.53%) | 368(75.56%) | |
| ICJointLE | 243 | 5.73 | 25(10.29%) | 27(11.11%) | 83(34.16%) | 196(80.66%) | 207(85.19%) |
| PC2P | 219 | 6.91 | 17(7.76%) | 11(5.02%) | 40(18.26%) | 106(48.4%) | 119(54.34%) |
| ELF-DPC | 286 | 8.81 | 59(20.63%) | 104(36.36%) | 154(53.84%) | 244(85.31%) | 262(91.6%) |
| 265 | 8.66 | 65(24.53%) | 89(33.59%) | 140(52.84%) | 231(87.18%) | 244(92.09%) | |
|
| |||||||
| MCL | 370 | 5.91 | 82(22.16%) | 119(32.16%) | 173(46.75%) | 275(74.32%) | 293(79.18%) |
| DPClus | 497 | 4.23 | 20(4.02%) | 43(8.65%) | 75(15.09%) | 253(50.9%) | 303(60.96%) |
| CMC | 264 | 5.05 | 20(7.58%) | 29(10.99%) | 44(16.67%) | 60(22.73%) | 63(23.87%) |
| ClusterONE | 240 | 5.27 | 44(18.33%) | 75(31.25%) | 121(50.42%) | 202(84.17%) | 216(90.0%) |
| PEWCC | 383 | 10.16 |
|
|
|
|
|
| CPredictor2.0 | 236 | 5.19 | 24(10.17%) | 46(19.49%) | 93(39.41%) | 213(90.26%) | 219(92.8%) |
| WPNCA | 369 | 12.59 | 43(11.65%) | 81(21.95%) | 172(46.61%) | 321(86.99%) | 339(91.87%) |
| Zhang | 326 | 5.41 | 37(11.35%) | 65(19.94%) | 118(36.2%) | 259(79.45%) | 279(85.58%) |
| ClusterEPs | 410 | 6.18 | 59(14.39%) | 95(23.17%) | 168(40.97%) | 341(83.17%) | 365(89.02%) |
| ClusterSS |
| 4.86 | 47(6.51%) | 95(13.16%) | 160(22.16%) | 371(51.38%) | 454(62.88%) |
| 342 | 7.01 | 48(14.04%) | 88(25.74%) | 155(45.33%) | 280(81.88%) | 304(88.9%) | |
| ICJointLE | 216 | 4.41 | 16(7.41%) | 21(9.72%) | 68(31.48%) | 184(85.18%) | 192(88.88%) |
| PC2P | 249 | 5.81 | 16(6.43%) | 23(9.24%) | 46(18.48%) | 136(54.62%) | 159(63.86%) |
| ELF-DPC | 304 | 9.55 | 80(26.32%) | 115(37.83%) | 163(53.62%) | 277(91.12%) | 292(96.05%) |
| 281 | 9.13 | 81(28.83%) | 111(39.51%) | 155(55.17%) | 262(93.25%) | 269(95.74%) | |
The bold values are the highest value of each metric of each PPI network.
Results of function enrichment test with different thresholds of p-value on DIP and MIPS.
| Algorithms | Num | As |
|
|
|
| Significant |
|---|---|---|---|---|---|---|---|
|
| |||||||
| MCL | 628 | 6.31 | 74(11.78%) | 125(19.9%) | 209(33.28%) | 414(65.92%) | 471(75.0%) |
| DPClus | 909 | 4.28 | 45(4.95%) | 64(7.04%) | 112(12.32%) | 364(40.04%) | 470(51.7%) |
| CMC | 1,192 | 3.81 | 90(7.55%) | 150(12.58%) | 304(25.5%) | 692(58.05%) | 829(69.54%) |
| ClusterONE | 904 | 6.40 | 54(5.97%) | 110(12.16%) | 259(28.64%) | 606(67.02%) | 705(77.97%) |
| PEWCC | 648 | 10.10 | 156(24.07%) |
|
| 584(90.12%) | 605(93.36%) |
| CPredictor2.0 | 293 | 4.54 | 18(6.14%) | 49(16.72%) | 124(42.32%) | 274(93.51%) |
|
| WPNCA | 623 | 12.41 | 81(13.0%) | 137(21.99%) | 228(36.6%) | 431(69.18%) | 481(77.21%) |
| Zhang | 502 | 5.18 | 44(8.76%) | 99(19.72%) | 200(39.84%) | 424(84.46%) | 448(89.24%) |
| ClusterEPs | 804 | 4.26 | 91(11.32%) | 145(18.04%) | 268(33.34%) | 625(77.74%) | 683(84.95%) |
| ClusterSS |
| 3.57 | 156(6.57%) | 253(10.65%) | 437(18.4%) | 1,047(44.08%) | 1,289(54.27%) |
| 2,179 | 5.74 | 110(5.05%) | 230(10.56%) | 501(23.0%) | 1,332(61.14%) | 1,574(72.25%) | |
| ICJointLE | 286 | 3.84 | 29(10.14%) | 27(9.44%) | 103(36.01%) | 248(86.71%) | 253(88.46%) |
| PC2P | 441 | 6.25 | 25(5.67%) | 14(3.17%) | 45(10.2%) | 185(41.95%) | 230(52.15%) |
| ELF-DPC | 564 | 14.43 | 140(24.82%) | 186(32.98%) | 289(51.24%) |
| 542(96.1%) |
| 545 | 12.77 |
| 203(37.25%) | 307(56.33%) | 493(90.46%) | 517(94.86%) | |
|
| |||||||
| MCL | 594 | 6.16 | 17(2.86%) | 29(4.88%) | 80(13.47%) | 165(27.78%) | 230(38.72%) |
| DPClus | 207 | 4.94 | 17(8.21%) | 27(13.04%) | 85(41.06%) | 169(81.64%) | 184(88.89%) |
| CMC | 408 | 4.87 | 30(7.35%) | 49(12.01%) | 101(24.76%) | 234(57.36%) | 278(68.14%) |
| ClusterONE | 690 | 6.03 | 22(3.19%) | 47(6.81%) | 137(19.85%) | 327(47.39%) | 483(70.0%) |
| PEWCC | 382 | 24.70 | 67(17.54%) | 94(24.61%) | 172(45.03%) | 308(80.63%) | 325(85.08%) |
| CPredictor2.0 | 265 | 4.60 | 19(7.17%) | 40(15.09%) | 118(44.52%) |
| 258(97.35%) |
| WPNCA | 527 | 18.27 | 60(11.39%) | 103(19.55%) | 234(44.41%) | 436(82.74%) | 471(89.38%) |
| Zhang | 406 | 5.14 | 16(3.94%) | 37(9.11%) | 111(27.34%) | 319(78.57%) | 355(87.44%) |
| ClusterEPs | 645 | 4.78 | 22(3.41%) | 45(6.98%) | 150(23.26%) | 443(68.69%) | 500(77.53%) |
| ClusterSS | 1,266 | 4.22 | 33(2.61%) | 70(5.53%) | 176(13.9%) | 607(47.94%) | 752(59.39%) |
|
| 5.81 | 25(1.58%) | 67(4.24%) | 237(14.99%) | 845(53.45%) | 1,069(67.62%) | |
| ICJointLE | 121 | 3.70 | 14(11.57%) | 16(13.22%) | 42(34.71%) | 102(84.3%) | 103(85.13%) |
| PC2P | 374 | 6.29 | 7(1.87%) | 4(1.07%) | 41(10.96%) | 171(45.72%) | 202(54.01%) |
| ELF-DPC | 483 | 9.33 |
|
| 246(50.93%) | 441(91.3%) | 463(95.85%) |
| 469 | 8.86 | 105(22.39%) | 155(33.05%) |
| 437(93.18%) |
| |
The bold values are the highest value of each metric of each PPI network.
FIGURE 5An example protein complex identified by different methods on the Krogan core PPI network. For example, (b) ELF-DPC-1.0–10, which means that the neighborhood affinity (Eq. 15) of ELF-DPC is 1.0, and it contains 10 proteins. Here, the red nodes are proteins that are correctly identified by this method, the yellow nodes are proteins that are missed by this method, and the blue nodes are the proteins that are incorrectly identified by this method.
The identified protein complexes with small p-values.
| Num |
| GOID | Gene ontology term |
|---|---|---|---|
|
| |||
| 1 | 9.72 641e-59 | GO:0000 502 | proteasome complex |
| 2 | 4.53 112e-61 | GO:0005 762 | mitochondrial large ribosomal subunit |
| 3 | 9.18 655e-68 | GO:0030 686 | 90S preribosome |
| 4 | 2.61 255e-65 | GO:0030 532 | small nuclear ribonucleoprotein complex |
|
| |||
| 1 | 2.50 943e-71 | GO:0000 375 | RNA splicing, |
| 2 | 1.21 735e-66 | GO:0005 681 | spliceosomal complex |
| 3 | 7.46 423e-67 | GO:0000 377 | RNA splicing, |
| 4 | 5.5 331e-62 | GO:0003 899 | DNA-directed 5′-3′ RNA polymerase activity |
|
| |||
| 1 | 2.14 679e-64 | GO:0042 254 | ribosome biogenesis |
| 2 | 5.5 228e-53 | GO:0042 274 | ribosomal small subunit biogenesis |
| 3 | 5.18 295e-62 | GO:0016 592 | mediator complex |
| 4 | 6.85 479e-66 | GO:0097 525 | spliceosomal snRNP complex |
|
| |||
| 1 | 1.22 375e-47 | GO:0050 657 | nucleic acid transport |
| 2 | 1.27 336e-44 | GO:0030 687 | preribosome, large subunit precursor |
| 3 | 1.58 322e-42 | GO:0022 624 | proteasome accessory complex |
| 4 | 9.71 714e-32 | GO:0000 124 | SAGA complex |
The framework of ELF-DPC algorithm.
|
|