| Literature DB >> 27117946 |
Yijia Zhang1, Hongfei Lin2, Zhihao Yang2, Jian Wang2.
Abstract
BACKGROUND: Recently, high-throughput experimental techniques have generated a large amount of protein-protein interaction (PPI) data which can construct large complex PPI networks for numerous organisms. System biology attempts to understand cellular organization and function by analyzing these PPI networks. However, most studies still focus on static PPI networks which neglect the dynamic information of PPI.Entities:
Keywords: Dynamic networks; Gene expression data; Protein complex identification; Protein-protein interaction networks
Mesh:
Substances:
Year: 2016 PMID: 27117946 PMCID: PMC4847341 DOI: 10.1186/s12859-016-1054-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Illustration of DPPN construction. a a static PPI network based on high-throughput PPI data. b the gene expression value of protein v1. c active time attributes and active probability of protein vertices calculated based on gene expression data. d a DPPN constructed based on a and c
Fig. 2Examples of active correlated cliques
The statistics of high-throughput PPI datasets in experiments
| PPI datasets | Proteins | Interactions |
|---|---|---|
| Krogan dataset | 2675 | 7080 |
| DIP dataset | 4928 | 17208 |
| MIPS dataset | 3950 | 11119 |
Fig. 3The distribution of the number of active proteins. a, b and c are the distribution of the number of active proteins in DPPN I, DPPN II and DPPN III, respectively
The effect of “Core_thresh” on DPPN I
|
| P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|
| 0 | 0.357 |
| 0.44 |
| 0.75 | 0.544 |
| 0.02 | 0.357 |
| 0.44 |
|
|
|
| 0.04 | 0.364 | 0.564 | 0.443 | 0.393 | 0.748 | 0.542 |
| 0.06 | 0.38 | 0.547 | 0.448 | 0.388 | 0.746 | 0.538 |
| 0.07 | 0.41 | 0.517 | 0.458 | 0.378 | 0.741 | 0.529 |
| 0.08 | 0.424 | 0.5 | 0.459 | 0.374 | 0.735 | 0.524 |
| 0.09 | 0.468 | 0.475 |
| 0.364 | 0.729 | 0.515 |
| 0.1 | 0.562 | 0.338 | 0.422 | 0.307 | 0.713 | 0.468 |
| 0.2 | 0.621 | 0.301 | 0.406 | 0.306 | 0.706 | 0.464 |
| 0.5 |
| 0.297 | 0.42 | 0.313 | 0.702 | 0.469 |
| 1.0 |
| 0.297 | 0.42 | 0.313 | 0.702 | 0.469 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
The effect of “Extend_thresh” on DPPN I
|
| P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|
| 0 | 0.428 | 0.439 | 0.433 |
| 0.629 | 0.515 |
| 0.02 | 0.463 |
| 0.469 | 0.39 | 0.702 |
|
| 0.04 |
| 0.468 | 0.468 | 0.372 | 0.724 | 0.519 |
| 0.05 |
|
|
| 0.364 | 0.729 | 0.515 |
| 0.06 | 0.465 |
| 0.47 | 0.36 | 0.734 | 0.514 |
| 0.08 | 0.46 | 0.471 | 0.465 | 0.351 | 0.739 | 0.509 |
| 0.1 | 0.458 | 0.468 | 0.463 | 0.346 | 0.741 | 0.507 |
| 0.2 | 0.455 | 0.468 | 0.461 | 0.336 |
| 0.5 |
| 0.5 | 0.45 | 0.468 | 0.459 | 0.334 | 0.743 | 0.498 |
| 1.0 | 0.45 | 0.468 | 0.459 | 0.334 | 0.743 | 0.498 |
| 0 | 0.428 | 0.439 | 0.433 |
| 0.629 | 0.515 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with other methods on Krogan dataset using CYC2008 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.468 |
|
| 0.364 |
| 0.515 |
| ClusterONE | 0.375 | 0.431 | 0.401 | 0.523 | 0.655 |
|
| COAN | 0.709 | 0.331 | 0.451 | 0.388 | 0.646 | 0.501 |
| COACH | 0.617 | 0.343 | 0.441 | 0.432 | 0.544 | 0.485 |
| CMC | 0.748 | 0.235 | 0.358 | 0.381 | 0.589 | 0.474 |
| HUNTER |
| 0.199 | 0.323 | 0.374 | 0.569 | 0.462 |
| MCL | 0.291 | 0.245 | 0.266 |
| 0.396 | 0.475 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with other methods on DIP dataset using CYC2008 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.483 |
|
| 0.373 |
|
|
| ClusterONE | 0.428 | 0.331 | 0.373 | 0.364 | 0.665 | 0.493 |
| COAN | 0.486 | 0.438 | 0.461 | 0.435 | 0.555 | 0.491 |
| COACH | 0.364 | 0.468 | 0.41 | 0.544 | 0.38 | 0.455 |
| CMC | 0.595 | 0.287 | 0.387 | 0.399 | 0.566 | 0.475 |
| HUNTER |
| 0.199 | 0.308 | 0.496 | 0.467 | 0.482 |
| MCL | 0.21 | 0.232 | 0.221 |
| 0.331 | 0.429 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with other methods on MIPS dataset using CYC2008 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.467 |
|
| 0.245 | 0.662 |
|
| ClusterONE | 0.359 | 0.23 | 0.281 | 0.243 |
|
|
| COAN | 0.453 | 0.282 | 0.348 | 0.271 | 0.55 | 0.386 |
| COACH | 0.301 | 0.289 | 0.295 | 0.336 | 0.311 | 0.323 |
| CMC | 0.429 | 0.211 | 0.283 | 0.389 | 0.318 | 0.352 |
| HUNTER |
| 0.11 | 0.189 | 0.296 | 0.286 | 0.291 |
| MCL | 0.164 | 0.154 | 0.159 |
| 0.212 | 0.307 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with other methods on Krogan dataset using MIPS2006 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.22 |
| 0.285 | 0.293 |
| 0.461 |
| ClusterONE | 0.317 | 0.327 | 0.322 | 0.328 | 0.667 | 0.467 |
| COAN | 0.46 | 0.35 |
| 0.352 | 0.696 |
|
| COACH | 0.357 | 0.341 | 0.349 | 0.357 | 0.673 | 0.49 |
| CMC | 0.309 | 0.304 | 0.306 | 0.401 | 0.569 | 0.478 |
| HUNTER |
| 0.207 | 0.288 | 0.317 | 0.602 | 0.437 |
| MCL | 0.149 | 0.23 | 0.181 |
| 0.444 | 0.464 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with other methods on DIP dataset using MIPS2006 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.292 | 0.535 | 0.378 | 0.325 |
| 0.483 |
| ClusterONE | 0.246 | 0.392 | 0.302 | 0.321 | 0.623 | 0.447 |
| COAN | 0.326 | 0.548 |
| 0.397 | 0.642 |
|
| COACH | 0.289 | 0.488 | 0.363 | 0.452 | 0.506 | 0.478 |
| CMC | 0.172 |
| 0.265 | 0.367 | 0.656 | 0.49 |
| HUNTER |
| 0.097 | 0.168 | 0.147 | 0.555 | 0.286 |
| MCL | 0.121 | 0.217 | 0.155 |
| 0.382 | 0.451 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with other methods on MIPS dataset using MIPS2006 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.336 |
|
| 0.24 | 0.683 | 0.405 |
| ClusterONE | 0.281 | 0.327 | 0.302 | 0.262 |
|
|
| COAN | 0.343 | 0.366 | 0.358 | 0.303 | 0.515 | 0.395 |
| COACH | 0.286 | 0.373 | 0.286 | 0.333 | 0.359 | 0.346 |
| CMC | 0.299 | 0.318 | 0.308 | 0.381 | 0.473 | 0.424 |
| HUNTER |
| 0.138 | 0.213 | 0.298 | 0.341 | 0.319 |
| MCL | 0.108 | 0.194 | 0.139 |
| 0.266 | 0.347 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with DyCluster method on Krogan dataset and gene expression data using CYC2008 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.468 |
|
| 0.364 |
| 0.515 |
| DyCluster + ClusterONE | 0.307 | 0.348 | 0.326 |
| 0.682 |
|
| DyCluster + COAN | 0.565 | 0.23 | 0.327 | 0.27 | 0.677 | 0.428 |
| DyCluster + COACH | 0.48 | 0.243 | 0.322 | 0.321 | 0.617 | 0.445 |
| DyCluster + CMC | 0.531 | 0.201 | 0.292 | 0.258 | 0.691 | 0.423 |
| DyCluster + HUNTER |
| 0.169 | 0.261 | 0.268 | 0.493 | 0.364 |
| DyCluster + MCL | 0.29 | 0.173 | 0.214 | 0.371 | 0.376 | 0.373 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with DyCluster method on DIP dataset and gene expression data using CYC2008 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method |
|
|
| 0.373 |
|
|
| DyCluster + ClusterONE | 0.153 | 0.373 | 0.217 | 0.399 | 0.63 | 0.501 |
| DyCluster + COAN | 0.349 | 0.311 | 0.329 | 0.339 | 0.596 | 0.449 |
| DyCluster + COACH | 0.319 | 0.375 | 0.344 | 0.409 | 0.54 | 0.47 |
| DyCluster + CMC | 0.316 | 0.294 | 0.305 | 0.328 | 0.565 | 0.43 |
| DyCluster + HUNTER | 0.472 | 0.147 | 0.224 | 0.226 | 0.618 | 0.374 |
| DyCluster + MCL | 0.243 | 0.228 | 0.237 |
| 0.343 | 0.413 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with DyCluster method on MIPS dataset and gene expression data using CYC2008 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method |
|
|
| 0.245 |
| 0.403 |
| DyCluster + ClusterONE | 0.157 | 0.27 | 0.198 |
| 0.597 |
|
| DyCluster + COAN | 0.39 | 0.216 | 0.278 | 0.223 | 0.601 | 0.366 |
| DyCluster + COACH | 0.304 | 0.216 | 0.252 | 0.24 | 0.522 | 0.354 |
| DyCluster + CMC | 0.363 | 0.174 | 0.235 | 0.199 | 0.572 | 0.337 |
| DyCluster + HUNTER | 0.421 | 0.123 | 0.19 | 0.195 | 0.527 | 0.321 |
| DyCluster + MCL | 0.156 | 0.11 | 0.129 | 0.232 | 0.275 | 0.253 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with DyCluster method on Krogan dataset and gene expression data using MIPS2006 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.22 |
|
| 0.293 | 0.726 | 0.461 |
| DyCluster + ClusterONE | 0.149 | 0.341 | 0.208 | 0.332 |
|
|
| DyCluster + COAN |
| 0.244 | 0.271 | 0.249 | 0.699 | 0.417 |
| DyCluster + COACH | 0.267 | 0.272 | 0.27 | 0.285 | 0.663 | 0.435 |
| DyCluster + CMC | 0.269 | 0.212 | 0.237 | 0.221 | 0.706 | 0.395 |
| DyCluster + HUNTER | 0.294 | 0.161 | 0.208 | 0.218 | 0.501 | 0.331 |
| DyCluster + MCL | 0.149 | 0.23 | 0.181 |
| 0.43 | 0.448 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with Dycluster method on DIP dataset and gene expression data using MIPS2006 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method | 0.292 |
|
| 0.325 |
| 0.483 |
| DyCluster + ClusterONE | 0.094 | 0.424 | 0.154 | 0.358 | 0.683 |
|
| DyCluster + COAN | 0.245 | 0.406 | 0.306 | 0.317 | 0.669 | 0.461 |
| DyCluster + COACH | 0.206 | 0.461 | 0.284 | 0.373 | 0.624 | 0.483 |
| DyCluster + CMC | 0.214 | 0.369 | 0.271 | 0.298 | 0.631 | 0.434 |
| DyCluster + HUNTER |
| 0.184 | 0.235 | 0.207 | 0.664 | 0.371 |
| DyCluster + MCL | 0.147 | 0.207 | 0.172 |
| 0.412 | 0.428 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with DyCluster method on MIPS dataset and gene expression data using MIPS2006 as benchmark
| P | R | F | Sn | PPV | Acc | |
|---|---|---|---|---|---|---|
| Our method |
|
|
| 0.24 |
| 0.405 |
| DyCluster + ClusterONE | 0.118 | 0.369 | 0.178 |
| 0.659 |
|
| DyCluster + COAN | 0.264 | 0.276 | 0.27 | 0.234 | 0.611 | 0.378 |
| DyCluster + COACH | 0.196 | 0.267 | 0.226 | 0.247 | 0.586 | 0.38 |
| DyCluster + CMC | 0.235 | 0.23 | 0.233 | 0.213 | 0.602 | 0.358 |
| DyCluster + HUNTER | 0.29 | 0.171 | 0.215 | 0.197 | 0.554 | 0.33 |
| DyCluster + MCL | 0.102 | 0.143 | 0.119 | 0.229 | 0.304 | 0.264 |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison of our method on gene expression data shuffled randomly using CYC2008 as benchmark
| PPI data | Gene expression data | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|---|
| Krogan | GSE3431 | 0.468 | 0.475 | 0.471 | 0.364 | 0.729 | 0.515 |
| Shuffled randomly | 0.435 | 0.406 | 0.421 | 0.28 | 0.735 | 0.453 | |
| DIP | GSE3431 | 0.483 | 0.471 | 0.477 | 0.373 | 0.694 | 0.509 |
| Shuffled randomly | 0.438 | 0.417 | 0.427 | 0.301 | 0.672 | 0.449 | |
| MIPS | GSE3431 | 0.467 | 0.324 | 0.382 | 0.245 | 0.662 | 0.403 |
| Shuffled randomly | 0.427 | 0.294 | 0.348 | 0.201 | 0.671 | 0.367 |
F: F-score, P: precision, R: recall
Performance comparison of our method on gene expression data shuffled randomly using MIPS2006 as benchmark
| PPI data | Gene expression data | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|---|
| Krogan | GSE3431 | 0.22 | 0.424 | 0.285 | 0.293 | 0.726 | 0.461 |
| Shuffled randomly | 0.207 | 0.355 | 0.262 | 0.212 | 0.73 | 0.393 | |
| DIP | GSE3431 | 0.292 | 0.535 | 0.378 | 0.325 | 0.718 | 0.483 |
| Shuffled randomly | 0.253 | 0.461 | 0.326 | 0.253 | 0.693 | 0.419 | |
| MIPS | GSE3431 | 0.336 | 0.401 | 0.372 | 0.24 | 0.683 | 0.405 |
| Shuffled randomly | 0.273 | 0.355 | 0.309 | 0.197 | 0.689 | 0.369 |
F: F-score, P: precision, R: recall
Fig. 4Golgi Transport Complex identified on DPPN I