| Literature DB >> 27454775 |
Yijia Zhang1, Hongfei Lin2, Zhihao Yang2, Jian Wang2, Yiwei Liu2, Shengtian Sang2.
Abstract
BACKGROUND: Accurate determination of protein complexes has become a key task of system biology for revealing cellular organization and function. Up to now, the protein complex prediction methods are mostly focused on static protein protein interaction (PPI) networks. However, cellular systems are highly dynamic and responsive to cues from the environment. The shift from static PPI networks to dynamic PPI networks is essential to accurately predict protein complex.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27454775 PMCID: PMC4965712 DOI: 10.1186/s12859-016-1101-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1An illustration example of dynamic PPI networks construction. a construction of static PPI networks based on high-throughput PPI data. b calculation of dynamic information based on gene expression data. ATP, AP and PCC denote active time points, active probability and Pearson correlation coefficient, respectively. c construction of dynamic PPI networks
The statistics of PPI datasets in experiments
| High-throughput PPI data | Proteins | Interactions |
|---|---|---|
| Gavin dataset | 1430 | 6531 |
| Krogan dataset | 2675 | 7080 |
| MIPS dataset | 3950 | 11119 |
| STRING dataset | 5970 | 99786 |
The effect of Complex_thresh on protein complex prediction performance on DPN_Gavin
|
| #Complexes | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|---|
| 0 | 623 | 0.549 |
| 0.505 |
| 0.619 |
|
| 0.1 | 447 | 0.662 | 0.434 |
| 0.413 | 0.617 | 0.505 |
| 0.2 | 325 | 0.695 | 0.385 | 0.495 | 0.379 | 0.624 | 0.486 |
| 0.3 | 238 | 0.752 | 0.304 | 0.433 | 0.31 | 0.638 | 0.445 |
| 0.4 | 181 | 0.74 | 0.25 | 0.374 | 0.24 | 0.653 | 0.395 |
| 0.5 | 130 | 0.708 | 0.174 | 0.279 | 0.178 | 0.687 | 0.349 |
| 0.6 | 87 | 0.724 | 0.12 | 0.206 | 0.118 | 0.72 | 0.291 |
| 0.7 | 51 |
| 0.088 | 0.159 | 0.074 | 0.71 | 0.23 |
| 0.8 | 30 | 0.733 | 0.059 | 0.109 | 0.049 | 0.67 | 0.181 |
| 0.9 | 13 | 0.769 | 0.025 | 0.048 | 0.016 |
| 0.119 |
| 1 | 0 | - | - | - | - |
| - |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
The effect of Complex_thresh on protein complex prediction performance on DPN_Krogan
|
| #Complexes | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|---|
| 0 | 1246 | 0.388 |
| 0.497 |
| 0.673 |
|
| 0.1 | 816 | 0.464 | 0.591 |
| 0.448 | 0.677 | 0.551 |
| 0.2 | 546 | 0.526 | 0.512 | 0.519 | 0.401 | 0.685 | 0.524 |
| 0.3 | 353 | 0.598 | 0.363 | 0.451 | 0.316 | 0.685 | 0.465 |
| 0.4 | 223 | 0.619 | 0.255 | 0.361 | 0.217 | 0.705 | 0.391 |
| 0.5 | 144 | 0.632 | 0.181 | 0.282 | 0.149 | 0.721 | 0.328 |
| 0.6 | 97 | 0.608 | 0.115 | 0.194 | 0.101 | 0.777 | 0.279 |
| 0.7 | 52 |
| 0.071 | 0.129 | 0.061 | 0.78 | 0.219 |
| 0.8 | 37 | 0.595 | 0.059 | 0.107 | 0.04 | 0.781 | 0.177 |
| 0.9 | 15 | 0.533 | 0.025 | 0.047 | 0.015 |
| 0.112 |
| 1 | 0 | - | - | - | - |
| - |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
The effect of Complex_thresh on protein complex prediction performance on DPN_MIPS
|
| #Complexes | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|---|
| 0 | 1895 | 0.239 |
| 0.353 |
| 0.608 |
|
| 0.1 | 1145 | 0.274 | 0.576 |
| 0.382 | 0.61 | 0.483 |
| 0.2 | 611 | 0.327 | 0.404 | 0.361 | 0.313 | 0.634 | 0.446 |
| 0.3 | 321 | 0.364 | 0.26 | 0.303 | 0.224 | 0.644 | 0.38 |
| 0.4 | 192 | 0.396 | 0.174 | 0.242 | 0.141 | 0.633 | 0.299 |
| 0.5 | 101 | 0.426 | 0.11 | 0.175 | 0.089 | 0.642 | 0.239 |
| 0.6 | 57 |
| 0.061 | 0.108 | 0.046 | 0.726 | 0.182 |
| 0.7 | 23 | 0.348 | 0.02 | 0.037 | 0.016 | 0.73 | 0.109 |
| 0.8 | 13 | 0.231 | 0.005 | 0.01 | 0.003 |
| 0.05 |
| 0.9 | 11 | 0.182 | 0.005 | 0.01 | 0.003 |
| 0.06 |
| 1 | 0 | - | - | - | - |
| - |
F: F-score, P: precision, R: recall. The highest score of each row is shown in bold
Performance comparison with other protein complex prediction methods
| PPI data | Methods | #Complexes | P | R | F | Sn | PPV | Acc |
|---|---|---|---|---|---|---|---|---|
| Gavin data | Our method | 447 | 0.662 |
|
| 0.413 |
| 0.505 |
| CSO | 174 | 0.645 | 0.302 | 0.411 |
| 0.534 | 0.503 | |
| Cluster ONE | 243 | 0.502 | 0.324 | 0.393 | 0.46 | 0.597 |
| |
| COACH | 326 | 0.525 | 0.331 | 0.406 | 0.44 | 0.547 | 0.49 | |
| CMC | 120 | 0.608 | 0.218 | 0.321 | 0.371 | 0.606 | 0.474 | |
| HUNTER | 69 |
| 0.206 | 0.333 | 0.386 | 0.508 | 0.443 | |
| MCODE | 66 | 0.727 | 0.142 | 0.238 | 0.277 | 0.513 | 0.377 | |
| Krogan data | Our method | 816 | 0.464 |
|
|
| 0.677 |
|
| CSO | 190 | 0.726 | 0.331 | 0.455 | 0.411 | 0.642 | 0.514 | |
| Cluster ONE | 240 | 0.579 | 0.328 | 0.419 | 0.398 |
| 0.521 | |
| COACH | 345 | 0.617 | 0.343 | 0.441 | 0.432 | 0.544 | 0.485 | |
| CMC | 111 | 0.748 | 0.235 | 0.358 | 0.381 | 0.589 | 0.474 | |
| HUNTER | 74 |
| 0.199 | 0.323 | 0.374 | 0.569 | 0.462 | |
| MCODE | 76 | 0.724 | 0.157 | 0.258 | 0.255 | 0.583 | 0.385 | |
| MIPS data | Our method | 1145 | 0.274 |
|
| 0.382 | 0.61 |
|
| CSO | 192 | 0.495 | 0.289 | 0.365 | 0.286 | 0.568 | 0.403 | |
| Cluster ONE | 256 | 0.359 | 0.23 | 0.281 | 0.243 |
| 0.403 | |
| COACH | 448 | 0.301 | 0.289 | 0.295 | 0.336 | 0.311 | 0.323 | |
| CMC | 168 | 0.429 | 0.211 | 0.283 |
| 0.318 | 0.352 | |
| HUNTER | 52 |
| 0.11 | 0.189 | 0.296 | 0.286 | 0.291 | |
| MCODE | 85 | 0.447 | 0.115 | 0.183 | 0.19 | 0.503 | 0.309 | |
| STRING data | Our method | 1240 | 0.324 |
|
| 0.836 | 0.404 | 0.581 |
| Cluster ONE | 893 | 0.151 | 0.245 | 0.187 | 0.846 |
|
| |
| COACH | 1645 | 0.186 | 0.292 | 0.227 |
| 0.12 | 0.338 | |
| HUNTER | 5 |
| 0.01 | 0.019 | 0.104 | 0.298 | 0.176 | |
| MCODE | 393 | 0.092 | 0.09 | 0.091 | 0.675 | 0.242 | 0.405 |
#Complexes refers to the number of predicted complexes. F: F-score, P: precision, R: recall. The highest score of each approach is shown in bold
Performance comparison in computational time
| Methods | Gavin data | Krogan data | MIPS data | STRING data |
|---|---|---|---|---|
| Our method | 1,624 ms | 2,150 ms | 3,487 ms | 68,719 ms |
| CSO | 173,562 ms | 40,954 ms | 215,476 ms | >12 h |
| Cluster ONE | 2,166 ms | 3,154 ms | 4,317 ms | 183,634 ms |
| COACH | 1,783 ms | 1,207 ms | 3,772 ms | 3,351,694 ms |
| CMC | 339 ms | 1,397 ms | 1,450 ms | >12 h |
| HUNTER | 172 ms | 3,322 ms | 5,451 ms | 1,222,027 ms |
| MCODE | 1,879 ms | 1,985 ms | 3,732 ms | 785,616 ms |
Fig. 2RNA polymerase I complex predicted by our method on STRING dataset
Fig. 3Examples of protein complexes predicted by our method