| Literature DB >> 26329886 |
Feng Yu, Zhi Yang, Xiao Hu, Yuan Sun, Hong Lin, Jian Wang.
Abstract
BACKGROUND: Revealing protein complexes are important for understanding principles of cellular organization and function. High-throughput experimental techniques have produced a large amount of protein interactions, which makes it possible to predict protein complexes from protein-protein interaction (PPI) networks. However, the small amount of known physical interactions may limit protein complex detection.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26329886 PMCID: PMC4705505 DOI: 10.1186/1471-2105-16-S12-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1The architecture of PPIExtractor.
Properties of three yeast PPI datasets.
| Datasets | DIP | Krogan | Gavin |
|---|---|---|---|
| Number of proteins | 4928 | 3581 | 1430 |
| Number of interactions | 17201 | 14076 | 6531 |
Properties of denoising PPI networks with different denoising thresholds.
| den_thred | DIP | Krogan | Gavin | |||
|---|---|---|---|---|---|---|
| 0.5 | 3669 | 11617 | 2400 | 9507 | 1318 | 5971 |
| 0.6 | 3536 | 11316 | 2331 | 9367 | 1315 | 5963 |
| 0.7 | 3536 | 11316 | 2331 | 9367 | 1315 | 5963 |
| 0.8 | 3490 | 11190 | 2309 | 9313 | 1312 | 5958 |
| 0.9 | 3450 | 11084 | 2273 | 9223 | 1304 | 5942 |
| 1.0 | 3402 | 10933 | 2235 | 9143 | 1301 | 5928 |
| 1.1 | 3205 | 9486 | 2103 | 7736 | 1267 | 5492 |
| 1.2 | 3164 | 9381 | 2078 | 7676 | 1261 | 5486 |
| 1.3 | 3095 | 9219 | 2043 | 7572 | 1255 | 5469 |
| 1.4 | 3011 | 9019 | 1998 | 7451 | 1252 | 5449 |
#den_thred. denotes the different denoising thresholds; #Pro. denotes the number of proteins. #Int. denotes the number of interactions.
The amounts of the PPIs added into the original PPI networks with different integrating thresholds.
| int_thred | DIP | Krogan | Gavin |
|---|---|---|---|
| 0 | 1206 | 857 | 205 |
| -0.1 | 1661 | 1166 | 288 |
| -0.2 | 2197 | 1525 | 371 |
| -0.3 | 2789 | 1953 | 455 |
| -0.4 | 3534 | 2470 | 568 |
| -0.5 | 4470 | 3079 | 684 |
| -0.6 | 5713 | 3907 | 866 |
| -0.7 | 7257 | 4879 | 1096 |
| -0.8 | 9153 | 6091 | 1447 |
| -0.9 | 11314 | 7482 | 1821 |
| -1.0 | 13257 | 8669 | 2125 |
| -1.1 | 14241 | 9242 | 2270 |
| -1.2 | 14580 | 9433 | 2315 |
#int_thred. denotes the different integrating thresholds.
The amounts of the PPIs added into the denoising PPI networks with different denoising thresholds.
| den_thred | DIP | Krogan | Gavin |
|---|---|---|---|
| 0.5 | 4214 | 2149 | 685 |
| 0.6 | 4156 | 2110 | 684 |
| 0.7 | 4156 | 2110 | 684 |
| 0.8 | 4132 | 2103 | 683 |
| 0.9 | 4088 | 2069 | 679 |
| 1.0 | 4062 | 2032 | 678 |
| 1.1 | 3411 | 1612 | 590 |
| 1.2 | 3357 | 1595 | 585 |
| 1.3 | 3277 | 1546 | 581 |
| 1.4 | 3204 | 1524 | 570 |
#den_thred. denotes the different denoising thresholds.
Protein complex detection algorithm.
The details of the gold standard protein complexes of original PPI networks.
| DIP | Krogan | Gavin | |
|---|---|---|---|
| Number of complexes | 732 | 623 | 584 |
| The average size of complexes | 7.18 | 6.95 | 6.27 |
The details of the gold standard protein complexes of denoising PPI networks with different denoising thresholds.
| den_thred | DIP | Krogan | Gavin | |||
|---|---|---|---|---|---|---|
| #complex | #size | #complex | #size | #complex | #size | |
| 0.5 | 679 | 7.01 | 565 | 7.04 | 542 | 6.49 |
| 0.6 | 673 | 7.03 | 563 | 7.03 | 542 | 6.49 |
| 0.7 | 673 | 7.03 | 563 | 7.03 | 542 | 6.49 |
| 0.8 | 673 | 7.01 | 563 | 7.02 | 542 | 6.49 |
| 0.9 | 668 | 7.03 | 557 | 7.03 | 534 | 6.55 |
| 1.0 | 667 | 7.03 | 552 | 7.07 | 533 | 6.56 |
| 1.1 | 660 | 7.03 | 541 | 7.1 | 518 | 6.66 |
| 1.2 | 658 | 7.04 | 539 | 7.12 | 517 | 6.67 |
| 1.3 | 653 | 7.06 | 538 | 7.11 | 517 | 6.67 |
| 1.4 | 649 | 7.05 | 533 | 7.12 | 515 | 6.68 |
#den_thred. denotes the different denoising thresholds; #Den. denotes the denoising PPI network; # Den.Ext. denotes the denoising extended PPI network (denoising PPI network added new PPI data); #complex denotes the number of complexes; #size denotes the average size of complexes.
The F-score performances of SLPC on original PPI networks with different integrating thresholds.
| int_thred | DIP | Krogan | Gavin | Avg.Δ |
|---|---|---|---|---|
| Origin | 0.5531 | 0.5029 | 0.6389 | |
| 0 | 0.5543 | 0.5298 | 0.6518 | 2.53% |
| -0.1 | 0.5463 | 0.5348 | 0.6665 | 3.14% |
| -0.2 | 0.5481 | 0.5382 | 0.6658 | 3.44% |
| -0.3 | 0.5621 | 0.5515 | 0.6623 | 4.98% |
| -0.4 | 0.5527 | 0.5485 | 0.6642 | 4.32% |
| -0.5 | 0.5577 | 0.544 | 0.6638 | 4.30% |
| -0.7 | 0.5418 | 0.5355 | 0.6638 | 2.78% |
| -0.8 | 0.5409 | 0.5329 | 0.6685 | 2.80% |
| -0.9 | 0.5335 | 0.5471 | 0.6694 | 3.34% |
| -1.0 | 0.5224 | 0.5445 | 0.6511 | 1.54% |
| -1.1 | 0.5138 | 0.5403 | 0.6501 | 0.69% |
| -1.2 | 0.5166 | 0.5368 | 0.6487 | 0.56% |
#int_thred. denotes the different integrating thresholds; Avg.Δ denotes the average F-score improvement over that on the original PPI networks.
The Accuracy performances of SLPC on original PPI networks with different integrating thresholds.
| int_thred | DIP | Krogan | Gavin | Avg.Δ |
|---|---|---|---|---|
| Origin | 0.3212 | 0.2984 | 0.3238 | |
| 0 | 0.323 | 0.3112 | 0.3285 | 2.10% |
| -0.1 | 0.3249 | 0.3108 | 0.3309 | 2.50% |
| -0.2 | 0.3241 | 0.3185 | 0.3331 | 3.50% |
| -0.3 | 0.3275 | 0.3233 | 0.3327 | 4.35% |
| -0.4 | 0.3284 | 0.3249 | 0.3336 | 4.72% |
| -0.5 | 0.3339 | 0.3301 | 0.3336 | 5.87% |
| -0.7 | 0.3401 | 0.337 | 0.3369 | 7.62% |
| -0.8 | 0.3424 | 0.3411 | 0.3383 | 8.46% |
| -0.9 | 0.3397 | 0.3409 | 0.3367 | 8.00% |
| -1.0 | 0.3453 | 0.3428 | 0.3376 | 8.88% |
| -1.1 | 0.3423 | 0.3425 | 0.3378 | 8.56% |
| -1.2 | 0.3427 | 0.342 | 0.3386 | 8.63% |
The MMR performances of SLPC on original PPI networks with different integrating thresholds.
| int_thred | DIP | Krogan | Gavin | Avg.Δ |
|---|---|---|---|---|
| Origin | 0.306 | 0.2933 | 0.3562 | |
| 0 | 0.3156 | 0.3135 | 0.3646 | 4.13% |
| -0.1 | 0.3224 | 0.3180 | 0.3722 | 6.09% |
| -0.2 | 0.3269 | 0.3244 | 0.3802 | 8.06% |
| -0.3 | 0.3364 | 0.3328 | 0.3811 | 10.13% |
| -0.4 | 0.3385 | 0.3420 | 0.3880 | 12.05% |
| -0.5 | 0.3468 | 0.3529 | 0.3898 | 14.36% |
| -0.7 | 0.3478 | 0.3603 | 0.3984 | 16.12% |
| -0.8 | 0.3603 | 0.3684 | 0.4000 | 18.55% |
| -0.9 | 0.3669 | 0.3767 | 0.4084 | 21.00% |
| -1.0 | 0.3626 | 0.3796 | 0.4064 | 20.67% |
| -1.1 | 0.3632 | 0.3767 | 0.4062 | 20.39% |
| -1.2 | 0.3633 | 0.3766 | 0.4087 | 20.62% |
The details of predicted complexes of SLPC on original PPI networks with different integrating thresholds.
| int_ | DIP | Krogan | Gavin | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| #gl_sz | #cluster | #size | #matched | #gl_sz | #cluster | #size | #matched | #gl_sz | #cluster | #size | #matched | |
| Origin | 7.18 | 844 | 9.49 | 543 | 6.95 | 710 | 14.97 | 419 | 6.27 | 337 | 9.01 | 273 |
| 0 | 7.18 | 981 | 9.14 | 606 | 6.95 | 787 | 12.30 | 486 | 6.27 | 351 | 8.99 | 289 |
| -0.1 | 7.18 | 1033 | 9.92 | 616 | 6.95 | 816 | 12.71 | 497 | 6.27 | 360 | 8.98 | 299 |
| -0.2 | 7.18 | 1110 | 8.66 | 652 | 6.95 | 853 | 11.42 | 516 | 6.27 | 369 | 8.85 | 303 |
| -0.3 | 7.18 | 1195 | 8.5 | 717 | 6.95 | 891 | 10.28 | 549 | 6.27 | 380 | 8.81 | 308 |
| -0.4 | 7.18 | 1271 | 8.39 | 736 | 6.95 | 968 | 10.45 | 578 | 6.27 | 396 | 8.86 | 319 |
| -0.5 | 7.18 | 1396 | 8.26 | 797 | 6.95 | 1040 | 10.03 | 595 | 6.27 | 397 | 8.94 | 318 |
| -0.6 | 7.18 | 1580 | 8.42 | 889 | 6.95 | 1149 | 9.55 | 653 | 6.27 | 426 | 8.97 | 337 |
| -0.7 | 7.18 | 1713 | 8.47 | 918 | 6.95 | 1243 | 9.26 | 663 | 6.27 | 446 | 9.07 | 347 |
| -0.8 | 7.18 | 1928 | 8.72 | 1006 | 6.95 | 1402 | 9.24 | 718 | 6.27 | 475 | 8.89 | 367 |
| -0.9 | 7.18 | 2147 | 8.77 | 1084 | 6.95 | 1555 | 9.10 | 816 | 6.27 | 521 | 9.01 | 395 |
| -1.0 | 7.18 | 2171 | 9.12 | 1043 | 6.95 | 1612 | 8.83 | 834 | 6.27 | 540 | 8.88 | 393 |
| -1.1 | 7.18 | 2139 | 9.40 | 995 | 6.95 | 1618 | 9.08 | 825 | 6.27 | 555 | 8.81 | 404 |
| -1.2 | 7.18 | 2171 | 9.46 | 1016 | 6.95 | 1636 | 9.26 | 830 | 6.27 | 555 | 8.93 | 402 |
#gl_size denotes the average size of the gold standard protein complexes on original PPI networks; #size denotes the average size of predicted complexes. #cluster denotes the number of predicted complexes on extended PPI networks; #matched denotes the matching number between the predicted complexes and the gold standard protein complexes;
The F-score performances of SLPC on denoising PPI networks with different denoising thresholds.
| Threshold | DIP | Krogan | Gavin | Avg.Δ | |||
|---|---|---|---|---|---|---|---|
| #Den.Ext | |||||||
| 0.5 | 0.5815 | 0.5889 | 0.5393 | 0.5761 | 0.6789 | 0.7006 | 3.76% |
| 0.6 | 0.5849 | 0.5912 | 0.543 | 0.5854 | 0.6789 | 0.7021 | 4.10% |
| 0.7 | 0.586 | 0.5905 | 0.5418 | 0.5778 | 0.6789 | 0.7012 | 3.57% |
| 0.8 | 0.5834 | 0.5939 | 0.5414 | 0.5778 | 0.6767 | 0.7001 | 3.99% |
| 0.9 | 0.5852 | 0.5962 | 0.5456 | 0.5819 | 0.6839 | 0.7057 | 3.91% |
| 1.0 | 0.5881 | 0.596 | 0.5503 | 0.5864 | 0.6855 | 0.7072 | 3.69% |
| 1.1 | 0.5538 | 0.5785 | 0.5624 | 0.5993 | 0.6627 | 0.7006 | 5.58% |
| 1.2 | 0.5568 | 0.5776 | 0.5645 | 0.5972 | 0.6634 | 0.7015 | 5.09% |
| 1.3 | 0.5572 | 0.582 | 0.5691 | 0.5984 | 0.6634 | 0.7011 | 5.09% |
| 1.4 | 0.5537 | 0.5845 | 0.565 | 0.5989 | 0.6672 | 0.7065 | 5.82% |
| ClusterONE(0.9) | 0.4412 | 0.4241 | 0.4834 | 0.4847 | 0.6418 | 0.6710 | 0.31% |
| Δ(0.9) | 40.58% | 20.05% | 17.42% | 26.02% | |||
#Den. denotes the denoising PPI network. #Den.Ext. denotes the denoising extended PPI network. Avg.Δ denotes the average F-score improvement with the different denoising threshold over that on the corresponding denoising networks. Δ(0.9) denotes the improvement of SLPC over ClusterONE with the denoising threshold 0.9.
The Accuracy performances of SLPC on denoising PPI networks with different denoising thresholds.
| den_thred | DIP | Krogan | Gavin | Avg.Δ | |||
|---|---|---|---|---|---|---|---|
| #Den.Ext | |||||||
| 0.5 | 0.331 | 0.3515 | 0.3215 | 0.3375 | 0.3309 | 0.3415 | 4.79% |
| 0.6 | 0.3312 | 0.3508 | 0.322 | 0.3374 | 0.3312 | 0.3415 | 4.60% |
| 0.7 | 0.3313 | 0.351 | 0.3216 | 0.3373 | 0.3312 | 0.3413 | 4.63% |
| 0.8 | 0.3307 | 0.3513 | 0.3227 | 0.338 | 0.3308 | 0.3425 | 4.84% |
| 0.9 | 0.3315 | 0.3516 | 0.3242 | 0.3393 | 0.3316 | 0.3419 | 4.61% |
| 1.0 | 0.3314 | 0.3528 | 0.3256 | 0.3401 | 0.3324 | 0.3419 | 4.59% |
| 1.1 | 0.3229 | 0.3482 | 0.3215 | 0.3391 | 0.3268 | 0.3412 | 5.91% |
| 1.2 | 0.324 | 0.3478 | 0.3218 | 0.3401 | 0.327 | 0.3408 | 5.75% |
| 1.3 | 0.3232 | 0.3477 | 0.3229 | 0.3405 | 0.327 | 0.3413 | 5.80% |
| 1.4 | 0.3227 | 0.3461 | 0.3231 | 0.3397 | 0.327 | 0.34 | 5.45% |
| Clusterone(0.9) | 0.4284 | 0.4267 | 0.3937 | 0.3985 | 0.4108 | 0.4124 | 0.40% |
| Δ(0.9) | -17.60% | -14.86% | -17.10% | -16.52% | |||
The MMR performances of SLPC on denoising PPI networks with different denoising thresholds.
| den_thred | DIP | Krogan | Gavin | Avg.Δ | |||
|---|---|---|---|---|---|---|---|
| #Den. | #Den.Ext. | #Den. | #Den. | #Den.Ext | |||
| 0.5 | 0.3294 | 0.3678 | 0.327 | 0.3817 | 0.3873 | 0.4225 | 12.49% |
| 0.6 | 0.3319 | 0.3702 | 0.3281 | 0.3822 | 0.3879 | 0.4231 | 12.37% |
| 0.7 | 0.3319 | 0.3703 | 0.3279 | 0.3816 | 0.3879 | 0.423 | 12.33% |
| 0.8 | 0.3321 | 0.3709 | 0.3278 | 0.3813 | 0.3873 | 0.4224 | 12.36% |
| 0.9 | 0.3342 | 0.3727 | 0.3317 | 0.3848 | 0.3935 | 0.428 | 12.10% |
| 1.0 | 0.3347 | 0.3729 | 0.3351 | 0.3871 | 0.3943 | 0.4291 | 11.92% |
| 1.1 | 0.3148 | 0.3565 | 0.3215 | 0.3789 | 0.3693 | 0.4098 | 14.02% |
| 1.2 | 0.3151 | 0.3573 | 0.3203 | 0.3786 | 0.3703 | 0.4107 | 14.17% |
| 1.3 | 0.3171 | 0.3585 | 0.3231 | 0.3797 | 0.3703 | 0.4098 | 13.75% |
| 1.4 | 0.3154 | 0.3614 | 0.3247 | 0.382 | 0.3705 | 0.4125 | 14.52% |
| Clusterone(0.9) | 0.2913 | 0.2829 | 0.3050 | 0.3188 | 0.3649 | 0.3730 | 1.29% |
| Δ(0.9) | 31.74% | 20.70% | 14.75% | 22.40% | |||