| Literature DB >> 24565259 |
Bingjing Cai, Haiying Wang, Huiru Zheng, Hui Wang.
Abstract
BACKGROUND: Detecting protein complexes in protein-protein interaction (PPI) networks plays an important role in improving our understanding of the dynamic of cellular organisation. However, protein interaction data generated by high-throughput experiments such as yeast-two-hybrid (Y2H) and tandem affinity-purification/mass-spectrometry (TAP-MS) are characterised by the presence of a significant number of false positives and false negatives. In recent years there has been a growing trend to incorporate diverse domain knowledge to support large-scale analysis of PPI networks.Entities:
Year: 2013 PMID: 24565259 PMCID: PMC3907791 DOI: 10.1186/1477-5956-11-S1-S2
Source DB: PubMed Journal: Proteome Sci ISSN: 1477-5956 Impact factor: 2.480
General statistics of two gold-standard datasets before and after pre-processing.
| Gold-standard dataset | CYC2008 | MIPS |
|---|---|---|
| No. of proteins | 1627 | 1095 |
| No. of complexes (size ≥ 2) | 408 | 220 |
| Ave. size of complexes | 4.7 | 7.1 |
| No. of proteins | 1389 | 1041 |
| No. of complexes (size ≥ 2) | 360 | 205 |
| Ave. size of complexes | 5.5 | 8.1 |
| No. of proteins | 1592 | 1088 |
| No. of complexes (size ≥ 2) | 406 | 218 |
| Ave. size of complexes | 4.8 | 7.3 |
*Note the set of statistics under the "Original" header show the statistics of the original two gold-standard datasets before pre-processed; The set of statistics under the "On Gavin_2006 network" or " On Krogan_2006 network" header show the statistics of the two gold-standard datasets after being pre-processed on Gavin_2006 network or Krogan_2006 network, respectively.
Evaluation of performance on Gavin_2006 and Krogan_2006 networks using MIPS gold-standard.
| Network | Gavin_2006 | Krogan_2006 | ||
|---|---|---|---|---|
| 0.357 | 0.257 | |||
| 0.601 | 0.455 | |||
| 0.463 | 0.342 | |||
| 0.324 | 0.156 | |||
| 0.692 | 0.642 | |||
| 0.473 | 0.317 | |||
| 0.530 | 0.246 | |||
| 0.357 | 0.257 | |||
| 0.435 | 0.251 | |||
| 0.387 | 0.159 | |||
| 0.267 | 0.149 | |||
| 0.316 | 0.154 | |||
| 0.698 | 0.491 | |||
| 0.354 | 0.122 | |||
| 0.470 | 0.195 | |||
Better measuring values are in bold.
Evaluation of performance on Gavin_2006 and Krogan_2006 networks using CYC2008 gold-standard.
| Network | Gavin_2006 | Krogan_2006 | ||
|---|---|---|---|---|
| 0.461 | 0.300 | |||
| 0.709 | 0.550 | |||
| 0.573 | 0.406 | |||
| 0.307 | 0.134 | |||
| 0.819 | 0.745 | |||
| 0.502 | 0.316 | |||
| 0.670 | 0.371 | |||
| 0.461 | 0.300 | |||
| 0.556 | 0.334 | |||
| 0.536 | 0.255 | |||
| 0.361 | 0.204 | |||
| 0.432 | 0.227 | |||
| 0.867 | 0.740 | |||
| 0.340 | 0.136 | |||
| 0.489 | 0.229 | |||
Better measuring values are in bold.
Number and average size of generated clusters from different methods on Gavin_2006 network and Krogan_2006 network.
| MCL | MCODE | CFinder | RRW | COACH | CODEC-w0 | CODEC-w1 | BGCA+BP | |
|---|---|---|---|---|---|---|---|---|
| Gavin_2006 | ||||||||
| No. of clusters | 782 | 100 | 65 | 474 | 612 | 1082 | 1005 | 542 |
| Ave. size | 5.4 | 12.1 | 16.4 | 2.1 | 78.1 | 17.3 | 13.8 | 5.0 |
| Krogan_2006 | ||||||||
| No. of clusters | 1548 | 73 | 73 | 690 | 1927 | 8348 | 2973 | 511 |
| Ave. size | 5.5 | 25.2 | 15.1 | 2.1 | 181.8 | 16.1 | 16.2 | 5.3 |
Evaluation results on Gavin_2006 network using MIPS gold-standard.
| Quality measures | BGCA+BP | MCL | MCODE | CFinder | RRW | COACH | CODEC-w0 | CODEC-w1 |
|---|---|---|---|---|---|---|---|---|
| 0.425 | 0.413 | 0.271 | 0.334 | 0.107 | 0.451 | 0.450 | ||
| 0.492 | 0.332 | 0.330 | 0.500 | 0.140 | 0.486 | 0.556 | ||
| 0.451 | 0.300 | 0.332 | 0.232 | 0.261 | 0.468 | 0.500 | ||
| 0.279 | 0.138 | 0.109 | 0.104 | 0.061 | 0.250 | 0.273 | ||
| 0.812 | 0.656 | 0.601 | 0.532 | 0.024 | 0.060 | 0.074 | ||
| 0.428 | 0.288 | 0.241 | 0.311 | 0.038 | 0.122 | 0.142 | ||
| 0.614 | 0.334 | 0.212 | 0.251 | 0.058 | 0.296 | 0.400 | ||
| 0.425 | 0.413 | 0.271 | 0.334 | 0.107 | 0.451 | 0.450 | ||
| 0.372 | 0.239 | 0.289 | 0.302 | 0.167 | 0.365 | 0.424 | ||
| 0.458 | 0.245 | 0.146 | 0.185 | 0.042 | 0.140 | 0.178 | ||
| 0.242 | 0.115 | 0.128 | 0.097 | 0.231 | 0.289 | 0.295 | ||
| 0.244 | 0.129 | 0.152 | 0.165 | 0.071 | 0.188 | 0.222 | ||
| 0.655 | 0.404 | 0.619 | 0.739 | 0.141 | 0.213 | 0.273 | ||
| 0.425 | 0.302 | 0.094 | 0.130 | 0.087 | 0.338 | 0.610 | ||
| 0.413 | 0.152 | 0.215 | 0.156 | 0.199 | 0.316 | 0.386 | ||
Better measuring values are in bold.
Evaluation results on Gavin_2006 network using CYC2008 gold-standard.
| Quality measures | BGCA+BP | MCL | MCODE | CFinder | RRW | COACH | CODEC-w0 | CODEC-w1 |
|---|---|---|---|---|---|---|---|---|
| 0.480 | 0.538 | 0.338 | 0.390 | 0.089 | 0 | 0.584 | 0.582 | |
| 0.709 | 0.571 | 0.342 | 0.365 | 0.120 | 0.511 | 0.546 | ||
| 0.555 | 0.340 | 0.377 | 0.261 | 0.268 | 0.546 | 0.564 | ||
| 0.295 | 0.123 | 0.087 | 0.088 | 0.048 | 0.234 | 0.272 | ||
| 0.896 | 0.816 | 0.748 | 0.613 | 0.030 | 0.086 | 0.107 | ||
| 0.490 | 0.303 | 0.231 | 0.295 | 0.038 | 0.141 | 0.171 | ||
| 0.714 | 0.419 | 0.268 | 0.324 | 0.066 | 0.311 | 0.426 | ||
| 0.480 | 0.538 | 0.338 | 0.390 | 0.089 | 0.584 | 0.582 | ||
| 0.475 | 0.301 | 0.356 | 0.281 | 0.198 | 0.426 | 0.498 | ||
| 0.583 | 0.335 | 0.208 | 0.260 | 0.053 | 0.169 | 0.230 | ||
| 0.380 | 0.319 | 0.146 | 0.139 | 0.082 | 0.272 | 0.362 | ||
| 0.326 | 0.172 | 0.181 | 0.145 | 0.088 | 0.230 | 0.287 | ||
| 0.815 | 0.610 | 0.804 | 0.875 | 0.252 | 0.305 | 0.459 | ||
| 0.352 | 0.315 | 0.103 | 0.115 | 0.079 | 0.360 | 0.578 | ||
| 0.506 | 0.454 | 0.176 | 0.201 | 0.144 | 0.297 | 0.399 | ||
Better measuring values are in bold.
Evaluation results on Krogan_2006 network using MIPS gold-standard.
| Quality measures | BGCA+BP | MCL | MCODE | CFinder | RRW | COACH | CODEC-w0 | CODEC-w1 |
|---|---|---|---|---|---|---|---|---|
| 0.438 | 0.183 | 0.219 | 0.290 | 0.028 | 0.420 | 0.404 | ||
| 0.565 | 0.152 | 0.342 | 0.558 | 0.094 | 0.386 | 0.362 | ||
| 0.322 | 0.182 | 0.315 | 0.124 | 0.231 | 0.403 | 0.382 | ||
| 0.202 | 0.049 | 0.078 | 0.037 | 0.019 | 0.226 | 0.215 | ||
| 0.828 | 0.587 | 0.409 | 0.656 | 0.002 | 0.015 | 0.032 | ||
| 0.344 | 0.141 | 0.226 | 0.192 | 0.007 | 0.059 | 0.083 | ||
| 0.449 | 0.355 | 0.069 | 0.237 | 0.040 | 0.180 | 0.175 | ||
| 0.438 | 0.183 | 0.219 | 0.290 | 0.028 | 0.420 | 0.404 | ||
| 0.255 | 0.123 | 0.262 | 0.144 | 0.150 | 0.275 | 0.266 | ||
| 0.328 | 0.203 | 0.055 | 0.206 | 0.034 | 0.105 | 0.115 | ||
| 0.125 | 0.049 | 0.103 | 0.026 | 0.162 | 0.250 | 0.241 | ||
| 0.154 | 0.051 | 0.137 | 0.049 | 0.056 | 0.148 | 0.155 | ||
| 0.824 | 0.280 | 0.154 | 0.808 | 0.024 | 0.212 | 0.337 | ||
| 0.361 | 0.098 | 0.018 | 0.100 | 0.032 | 0.190 | 0.781 | ||
| 0.145 | 0.033 | 0.179 | 0.062 | 0.043 | 0.338 | 0.470 | ||
Better measuring values are in bold.
Evaluation results on Krogan_2006 network using CYC2008 gold-standard.
| Quality measures | BGCA+BP | MCL | MCODE | CFinder | RRW | COACH | CODEC-w0 | CODEC-w1 |
|---|---|---|---|---|---|---|---|---|
| 0.419 | 0.269 | 0.275 | 0.346 | 0.036 | 0.595 | 0.562 | ||
| 0.595 | 0.653 | 0.135 | 0.389 | 0.076 | 0.399 | 0.422 | ||
| 0.419 | 0.193 | 0.366 | 0.163 | 0.224 | 0.487 | 0.487 | ||
| 0.221 | 0.036 | 0.063 | 0.042 | 0.015 | 0.232 | 0.218 | ||
| 0.846 | 0.706 | 0.474 | 0.566 | 0.003 | 0.024 | 0.048 | ||
| 0.413 | 0.131 | 0.189 | 0.205 | 0.007 | 0.075 | 0.102 | ||
| 0.428 | 0.406 | 0.080 | 0.352 | 0.040 | 0.206 | 0.238 | ||
| 0.419 | 0.269 | 0.275 | 0.346 | 0.036 | 0.595 | 0.562 | ||
| 0.331 | 0.148 | 0.349 | 0.169 | 0.162 | 0.350 | 0.366 | ||
| 0.321 | 0.259 | 0.065 | 0.304 | 0.035 | 0.138 | 0.170 | ||
| 0.293 | 0.171 | 0.054 | 0.142 | 0.029 | 0.194 | 0.323 | ||
| 0.206 | 0.059 | 0.194 | 0.055 | 0.059 | 0.197 | 0.223 | ||
| 0.840 | 0.345 | 0.226 | 0.800 | 0.047 | 0.324 | 0.502 | ||
| 0.232 | 0.119 | 0.017 | 0.090 | 0.040 | 0.196 | 0.790 | ||
| 0.363 | 0.177 | 0.032 | 0.162 | 0.076 | 0.075 | 0.470 | ||
Better measuring values are in bold.
Expected values of evaluation results of randomised clustering of BGCA+BP on Gavin_2006 and Krogan_2006 networks using CYC2008 gold-standard.
| Gavin_2006 | Krogan_2006 | |||||
|---|---|---|---|---|---|---|
| Original | Random average | Original | Random average | |||
| 0.583 | 0.095 | 0.000 | 0.499 | 0.095 | 0.000 | |
| 0.540 | 0.123 | 0.000 | 0.432 | 0.093 | 0.000 | |
| 0.585 | 0.072 | 0.000 | 0.423 | 0.064 | 0.000 | |
| 0.460 | 0.023 | 0.000 | 0.306 | 0.017 | 0.000 | |
| 0.506 | 0.001 | 0.000 | 0.363 | 0.001 | 0.000 | |
*p-value is corrected.
Figure 1Robustness of BGCA and BGCA+BP: impact of edge addition on geometric accuracy.
Figure 2Robustness of BGCA and BGCA+BP: impact of edge addition on BH-FMeasure.
Figure 3Robustness of BGCA and BGCA+BP: impact of edge deletion on geometric accuracy.
Figure 4Robustness of BGCA and BGCA+BP: impact of edge deletion on BH-FMeasure.