| Literature DB >> 21115439 |
Abstract
MOTIVATION: Advanced technologies are producing large-scale protein-protein interaction data at an ever increasing pace. A fundamental challenge in analyzing these data is the inference of protein machineries. Previous methods for detecting protein complexes have been mainly based on analyzing binary protein-protein interaction data, ignoring the more involved co-complex relations obtained from co-immunoprecipitation experiments.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21115439 PMCID: PMC3008648 DOI: 10.1093/bioinformatics/btq652
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.An example data set. (a) An input bait–prey graph. Baits are colored in blue and preys are colored in red. (b) Two possible protein complexes and their corresponding subgraphs.
Fig. 2.A comparison of protein complex identification approaches on the data of Gavin ). For each method shown is the sensitivity of the output solution as a function of one minus its specificity. For CODEC shown are two receiver operating characteristic (ROC) curves, corresponding to different weighting strategies (w0 and w1). The evaluation is based on a comparison to known protein complexes from the MIPS database (Mewes, 2002). The CODEC plots were smoothed using a cubic spline.
Fig. 3.A comparison of protein complex identification approaches on the data of Krogan ). See legend of Figure 2 for details.
Comparison to MCODE, MCL and MRF
| Number of | Specificity | Sensitivity | Number of | Specificity | Sensitivity | |||
|---|---|---|---|---|---|---|---|---|
| Complexes | (%) | (%) | (%) | Complexes | (%) | (%) | (%) | |
| CODEC using | 1082 | 77.5 | 77 | 77 | 8348 | 30 | 43 | |
| CODEC using | 1005 | 78.5 | 2973 | 72 | ||||
| MCODE | 73 | 73.5 | 32 | 44.5 | 130 | 25 | 14 | 18 |
| MCL | 411 | 49.5 | 44.5 | 47 | 818 | 19.5 | 46 | 27.5 |
| MRF | 698 | 46.7 | 59 | – | – | – | – | |
A comparison of CODEC, MCODE, MCL and MRF on the datasets Gavin ) and Krogan ). The best result in each column appears in bold.
Comparison to Collins et al. and Gavin
| Number of | Specificity | Sensitivity | ||
|---|---|---|---|---|
| Complexes | (%) | (%) | (%) | |
| CODEC using | 1082 | 77.5 | 77 | 77 |
| CODEC using | 1005 | |||
| 480 | 51.5 | 70.5 | 59.5 | |
| Collins | 258 | 70 | 69.5 | 69.5 |
A comparison of CODEC and the methods of Collins et al. and Gavin et al. on the dataset of Gavin ). The best result in each column appears in bold.
Comparison to Local Modeling
| Number of | Specificity | Sensitivity | ||
|---|---|---|---|---|
| Complexes | (%) | (%) | (%) | |
| CODEC using | 185 | |||
| CODEC using | 180 | 79.5 | 81 | 80 |
| Local Modeling | 272 | 73 | 67 | 70 |
A comparison of CODEC to the Local Modeling approach on the dataset of Gavin (2002). The best result in each column appears in bold.
Fig. 4.A comparison of protein complex identification approaches on the data of Gavin (2002). See legend of Figure 2 for details.