| Literature DB >> 20158874 |
Xiaoli Li1, Min Wu, Chee-Keong Kwoh, See-Kiong Ng.
Abstract
BACKGROUND: Most proteins form macromolecular complexes to perform their biological functions. However, experimentally determined protein complex data, especially of those involving more than two protein partners, are relatively limited in the current state-of-the-art high-throughput experimental techniques. Nevertheless, many techniques (such as yeast-two-hybrid) have enabled systematic screening of pairwise protein-protein interactions en masse. Thus computational approaches for detecting protein complexes from protein interaction data are useful complements to the limited experimental methods. They can be used together with the experimental methods for mapping the interactions of proteins to understand how different proteins are organized into higher-level substructures to perform various cellular functions.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20158874 PMCID: PMC2822531 DOI: 10.1186/1471-2164-11-S1-S3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1An example of how MCODE detects protein complexes from a small sample graph of protein-protein interactions.
Figure 2CODEC to detect protein complexes from TAP data which are modeled as a bipartite graph.
Figure 3COACH method detects core-attachment complexes with overlapping core structures.
Results of various approaches using DIP data
| Algorithms | MCODE | MCL | RNSC | COACH | CORE | DECAFF | CFinder | DPClus |
|---|---|---|---|---|---|---|---|---|
| # complexes | 50 | 1246 | 2435 | 746 | 1722 | 2190 | 245 | 1143 |
| # covered proteins | 844 | 4930 | 4930 | 1838 | 3777 | 1832 | 2008 | 2987 |
| 21 | 212 | 234 | 285 | 221 | 605 | 84 | 193 | |
| 44 | 256 | 289 | 249 | 256 | 243 | 111 | 274 |
Results of various approaches using Krogan et al.'s data
| Algorithms | MCODE | MCL | RNSC | COACH | CORE | DECAFF | CFinder | DPClus |
|---|---|---|---|---|---|---|---|---|
| # complexes | 52 | 834 | 1890 | 570 | 1232 | 2143 | 122 | 689 |
| # covered proteins | 651 | 3581 | 3581 | 1428 | 2665 | 1478 | 1578 | 1996 |
| 29 | 147 | 245 | 244 | 201 | 759 | 45 | 167 | |
| 45 | 197 | 283 | 193 | 229 | 192 | 63 | 241 |
Figure 4Comparative performance of existing methods in terms of various evaluation metrics for DIP data. The methods are ordered chronologically in the years in which they were published.
Figure 5Comparative performance of existing methods in terms of various evaluation metrics for the Krogan data. The methods are ordered chronologically in the years in which they were published.
Figure 6Proportion of statistically significant complexes predicted by various methods in terms of P-values.
Key milestones of computational protein complex detection
| Methods | Main contributions |
|---|---|
| MCODE [ | MCODE pioneered the computational detection of protein complexes from PPI networks. |
| MCL [ | MCL is a widely used method [ |
| Krogan et al. [ | They provided and exploited a comprehensive TAP dataset for computational protein complex detection. |
| Gavin et al. [ | In addition to providing a TAP dataset widely used for protein complex detection, they described the inherent organization of protein complexes with core-attachment structures. |
| Sharan et al. [ | They systematically conducted cross-species analysis of PPI networks to derive evolutionary information for protein complex detection. |