| Literature DB >> 21143777 |
Jianxin Wang1, Min Li, Youping Deng, Yi Pan.
Abstract
The increasing availability of large-scale protein-protein interaction data has made it possible to understand the basic components and organization of cell machinery from the network level. The arising challenge is how to analyze such complex interacting data to reveal the principles of cellular organization, processes and functions. Many studies have shown that clustering protein interaction network is an effective approach for identifying protein complexes or functional modules, which has become a major research topic in systems biology. In this review, recent advances in clustering methods for protein interaction networks will be presented in detail. The predictions of protein functions and interactions based on modules will be covered. Finally, the performance of different clustering methods will be compared and the directions for future research will be discussed.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21143777 PMCID: PMC2999340 DOI: 10.1186/1471-2164-11-S3-S10
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Two typical graphs of the same size and density[20]
Different Definitions of module in protein interaction network[29-31,39,40]
| Module Definitions | References | ||
|---|---|---|---|
| Module Names | Computational Formula | Descriptions | |
| Strong Module | In a strong module each vertex has more connections within the module than with the rest of the graph. | [ | |
| Weak Module | In a weak module the sum of all degrees within subgraph | [ | |
| Chen | A combination of weak module and a new less stringent condition, which is that, collectively, the in-degrees of the vertices in the subgraph are significantly greater than the out-degrees. | [ | |
| Luo | A subgraph | [ | |
| λ-module is a general version of weak module. When | [ | ||
| [ | |||
In Table 1, different criterions are shown that the given subgraph H ⊂ G is a module.
denotes the “in-degree” of vertex i (i.e. the number of edges connecting vertex i to other vertices belonging to H) and denotes the “out-degree” of vertex i (ie. the number of edges connecting vertex i and other vertices in the rest of the graph G). Let k be the degree of vertex i. Then, . and are the weighted “in-degree” and “out-degree” of vertex i, respectively.
Figure 2Overview of the ensemble framework[108]
Main features of 20 typical clustering algorithms for extracting clusters from protein interaction networks.
| Authors | Methods | Weighted graphs supported | Overlapping clusters supported | Objective | Web-Tool Available |
|---|---|---|---|---|---|
| Girvan and Newman 2002 (G-N) | Hierarchical clustering based on betweenness | Functional module | Upon request | ||
| Van Dongen S 2000, Enright | Flow simulation | √ | Protein family detection | ||
| Spirin and Mirny 2003 (SPC) | Hierarchical | √ | Protein complex | ||
| Bader and Hogue 2003 (MCODE) | Local neighbourhood density search | √ | Protein complex | ||
| King | Local search cost based | Protein complex | upon request | ||
| Radicchi | Hierarchical, module definition | Strong module or weak module | upon request | ||
| Pržulj | Minimum cut (HCS) | Protein complex | upon request | ||
| Palla | Clique Percolation | √ | Protein complex; functional module | ||
| Li | Local clique merging | √ | Protein complex | upon request | |
| Altaf-UI-Amin | Local density and periphery search | √ | Protein complex | ||
| Hwang | signal transduction | √ | Functional module | upon request | |
| Zotenko | Complex Overlap Decomposition | √ | Protein complex | upon request | |
| Luo | Hierarchical, module definition | √ | Functional module | upon request | |
| Cho | flow-based clustering and Semantic integration | √ | √ | Functional module | upon request |
| Ulitsky and Shamir 2007 (MATISSE) | Module Analysis via Topology of Interactions and Similarity | √ | √ | Functional module | |
| Gregory 2007 (CONGA) | split betweenness | √ | Functional module | upon request | |
| Li | Local density and distance-based search | √ | Protein complex | ||
| Mete | structural clustering based on common neighbors | √ | Functional module | upon request | |
| Turanalp and Can 2008 (PPISpan) | gSpan | √ | √ | Frequent patterns | |
| Li | Hierarchical clustering based on local metric | √ | Functional module | ||
Figure 3Predicting false negatives and purifying false positives are done on the identified clusters