| Literature DB >> 26437714 |
Mohieddin Jafari1,2, Mehdi Mirzaie3, Mehdi Sadeghi4.
Abstract
BACKGROUND: In the field of network science, exploring principal and crucial modules or communities is critical in the deduction of relationships and organization of complex networks. This approach expands an arena, and thus allows further study of biological functions in the field of network biology. As the clustering algorithms that are currently employed in finding modules have innate uncertainties, external and internal validations are necessary.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26437714 PMCID: PMC4595048 DOI: 10.1186/s12859-015-0755-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Function prediction procedures in network biology. Network-based prediction methods of protein function are described schematically. There are two approaches to explore function from network in biology, direct and module-assisted methods. In the direct method which is not our subject in this study, the annotation of gene/protein neighbors are used to predict function. But in the module-assisted methods, node’s community/neighborhood is principal for function prediction. These methods are also divided into two categories i.e. graph and distance-based clustering methods. The validation is the main step after finding modules in the PPIN or using direct methods. Any assignment of function based on annotation of neighbors or neighborhood should be evaluated by the different validating methods. We introduce the IPN-based validation for this purpose
IPN expression data
| Network name | STRING derived networks (Nodes and Edges) | Co-expression networks (Nodes and Edges) | Matched edges | Ratio of matched edges |
|---|---|---|---|---|
| Interlog protein network (IPN) | 31, 63 | 20, 22 | 13 | ~60 % |
| Protein-protein interaction network (PPIN) | 563, 2431 | 230, 1205 | 150 | 12 % |
Using the expression proteomic data, a correlation network was constructed and compared to database derived networks i.e. IPN and PPIN. This evaluation was performed for rat mitochondrial proteins and related proteins and their links were considered. The number of nodes and edges in STRING derived (col 2) and co-expression (col 3) networks are presented in this table along with the number of matched or same edges in two corresponding networks (col 4). The edge matching ratios of these networks are represented (col 5)
Fig. 2External clustering indices. The average of the calculated indices with SD (Standard Deviation) bars are shown graphically in the five clustering algorithms. As shown, the MCL outperformed in all the indices including even the Rand index with argued imperfection. Note that the range of the Minkowski index is [0, +∞) and the values (here is MCL) near to zero indicate the more similarity. But the other indices range is [0, 1] and the values near to zero specify the less similarity
Main features of the graph clustering methods presented in this study
| Markov clustering (MCL) | Restricted Neighborhood Search Clustering (RNSC) | Laplacian dynamics (LD) | Cartographic Representation (CR) | Genetic Algorithm to find communities in Protein-Protein Interaction networks (GAPPI) | |
|---|---|---|---|---|---|
| Type | Flow simulation & Pagerank centrality | Cost-based local search | Multiscale modular structure | Inter- and intramodule connection | Search inspired by natural evolution |
| Allow multiple assignations | No | No | No | No | No |
| Allow unassigned nodes | No | No | No | No | No |
| Edge-weighted graphs supported | Yes | No | Yes | No | No |
| First application | Protein family detection | Protein complex prediction | High modularity partitions of large (more than million) networks finding | Metabolic network | Protein-protein interaction networks |
| Availability |
|
| Upon Gephi program | Upon request |
|
| Developer (Year) | Enright A.J. et al. (2002) [ | King A.D. et al. (2004) [ | (1) Lambiotte R. et al. (2007) [ | Guimera R. & Amaral LAN (2005) [ | Pizzuti C. & Rombo S. E. (2014) [ |
| (2) Blondel V.D. et al. (2008) [ |