| Literature DB >> 24039752 |
Mahdieh Ghasemi1, Maseud Rahgozar, Gholamreza Bidkhori, Ali Masoudi-Nejad.
Abstract
Graph clustering algorithms are widely used in the analysis of biological networks. Extracting functional modules in protein-protein interaction (PPI) networks is one such use. Most clustering algorithms whose focuses are on finding functional modules try either to find a clique like sub networks or to grow clusters starting from vertices with high degrees as seeds. These algorithms do not make any difference between a biological network and any other networks. In the current research, we present a new procedure to find functional modules in PPI networks. Our main idea is to model a biological concept and to use this concept for finding good functional modules in PPI networks. In order to evaluate the quality of the obtained clusters, we compared the results of our algorithm with those of some other widely used clustering algorithms on three high throughput PPI networks from Sacchromyces Cerevisiae, Homo sapiens and Caenorhabditis elegans as well as on some tissue specific networks. Gene Ontology (GO) analyses were used to compare the results of different algorithms. Each algorithm's result was then compared with GO-term derived functional modules. We also analyzed the effect of using tissue specific networks on the quality of the obtained clusters. The experimental results indicate that the new algorithm outperforms most of the others, and this improvement is more significant when tissue specific networks are used.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24039752 PMCID: PMC3764100 DOI: 10.1371/journal.pone.0072366
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1An overview of the proposed algorithm; the algorithm consists of two main parts.
Figure 2Two red vertices A and A' are creative elements and green vertices are hubs. The original network can be found in [27].
Figure 3Zachary's karate club network; two main clusters indicates by different colors.
The Centrality Measures that used to find Creative elements.
| Measure | Equation/Definition | Reference |
| Degree centrality |
| Koschützk et al. |
| Leverage centrality |
| Joyce et al. |
| Local leader |
| Blondel et al. |
| Strict leader |
| Blondel et al. |
| Closeness |
| Sabidussi |
| Eccentricity |
| Hage et al. |
| Radiality |
| Valente el al. |
| Shortest-path betweenness |
| Freeman |
| PageRank |
| Page et al. |
| Eigenvector |
| Bonacich |
| Power |
| Bonacich |
| Cluatering coefficient |
| Watts et al. |
| K-step markov |
| White et al. |
The top-ranked vertices in two networks based on different centrality measures.
| Measure | The top-ranked vertices in the network from | The top-ranked vertices in karate club |
|
| b1, b3, b4 | 34, 1, 33, 3, 2, 32, 4, 14, 24, 9 |
|
| b2, b3, 9 | 7, 30, 33, 32, 3, 1, 28, 6 |
|
| b1, b2, b3, b4 | 34, 1 |
|
| b1, b2, b3, b4 | 34, 1 |
|
| b1, b3, r1 | 1, 3, 34, 32, 33, 14, 9, 20 |
|
| b1, b3, r1 | 1, 2, 3, 4, 9, 14, 20, 39 |
|
| b1, b3, r1 | 1, 3, 34, 32, 9, 14, 33, 20 |
|
| b1, b3, r1 | 1, 34, 33, 3, 32, 9, 2, 14 |
|
| b1, b3, b4 | 34, 1, 3, 33, 2, 9, 14, 4 |
|
| b1, r1, 5 | 34, 1, 3, 33, 2, 9, 14, 4 |
|
| b1, b4, b2 | 34, 1, 25, 26, 17, 33, 2, 12 |
|
| 15, 16, 22, 23 | 19, 17, 18, 15, 16, 13, 21, 22, 23, 27, 8 |
|
| b1, r1, 14 | 1, 34, 33, 3, 2, 32, 4, 14, 9 |
Figure 4Green vertices are the most creative elements realized by equation (1) in a network which is illustrated Figure 2 (A) and in Zachary's karate club network (B).
Figure 5The pseudocode of the first part of the proposed algorithm.
Figure 6The pseudo code of the second part of the proposed algorithm.
Figure 7The results of Jaccard measure analyses on the results of different algorithms on different network datasets when GO CC derived functional modules were considered.
Figure 8Jaccard measure analyses results on the different network datasets and different algorithms when GO BP derived functional modules were considered.
Figure 9The results of Precision–Recall measure analyses on the results of different algorithms on different network datasets when GO CC derived functional modules were considered.
Figure 10The Precision–Recall measure analyses results on the results of different algorithms on the six different network datasets when GO CC derived functional modules were considered.
Figure 11The results of semantic density measure analyses on the results of different algorithms on the six network datasets when GO CC derived functional modules were considered.
Figure 12Semantic density measure analyses results on the six different networks and different algorithm considering GO BP derived functional modules.