| Literature DB >> 22206604 |
Tejaswini Narayanan1, Merril Gersten, Shankar Subramaniam, Ananth Grama.
Abstract
BACKGROUND: Many recent studies have investigated modularity in biological networks, and its role in functional and structural characterization of constituent biomolecules. A technique that has shown considerable promise in the domain of modularity detection is the Newman and Girvan (NG) algorithm, which relies on the number of shortest-paths across pairs of vertices in the network traversing a given edge, referred to as the betweenness of that edge. The edge with the highest betweenness is iteratively eliminated from the network, with the betweenness of the remaining edges recalculated in every iteration. This generates a complete dendrogram, from which modules are extracted by applying a quality metric called modularity denoted by Q. This exhaustive computation can be prohibitively expensive for large networks such as Protein-Protein Interaction Networks. In this paper, we present a novel optimization to the modularity detection algorithm, in terms of an efficient termination criterion based on a target edge betweenness value, using which the process of iterative edge removal may be terminated.Entities:
Year: 2011 PMID: 22206604 PMCID: PMC3292542 DOI: 10.1186/1756-0500-4-569
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Summary of Networks that were used to validate our approach
| Network | Source | #Vertices | #Edges | |
|---|---|---|---|---|
| Original Network | Network Considered | |||
| C.Elegans | [ | 453 | 4596 | 2025 |
| Yeast* | [ | 3654 | 15316 | 9946 |
| Drosophila | [ | 7666 | 25649 | 25433 |
* The entire Yeast network contains 160,566 interactions. We restrict the dataset to interactions determined by Co-purification or Yeast Two-hybrid experiments. This yields a network of 15,316 interactions
Figure 1Comparison of Runtimes for NG and Gmean algorithms for C.Elegans, Yeast and Drosophila networks.
Figure 2Log scale comparison of total number of modules identified by NG and Gmean algorithms for C.Elegans, Yeast and Drosophila networks.
Figure 3Log scale comparison of number of modules with at least 15 vertices identified by NG and Gmean algorithms for C.Elegans, Yeast and Drosophila networks.
Figure 4Comparison of Modularity (Q) values from NG and Gmean algorithms for C.Elegans, Yeast and Drosophila networks.
Summary of % similarity for biological networks considered
| C.Elegans | Yeast | Drosophila | |
|---|---|---|---|
| 4.5472 | 47.973 | 40.5089 | |
| 5 | 48 | 46 | |
| λ | 90.94% | 99.94% | 88.06% |
Figure 5Flow diagram illustrating the general framework of the proposed Gmean algorithm.