| Literature DB >> 21569426 |
Abstract
BACKGROUND: With the exploding volume of data generated by continuously evolving high-throughput technologies, biological network analysis problems are growing larger in scale and craving for more computational power. General Purpose computation on Graphics Processing Units (GPGPU) provides a cost-effective technology for the study of large-scale biological networks. Designing algorithms that maximize data parallelism is the key in leveraging the power of GPUs.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21569426 PMCID: PMC3115853 DOI: 10.1186/1471-2105-12-149
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Shortest path-based centrality metrics
| Centrality | Equation | Description |
|---|---|---|
| Betweenness (BC) | fraction of shortest paths between all other nodes that run through node | |
| Closeness (CC) | reciprocal of average shortest path distance | |
| Eccentricity (EC) | reciprocal of maximum shortest path distance | |
| Stress (SC) | total number of shortest paths between all other nodes that run through |
Figure 1Pseudo-code for computing betweenness centrality on GPU. Lines 1-25 implement the main function that is executed on CPU. Code between lines 26-39 is the kernel function that carries out the All-Pairs Shortest Path algorithm. Lines 40-55 implement the back propagation where the final values of betweenness centrality for each node are set.
Figure 2Performance improvement over the most recent GPU-based betweenness centrality algorithm. We benchmarked our betweenness centrality algorithm against the one described in [18]. Results are based on 25 randomly generated scale-free networks with n varied from 10, 000 to 50, 000 and β varied from 10 and 50. n represents the number of nodes in the network and β controls the preferential attachment process for generating the scale-free networks.
Figure 3Speedup of centrality computation with GPU as compared to CPU implementations. (a) Speedup as a function of β when network size is fixed. (b) Speedup as a function of network size n when β is fixed. (c) Speedup of four centrality metrics for a human protein-protein interaction network. (d) Speedup of four centrality metrics for a breast cancer gene co-expression network. n represents the number of nodes in the network and β controls the preferential attachment process for generating the scale-free networks. BC: betweenness centrality; CC: closeness centrality; EC: eccentricity centrality; SC: stress centrality.
Running times on GPU vs. CPU for centrality computations in a randomly generated scale-free network (n = 30, 000, β = 50)
| Centrality | CPU time ( | GPU time ( | Speedup |
|---|---|---|---|
| Betweenness (BC) | 17777.0 | 365.5 | 48.64 |
| Closeness (CC) | 3914.7 | 92.6 | 42.29 |
| Eccentricity (EC) | 3954.1 | 91.4 | 43.24 |
| Stress (SC) | 16950.1 | 338.2 | 50.12 |