| Literature DB >> 23762819 |
Chun-Hung Richard Lin1, Chun-Hao Wen, Ying-Chih Lin, Kuang-Yuan Tung, Rung-Wei Lin, Chun-Yuan Lin.
Abstract
Bioinformatics is advanced from in-house computing infrastructure to cloud computing for tackling the vast quantity of biological data. This advance enables large number of collaborative researches to share their works around the world. In view of that, retrieving biological data over the internet becomes more and more difficult because of the explosive growth and frequent changes. Various efforts have been made to address the problems of data discovery and delivery in the cloud framework, but most of them suffer the hindrance by a MapReduce master server to track all available data. In this paper, we propose an alternative approach, called PRKad, which exploits a Peer-to-Peer (P2P) model to achieve efficient data discovery and delivery. PRKad is a Kademlia-based implementation with Round-Trip-Time (RTT) as the associated key, and it locates data according to Distributed Hash Table (DHT) and XOR metric. The simulation results exhibit that our PRKad has the low link latency to retrieve data. As an interdisciplinary application of P2P computing for bioinformatics, PRKad also provides good scalability for servicing a greater number of users in dynamic cloud environments.Entities:
Year: 2013 PMID: 23762819 PMCID: PMC3665235 DOI: 10.1155/2013/361327
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
k-bucket.
| Distance | List with |
|---|---|
| 20 | ■→null |
| 21 | ■→■→null |
| 22 | ■→■→■→■→null |
| ⋮ | ⋮ |
| 2159 | ■→■ →■→■→■→■→■→■→■→■→null |
Figure 1PRID representation.
Figure 2PR tree.
Figure 3Example of eight nodes with their RTT values.
Figure 4A construction example of PR tree.
Figure 5Simulation flow.
Figure 6Illustration of a successful search.
Figure 7Rate of successful searches.
Figure 8Average of hop counts for successful searches.
Figure 9Average of RTT values for nodes with successful searches.
Figure 10Average of message counts for nodes with successful searches.
Figure 11Amount of messages used in simulations.
Figure 12Average of RTT values for nodes with successful searches.