| Literature DB >> 22537007 |
Yuhong Zhang1, Sanchit Misra, Ankit Agrawal, Md Mostofa Ali Patwary, Wei-Keng Liao, Zhiguang Qin, Alok Choudhary.
Abstract
BACKGROUND: Pairwise statistical significance has been recognized to be able to accurately identify related sequences, which is a very important cornerstone procedure in numerous bioinformatics applications. However, it is both computationally and data intensive, which poses a big challenge in terms of performance and scalability.Entities:
Mesh:
Year: 2012 PMID: 22537007 PMCID: PMC3318904 DOI: 10.1186/1471-2105-13-S5-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Two GPU memory layout strategies. (a) The intuitive layout that one sequence is appended after another. (b) The reorganized layout such that the uchar4 at the same indices in different sequences stay at neighboring positions.
Figure 2The customized substitution matrix. (a) Local query profile: the same symbol (e.g., 'A') at different position in query has same scores. (b) Position specific query profile: the same symbol (e.g., 'A') at different position in query has different scores.
Figure 3Adaptive tile-based strategy. The optimal tile size T can be calculated according to the hardware configuration of GPU. After calculated, the T subject sequences together are transferred to GPU global memory. Permutations and alignments are done in parallel in GPU. Then T × 1000 alignment scores are moved back to CPU for T fittings.
Figure 4Intra-task and inter-task parallelism for single-pair PSSE. We choose four pairs of query and subject sequences of length 200, 400, 800, and 1600 from CATH database.
Figure 5Performance of three strategies for multi-pair PSSE. All experiments are run using 2771 subject sequences and 86 query sequences from the CATH 2.3 database.
The maximum occupancy for three strategies
| Threads/blocks | 64 | 128 | 256 | 512 |
|---|---|---|---|---|
| Intuitive occupancy | 12.5% | 6.25% | 6.25% | 6.25% |
| Data-reuse occupancy | 12.5% | 6.25% | 6.25% | 6.25% |
| Tile-based occupancy | 50% | 100% | 100% | 100% |