| Literature DB >> 23864730 |
Abstract
SUMMARY: We propose SW#, a new CUDA graphical processor unit-enabled and memory-efficient implementation of dynamic programming algorithm, for local alignment. It can be used as either a stand-alone application or a library. Although there are other graphical processor unit implementations of the Smith-Waterman algorithm, SW# is the only one publicly available that can produce sequence alignments on genome-wide scale. For long sequences, it is at least a few hundred times faster than a CPU version of the same algorithm. AVAILABILITY: Source code and installation instructions freely available for download at http://complex.zesoi.fer.hr/SW.html.Entities:
Mesh:
Year: 2013 PMID: 23864730 PMCID: PMC3777108 DOI: 10.1093/bioinformatics/btt410
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.SW# done with two CUDA cards. (a–c) show solving, finding alignment startpoint and reconstruction phases, respectively. Gray cells represent executable area, dashed lines represent prunable area, and black arrows show direction of the execution. The best local alignment could be located in the upper part, lower part or both parts of the matrix. In the last case, the total score is the maximum sum of scores of the neighboring cells at the middle of matrix (darker gray cells). The positions of maximum scores in each phase are marked by darkest cells
SW# and CUDAlign run time comparison for different NVIDIA GPU cards and sequence lengths
| Sequences size | CPU | GTX 560 CUDAlign | GTX 570 SW# | GTX 690 |
|---|---|---|---|---|
| 2 × 172 Kb | 660 s | 2.1 s | 1.5 s | 2.6 s |
| 0.5 × 0.5 Mb | 9090 s | 11.8 s | 9.4 s | 5.8 s |
| 3.1 × 3.3 Mb | — | 367 s | 296 s | 119 s |
| 59 × 24 Mb | — | 47123 s | 40359 s | 16263 s |
| 33 × 47 Mb | — | 30369 s | 59228 s | 23614 s |
Note: CPU results are presented only for shorter sequences.
aCUDAlign results are taken from the original article (Sandes and Melo, 2013).
bDual GPU card.