| Literature DB >> 22490084 |
Sylvain Robert Rivard1, Jean-Gabriel Mailloux, Rachid Beguenane, Hung Tien Bui.
Abstract
BACKGROUND: This paper proposes a method of implementing parallel gene prediction algorithms in MATLAB. The proposed designs are based on either Goertzel's algorithm or on FFTs and have been implemented using varying amounts of parallelism on a central processing unit (CPU) and on a graphics processing unit (GPU).Entities:
Mesh:
Substances:
Year: 2012 PMID: 22490084 PMCID: PMC3444342 DOI: 10.1186/1756-0500-5-183
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1Block diagram of the system.
Figure 2DNA sequence converted into its binary counterparts.
Figure 3Illustration of the sliding window approach.
Figure 4Procedure for breaking down large DNA sequences for processing on GPU.
Figure 5Processing time for a sequence of 15 million bps with varying DNA block size and DNA fragment size.
data on HFE2 gene on chromosome 1
| Transcript variant a | 1,2,3b,4 | 2234 |
| Transcript variant b | 1,3b,4 | 2048 |
| Transcript variant c | 1, 3a,4 | 1525 |
| Transcript variant d | 1,4 | 1488 |
HFE2 exon positions on chromosome 1
| Start | 145,413,191 | 145,414,693 | 145,415,278 | 145,415,278 | 145,416,313 |
| End | 145,413,427 | 145,414,879 | 145,415,315 | 145,415,838 | 145,417,545 |
| Length | 236 | 186 | 37 | 560 | 1232 |
Lengths are in bps.
Figure 6Frequency component at f/3 for different areas in the HFE2 gene using a sliding FFT and Goertzel algorithm.
Runtime for varying sequence lengths
| | | | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 goertzel.m | PARFOR | CPU 8 T | 1.06 | 8.29 | 32.91 | 82.16 | 161.89 | 805.44 | TLTC |
| 2 goertzelMEX | FOR | CPU | 0.18 | 1.78 | 7.11 | 17.84 | 35.65 | 178.30 | 535.21 |
| 3 goertzelMEX | PARFOR | CPU 2 T | 0.19 | 0.99 | 3.86 | 9.58 | 19.20 | 100.39 | 287.35 |
| 4 goertzelMEX | PARFOR | CPU 4 T | 0.18 | 0.60 | 2.36 | 5.81 | 11.41 | 56.27 | 164.84 |
| 5 goertzelMEX | PARFOR | CPU 8 T | 0.25 | 0.53 | 1.95 | 4.75 | 9.52 | 47.49 | 164.57 |
| 7 JACKET’s FFT (full sequences) | GFOR | GPU | 0.03 | 0.22 | 0.78 | 1.90 | 3.78 | 18.82 | 57.68 |
| 9 Matlab’s FFT | PARFOR | CPU 8 T | 0.29 | 0.42 | 1.46 | 3.51 | 6.95 | 34.12 | 109.15 |
| 10 Custom Goerztel on GPU | GFOR | GPU | 0.22 | 0.79 | 2.82 | 7.15 | 14.09 | 71.01 | 213.31 |