| Literature DB >> 25937944 |
Abstract
This paper focuses on the latest research and critical reviews on modern computing architectures, software and hardware accelerated algorithms for bioinformatics data analysis with an emphasis on one of the most important sequence analysis applications-hidden Markov models (HMM). We show the detailed performance comparison of sequence analysis tools on various computing platforms recently developed in the bioinformatics society. The characteristics of the sequence analysis, such as data and compute-intensive natures, make it very attractive to optimize and parallelize by using both traditional software approach and innovated hardware acceleration technologies.Entities:
Year: 2013 PMID: 25937944 PMCID: PMC4393056 DOI: 10.1155/2013/252183
Source DB: PubMed Journal: ISRN Bioinform ISSN: 2090-7338
Figure 1A hidden Markov model.
Figure 2An example of a possible transmission sequence and symbol sequence.
Figure 3Plan 7 HMM model architecture.
Figure 4A computer cluster with n nodes.
Performance comparison among different software approaches.
| Acceleration strategies | Supporting software/package | Accelerated programs | Datasets | Hardware environment | Achieved speedup |
|---|---|---|---|---|---|
| Instruction-level parallelism | SSE2 Instructions |
|
| 2.66 GHz Intel Xeon processor with 2.5 GB of memory | 1.2x~1.3x [ |
|
| |||||
| Shared memory parallelism | OpenMP |
| 600 HMM profiles and 250 sequences | 16 x86 3.0 GHz processors, 32 MB L4 cache shared among 4 CPUs, 4 MB L3 cache, 8 GB of memory | 14x [ |
| EARTH |
| 50 HMM profiles and 38192 sequences | A cluster that consists of 128 nodes, each with two 500 MHz Pentium III processors | 222.8x [ | |
|
| |||||
| Distributed memory parallelism | PVM |
| 1 HMM profile and 100 MB of | A cluster with 4 nodes, each node consists of two 2.66 GHz Intel Xeon processors with 2.5 GB memory per node | 4.56x [ |
| MPI |
| 5.90x [ | |||
| MPI + I/O optimizations |
| One 236-state HMM profile and | A cluster that consists of 1056 nodes, each equipped with two 3.2 GHz Intel Xeon processors, 2 GB RAM | 221x [ | |
|
| 1.6 GB of | 328x [ | |||
|
| |||||
| Heterogeneous approach | MPI + SSE2 |
| 1 HMM profile and 100 MB of | A cluster with 4 nodes, each node consists of two 2.66 GHz Intel Xeon processors with 2.5 GB memory per node | 7.71x [ |
Performance comparison among different hardware approaches.
| Acceleration hardware type | Hardware accelerator | Accelerated programs | Datasets | Host or base hardware environment | Reported max. speedup |
|---|---|---|---|---|---|
| Network Processor | Intel IXP 2850 network processor |
| Pfam_ls database (7459 models) | 2.6 GHz Intel Pentium 4 CPU with 768 MB of SDRAM and 32 MB of QDR SRAM | 1.82x [ |
|
| |||||
| GPGPU | 8800 GTX Ultra |
| 3 GB nr Database (5.5 million sequences) 3 models (77, 209, 456, 789, and 1431 states) | — | 38.6x [ |
| 4 Tesla C1060s |
| 5.4 GB Database (10.54 million sequences) 3 models (128, 256 and 507 states) | 2.33 GHz AMD Opteron | 100+x [ | |
|
| |||||
| Heterogeneous multi-core chip | CELL BE |
| 100 HMM states and characters | Dual-core 2.4 GHz Opteron with 8 GB RAM | ~3.5x [ |
|
| |||||
| FPGA | Spartan-3 XC3S1500 |
| 244 HMM states and a database consisting of 643,552 sequences | AMD Athlon 64 3500+ | 31x [ |
| Spartan-3 XC3S1500 |
| A database consisting of 1,544 HMMs and 1000 protein sequences | AMD Athlon 64 3500+ | 39x [ | |
| Virtex-5 110T |
| 122,564 query inputs | 2.33 GHz Intel Core2 Duo with 4 GB RAM | 56.8x [ | |
|
| |||||
| Heterogeneous approach (MPI + FPGA) | 2 Spartan-3 XC3S1500 |
| 2 HMM models (77 and 236 states) and 2 databases (217,875 and 2,521,679 sequences) | A cluster consists of 10 worker nodes, each with a dual core AMD Opteron 175 processor with 2 GB memory per node | 30x [ |