| Literature DB >> 28883909 |
Zekun Yin1, Haidong Lan1, Guangming Tan2, Mian Lu3, Athanasios V Vasilakos4, Weiguo Liu1.
Abstract
The last decade has witnessed an explosion in the amount of available biological sequence data, due to the rapid progress of high-throughput sequencing projects. However, the biological data amount is becoming so great that traditional data analysis platforms and methods can no longer meet the need to rapidly perform data analysis tasks in life sciences. As a result, both biologists and computer scientists are facing the challenge of gaining a profound insight into the deepest biological functions from big biological data. This in turn requires massive computational resources. Therefore, high performance computing (HPC) platforms are highly needed as well as efficient and scalable algorithms that can take advantage of these platforms. In this paper, we survey the state-of-the-art HPC platforms for big biological data analytics. We first list the characteristics of big biological data and popular computing platforms. Then we provide a taxonomy of different biological data analysis applications and a survey of the way they have been mapped onto various computing platforms. After that, we present a case study to compare the efficiency of different computing platforms for handling the classical biological sequence alignment problem. At last we discuss the open issues in big biological data analytics.Entities:
Keywords: Big biological data; Computational biology applications; Computing platforms; GPU; Intel MIC; NGS
Year: 2017 PMID: 28883909 PMCID: PMC5581845 DOI: 10.1016/j.csbj.2017.07.004
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Classic bioinformatics applications. C&C is short for cluster and cloud. In this table, at first we give a short description about specifications of each application. Then three major characteristics are listed: memory access pattern, computation density and I/O density. At last we list platforms the applications have been implemented on.
| Application | Specification | Memory access pattern | Computation | I/O | Platform | |||
|---|---|---|---|---|---|---|---|---|
| Multi-core | GPU | MIC | C&C | |||||
| ClustalW | Classic but old | Regular and irregular | High | Low | ||||
| Clustal Omega | Fast and scalable | Regular and irregular | High | Moderate | NA | NA | NA | |
| Smith-Waterman | Small database, optimal results | Regular | High | Moderate | ||||
| Blast | Large DB, heuristic algorithm | Irregular and regular | High | Low | Yes | NA | ||
| BLAT | In memory, fast than Blast | Regular and irregular | High | Moderate | NA | NA | NA | NA |
| Bowtie2 | Typical short read alignment tool | Irregular | Moderate | Moderate | NA | NA | NA | |
| BWA | Typical short read alignment tool | Irregular suffix array | Low | High | NA | |||
| mrFast | Short read, all mapper | Regular filter strategy | High | Low | NA | NA | NA | |
| SPADES | Fast assembler, single and multi cell | Irregular | Low | High | NA | NA | NA | |
| BFCounter | Error correction | Regular | High | Low | NA | NA | ||
| Fiona | Error correction | Irregular | Low | High | NA | NA | NA | |
Parallel algorithm design.
| Application | Description | Data organization | Coarse-grained parallel | Fine-grained parallel |
|---|---|---|---|---|
| SWIPE | Multi-core Smith-Waterman database search | Sequence profile | Multi-thread | SIMD |
| XSW | Smith-Waterman database search on Xeon Phi | Pre-processing | Multi-thread | SIMD |
| CUDASW++ | Smith-Waterman database search on GPUs | Texture filter | Data | SIMT |
| LSDBS | Large-scale database search on Xeon Phi | Pre-processing | Multi-thread | SIMD |
| CUDA-BLASTP | Accelerating BLASTP utilizing CUDA | DFA reorganization | Data | SIMT |
| MSA-CUDA | ClustalW accelerated using CUDA | Sorting | Data | SIMT |
| FHAST | FPGA-based acceleration of BOWTIE in hardware | Index | Data | – |
| BWA | A typical best mapper algorithm | BWT & FM-index | Multi-thread | – |
| BitMapper | A typical all mapper algorithm | Hash index | Multi-thread | SIMD |
| DecGPU | GPU based error correction algorithm | bloom filter | Data | SIMT |
Application optimization.
| Application | Data transfer | Memory access | Cache | Load balance | Heterogeneous computing |
|---|---|---|---|---|---|
| SWIPE | Synchronized | Score profile | – | Dynamic | CPU |
| XSW | Asynchronized | Score profile | – | Dynamic | Xeon Phi native |
| CUDASW++ | Synchronized | Query profile | Texture | Static | CPU + GPU |
| LSDBS | Asynchronized | Score profile | Multi-pass | Dynamic | CPU + Xeon Phi |
| CUDA-BLASTP | Synchronized | Memory coalescing | DFA index table | Static | GPU |
| MSA-CUDA | Synchronized | Memory coalescing | – | – | GPU |
| FHAST | Synchronized | – | – | – | FPGA |
| BWA | – | Index | – | – | – |
| BitMapper | – | – | – | Dynamic | CPU |
| DecGPU | Asynchronized | Memory coalescing | – | Dynamic | GPU |
The major studies for accelerating Smith-Waterman algorithm on Xeon Phi.
| Study | Work mode | Perf (1 Phi) | CPU | DB size restrict |
|---|---|---|---|---|
| XSW | Native | 70 GCUPS | N/A | Phi memory |
| SWAPHI | Offload | 62 GCUPS | N/A | System memory |
| SWIMM | Offload | 45 GCPUS | Yes | System memory |
| LSDBS | Offload | 72 GCUPS | Yes | Hard disk |