Literature DB >> 29747076

Hardware acceleration of BWA-MEM genomic short read mapping for longer read lengths.

Ernst Joachim Houtgast1, Vlad-Mihai Sima2, Koen Bertels3, Zaid Al-Ars3.   

Abstract

We present our work on hardware accelerated genomics pipelines, using either FPGAs or GPUs to accelerate execution of BWA-MEM, a widely-used algorithm for genomic short read mapping. The mapping stage can take up to 40% of overall processing time for genomics pipelines. Our implementation offloads the Seed Extension function, one of the main BWA-MEM computational functions, onto an accelerator. Sequencers typically output reads with a length of 150 base pairs. However, read length is expected to increase in the near future. Here, we investigate the influence of read length on BWA-MEM performance using data sets with read length up to 400 base pairs, and introduce methods to ameliorate the impact of longer read length. For the industry-standard 150 base pair read length, our implementation achieves an up to two-fold increase in overall application-level performance for systems with at most twenty-two logical CPU cores. Longer read length requires commensurately bigger data structures, which directly impacts accelerator efficiency. The two-fold performance increase is sustained for read length of at most 250 base pairs. To improve performance, we perform a classification of the inefficiency of the underlying systolic array architecture. By eliminating idle regions as much as possible, efficiency is improved by up to +95%. Moreover, adaptive load balancing intelligently distributes work between host and accelerator to ensure use of an accelerator always results in performance improvement, which in GPU-constrained scenarios provides up to +45% more performance.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Keywords:  Acceleration; BWA-MEM; FPGA; GPU; Short read mapping; Systolic array

Mesh:

Year:  2018        PMID: 29747076     DOI: 10.1016/j.compbiolchem.2018.03.024

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  26 in total

1.  Targeted panel sequencing establishes the implication of planar cell polarity pathway and involves new candidate genes in neural tube defect disorders.

Authors:  Marie Beaumont; Linda Akloul; Wilfrid Carré; Chloé Quélin; Hubert Journel; Laurent Pasquier; Mélanie Fradin; Sylvie Odent; Houda Hamdi-Rozé; Erwan Watrin; Valérie Dupé; Christèle Dubourg; Véronique David
Journal:  Hum Genet       Date:  2019-03-05       Impact factor: 4.132

2.  Proposal of Smith-Waterman algorithm on FPGA to accelerate the forward and backtracking steps.

Authors:  Fabio F de Oliveira; Leonardo A Dias; Marcelo A C Fernandes
Journal:  PLoS One       Date:  2022-06-30       Impact factor: 3.752

3.  The Chromatin Accessibility Landscape of Peripheral Blood Mononuclear Cells in Patients With Systemic Lupus Erythematosus at Single-Cell Resolution.

Authors:  Haiyan Yu; Xiaoping Hong; Hongwei Wu; Fengping Zheng; Zhipeng Zeng; Weier Dai; Lianghong Yin; Dongzhou Liu; Donge Tang; Yong Dai
Journal:  Front Immunol       Date:  2021-05-18       Impact factor: 7.561

4.  Eleven High-Quality Reference Genome Sequences and 360 Draft Assemblies of Shiga Toxin-Producing Escherichia coli Isolates from Human, Food, Animal, and Environmental Sources in Canada.

Authors:  Shari Tyson; Christy-Lynn Peterson; Adam Olson; Shaun Tyler; Natalie Knox; Emma Griffiths; Damion Dooley; William Hsiao; Jennifer Cabral; Roger P Johnson; Chad Laing; Victor Gannon; Tarah Lynch; Gary Van Domselaar; Fiona Brinkman; Morag Graham
Journal:  Microbiol Resour Announc       Date:  2019-10-10

5.  Differential Expression of Circular RNAs in Polytocous and Monotocous Uterus during the Reproductive Cycle of Sheep.

Authors:  Yongfu La; Jishun Tang; Ran Di; Xiangyu Wang; Qiuyue Liu; Liping Zhang; Xiaosheng Zhang; Jinlong Zhang; Wenping Hu; Mingxing Chu
Journal:  Animals (Basel)       Date:  2019-10-14       Impact factor: 2.752

6.  Differential Expression and Functional Analysis of CircRNA in the Ovaries of Low and High Fecundity Hanper Sheep.

Authors:  Aiju Liu; Xiaoyong Chen; Menghe Liu; Limeng Zhang; Xiaofei Ma; Shujun Tian
Journal:  Animals (Basel)       Date:  2021-06-23       Impact factor: 2.752

7.  Genome re-sequencing and reannotation of the Escherichia coli ER2566 strain and transcriptome sequencing under overexpression conditions.

Authors:  Lizhi Zhou; Hai Yu; Kaihang Wang; Tingting Chen; Yue Ma; Yang Huang; Jiajia Li; Liqin Liu; Yuqian Li; Zhibo Kong; Qingbing Zheng; Yingbin Wang; Ying Gu; Ningshao Xia; Shaowei Li
Journal:  BMC Genomics       Date:  2020-06-16       Impact factor: 3.969

8.  Comprehensive Analysis of Differentially Expressed Profiles of mRNA, lncRNA, and circRNA in the Uterus of Seasonal Reproduction Sheep.

Authors:  Yongfu La; Xiaoyun He; Liping Zhang; Ran Di; Xiangyu Wang; Shangquan Gan; Xiaosheng Zhang; Jinlong Zhang; Wenping Hu; Mingxing Chu
Journal:  Genes (Basel)       Date:  2020-03-12       Impact factor: 4.096

9.  Gene-regulatory network analysis of ankylosing spondylitis with a single-cell chromatin accessible assay.

Authors:  Haiyan Yu; Hongwei Wu; Fengping Zheng; Chengxin Zhu; Lianghong Yin; Weier Dai; Dongzhou Liu; Donge Tang; Xiaoping Hong; Yong Dai
Journal:  Sci Rep       Date:  2020-11-10       Impact factor: 4.379

10.  Optimizing performance of GATK workflows using Apache Arrow In-Memory data framework.

Authors:  Tanveer Ahmad; Nauman Ahmed; Zaid Al-Ars; H Peter Hofstee
Journal:  BMC Genomics       Date:  2020-11-18       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.