Literature DB >> 28575161

GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping.

Mohammed Alser1, Hasan Hassan2,3, Hongyi Xin4, Oguz Ergin2, Onur Mutlu3, Can Alkan1.   

Abstract

MOTIVATION: High throughput DNA sequencing (HTS) technologies generate an excessive number of small DNA segments -called short reads- that cause significant computational burden. To analyze the entire genome, each of the billions of short reads must be mapped to a reference genome based on the similarity between a read and 'candidate' locations in that reference genome. The similarity measurement, called alignment, formulated as an approximate string matching problem, is the computational bottleneck because: (i) it is implemented using quadratic-time dynamic programming algorithms and (ii) the majority of candidate locations in the reference genome do not align with a given read due to high dissimilarity. Calculating the alignment of such incorrect candidate locations consumes an overwhelming majority of a modern read mapper's execution time. Therefore, it is crucial to develop a fast and effective filter that can detect incorrect candidate locations and eliminate them before invoking computationally costly alignment algorithms.
RESULTS: We propose GateKeeper, a new hardware accelerator that functions as a pre-alignment step that quickly filters out most incorrect candidate locations. GateKeeper is the first design to accelerate pre-alignment using Field-Programmable Gate Arrays (FPGAs), which can perform pre-alignment much faster than software. When implemented on a single FPGA chip, GateKeeper maintains high accuracy (on average >96%) while providing, on average, 90-fold and 130-fold speedup over the state-of-the-art software pre-alignment techniques, Adjacency Filter and Shifted Hamming Distance (SHD), respectively. The addition of GateKeeper as a pre-alignment step can reduce the verification time of the mrFAST mapper by a factor of 10.
AVAILABILITY AND IMPLEMENTATION: https://github.com/BilkentCompGen/GateKeeper. CONTACT: mohammedalser@bilkent.edu.tr or onur.mutlu@inf.ethz.ch or calkan@cs.bilkent.edu.tr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 28575161      PMCID: PMC5860160          DOI: 10.1093/bioinformatics/btx342

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  Efficient q-gram filters for finding all epsilon-matches over a given length.

Authors:  Kim R Rasmussen; Jens Stoye; Eugene W Myers
Journal:  J Comput Biol       Date:  2006-03       Impact factor: 1.479

2.  RazerS--fast read mapping with sensitivity control.

Authors:  David Weese; Anne-Katrin Emde; Tobias Rausch; Andreas Döring; Knut Reinert
Journal:  Genome Res       Date:  2009-07-10       Impact factor: 9.043

3.  Shifted Hamming distance: a fast and accurate SIMD-friendly filter to accelerate alignment verification in read mapping.

Authors:  Hongyi Xin; John Greth; John Emmons; Gennady Pekhimenko; Carl Kingsford; Can Alkan; Onur Mutlu
Journal:  Bioinformatics       Date:  2015-01-10       Impact factor: 6.937

4.  Short Read Mapping: An Algorithmic Tour.

Authors:  Stefan Canzar; Steven L Salzberg
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2015-09-07       Impact factor: 10.961

5.  RazerS 3: faster, fully sensitive read mapping.

Authors:  David Weese; Manuel Holtgrewe; Knut Reinert
Journal:  Bioinformatics       Date:  2012-08-24       Impact factor: 6.937

6.  The GEM mapper: fast, accurate and versatile alignment by filtration.

Authors:  Santiago Marco-Sola; Michael Sammeth; Roderic Guigó; Paolo Ribeca
Journal:  Nat Methods       Date:  2012-10-28       Impact factor: 28.547

7.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

8.  SOAP3-dp: fast, accurate and sensitive GPU-based short read aligner.

Authors:  Ruibang Luo; Thomas Wong; Jianqiao Zhu; Chi-Man Liu; Xiaoqian Zhu; Edward Wu; Lap-Kei Lee; Haoxiang Lin; Wenjuan Zhu; David W Cheung; Hing-Fung Ting; Siu-Ming Yiu; Shaoliang Peng; Chang Yu; Yingrui Li; Ruiqiang Li; Tak-Wah Lam
Journal:  PLoS One       Date:  2013-05-31       Impact factor: 3.240

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications.

Authors:  Faraz Hach; Iman Sarrafi; Farhad Hormozdiari; Can Alkan; Evan E Eichler; S Cenk Sahinalp
Journal:  Nucleic Acids Res       Date:  2014-05-08       Impact factor: 16.971

View more
  7 in total

Review 1.  Nanopore sequencing technology and tools for genome assembly: computational analysis of the current state, bottlenecks and future directions.

Authors:  Damla Senol Cali; Jeremie S Kim; Saugata Ghose; Can Alkan; Onur Mutlu
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

2.  Shouji: a fast and efficient pre-alignment filter for sequence alignment.

Authors:  Mohammed Alser; Hasan Hassan; Akash Kumar; Onur Mutlu; Can Alkan
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

3.  Hardware Acceleration of Genomics Data Analysis: Challenges and Opportunities.

Authors:  Tony Robinson; Jim Harkin; Priyank Shukla
Journal:  Bioinformatics       Date:  2021-05-25       Impact factor: 6.937

Review 4.  Technology dictates algorithms: recent developments in read alignment.

Authors:  Mohammed Alser; Jeremy Rotman; Onur Mutlu; Serghei Mangul; Dhrithi Deshpande; Kodi Taraszka; Huwenbo Shi; Pelin Icer Baykal; Harry Taegyun Yang; Victor Xue; Sergey Knyazev; Benjamin D Singer; Brunilda Balliu; David Koslicki; Pavel Skums; Alex Zelikovsky; Can Alkan
Journal:  Genome Biol       Date:  2021-08-26       Impact factor: 13.583

Review 5.  From molecules to genomic variations: Accelerating genome analysis via intelligent algorithms and architectures.

Authors:  Mohammed Alser; Joel Lindegger; Can Firtina; Nour Almadhoun; Haiyu Mao; Gagandeep Singh; Juan Gomez-Luna; Onur Mutlu
Journal:  Comput Struct Biotechnol J       Date:  2022-08-18       Impact factor: 6.155

6.  Phylogenomic assessment of drug-resistant Mycobacterium tuberculosis strains from Beira, Mozambique.

Authors:  Evangelina Inacio Namburete; Anzaan Dippenaar; Emilyn Costa Conceição; Cinara Feliciano; Margarida Maria Passeri do Nascimento; Kamila Chagas Peronni; Wilson Araújo Silva; Josefo João Ferro; Lee H Harrison; Robin Mark Warren; Valdes Roberto Bollela
Journal:  Tuberculosis (Edinb)       Date:  2020-01-29       Impact factor: 2.973

7.  GRIM-Filter: Fast seed location filtering in DNA read mapping using processing-in-memory technologies.

Authors:  Jeremie S Kim; Damla Senol Cali; Hongyi Xin; Donghyuk Lee; Saugata Ghose; Mohammed Alser; Hasan Hassan; Oguz Ergin; Can Alkan; Onur Mutlu
Journal:  BMC Genomics       Date:  2018-05-09       Impact factor: 3.969

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.