Literature DB >> 21088030

Anatomy of a hash-based long read sequence mapping algorithm for next generation DNA sequencing.

Sanchit Misra1, Ankit Agrawal, Wei-keng Liao, Alok Choudhary.   

Abstract

MOTIVATION: Recently, a number of programs have been proposed for mapping short reads to a reference genome. Many of them are heavily optimized for short-read mapping and hence are very efficient for shorter queries, but that makes them inefficient or not applicable for reads longer than 200 bp. However, many sequencers are already generating longer reads and more are expected to follow. For long read sequence mapping, there are limited options; BLAT, SSAHA2, FANGS and BWA-SW are among the popular ones. However, resequencing and personalized medicine need much faster software to map these long sequencing reads to a reference genome to identify SNPs or rare transcripts.
RESULTS: We present AGILE (AliGnIng Long rEads), a hash table based high-throughput sequence mapping algorithm for longer 454 reads that uses diagonal multiple seed-match criteria, customized q-gram filtering and a dynamic incremental search approach among other heuristics to optimize every step of the mapping process. In our experiments, we observe that AGILE is more accurate than BLAT, and comparable to BWA-SW and SSAHA2. For practical error rates (< 5%) and read lengths (200-1000 bp), AGILE is significantly faster than BLAT, SSAHA2 and BWA-SW. Even for the other cases, AGILE is comparable to BWA-SW and several times faster than BLAT and SSAHA2. AVAILABILITY: http://www.ece.northwestern.edu/~smi539/agile.html.

Mesh:

Year:  2010        PMID: 21088030     DOI: 10.1093/bioinformatics/btq648

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

Review 1.  Whole genome sequencing as a means to assess pathogenic mutations in medical genetics and cancer.

Authors:  Beryl Royer-Bertrand; Carlo Rivolta
Journal:  Cell Mol Life Sci       Date:  2014-12-30       Impact factor: 9.261

2.  Short Read Mapping: An Algorithmic Tour.

Authors:  Stefan Canzar; Steven L Salzberg
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2015-09-07       Impact factor: 10.961

3.  YAHA: fast and flexible long-read alignment with optimal breakpoint detection.

Authors:  Gregory G Faust; Ira M Hall
Journal:  Bioinformatics       Date:  2012-07-24       Impact factor: 6.937

Review 4.  Technology dictates algorithms: recent developments in read alignment.

Authors:  Mohammed Alser; Jeremy Rotman; Onur Mutlu; Serghei Mangul; Dhrithi Deshpande; Kodi Taraszka; Huwenbo Shi; Pelin Icer Baykal; Harry Taegyun Yang; Victor Xue; Sergey Knyazev; Benjamin D Singer; Brunilda Balliu; David Koslicki; Pavel Skums; Alex Zelikovsky; Can Alkan
Journal:  Genome Biol       Date:  2021-08-26       Impact factor: 13.583

5.  The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote.

Authors:  Yang Liao; Gordon K Smyth; Wei Shi
Journal:  Nucleic Acids Res       Date:  2013-04-04       Impact factor: 16.971

6.  Experience of targeted Usher exome sequencing as a clinical test.

Authors:  Thomas Besnard; Gema García-García; David Baux; Christel Vaché; Valérie Faugère; Lise Larrieu; Susana Léonard; Jose M Millan; Sue Malcolm; Mireille Claustres; Anne-Françoise Roux
Journal:  Mol Genet Genomic Med       Date:  2013-07-10       Impact factor: 2.183

7.  HIA: a genome mapper using hybrid index-based sequence alignment.

Authors:  Jongpill Choi; Kiejung Park; Seong Beom Cho; Myungguen Chung
Journal:  Algorithms Mol Biol       Date:  2015-12-23       Impact factor: 1.405

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.