Literature DB >> 32915952

Fast gap-affine pairwise alignment using the wavefront algorithm.

Santiago Marco-Sola1,2, Juan Carlos Moure2, Miquel Moreto1,3, Antonio Espinosa2.   

Abstract

MOTIVATION: Pairwise alignment of sequences is a fundamental method in modern molecular biology, implemented within multiple bioinformatics tools and libraries. Current advances in sequencing technologies press for the development of faster pairwise alignment algorithms that can scale with increasing read lengths and production yields.
RESULTS: In this article, we present the wavefront alignment algorithm (WFA), an exact gap-affine algorithm that takes advantage of homologous regions between the sequences to accelerate the alignment process. As opposed to traditional dynamic programming algorithms that run in quadratic time, the WFA runs in time O(ns), proportional to the read length n and the alignment score s, using O(s2) memory. Furthermore, our algorithm exhibits simple data dependencies that can be easily vectorized, even by the automatic features of modern compilers, for different architectures, without the need to adapt the code. We evaluate the performance of our algorithm, together with other state-of-the-art implementations. As a result, we demonstrate that the WFA runs 20-300× faster than other methods aligning short Illumina-like sequences, and 10-100× faster using long noisy reads like those produced by Oxford Nanopore Technologies.
AVAILABILITY AND IMPLEMENTATION: The WFA algorithm is implemented within the wavefront-aligner library, and it is publicly available at https://github.com/smarco/WFA.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Year:  2021        PMID: 32915952     DOI: 10.1093/bioinformatics/btaa777

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  Long-read mapping to repetitive reference sequences using Winnowmap2.

Authors:  Chirag Jain; Arang Rhie; Nancy F Hansen; Sergey Koren; Adam M Phillippy
Journal:  Nat Methods       Date:  2022-04-01       Impact factor: 28.547

2.  AncestralClust: Clustering of Divergent Nucleotide Sequences by Ancestral Sequence Reconstruction using Phylogenetic Trees.

Authors:  Lenore Pipes; Rasmus Nielsen
Journal:  Bioinformatics       Date:  2021-10-20       Impact factor: 6.931

3.  Pair consensus decoding improves accuracy of neural network basecallers for nanopore sequencing.

Authors:  Jordi Silvestre-Ryan; Ian Holmes
Journal:  Genome Biol       Date:  2021-01-19       Impact factor: 13.583

4.  AnchorWave: Sensitive alignment of genomes with high sequence diversity, extensive structural polymorphism, and whole-genome duplication.

Authors:  Baoxing Song; Santiago Marco-Sola; Miquel Moreto; Lynn Johnson; Edward S Buckler; Michelle C Stitzer
Journal:  Proc Natl Acad Sci U S A       Date:  2022-01-04       Impact factor: 12.779

5.  Lightweight Pattern Matching Method for DNA Sequencing in Internet of Medical Things.

Authors:  J A M Rexie; Kumudha Raimond; Mythily Murugaaboopathy; D Brindha; Henock Mulugeta
Journal:  Comput Intell Neurosci       Date:  2022-09-08

Review 6.  From molecules to genomic variations: Accelerating genome analysis via intelligent algorithms and architectures.

Authors:  Mohammed Alser; Joel Lindegger; Can Firtina; Nour Almadhoun; Haiyu Mao; Gagandeep Singh; Juan Gomez-Luna; Onur Mutlu
Journal:  Comput Struct Biotechnol J       Date:  2022-08-18       Impact factor: 6.155

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.