Literature DB >> 33051648

QAlign: aligning nanopore reads accurately using current-level modeling.

Dhaivat Joshi1, Shunfu Mao2, Sreeram Kannan2, Suhas Diggavi1.   

Abstract

MOTIVATION: Efficient and accurate alignment of DNA/RNA sequence reads to each other or to a reference genome/transcriptome is an important problem in genomic analysis. Nanopore sequencing has emerged as a major sequencing technology and many long-read aligners have been designed for aligning nanopore reads. However, the high error rate makes accurate and efficient alignment difficult. Utilizing the noise and error characteristics inherent in the sequencing process properly can play a vital role in constructing a robust aligner. In this article, we design QAlign, a pre-processor that can be used with any long-read aligner for aligning long reads to a genome/transcriptome or to other long reads. The key idea in QAlign is to convert the nucleotide reads into discretized current levels that capture the error modes of the nanopore sequencer before running it through a sequence aligner.
RESULTS: We show that QAlign is able to improve alignment rates from around 80% up to 90% with nanopore reads when aligning to the genome. We also show that QAlign improves the average overlap quality by 9.2, 2.5 and 10.8% in three real datasets for read-to-read alignment. Read-to-transcriptome alignment rates are improved from 51.6% to 75.4% and 82.6% to 90% in two real datasets.
AVAILABILITY AND IMPLEMENTATION: https://github.com/joshidhaivat/QAlign.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2021        PMID: 33051648      PMCID: PMC8097683          DOI: 10.1093/bioinformatics/btaa875

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

2.  De novo peptide sequencing by deep learning.

Authors:  Ngoc Hieu Tran; Xianglilan Zhang; Lei Xin; Baozhen Shan; Ming Li
Journal:  Proc Natl Acad Sci U S A       Date:  2017-07-18       Impact factor: 11.205

3.  mRNA-Seq whole-transcriptome analysis of a single cell.

Authors:  Fuchou Tang; Catalin Barbacioru; Yangzhou Wang; Ellen Nordman; Clarence Lee; Nanlan Xu; Xiaohui Wang; John Bodeau; Brian B Tuch; Asim Siddiqui; Kaiqin Lao; M Azim Surani
Journal:  Nat Methods       Date:  2009-04-06       Impact factor: 28.547

4.  Resolving multicopy duplications de novo using polyploid phasing.

Authors:  Mark J Chaisson; Sudipto Mukherjee; Sreeram Kannan; Evan E Eichler
Journal:  Res Comput Mol Biol       Date:  2017-04-12

5.  A complete bacterial genome assembled de novo using only nanopore sequencing data.

Authors:  Nicholas J Loman; Joshua Quick; Jared T Simpson
Journal:  Nat Methods       Date:  2015-06-15       Impact factor: 28.547

6.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

7.  Completing bacterial genome assemblies with multiplex MinION sequencing.

Authors:  Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal:  Microb Genom       Date:  2017-09-14

8.  DeepSimulator: a deep simulator for Nanopore sequencing.

Authors:  Yu Li; Renmin Han; Chongwei Bi; Mo Li; Sheng Wang; Xin Gao
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

9.  The UCSC Genome Browser database: 2019 update.

Authors:  Maximilian Haeussler; Ann S Zweig; Cath Tyner; Matthew L Speir; Kate R Rosenbloom; Brian J Raney; Christopher M Lee; Brian T Lee; Angie S Hinrichs; Jairo Navarro Gonzalez; David Gibson; Mark Diekhans; Hiram Clawson; Jonathan Casper; Galt P Barber; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  Structural variants identified by Oxford Nanopore PromethION sequencing of the human genome.

Authors:  Wouter De Coster; Peter De Rijk; Arne De Roeck; Tim De Pooter; Svenn D'Hert; Mojca Strazisar; Kristel Sleegers; Christine Van Broeckhoven
Journal:  Genome Res       Date:  2019-06-11       Impact factor: 9.043

View more
  3 in total

Review 1.  Nanopore sequencing technology, bioinformatics and applications.

Authors:  Yunhao Wang; Yue Zhao; Audrey Bollas; Yuru Wang; Kin Fai Au
Journal:  Nat Biotechnol       Date:  2021-11-08       Impact factor: 54.908

2.  LazyB: fast and cheap genome assembly.

Authors:  Thomas Gatter; Sarah von Löhneysen; Jörg Fallmann; Polina Drozdova; Tom Hartmann; Peter F Stadler
Journal:  Algorithms Mol Biol       Date:  2021-06-01       Impact factor: 1.405

Review 3.  Metagenomics: a path to understanding the gut microbiome.

Authors:  Sandi Yen; Jethro S Johnson
Journal:  Mamm Genome       Date:  2021-07-14       Impact factor: 2.957

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.