Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 QAlign: aligning nanopore reads accurately using current-level modeling.

Literature DB >> 33051648

QAlign: aligning nanopore reads accurately using current-level modeling.

Dhaivat Joshi¹, Shunfu Mao², Sreeram Kannan², Suhas Diggavi¹.

Abstract

MOTIVATION: Efficient and accurate alignment of DNA/RNA sequence reads to each other or to a reference genome/transcriptome is an important problem in genomic analysis. Nanopore sequencing has emerged as a major sequencing technology and many long-read aligners have been designed for aligning nanopore reads. However, the high error rate makes accurate and efficient alignment difficult. Utilizing the noise and error characteristics inherent in the sequencing process properly can play a vital role in constructing a robust aligner. In this article, we design QAlign, a pre-processor that can be used with any long-read aligner for aligning long reads to a genome/transcriptome or to other long reads. The key idea in QAlign is to convert the nucleotide reads into discretized current levels that capture the error modes of the nanopore sequencer before running it through a sequence aligner.
RESULTS: We show that QAlign is able to improve alignment rates from around 80% up to 90% with nanopore reads when aligning to the genome. We also show that QAlign improves the average overlap quality by 9.2, 2.5 and 10.8% in three real datasets for read-to-read alignment. Read-to-transcriptome alignment rates are improved from 51.6% to 75.4% and 82.6% to 90% in two real datasets.
AVAILABILITY AND IMPLEMENTATION: https://github.com/joshidhaivat/QAlign.git. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Chemical

Mesh：

Year: 2021 PMID： 33051648 PMCID： PMC8097683 DOI： 10.1093/bioinformatics/btaa875

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

22 in total

1. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors: Heng Li
Journal: Bioinformatics Date: 2011-09-08 Impact factor: 6.937

2. De novo peptide sequencing by deep learning.

Authors: Ngoc Hieu Tran; Xianglilan Zhang; Lei Xin; Baozhen Shan; Ming Li
Journal: Proc Natl Acad Sci U S A Date: 2017-07-18 Impact factor: 11.205

3. mRNA-Seq whole-transcriptome analysis of a single cell.

Authors: Fuchou Tang; Catalin Barbacioru; Yangzhou Wang; Ellen Nordman; Clarence Lee; Nanlan Xu; Xiaohui Wang; John Bodeau; Brian B Tuch; Asim Siddiqui; Kaiqin Lao; M Azim Surani
Journal: Nat Methods Date: 2009-04-06 Impact factor: 28.547

4. Resolving multicopy duplications de novo using polyploid phasing.

Authors: Mark J Chaisson; Sudipto Mukherjee; Sreeram Kannan; Evan E Eichler
Journal: Res Comput Mol Biol Date: 2017-04-12

5. A complete bacterial genome assembled de novo using only nanopore sequencing data.

Authors: Nicholas J Loman; Joshua Quick; Jared T Simpson
Journal: Nat Methods Date: 2015-06-15 Impact factor: 28.547

6. A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors: Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal: Nat Genet Date: 2011-04-10 Impact factor: 38.330

7. Completing bacterial genome assemblies with multiplex MinION sequencing.

Authors: Ryan R Wick; Louise M Judd; Claire L Gorrie; Kathryn E Holt
Journal: Microb Genom Date: 2017-09-14

8. DeepSimulator: a deep simulator for Nanopore sequencing.

Authors: Yu Li; Renmin Han; Chongwei Bi; Mo Li; Sheng Wang; Xin Gao
Journal: Bioinformatics Date: 2018-09-01 Impact factor: 6.937

9. The UCSC Genome Browser database: 2019 update.

Authors: Maximilian Haeussler; Ann S Zweig; Cath Tyner; Matthew L Speir; Kate R Rosenbloom; Brian J Raney; Christopher M Lee; Brian T Lee; Angie S Hinrichs; Jairo Navarro Gonzalez; David Gibson; Mark Diekhans; Hiram Clawson; Jonathan Casper; Galt P Barber; David Haussler; Robert M Kuhn; W James Kent
Journal: Nucleic Acids Res Date: 2019-01-08 Impact factor: 16.971