Literature DB >> 28868335

Fundamental Bounds for Sequence Reconstruction from Nanopore Sequencers.

Abram Magner1, Jarosław Duda2, Wojciech Szpankowski3, Ananth Grama4.   

Abstract

Nanopore sequencers are emerging as promising new platforms for high-throughput sequencing. As with other technologies, sequencer errors pose a major challenge for their effective use. In this paper, we present a novel information theoretic analysis of the impact of insertion-deletion (indel) errors in nanopore sequencers. In particular, we consider the following problems: (i) for given indel error characteristics and rate, what is the probability of accurate reconstruction as a function of sequence length; (ii) using replicated extrusion (the process of passing a DNA strand through the nanopore), what is the number of replicas needed to accurately reconstruct the true sequence with high probability? Our results provide a number of important insights: (i) the probability of accurate reconstruction of a sequence from a single sample in the presence of indel errors tends quickly (i.e., exponentially) to zero as the length of the sequence increases; and (ii) replicated extrusion is an effective technique for accurate reconstruction. We show that for typical distributions of indel errors, the required number of replicas is a slow function (polylogarithmic) of sequence length - implying that through replicated extrusion, we can sequence large reads using nanopore sequencers. Moreover, we show that in certain cases, the required number of replicas can be related to information-theoretic parameters of the indel error distributions.

Entities:  

Year:  2016        PMID: 28868335      PMCID: PMC5575792          DOI: 10.1109/TMBMC.2016.2630056

Source DB:  PubMed          Journal:  IEEE Trans Mol Biol Multiscale Commun        ISSN: 2332-7804


  13 in total

Review 1.  Characterization of nucleic acids by nanopore analysis.

Authors:  David W Deamer; Daniel Branton
Journal:  Acc Chem Res       Date:  2002-10       Impact factor: 22.384

2.  The accuracy of DNA sequences: estimating sequence quality.

Authors:  G A Churchill; M S Waterman
Journal:  Genomics       Date:  1992-09       Impact factor: 5.736

3.  A first look at the Oxford Nanopore MinION sequencer.

Authors:  Alexander S Mikheyev; Mandy M Y Tin
Journal:  Mol Ecol Resour       Date:  2014-09-24       Impact factor: 7.090

4.  Error analysis of idealized nanopore sequencing.

Authors:  Christopher R O'Donnell; Hongyun Wang; William B Dunbar
Journal:  Electrophoresis       Date:  2013-08       Impact factor: 3.535

5.  PBSIM: PacBio reads simulator--toward accurate genome assembly.

Authors:  Yukiteru Ono; Kiyoshi Asai; Michiaki Hamada
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

6.  Error rates for nanopore discrimination among cytosine, methylcytosine, and hydroxymethylcytosine along individual DNA strands.

Authors:  Jacob Schreiber; Zachary L Wescoe; Robin Abu-Shumays; John T Vivian; Baldandorj Baatar; Kevin Karplus; Mark Akeson
Journal:  Proc Natl Acad Sci U S A       Date:  2013-10-28       Impact factor: 11.205

7.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers.

Authors:  Michael A Quail; Miriam Smith; Paul Coupland; Thomas D Otto; Simon R Harris; Thomas R Connor; Anna Bertoni; Harold P Swerdlow; Yong Gu
Journal:  BMC Genomics       Date:  2012-07-24       Impact factor: 3.969

8.  Assessing the performance of the Oxford Nanopore Technologies MinION.

Authors:  T Laver; J Harrison; P A O'Neill; K Moore; A Farbos; K Paszkiewicz; D J Studholme
Journal:  Biomol Detect Quantif       Date:  2015-03

9.  Hybrid error correction and de novo assembly of single-molecule sequencing reads.

Authors:  Sergey Koren; Michael C Schatz; Brian P Walenz; Jeffrey Martin; Jason T Howard; Ganeshkumar Ganapathy; Zhong Wang; David A Rasko; W Richard McCombie; Erich D Jarvis
Journal:  Nat Biotechnol       Date:  2012-07-01       Impact factor: 54.908

10.  Pacific biosciences sequencing technology for genotyping and variation discovery in human data.

Authors:  Mauricio O Carneiro; Carsten Russ; Michael G Ross; Stacey B Gabriel; Chad Nusbaum; Mark A DePristo
Journal:  BMC Genomics       Date:  2012-08-05       Impact factor: 3.969

View more
  1 in total

1.  Trace Reconstruction Problems in Computational Biology.

Authors:  Vinnu Bhardwaj; Pavel A Pevzner; Cyrus Rashtchian; Yana Safonova
Journal:  IEEE Trans Inf Theory       Date:  2020-10-13       Impact factor: 2.978

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.