Literature DB >> 33505833

Low-complexity and highly robust barcodes for error-rich single molecular sequencing.

Weigang Chen1,2, Panpan Wang1, Lixia Wang1, Dalu Zhang3, Mingzhe Han4, Mingyong Han5,6, Lifu Song2,4.   

Abstract

DNA barcodes are frequently corrupted due to insertion, deletion, and substitution errors during DNA synthesis, amplification and sequencing, resulting in index hopping. In this paper, we propose a new DNA barcode construction scheme that combines a cyclic block code with a predetermined pseudo-random sequence bit by bit to form bit pairs, and then converts the bit pairs to bases, i.e., the DNA barcodes. Then, we present a barcode identification scheme for noisy sequencing reads, which uses a combination of cyclic shifting and traditional dynamic programming to mark the insertion and deletion positions, and then performs erasure-and-error-correction decoding on the corrupted codewords. Furthermore, we verify the identification error rate of barcodes for multiple errors and evaluate the reliability of the barcodes in DNA context. This method can be easily generalized for constructing long barcodes, which may be used in scenarios with serious errors. Simulation results show that the bit error rate after identifying insertions/deletions is greatly reduced using the combination of cyclic shift and dynamic programming compared to using dynamic programming only. It indicates that the proposed method can effectively improve the accuracy for estimating insertion/deletion errors. And the overall identification error rate of the proposed method is lower than 10 - 5 when the probability of each base mutation is less than 0.1, which is the typical scenario in third-generation sequencing. © King Abdulaziz City for Science and Technology 2021.

Entities:  

Keywords:  Barcode construction; Barcode identification; DNA sequencing barcode deletions/insertions

Year:  2021        PMID: 33505833      PMCID: PMC7811498          DOI: 10.1007/s13205-020-02607-5

Source DB:  PubMed          Journal:  3 Biotech        ISSN: 2190-5738            Impact factor:   2.406


  19 in total

1.  Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex.

Authors:  Micah Hamady; Jeffrey J Walker; J Kirk Harris; Nicholas J Gold; Rob Knight
Journal:  Nat Methods       Date:  2008-02-10       Impact factor: 28.547

2.  Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition.

Authors:  Andrew Adey; Hilary G Morrison; Xu Xun; Jacob O Kitzman; Emily H Turner; Bethany Stackhouse; Alexandra P MacKenzie; Nicholas C Caruccio; Xiuqing Zhang; Jay Shendure
Journal:  Genome Biol       Date:  2010-12-08       Impact factor: 13.583

3.  Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and genome analyzer systems.

Authors:  André E Minoche; Juliane C Dohm; Heinz Himmelbauer
Journal:  Genome Biol       Date:  2011-11-08       Impact factor: 13.583

4.  Insertion and deletion correcting DNA barcodes based on watermarks.

Authors:  David Kracht; Steffen Schober
Journal:  BMC Bioinformatics       Date:  2015-02-18       Impact factor: 3.169

5.  Nanopore sequencing and assembly of a human genome with ultra-long reads.

Authors:  Miten Jain; Sergey Koren; Karen H Miga; Josh Quick; Arthur C Rand; Thomas A Sasani; John R Tyson; Andrew D Beggs; Alexander T Dilthey; Ian T Fiddes; Sunir Malla; Hannah Marriott; Tom Nieto; Justin O'Grady; Hugh E Olsen; Brent S Pedersen; Arang Rhie; Hollian Richardson; Aaron R Quinlan; Terrance P Snutch; Louise Tee; Benedict Paten; Adam M Phillippy; Jared T Simpson; Nicholas J Loman; Matthew Loose
Journal:  Nat Biotechnol       Date:  2018-01-29       Impact factor: 54.908

6.  Sample-Index Misassignment Impacts Tumour Exome Sequencing.

Authors:  Daniel Vodák; Susanne Lorenz; Sigve Nakken; Lars Birger Aasheim; Harald Holte; Baoyan Bai; Ola Myklebost; Leonardo A Meza-Zepeda; Eivind Hovig
Journal:  Sci Rep       Date:  2018-03-28       Impact factor: 4.379

7.  BARCOSEL: a tool for selecting an optimal barcode set for high-throughput sequencing.

Authors:  Panu Somervuo; Patrik Koskinen; Peng Mei; Liisa Holm; Petri Auvinen; Lars Paulin
Journal:  BMC Bioinformatics       Date:  2018-07-05       Impact factor: 3.169

8.  Large-scale DNA Barcode Library Generation for Biomolecule Identification in High-throughput Screens.

Authors:  Eli Lyons; Paul Sheridan; Georg Tremmel; Satoru Miyano; Sumio Sugano
Journal:  Sci Rep       Date:  2017-10-24       Impact factor: 4.379

9.  Indel-correcting DNA barcodes for high-throughput sequencing.

Authors:  John A Hawkins; Stephen K Jones; Ilya J Finkelstein; William H Press
Journal:  Proc Natl Acad Sci U S A       Date:  2018-06-20       Impact factor: 11.205

10.  Barcode identification for single cell genomics.

Authors:  Akshay Tambe; Lior Pachter
Journal:  BMC Bioinformatics       Date:  2019-01-17       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.