Literature DB >> 24179701

A Benchmark Study on Error Assessment and Quality Control of CCS Reads Derived from the PacBio RS.

Xiaoli Jiao1, Xin Zheng, Liang Ma, Geetha Kutty, Emile Gogineni, Qiang Sun, Brad T Sherman, Xiaojun Hu, Kristine Jones, Castle Raley, Bao Tran, David J Munroe, Robert Stephens, Dun Liang, Tomozumi Imamichi, Joseph A Kovacs, Richard A Lempicki, Da Wei Huang.   

Abstract

PacBio RS, a newly emerging third-generation DNA sequencing platform, is based on a real-time, single-molecule, nano-nitch sequencing technology that can generate very long reads (up to 20-kb) in contrast to the shorter reads produced by the first and second generation sequencing technologies. As a new platform, it is important to assess the sequencing error rate, as well as the quality control (QC) parameters associated with the PacBio sequence data. In this study, a mixture of 10 prior known, closely related DNA amplicons were sequenced using the PacBio RS sequencing platform. After aligning Circular Consensus Sequence (CCS) reads derived from the above sequencing experiment to the known reference sequences, we found that the median error rate was 2.5% without read QC, and improved to 1.3% with an SVM based multi-parameter QC method. In addition, a De Novo assembly was used as a downstream application to evaluate the effects of different QC approaches. This benchmark study indicates that even though CCS reads are post error-corrected it is still necessary to perform appropriate QC on CCS reads in order to produce successful downstream bioinformatics analytical results.

Entities:  

Keywords:  CCS read; PacBio; SVM regression; assembly; pass number; quality control (QC); quality value (QV)

Year:  2013        PMID: 24179701      PMCID: PMC3811116          DOI: 10.4172/2153-0602.1000136

Source DB:  PubMed          Journal:  J Data Mining Genomics Proteomics


  16 in total

1.  Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing.

Authors:  Joke Reumers; Peter De Rijk; Hui Zhao; Anthony Liekens; Dominiek Smeets; John Cleary; Peter Van Loo; Maarten Van Den Bossche; Kirsten Catthoor; Bernard Sabbe; Evelyn Despierre; Ignace Vergote; Brian Hilbush; Diether Lambrechts; Jurgen Del-Favero
Journal:  Nat Biotechnol       Date:  2011-12-18       Impact factor: 54.908

2.  Real-time sequencing.

Authors:  Thomas D Otto
Journal:  Nat Rev Microbiol       Date:  2011-08-12       Impact factor: 60.633

3.  Field guide to next-generation DNA sequencers.

Authors:  Travis C Glenn
Journal:  Mol Ecol Resour       Date:  2011-05-19       Impact factor: 7.090

Review 4.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

5.  A hybrid approach for the automated finishing of bacterial genomes.

Authors:  Ali Bashir; Aaron Klammer; William P Robins; Chen-Shan Chin; Dale Webster; Ellen Paxinos; David Hsu; Meredith Ashby; Susana Wang; Paul Peluso; Robert Sebra; Jon Sorenson; James Bullard; Jackie Yen; Marie Valdovino; Emilia Mollova; Khai Luong; Steven Lin; Brianna LaMay; Amruta Joshi; Lori Rowe; Michael Frace; Cheryl L Tarr; Maryann Turnsek; Brigid M Davis; Andrew Kasarskis; John J Mekalanos; Matthew K Waldor; Eric E Schadt
Journal:  Nat Biotechnol       Date:  2012-07-01       Impact factor: 54.908

6.  Variation in the major surface glycoprotein genes in Pneumocystis jirovecii.

Authors:  Geetha Kutty; Frank Maldarelli; Guillaume Achaz; Joseph A Kovacs
Journal:  J Infect Dis       Date:  2008-09-01       Impact factor: 5.226

7.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

8.  A tale of three next generation sequencing platforms: comparison of Ion Torrent, Pacific Biosciences and Illumina MiSeq sequencers.

Authors:  Michael A Quail; Miriam Smith; Paul Coupland; Thomas D Otto; Simon R Harris; Thomas R Connor; Anna Bertoni; Harold P Swerdlow; Yong Gu
Journal:  BMC Genomics       Date:  2012-07-24       Impact factor: 3.969

9.  Hybrid error correction and de novo assembly of single-molecule sequencing reads.

Authors:  Sergey Koren; Michael C Schatz; Brian P Walenz; Jeffrey Martin; Jason T Howard; Ganeshkumar Ganapathy; Zhong Wang; David A Rasko; W Richard McCombie; Erich D Jarvis
Journal:  Nat Biotechnol       Date:  2012-07-01       Impact factor: 54.908

10.  Pacific biosciences sequencing technology for genotyping and variation discovery in human data.

Authors:  Mauricio O Carneiro; Carsten Russ; Michael G Ross; Stacey B Gabriel; Chad Nusbaum; Mark A DePristo
Journal:  BMC Genomics       Date:  2012-08-05       Impact factor: 3.969

View more
  34 in total

1.  Next generation multilocus sequence typing (NGMLST) and the analytical software program MLSTEZ enable efficient, cost-effective, high-throughput, multilocus sequencing typing.

Authors:  Yuan Chen; Aubrey E Frazzitta; Anastasia P Litvintseva; Charles Fang; Thomas G Mitchell; Deborah J Springer; Yun Ding; George Yuan; John R Perfect
Journal:  Fungal Genet Biol       Date:  2015-01-24       Impact factor: 3.495

2.  MICADo - Looking for Mutations in Targeted PacBio Cancer Data: An Alignment-Free Method.

Authors:  Justine Rudewicz; Hayssam Soueidan; Raluca Uricaru; Hervé Bonnefoi; Richard Iggo; Jonas Bergh; Macha Nikolski
Journal:  Front Genet       Date:  2016-12-08       Impact factor: 4.599

3.  Kinetics of Genetic Variation of the Mycoplasma genitalium MG192 Gene in Experimentally Infected Chimpanzees.

Authors:  Liang Ma; Jørgen S Jensen; Miriam Mancuso; Leann Myers; David H Martin
Journal:  Infect Immun       Date:  2015-12-28       Impact factor: 3.441

4.  Extensive variation and rapid shift of the MG192 sequence in Mycoplasma genitalium strains from patients with chronic infection.

Authors:  Liang Ma; Miriam Mancuso; James A Williams; Barbara Van Der Pol; J Dennis Fortenberry; Qiuyao Jia; Leann Myers; David H Martin
Journal:  Infect Immun       Date:  2014-01-06       Impact factor: 3.441

5.  Conformation-dependent epitopes recognized by prion protein antibodies probed using mutational scanning and deep sequencing.

Authors:  Kyle M Doolan; David W Colby
Journal:  J Mol Biol       Date:  2014-11-07       Impact factor: 5.469

6.  Bioengineered AAV Capsids with Combined High Human Liver Transduction In Vivo and Unique Humoral Seroreactivity.

Authors:  Nicole K Paulk; Katja Pekrun; Erhua Zhu; Sean Nygaard; Bin Li; Jianpeng Xu; Kirk Chu; Christian Leborgne; Allison P Dane; Annelise Haft; Yue Zhang; Feijie Zhang; Chris Morton; Marcus B Valentine; Andrew M Davidoff; Amit C Nathwani; Federico Mingozzi; Markus Grompe; Ian E Alexander; Leszek Lisowski; Mark A Kay
Journal:  Mol Ther       Date:  2017-09-25       Impact factor: 11.454

7.  Accelerated cloning of a potato late blight-resistance gene using RenSeq and SMRT sequencing.

Authors:  Kamil Witek; Florian Jupe; Agnieszka I Witek; David Baker; Matthew D Clark; Jonathan D G Jones
Journal:  Nat Biotechnol       Date:  2016-04-25       Impact factor: 54.908

Review 8.  A Molecular Window into the Biology and Epidemiology of Pneumocystis spp.

Authors:  Liang Ma; Ousmane H Cissé; Joseph A Kovacs
Journal:  Clin Microbiol Rev       Date:  2018-06-13       Impact factor: 26.132

9.  Scaffolding of a bacterial genome using MinION nanopore sequencing.

Authors:  E Karlsson; A Lärkeryd; A Sjödin; M Forsman; P Stenberg
Journal:  Sci Rep       Date:  2015-07-07       Impact factor: 4.379

10.  SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information.

Authors:  Marten Boetzer; Walter Pirovano
Journal:  BMC Bioinformatics       Date:  2014-06-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.