Literature DB >> 21592414

Accuracy and quality assessment of 454 GS-FLX Titanium pyrosequencing.

André Gilles1, Emese Meglécz, Nicolas Pech, Stéphanie Ferreira, Thibaut Malausa, Jean-François Martin.   

Abstract

BACKGROUND: The rapid evolution of 454 GS-FLX sequencing technology has not been accompanied by a reassessment of the quality and accuracy of the sequences obtained. Current strategies for decision-making and error-correction are based on an initial analysis by Huse et al. in 2007, for the older GS20 system based on experimental sequences. We analyze here the quality of 454 sequencing data and identify factors playing a role in sequencing error, through the use of an extensive dataset for Roche control DNA fragments.
RESULTS: We obtained a mean error rate for 454 sequences of 1.07%. More importantly, the error rate is not randomly distributed; it occasionally rose to more than 50% in certain positions, and its distribution was linked to several experimental variables. The main factors related to error are the presence of homopolymers, position in the sequence, size of the sequence and spatial localization in PT plates for insertion and deletion errors. These factors can be described by considering seven variables. No single variable can account for the error rate distribution, but most of the variation is explained by the combination of all seven variables.
CONCLUSIONS: The pattern identified here calls for the use of internal controls and error-correcting base callers, to correct for errors, when available (e.g. when sequencing amplicons). For shotgun libraries, the use of both sequencing primers and deep coverage, combined with the use of random sequencing primer sites should partly compensate for even high error rates, although it may prove more difficult than previous thought to distinguish between low-frequency alleles and errors.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21592414      PMCID: PMC3116506          DOI: 10.1186/1471-2164-12-245

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  23 in total

1.  Pyrobayes: an improved base caller for SNP discovery in pyrosequences.

Authors:  Aaron R Quinlan; Donald A Stewart; Michael P Strömberg; Gábor T Marth
Journal:  Nat Methods       Date:  2008-01-13       Impact factor: 28.547

2.  Estimation of allele frequencies from high-coverage genome-sequencing projects.

Authors:  Michael Lynch
Journal:  Genetics       Date:  2009-03-16       Impact factor: 4.562

3.  Massive parallel MHC genotyping: titanium that shines.

Authors:  K Mathias Wegner
Journal:  Mol Ecol       Date:  2009-03-20       Impact factor: 6.185

4.  Rapidly developing functional genomics in ecological model systems via 454 transcriptome sequencing.

Authors:  Christopher W Wheat
Journal:  Genetica       Date:  2008-10-18       Impact factor: 1.082

Review 5.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

6.  Increased throughput by parallelization of library preparation for massive sequencing.

Authors:  Sverker Lundin; Henrik Stranneheim; Erik Pettersson; Daniel Klevebring; Joakim Lundeberg
Journal:  PLoS One       Date:  2010-04-06       Impact factor: 3.240

7.  Ironing out the wrinkles in the rare biosphere through improved OTU clustering.

Authors:  Susan M Huse; David Mark Welch; Hilary G Morrison; Mitchell L Sogin
Journal:  Environ Microbiol       Date:  2010-03-11       Impact factor: 5.491

8.  High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies.

Authors:  Jean-Marc Aury; Corinne Cruaud; Valérie Barbe; Odile Rogier; Sophie Mangenot; Gaelle Samson; Julie Poulain; Véronique Anthouard; Claude Scarpelli; François Artiguenave; Patrick Wincker
Journal:  BMC Genomics       Date:  2008-12-16       Impact factor: 3.969

9.  Accuracy and quality of massively parallel DNA pyrosequencing.

Authors:  Susan M Huse; Julie A Huber; Hilary G Morrison; Mitchell L Sogin; David Mark Welch
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

10.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing.

Authors:  Juliane C Dohm; Claudio Lottaz; Tatiana Borodina; Heinz Himmelbauer
Journal:  Nucleic Acids Res       Date:  2008-07-26       Impact factor: 16.971

View more
  177 in total

1.  QuRe: software for viral quasispecies reconstruction from next-generation sequencing data.

Authors:  Mattia C F Prosperi; Marco Salemi
Journal:  Bioinformatics       Date:  2011-11-15       Impact factor: 6.937

2.  Niche specialization of terrestrial archaeal ammonia oxidizers.

Authors:  Cécile Gubry-Rangin; Brigitte Hai; Christopher Quince; Marion Engel; Bruce C Thomson; Phillip James; Michael Schloter; Robert I Griffiths; James I Prosser; Graeme W Nicol
Journal:  Proc Natl Acad Sci U S A       Date:  2011-12-08       Impact factor: 11.205

3.  PAPNC, a novel method to calculate nucleotide diversity from large scale next generation sequencing data.

Authors:  Wei Shao; Mary F Kearney; Valerie F Boltz; Jonathan E Spindler; John W Mellors; Frank Maldarelli; John M Coffin
Journal:  J Virol Methods       Date:  2014-03-26       Impact factor: 2.014

Review 4.  New virologic tools for management of chronic hepatitis B and C.

Authors:  Stéphane Chevaliez; Christophe Rodriguez; Jean-Michel Pawlotsky
Journal:  Gastroenterology       Date:  2012-05       Impact factor: 22.682

5.  Multiple ITS haplotypes in the genome of the lichenized basidiomycete Cora inversa (Hygrophoraceae): fact or artifact?

Authors:  Robert Lücking; James D Lawrey; Patrick M Gillevet; Masoumeh Sikaroodi; Manuela Dal-Forno; Simon A Berger
Journal:  J Mol Evol       Date:  2013-12-17       Impact factor: 2.395

6.  Indel and Carryforward Correction (ICC): a new analysis approach for processing 454 pyrosequencing data.

Authors:  Wenjie Deng; Brandon S Maust; Dylan H Westfall; Lennie Chen; Hong Zhao; Brendan B Larsen; Shyamala Iyer; Yi Liu; James I Mullins
Journal:  Bioinformatics       Date:  2013-07-29       Impact factor: 6.937

Review 7.  Analytical tools and databases for metagenomics in the next-generation sequencing era.

Authors:  Mincheol Kim; Ki-Hyun Lee; Seok-Whan Yoon; Bong-Soo Kim; Jongsik Chun; Hana Yi
Journal:  Genomics Inform       Date:  2013-09-30

8.  Transcriptomic messiness in the venom duct of Conus miles contributes to conotoxin diversity.

Authors:  Ai-hua Jin; Sébastien Dutertre; Quentin Kaas; Vincent Lavergne; Petra Kubala; Richard J Lewis; Paul F Alewood
Journal:  Mol Cell Proteomics       Date:  2013-09-16       Impact factor: 5.911

9.  Absence/presence calling in microarray-based CGH experiments with non-model organisms.

Authors:  Martijs J Jonker; Wim C de Leeuw; Marino Marinković; Floyd R A Wittink; Han Rauwerda; Oskar Bruning; Wim A Ensink; Ad C Fluit; C H Boel; Mark de Jong; Timo M Breit
Journal:  Nucleic Acids Res       Date:  2014-04-25       Impact factor: 16.971

10.  Deep sequencing reveals mutagenic effects of ribavirin during monotherapy of hepatitis C virus genotype 1-infected patients.

Authors:  Julia Dietz; Sven-Eric Schelhorn; Daniel Fitting; Ulrike Mihm; Simone Susser; Martin-Walter Welker; Caterina Füller; Martin Däumer; Gerlinde Teuber; Heiner Wedemeyer; Thomas Berg; Thomas Lengauer; Stefan Zeuzem; Eva Herrmann; Christoph Sarrazin
Journal:  J Virol       Date:  2013-03-27       Impact factor: 5.103

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.