Literature DB >> 18212088

Quality scores and SNP detection in sequencing-by-synthesis systems.

William Brockman1, Pablo Alvarez, Sarah Young, Manuel Garber, Georgia Giannoukos, William L Lee, Carsten Russ, Eric S Lander, Chad Nusbaum, David B Jaffe.   

Abstract

Promising new sequencing technologies, based on sequencing-by-synthesis (SBS), are starting to deliver large amounts of DNA sequence at very low cost. Polymorphism detection is a key application. We describe general methods for improved quality scores and accurate automated polymorphism detection, and apply them to data from the Roche (454) Genome Sequencer 20. We assess our methods using known-truth data sets, which is critical to the validity of the assessments. We developed informative, base-by-base error predictors for this sequencer and used a variant of the phred binning algorithm to combine them into a single empirically derived quality score. These quality scores are more useful than those produced by the system software: They both better predict actual error rates and identify many more high-quality bases. We developed a SNP detection method, with variants for low coverage, high coverage, and PCR amplicon applications, and evaluated it on known-truth data sets. We demonstrate good specificity in single reads, and excellent specificity (no false positives in 215 kb of genome) in high-coverage data.

Mesh:

Year:  2008        PMID: 18212088      PMCID: PMC2336812          DOI: 10.1101/gr.070227.107

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  9 in total

1.  An SNP map of the human genome generated by reduced representation shotgun sequencing.

Authors:  D Altshuler; V J Pollara; C R Cowles; W J Van Etten; J Baldwin; L Linton; E S Lander
Journal:  Nature       Date:  2000-09-28       Impact factor: 49.962

2.  Basecalling with LifeTrace.

Authors:  D Walther; G Bartha; M Morris
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

3.  The International HapMap Project.

Authors: 
Journal:  Nature       Date:  2003-12-18       Impact factor: 49.962

4.  Genome sequencing in microfabricated high-density picolitre reactors.

Authors:  Marcel Margulies; Michael Egholm; William E Altman; Said Attiya; Joel S Bader; Lisa A Bemben; Jan Berka; Michael S Braverman; Yi-Ju Chen; Zhoutao Chen; Scott B Dewell; Lei Du; Joseph M Fierro; Xavier V Gomes; Brian C Godwin; Wen He; Scott Helgesen; Chun Heen Ho; Chun He Ho; Gerard P Irzyk; Szilveszter C Jando; Maria L I Alenquer; Thomas P Jarvie; Kshama B Jirage; Jong-Bum Kim; James R Knight; Janna R Lanza; John H Leamon; Steven M Lefkowitz; Ming Lei; Jing Li; Kenton L Lohman; Hong Lu; Vinod B Makhijani; Keith E McDade; Michael P McKenna; Eugene W Myers; Elizabeth Nickerson; John R Nobile; Ramona Plant; Bernard P Puc; Michael T Ronan; George T Roth; Gary J Sarkis; Jan Fredrik Simons; John W Simpson; Maithreyan Srinivasan; Karrie R Tartaro; Alexander Tomasz; Kari A Vogt; Greg A Volkmer; Shally H Wang; Yong Wang; Michael P Weiner; Pengguang Yu; Richard F Begley; Jonathan M Rothberg
Journal:  Nature       Date:  2005-07-31       Impact factor: 49.962

5.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment.

Authors:  B Ewing; L Hillier; M C Wendl; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

6.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

7.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms.

Authors:  R Sachidanandam; D Weissman; S C Schmidt; J M Kakol; L D Stein; G Marth; S Sherry; J C Mullikin; B J Mortimore; D L Willey; S E Hunt; C G Cole; P C Coggill; C M Rice; Z Ning; J Rogers; D R Bentley; P Y Kwok; E R Mardis; R T Yeh; B Schultz; L Cook; R Davenport; M Dante; L Fulton; L Hillier; R H Waterston; J D McPherson; B Gilman; S Schaffner; W J Van Etten; D Reich; J Higgins; M J Daly; B Blumenstiel; J Baldwin; N Stange-Thomann; M C Zody; L Linton; E S Lander; D Altshuler
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

8.  Whole-genome sequence assembly for mammalian genomes: Arachne 2.

Authors:  David B Jaffe; Jonathan Butler; Sante Gnerre; Evan Mauceli; Kerstin Lindblad-Toh; Jill P Mesirov; Michael C Zody; Eric S Lander
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

9.  The UCSC genome browser database: update 2007.

Authors:  R M Kuhn; D Karolchik; A S Zweig; H Trumbower; D J Thomas; A Thakkapallayil; C W Sugnet; M Stanke; K E Smith; A Siepel; K R Rosenbloom; B Rhead; B J Raney; A Pohl; J S Pedersen; F Hsu; A S Hinrichs; R A Harte; M Diekhans; H Clawson; G Bejerano; G P Barber; R Baertsch; D Haussler; W J Kent
Journal:  Nucleic Acids Res       Date:  2006-11-16       Impact factor: 16.971

  9 in total
  136 in total

Review 1.  Call for a quality standard for sequence-based assays in clinical microbiology: necessity for quality assessment of sequences used in microbial identification and typing.

Authors:  Anthony Underwood; Jonathan Green
Journal:  J Clin Microbiol       Date:  2010-11-10       Impact factor: 5.948

2.  Genome-wide patterns of intrahuman dengue virus diversity reveal associations with viral phylogenetic clade and interhost diversity.

Authors:  Poornima Parameswaran; Patrick Charlebois; Yolanda Tellez; Andrea Nunez; Elizabeth M Ryan; Christine M Malboeuf; Joshua Z Levin; Niall J Lennon; Angel Balmaseda; Eva Harris; Matthew R Henn
Journal:  J Virol       Date:  2012-05-30       Impact factor: 5.103

3.  Metagenomic reconstructions of bacterial CRISPR loci constrain population histories.

Authors:  Christine L Sun; Brian C Thomas; Rodolphe Barrangou; Jillian F Banfield
Journal:  ISME J       Date:  2015-09-22       Impact factor: 10.302

4.  mlRho - a program for estimating the population mutation and recombination rates from shotgun-sequenced diploid genomes.

Authors:  Bernhard Haubold; Peter Pfaffelhuber; Michael Lynch
Journal:  Mol Ecol       Date:  2010-03       Impact factor: 6.185

5.  SNP detection for massively parallel whole-genome resequencing.

Authors:  Ruiqiang Li; Yingrui Li; Xiaodong Fang; Huanming Yang; Jian Wang; Karsten Kristiansen; Jun Wang
Journal:  Genome Res       Date:  2009-05-06       Impact factor: 9.043

6.  Efficient frequency-based de novo short-read clustering for error trimming in next-generation sequencing.

Authors:  Wei Qu; Shin-Ichi Hashimoto; Shinichi Morishita
Journal:  Genome Res       Date:  2009-05-13       Impact factor: 9.043

7.  Synergy of two reference genomes for the grass family.

Authors:  Joachim Messing
Journal:  Plant Physiol       Date:  2009-01       Impact factor: 8.340

8.  Rapidly developing functional genomics in ecological model systems via 454 transcriptome sequencing.

Authors:  Christopher W Wheat
Journal:  Genetica       Date:  2008-10-18       Impact factor: 1.082

9.  Population genetic inference from resequencing data.

Authors:  Rong Jiang; Simon Tavaré; Paul Marjoram
Journal:  Genetics       Date:  2008-11-03       Impact factor: 4.562

10.  PhredEM: a phred-score-informed genotype-calling approach for next-generation sequencing studies.

Authors:  Peizhou Liao; Glen A Satten; Yi-Juan Hu
Journal:  Genet Epidemiol       Date:  2017-05-31       Impact factor: 2.135

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.