Literature DB >> 14744981

Automated correction of genome sequence errors.

Pawel Gajer1, Michael Schatz, Steven L Salzberg.   

Abstract

By using information from an assembly of a genome, a new program called AutoEditor significantly improves base calling accuracy over that achieved by previous algorithms. This in turn improves the overall accuracy of genome sequences and facilitates the use of these sequences for polymorphism discovery. We describe the algorithm and its application in a large set of recent genome sequencing projects. The number of erroneous base calls in these projects was reduced by 80%. In an analysis of over one million corrections, we found that AutoEditor made just one error per 8828 corrections. By substantially increasing the accuracy of base calling, AutoEditor can dramatically accelerate the process of finishing genomes, which involves closing all gaps and ensuring minimum quality standards for the final sequence. It also greatly improves our ability to discover single nucleotide polymorphisms (SNPs) between closely related strains and isolates of the same species.

Mesh:

Year:  2004        PMID: 14744981      PMCID: PMC373340          DOI: 10.1093/nar/gkh216

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  10 in total

1.  Variation is the spice of life.

Authors:  L Kruglyak; D A Nickerson
Journal:  Nat Genet       Date:  2001-03       Impact factor: 38.330

2.  Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

Authors:  N Patil; A J Berno; D A Hinds; W A Barrett; J M Doshi; C R Hacker; C R Kautzer; D H Lee; C Marjoribanks; D P McDonough; B T Nguyen; M C Norris; J B Sheehan; N Shen; D Stern; R P Stokowski; D J Thomas; M O Trulson; K R Vyas; K A Frazer; S P Fodor; D R Cox
Journal:  Science       Date:  2001-11-23       Impact factor: 47.728

3.  An Eulerian path approach to DNA fragment assembly.

Authors:  P A Pevzner; H Tang; M S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-14       Impact factor: 11.205

4.  Correcting errors in shotgun sequences.

Authors:  Martti T Tammi; Erik Arner; Ellen Kindlund; Björn Andersson
Journal:  Nucleic Acids Res       Date:  2003-08-01       Impact factor: 16.971

5.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment.

Authors:  B Ewing; L Hillier; M C Wendl; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

6.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

7.  A whole-genome assembly of Drosophila.

Authors:  E W Myers; G G Sutton; A L Delcher; I M Dew; D P Fasulo; M J Flanigan; S A Kravitz; C M Mobarry; K H Reinert; K A Remington; E L Anson; R A Bolanos; H H Chou; C M Jordan; A L Halpern; S Lonardi; E M Beasley; R C Brandon; L Chen; P J Dunn; Z Lai; Y Liang; D R Nusskern; M Zhan; Q Zhang; X Zheng; G M Rubin; M D Adams; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

8.  Comparative genome sequencing for discovery of novel polymorphisms in Bacillus anthracis.

Authors:  Timothy D Read; Steven L Salzberg; Mihai Pop; Martin Shumway; Lowell Umayam; Lingxia Jiang; Erik Holtzapple; Joseph D Busch; Kimothy L Smith; James M Schupp; Daniel Solomon; Paul Keim; Claire M Fraser
Journal:  Science       Date:  2002-05-09       Impact factor: 47.728

9.  ARACHNE: a whole-genome shotgun assembler.

Authors:  Serafim Batzoglou; David B Jaffe; Ken Stanley; Jonathan Butler; Sante Gnerre; Evan Mauceli; Bonnie Berger; Jill P Mesirov; Eric S Lander
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

10.  Single nucleotide polymorphisms in Mycobacterium tuberculosis structural genes.

Authors:  J M Musser
Journal:  Emerg Infect Dis       Date:  2001 May-Jun       Impact factor: 6.883

  10 in total
  21 in total

1.  Comparative genomic analysis of eutherian tumor necrosis factor ligand genes.

Authors:  Marko Premzl
Journal:  Immunogenetics       Date:  2015-12-09       Impact factor: 2.846

2.  Inference of population genetic parameters in metagenomics: a clean look at messy data.

Authors:  Philip L F Johnson; Montgomery Slatkin
Journal:  Genome Res       Date:  2006-09-05       Impact factor: 9.043

3.  ECHO: a reference-free short-read error correction algorithm.

Authors:  Wei-Chun Kao; Andrew H Chan; Yun S Song
Journal:  Genome Res       Date:  2011-04-11       Impact factor: 9.043

4.  Molecular characterization of a new species in the genus Alphacoronavirus associated with mink epizootic catarrhal gastroenteritis.

Authors:  Anastasia N Vlasova; Rebecca Halpin; Shiliang Wang; Elodie Ghedin; David J Spiro; Linda J Saif
Journal:  J Gen Virol       Date:  2011-02-23       Impact factor: 3.891

5.  A post-assembly genome-improvement toolkit (PAGIT) to obtain annotated genomes from contigs.

Authors:  Martin T Swain; Isheng J Tsai; Samual A Assefa; Chris Newbold; Matthew Berriman; Thomas D Otto
Journal:  Nat Protoc       Date:  2012-06-07       Impact factor: 13.491

6.  Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technology.

Authors:  Thomas D Otto; Mandy Sanders; Matthew Berriman; Chris Newbold
Journal:  Bioinformatics       Date:  2010-06-18       Impact factor: 6.937

7.  Human population differentiation is strongly correlated with local recombination rate.

Authors:  Alon Keinan; David Reich
Journal:  PLoS Genet       Date:  2010-03-26       Impact factor: 5.917

8.  Error and error mitigation in low-coverage genome assemblies.

Authors:  Melissa J Hubisz; Michael F Lin; Manolis Kellis; Adam Siepel
Journal:  PLoS One       Date:  2011-02-14       Impact factor: 3.240

9.  Quake: quality-aware detection and correction of sequencing errors.

Authors:  David R Kelley; Michael C Schatz; Steven L Salzberg
Journal:  Genome Biol       Date:  2010-11-29       Impact factor: 13.583

10.  Base-calling algorithm with vocabulary (BCV) method for analyzing population sequencing chromatograms.

Authors:  Yuri S Fantin; Alexey D Neverov; Alexander V Favorov; Maria V Alvarez-Figueroa; Svetlana I Braslavskaya; Maria A Gordukova; Inga V Karandashova; Konstantin V Kuleshov; Anna I Myznikova; Maya S Polishchuk; Denis A Reshetov; Yana A Voiciehovskaya; Andrei A Mironov; Vladimir P Chulanov
Journal:  PLoS One       Date:  2013-01-28       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.