Literature DB >> 10568751

Detecting and analyzing DNA sequencing errors: toward a higher quality of the Bacillus subtilis genome sequence.

C Médigue1, M Rose, A Viari, A Danchin.   

Abstract

During the determination of a DNA sequence, the introduction of artifactual frameshifts and/or in-frame stop codons in putative genes can lead to misprediction of gene products. Detection of such errors with a method based on protein similarity matching is only possible when related sequences are available in databases. Here, we present a method to detect frameshift errors in DNA sequences that is based on the intrinsic properties of the coding sequences. It combines the results of two analyses, the search for translational initiation/termination sites and the prediction of coding regions. This method was used to screen the complete Bacillus subtilis genome sequence and the regions flanking putative errors were resequenced for verification. This procedure allowed us to correct the sequence and to analyze in detail the nature of the errors. Interestingly, in several cases in-frame termination codons or frameshifts were not sequencing errors but confirmed to be present in the chromosome, indicating that the genes are either nonfunctional (pseudogenes) or subject to regulatory processes such as programmed translational frameshifts. The method can be used for checking the quality of the sequences produced by any prokaryotic genome sequencing project.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10568751      PMCID: PMC310837          DOI: 10.1101/gr.9.11.1116

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  37 in total

1.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

2.  Finding errors in DNA sequences.

Authors:  J Posfai; R J Roberts
Journal:  Proc Natl Acad Sci U S A       Date:  1992-05-15       Impact factor: 11.205

3.  How to interpret an anonymous bacterial genome: machine learning approach to gene identification.

Authors:  W S Hayes; M Borodovsky
Journal:  Genome Res       Date:  1998-11       Impact factor: 9.043

4.  Estimation of errors in "raw" DNA sequences: a validation study.

Authors:  P Richterich
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

5.  Alignments of DNA and protein sequences containing frameshift errors.

Authors:  X Guan; E C Uberbacher
Journal:  Comput Appl Biosci       Date:  1996-02

Review 6.  Translational frameshifting in the control of transposition in bacteria.

Authors:  M Chandler; O Fayet
Journal:  Mol Microbiol       Date:  1993-02       Impact factor: 3.501

7.  Detecting frame shifts by amino acid sequence comparison.

Authors:  J M Claverie
Journal:  J Mol Biol       Date:  1993-12-20       Impact factor: 5.469

8.  Microbial gene identification using interpolated Markov models.

Authors:  S L Salzberg; A L Delcher; S Kasif; O White
Journal:  Nucleic Acids Res       Date:  1998-01-15       Impact factor: 16.971

9.  Mannitol-specific enzyme II of the bacterial phosphotransferase system. III. The nucleotide sequence of the permease gene.

Authors:  C A Lee; M H Saier
Journal:  J Biol Chem       Date:  1983-09-10       Impact factor: 5.157

Review 10.  Phosphoenolpyruvate:carbohydrate phosphotransferase systems of bacteria.

Authors:  P W Postma; J W Lengeler; G R Jacobson
Journal:  Microbiol Rev       Date:  1993-09
View more
  19 in total

1.  AMIGene: Annotation of MIcrobial Genes.

Authors:  Stéphanie Bocs; Stéphane Cruveiller; David Vallenet; Grégory Nuel; Claudine Médigue
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  SubtiList: the reference database for the Bacillus subtilis genome.

Authors:  Ivan Moszer; Louis M Jones; Sandrine Moreira; Cécilia Fabry; Antoine Danchin
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  Comparative proteogenomics: combining mass spectrometry and comparative genomics to analyze multiple genomes.

Authors:  Nitin Gupta; Jamal Benhamida; Vipul Bhargava; Daniel Goodman; Elisabeth Kain; Ian Kerman; Ngan Nguyen; Noah Ollikainen; Jesse Rodriguez; Jian Wang; Mary S Lipton; Margaret Romine; Vineet Bafna; Richard D Smith; Pavel A Pevzner
Journal:  Genome Res       Date:  2008-04-21       Impact factor: 9.043

4.  GeneTack database: genes with frameshifts in prokaryotic genomes and eukaryotic mRNA sequences.

Authors:  Ivan Antonov; Pavel Baranov; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

5.  Comparative geno-plasticity analysis of Mycoplasma bovis HB0801 (Chinese isolate).

Authors:  Jingjing Qi; Aizhen Guo; Peng Cui; Yingyu Chen; Riaz Mustafa; Xiaoliang Ba; Changmin Hu; Zhidi Bai; Xi Chen; Lei Shi; Huanchun Chen
Journal:  PLoS One       Date:  2012-05-31       Impact factor: 3.240

6.  ICDS database: interrupted CoDing sequences in prokaryotic genomes.

Authors:  Emmanuel Perrodou; Caroline Deshayes; Jean Muller; Christine Schaeffer; Alain Van Dorsselaer; Raymond Ripp; Olivier Poch; Jean-Marc Reyrat; Odile Lecompte
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  MICheck: a web tool for fast checking of syntactic annotations of bacterial genomes.

Authors:  Stéphane Cruveiller; Jérôme Le Saux; David Vallenet; Aurélie Lajus; Stéphanie Bocs; Claudine Médigue
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

8.  The complete genome and proteome of Laribacter hongkongensis reveal potential mechanisms for adaptations to different temperatures and habitats.

Authors:  Patrick C Y Woo; Susanna K P Lau; Herman Tse; Jade L L Teng; Shirly O T Curreem; Alan K L Tsang; Rachel Y Y Fan; Gilman K M Wong; Yi Huang; Nicholas J Loman; Lori A S Snyder; James J Cai; Jian-Dong Huang; William Mak; Mark J Pallen; Si Lok; Kwok-Yung Yuen
Journal:  PLoS Genet       Date:  2009-03-13       Impact factor: 5.917

9.  From a consortium sequence to a unified sequence: the Bacillus subtilis 168 reference genome a decade later.

Authors:  Valérie Barbe; Stéphane Cruveiller; Frank Kunst; Patricia Lenoble; Guillaume Meurice; Agnieszka Sekowska; David Vallenet; Tingzhang Wang; Ivan Moszer; Claudine Médigue; Antoine Danchin
Journal:  Microbiology (Reading)       Date:  2009-04-21       Impact factor: 2.777

10.  Optical mapping of the Mycobacterium avium subspecies paratuberculosis genome.

Authors:  Chia-wei Wu; Timothy M Schramm; Shiguo Zhou; David C Schwartz; Adel M Talaat
Journal:  BMC Genomics       Date:  2009-01-15       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.