Literature DB >> 9421513

Microbial gene identification using interpolated Markov models.

S L Salzberg1, A L Delcher, S Kasif, O White.   

Abstract

This paper describes a new system, GLIMMER, for finding genes in microbial genomes. In a series of tests on Haemophilus influenzae , Helicobacter pylori and other complete microbial genomes, this system has proven to be very accurate at locating virtually all the genes in these sequences, outperforming previous methods. A conservative estimate based on experiments on H.pylori and H. influenzae is that the system finds >97% of all genes. GLIMMER uses interpolated Markov models (IMMs) as a framework for capturing dependencies between nearby nucleotides in a DNA sequence. An IMM-based method makes predictions based on a variable context; i.e., a variable-length oligomer in a DNA sequence. The context used by GLIMMER changes depending on the local composition of the sequence. As a result, GLIMMER is more flexible and more powerful than fixed-order Markov methods, which have previously been the primary content-based technique for finding genes in microbial DNA.

Entities:  

Mesh:

Substances:

Year:  1998        PMID: 9421513      PMCID: PMC147303          DOI: 10.1093/nar/26.2.544

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  396 in total

1.  Detecting and analyzing DNA sequencing errors: toward a higher quality of the Bacillus subtilis genome sequence.

Authors:  C Médigue; M Rose; A Viari; A Danchin
Journal:  Genome Res       Date:  1999-11       Impact factor: 9.043

2.  The genome of turkey herpesvirus.

Authors:  C L Afonso; E R Tulman; Z Lu; L Zsak; D L Rock; G F Kutish
Journal:  J Virol       Date:  2001-01       Impact factor: 5.103

3.  Transcription profiling-based identification of Staphylococcus aureus genes regulated by the agr and/or sarA loci.

Authors:  P M Dunman; E Murphy; S Haney; D Palacios; G Tucker-Kellogg; S Wu; E L Brown; R J Zagursky; D Shlaes; S J Projan
Journal:  J Bacteriol       Date:  2001-12       Impact factor: 3.490

4.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

5.  The secE gene of Helicobacter pylori.

Authors:  Claudine Médigue; Benjamin Chun-Yu Wong; Marie Chia-Mi Lin; Stéphanie Bocs; Antoine Danchin
Journal:  J Bacteriol       Date:  2002-05       Impact factor: 3.490

Review 6.  Computational gene finding in plants.

Authors:  Mihaela Pertea; Steven L Salzberg
Journal:  Plant Mol Biol       Date:  2002-01       Impact factor: 4.076

7.  Genome sequence of a baculovirus pathogenic for Culex nigripalpus.

Authors:  C L Afonso; E R Tulman; Z Lu; C A Balinsky; B A Moser; J J Becnel; D L Rock; G F Kutish
Journal:  J Virol       Date:  2001-11       Impact factor: 5.103

8.  ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes.

Authors:  Feng-Biao Guo; Hong-Yu Ou; Chun-Ting Zhang
Journal:  Nucleic Acids Res       Date:  2003-03-15       Impact factor: 16.971

9.  The complete genome sequence of the murine respiratory pathogen Mycoplasma pulmonis.

Authors:  I Chambaud; R Heilig; S Ferris; V Barbe; D Samson; F Galisson; I Moszer; K Dybvig; H Wróblewski; A Viari; E P Rocha; A Blanchard
Journal:  Nucleic Acids Res       Date:  2001-05-15       Impact factor: 16.971

10.  Analysis of a donor gene region for a variant surface glycoprotein and its expression site in African trypanosomes.

Authors:  D J LaCount; N M El-Sayed; S Kaul; D Wanless; C M Turner; J E Donelson
Journal:  Nucleic Acids Res       Date:  2001-05-15       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.