Literature DB >> 7567469

Detection of new genes in a bacterial genome using Markov models for three gene classes.

M Borodovsky1, J D McIninch, E V Koonin, K E Rudd, C Médigue, A Danchin.   

Abstract

We further investigated the statistical features of the three classes of Escherichia coli genes that have been previously delineated by factorial correspondence analysis and dynamic clustering methods. A phased Markov model for a nucleotide sequence of each gene class was developed and employed for gene prediction using the GeneMark program. The protein-coding region prediction accuracy was determined for class-specific Markov models of different orders when the programs implementing these models were applied to gene sequences from the same or other classes. It is shown that at least two training sets and two program versions derived for different classes of E. coli genes are necessary in order to achieve a high accuracy of coding region prediction for uncharacterized sequences. Some annotated E. coli genes from Class I and Class III are shown to be spurious, whereas many open reading frames (ORFs) that have not been annotated in GenBank as genes are predicted to encode proteins. The amino acid sequences of the putative products of these ORFs initially did not show similarity to already known proteins. However, conserved regions have been identified in several of them by screening the latest entries in protein sequence databases and applying methods for motif search, while some other of these new genes have been identified in independent experiments.

Entities:  

Mesh:

Substances:

Year:  1995        PMID: 7567469      PMCID: PMC307237          DOI: 10.1093/nar/23.17.3554

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  22 in total

1.  Analysis of the Escherichia coli genome: DNA sequence of the region from 84.5 to 86.5 minutes.

Authors:  D L Daniels; G Plunkett; V Burland; F R Blattner
Journal:  Science       Date:  1992-08-07       Impact factor: 47.728

2.  Evidence for horizontal gene transfer in Escherichia coli speciation.

Authors:  C Médigue; T Rouxel; P Vigier; A Hénaut; A Danchin
Journal:  J Mol Biol       Date:  1991-12-20       Impact factor: 5.469

3.  Codon preference and primary sequence structure in protein-coding regions.

Authors:  S Tavaré; B Song
Journal:  Bull Math Biol       Date:  1989       Impact factor: 1.758

4.  Merging of distance matrices and classification by dynamic clustering.

Authors:  M O Delorme; A Hénaut
Journal:  Comput Appl Biosci       Date:  1988-11

5.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.

Authors:  P M Sharp; W H Li
Journal:  Nucleic Acids Res       Date:  1987-02-11       Impact factor: 16.971

6.  A simple tool to search for sequence motifs that are conserved in BLAST outputs.

Authors:  R L Tatusov; E V Koonin
Journal:  Comput Appl Biosci       Date:  1994-07

7.  Measurements of the effects that coding for a protein has on a DNA sequence and their use for finding genes.

Authors:  R Staden
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

8.  Codon catalog usage is a genome strategy modulated for gene expressivity.

Authors:  R Grantham; C Gautier; M Gouy; M Jacobzone; R Mercier
Journal:  Nucleic Acids Res       Date:  1981-01-10       Impact factor: 16.971

9.  Mutational analysis of the Escherichia coli serB promoter region reveals transcriptional linkage to a downstream gene.

Authors:  A F Neuwald; D E Berg; G V Stauffer
Journal:  Gene       Date:  1992-10-12       Impact factor: 3.688

Review 10.  Codon usage and tRNA content in unicellular and multicellular organisms.

Authors:  T Ikemura
Journal:  Mol Biol Evol       Date:  1985-01       Impact factor: 16.240

View more
  25 in total

1.  Detecting and analyzing DNA sequencing errors: toward a higher quality of the Bacillus subtilis genome sequence.

Authors:  C Médigue; M Rose; A Viari; A Danchin
Journal:  Genome Res       Date:  1999-11       Impact factor: 9.043

2.  Mining Bacillus subtilis chromosome heterogeneities using hidden Markov models.

Authors:  Pierre Nicolas; Laurent Bize; Florence Muri; Mark Hoebeke; François Rodolphe; S Dusko Ehrlich; Bernard Prum; Philippe Bessières
Journal:  Nucleic Acids Res       Date:  2002-03-15       Impact factor: 16.971

3.  AMIGene: Annotation of MIcrobial Genes.

Authors:  Stéphanie Bocs; Stéphane Cruveiller; David Vallenet; Grégory Nuel; Claudine Médigue
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

Review 4.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

5.  AmpC and AmpH, proteins related to the class C beta-lactamases, bind penicillin and contribute to the normal morphology of Escherichia coli.

Authors:  T A Henderson; K D Young; S A Denome; P K Elf
Journal:  J Bacteriol       Date:  1997-10       Impact factor: 3.490

Review 6.  Evolution of T4-related phages.

Authors:  E Kutter; K Gachechiladze; A Poglazov; E Marusich; M Shneider; P Aronsson; A Napuli; D Porter; V Mesyanzhinov
Journal:  Virus Genes       Date:  1995       Impact factor: 2.332

7.  Self-identification of protein-coding regions in microbial genomes.

Authors:  S Audic; J M Claverie
Journal:  Proc Natl Acad Sci U S A       Date:  1998-08-18       Impact factor: 11.205

8.  GeneMark.hmm: new solutions for gene finding.

Authors:  A V Lukashin; M Borodovsky
Journal:  Nucleic Acids Res       Date:  1998-02-15       Impact factor: 16.971

9.  Sequence of a 189-kb segment of the chromosome of Rhodobacter capsulatus SB1003.

Authors:  C Vlcek; V Paces; N Maltsev; J Paces; R Haselkorn; M Fonstein
Journal:  Proc Natl Acad Sci U S A       Date:  1997-08-19       Impact factor: 11.205

10.  Natural selection retains overrepresented out-of-frame stop codons against frameshift peptides in prokaryotes.

Authors:  Herman Tse; James J Cai; Hoi-Wah Tsoi; Esther Pt Lam; Kwok-Yung Yuen
Journal:  BMC Genomics       Date:  2010-09-09       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.