Literature DB >> 9707594

Self-identification of protein-coding regions in microbial genomes.

S Audic1, J M Claverie.   

Abstract

A new method for predicting protein-coding regions in microbial genomic DNA sequences is presented. It uses an ab initio iterative Markov modeling procedure to automatically perform the partition of genomic sequences into three subsets shown to correspond to coding, coding on the opposite strand, and noncoding segments. In contrast to current methods, such as GENEMARK [Borodovsky, M. & McIninch, J. D. (1993) Comput. Chem. 17, 123-133], no training set or prior knowledge of the statistical properties of the studied genome are required. This new method tolerates error rates of 1-2% and can process unassembled sequences. It is thus ideal for the analysis of genome survey and/or fragmented sequence data from uncharacterized microorganisms. The method was validated on 10 complete bacterial genomes (from four major phylogenetic lineages). The results show that protein-coding regions can be identified with an accuracy of up to 90% with a totally automated and objective procedure.

Mesh:

Substances:

Year:  1998        PMID: 9707594      PMCID: PMC21455          DOI: 10.1073/pnas.95.17.10026

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  29 in total

Review 1.  Assessment of protein coding measures.

Authors:  J W Fickett; C S Tung
Journal:  Nucleic Acids Res       Date:  1992-12-25       Impact factor: 16.971

2.  Statistical analysis of GeneMark performance by cross-validation.

Authors:  J Kleffe; K Hermann; M Borodovsky
Journal:  Comput Chem       Date:  1996-03

3.  Evidence for horizontal gene transfer in Escherichia coli speciation.

Authors:  C Médigue; T Rouxel; P Vigier; A Hénaut; A Danchin
Journal:  J Mol Biol       Date:  1991-12-20       Impact factor: 5.469

4.  The minimal gene complement of Mycoplasma genitalium.

Authors:  C M Fraser; J D Gocayne; O White; M D Adams; R A Clayton; R D Fleischmann; C J Bult; A R Kerlavage; G Sutton; J M Kelley; R D Fritchman; J F Weidman; K V Small; M Sandusky; J Fuhrmann; D Nguyen; T R Utterback; D M Saudek; C A Phillips; J M Merrick; J F Tomb; B A Dougherty; K F Bott; P C Hu; T S Lucier; S N Peterson; H O Smith; C A Hutchison; J C Venter
Journal:  Science       Date:  1995-10-20       Impact factor: 47.728

5.  Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment.

Authors:  C E Lawrence; S F Altschul; M S Boguski; J S Liu; A F Neuwald; J C Wootton
Journal:  Science       Date:  1993-10-08       Impact factor: 47.728

6.  A streamlined random sequencing strategy for finding coding exons.

Authors:  J M Claverie
Journal:  Genomics       Date:  1994-10       Impact factor: 5.736

7.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd.

Authors:  R D Fleischmann; M D Adams; O White; R A Clayton; E F Kirkness; A R Kerlavage; C J Bult; J F Tomb; B A Dougherty; J M Merrick
Journal:  Science       Date:  1995-07-28       Impact factor: 47.728

8.  [Statistical characteristics in primary structures of functional regions of Escherichia coli genome. II. Non-stationary Markov chains].

Authors:  M Iu Borodovskiĭ; Iu A Sprizhitskiĭ; E I Golovanov; A A Aleksandrov
Journal:  Mol Biol (Mosk)       Date:  1986 Jul-Aug

9.  Atypical regions in large genomic DNA sequences.

Authors:  S Scherer; M S McPeek; T P Speed
Journal:  Proc Natl Acad Sci U S A       Date:  1994-07-19       Impact factor: 11.205

10.  A Markov analysis of DNA sequences.

Authors:  H Almagor
Journal:  J Theor Biol       Date:  1983-10-21       Impact factor: 2.691

View more
  21 in total

1.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

2.  Ab initio gene identification: prokaryote genome annotation with GeneScan and GLIMMER.

Authors:  Gautam Aggarwal; Ramakrishna Ramaswamy
Journal:  J Biosci       Date:  2002-02       Impact factor: 1.826

3.  Dictionary-driven prokaryotic gene finding.

Authors:  Tetsuo Shibuya; Isidore Rigoutsos
Journal:  Nucleic Acids Res       Date:  2002-06-15       Impact factor: 16.971

Review 4.  Current methods of gene prediction, their strengths and weaknesses.

Authors:  Catherine Mathé; Marie-France Sagot; Thomas Schiex; Pierre Rouzé
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

5.  DNA-energetics-based analyses suggest additional genes in prokaryotes.

Authors:  Garima Khandelwal; Jalaj Gupta; B Jayaram
Journal:  J Biosci       Date:  2012-07       Impact factor: 1.826

6.  Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training.

Authors:  Vardges Ter-Hovhannisyan; Alexandre Lomsadze; Yury O Chernoff; Mark Borodovsky
Journal:  Genome Res       Date:  2008-08-29       Impact factor: 9.043

7.  Reverse transcriptase-polymerase chain reaction validation of 25 "orphan" genes from Escherichia coli K-12 MG1655.

Authors:  J P Alimi; O Poirot; F Lopez; J M Claverie
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

8.  Classifying coding DNA with nucleotide statistics.

Authors:  Nicolas Carels; Diego Frías
Journal:  Bioinform Biol Insights       Date:  2009-10-28

Review 9.  Using DNA microarrays to study host-microbe interactions.

Authors:  C A Cummings; D A Relman
Journal:  Emerg Infect Dis       Date:  2000 Sep-Oct       Impact factor: 6.883

10.  MetaGeneAnnotator: detecting species-specific patterns of ribosomal binding site for precise gene prediction in anonymous prokaryotic and phage genomes.

Authors:  Hideki Noguchi; Takeaki Taniguchi; Takehiko Itoh
Journal:  DNA Res       Date:  2008-10-21       Impact factor: 4.458

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.