Literature DB >> 10446249

Bacterial start site prediction.

S S Hannenhalli1, W S Hayes, A G Hatzigeorgiou, J W Fickett.   

Abstract

With the growing number of completely sequenced bacterial genes, accurate gene prediction in bacterial genomes remains an important problem. Although the existing tools predict genes in bacterial genomes with high overall accuracy, their ability to pinpoint the translation start site remains unsatisfactory. In this paper, we present a novel approach to bacterial start site prediction that takes into account multiple features of a potential start site, viz., ribosome binding site (RBS) binding energy, distance of the RBS from the start codon, distance from the beginning of the maximal ORF to the start codon, the start codon itself and the coding/non-coding potential around the start site. Mixed integer programing was used to optimize the discriminatory system. The accuracy of this approach is up to 90%, compared to 70%, using the most common tools in fully automated mode (that is, without expert human post-processing of results). The approach is evaluated using Bacillus subtilis, Escherichia coli and Pyrococcus furiosus. These three genomes cover a broad spectrum of bacterial genomes, since B.subtilis is a Gram-positive bacterium, E.coli is a Gram-negative bacterium and P. furiosus is an archaebacterium. A significant problem is generating a set of 'true' start sites for algorithm training, in the absence of experimental work. We found that sequence conservation between P. furiosus and the related Pyrococcus horikoshii clearly delimited the gene start in many cases, providing a sufficient training set.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10446249      PMCID: PMC148603          DOI: 10.1093/nar/27.17.3577

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  14 in total

1.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

2.  ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes.

Authors:  Feng-Biao Guo; Hong-Yu Ou; Chun-Ting Zhang
Journal:  Nucleic Acids Res       Date:  2003-03-15       Impact factor: 16.971

3.  Dictionary-driven prokaryotic gene finding.

Authors:  Tetsuo Shibuya; Isidore Rigoutsos
Journal:  Nucleic Acids Res       Date:  2002-06-15       Impact factor: 16.971

4.  A comparative genomic method for computational identification of prokaryotic translation initiation sites.

Authors:  Megon Walker; Vladimir Pavlovic; Simon Kasif
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

5.  Identification of 113 conserved essential genes using a high-throughput gene disruption system in Streptococcus pneumoniae.

Authors:  Jane A Thanassi; Sandra L Hartman-Neumann; Thomas J Dougherty; Brian A Dougherty; Michael J Pucci
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

6.  pAM401-based shuttle vectors that enable overexpression of promoterless genes and one-step purification of tag fusion proteins directly from Enterococcus faecalis.

Authors:  S Fujimoto; Y Ike
Journal:  Appl Environ Microbiol       Date:  2001-03       Impact factor: 4.792

7.  Identification of fur and fldA homologs and a Pasteurella multocida tbpA homolog in Histophilus ovis and effects of iron availability on their transcription.

Authors:  Andrew Ekins; Donald F Niven
Journal:  J Bacteriol       Date:  2002-05       Impact factor: 3.490

8.  Hon-yaku: a biology-driven Bayesian methodology for identifying translation initiation sites in prokaryotes.

Authors:  Yuko Makita; Michiel J L de Hoon; Antoine Danchin
Journal:  BMC Bioinformatics       Date:  2007-02-08       Impact factor: 3.169

9.  Cloning, extracellular expression and characterization of a predominant beta-CGTase from Bacillus sp. G1 in E. coli.

Authors:  Rui Min Ong; Kian Mau Goh; Nor Muhammad Mahadi; Osman Hassan; Raja Noor Zaliha Raja Abdul Rahman; Rosli Md Illias
Journal:  J Ind Microbiol Biotechnol       Date:  2008-08-26       Impact factor: 3.346

10.  Prokaryotic gene finding based on physicochemical characteristics of codons calculated from molecular dynamics simulations.

Authors:  Poonam Singhal; B Jayaram; Surjit B Dixit; David L Beveridge
Journal:  Biophys J       Date:  2008-03-07       Impact factor: 4.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.