Literature DB >> 12626720

ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes.

Feng-Biao Guo1, Hong-Yu Ou, Chun-Ting Zhang.   

Abstract

A new system, ZCURVE 1.0, for finding protein- coding genes in bacterial and archaeal genomes has been proposed. The current algorithm, which is based on the Z curve representation of the DNA sequences, lays stress on the global statistical features of protein-coding genes by taking the frequencies of bases at three codon positions into account. In ZCURVE 1.0, since only 33 parameters are used to characterize the coding sequences, it gives better consideration to both typical and atypical cases, whereas in Markov-model-based methods, e.g. Glimmer 2.02, thousands of parameters are trained, which may result in less adaptability. To compare the performance of the new system with that of Glimmer 2.02, both systems were run, respectively, for 18 genomes not annotated by the Glimmer system. Comparisons were also performed for predicting some function-known genes by both systems. Consequently, the average accuracy of both systems is well matched; however, ZCURVE 1.0 has more accurate gene start prediction, lower additional prediction rate and higher accuracy for the prediction of horizontally transferred genes. It is shown that the joint applications of both systems greatly improve gene-finding results. For a typical genome, e.g. Escherichia coli, the system ZCURVE 1.0 takes approximately 2 min on a Pentium III 866 PC without any human intervention. The system ZCURVE 1.0 is freely available at: http://tubic. tju.edu.cn/Zcurve_B/.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12626720      PMCID: PMC152858          DOI: 10.1093/nar/gkg254

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  21 in total

1.  EcoGene: a genome sequence database for Escherichia coli K-12.

Authors:  K E Rudd
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Recognition of protein coding genes in the yeast genome at better than 95% accuracy based on the Z curve.

Authors:  C T Zhang; J Wang
Journal:  Nucleic Acids Res       Date:  2000-07-15       Impact factor: 16.971

3.  A novel bacterial gene-finding system with improved accuracy in locating start codons.

Authors:  T Yada; Y Totoki; T Takagi; K Nakai
Journal:  DNA Res       Date:  2001-06-30       Impact factor: 4.458

4.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

5.  A probabilistic method for identifying start codons in bacterial genomes.

Authors:  B E Suzek; M D Ermolaeva; M Schreiber; S L Salzberg
Journal:  Bioinformatics       Date:  2001-12       Impact factor: 6.937

6.  HGT-DB: a database of putative horizontally transferred genes in prokaryotic complete genomes.

Authors:  S Garcia-Vallve; E Guzman; M A Montero; A Romeu
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

7.  Improved microbial gene identification with GLIMMER.

Authors:  A L Delcher; D Harmon; S Kasif; O White; S L Salzberg
Journal:  Nucleic Acids Res       Date:  1999-12-01       Impact factor: 16.971

8.  Bacterial start site prediction.

Authors:  S S Hannenhalli; W S Hayes; A G Hatzigeorgiou; J W Fickett
Journal:  Nucleic Acids Res       Date:  1999-09-01       Impact factor: 16.971

9.  Complete genome sequence of Pseudomonas aeruginosa PAO1, an opportunistic pathogen.

Authors:  C K Stover; X Q Pham; A L Erwin; S D Mizoguchi; P Warrener; M J Hickey; F S Brinkman; W O Hufnagle; D J Kowalik; M Lagrou; R L Garber; L Goltry; E Tolentino; S Westbrock-Wadman; Y Yuan; L L Brody; S N Coulter; K R Folger; A Kas; K Larbig; R Lim; K Smith; D Spencer; G K Wong; Z Wu; I T Paulsen; J Reizer; M H Saier; R E Hancock; S Lory; M V Olson
Journal:  Nature       Date:  2000-08-31       Impact factor: 49.962

10.  Identification of protein-coding genes in the genome of Vibrio cholerae with more than 98% accuracy using occurrence frequencies of single nucleotides.

Authors:  J Wang; C T Zhang
Journal:  Eur J Biochem       Date:  2001-08
View more
  73 in total

1.  Biogeography and phylogenetic diversity of a cluster of exclusively marine myxobacteria.

Authors:  Thorsten Brinkhoff; Doreen Fischer; John Vollmers; Sonja Voget; Christine Beardsley; Sebastian Thole; Marc Mussmann; Brigitte Kunze; Irene Wagner-Döbler; Rolf Daniel; Meinhard Simon
Journal:  ISME J       Date:  2011-12-22       Impact factor: 10.302

2.  Complete genome sequence of the industrial strain Ketogulonicigenium vulgare WSH-001.

Authors:  Liming Liu; Yang Li; Jing Zhang; Zhemin Zhou; Jie Liu; Xiaomin Li; Jingwen Zhou; Guocheng Du; Lei Wang; Jian Chen
Journal:  J Bacteriol       Date:  2011-11       Impact factor: 3.490

3.  Complete genome sequence of virulence-enhancing Siphophage VHS1 from Vibrio harveyi.

Authors:  Krit Khemayan; Anuphap Prachumwat; Burachai Sonthayanon; Aungkul Intaraprasong; Siriporn Sriurairatana; Timothy W Flegel
Journal:  Appl Environ Microbiol       Date:  2012-02-03       Impact factor: 4.792

4.  Complete genome sequence of the metabolically versatile plant growth-promoting endophyte Variovorax paradoxus S110.

Authors:  Jong-In Han; Hong-Kyu Choi; Seung-Won Lee; Paul M Orwin; Jina Kim; Sarah L Laroe; Tae-Gyu Kim; Jennifer O'Neil; Jared R Leadbetter; Sang Yup Lee; Cheol-Goo Hur; Jim C Spain; Galina Ovchinnikova; Lynne Goodwin; Cliff Han
Journal:  J Bacteriol       Date:  2010-12-23       Impact factor: 3.490

Review 5.  Identification of replication origins in archaeal genomes based on the Z-curve method.

Authors:  Ren Zhang; Chun-Ting Zhang
Journal:  Archaea       Date:  2005-05       Impact factor: 3.273

6.  Defining genes in the genome of the hyperthermophilic archaeon Pyrococcus furiosus: implications for all microbial genomes.

Authors:  Farris L Poole; Brian A Gerwe; Robert C Hopkins; Gerrit J Schut; Michael V Weinberg; Francis E Jenney; Michael W W Adams
Journal:  J Bacteriol       Date:  2005-11       Impact factor: 3.490

7.  Identifying bacterial genes and endosymbiont DNA with Glimmer.

Authors:  Arthur L Delcher; Kirsten A Bratke; Edwin C Powers; Steven L Salzberg
Journal:  Bioinformatics       Date:  2007-01-19       Impact factor: 6.937

8.  Complete genome sequence of the metabolically versatile halophilic archaeon Haloferax mediterranei, a poly(3-hydroxybutyrate-co-3-hydroxyvalerate) producer.

Authors:  Jing Han; Fan Zhang; Jing Hou; Xiaoqing Liu; Ming Li; Hailong Liu; Lei Cai; Bing Zhang; Yaping Chen; Jian Zhou; Songnian Hu; Hua Xiang
Journal:  J Bacteriol       Date:  2012-08       Impact factor: 3.490

9.  Improvement of natamycin production by engineering of phosphopantetheinyl transferases in Streptomyces chattanoogensis L10.

Authors:  Hui Jiang; Yue-Yue Wang; Xin-Xin Ran; Wei-Ming Fan; Xin-Hang Jiang; Wen-Jun Guan; Yong-Quan Li
Journal:  Appl Environ Microbiol       Date:  2013-03-22       Impact factor: 4.792

10.  DIGA--a database of improved gene annotation for phytopathogens.

Authors:  Na Gao; Ling-Ling Chen; Hong-Fang Ji; Wei Wang; Ji-Wei Chang; Bei Gao; Lin Zhang; Shi-Cui Zhang; Hong-Yu Zhang
Journal:  BMC Genomics       Date:  2010-01-21       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.