Literature DB >> 19014535

Extension of the COG and arCOG databases by amino acid and nucleotide sequences.

Florian Meereis1, Michael Kaufmann.   

Abstract

BACKGROUND: The current versions of the COG and arCOG databases, both excellent frameworks for studies in comparative and functional genomics, do not contain the nucleotide sequences corresponding to their protein or protein domain entries.
RESULTS: Using sequence information obtained from GenBank flat files covering the completely sequenced genomes of the COG and arCOG databases, we constructed NUCOCOG (nucleotide sequences containing COG databases) as an extended version including all nucleotide sequences and in addition the amino acid sequences originally utilized to construct the current COG and arCOG databases. We make available three comprehensive single XML files containing the complete databases including all sequence information. In addition, we provide a web interface as a utility suitable to browse the NUCOCOG database for sequence retrieval. The database is accessible at http://www.uni-wh.de/nucocog.
CONCLUSION: NUCOCOG offers the possibility to analyze any sequence related property in the context of the COG and arCOG framework simply by using script languages such as PERL applied to a large but single XML document.

Entities:  

Mesh:

Year:  2008        PMID: 19014535      PMCID: PMC2588464          DOI: 10.1186/1471-2105-9-479

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  14 in total

1.  Artemis: sequence visualization and annotation.

Authors:  K Rutherford; J Parkhill; J Crook; T Horsnell; P Rice; M A Rajandream; B Barrell
Journal:  Bioinformatics       Date:  2000-10       Impact factor: 6.937

2.  Structural and genomic correlates of hyperthermostability.

Authors:  C Cambillau; J M Claverie
Journal:  J Biol Chem       Date:  2000-10-20       Impact factor: 5.157

3.  Genomic correlates of hyperthermostability, an update.

Authors:  Karsten Suhre; Jean-Michel Claverie
Journal:  J Biol Chem       Date:  2003-02-24       Impact factor: 5.157

4.  Thermophilic prokaryotes have characteristic patterns of codon usage, amino acid composition and nucleotide content.

Authors:  Gregory A C Singer; Donal A Hickey
Journal:  Gene       Date:  2003-10-23       Impact factor: 3.688

5.  The COG database: a tool for genome-scale analysis of protein functions and evolution.

Authors:  R L Tatusov; M Y Galperin; D A Natale; E V Koonin
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 6.  A genomic perspective on protein families.

Authors:  R L Tatusov; E V Koonin; D J Lipman
Journal:  Science       Date:  1997-10-24       Impact factor: 47.728

7.  The COG database: new developments in phylogenetic classification of proteins from complete genomes.

Authors:  R L Tatusov; D A Natale; I V Garkavtsev; T A Tatusova; U T Shankavaram; B S Rao; B Kiryutin; M Y Galperin; N D Fedorova; E V Koonin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

8.  PCOGR: phylogenetic COG ranking as an online tool to judge the specificity of COGs with respect to freely definable groups of organisms.

Authors:  Florian Meereis; Michael Kaufmann
Journal:  BMC Bioinformatics       Date:  2004-10-15       Impact factor: 3.169

9.  Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.

Authors:  Kira S Makarova; Alexander V Sorokin; Pavel S Novichkov; Yuri I Wolf; Eugene V Koonin
Journal:  Biol Direct       Date:  2007-11-27       Impact factor: 4.540

10.  The COG database: an updated version includes eukaryotes.

Authors:  Roman L Tatusov; Natalie D Fedorova; John D Jackson; Aviva R Jacobs; Boris Kiryutin; Eugene V Koonin; Dmitri M Krylov; Raja Mazumder; Sergei L Mekhedov; Anastasia N Nikolskaya; B Sridhar Rao; Sergei Smirnov; Alexander V Sverdlov; Sona Vasudevan; Yuri I Wolf; Jodie J Yin; Darren A Natale
Journal:  BMC Bioinformatics       Date:  2003-09-11       Impact factor: 3.169

View more
  5 in total

1.  Application of Subspace Clustering in DNA Sequence Analysis.

Authors:  Tim Wallace; Ali Sekmen; Xiaofei Wang
Journal:  J Comput Biol       Date:  2015-07-10       Impact factor: 1.479

2.  Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group.

Authors:  Takuro Nunoura; Yoshihiro Takaki; Jungo Kakuta; Shinro Nishi; Junichi Sugahara; Hiromi Kazama; Gab-Joo Chee; Masahira Hattori; Akio Kanai; Haruyuki Atomi; Ken Takai; Hideto Takami
Journal:  Nucleic Acids Res       Date:  2010-12-15       Impact factor: 16.971

3.  ComSin: database of protein structures in bound (complex) and unbound (single) states in relation to their intrinsic disorder.

Authors:  Michail Yu Lobanov; Benjamin A Shoemaker; Sergiy O Garbuzynskiy; Jessica H Fong; Anna R Panchenko; Oxana V Galzitskaya
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

4.  ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.

Authors:  Arno Meiler; Claudia Klinger; Michael Kaufmann
Journal:  BMC Bioinformatics       Date:  2012-09-08       Impact factor: 3.169

5.  Mining for hemicellulases in the fungus-growing termite Pseudacanthotermes militaris using functional metagenomics.

Authors:  Géraldine Bastien; Grégory Arnal; Sophie Bozonnet; Sandrine Laguerre; Fernando Ferreira; Régis Fauré; Bernard Henrissat; Fabrice Lefèvre; Patrick Robe; Olivier Bouchez; Céline Noirot; Claire Dumon; Michael O'Donohue
Journal:  Biotechnol Biofuels       Date:  2013-05-14       Impact factor: 6.040

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.