Literature DB >> 10592175

The COG database: a tool for genome-scale analysis of protein functions and evolution.

R L Tatusov1, M Y Galperin, D A Natale, E V Koonin.   

Abstract

Rational classification of proteins encoded in sequenced genomes is critical for making the genome sequences maximally useful for functional and evolutionary studies. The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes (http://www. ncbi.nlm. nih.gov/COG). The COGs were constructed by applying the criterion of consistency of genome-specific best hits to the results of an exhaustive comparison of all protein sequences from these genomes. The database comprises 2091 COGs that include 56-83% of the gene products from each of the complete bacterial and archaeal genomes and approximately 35% of those from the yeast Saccharomyces cerevisiae genome. The COG database is accompanied by the COGNITOR program that is used to fit new proteins into the COGs and can be applied to functional and phylogenetic annotation of newly sequenced genomes.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10592175      PMCID: PMC102395          DOI: 10.1093/nar/28.1.33

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  12 in total

Review 1.  Phylogenetic classification and the universal tree.

Authors:  W F Doolittle
Journal:  Science       Date:  1999-06-25       Impact factor: 47.728

2.  Uses for evolutionary trees.

Authors:  W M Fitch
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  1995-07-29       Impact factor: 6.237

3.  Differential genome display.

Authors:  M A Huynen; Y Diaz-Lazcoz; P Bork
Journal:  Trends Genet       Date:  1997-10       Impact factor: 11.639

Review 4.  Gene families: the taxonomy of protein paralogs and chimeras.

Authors:  S Henikoff; E A Greene; S Pietrokovski; P Bork; T K Attwood; L Hood
Journal:  Science       Date:  1997-10-24       Impact factor: 47.728

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  Distinguishing homologous from analogous proteins.

Authors:  W M Fitch
Journal:  Syst Zool       Date:  1970-06

7.  Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea.

Authors:  E V Koonin; A R Mushegian; M Y Galperin; D R Walker
Journal:  Mol Microbiol       Date:  1997-08       Impact factor: 3.501

8.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

Review 9.  Genome sequences: genome sequence of a model prokaryote.

Authors:  E V Koonin
Journal:  Curr Biol       Date:  1997-10-01       Impact factor: 10.834

Review 10.  Functions of the gene products of Escherichia coli.

Authors:  M Riley
Journal:  Microbiol Rev       Date:  1993-12
View more
  1641 in total

1.  The Comprehensive Microbial Resource.

Authors:  J D Peterson; L A Umayam; T Dickinson; E K Hickey; O White
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  TIGRFAMs: a protein family resource for the functional identification of proteins.

Authors:  D H Haft; B J Loftus; D L Richardson; F Yang; J A Eisen; I T Paulsen; O White
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

3.  Proteome Analysis Database: online application of InterPro and CluSTr for the functional classification of proteins in whole genomes.

Authors:  R Apweiler; M Biswas; W Fleischmann; A Kanapin; Y Karavidopoulou; P Kersey; E V Kriventseva; V Mittard; N Mulder; I Phan; E Zdobnov
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

4.  Protein Information Resource: a community resource for expert annotation of protein data.

Authors:  W C Barker; J S Garavelli; Z Hou; H Huang; R S Ledley; P B McGarvey; H W Mewes; B C Orcutt; F Pfeiffer; A Tsugita; C R Vinayaka; C Xiao; L S Yeh; C Wu
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

5.  iProClass: an integrated, comprehensive and annotated protein classification database.

Authors:  C H Wu; C Xiao; Z Hou; H Huang; W C Barker
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

6.  ARED: human AU-rich element-containing mRNA database reveals an unexpectedly diverse functional repertoire of encoded proteins.

Authors:  T Bakheet; M Frevel; B R Williams; W Greer; K S Khabar
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

7.  PartsList: a web-based system for dynamically ranking protein folds based on disparate attributes, including whole-genome expression and interaction information.

Authors:  J Qian; B Stenger; C A Wilson; J Lin; R Jansen; S A Teichmann; J Park; W G Krebs; H Yu; V Alexandrov; N Echols; M Gerstein
Journal:  Nucleic Acids Res       Date:  2001-04-15       Impact factor: 16.971

8.  Lineage-specific gene expansions in bacterial and archaeal genomes.

Authors:  I K Jordan; K S Makarova; J L Spouge; Y I Wolf; E V Koonin
Journal:  Genome Res       Date:  2001-04       Impact factor: 9.043

9.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

10.  Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags.

Authors:  S J de Souza; A A Camargo; M R Briones; F F Costa; M A Nagai; S Verjovski-Almeida; M A Zago; L E Andrade; H Carrer; H F El-Dorry; E M Espreafico; A Habr-Gama; D Giannella-Neto; G H Goldman; A Gruber; C Hackel; E T Kimura; R M Maciel; S K Marie; E A Martins; M P Nobrega; M L Paco-Larson; M I Pardini; G G Pereira; J B Pesquero; V Rodrigues; S R Rogatto; I D da Silva; M C Sogayar; M de Fátima Sonati; E H Tajara; S R Valentini; M Acencio; F L Alberto; M E Amaral; I Aneas; M H Bengtson; D M Carraro; A F Carvalho; L H Carvalho; J M Cerutti; M L Corrêa; M C Costa; C Curcio; T Gushiken; P L Ho; E Kimura; L C Leite; G Maia; P Majumder; M Marins; A Matsukuma; A S Melo; C A Mestriner; E C Miracca; D C Miranda; A N Nascimento; F G Nóbrega; E P Ojopi; J R Pandolfi; L G Pessoa; P Rahal; C A Rainho; N da Rós; R G de Sá; M M Sales; N P da Silva; T C Silva; W da Silva; D F Simão; J F Sousa; D Stecconi; F Tsukumo; V Valente; H Zalcbeg; R R Brentani; F L Reis; E Dias-Neto; A J Simpson
Journal:  Proc Natl Acad Sci U S A       Date:  2000-11-07       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.