Literature DB >> 15488147

PCOGR: phylogenetic COG ranking as an online tool to judge the specificity of COGs with respect to freely definable groups of organisms.

Florian Meereis1, Michael Kaufmann.   

Abstract

BACKGROUND: The rapidly increasing number of completely sequenced genomes led to the establishment of the COG-database which, based on sequence homologies, assigns similar proteins from different organisms to clusters of orthologous groups (COGs). There are several bioinformatic studies that made use of this database to determine (hyper)thermophile-specific proteins by searching for COGs containing (almost) exclusively proteins from (hyper)thermophilic genomes. However, public software to perform individually definable group-specific searches is not available.
RESULTS: The tool described here exactly fills this gap. The software is accessible at http://www.uni-wh.de/pcogr and is linked to the COG-database. The user can freely define two groups of organisms by selecting for each of the (current) 66 organisms to belong either to groupA, to the reference groupB or to be ignored by the algorithm. Then, for all COGs a specificity index is calculated with respect to the specificity to groupA, i. e. high scoring COGs contain proteins from the most of groupA organisms while proteins from the most organisms assigned to groupB are absent. In addition to ranking all COGs according to the user defined specificity criteria, a graphical visualization shows the distribution of all COGs by displaying their abundance as a function of their specificity indexes.
CONCLUSIONS: This software allows detecting COGs specific to a predefined group of organisms. All COGs are ranked in the order of their specificity and a graphical visualization allows recognizing (i) the presence and abundance of such COGs and (ii) the phylogenetic relationship between groupA- and groupB-organisms. The software also allows detecting putative protein-protein interactions, novel enzymes involved in only partially known biochemical pathways, and alternate enzymes originated by convergent evolution.

Entities:  

Mesh:

Year:  2004        PMID: 15488147      PMCID: PMC526202          DOI: 10.1186/1471-2105-5-150

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  10 in total

1.  Who's your neighbor? New computational approaches for functional genomics.

Authors:  M Y Galperin; E V Koonin
Journal:  Nat Biotechnol       Date:  2000-06       Impact factor: 54.908

2.  A hot story from comparative genomics: reverse gyrase is the only hyperthermophile-specific protein.

Authors:  Patrick Forterre
Journal:  Trends Genet       Date:  2002-05       Impact factor: 11.639

3.  Potential genomic determinants of hyperthermophily.

Authors:  Kira S Makarova; Yuri I Wolf; Eugene V Koonin
Journal:  Trends Genet       Date:  2003-04       Impact factor: 11.639

4.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.

Authors:  M Pellegrini; E M Marcotte; M J Thompson; D Eisenberg; T O Yeates
Journal:  Proc Natl Acad Sci U S A       Date:  1999-04-13       Impact factor: 11.205

5.  EPPS: mining the COG database by an extended phylogenetic patterns search.

Authors:  Klaus Reichard; Michael Kaufmann
Journal:  Bioinformatics       Date:  2003-04-12       Impact factor: 6.937

Review 6.  A genomic perspective on protein families.

Authors:  R L Tatusov; E V Koonin; D J Lipman
Journal:  Science       Date:  1997-10-24       Impact factor: 47.728

7.  The COG database: new developments in phylogenetic classification of proteins from complete genomes.

Authors:  R L Tatusov; D A Natale; I V Garkavtsev; T A Tatusova; U T Shankavaram; B S Rao; B Kiryutin; M Y Galperin; N D Fedorova; E V Koonin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

8.  X-ray scattering studies of Methylophilus methylotrophus (sp. W3A1) electron-transferring flavoprotein. Evidence for multiple conformational states and an induced fit mechanism for assembly with trimethylamine dehydrogenase.

Authors:  M Jones; J Basran; M J Sutcliffe; J Günter Grossmann; N S Scrutton
Journal:  J Biol Chem       Date:  2000-07-14       Impact factor: 5.157

9.  Thermophile-specific proteins: the gene product of aq_1292 from Aquifex aeolicus is an NTPase.

Authors:  Claudia Klinger; Michael Rossbach; Rebecca Howe; Michael Kaufmann
Journal:  BMC Biochem       Date:  2003-09-23       Impact factor: 4.059

10.  The COG database: an updated version includes eukaryotes.

Authors:  Roman L Tatusov; Natalie D Fedorova; John D Jackson; Aviva R Jacobs; Boris Kiryutin; Eugene V Koonin; Dmitri M Krylov; Raja Mazumder; Sergei L Mekhedov; Anastasia N Nikolskaya; B Sridhar Rao; Sergei Smirnov; Alexander V Sverdlov; Sona Vasudevan; Yuri I Wolf; Jodie J Yin; Darren A Natale
Journal:  BMC Bioinformatics       Date:  2003-09-11       Impact factor: 3.169

  10 in total
  4 in total

1.  Extension of the COG and arCOG databases by amino acid and nucleotide sequences.

Authors:  Florian Meereis; Michael Kaufmann
Journal:  BMC Bioinformatics       Date:  2008-11-13       Impact factor: 3.169

2.  PHOG: a database of supergenomes built from proteome complements.

Authors:  Igor V Merkeev; Pavel S Novichkov; Andrey A Mironov
Journal:  BMC Evol Biol       Date:  2006-06-22       Impact factor: 3.260

3.  Crystal structure of THEP1 from the hyperthermophile Aquifex aeolicus: a variation of the RecA fold.

Authors:  Michael Rossbach; Oliver Daumke; Claudia Klinger; Alfred Wittinghofer; Michael Kaufmann
Journal:  BMC Struct Biol       Date:  2005-03-20

4.  On the cytotoxicity of HCR-NTPase in the neuroblastoma cell line SH-SY5Y.

Authors:  Markus Pasdziernik; Barbara Kaltschmidt; Christian Kaltschmidt; Claudia Klinger; Michael Kaufmann
Journal:  BMC Res Notes       Date:  2009-06-11
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.