Literature DB >> 15121896

Identification and functional analysis of 'hypothetical' genes expressed in Haemophilus influenzae.

Eugene Kolker1, Kira S Makarova, Svetlana Shabalina, Alex F Picone, Samuel Purvine, Ted Holzman, Tim Cherny, David Armbruster, Robert S Munson, Grigory Kolesov, Dmitrij Frishman, Michael Y Galperin.   

Abstract

The progress in genome sequencing has led to a rapid accumulation in GenBank submissions of uncharacterized 'hypothetical' genes. These genes, which have not been experimentally characterized and whose functions cannot be deduced from simple sequence comparisons alone, now comprise a significant fraction of the public databases. Expression analyses of Haemophilus influenzae cells using a combination of transcriptomic and proteomic approaches resulted in confident identification of 54 'hypothetical' genes that were expressed in cells under normal growth conditions. In an attempt to understand the functions of these proteins, we used a variety of publicly available analysis tools. Close homologs in other species were detected for each of the 54 'hypothetical' genes. For 16 of them, exact functional assignments could be found in one or more public databases. Additionally, we were able to suggest general functional characterization for 27 more genes (comprising approximately 80% total). Findings from this analysis include the identification of a pyruvate-formate lyase-like operon, likely to be expressed not only in H.influenzae but also in several other bacteria. Further, we also observed three genes that are likely to participate in the transport and/or metabolism of sialic acid, an important component of the H.influenzae lipo-oligosaccharide. Accurate functional annotation of uncharacterized genes calls for an integrative approach, combining expression studies with extensive computational analysis and curation, followed by eventual experimental verification of the computational predictions.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15121896      PMCID: PMC419445          DOI: 10.1093/nar/gkh555

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  59 in total

1.  SCOP database in 2002: refinements accommodate structural genomics.

Authors:  Loredana Lo Conte; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

2.  H. influenzae Consortium: integrative study of H. influenzae-human interactions.

Authors:  Eugene Kolker; Samuel Purvine; Alex Picone; Tim Cherny; Brian J Akerley; Robert S Munson; Bernhard O Palsson; Dayle A Daines; Arnold L Smith
Journal:  OMICS       Date:  2002

3.  Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search.

Authors:  Andrew Keller; Alexey I Nesvizhskii; Eugene Kolker; Ruedi Aebersold
Journal:  Anal Chem       Date:  2002-10-15       Impact factor: 6.986

4.  Crystal structure of conserved hypothetical protein Aq1575 from Aquifex aeolicus.

Authors:  Dong Hae Shin; Hisao Yokota; Rosalind Kim; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2002-06-11       Impact factor: 11.205

5.  Crystal structure of the YajQ protein from Haemophilus influenzae reveals a tandem of RNP-like domains.

Authors:  Alexey Teplyakov; Galina Obmolova; Nivedita Bir; Prasad Reddy; Andrew J Howard; Gary L Gilliland
Journal:  J Struct Funct Genomics       Date:  2003

6.  Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea.

Authors:  E V Koonin; A R Mushegian; M Y Galperin; D R Walker
Journal:  Mol Microbiol       Date:  1997-08       Impact factor: 3.501

7.  Sialic acid metabolism's dual function in Haemophilus influenzae.

Authors:  E Vimr; C Lichtensteiger; S Steenbergen
Journal:  Mol Microbiol       Date:  2000-06       Impact factor: 3.501

8.  Comparative gene expression profiles following UV exposure in wild-type and SOS-deficient Escherichia coli.

Authors:  J Courcelle; A Khodursky; B Peter; P O Brown; P C Hanawalt
Journal:  Genetics       Date:  2001-05       Impact factor: 4.562

9.  Identifying protein function--a call for community action.

Authors:  Richard J Roberts
Journal:  PLoS Biol       Date:  2004-03-16       Impact factor: 8.029

10.  Conserved 'hypothetical' proteins: new hints and new puzzles.

Authors:  M Y Galperin
Journal:  Comp Funct Genomics       Date:  2001
View more
  35 in total

Review 1.  'Conserved hypothetical' proteins: prioritization of targets for experimental study.

Authors:  Michael Y Galperin; Eugene V Koonin
Journal:  Nucleic Acids Res       Date:  2004-10-12       Impact factor: 16.971

2.  Novel sialic acid transporter of Haemophilus influenzae.

Authors:  Simon Allen; Anthony Zaleski; Jason W Johnston; Bradford W Gibson; Michael A Apicella
Journal:  Infect Immun       Date:  2005-09       Impact factor: 3.441

3.  Crystal structure of the bacterial YhcH protein indicates a role in sialic acid catabolism.

Authors:  Alexey Teplyakov; Galina Obmolova; John Toedt; Michael Y Galperin; Gary L Gilliland
Journal:  J Bacteriol       Date:  2005-08       Impact factor: 3.490

Review 4.  New metrics for comparative genomics.

Authors:  Michael Y Galperin; Eugene Kolker
Journal:  Curr Opin Biotechnol       Date:  2006-09-15       Impact factor: 9.740

5.  Optimizing high performance computing workflow for protein functional annotation.

Authors:  Larissa Stanberry; Bhanu Rekepalli; Yuan Liu; Paul Giblock; Roger Higdon; Elizabeth Montague; William Broomall; Natali Kolker; Eugene Kolker
Journal:  Concurr Comput       Date:  2014-09-10       Impact factor: 1.536

6.  Genetic redundancy is prevalent within the 6.7 Mb Sinorhizobium meliloti genome.

Authors:  George C diCenzo; Turlough M Finan
Journal:  Mol Genet Genomics       Date:  2015-02-01       Impact factor: 3.291

7.  Modeling sequence and function similarity between proteins for protein functional annotation.

Authors:  Roger Higdon; Brenton Louie; Eugene Kolker
Journal:  Proc Int Symp High Perform Distrib Comput       Date:  2010

8.  Large-scale transposon mutagenesis of Photobacterium profundum SS9 reveals new genetic loci important for growth at low temperature and high pressure.

Authors:  Federico M Lauro; Khiem Tran; Alessandro Vezzi; Nicola Vitulo; Giorgio Valle; Douglas H Bartlett
Journal:  J Bacteriol       Date:  2007-12-21       Impact factor: 3.490

9.  A statistical model of protein sequence similarity and function similarity reveals overly-specific function predictions.

Authors:  Brenton Louie; Roger Higdon; Eugene Kolker
Journal:  PLoS One       Date:  2009-10-21       Impact factor: 3.240

10.  DIGA--a database of improved gene annotation for phytopathogens.

Authors:  Na Gao; Ling-Ling Chen; Hong-Fang Ji; Wei Wang; Ji-Wei Chang; Bei Gao; Lin Zhang; Shi-Cui Zhang; Hong-Yu Zhang
Journal:  BMC Genomics       Date:  2010-01-21       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.