Literature DB >> 12429059

Genomic functional annotation using co-evolution profiles of gene clusters.

Yu Zheng1, Richard J Roberts, Simon Kasif.   

Abstract

BACKGROUND: The current speed of sequencing already exceeds the capability of annotation, creating a potential bottleneck. A large proportion of the genes in microbial genomes remains uncharacterized. Here we propose a new method for functional annotation using the conservation patterns of gene clusters. If several gene clusters show the same coevolution pattern across different genomes it is reasonable to infer they are functionally related. The gene cluster phylogenetic profile integrates chromosomal proximity information and phylogenetic profile information and allows us to infer functional dependences between the gene clusters even at great distance on the chromosome.
RESULTS: As a proof of concept, we applied our method to the genome of Escherichia coli K12 strain. Our method establishes functional relationships among 176 gene clusters, comprising 738 E. coli genes. The accuracy of pair phylogenetic profiles was compared with the single-gene phylogenetic profile and was shown to be higher. As a result, we are able to suggest functional roles for several previously unknown genes or unknown genomic regions in E. coli. We also examined the robustness of coevolution signals across a larger set of genomes and suggest a possible upper limit of accuracy for the phylogenetic profile methods.
CONCLUSIONS: The higher-order phylogenetic profiles, such as the gene-pair phylogenetic profiles, can detect functional dependences that are missed by using conventional single-gene phylogenetic profile or the chromosomal proximity method only. We show that the gene-pair phylogenetic profile is more accurate than the single-gene phylogenetic profiles.

Entities:  

Mesh:

Year:  2002        PMID: 12429059      PMCID: PMC133444          DOI: 10.1186/gb-2002-3-11-research0060

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


  20 in total

1.  Protein interaction maps for complete genomes based on gene fusion events.

Authors:  A J Enright; I Iliopoulos; N C Kyrpides; C A Ouzounis
Journal:  Nature       Date:  1999-11-04       Impact factor: 49.962

Review 2.  The pleiotropic two-component regulatory system PhoP-PhoQ.

Authors:  E A Groisman
Journal:  J Bacteriol       Date:  2001-03       Impact factor: 3.490

3.  Predicting regulons and their cis-regulatory motifs by comparative genomics.

Authors:  A Manson McGuire; G M Church
Journal:  Nucleic Acids Res       Date:  2000-11-15       Impact factor: 16.971

4.  Computational identification of operons in microbial genomes.

Authors:  Yu Zheng; Joseph D Szustakowski; Lance Fortnow; Richard J Roberts; Simon Kasif
Journal:  Genome Res       Date:  2002-08       Impact factor: 9.043

5.  The murG gene of Escherichia coli codes for the UDP-N-acetylglucosamine: N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase involved in the membrane steps of peptidoglycan synthesis.

Authors:  D Mengin-Lecreulx; L Texier; M Rousseau; J van Heijenoort
Journal:  J Bacteriol       Date:  1991-08       Impact factor: 3.490

Review 6.  Interim report on genomics of Escherichia coli.

Authors:  M Riley; M H Serres
Journal:  Annu Rev Microbiol       Date:  2000       Impact factor: 15.500

7.  Molecular characterization of the PhoP-PhoQ two-component system in Escherichia coli K-12: identification of extracellular Mg2+-responsive promoters.

Authors:  A Kato; H Tanabe; R Utsumi
Journal:  J Bacteriol       Date:  1999-09       Impact factor: 3.490

8.  Intrinsic lipid preferences and kinetic mechanism of Escherichia coli MurG.

Authors:  Lan Chen; Hongbin Men; Sha Ha; Xiang-Yang Ye; Livia Brunner; Yanan Hu; Suzanne Walker
Journal:  Biochemistry       Date:  2002-05-28       Impact factor: 3.162

9.  Localizing proteins in the cell from their phylogenetic profiles.

Authors:  E M Marcotte; I Xenarios; A M van Der Bliek; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  2000-10-24       Impact factor: 11.205

10.  The Pfam protein families database.

Authors:  Alex Bateman; Ewan Birney; Lorenzo Cerruti; Richard Durbin; Laurence Etwiller; Sean R Eddy; Sam Griffiths-Jones; Kevin L Howe; Mhairi Marshall; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

View more
  26 in total

1.  Whole-genome annotation by using evidence integration in functional-linkage networks.

Authors:  Ulas Karaoz; T M Murali; Stan Letovsky; Yu Zheng; Chunming Ding; Charles R Cantor; Simon Kasif
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-23       Impact factor: 11.205

2.  Asymmetrical evolution of cytochrome bd subunits.

Authors:  Weilong Hao; G Brian Golding
Journal:  J Mol Evol       Date:  2006-02-10       Impact factor: 2.395

3.  YibK is the 2'-O-methyltransferase TrmL that modifies the wobble nucleotide in Escherichia coli tRNA(Leu) isoacceptors.

Authors:  Alfonso Benítez-Páez; Magda Villarroya; Stephen Douthwaite; Toni Gabaldón; M-Eugenia Armengod
Journal:  RNA       Date:  2010-09-20       Impact factor: 4.942

4.  Rapid pair-wise synteny analysis of large bacterial genomes using web-based GeneOrder4.0.

Authors:  Padmanabhan Mahadevan; Donald Seto
Journal:  BMC Res Notes       Date:  2010-02-23

5.  Phydbac (phylogenomic display of bacterial genes): An interactive resource for the annotation of bacterial genomes.

Authors:  François Enault; Karsten Suhre; Olivier Poirot; Chantal Abergel; Jean-Michel Claverie
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

6.  CoPAP: Coevolution of presence-absence patterns.

Authors:  Ofir Cohen; Haim Ashkenazy; Eli Levy Karin; David Burstein; Tal Pupko
Journal:  Nucleic Acids Res       Date:  2013-06-08       Impact factor: 16.971

7.  The evolutionary dynamics of functional modules and the extraordinary plasticity of regulons: the Escherichia coli perspective.

Authors:  Gabriel Moreno-Hagelsieb; Petar Jokic
Journal:  Nucleic Acids Res       Date:  2012-05-22       Impact factor: 16.971

8.  Gene Cluster Profile Vectors: a method to infer functionally related gene sets by grouping proximity-based gene clusters.

Authors:  Vikas Rao Pejaver; Sun Kim
Journal:  BMC Genomics       Date:  2011-07-27       Impact factor: 3.969

9.  Effect of reference genome selection on the performance of computational methods for genome-wide protein-protein interaction prediction.

Authors:  Vijaykumar Yogesh Muley; Akash Ranjan
Journal:  PLoS One       Date:  2012-07-26       Impact factor: 3.240

10.  Uncovering the co-evolutionary network among prokaryotic genes.

Authors:  Ofir Cohen; Haim Ashkenazy; David Burstein; Tal Pupko
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.