| Literature DB >> 19956731 |
Konstantinos Mavromatis1, Ken Chu, Natalia Ivanova, Sean D Hooper, Victor M Markowitz, Nikos C Kyrpides.
Abstract
Computational methods for determining the function of genes in newly sequenced genomes have been traditionally based on sequence similarity to genes whose function has been identified experimentally. Function prediction methods can be extended using gene context analysis approaches such as examining the conservation of chromosomal gene clusters, gene fusion events and co-occurrence profiles across genomes. Context analysis is based on the observation that functionally related genes are often having similar gene context and relies on the identification of such events across phylogenetically diverse collection of genomes. We have used the data management system of the Integrated Microbial Genomes (IMG) as the framework to implement and explore the power of gene context analysis methods because it provides one of the largest available genome integrations. Visualization and search tools to facilitate gene context analysis have been developed and applied across all publicly available archaeal and bacterial genomes in IMG. These computations are now maintained as part of IMG's regular genome content update cycle. IMG is available at: http://img.jgi.doe.gov.Entities:
Mesh:
Year: 2009 PMID: 19956731 PMCID: PMC2776528 DOI: 10.1371/journal.pone.0007979
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistics of conserved chromosomal cassettes in IMG 2.8.
| No of conserved cassettes | No of genes | % of total protein coding genes | |
| COGs | 8,653,081 | 3,114,028 | 68.71% |
| Pfam | 29,858,323 | 3,222,379 | 71.10% |
| IMG ortholog | 2,760,407 | 3,871,625 | 85.43% |
Figure 1Gene specific chromosomal cassette details and viewers.
Figure 2Percentage of protein families found in the same KEGG pathway at different correlation score levels.
Figure 3Phylogenetic Profiler for Gene Cassettes.
Figure 4Separation of function of paralogous genes based on information of their chromosomal context.