| Literature DB >> 18478081 |
Jayavel Sridhar1, Ziauddin Ahamed Rafi.
Abstract
One of the key challenges in computational genomics is annotating coding genes and identification of regulatory RNAs in complete genomes. An attempt is made in this study which uses the regulatory RNA locations and their conserved flanking genes identified within the genomic backbone of template genome to search for similar RNA locations in query genomes. The search is based on recently reported coexistence of small RNAs and their conserved flanking genes in related genomes. Based on our study, 54 additional sRNA locations and functions of 96 uncharacterized genes are predicted in two draft genomes viz., Serratia marcesens Db1 and Yersinia enterocolitica 8081. Although most of the identified additional small RNA regions and their corresponding flanking genes are homologous in nature, the proposed anchoring technique could successfully identify four non-homologous small RNA regions in Y. enterocolitica genome also. The KEGG Orthology (KO) based automated functional predictions confirms the predicted functions of 65 flanking genes having defined KO numbers, out of the total 96 predictions made by this method. This coexistence based method shows more sensitivity than controlled vocabularies in locating orthologous gene pairs even in the absence of defined Orthology numbers. All functional predictions made by this study in Y. enterocolitica 8081 were confirmed by the recently published complete genome sequence and annotations. This study also reports the possible regions of gene rearrangements in these two genomes and further characterization of such RNA regions could shed more light on their possible role in genome evolution.Entities:
Keywords: Bio-ontology; KO; KOBAS; flanking genes; functional annotation; sRNA
Year: 2008 PMID: 18478081 PMCID: PMC2374372 DOI: 10.6026/97320630002284
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1A multiple sequence alignment of eco, sma and ye genomes using Mauve. (A) The tke1 sRNA and its conserved flanking genes (b2556 and b2557) in eco are observed in ‘A’ block. (B) The identified ASL and its corresponding flanking genes (sma3041 and sma3042) are observed to be retained in the common genomic backbone marked ‘B’ in sma genome. (C) The identified ASL and its corresponding flanking genes (ye1032 and ye1035) are also observed to be retained in the common genomic backbone marked ‘C’ in ye genome.