| Literature DB >> 20691398 |
Jayavel Sridhar1, Govindaraj Sowmiya, Kanagaraj Sekar, Ziauddin Ahamed Rafi.
Abstract
Small RNAs (sRNAs) are non-coding transcripts exerting their functions in the cells directly. Identification of sRNAs is a difficult task due to the lack of clear sequence and structural biases. Most sRNAs are identified within genus specific intergenic regions in related genomes. However, several of these regions remain un-annotated due to lack of sequence homology and/or potent statistical identification tools. A computational engine has been built to search within the intergenic regions to identify and roughly annotate new putative sRNA regions in Enterobacteriaceae genomes. It utilizes experimentally known sRNA data and their flanking genes/KEGG Orthology (KO) numbers as templates to identify similar sRNA regions in related query genomes. The search engine not only has the capability to locate putative intergenic regions for specific sRNAs, but also has the potency to locate conserved, shuffled or deleted gene clusters in query genomes. Because it uses the KO terms for locating functionally important regions such as sRNAs, any further KO number assignment to additional genes will increase the sensitivity. The PsRNA server is used for the identification of putative sRNA regions through the information retrieved from the sRNA of interest. The computing engine is available online at http://bioserver1.physics.iisc.ernet.in/psrna/ and http://bicmku.in:8081/psrna/. Copyright 2010 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20691398 PMCID: PMC5054453 DOI: 10.1016/S1672-0229(10)60014-9
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Figure 1Flow chart of the methodology used in the identification of putative sRNA locations within the intergenic regions of the query genomes.
Reference and query genomes used
| No. | Organism | Genome code | GenBank ID | Gene ID code |
|---|---|---|---|---|
| 1 | NC_000913 | Bxxxx | ||
| 2 | NC_008253 | ECP_xxxx | ||
| 3 | NC_008563 | APECO1_xxxx | ||
| 4 | NC_007946 | UTI89_Cxxxx | ||
| 5 | NC_006905 | SCxxxx | ||
| 6 | NC_007613 | SBO_xxxx | ||
| 7 | NC_007606 | SDY_xxxx | ||
| 8 | NC_007384 | SSON_xxxx | ||
| 9 | NC_008258 | SFV_xxxx | ||
| 10 | NC_007712 | SGxxxx | ||
| 11 | NC_008150 | YPA_xxxx | ||
| 12 | NC_008149 | YPN_xxxx | ||
| 13 | NC_008800 | YExxxx | ||
| 14 | NC_009381 | YPDSF_xxxx | ||
| 15 | NC_009708 | YPSIP31758_xxxx | ||
| 16 | NC_007292 | BPEN_xxxx | ||
| 17 | NC_005061 | BFLxxx | ||
| 18 | NC_008513 | BCC_xxx | ||
| 19 | NC_009648 | KPN_xxxxx | ||
| 20 | NC_009436 | ENT638_xxxx | ||
| 21 | NC_000091 | JWxxxx |
Gene ID codes are as per KEGG database (.
Reference genome used in this study (gray shade).
Figure 2A snap shot of the results page from PsRNA server for spf sRNA with eco reference and ecp as the query genome. The putative sRNA region is identified between flanking genes ECP_4074 and ECP_4075. The KO pair obtained based on the reference genome eco is also displayed.