| Literature DB >> 17254314 |
Sarita Ranjan1, Ranjit Kumar Gundu, Akash Ranjan.
Abstract
BACKGROUND: A key post genomics challenge is to identify how genes in an organism come together and perform physiological functions. An important first step in this direction is to identify transcriptional units, operons and regulons in a genome. Here we implement and report a strategy to computationally identify transcriptional units and operons of mycobacteria and construct a database-MycoperonDB. DESCRIPTION: We have predicted transcriptional units and operons in mycobacteria and organized these predictions in the form of relational database called MycoperonDB. MycoperonDB database at present consists of 18,053 genes organized as 8256 predicted operons and transcriptional units from five closely related species of mycobacteria. The database further provides literature links for experimentally characterized operons. All known promoters and related information is collected, analysed and stored. It provides a user friendly interface to allow a web based navigation of transcription units and operons. The web interface provides search tools to locate transcription factor binding DNA motif upstream to various genes. The reliability of operon prediction has been assessed by comparing the predicted operons with a set of known operons.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17254314 PMCID: PMC1764487 DOI: 10.1186/1471-2105-7-S5-S9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Search form of MycoperonDB. SearchDB is a html form which takes input from the user. The query, which can be the gene name, ORF number, COG value or motif sequence, as given in the example, should be provided in the text box. In accordance with the query, user should select the nature of the query and the species of interest by clicking appropriate radio buttons before clicking the search button.
Figure 2Output page of MycoperonDB. A typical output html page which shows the result of the user's query. The query in this case was ORF number 2243 and the species selected was M.tuberculosis H37Rv. The output of a search has two parts: a table and a drawing. The table shows that the query ORF is part of an operon that consists of 5 genes. The last but one column of the table shows that this is a known operon and it is hyperlinked to the relevant pubmed ID which in this case is 12464486. The last column of the table provides a quick hyperlink to gene/protein sequences of the listed operon. Each gene that is part of selected operon is drawn as maroon colored rectangle with its ORF number written on it. The drawing has a grey arrow head which depicts the forward or reverse orientation of the operonic genes on the genome.