| Literature DB >> 21672960 |
Florian Eggenhofer1, Hakim Tafer, Peter F Stadler, Ivo L Hofacker.
Abstract
Bacterial genomes encode a plethora of small RNAs (sRNAs), which are heterogeneous in size, structure and function. Most sRNAs act as post-transcriptional regulators by means of specific base pairing interactions with the 5'-untranslated region of mRNA transcripts, thereby modifying the stability of the target transcript and/or its ability to be translated. Here, we present RNApredator, a web server for the prediction of sRNA targets. The user can choose from a set of over 2155 genomes and plasmids from 1183 bacterial species. RNApredator then uses a dynamic programming approach, RNAplex, to compute putative targets. Compared to web servers with a similar task, RNApredator takes the accessibility of the target during the target search into account, improving the specificity of the predictions. Furthermore, enrichment in Gene Ontology terms, cellular pathways as well as changes in accessibilities along the target sequence can be done in fully automated post-processing steps. The predictive performance of the underlying dynamic programming approach RNAplex is similar to that of more complex methods, but needs at least three orders of magnitude less time to complete. RNApredator is available at http://rna.tbi.univie.ac.at/RNApredator.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21672960 PMCID: PMC3125805 DOI: 10.1093/nar/gkr467
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Results page. After the target search is completed, RNApredator presents the list of the 100 best interactions. Each line contains the rank, total energy of interaction, corresponding Z-score, duplex structure in dot-bracket format, interaction coordinates on the sRNA and mRNA, gene annotation, locus tag, strand, genomic coordinates of the target, NCBI accession number as well as the type of replicon the target is located on. Results can be filtered based on the coordinates of the target locations (for up to 500 interactions). Moreover, it is possible to limit the displayed interactions to the 25, 50, 75, 100, 500 best interactions. Finally, the complete results table can be downloaded in .csv or raw RNAplex format.
Figure 2.Post-processing page: detailed information about interactions selected on the results page. (a) The upper part of the three GO term class tables, i.e. biological process, molecular function or cellular component. The 20 most significant GO terms are shown in a separate table. Each line of the table contains the GO term ID, human readable term, total number of annotated genes linked to the GO term, number of targets selected linked to the GO term, expected number of targets linked to the GO term and P-value of the GO term enrichment. (b)The selected interactions are shown in detail; Relevant duplex characteristics are recapitulated and a graphical representation of the duplex structure is shown. mRNA and sRNA sequences can be downloaded in .fasta format. The Calculate-link enables the user to get a plot of the opening energy for all stretches of 4 nt for the region around the start codon before (red line) and after (green line) the sRNA binding. The accessibility difference is shown in (black line). [(c) rpoS-RprA, (d) rpoS-DsrA]. The 5′-end of the start codon is represented with a cyan line and the interaction site with two blue lines. Recalculation of the duplex is possible by using the Calculate-link to RNAup web server.
Summary of TargetRNA and RNApredator ranking of 30 experimentally confirmed interactions
| Genome | Species | sRNA | mRNA | Gene | ||
|---|---|---|---|---|---|---|
| NC_000964 | B.s. | FsrA | sdhC | BSU28450 | NF(NF) | 153(83) |
| NC_011601 | E.c.O | OmrA | ompR | b3405 | NF(NF) | 436(49) |
| NC_011601 | E.c.O | OmrA | ompT | b0565 | NF(NF) | 712(93) |
| NC_011601 | E.c.O | OmrB | ompR | b3405 | NF(31) | 312(39) |
| NC_011601 | E.c.O | OmrB | ompT | b0565 | NF(NF) | 210(13) |
| NC_000913 | E.c.K. | CyaR | ompX | b0814 | NF(NF) | 495(86) |
| NC_000913 | E.c.K. | CyaR | yqaE | b2666 | NF(NF) | 541(97) |
| NC_000913 | E.c.K. | DsrA | hns | b1237 | 52(6) | 8(4) |
| NC_000913 | E.c.K. | FnrS | metE | b3829 | 5(8) | 120(37) |
| NC_000913 | E.c.K. | FnrS | sodB | b1656 | 24(21) | 615(192) |
| NC_000913 | E.c.K. | GcvB | cycA | b4208 | 37(5) | 41(10) |
| NC_000913 | E.c.K. | IstR | tisB | b4405 | 2(NF) | NF(NF) |
| NC_000913 | E.c.K. | MicA | phoP | b1130 | 80(23) | 57(10) |
| NC_000913 | E.c.K. | MicC | ompC | b2215 | 2(5) | 2(2) |
| NC_000913 | E.c.K. | MicF | ompF | b0929 | 43(5) | 2(2) |
| NC_000913 | E.c.K. | OmrA | gntP | b4321 | NF(NF) | 79(17) |
| NC_000913 | E.c.K. | OmrB | csgD | b1040 | 50(NF) | 2(NF) |
| NC_000913 | E.c.K. | RseX | ompC | b2215 | 98(NF) | 504(238) |
| NC_000913 | E.c.K. | RyhB | iscS | b2530 | NF(NF) | 123(30) |
| NC_000913 | E.c.K. | RyhB | sodB | b1656 | 24(21) | 184(52) |
| NC_000913 | E.c.K. | SgrS | ptsG | b1101 | NF(NF) | 5(1) |
| NC_003210 | L.m. | LhrA | lmo085 | lmo0850 | NF(NF) | 31(NF) |
| NC_002505 | V.c. | MicX | vca0620 | vca0620 | NF(34) | 48(7) |
| NC_002505 | V.c. | Qrr1 | luxO | vca1021 | NF(NF) | 196(44) |
| NC_002505 | V.c. | Qrr1 | vca0939 | vca0939 | NF(NF) | 5(NF) |
| NC_002505 | V.c. | Qrr2 | luxO | vca0620 | NF(NF) | 12(NF) |
| NC_002505 | V.c. | Qrr2 | vca0939 | vca0939 | NF(NF) | 3(NF) |
| NC_002505 | V.c. | Qrr3 | vca0939 | vca0939 | NF(NF) | 4(NF) |
| NC_002505 | V.c. | Qrr4 | vca0939 | vca0939 | NF(NF) | 4(NF) |
| NC_002505 | V.c. | VrrA | tcpA | vca0838 | 35(NF) | 246(71) |
The first column contains the NCBI accession ID of the species, the species name is indicated in the second column. The third and fourth columns contain the sRNA and mRNA gene tag, the fifth column shows the locus tag and the sixth and seventh columns contain the rank of the interaction for TargetRNA and RNApredator. In the last two columns, the number in parenthesis corresponds to the rank when the target search is constrained to a region located 30 nt upstream and 20 nt downstream of the start codon, while the other numbers correspond to the rank for the region spanning 150 nt upstream and 100 nt downstream of the start codon. NF stands for not found (TargetRNA does not return targets with a rank >100, and RNApredator hits also contain suboptimal interactions) B.s. is Bacillus subtilis subsp. subtilis str. 168, E.c.O is Escherichia coli O127:H6 str. E2348/69, E.c.K. is Escherichia coli str. K-12 substr. MG1655, L.m. is Listeria monocytogenes EGD-e and V.c. is Vibrio cholerae O1 biovar El Tor str. N16961.