Literature DB >> 15980564

RibEx: a web server for locating riboswitches and other conserved bacterial regulatory elements.

Cei Abreu-Goodger1, Enrique Merino.   

Abstract

We present RibEx (riboswitch explorer), a web server capable of searching any sequence for known riboswitches as well as other predicted, but highly conserved, bacterial regulatory elements. It allows the visual inspection of the identified motifs in relation to attenuators and open reading frames (ORFs). Any of the ORF's or regulatory elements' sequence can be obtained with a click and submitted to NCBI's BLAST. Alternatively, the genome context of all other genes regulated by the same element can be explored with our genome context tool (GeConT). RibEx is available at http://www.ibt.unam.mx/biocomputo/ribex.html.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15980564      PMCID: PMC1160206          DOI: 10.1093/nar/gki445

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Ribonucleic acids have become fashionable lately. Apart from their fundamental participation in transcription and translation, RNAs are clearly some of the most functionally diverse molecules in the cell. Recently, non-translated regions of several mRNAs have been found to be capable of regulating their own expression by binding specific metabolites with high affinity in complete absence of proteins [(1), reviewed in (2)]. These regulatory elements, termed riboswitches, appear to be highly conserved, the extreme case being that of the thiamine pyrophosphate (TPP) riboswitch, which has been found in all three kingdoms of life (3). Riboswitches comprise two parts, a sensing element or aptamer, which forms a complex structure capable of binding the metabolite, and an effector element, or expression platform capable of transforming the signal into a biological response. The aptamer is the most conserved, having been selected to bind an unchanging molecule such as a vitamin or an amino acid. Upon binding, a shift between two mutually exclusive RNA secondary structures in the effector element occurs. These pairs of structures of the expression platform can represent a transcriptional terminator/anti-terminator, a Shine-Dalgarno sequester/anti-sequester or even an active/inactive ribozyme (2,4). It is not uncommon for different organisms to use the same sensing element, yet different effector elements.

FINDING RIBOSWITCHES

Although the usual method to define a riboswitch involves locating a conserved secondary structure in the RNA molecule, the highly restricted nature of the sensing element argues that sequence alone should be enough to locate riboswitches correctly. We have previously developed a computer algorithm capable of finding bacterial regulatory motifs, based exclusively on sequence conservation in the regulatory regions of orthologous groups of genes (5). The main restrictions of our method are that a regulatory element must be closely associated with at least one COG (cluster of orthologous groups of proteins) (6) and it must be present in at least five non-redundant genomes. On the other hand, the advantage is that it is an automatic process, requiring no previous regulatory information to produce relevant results, and as such, can be easily run every time that new genomes or annotations are available. We updated our previous results (5), taking into account 223 complete genomes. From these, a reduced set of 145 non-redundant organisms was obtained using CVtree (7). We were able to recover 10 out of the 11 currently reported riboswitches. Additionally, our results included many regulatory elements that are also known to depend on structured RNA for recognition, such as the Gram-positive T-box and the PyrR protein binding site. We thus call our set of regulatory elements: riboswitch-like elements (RLEs), given the fact that almost all the identified conserved signals were RNA-dependant regulatory elements. RibEx is a web server that allows any user to easily find any RLE in the sequence of his/her interest. Since most known riboswitches are associated with attenuators, we have included the option of searching for transcriptional and translational attenuators, which can help in selecting the most likely candidates, as has been shown by Barrick et al. (4). Additionally, our web server displays representative drawings of the open reading frames (ORFs) and their corresponding regulatory elements, any of which can be selected, in order to acquire its sequence for submission to NCBI's BLAST server (8). Every RLE is linked to a list of genes that are predicted to be subject to its regulation. The genome context of these genes, analyzed with our local GeConT web server (9), in addition to the scores of the pre-computed RLEs, can be of great assistance when evaluating the likelihood of a new prediction. A great resource when working with RNA families is the Rfam database (10). We have used their models to annotate our RLEs. As of version 7.0, Rfam contains a total of 503 families, 125 of them are non-coding, and 11 of these are annotated as riboswitches. We were able to recover automatically all but one of these riboswitches, missing the ykoK element. Our matrices for the most abundant riboswitches perform very well when compared with the co-variance models used by Rfam (∼90% coverage when analyzing bacterial sequences). Less common riboswitches (e.g. lysine and purine) are more difficult to model with sequence-based weight-matrices. Our method thus tends to recover between 70 and 80% of these Rfam members. Our data set also contains six more RLEs that coincide with an Rfam cis-regulating member and 341 RLEs that do not have a match and thus remain as predicted elements. We have calculated a P-value, assuming a hyper-geometrical distribution, for each RLE to be over-represented in a given COG or KEGG pathway (11). Thus, we provide every RLE with a tentative functional assignation. As far as we know there are only two servers, beside ours, that can be used to locate riboswitches in a given sequence: riboswitch finder (12) which, in its current implementation, only searches for the purine-sensing riboswitch, and Rfam, that has an option to locate riboswitches in any sequence, but as co-variance searches have high computational requirements, the sequence length is limited to 2 kb. RibEx, in addition to performing searches on larger sequences, allows the user a greater view of the regulatory potential of his sequence, by showing the ORFs and predicted attenuators. The 341 predicted RLEs also make RibEx a great complement to the curated families contained in Rfam.

THE WEB SERVER

The server is divided into modules, which are written in, and tied together with Perl. A brief description of each module follows: Riboswitch-like elements. The program takes the sequence provided and splits it into overlapping windows of 500 nt. Each of these smaller sequences are searched for the selected RLEs with MAST (13), using matrices obtained as detailed in our previous work (5). Our method defines each RLE as several non-overlapping motifs, so we restrict the search to 500 nt to avoid false positives where the individual motifs are too far apart. When an RLE passes the selected E-value cutoff, the positions, size of each motif and final score of the regulatory element are recorded. Open reading frames. ORFs are predicted, as is commonly done for bacterial genomes. The default options are for a resulting protein of at least 80 amino acids beginning with a start codon (ATG, GTG or TTG) and ending with a stop codon (TAA, TAG or TGA). By default, fully overlapped ORFs are not shown. Attenuators. These are predicted according to an algorithm developed in our group and described elsewhere (14). The predicted secondary structure of each attenuator and its free energy is recorded. Upon clicking on the image of the attenuator, an additional window will be opened showing this information. To avoid false positives, attenuators are only searched for in the region preceding each predicted ORF. Web output. The web page is generated ‘on the fly’ by a Perl script that controls all the other modules. The images are generated using the GD graphics library, and the interactivity between windows and frames is provided with Javascript.

AN EXAMPLE

Figure 1 shows a typical RibEx output. The input sequence was a region of 4000 nt from around the thiC gene of Bacillus cereus ATCC14579. Immediately upstream from one of the ORFs (drawn as blue arrows) the three motifs that comprise the TPP riboswitch (red boxes) can be seen, as well as a transcriptional attenuator (black lollipop). A separate window acts as a figure legend indicating the score for each regulatory element found (in this case, only the TPP riboswitch). A typical scenario might include clicking on the second ORF, and sending the sequence to the BLAST web server, showing it to be identical to several ThiC proteins. Clicking on the TPP riboswitch motif in the figure legend box opens a window with the genes that are predicted to be regulated by this riboswitch, where the user can see how the motifs are distributed in different genomes. Taken together, and strengthened by the presence of a transcriptional attenuator, the user would have no trouble at all concluding that his sequence contains a bona fide riboswitch.
Figure 1

RibEx locates a thiamine riboswitch.

  14 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Metabolite-binding RNA domains are present in the genes of eukaryotes.

Authors:  Narasimhan Sudarsan; Jeffrey E Barrick; Ronald R Breaker
Journal:  RNA       Date:  2003-06       Impact factor: 4.942

3.  Whole proteome prokaryote phylogeny without sequence alignment: a K-string composition approach.

Authors:  Ji Qi; Bin Wang; Bai-Iin Hao
Journal:  J Mol Evol       Date:  2004-01       Impact factor: 2.395

Review 4.  The riboswitch control of bacterial metabolism.

Authors:  Evgeny Nudler; Alexander S Mironov
Journal:  Trends Biochem Sci       Date:  2004-01       Impact factor: 13.807

5.  Rfam: an RNA family database.

Authors:  Sam Griffiths-Jones; Alex Bateman; Mhairi Marshall; Ajay Khanna; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

6.  Riboswitch finder--a tool for identification of riboswitch RNAs.

Authors:  Peter Bengert; Thomas Dandekar
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

7.  Conserved regulatory motifs in bacteria: riboswitches and beyond.

Authors:  Cei Abreu-Goodger; Nancy Ontiveros-Palacios; Ricardo Ciria; Enrique Merino
Journal:  Trends Genet       Date:  2004-10       Impact factor: 11.639

8.  GeConT: gene context analysis.

Authors:  R Ciria; C Abreu-Goodger; E Morett; E Merino
Journal:  Bioinformatics       Date:  2004-04-08       Impact factor: 6.937

Review 9.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

10.  Thiamine derivatives bind messenger RNAs directly to regulate bacterial gene expression.

Authors:  Wade Winkler; Ali Nahvi; Ronald R Breaker
Journal:  Nature       Date:  2002-10-16       Impact factor: 49.962

View more
  72 in total

Review 1.  Comparative genomic reconstruction of transcriptional regulatory networks in bacteria.

Authors:  Dmitry A Rodionov
Journal:  Chem Rev       Date:  2007-07-18       Impact factor: 60.622

2.  Metatranscriptomics reveals unique microbial small RNAs in the ocean's water column.

Authors:  Yanmei Shi; Gene W Tyson; Edward F DeLong
Journal:  Nature       Date:  2009-05-14       Impact factor: 49.962

3.  Computational identification of riboswitches based on RNA conserved functional sequences and conformations.

Authors:  Tzu-Hao Chang; Hsien-Da Huang; Li-Ching Wu; Chi-Ta Yeh; Baw-Jhiune Liu; Jorng-Tzong Horng
Journal:  RNA       Date:  2009-05-21       Impact factor: 4.942

Review 4.  Biochemical features and functional implications of the RNA-based T-box regulatory mechanism.

Authors:  Ana Gutiérrez-Preciado; Tina M Henkin; Frank J Grundy; Charles Yanofsky; Enrique Merino
Journal:  Microbiol Mol Biol Rev       Date:  2009-03       Impact factor: 11.056

5.  Expression, purification and preliminary X-ray diffraction studies of the transcriptional factor PyrR from Bacillus halodurans.

Authors:  Rodrigo Arreola; Anita Vega-Miranda; Armando Gómez-Puyou; Ruy Pérez-Montfort; Enrique Merino-Pérez; Alfredo Torres-Larios
Journal:  Acta Crystallogr Sect F Struct Biol Cryst Commun       Date:  2008-07-05

Review 6.  Riboswitch RNAs: using RNA to sense cellular metabolism.

Authors:  Tina M Henkin
Journal:  Genes Dev       Date:  2008-12-15       Impact factor: 11.361

7.  YlxM is a newly identified accessory protein that influences the function of signal recognition particle pathway components in Streptococcus mutans.

Authors:  Matthew L Williams; Paula J Crowley; Adnan Hasona; L Jeannine Brady
Journal:  J Bacteriol       Date:  2014-03-21       Impact factor: 3.490

Review 8.  Computational analysis of riboswitch-based regulation.

Authors:  Eric I Sun; Dmitry A Rodionov
Journal:  Biochim Biophys Acta       Date:  2014-02-28

9.  Riboswitch control of gene expression in plants by splicing and alternative 3' end processing of mRNAs.

Authors:  Andreas Wachter; Meral Tunc-Ozdemir; Beth C Grove; Pamela J Green; David K Shintani; Ronald R Breaker
Journal:  Plant Cell       Date:  2007-11-09       Impact factor: 11.277

10.  The MarR-Type Regulator Rdh2R Regulates rdh Gene Transcription in Dehalococcoides mccartyi Strain CBDB1.

Authors:  Lydia Krasper; Hauke Lilie; Anja Kublik; Lorenz Adrian; Ralph Golbik; Ute Lechner
Journal:  J Bacteriol       Date:  2016-11-04       Impact factor: 3.490

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.