Literature DB >> 23611891

RNA structures regulating ribosomal protein biosynthesis in bacilli.

Kaila Deiorio-Haggar1, Jon Anthony, Michelle M Meyer.   

Abstract

In Bacilli, there are three experimentally validated ribosomal-protein autogenous regulatory RNAs that are not shared with E. coli. Each of these RNAs forms a unique secondary structure that interacts with a ribosomal protein encoded by a downstream gene, namely S4, S15, and L20. Only one of these RNAs that interacts with L20 is currently found in the RNA Families Database. We created, or modified, existing structural alignments for these three RNAs and used them to perform homology searches. We have determined that each structure exhibits a narrow phylogenetic distribution, mostly relegated to the Firmicute class Bacilli. This work, in conjunction with other similar work, demonstrates that there are most likely many non-homologous RNA regulatory elements regulating ribosomal protein biosynthesis that still await discovery and characterization in other bacterial species.

Entities:  

Keywords:  Bacillus subtilis; Geobacillus stearothermophilus; Infernal; Rfam; gram-positive; ribosomal leader sequence; ribosomal protein

Mesh:

Substances:

Year:  2013        PMID: 23611891      PMCID: PMC3849166          DOI: 10.4161/rna.24151

Source DB:  PubMed          Journal:  RNA Biol        ISSN: 1547-6286            Impact factor:   4.652


Introduction

There are a wide variety of different bacterial regulatory RNAs ranging from riboswitches that require complex secondary- and tertiary- structure motifs for function to sRNAs that typically act predominantly through base-pairing interactions., Some of the first regulatory RNAs to be described are those that autogenously regulate ribosomal protein biosynthesis in Escherichia coli. These regulatory RNAs typically occur within 5′-untranslated or intergenic regions of transcripts encoding ribosomal proteins. When transcribed, these RNA sequences form secondary structures that interact with a specific ribosomal protein binding partner to regulate an entire ribosomal protein operon. The mechanism of gene regulation can be either transcriptional or translational,- thus allowing these mRNA structures to act as a means of feedback inhibition. To date, over 10 such RNAs regulating more than half of ribosomal protein genes have been described in E. coli. However, recent work has shown that most of these RNAs are narrowly distributed to Gammaproteobacteria. Progress toward understanding the regulation of ribosomal protein biosynthesis in gram-positive bacteria, including model organisms such as Bacillus subtilis, has been much more limited. Ribosomal proteins are typically universally distributed and well conserved across bacterial phyla. In addition, the over 50 genes encoding ribosomal proteins occur in long multi-gene operons whose structure is largely, but not completely, conserved. Despite the universal nature of these proteins, their regulation does not appear to be conserved. Of the E. coli ribosomal protein regulatory RNAs, only two (interacting with ribosomal proteins L1 and L10), are widely present in Firmicutes (> 85% of all sequenced Firmicutes). A third RNA structure (interacting with ribosomal protein S2) has been identified in > 50% of sequenced Firmicutes., In addition to these widely distributed RNAs, RNA structures interacting with ribosomal proteins S4, S15 and L20 have been identified and experimentally validated in the Firmicute class Bacilli.- While each of these ribosomal proteins is also a regulator in E. coli, the RNAs show little or no homology to the E. coli RNAs with the same function. Currently, Rfam alignments are only available for one of these RNAs (L20, RF00558). While several additional putative RNA structures associated with ribosomal proteins have been identified in comparative genomic studies, none have been experimentally validated. We utilized the RNA homology search program Infernal, coupled with our high-capacity genomic context visualization tool, to identify homologs of the three experimentally validated ribosomal-protein autogenous regulatory RNAs found in Bacilli. The alignments produced in this work assess the phylogenetic distribution of these RNA structures, and integrate experimental data with comparative genomics to gain new insight into the structure and function for these RNAs.

Results

L20-interacting RNA

The RNA structure interacting with ribosomal protein L20 to regulate the infC-L30-L20 operon was discovered independently by two studies at approximately the same time. One study experimentally analyzed the RNA, while the other identified the RNA structure through a comparative genomics approach. In contrast to the L20-interacting RNA present in Gammaproteobacteria, the L20-interacting RNA in B. subtilis regulates transcription rather than translation, and does not directly precede the gene encoding its binding partner, rplT. Instead, the protein-binding site is located at the start of the operon, preceding and regulating the genes infC, rpmI and rplT. The current Rfam alignment for this RNA (Rfam: RF00558) (generated by comparative genomics in Yao et al.), representing the L20-bound form, was used as the starting alignment for this study. The L20-interacting RNA structure consists of three pairing elements (Fig. 1; ), including a terminator stem that was experimentally confirmed using in vitro transcription termination assays. A fourth pairing element directly preceding the terminator was noted in previous studies. This fourth stem is present in > 75% of species in the alignment, but shows no conservation of individual nucleotides. In addition, nuclease footprinting assays indicate it is unlikely to be involved in L20 binding (Fig. 1).

Figure 1. Consensus sequence and secondary structures of Bacilli ribosomal regulatory elements. Start codons (AUG) are depicted inside a black box when occurring within the RNA structure. Gray boxes indicate areas of high conservation and possible binding. Dotted boxes surround areas proposed to be important for binding. Co-varying base pairs are shaded red or green only when Watson-Crick base pairing is in > 95% of the aligned sequences. Helix numbering is consistent with previously published data for each RNA.

Figure 1. Consensus sequence and secondary structures of Bacilli ribosomal regulatory elements. Start codons (AUG) are depicted inside a black box when occurring within the RNA structure. Gray boxes indicate areas of high conservation and possible binding. Dotted boxes surround areas proposed to be important for binding. Co-varying base pairs are shaded red or green only when Watson-Crick base pairing is in > 95% of the aligned sequences. Helix numbering is consistent with previously published data for each RNA. Toe-printing and RNase probing assays have determined that L20 binds specifically at the junction of Helices 1 and 2, stabilizing Helix 2 in the process. This region of the RNA bears striking resemblance to the L20 binding site on the 23S rRNA, and our studies show that this region is highly conserved, with few or no mutations. The terminator stem does not appear to be involved in L20 binding, and there is co-variation throughout the stem, indicating that the secondary structure, rather than the sequence, is important for function. There is also a rigorously conserved pair of adenosines before the start of the first helix, but their effect on binding is unknown. Of the RNAs examined here, the L20-binding RNA has the greatest penetration in sequenced Firmicutes (Fig. 2). It is found in most Bacilli, and many Clostridia species. A few homologs of the L20-interacting RNA are also identified in Thermotogae and Actinobacteria. While Thermotogae are relatively closely related to Firmicutes, the presence in Actinobacteria species suggests possible horizontal transfer events.

Figure 2. Phylogenetic distribution of Bacilli autogenous ribosomal regulators. (A) Distribution of autogenous regulators of ribosomal protein synthesis in eubacterial phyla. (B) Distribution of autogenous regulators of ribosomal protein synthesis for classes within the phylum Firmicutes.

Figure 2. Phylogenetic distribution of Bacilli autogenous ribosomal regulators. (A) Distribution of autogenous regulators of ribosomal protein synthesis in eubacterial phyla. (B) Distribution of autogenous regulators of ribosomal protein synthesis for classes within the phylum Firmicutes.

S15-interacting RNA

The mRNA structure interacting with ribosomal protein S15 regulates only rpsO. Like its E. coli counterpart, the RNA structure identified in Bacilli overlaps with the beginning of the coding region for rpsO. The RNA was first identified in B. stearothermophilus (subsequently reclassified as Geobacillus stearothermophilus), and RNA-protein interaction was experimentally validated initially utilizing in vitro approaches. Regulatory activity of the RNA was subsequently demonstrated using E. coli as a surrogate organism. There is no Rfam alignment for this RNA, nor has it been identified in previous comparative genomic works. For this study, we manually constructed a starting alignment consisting of the 3-helix junction necessary for binding and regulation,using BLAST to identify the initial homologs. The starting alignment contained sequences from several Geobacillus species, as well as a hand-aligned portion of the genomic region preceding rpsO in B. subtilis and Caldicellulosiruptor bescii DSM 6725 (a member of Clostridia). Utilizing this initial alignment we were able to identify the S15-binding RNA in most Bacilli species, and in both sequenced Negativiticutes species (Fig. 2). However, its incidence in Clostridia is considerably lower, resulting in a lower overall frequency in Firmicutes. We also identified sequences in Deinococci and Fusobacteria (two sequences in each phylum), suggesting potential horizontal transfer. However, it is difficult to make a definitive conclusion in this matter due to the small number of putative homologs and lack of any experimental data to verify them. Although alternative structures may exist, our final secondary structure is presumably stabilized by interactions with S15 (Fig. 1; ). Consistent with deletion studies suggesting the length of Helix 1 may vary, the sequence and length of Helix 1 in naturally occurring examples can range from nine to 17 base pairs. The putative “AUG” start codon within the loop of that helix (Fig. 1, black box) is highly conserved, appearing in > 97% of all sequences. Similarly to Helix 1, Helix 2 shows nucleotide sequence variability, but base pairing throughout the stem is largely maintained. While deletion studies have shown that the H2 helix may be reduced to 29 nucleotides and still retain functionality, our alignment shows that the full-length stem is maintained in > 90% of all sequences. A consecutive set of “G-C” and “G·U” pairs in H2, appearing as “G-C” and “R·Y” base pairs in Figure 1 due to sequence variability, were both expected to be highly conserved, as both base pairs are reported to be important for binding. However, only the “G-C” pair showed > 90% conservation, while the “G·U” pair (“R·Y” in Fig. 1) exhibited much greater sequence variability, though base pairing was maintained. Helix 3 includes a conserved “GGAGG” that based on its location relative to the conserved “AUG,” is likely part of the Shine-Delgarno sequence.

S4-interacting RNA

The B. subtilis S4-interacting RNA regulates only the gene encoding S4, rpsD. In E. coli, the operon containing S4 also contains four additional ribosomal genes, and the S4 protein regulates the synthesis of all of them. In B. subtilis, this gene cluster does not include rpsD., Rather rpsD is at a different location in the genome and is likely to be the only gene regulated by this RNA. B. subtilis S4 represses its own synthesis post-transcription initiation, but the mechanism of action remains unknown. The S4 protein is known to interact with 16S rRNA, and parts of the B. subtilis 5′-UTR (5′-untranslated region) have sequence and structural similarity to 16S. The S4-interacting RNA is found in only Firmicutes, Tenericutes and Thermotogae, although within Firmicutes, the RNA does not appear in Clostridia or in the two Negativiticute genomes analyzed (Fig. 2). The starting alignment used here originated from the supplementary material of a comparative genomic screen of Firmicutes. However, the structure derived from comparative genomics did not match the experimental structure proposed by Grundy et. al. In particular, the proposed structure incorporates sequence demonstrated to have no regulatory activity. For the work presented here, the structure was manually edited to match the structural prediction based on experimental data. This structure contains two hairpins with variable bulges and stems (Fig. 1; ). We found the hairpin branching from the first helix to be especially variable, both in sequence and in presence. The “GUAA” bulge (Fig. 1, gray box) remains conserved and is proposed to interact with the S4 protein,, as it has sequence identity to a similar bulge on the 16S rRNA., Also, in most sequences aligned, the Shine-Dalgarno sequence follows relatively closely downstream of the second variable helix of the RNA (Fig. 1). We evaluated the presence of two pseudoknots proposed in the original description of this RNA structure. Mutations to these pseudoknots did not affect regulation, and our alignment shows no support for them. We also performed homology searches with the original alignment derived from comparative genomics by Yao and coworkers, as this structure has the potential to be an alternative non-interacting conformation for the RNA. The final alignments of the two possible structures share significant taxonomic overlap, 172 out of 177 species, indicating that both structures are possible in most species. The majority of the alternative structure is quite similar to the S4-binding structure. H1 is largely intact in the alternative structure, with a conserved “GUAA” bulge near the top of the stem, as well as similarly conserved loops. There are some minor changes to the base pairing of H1, causing the protein binding site (“GUAA”) and the top loop to be smaller than their correlates in the S4-binding structure. More dramatically, the alternative structure lacks the H2 stem, and instead has an elongated H1 stem that partially overlaps the existing H2. In order to elongate the H1 stem, the alternative structure includes a 5′ extension of 25–30 nucleotides. Although this 5′ extension corresponds exactly with the transcriptional start in B. subtilis, deletion of this region was shown to have no impact on S4 protein binding. An alignment and secondary structure diagram for this alternative structure are included in the supplementary data ().

Discussion

Scientists have been aware of autogenous regulators of ribosomal protein synthesis since 1980. While there is a good understanding of the RNA structures that regulate ribosomal protein regulation in E. coli, our knowledge of these elements outside of E. coli is sorely lacking. While three of the E. coli RNA regulators are widely distributed, the majority are not. In Bacilli, experimentally validated RNA structures interacting with S4, S15 and L20 are known to regulate ribosomal proteins. These structures show no homology to RNAs interacting with homologous proteins from E. coli. This study created alignments for two of the three RNAs unique to Bacilli and assessed the homologs in the context of previous experimental results. We have found that the three regulatory elements examined—interacting with S4, S15 and L20—have a narrow evolutionary distribution, even so narrow as to exclude the Firmicute class Clostridia from the S4 distribution. Based on various experimental analyses it is apparent that there are multiple evolutionarily distinct regulatory RNAs responding to the same ribosomal protein in different bacterial phyla.,,,,, Furthermore, most of the characterized RNA structures responsible for regulating ribosomal protein biosynthesis appear to be narrowly distributed, and comparative genomic studies have discovered a number of putative RNA structures associated with ribosomal proteins in Firmicutes and other bacterial species that have yet to be verified., The combination of these observations suggests strongly that there are many distinct RNA structures responsible for ribosomal protein regulation that have yet to be identified and experimentally characterized. In the future, identifying and validating non-homologous regulatory RNAs that likely have the same function in non-model organisms will lead to a more comprehensive understanding of each RNA-protein interaction and to elucidation of the evolutionary trajectories for these regulatory RNAs.

Materials and Methods

Initial multiple sequence alignments were obtained as described below for each RNA. The seed alignment for the L20-binding RNA was downloaded from the Rfam database (Rfam families: RF00558), the S4-binding RNA was obtained via Yao et al. (2007) supplementary material and the S15-binding RNA was created manually via BLAST (completed genomes only) matches to the G. stearothermophilus sequence of interest and hand alignments of a few selected sequences as noted in the main text. The alignments were all manually edited to remove sequences not compatible with published experimental data and to adjust base pairing. Any changes to base paring are discussed in the main text. Covariance models for each RNA were constructed and calibrated using Infernal 1.0 (cmbuild, cmcalibrate), and homologs identified for each RNA (cmsearch). Cmsearch was performed against a custom sequence database as described in Fu et al. using a lenient e-value cut-off. Potential homologs were assessed on the basis of genomic context, using a custom visualization tool (GenomeChart), and for fit to the existing alignment. Alignments were manually adjusted as necessary when sequences with variable-length helices and/or loops were added. The search process was repeated three to four times per multiple sequence alignment, to expand sequence diversity. The counts for Figure 2 were calculated from the number of completed genomes within refseq46 based on the final alignments utilizing queries to our custom database. Consensus secondary structure diagrams were created from the alignments using GSC-weighting in R2R.
  34 in total

1.  Interaction of the Bacillus stearothermophilus ribosomal protein S15 with its 5'-translational operator mRNA.

Authors:  L G Scott; J R Williamson
Journal:  J Mol Biol       Date:  2001-11-30       Impact factor: 5.469

2.  Translational feedback regulation of the gene for L35 in Escherichia coli requires binding of ribosomal protein L20 to two sites in its leader mRNA: a possible case of ribosomal RNA-messenger RNA molecular mimicry.

Authors:  Maude Guillier; Frédéric Allemand; Sophie Raibaud; Frédéric Dardel; Mathias Springer; Claude Chiaruttini
Journal:  RNA       Date:  2002-07       Impact factor: 4.942

3.  Phylogeny of Firmicutes with special reference to Mycoplasma (Mollicutes) as inferred from phosphoglycerate kinase amino acid sequence data.

Authors:  Matthias Wolf; Tobias Müller; Thomas Dandekar; J Dennis Pollack
Journal:  Int J Syst Evol Microbiol       Date:  2004-05       Impact factor: 2.747

4.  S4-16 S ribosomal RNA complex. Binding constant measurements and specific recognition of a 460-nucleotide region.

Authors:  J V Vartikar; D E Draper
Journal:  J Mol Biol       Date:  1989-09-20       Impact factor: 5.469

5.  Taxonomic study of aerobic thermophilic bacilli: descriptions of Geobacillus subterraneus gen. nov., sp. nov. and Geobacillus uzenensis sp. nov. from petroleum reservoirs and transfer of Bacillus stearothermophilus, Bacillus thermocatenulatus, Bacillus thermoleovorans, Bacillus kaustophilus, Bacillus thermodenitrificans to Geobacillus as the new combinations G. stearothermophilus, G. th.

Authors:  T N Nazina; T P Tourova; A B Poltaraus; E V Novikova; A A Grigoryan; A E Ivanova; A M Lysenko; V V Petrunyaka; G A Osipov; S S Belyaev; M V Ivanov
Journal:  Int J Syst Evol Microbiol       Date:  2001-03       Impact factor: 2.747

6.  Ribosomal protein S15 represses its own translation via adaptation of an rRNA-like fold within its mRNA.

Authors:  Alexander Serganov; Ann Polonskaia; Bernard Ehresmann; Chantal Ehresmann; Dinshaw J Patel
Journal:  EMBO J       Date:  2003-04-15       Impact factor: 11.598

7.  Localization of the binding site for protein S4 on 16 S ribosomal RNA by chemical and enzymatic probing and primer extension.

Authors:  S Stern; R C Wilson; H F Noller
Journal:  J Mol Biol       Date:  1986-11-05       Impact factor: 5.469

8.  Ribosomal protein S4 acts in trans as a translational repressor to regulate expression of the alpha operon in Escherichia coli.

Authors:  S Jinks-Robertson; M Nomura
Journal:  J Bacteriol       Date:  1982-07       Impact factor: 3.490

9.  Feedback regulation of ribosomal protein gene expression in Escherichia coli: structural homology of ribosomal RNA and ribosomal protein MRNA.

Authors:  M Nomura; J L Yates; D Dean; L E Post
Journal:  Proc Natl Acad Sci U S A       Date:  1980-12       Impact factor: 11.205

10.  Most RNAs regulating ribosomal protein biosynthesis in Escherichia coli are narrowly distributed to Gammaproteobacteria.

Authors:  Yang Fu; Kaila Deiorio-Haggar; Jon Anthony; Michelle M Meyer
Journal:  Nucleic Acids Res       Date:  2013-02-08       Impact factor: 16.971

View more
  15 in total

1.  The expanding view of RNA and DNA function.

Authors:  Ronald R Breaker; Gerald F Joyce
Journal:  Chem Biol       Date:  2014-09-18

Review 2.  Regulatory RNAs in Bacillus subtilis: a Gram-Positive Perspective on Bacterial RNA-Mediated Regulation of Gene Expression.

Authors:  Ruben A T Mars; Pierre Nicolas; Emma L Denham; Jan Maarten van Dijl
Journal:  Microbiol Mol Biol Rev       Date:  2016-10-26       Impact factor: 11.056

3.  Ribosomal protein L10(L12)4 autoregulates expression of the Bacillus subtilis rplJL operon by a transcription attenuation mechanism.

Authors:  Helen Yakhnin; Alexander V Yakhnin; Paul Babitzke
Journal:  Nucleic Acids Res       Date:  2015-06-22       Impact factor: 16.971

4.  Bacterial RNA motif in the 5' UTR of rpsF interacts with an S6:S18 complex.

Authors:  Yang Fu; Kaila Deiorio-Haggar; Mark W Soo; Michelle M Meyer
Journal:  RNA       Date:  2013-12-05       Impact factor: 4.942

5.  Extreme features of the Galdieria sulphuraria organellar genomes: a consequence of polyextremophily?

Authors:  Kanika Jain; Kirsten Krause; Felix Grewe; Gaven F Nelson; Andreas P M Weber; Alan C Christensen; Jeffrey P Mower
Journal:  Genome Biol Evol       Date:  2014-12-30       Impact factor: 3.416

6.  Recognizing RNA structural motifs in HT-SELEX data for ribosomal protein S15.

Authors:  Shermin Pei; Betty L Slinger; Michelle M Meyer
Journal:  BMC Bioinformatics       Date:  2017-06-06       Impact factor: 3.169

7.  Co-evolution of Bacterial Ribosomal Protein S15 with Diverse mRNA Regulatory Structures.

Authors:  Betty L Slinger; Hunter Newman; Younghan Lee; Shermin Pei; Michelle M Meyer
Journal:  PLoS Genet       Date:  2015-12-16       Impact factor: 5.917

8.  Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

Authors:  Eric I Sun; Semen A Leyn; Marat D Kazanov; Milton H Saier; Pavel S Novichkov; Dmitry A Rodionov
Journal:  BMC Genomics       Date:  2013-09-02       Impact factor: 3.969

9.  Discovery and validation of novel and distinct RNA regulators for ribosomal protein S15 in diverse bacterial phyla.

Authors:  Betty L Slinger; Kaila Deiorio-Haggar; Jon S Anthony; Molly M Gilligan; Michelle M Meyer
Journal:  BMC Genomics       Date:  2014-08-07       Impact factor: 3.969

10.  Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics.

Authors:  Sanshu Li; Ronald R Breaker
Journal:  BMC Genomics       Date:  2017-10-13       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.