Literature DB >> 24310371

Bacterial RNA motif in the 5' UTR of rpsF interacts with an S6:S18 complex.

Yang Fu, Kaila Deiorio-Haggar, Mark W Soo, Michelle M Meyer.   

Abstract

Approximately half the transcripts encoding ribosomal proteins in Escherichia coli include a structured RNA motif that interacts with a specific ribosomal protein to inhibit gene expression, thus allowing stoichiometric production of ribosome components. However, many of these RNA structures are not widely distributed across bacterial phyla. It is increasingly common for RNA motifs associated with ribosomal protein genes to be identified using comparative genomic methods, yet these are rarely experimentally validated. In this work, we characterize one such motif that precedes operons containing rpsF and rpsR, which encode ribosomal proteins S6 and S18. This RNA structure is widely distributed across many phyla of bacteria despite differences within the downstream operon, and examples are present in both E. coli and Bacillus subtilis. We demonstrate a direct interaction between an example of the RNA from B. subtilis and an S6:S18 complex using in vitro binding assays, verify our predicted secondary structure, and identify a putative protein-binding site. The proposed binding site bears a strong resemblance to the S18 binding site within the 16S rRNA, suggesting molecular mimicry. This interaction is a valuable addition to the canon of ribosomal protein mRNA interactions. This work shows how experimental verification translates computational results into concrete knowledge of biological systems.

Entities:  

Keywords:  autogenous regulation; comparative genomics; rRNA mimic; ribosomal protein; ribosome assembly

Mesh:

Substances:

Year:  2013        PMID: 24310371      PMCID: PMC3895269          DOI: 10.1261/rna.041285.113

Source DB:  PubMed          Journal:  RNA        ISSN: 1355-8382            Impact factor:   4.942


INTRODUCTION

In the past few years, increasingly sophisticated comparative genomic approaches have predicted many putative RNA structures within bacterial genomes (Barrick et al. 2004; Weinberg et al. 2007, 2010; Yao et al. 2007; Meyer et al. 2009; Xu et al. 2009). Many of these RNAs have been experimentally characterized as riboswitches (for review, see Serganov and Nudler 2013), or other functional RNAs (Irnov and Winkler 2010). However, most have no experimental validation, and in many cases determining a biological function can be challenging (Meyer et al. 2011). As the volume of sequence data available continues to grow, experimental follow-up is increasingly necessary to translate findings from computational studies into biological knowledge. Several conserved RNA motifs are associated with ribosomal protein (r-protein) operons (Weinberg et al. 2007; Yao et al. 2007; Meyer et al. 2009; Xu et al. 2009). In Escherichia coli, three decades of research have identified a series of mRNA structures that collectively regulate approximately half the transcriptional units encoding r-proteins (Zengel and Lindahl 1994). These mRNA structures allow autogenous regulation, wherein an r-protein interacts with a structured portion of the transcript to negatively regulate transcription or translation (Dean et al. 1981; Zengel and Lindahl 1992). Many of the r-proteins responsible for this regulatory activity are primary ribosomal RNA (rRNA) binding proteins that interact with the rRNA early during ribosome assembly (e.g., S8, S4, S15, L1, L10, L20) (Dean and Nomura 1980). However, autogenous regulatory activity has been reported for tertiary rRNA binding proteins (e.g., S1, S2) (Skouv et al. 1990; Aseev et al. 2008). Several of the mRNA structures form secondary or tertiary structures that appear to mimic the rRNA binding sites of their protein interaction partners. For example, the mRNA structures interacting with L1, L10(L12)4 complex, S8, and L20 each bear striking resemblance to the rRNA binding sites for these proteins (Merianos et al. 2004; Guillier et al. 2005; Nevskaya et al. 2005; Iben and Draper 2008). In other cases there is no clear resemblance between the mRNA and rRNA binding sites (Tang and Draper 1989; Mathy et al. 2004). These regulatory RNAs maintain stoichiometric levels of r-proteins (Sykes et al. 2010) and overexpression of their respective binding-proteins can result in severe growth defects (Dean et al. 1981; Wower et al. 1992). Despite their important role in regulating an essential process, only three of more than ten RNA structures known in E. coli are widely distributed to many bacterial phyla. The majority are narrowly distributed to only a few orders of γ-proteobacteria (Fu et al. 2013). There are a number of cases in which multiple RNA structures have arisen in different bacterial phyla to perform similar functions (Nomura et al. 1980; Grundy and Henkin 1992; Scott and Williamson 2001; Guillier et al. 2002; Serganov et al. 2003; Choonee et al. 2007). In this work, we demonstrate an RNA motif preceding rpsF and rpsR, encoding r-proteins S6 and S18 interacts with an S6:S18 dimer. Both S6 and S18 are secondary rRNA binding proteins that form a heterodimer prior to assembly with the S15-rRNA complex (Held et al. 1974; Recht and Williamson 2001). Although this RNA structure was originally identified in Firmicutes (Yao et al. 2007), our subsequent analysis indicates that it is widely distributed to many bacterial phyla. We verify the secondary structure predicted from comparative genomics using SHAPE (Selective 2′-Hydroxyl Acylation Analyzed by Primer Extension) (Wilkinson et al. 2006) and identify a putative binding site that resembles the S18 rRNA binding site using nuclease protection assays (Ehresmann et al. 1987).

RESULTS

An RNA structure preceding rpsF is present in many bacterial species

A computational search for structured RNAs in Firmicute genomes identified an RNA motif preceding rpsF (Yao et al. 2007), and additional putative RNA structures preceding this gene have been identified by similar searches in other bacterial phyla (Weinberg et al. 2007, 2010). Examination of the motifs obtained from these searches revealed a common structural element that includes a stem–loop with a bulge containing a pair of conserved cytosines. We combined the shared regions of these motifs into a common alignment and subsequently used this to identify additional homologs in completed microbial genomes. We identified more than 1300 sequences matching this RNA motif across many bacterial phyla, including Proteobacteria, Actinobacteria, Cyanobacteria, Spirochaetes, and Firmicutes (alignment in Supplemental File 1; secondary structures from selected examples drawn in Supplemental Fig. 1). A consensus representation of this alignment is in Figure 1A. The double cytosine in the bulge is strikingly conserved, and the H1 stem sequesters a potential Shine-Dalgarno sequence in many examples, suggesting possible regulatory activity. However, the mere proximity of the conserved motif to the translational start site is potentially sufficient to allow regulation of gene expression upon protein binding through steric inhibition of ribosome binding. H2 is variable in length (13–107 nucleotides, typically paired), but the region proximal to the bulge is fairly conserved. Apart from this shared motif, the organization of the operon is not well conserved. The RNA invariably precedes rpsF, but the subsequent genomic context can vary, although it contains both rpsF and rpsR the vast majority of the time (Supplemental Table 1). In most Deltaproteobacteria and some γ-proteobacteria rpsF and rpsR occur together with no intervening gene. However, in β-proteobactera and the remaining γ-proteobacteria they are separated by priB, and in Firmicutes and Actinobacteria they are separated by ssbA. In E. coli the operon contains rplI (encoding L9) (Isono and Kitakawa 1978), which occurs in isolation at a different locus in Firmicute genomes (Akanuma et al. 2012). The only bacteria where we find the RNA without both rpsF and rpsR are some species of Cyanobacteria, where rpsF occurs in isolation in the genome and rpsR is found near an rpmG (encoding L33) homolog.
FIGURE 1.

(A) Consensus secondary structure of the RNA motif preceding rpsF derived from more than 1300 sequences. This RNA motif is widely distributed across many bacterial phyla. (B) Example of the RNA motif from B. subtilis 168. The transcript start site, an alternative pairing element, and potential regulatory features are indicated. (C) Consensus secondary structure for the more than 500 RNA sequences primarily identified from Firmicutes and Actinobacteria that contain a potential conserved H3 helix. (D) rRNA binding site for the S6:S18 complex. Sequence is derived from B. subtilis, black nucleotides are conserved >90%, and red nucleotides conserved >98% across 4214 bacterial 16S rRNA sequences (Cannone et al. 2002). Bases in contact with S18 in T. thermophilus structure are boxed (Agalarov et al. 2000).

(A) Consensus secondary structure of the RNA motif preceding rpsF derived from more than 1300 sequences. This RNA motif is widely distributed across many bacterial phyla. (B) Example of the RNA motif from B. subtilis 168. The transcript start site, an alternative pairing element, and potential regulatory features are indicated. (C) Consensus secondary structure for the more than 500 RNA sequences primarily identified from Firmicutes and Actinobacteria that contain a potential conserved H3 helix. (D) rRNA binding site for the S6:S18 complex. Sequence is derived from B. subtilis, black nucleotides are conserved >90%, and red nucleotides conserved >98% across 4214 bacterial 16S rRNA sequences (Cannone et al. 2002). Bases in contact with S18 in T. thermophilus structure are boxed (Agalarov et al. 2000).

rpsF, ssbA, and rpsR are cotranscribed from S6 operon in Bacillus subtilis

Because the conserved RNA motif was originally identified in Firmicutes, we chose to study an example of the RNA occurring in the model gram-positive bacterium Bacillus subtilis. To ensure that the RNA motif is cotranscribed with both the rpsF and rpsR genes, we performed 5′-RLM RACE using primers in the S6 coding region. From this analysis, we identified a transcription start site that completely contains the predicted RNA structure (Fig. 1B) and is preceded in B. subtilis by a canonical promoter sequence (Supplemental Fig. 2). We also performed 3′-RACE to confirm the 3′ end of the transcript, which lies 22 nucleotides after the stop codon for rpsR. Identification of the transcription start site reveals a potential alternative structure (H3) in the B. subtilis transcript that may disrupt the Shine-Dalgarno sequestration in H1 (Fig. 1B). Although this alternative pairing is not universally conserved throughout our alignment, we identified it in many Firmicute and Actinobacteria species (Fig. 1C).
FIGURE 2.

(A) Gel-shift of rpsF_5-69 with increasing concentration of S18 (0 nM, 19–600 nM). (B) Gel-shift of rpsF_5-69 with increasing concentration of S6 (0 nM, 9.7–600 nM). (C) Gel-shift of rpsF_5-69 with increasing S6 and S18 (mixed 1:1, 0 nM, 9.7–600 nM). Arrows indicate quantified bands. (D) Quantification of the fraction of rpsF_5-69 bound by the protein with S6, S18, and S6:S18. Error bars represent standard deviation on three to six independent experiments. Some error bars fall within the boundaries of the marker. Curve represents a standard binding curve with a KD of 155 nM, and a Hill coefficient of 4.

(A) Gel-shift of rpsF_5-69 with increasing concentration of S18 (0 nM, 19–600 nM). (B) Gel-shift of rpsF_5-69 with increasing concentration of S6 (0 nM, 9.7–600 nM). (C) Gel-shift of rpsF_5-69 with increasing S6 and S18 (mixed 1:1, 0 nM, 9.7–600 nM). Arrows indicate quantified bands. (D) Quantification of the fraction of rpsF_5-69 bound by the protein with S6, S18, and S6:S18. Error bars represent standard deviation on three to six independent experiments. Some error bars fall within the boundaries of the marker. Curve represents a standard binding curve with a KD of 155 nM, and a Hill coefficient of 4.

Predicted RNA structure interacts with S6:S18 protein dimer in vitro

Based on the genomic organization of the operon, we hypothesized that either r-protein S6 or S18 was likely to interact with the RNA structure. In addition, previous studies of ribosome assembly indicate that S18 is insoluble and unstructured prior to interaction with S6 (Held et al. 1974) and that S6 and S18 dimerize prior to association with the 16S rRNA-S15 complex (Recht and Williamson 2001), leading us to include the S6:S18 dimer as a potential binding partner. To determine the binding partner of the B. subtilis RNA structure, we conducted electrophoretic mobility shift assays (EMSA) using S6, S18, and the S6:S18 dimer (S6 and S18 mixed at a 1:1 ratio). Due to difficulties overexpressing B. subtilis S18 in E. coli, we utilized S6 and S18 proteins from a close thermophilic relative, Geobacillus kaustophilus. The RNA motifs from G. kaustophilus and B. subtilis are identical with the exception of minor differences in the sequence and length of the variable stem H2 (see Supplemental Fig. 1 for the G. kaustophilus RNA sequence and predicted secondary structure). In addition, the proteins share 75% (S6) and 89% (S18) sequence identity, and the vast majority of residues making contacts to the rRNA are conserved between B. subtilis and G. kaustophilus (see Supplemental Fig. 3). Although species-specific interactions between mRNA regulatory RNAs and ribosomal proteins are reported (Scott and Williamson 2005), these typically occur when the regulatory RNAs are nonhomologous. In this case the RNAs are homologous; thus, this substitution is unlikely to negatively impact potential binding interactions. The RNA construct derived from B. subtilis used for most of our in vitro assays (rpsF_5-69) was truncated at the 5′ end. However, similar results were observed for a construct beginning at the native transcription start site (rpsF_1-69). Both of these constructs encompass the entire conserved RNA structure and continue through the translational start site and are thus likely to include any relevant features required for protein-binding.
FIGURE 3.

(A) Gel-shift of rpsF_5-69 with 500 nM S18 and increasing concentration of S6 (no protein, 0 nM S6, 4.6–500 nM S6). (B) Gel-shift of rpsF_5-69 with 600 nM S6 and increasing concentration of S18 (0 nM, 18.5–600 nM S18, no protein). Arrows on A and B indicate quantified bands. (C) Quantification of the fraction of rpsF_5-69 bound by the S6:S18 complex. Solid thick line corresponds to the titration of S6:S18 shown in Figure 2D (KD of 155 nM, Hill coefficient of 4, maximum fraction bound 100%). Dashed line is the curve fit of the S6 titration (KD of 37 nM, Hill coefficient 1.45, maximum fraction bound 93.5%). Thin line is the curve fit for the S18 titration (KD of 60 nM, Hill coefficient 1.5, maximum fraction bound of 100%).

(A) Gel-shift of rpsF_5-69 with 500 nM S18 and increasing concentration of S6 (no protein, 0 nM S6, 4.6–500 nM S6). (B) Gel-shift of rpsF_5-69 with 600 nM S6 and increasing concentration of S18 (0 nM, 18.5–600 nM S18, no protein). Arrows on A and B indicate quantified bands. (C) Quantification of the fraction of rpsF_5-69 bound by the S6:S18 complex. Solid thick line corresponds to the titration of S6:S18 shown in Figure 2D (KD of 155 nM, Hill coefficient of 4, maximum fraction bound 100%). Dashed line is the curve fit of the S6 titration (KD of 37 nM, Hill coefficient 1.45, maximum fraction bound 93.5%). Thin line is the curve fit for the S18 titration (KD of 60 nM, Hill coefficient 1.5, maximum fraction bound of 100%). We observed a specific RNA-protein interaction between the rpsF_5-69 RNA and the S6:S18 dimer. S18 alone binds to rpsF_5-69 RNA at high protein concentrations (KD > 500 nM), and S6 does not bind at all (Fig. 2A–C). The predicted RNA structure has nM affinity for the protein dimer (155 ± 17 nM) (Fig. 2D), and we observe a high Hill coefficient (4.1 ± 0.5) for this interaction. Several different experimental artifacts can potentially cause the exceedingly steep binding curve observed. First, many r-proteins are “sticky” and can be absorbed onto surfaces of tubes, resulting in no observed binding until the surfaces are saturated. Second, due to the insolubility of S18, aggregation of S18 or the S6:S18 dimer could be occurring. Finally, the proportion of correctly folded heterodimer in solution could be limiting the extent to which RNA binding occurs. Our binding buffer contains 0.8% BSA, which should largely alleviate protein absorption onto tube surfaces. To examine the potential for aggregation, we repeated our experiments in the presence of detergent (0.02% IGEPAL CA630) (Recht and Williamson 2001) and found no significant change in either the KD or the Hill coefficient. To further assess the causes of our steep binding curve, we also individually titrated S6 and S18, where the other component of the heterodimer was held constant at high concentration (Fig. 3A–C). Under these conditions, the majority of the titrated protein should form the heterodimer allowing for more accurate assessment of heterodimer:RNA interactions. These experiments show that RNA binding can occur at approximately two- to threefold lower concentrations (KD for S6 titration [S18 at 500 nM] is 37 ± 7 nM, KD for S18 titration [S6 at 600 nM] is 60 ± 3 nM). As anticipated, the steep transition previously observed is largely alleviated under these conditions (Hill coefficients are 1.45 ± 0.40 and 1.5 ± 0.23 for S6 and S18 titrations, respectively). These experiments suggest that the steep transition observed for the S6:S18 titration is most likely the result of inadequate dimer formation when S6 and S18 are both titrated rather than aggregation or absorption of the proteins onto tube surfaces. To ensure the specificity of the observed interaction, we constructed several mutated RNAs. A series of mutations that disrupted various portions of the secondary structure (M1–M3) and the conserved pair of cytosines (M4) were analyzed with the S6:S18 protein dimer using EMSAs (Fig. 4A). The RNA–protein interactions were either abolished (M2) or significantly impaired by these mutations (M1, M3, M4) (KD> 500 nM). To further confirm our secondary structure predictions, we introduced compensatory mutations (M5 and M6). The M6 compensatory mutation to the H1 stem fully restores protein binding (Fig. 4B) (KD = 125, nM ± 13), but the M5 mutation within helix H2 does not (KD> 500 nM). This indicates that either the primary sequence changes have altered the global structure in some unanticipated way, or the primary sequence in this area is important for a productive interaction.
FIGURE 4.

(A) The RNA construct rpsF_5-69 used for in vitro binding assays with mutations indicated. (B) Interactions of S6:S18 with mutants displayed in A. Solid black curve represents the wild-type interaction from Figure 2D. A binding curve is plotted for M6 (KD = 115 nM, Hill coefficient 3). For the remaining mutants RNA binding was not saturated, indicating a KD > 500 nM and no curves are drawn.

(A) The RNA construct rpsF_5-69 used for in vitro binding assays with mutations indicated. (B) Interactions of S6:S18 with mutants displayed in A. Solid black curve represents the wild-type interaction from Figure 2D. A binding curve is plotted for M6 (KD = 115 nM, Hill coefficient 3). For the remaining mutants RNA binding was not saturated, indicating a KD > 500 nM and no curves are drawn.

Structural probing confirms predicted secondary structure

To confirm our secondary structure predicted from comparative genomics, we conducted SHAPE (Wilkinson et al. 2006) in the presence and absence of the S6:S18 dimer. For this experiment, an additional hairpin was appended to the 3′ end of the rpsF_1-69 RNA to allow for primer binding. Limitations in resolving the 5′ end of RNA on the SHAPE gel (Supplemental Fig. 4; quantified nucleotide reactivity in Supplemental Table 2) prevented quantification of reverse transcription for nucleotides 1–6, and a strong pause site at the base of the H2 stem prevents quantitative analysis of nucleotides 35–39. From analysis of the SHAPE data we find that the 3′ end of the RNA is largely flexible in both the protein-bound and protein-unbound states. The protein-bound and -unbound states show moderately similar, but not identical, patterns suggesting that there are changes in flexibility; but there do not appear to be any large whole-scale changes to the secondary structure between the two states (Fig. 5A,B). Reactive SHAPE nucleotides are predominantly located in the predicted loop regions of H1 and H2, within the conserved bulge, and at the 3′ termini of the RNA that is not predicted to be structured. Somewhat surprisingly, we also consistently observe significant flexibility in the 3′ portion of the predicted H1 stem. However, this may partially be due to over estimation of cleavage at the 3′ end of the molecule due to the signal decay that occurs during primer extension.
FIGURE 5.

(A) Highly reactive (black circles) and moderately reactive (gray circles) nucleotides determined from SHAPE analysis of RNA in isolation mapped to the rpsF_1-69 structure. Positions not resolved are in gray. (B) Reactive nucleotides in SHAPE analysis of RNA bound to the S6:S18 complex. (C) Nuclease probing data mapped to the secondary structure of the rpsF_5-69 RNA structure. Starred bases are cleaved by V1 nuclease and arrows indicate bases cleaved by RNase A. Bases protected from cleavage by the addition of protein are circled. Bases not resolved on the gel are in gray. Numbering starts from the transcription start site (A). (D,E) Nuclease probing gels of rpsF_5-69 (D) and rpsF_5-69 M4 (E) using RNase A (left), which cleaves single-stranded uridine and cytosine, and RNase V1 (right), which cleaves double-stranded RNA. RNA was incubated with increasing concentrations of S6 and S18 mixed at a 1:1 ratio. (F) V1 nuclease probing gel of rpsF_5-69 in the presence of S18 alone. OH− and T1 lanes indicate partial alkaline hydrolysis and RNase T1 digest under denaturing conditions used to map the RNA sequence. On the right, bases cleaved by RNase V1 are indicated. On the left, nucleotides cleaved by RNase T1 (G) and RNase A (C or U, bold) are indicated.

(A) Highly reactive (black circles) and moderately reactive (gray circles) nucleotides determined from SHAPE analysis of RNA in isolation mapped to the rpsF_1-69 structure. Positions not resolved are in gray. (B) Reactive nucleotides in SHAPE analysis of RNA bound to the S6:S18 complex. (C) Nuclease probing data mapped to the secondary structure of the rpsF_5-69 RNA structure. Starred bases are cleaved by V1 nuclease and arrows indicate bases cleaved by RNase A. Bases protected from cleavage by the addition of protein are circled. Bases not resolved on the gel are in gray. Numbering starts from the transcription start site (A). (D,E) Nuclease probing gels of rpsF_5-69 (D) and rpsF_5-69 M4 (E) using RNase A (left), which cleaves single-stranded uridine and cytosine, and RNase V1 (right), which cleaves double-stranded RNA. RNA was incubated with increasing concentrations of S6 and S18 mixed at a 1:1 ratio. (F) V1 nuclease probing gel of rpsF_5-69 in the presence of S18 alone. OH− and T1 lanes indicate partial alkaline hydrolysis and RNase T1 digest under denaturing conditions used to map the RNA sequence. On the right, bases cleaved by RNase V1 are indicated. On the left, nucleotides cleaved by RNase T1 (G) and RNase A (C or U, bold) are indicated. To further assess the secondary structure, we also conducted nuclease probing with RNase V1 and RNase A (Fig. 5C). RNase V1 nonspecifically cleaves double-stranded RNA, and significant cleavage is observed for bases predicted to be paired in H2 (28–29 and 35–38) and for the bases resolved on the gel that are predicted to be paired in H1 (52–56, 20–25) (Fig. 5C,E). RNase A cleaves single-stranded C and U, and we can see that U32, C39, U43, and C48 are cleaved, in accordance with the SHAPE data, suggesting that these regions are flexible (Fig. 5A,B). Although we predict that the conserved bulge is not double-stranded, the RNase A cleavage patterns suggest that portions of this bulge have potential tertiary structure. In particular, we do not observe cleavage at several positions (e.g., C41, U46 and C47) predicted to be single-stranded within this bulge.

Nuclease protection analysis suggests a protein-binding region

To assess which portions of the RNA are necessary for interaction with the S6:S18 dimer, we performed nuclease protection assays using RNase V1 and RNase A with increasing concentrations of the S6:S18 dimer (Fig. 5C–E). Although such experiments can only approximate sites of protein binding, they do give insight into general regions of the RNA that are impacted by protein binding. In the presence of the S6:S18 dimer, several regions of the rpsF_5-69 RNA are protected from RNase cleavage, indicating that the protein dimer most likely has some contact with the bulge and portions of H1 and H2 proximal to the bulge (Fig. 5C,D). Upon increasing protein concentrations, significant RNAse V1 protection is seen for bases 28–29 and 35–38 in the H2 stem. This direct interaction between the protein complex and this portion of H2 suggests that our mutations to H2 (M3 and M5) likely disrupt not only secondary structure, but also potential sequence specific contacts. RNase V1 protection was also observed for H1 bases 52–53, which includes a putative Shine-Dalgarno sequence. From the RNase A protection, it is apparent that U43 and C48 within the bulge are significantly protected in the presence of the S6:S18 dimer. This protection within the bulge further supports our hypothesis that this region is important for protein binding. The loop region of H2 is unaffected by the presence of protein. This finding is in accordance with the multiple sequence alignment, which indicates that H2 can vary significantly in length. Thus beyond some minimum length, H2 is not likely to be important for binding. Despite an overall very similar cleavage pattern in the absence of the S6:S18 dimer, no protection was observed with the M4 mutant RNA. This indicates that changes in nuclease cleavage are attributable to specific protein–RNA interactions (Fig. 5D). We also conducted nuclease protection assays with just S18 rather than the S6:S18 complex to further examine the potential S18-mRNA interaction observed at high S18 concentrations during gel-shift assays. In these experiments, we did not observe any specific changes to protection upon increasing protein concentration up to 600 nM. This suggests that the results from the gel-shift assays, indicating that S18 alone can interact with the RNA, may reflect nonspecific or weak interactions that cannot prevent nuclease cleavage.

DISCUSSION

In this work, we show that a putative RNA structure preceding the B. subtilis operon containing rpsF and rpsR specifically interacts with an S6:S18 dimer. Although weak interactions with S18 are detected, it is likely that the biologically active molecule is the S6:S18 dimer. S18 is insoluble and only partially folded in the absence of S6 (Held et al. 1974), and the previously reported dissociation constant for this interaction is significantly below that we observe for S6:S18 interaction with the RNA structure. Furthermore, S18 interactions alone are not sufficient to prevent nuclease cleavage, suggesting that the potential interaction observed is either not specific, or very weak. Like most nonribosomal RNA binding sites for r-proteins (Zengel and Lindahl 1994), the dissociation constant for the rpsF_5-69 and S6:S18 dimer complex is significantly higher than that reported for the S6:S18 dimer with the 16S rRNA-S15 complex (A. aeolicus ∼6 nM under equilibrium conditions) (Recht and Williamson 2001). We do observe an exceedingly steep transition between the unbound and protein-bound states in our gel-shift assays. We conclude that this steep transition is most likely a result of inadequate or inefficient heterodimer formation. Previous studies of Aquifex aeolicus S6:S18 dimer formation and interactions with the assembling ribosome (S15-rRNA complex) report a dissociation constant for the S6:S18 heterodimer of 8 nM and a dissociation constant between this complex and the S15-RNA complex of 6 nM. The proximity of these two values results in a similar steep transition curve measured under equilibrium conditions due to difficulties maintaining appropriate concentrations of the heterodimer at low protein concentrations. Our protein concentrations are significantly higher, but the overall effect is similar, suggesting that either the dissociation constant between G. kaustophilus S6 and S18 is somewhat higher than that reported for A. aeolicus, or more likely that we have poor reconstitution of the dimer protein in our assays due to inadequate refolding of S18. One characteristic feature of the S6:S18 interacting RNA structure is the conserved pair of cytosines present in the bulge (C47 and C48). Our mutagenesis results (M4) indicate that the cytosines are important for binding, and although coarse-grained, our nuclease probing assays indicate that C48 is significantly protected from cleavage by its interaction with the S6:S18 dimer, suggesting that this region is proximal to the protein binding region. S18 makes extensive contacts with a region of the 16S rRNA that also contains a pair of conserved cytosines (C719 and C720) located in the three-helix junction between rRNA helices 22, 23a, and 23b (Fig. 1D; Agalarov et al. 2000). Furthermore, these cytosines are also conserved in >90% of bacterial 16S rRNA sequences (Fig. 1D). Based on our observed interaction between S18 and the rpsF_5-69 structure and the similarities between the rRNA and mRNA primary sequences, we speculate the mRNA structure we observe is mimicking a portion of the rRNA binding site for S18. The interaction of S6:S18 with its mRNA and rRNA binding sites is reminiscent of the interaction of the L10(L12)4 complex with its own mRNA and rRNA binding sites. In both cases, the protein making the majority of contacts to the RNA is largely insoluble (S18 or L10) (Iben and Draper 2008), but may bind the mRNA independently of its protein partner (S6 or L12) (Brot et al. 1980). However, the protein complex, S6:S18 or L10(L12)4, has an improved interaction with the mRNA due to increased solubility and folding of the dominant protein partner (S18 or L10). The RNA we have characterized from B. subtilis joins the collection of RNA structures that interact with r-proteins. However, its wide distribution across many bacterial phyla is the exception rather than the rule for such RNAs. Of the r-protein interacting RNA structures known in E. coli, only those interacting with L1, L10, and S2 show a similar broad distribution to many bacterial phyla (Fu et al. 2013). Like the example here, the RNAs interacting with both L1 and L10 are believed to act through mimicry of the rRNA binding sites (Nevskaya et al. 2005; Iben and Draper 2008). Although there are narrowly distributed r-protein mRNA binding sites that are obvious mimics of the rRNA (e.g., that interacting S8) (Merianos et al. 2004), most r-protein binding motifs showing obvious mimicry are broadly distributed.

MATERIALS AND METHODS

Homology searches

Covariance models for each RNA were constructed and calibrated using Infernal 1.1 (cmbuild, cmcalibrate), and homologs identified for each RNA (cmsearch) (Nawrocki and Eddy 2013). Cmsearch was performed against a custom sequence database as described previously (Deiorio-Haggar et al. 2013) using a lenient E-value cutoff. Potential homologs were assessed on the basis of genomic context, using a custom visualization tool (GenomeChart) (Miller et al. 2013) and for fit to the existing alignment. Alignments were manually adjusted as necessary when sequences with variable-length helices and/or loops were added. The search process was repeated approximately six to seven times, to expand sequence diversity. Phylogenetic diversity was determined from the number of completed genomes within refseq58 based on the final alignments utilizing queries to our custom database. Consensus secondary structure diagrams were created from the alignments using GSC-weighting in R2R (Weinberg and Breaker 2011).

RACE

Bacillus subtilis total RNA was extracted from a log phase culture and 5′ RACE performed using the Invitrogen GeneRacer kit. First strands were synthesized with a gene-specific primer (rpsF, 5′-ACCCCAATCCTTTGTTCCAGTG; rpsR, 5′-GATCTACATCTTTGTAGTCGATGTGC). PCRs were conducted using the first strand cDNA as the template and a second gene-specific primer (rpsF, 5′-GACTCTTCGTCAATGTTTGGGCGG; rpsR, 5′-CGCACGACCGCCTCTGCGTCCTC). 3′ RACE began with adenylation of extracted total RNA followed by reverse transcription with a dT primer (5′-GCGGTCACGCTTACTTAGCCCTCACTGAAATTTTTTTTTTTTTTTTT) to generate the cDNA. PCR was conducted using this cDNA as the template, a gene-specific primer (5′-GATCAAACGCGCACGTCAAATGG) and an Anchor primer (5′-GCGGTCACGCTTACTTAGCCCTCACTGAA). All PCR products were cloned using Invitrogen TOPO-cloning kit and sequenced to identify the transcription initiation site and termination site of the S6 operon and the cotranscription of related genes.

Preparation of RNA transcripts

The 65-nucleotide DNA fragment preceding rpsF was PCR amplified from Bacillus subtilis genomic DNA with a T7 promoter and was transcribed into mRNA in vitro with T7 RNA polymerase (Milligan et al. 1987). Purified mRNA was 5′-labeled with γ32P-ATP and purified again on a denaturing acrylamide gel. Transcripts carrying mutations were produced in the same way from a DNA template carrying mutations that had been introduced through QuickChange mutagenesis.

Preparation of Geobacillus kaustophilus S6 and S18 proteins

The genes coding for Geobacillus kaustophilus S6 and S18 were cloned into pET-HT (Block et al. 2011). Clones were transformed into BL21(DE3) cells and proteins were over-expressed from the T7 promoter. After over-expression, proteins were extracted and purified similarly to that previously described (Culver and Noller 1999). Cells were lysed in Buffer E (20 mM HEPES [pH 7.6], 20 mM KCl, and 6 mM BME). S6 was soluble and was purified with Buffer C (20 mM NaOAc [pH 5.0], 20 mM KCl, and 6 mM BME) at 4°C by nondenaturing FPLC cation exchange chromatography with a linear gradient (20 mM–1 M KCl). S18 was recovered from inclusion bodies with Buffer B (20 mM Tris-HCl [pH 7.6], 20 mM KCl, 6 M urea, and 6 mM BME) and was purified at 4°C by denaturing FPLC cation exchange chromatography with a linear gradient (20 mM–1 M KCl). Protein-containing fractions were examined by SDS-PAGE, collected, and dialyzed against Buffer D (S18 with 4 M urea). After dialysis proteins were examined by SDS-PAGE again, the concentration was determined by Bradford assays. Proteins were stored in buffer D (S18 with 4 M urea) at 4°C.

Electrophoretic mobility shift assays

Protein–RNA interaction was examined by electrophoretic mobility shift assays (EMSA). A fixed amount of radio-labeled RNA (1000 cpm, ∼1 nM) was incubated with protein in serial dilutions (0–600 nM) in a total volume of 10 μL for 30 min at 25°C (2× binding buffer: 100 mM Tris-HCl pH7.6, 40 mM MgCl2, 300 mM KCl, 2 mM BME, 0.08% BSA). Five microliters 50% glycerol was added to each reaction before loading onto a native 10% acrylamide gel which had been prerun for 60 min at 160V under 25°C. The loaded gel was run for 4 h at 350V at 4°C and subsequently dried and imaged using GE Healthcare STORM 820 phosphorimager and bands quantified using ImageQuant. Quantified bands were fit to a standard binding curve of the form (Fmax * y)/(x+KD) where n is the Hill coefficient, Fmax is the maximum fraction bound, x is the concentration of protein in the reaction, and y is the measured fraction bound, using the Solver function in Microsoft Excel to minimize the summed residuals subject to the constraint Fmax ≤ 1.0. KD and Hill coefficient values are reported ± the standard deviation of three or more data sets. For individual component titrations, 600 nM S6 was incubated with a S18 serial dilution series (0 nM, 18.75–600 nM, twofold dilutions) in 1× binding buffer for 10 min before a fixed amount of radio-labeled RNA (rpsF_5-69, 1000 cpm, ∼1 nM) was added. The reactions were incubated at 25°C for an additional 20 min. RNA–protein interaction was examined by EMSA as described above. 500 nM S18 was incubated with a dilution series of S6 (0 nM, 4.6–300 nM in twofold serial dilutions, and 500 nM). Titrations with S18 at 600 nM displayed significant aggregation and did not yield interpretable results.

SHAPE (selective 2′-hydroxyl acylation analyzed by primer extension) analysis

RNA composed of the predicted RNA structure (rpsF_1-69) with a 3′ primer binding site appended (5′-TCGATCCGGTTCGCCGGATCCAAATCGGGCTTCGGTCCGGTTC) was prepared as described above using T7 transcription (Milligan et al. 1987). To form acylated products, 3 μL NMIA (130 mM) was added into RNA–protein binding reactions (with and without protein) as described above and incubated for 15 min at 25°C. Modified RNAs were purified by phenol-chloroform extraction followed by ethanol precipitation and redissolved in 10 μL TE buffer (0.5×). Primer extension was performed by incubating the redissolved RNA with a 5′ radiolabeled primer (5′-GAACCGGACCGAAGCCCG) for 30 min at 55°C in the presence of Superscript III (Invitrogen). Reverse transcription was stopped by addition of 1 μL NaOH (4 M) and 29 μL acid stop mix (Wilkinson et al. 2006). cDNAs were loaded on a 10% denaturing Acrylamide/Bis gel and run at 55W at 25°C. The gel was dried and examined using a GE Healthcare STORM phosphorimager and SAFA footprinting software (Das et al. 2005; Laederach et al. 2008). Data was normalized as described (Wilkinson et al. 2006; Deigan et al. 2009).

Nuclease protection analysis

For S6:S18 nuclease protection assays, the RNA–protein binding reactions identical to those used for EMSA assays above (50 μL reaction volume with S6:S18 dimer concentrations of 0, 50, 100, 300, and 600 nM) were assembled. For S18 nuclease protection assays binding reactions with S18 concentrations of 0, 25, 50, 100, 300, and 600 nM were similarly assembled. After protein-RNA incubation, each reaction was treated with 1 μL RNase A and V1 (purchased from Ambion: 20 pg and 0.002 units, respectively) for 10 min and 5 min, respectively, at 25°C. The nuclease was inactivated with inactivation/precipitation buffer (Ambion), and RNA fragments were recovered by ethanol precipitation. Precipitated RNAs were resuspended in 10 μL water and 10 μL urea loading solution (10 M urea, 1.5 mM EDTA). Ten microliters of each reaction was loaded on a 12% denaturing Acrylamide/Bis gel and run at 35W at 25°C. Partial alkaline hydrolysis reactions were generated by incubating RNA in 50 mM Na2CO3 pH 9.0, 1 mM EDTA for 10 min at 95°C. Denaturing T1 reaction was conducted by incubating 1U of RNAse T1 (Roche) in 25 mM Na Citrate pH 5.0, 5.5 M urea for 15 min at 55°C. The gel was dried and examined using a GE Healthcare STORM 820 phosphorimager and ImageQuant.

SUPPLEMENTAL MATERIAL

Supplemental material is available for this article.
  50 in total

1.  Ribosomal protein S1 of Escherichia coli is the effector for the regulation of its own synthesis.

Authors:  J Skouv; J Schnier; M D Rasmussen; A R Subramanian; S Pedersen
Journal:  J Biol Chem       Date:  1990-10-05       Impact factor: 5.157

2.  Unusual mRNA pseudoknot structure is recognized by a protein translational repressor.

Authors:  C K Tang; D E Draper
Journal:  Cell       Date:  1989-05-19       Impact factor: 41.582

Review 3.  Probing the structure of RNAs in solution.

Authors:  C Ehresmann; F Baudin; M Mougel; P Romby; J P Ebel; B Ehresmann
Journal:  Nucleic Acids Res       Date:  1987-11-25       Impact factor: 16.971

4.  Assembly mapping of 30 S ribosomal proteins from Escherichia coli. Further studies.

Authors:  W A Held; B Ballou; S Mizushima; M Nomura
Journal:  J Biol Chem       Date:  1974-05-25       Impact factor: 5.157

5.  Feedback regulation of ribosomal protein gene expression in Escherichia coli.

Authors:  D Dean; M Nomura
Journal:  Proc Natl Acad Sci U S A       Date:  1980-06       Impact factor: 11.205

6.  Escherichia coli ribosomal protein S8 feedback regulates part of spc operon.

Authors:  D Dean; J L Yates; M Nomura
Journal:  Nature       Date:  1981-01-01       Impact factor: 49.962

7.  Feedback regulation of ribosomal protein gene expression in Escherichia coli: structural homology of ribosomal RNA and ribosomal protein MRNA.

Authors:  M Nomura; J L Yates; D Dean; L E Post
Journal:  Proc Natl Acad Sci U S A       Date:  1980-12       Impact factor: 11.205

8.  Autogenous control of Escherichia coli ribosomal protein L10 synthesis in vitro.

Authors:  N Brot; P Caldwell; H Weissbach
Journal:  Proc Natl Acad Sci U S A       Date:  1980-05       Impact factor: 11.205

9.  Oligoribonucleotide synthesis using T7 RNA polymerase and synthetic DNA templates.

Authors:  J F Milligan; D R Groebe; G W Witherell; O C Uhlenbeck
Journal:  Nucleic Acids Res       Date:  1987-11-11       Impact factor: 16.971

10.  Infernal 1.1: 100-fold faster RNA homology searches.

Authors:  Eric P Nawrocki; Sean R Eddy
Journal:  Bioinformatics       Date:  2013-09-04       Impact factor: 6.937

View more
  12 in total

1.  Multiplex Aptamer Discovery through Apta-Seq and Its Application to ATP Aptamers Derived from Human-Genomic SELEX.

Authors:  Michael M Abdelsayed; Bao T Ho; Michael M K Vu; Julio Polanco; Robert C Spitale; Andrej Lupták
Journal:  ACS Chem Biol       Date:  2017-07-11       Impact factor: 5.100

2.  Pervasive Regulatory Functions of mRNA Structure Revealed by High-Resolution SHAPE Probing.

Authors:  Anthony M Mustoe; Steven Busan; Greggory M Rice; Christine E Hajdin; Brant K Peterson; Vera M Ruda; Neil Kubica; Razvan Nutiu; Jeremy L Baryza; Kevin M Weeks
Journal:  Cell       Date:  2018-03-15       Impact factor: 41.582

3.  Regulation of Ribosomal Protein Operons rplM-rpsI, rpmB-rpmG, and rplU-rpmA at the Transcriptional and Translational Levels.

Authors:  Leonid V Aseev; Ludmila S Koledinskaya; Irina V Boni
Journal:  J Bacteriol       Date:  2016-08-25       Impact factor: 3.490

4.  Ribosomal protein L10(L12)4 autoregulates expression of the Bacillus subtilis rplJL operon by a transcription attenuation mechanism.

Authors:  Helen Yakhnin; Alexander V Yakhnin; Paul Babitzke
Journal:  Nucleic Acids Res       Date:  2015-06-22       Impact factor: 16.971

5.  Regulation of the rplY gene encoding 5S rRNA binding protein L25 in Escherichia coli and related bacteria.

Authors:  Leonid V Aseev; Natalia S Bylinkina; Irina V Boni
Journal:  RNA       Date:  2015-03-06       Impact factor: 4.942

6.  Autogenous regulation in vivo of the rpmE gene encoding ribosomal protein L31 (bL31), a key component of the protein-protein intersubunit bridge B1b.

Authors:  Leonid V Aseev; Ludmila S Koledinskaya; Irina V Boni
Journal:  RNA       Date:  2020-03-24       Impact factor: 4.942

7.  Fitness advantages conferred by the L20-interacting RNA cis-regulator of ribosomal protein synthesis in Bacillus subtilis.

Authors:  Arianne M Babina; Darren J Parker; Gene-Wei Li; Michelle M Meyer
Journal:  RNA       Date:  2018-06-20       Impact factor: 4.942

8.  Comparative Metatranscriptomics of Periodontitis Supports a Common Polymicrobial Shift in Metabolic Function and Identifies Novel Putative Disease-Associated ncRNAs.

Authors:  Nikhil Ram-Mohan; Michelle M Meyer
Journal:  Front Microbiol       Date:  2020-04-09       Impact factor: 5.640

9.  An S6:S18 complex inhibits translation of E. coli rpsF.

Authors:  Arianne M Babina; Mark W Soo; Yang Fu; Michelle M Meyer
Journal:  RNA       Date:  2015-10-07       Impact factor: 4.942

10.  Transcriptome-Wide Cleavage Site Mapping on Cellular mRNAs Reveals Features Underlying Sequence-Specific Cleavage by the Viral Ribonuclease SOX.

Authors:  Marta Maria Gaglia; Chris H Rycroft; Britt A Glaunsinger
Journal:  PLoS Pathog       Date:  2015-12-08       Impact factor: 6.823

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.