Christopher P Jones1, Adrian R Ferré-D'Amaré1. 1. Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, USA.
Abstract
The bacterial alarmone 5-aminoimidazole-4-carboxamide riboside 5'-triphosphate (AICAR triphosphate or ZTP), derived from the monophosphorylated purine precursor ZMP, accumulates during folate starvation. ZTP regulates genes involved in purine and folate metabolism through a cognate riboswitch. The linker connecting this riboswitch's two subdomains varies in length by over 100 nucleotides. We report the cocrystal structure of the Fusobacterium ulcerans riboswitch bound to ZMP, which spans the two subdomains whose interface also comprises a pseudoknot and ribose zipper. The riboswitch recognizes the carboxamide oxygen of ZMP through an unprecedented inner-sphere coordination with a Mg(2+) ion. We show that the affinity of the riboswitch for ZMP is modulated by the linker length. Notably, ZMP can simultaneously bind to the two subdomains even when they are synthesized as separate RNAs. The ZTP riboswitch demonstrates how specific small-molecule binding can drive association of distant noncoding-RNA domains to regulate gene expression.
The bacterial alarmone5-aminoimidazole-4-carboxamide riboside 5'-triphosphate (AICAR triphosphate or ZTP), derived from the monophosphorylated purine precursor ZMP, accumulates during folate starvation. ZTP regulates genes involved in purine and folate metabolism through a cognate riboswitch. The linker connecting this riboswitch's two subdomains varies in length by over 100 nucleotides. We report the cocrystal structure of the Fusobacterium ulcerans riboswitch bound to ZMP, which spans the two subdomains whose interface also comprises a pseudoknot and ribose zipper. The riboswitch recognizes the carboxamideoxygen of ZMP through an unprecedented inner-sphere coordination with a Mg(2+) ion. We show that the affinity of the riboswitch for ZMP is modulated by the linker length. Notably, ZMP can simultaneously bind to the two subdomains even when they are synthesized as separate RNAs. The ZTP riboswitch demonstrates how specific small-molecule binding can drive association of distant noncoding-RNA domains to regulate gene expression.
Purine biosynthesis is an ancient metabolic pathway common to most organisms. Its final steps are the conversion of 5-amino-4-imidazolecarboxamide riboside 5'-monophosphate (AICAR or ZMP) into 5-formyl-AICAR (FAICAR), which is converted into inosine monophosphate, a precursor for AMP and GMP (reviewed in ref. 1). The bifunctional enzyme AICAR transformylase-IMP cyclohydrolase (PurHJ) catalyzes the formation of FAICAR from AICAR, using 10-formyl-tetrahydrofolate (10f-THF) as the formate source. During purine or folate starvation, cells accumulate ZMP or its triphosphate derivative (ZTP), which were proposed to act as “alarmones”, or signals for metabolic stress due to low amounts of either precursor[2]. Tetrahydrofolate (THF) is essential for the synthesis of purines, pyrimidines, and some amino acids, so the detection of low THF levels is a critical check on the cellular metabolic state.Recently, a riboswitch within the pfl operon was found to be the cellular means for sensing accumulation of Z nucleotides[3]. ZMP and ZTP were shown to bind to a conserved non-coding element of pfl mRNA transcripts from various bacterial species that function as genetic “ON” switches—transcription-promoting riboswitches when bound to ZMP or ZTP. Riboswitches are gene-regulatory mRNA domains that directly sense the intracellular concentration of their cognate ligands and regulate gene expression in cis. They are a widespread mechanism of gene control throughout bacteria (reviewed in refs. 4–6). In cells deprived of purines or folate, Z nucleotide accumulation leads to the activation of riboswitch-controlled genes. These include enzymes involved in purine biosynthesis like PurHJ or in the folate cycle like pyruvate formate lyase (the namesake enzyme of the pfl operon), as well as other genes[3]. Both ZMP and ZTP bind the pfl riboswitch and likely both control riboswitch function in vivo[3].The predicted secondary structure[3,7] of the ZTP riboswitch minimal ligand-binding (or “aptamer”) domain consists of two sub-domains that associate through formation of a pseudoknot[8]. Phylogenetic comparison suggested that the sequence-variable linker that covalently connects the two sub-domains of the RNA can comprise from a few to over one hundred nucleotides[3,7]. Such an arrangement of constituent elements is unprecedented among structurally characterized riboswitches. Even the spatially distant Stem I and antiterminator domains of T-box riboswitches are typically connected by a highly structured linker[9]. To understand at the atomic level how ZMP controls gene expression through the pfl motif, we have now determined the crystal structure of the F. ulceransZTP riboswitch bound to ZMP at 2.8 Å resolution. Our structure reveals how distant non-coding RNA elements coalesce to form a single ZMP-binding pocket, and suggests a mechanism for ligand-dependent RNA pseudoknot stabilization. The linker connecting the two sub-domains of the RNA is crystallographically disordered. We have demonstrated biochemically that its length modulates thermodynamically the stability of the aptamer domain and kinetically the intrinsic efficiency of transcriptional read-through. Since ZMP connects folate metabolism and purine biosynthesis, understanding how Z nucleotides regulate pfl gene expression would inform the design of Z nucleotide-based antibiotics. Sulfonamides and trimethoprim act through the inhibition of folate-synthesizing enzymes (reviewed in ref. 10), and ZTP riboswitches may offer an alternative avenue for the development of specific antibacterial compounds.
RESULTS
Screening of ZTP riboswitches
Initial studies of the ZTP riboswitch were plagued by RNA misfolding[3]. To avoid poorly behaved RNAs, we screened riboswitch aptamer domains from eight bacterial species (Supplementary Fig. 1 and Supplementary Table 1) for ZMP binding using isothermal titration calorimetry (ITC). Some of these RNAs bound ZMP with µM to sub-µM apparent dissociation constants (KD) (Supplementary Table 2), with the tightest binding RNAs from Halothermothrix orenii (KD = 110 nM) and F. ulcerans (KD = 487 nM). Several RNAs did not bind ZMP under the conditions tested (i.e., Klebsiella pneumoniae, Paenibacillus sp. HGF5, and Spirochaeta thermophila). RNAs competent for ZMP binding were screened for co-crystallization with ZMP and, in the case of the F. ulcerans riboswitch, produced well-diffracting crystals (Supplementary Fig. 2a,b).
Overall structure of the F. ulcerans ZTP riboswitch
The co-crystal structure was solved by the single-wavelength anomalous dispersion (SAD) method using data from an iridium hexammine-soaked crystal[11]. The electron density maps were of excellent quality (Supplementary Fig. 2c) and allowed unambiguous chain tracing. The bound ZMP molecule was readily identified but was not included in the crystallographic model until the final stages of refinement. The structure has been refined against diffraction data extending to 2.8 Å resolution (Methods and Table 1) to a free-R value of 22.1%. Two near-identical copies of the RNA were present in the crystallographic asymmetric unit. While this manuscript was under review, structures of Thermosinus carboxydivorans[12] and Actinomyces odontolyticus[13] ZTP riboswitches were published. They are in general agreement with the structure reported here.
Table 1
Data collection, phasing and refinement statistics
Native
Ir-Hexammine
Data collection
Space group
P3221
P3221
Cell dimensions
a, b, c (Å)
92.7, 92.7, 121.2
92.4, 92.4, 121.5
α, β, γ (°)
90, 90, 120
90, 90, 120
Resolution (Å)
30.3-2.82 (2.92-2.82)a
48.3-3.00 (3.05-3.00)a
Rmerge
0.114 (1.78)
0.126 (1.00)
I / σI
26.05 (1.79)
27.4 (0.62)
Completeness (%)
99.5 (96.2)
98.7 (87.0)
Redundancy
10.3 (10.9)
20.1 (14.7)
Refinement
Resolution (Å)
30.3-2.82 (2.92-2.82)
No. reflections
14947 (1404)
Rwork / Rfree
0.186 / 0.221
No. atoms
RNA
2653
ZMP
44
Ions/water
20
B factors
Protein
106.9
ZMP
73.1
Ions/water
77.9
r.m.s. deviations
Bond lengths (Å)
0.002
Bond angles (°)
0.829
Values in parentheses are for highest-resolution shell. Diffraction was collected from a single crystal for each dataset.
The ZTP riboswitch folds into a compact structure consisting of four helices (paired regions P1–P4) (Fig. 1). Two sub-domains associate through a pseudoknot (helix P4). Helices P1 and P2 form the 5' sub-domain of the RNA, while P3 and P4 form the 3' sub-domain (Fig. 1b). P1, P2, and P3 come together through formation of the pseudoknot helix P4 between the apical loop of P3 (nts 64–70) and the residues connecting helices P1 and P2 (nts 17–24). Sixty of the 75 residues of the crystallization construct are visible in our experimental electron density maps; residues at the 5' terminus and in the linker between helices P1 and P3 have no electron density and are presumed to be disordered (Fig. 1a,b). The riboswitch secondary structure agrees with sequence-based predictions[7], except for four base pairs in P1 formed between residues 13–16 and 40–43. These include two purine-purine pairs between conserved residues G13•A43 and A14•A42 and the U16•U40 pair (Fig. 1a). In-line probing experiments[3] suggested that these eight nucleotides are single-stranded in the absence of ZMP; their pairing may be part of the ligand-stabilized interactions responsible for the gene regulatory decision (Discussion).
Figure 1
Structure of the F. ulcerans ZTP riboswitch. (a) Secondary structure of the riboswitch based on the crystal structure. Paired regions P1 (blue), P2 (yellow), P3 (green), and pseudoknot helix P4 (orange) are labeled. Residues disordered in the crystal structure are in gray. The bound ZMP is boxed in magenta and indicated by a “Z” with nearby interacting residues G63, U70, and G71 colored magenta. Residues participating in dimerization and crystal contacts are colored pink. Lines with arrowheads indicate chain connectivity (gray line denotes crystallographic disorder). Dotted lines indicate interactions between ZMP and phosphates (U16 and C35) and A-minor interactions between A34 and the G17-C69 base pair. (b) Cartoon representation of the crystal structure of the ZTP riboswitch, colored as in A. Gray spheres denote the break in the crystallographic model at the disordered linker. (c) Chemical structure of ZMP; key atoms numbered (i.e., carboxamide carbon labeled “6”). (d) Cartoon representation of the ZTP riboswitch crystal structure, colored as in A, emphasizing association of sub-domain 1 (sticks) and sub-domain 2 (surface).
ZMP binding site stitches together four helical elements
ZMP joins the cores of the two halves of the riboswitch, being recognized through interactions with residues adjacent to all four helical segments (Fig. 2). These interactions are distributed across three layers of highly conserved nucleotides that incorporate ZMP into a continuous helical stack from P3 into P4 (Fig. 2a). On the bottom layer, G63 and G71 stack on the Z nucleobase and ribose, respectively, and pair with each other through a single hydrogen bond. The 2'-OH of G63 also contributes a hydrogen bond to the 3'-OH of ZMP (Fig. 2a,c). Mutation of either G63 or G71 to A or U completely abolished ZMP binding as measured by ITC (Supplementary Table 2). In the middle layer, the Watson-Crick face of U70 interacts with the ZMP via two hydrogen bonds, which would not be provided by an A, G, or C at this position, consistent with the high phylogenetic conservation[3] of U70 and the complete loss of ZMP binding observed in the U70A and U70C mutants (Supplementary Table 2). The top layer is formed by the longest-range base pair in RNA, formed between the highly conserved[3] G17 and C69 (Fig. 2a). These two residues contribute three interactions to ZMP binding—a hydrogen bond from N7 of G17 to the 2'-OH of ZMP, a hydrogen bond from N4 of C69 to the ZMP phosphate, and stacking interactions between G17 and the Z nucleobase. Mutation of the G17-C69 pair to either C17-G69 or A17-U69 also resulted in a complete loss in ZMP binding, as measured by ITC (Supplementary Table 2).
Figure 2
Critical ligand-RNA and RNA-RNA interactions in the ZTP riboswitch. (a) Phosphate-side view of the bound ZMP with hydrogen bonds indicated by black dashed lines. The G17-C69 (orange) and G71•G63 (magenta) base pairs are above and below ZMP, respectively. The Watson-Crick face of U70 (magenta) contacts ZMP. (b) Nucleobase-side view of the bound ZMP, showing the critical Mg2+ ion (green) and two coordinated waters WA and WB. The Mg2+ also coordinates to the O4 carbonyl oxygen of ZMP, the phosphate of C35 (yellow), and the phosphate of U16 (blue). (c) View of the ZMP binding site in the plane of the ligand. |Fo|−|Fc| residual electron density map prior to ligand placement is shown, contoured at 3.0 and 7.5 σ (green and blue mesh, respectively). (d) Tertiary interactions between loop L2 residues A30 and A34 (yellow) with helix P4 residue A18 and the G17-C69 pair, respectively. (e) View of the interactions of the G17-C69 pair with ZMP and A34. (f) View of the two copies of RNA in the crystallographic asymmetric unit, which interact via residues A23 and A64 (pink).
In addition to these interactions, the O6 carbonyl oxygen of the carboxamide moiety of ZMP makes an inner-sphere coordination to a highly ordered Mg2+ ion that brings all four helices together. This Mg2+ ion is also coordinated by non-bridging phosphateoxygen atoms from U16 (helix P1) and C35 (helix P2) (Fig. 2b). Each phosphate also contributes one hydrogen bond to N4 and carboxamide amino group N6 of ZMP, respectively. Two water molecules (WA and WB) coordinate the Mg2+ ion and form hydrogen bonds with O4' of G17 and the 2'-OH of G71, respectively (Fig. 2b). Together with U70, these interactions create a binding site that surrounds the Z nucleobase on three sides (Fig. 2c). Consistent with the importance of the inner-sphere coordination between this Mg2+ and ZMP, iridium hexammine (whose amino ligands are kinetically inert, and can thus participate only in outer-sphere coordination) did not bind appreciably at this site (not shown). Hydrogen bonds from the RNA contact the ZMP ribose and phosphate moieties; however the distal end of the phosphate of the alarmone points out into bulk solution, which is consistent with the observation that ZMP and ZTP bind the riboswitch similarly[3].On the nucleobase side of the bound ZMP, the base of the highly conserved[3] A34 from loop L2 makes type I A-minor interactions[14] with the G17-C69 base pair (Fig. 2d,e), the same pair that contacts ZMP. The P2 helix is typically a 5-base pair stem with a loop of four or five residues[3], but in the F. ulcerans riboswitch, the stem is four base pairs with a six-residue AAUUAA loop L2. Interestingly, both of the tightest binding species of riboswitches examined, F. ulcerans and H. orenii, possess this loop sequence (Supplementary Fig. 1a). In the F. ulcerans RNA, mutating loop L2 residues 29–34 to either a U1A binding site, a GAAA tetraloop, or a UCUG tetraloop abolished ZMP binding, emphasizing its importance (Supplementary Table 2). In the F. ulcerans crystal structure, U32 is looped out, A33 stacks on A34, and A29 stacks on A30 (Fig. 2d). Both A29U and A33U mutations weaken ZMP binding, by ~56-fold and ~4-fold, respectively, while the A33G mutation (KD = 599 nM) binds similarly to the wild-type RNA, consistent with stacking being its primary function. A34 is more than 90% conserved[3] among pfl species (Supplementary Fig. 3), suggesting that this tertiary contact is a feature shared among ZTP riboswitches even if the sequence and connectivity vary (Supplementary Fig. 1). Mutation of A34 to a U or G results in 69-fold (KD = 33.5 µM) and 100-fold (KD = 50.2 µM) weaker binding, respectively, relative to wild-type (Supplementary Table 2). A second tertiary contact within loop L2 occurs between A30 and the backbone of P4 in hydrogen bonds between N1 of A30 to the 2'-OH of A18 and N6 of A30 to the 3'-O of A18 (Supplementary Fig. 3).
ZMP binding stabilizes the pfl riboswitch
Although the three-dimensional structure of the ZTP riboswitch is relatively compact (153 Å2 solvent-accessible surface per residue) when compared to other RNAs[15], its secondary structure suggests that the RNA is initially extended during transcription and folds in response to ZMP or ZTP binding. To investigate this biophysically, we purified the F. ulcerans, H. orenii, and T. carboxydivorans riboswitches by size-exclusion chromatography (SEC, Supplementary Fig. 4a,b) and examined the effect of ligand by small-angle X-ray scattering (SAXS), which reports on the overall shape of a macromolecule in solution (reviewed in ref. 16).In the absence of ZMP, SAXS analyses frequently indicated non-specific aggregation by the three RNAs. Aggregation results in increased, skewed scattering at low angles (Supplementary Fig. 4c), which compromises calculation of the radius of gyration (Rg) and end-to-end distance (Dmax). For the F. ulcerans RNA, samples in the absence of ZMP that did not display overt aggregation were characterized by an Rg = 25.3 Å and Dmax = 86 Å (Table 2). Notably, these samples still had ~25% larger Porod volumes (VP) than ZMP-bound samples, suggesting that less severe aggregation or dimerization may still be occurring. Thus, the change in Rg between the bound and free states could reflect RNA-RNA association in addition to ligand binding. In the presence of ZMP, scattering curves showed no signs of aggregation (Supplementary Fig. 4c). All three ZMP-bound RNAs are compact in solution, with Rg ~ 21–24 Å and Dmax ~75 Å (Table 2). These values are similar to those observed[17] for tRNAs, which are approximately the same size as the pfl RNAs examined herein. The Kratky plot portrays the compactness of a macromolecule[16] and changes in it can reflect global conformational changes in riboswitches[18]. Kratky plots of the F. ulcerans riboswitch in the presence and absence of ligand are practically identical, suggesting that after transcription and purification, the RNA may be globally prefolded (Supplementary Fig. 4d).
Table 2
Small-angle X-ray scattering analysis of ZTP riboswitches
RNA
Rg, Å
Dmax, Å
VP, Å3
F. ulcerans
25.3
85.8
26300
F. ulcerans + ZMP
22.1
72.8
21100
H. orenii
-a
-
-
H. orenii + ZMP
24.1
77.0
30000
T. carboxydivorans
-a
-
-
T. carboxydivorans + ZMP
20.8
72.9
31200
Calculations of Rg, Dmax, and VP were not possible due to aggregation.
Common among the pfl riboswitch species examined is presence of dimeric or higher order species. In our crystals, a non-crystallographic dimer interface is observed between the two copies of pfl RNA in the asymmetric unit involving residues A23 and A64 (Fig. 2f), which are not highly conserved (Supplementary Fig. 3). A23 and A64 stack on the G21-C65 pair, and A23 nucleobases from each RNA stack on one another. The Watson-Crick face of A64 from one RNA hydrogen bonds to the sugar edge of A23 of the other RNA (Fig. 2f). Nearby residue A22 is involved in crystal packing and stacks with A22 from the adjacent dimer (Supplementary Fig. 4e). SEC peaks corresponding to the monomeric and dimeric F. ulcerans riboswitch were purified to test the contribution of each oligomeric state to ZMP binding (Supplementary Fig. 4f). Both oligomeric states bind ZMP with similar KD of ~400 nM (Supplementary Table 2). SEC analysis of the ITC samples revealed that the dimer converts to monomer during the titration while the monomer remains monomeric (Supplementary Fig. 4f), indicating that the riboswitch functions as a monomer. Together, both SEC and SAXS results suggest that ZMP binding to pfl RNA prevents misfolding by stabilizing the monomeric state. The increased Rg and Dmax observed by SAXS in the absence of ligand (Table 2) are likely due to varying amounts of dimerization, which is alleviated by addition of ZMP.
Linker length modulates ZMP affinity and antitermination
The aptamer domain of the ZTP riboswitch is composed of two sub-domains that associate through space by formation of the pseudoknot helix P4 and binding of ZMP or ZTP, but which can be separated by a linker of over 100 nt (Fig. 1). To examine the consequences of the naturally observed variation in linker length, we tested ZMP binding by ITC to F. ulcerans riboswitch variants with either 10 or 20 adenosines inserted between residues 55 and 56 (Fig. 3). Additionally, we inserted a naturally occurring 102-nt linker from environmental sample 3278 (env3278, Fig. 3a and Supplementary Fig. 1h), which appears in a subset of pfl variants[3]. The KD for ZMP is increased by ~6-fold for the 10A insertion (KD = 2.99 µM), ~10-fold for the 20A insertion (KD = 4.74 µM), and ~20-fold for the 102-nt insertion (KD = 9.70 µM) (Supplementary Table 2 and Supplementary Fig. 5a–e). These observations suggest that increased linker length imposes an entropic penalty on folding of the RNA. To examine the effect of an arbitrarily long linker, we prepared an isolated 5' subdomain of the ZTP riboswitch (Δ55–75). This RNA did not bind ZMP by itself (Supplementary Table 2 and Supplementary Fig. 5f). However, when the 3' sub-domain (nts 59–75) was supplied in trans, the RNA bound ZMP with a dissociation constant even larger than that of the construct with the 102-nt linker (KD = 15.2 µM). These experiments confirm that association of the two sub-domains is required for ZMP binding and, given that our SAXS data shows that the two sub-domains can associate prior to ZMP binding, imply that linker length modulates stability of the aptamer domain.
Figure 3
Effect of linker length on ZMP binding affinity. (a) Cartoons of F. ulcerans riboswitches with varying linker lengths for the wild-type F. ulcerans, +10 adenosine linker, +20 adenosine linker, +102-nt linker derived from environmental sample 3278, and Δ55–75 plus 59–75 in trans. (b) Apparent dissociation constants (KD) of riboswitch linker variants from isothermal titration calorimetric analysis. For wild-type ZTP riboswitch (n = 5), the average is shown, with s.d. For other experiments (n = 2 or 3), all independent experiments are shown.
The naturally occurring variation in the linker connecting the riboswitch sub-domains may serve to modulate the threshold of ZMP required to activate gene expression, or to fine-tune riboswitch response to RNA polymerase elongation speed. The Clostridium beijerinckiiZTP riboswitch has been previously shown to function as a transcriptional “ON” switch by preventing the formation of a transcription termination hairpin in the ZMP-bound state[3]. In those experiments, the concentration of ZMP required to terminate transcription was much greater than the KD determined from in-line probing, suggesting that the ZTP riboswitch is kinetically controlled[3]—that is, the speed of transcription by RNA polymerase is too fast for ligand binding to reach equilibrium[19].The F. ulceransZTP riboswitch we have studied is also predicted to form a terminator hairpin in the “OFF” state, employing alternative base pairing of residues that form helices P3 and P4 in the “ON” (or antiterminator) state (Fig. 4a). To test the contribution of linker length to the transcriptional readout of the riboswitch, we examined the same linker variants as above in single-round transcription assays (Fig. 4 and Supplementary Fig. 5g) in the presence and absence of 0.1 mM ZMP (which was previously shown to be saturating for the C. beijerinckiiZTP riboswitch[3]). In the absence of ZMP, the wild-type riboswitch antiterminates 17% (Fig. 4b), while in the presence of ZMP, production of full-length RNA increased by 130% (Fig. 4c). For transcription assays with ZTP riboswitch linker variants, two effects were observed. First, for all three longer variants, more full-length RNA was produced in the absence of ZMP compared to the wild-type riboswitch (Fig. 4b). This enhancement in the basal level of transcription antitermination suggests that increased transit time of the polymerase through the longer linkers may allow more time for the P1 and P2 helices to fold (and thus increases their efficiency in "capturing" P3 and P4 as they are transcribed). Second, as the linker length separating the two sub-domains was increased, addition of ZMP resulted in progressively smaller relative enhancement of transcriptional antitermination (Fig. 4c). The reduction in the effect of ZMP is consistent with the progressively weaker binding (Supplementary Table 2 and Supplementary Fig. 5a–f) of the small molecule resulting from decreased stability of the aptamer domain as linker length is increased.
Figure 4
Transcription antitermination by ZTP riboswitch linker variants. (a) Predicted secondary structures of the two states of the F. ulcerans ZTP riboswitch. “PT” denotes transcription terminator hairpin predicted to form in the “OFF” state in the absence of ZMP. Nucleotides within the terminator hairpin PT involved in alternative base pairing in helices P3 and P4 are colored blue. (b) Fraction of antiterminated RNA transcribed in the absence ZMP, as observed in single-round transcription assays. (c) Fractional increase of antiterminated RNA induced by the presence of 0.1 mM ZMP. For all experiments, n = 2. Both independent experiments are shown for each linker variant.
DISCUSSION
Many of the metabolites recognized by naturally occurring riboswitches are purines, are purine-related, or contain purine moieties (reviewed in refs. 6,20). Atomistic structures have been reported for riboswitches that recognize simple purines[21,22], cyclic purine dinucleotides[23-29], S-adenosyl methionine[30-32], vitamin B12 (refs. 33,34), flavin mononucleotide[35], THF (refs. 36,37), and the hypermodified purine preQ1 (refs. 38–41). The Mg2+-mediated recognition of the ketoneoxygen of the ZMP carboxamide moiety we have uncovered is thus far unique (Fig. 2b,c), but the residues forming the ZMP binding pocket share some similarity to those of other riboswitch ligand-binding sites. Ligand-Mg2+ coordination typically occurs at ligand phosphate groups, as in the FMN and TPP riboswitches[35,42-44]. Examples of coordination between carboxylate groups of bound amino acids and cations are seen in the glycine riboswitch[45,46] (with Mg2+) and lysine riboswitch (with K+)[47,48]. The guanine and THF riboswitches[21,22,36,37] both recognize purines using three hydrogen bonds between the Watson-Crick face of a U residue of the RNA and the minor groove face of guanine (Fig. 5a) or the guanine-like pterin ring of THF (Fig. 5b). The corresponding interaction between U70 and the Hoogsteen face of ZMP involves only two hydrogen bonds, one of which is donated by the carboxamide amino group N6 (Fig. 5c). Like the THF riboswitch, the ZTP riboswitch entirely surrounds the nucleobase moiety of its ligand while the opposite end of the molecule points into solution (Fig. 2), allowing for binding to both ZMP and ZTP. Both ZMP’s biosynthetic precursor (succinyl-ZMP) and successors (e.g., FAICAR, inosine monophosphate, and purines) would be unable to bind the ZTP riboswitch due to steric clash with the phosphates and coordinated Mg2+. Even the addition of a single methyl group to the carboxamide moiety of the Z nucleobase eliminates ligand binding[3].
Figure 5
Ligand recognition by purine and purine-like riboswitches. (a) Ligand binding site of the guanine riboswitch[22] bound to guanine (G). (b) Ligand binding site at the pseudoknot region of the THF riboswitch[36]. (c) Ligand binding site of the F. ulcerans ZTP riboswitch.
Since the phosphate end of the bound ZMP is exposed to bulk solvent (Fig. 2), ZMP could enter a largely preformed binding site and stabilize the binding pocket after the pseudoknot has already folded. SAXS data show little to no global conformational change upon ZMP binding (Table 2), so the pseudoknot may form independently from ligand binding. This would be reminiscent of the FMN riboswitch, whose global structure was unaffected by ligand binding according to SAXS experiments[49] and enzymatic probing[35]. However, in-line probing of the Clostridium bartletti ZTP riboswitch suggests that helix P4 and four base pairs of P1 are less reactive upon ZMP binding[3]. These four pairs correspond to residues 13–16 and 40–43 in the F. ulcerans riboswitch, which, aside from U40, are highly conserved (Supplementary Fig. 3) and may constitute part of a ligand-dependent conformational switch. It is possible that SEC purification of the riboswitch prior to SAXS eliminated the flexibility and heterogeneity observed in probing experiments. Alternatively, while the overall shape of the riboswitch may be unchanged, the residues surrounding ligand binding site could be more flexible in the absence of ligand. In either case, formation of the pseudoknot helix is predicted to compete with the downstream transcription termination hairpin (Fig. 4a), so in the absence of ZMP the riboswitch aptamer domain must remain plastic enough to unfold once the downstream terminator sequence has been fully transcribed.Among the purine and purine-related riboswitches, many contain pseudoknots, but none have an internal segment as long and as variable as the linker separating the two sub-domains of the ZTP riboswitch. Engineering longer linkers weakens ZMP binding (Fig. 3, Supplementary Table 2), but an RNA with the longest known naturally-occurring linker binds to its ligand with dissociation constant comparable to that of a trans-acting (i.e., split) riboswitch (Fig. 3). Riboswitches are thought to function through either kinetic or thermodynamic control depending on the competition between ligand binding and transcription termination that ultimately results in the gene control decision (reviewed in refs. 6,19). The longer linker variants of ZTP riboswitches often contain stem-loops, as exemplified by the T. bispora linker separating helices P4 and P2 (Supplementary Fig. 1e) and environmental sample 3278 linker separating helices P1 and P3 (Supplementary Fig. 1h). These stem-loops could serve as transcriptional pause sites that act to retard elongation and allowing the RNA more time to fold, thereby altering the mode of thermodynamic or kinetic control. Consistent with the two sub-domains of the riboswitch associating prior to ZMP binding, single-round transcription experiments (Fig. 4) imply that linker length primarily controls the antitermination efficiency of the F. ulcerans riboswitch in the absence of ZMP, rather than the dynamic range of the genetic switch.Several examples of trans-acting riboswitches have been previously described[50-52]. In these cases, the trans-association is analogous to sRNA-target binding and is mediated by Watson-Crick base pairing. The pfl riboswitch is distinctly different in that the association of its two constituent sub-domains is ligand mediated, and, in addition to pseudoknot formation (by definition involving Watson-Crick base pairing[8]), other tertiary interactions are involved. We have demonstrated that the two sub-domains of the ZTP riboswitch can associate in trans to bind ZMP (Fig. 3). This opens the way to the design of split variants of the ZTP riboswitch that could in principle function in cells to coordinate the expression of two disjoint transcriptional units. Moreover, such ligand-mediated trans-association may underlie the function of as yet uncharacterized cellular non-coding RNAs.
ONLINE METHODS
Preparation of pfl riboswitch RNAs
RNAs were in vitro transcribed from DNA templates prepared by PCR from plasmids (Integrated DNA Technologies) encoding pfl cDNAs, and purified using denaturing polyacrylamide gel electrophoresis as previously described[53]. The insert of the Fusobacterium ulcerans wild-type plasmid pFulpfl is identical to GenBank ACDH02000027.1, nts 25274–25200 (antisense). The wild-type pfl plasmids from other bacterial species are described in Supplementary Table 1, as well as the pfl plasmid containing the 102-nt env3278 insertion sequence. Plasmids and DNA templates were designed to obtain homogeneous RNA 5' and 3' ends during transcription, as previously described[23]. Site-directed mutagenesis was performed using QuikChange kits (Agilent) with oligonucleotide primers (Integrated DNA Technologies). Prior to use, RNAs were brought up in a buffer containing 50 mM HEPES-KOH, pH 7.4 and 150 mM KCl, and heated at 80 °C for 2 min and 60 °C for 2 min. MgCl2 was then added to 10 mM final concentration, and the solution was placed on ice for at least 30 min. For crystallization, pfl RNAs were folded in the presence of 1.25-fold excess ZMP (Sigma Aldrich).
Crystallization and diffraction data collection
Crystals were grown by hanging-drop vapor diffusion at 20°C by adding 1.4 µL of a 200 µM RNA solution containing 50 mM HEPES-KOH, pH 7.4, 150 mM KCl, 10 mM MgCl2, and 250 µM ZMP to 0.7 µL of a reservoir solution containing 50 mM Bis-Tris-HCl, pH 8.0, 0.2 M LiSO4, and 30% (w/v) polyethylene glycol (PEG) 4000, and equilibrated against 500 µL of reservoir solution. Crystals were grown with the less expensive ZMP rather than ZTP since both Z nucleotides bind to the riboswitch with near-equivalent affinity[54]. Hexagonal bipyramid-shaped crystals appeared within 2 days, and grew to maximum dimensions of 250 × 250 × 250 µm3 over a week. Crystals primarily nucleated out of precipitate (Supplementary Fig. 2a). A single crystal was washed three times with stabilizing solution, and the washes and dissolved crystal were analyzed via denaturing polyacrylamide gel electrophoresis. Staining with ethidium bromide confirmed the presence of intact RNA in the crystal (Supplementary Fig. 2b). For heavy-atom derivative preparation, 1 µL of reservoir solution supplemented with 10 mM iridium (III) hexammine was added directly to the drops, which were then incubated for up to 3 hours at 20°C prior to vitrification in liquid nitrogen. Native and SAD data were collected at 1.0 Å and at the iridium L-III edge (1.105 Å), respectively, at 100 K at beamline 5.0.2 of the Advanced Light Source (ALS), Lawrence Berkeley National Laboratory. Data were integrated and scaled using the HKL (ref. 55) package.
Structure determination and refinement
The heavy atom substructure was determined with SHELX (ref. 56) and confirmed using HySS (ref. 57) implemented in Phenix (ref. 58). Phase-probability distributions were calculated using PHASER (ref. 59) with heavy atom sites from SHELX, and density-modified using RESOLVE (ref. 60) (Supplementary Fig. 2c). Satisfactory CCall/CCweak SHELXD scores were obtained for 9–15 iridium sites, but solutions for 11 iridium sites provided the best electron density maps. Manual model building was performed in Coot (ref. 61) with RCrane (ref. 62). Rounds of model building interspersed with refinement were then performed, using experimental phases and Ir-hexammine amplitudes until the free-R factor was 0.290. ZMP and coordinated Mg2+ and water ions were placed into the |Fo|−|Fc| residual electron density (Fig. 2c), and final model building and refinement was performed against the native structure factor amplitudes, to a final free-R factor of 0.221. The mean coordinate precision of the crystallographic model is estimated[58] to be 0.37 Å. The average solvent-accessible surface area per residue was calculated over both copies of RNA in the asymmetric unit using StrucTools provided by Helix Systems at the National Institutes of Health, Bethesda, MD (http://helix.nih.gov).
Small-angle X-ray scattering
For SAXS experiments, 250 µL samples of 200 µM RNA (~5 g/L) were folded and purified by size-exclusion chromatography using a Superdex 200 Increase 10/300 GL column (GE). The column was equilibrated in a buffer matching the RNA folding conditions, and fractions corresponding to the monomeric peak were concentrated by centrifugal ultrafiltration and stored for no more than 48 hours at 4°C until data collection at beamline 12-ID-C of the Advanced Photon Source (APS), Argonne National Laboratory. Samples of the RNAs were judged for final purity by native and denaturing polyacrylamide gel electrophoresis (Supplementary Fig. 4b). SAXS data were collected as previously described[63] on RNA samples at several concentrations (0.2–1 g/L) in the presence and absence of 100 µM ZMP. Data were analyzed as previously described[23].
Isothermal titration calorimetry
ITC measurements were made using a MicroCal iTC200 microcalorimeter (GE), and data were fit using NITPIC and SEDPHAT (refs. 64,65). All experiments were performed at 37 °C in RNA refolding buffer (50 mM HEPES-KOH, pH 7.4, 150 mM KCl, 10 mM MgCl2). Typically, 20 µM RNA in the cell was titrated with 400 µM ZMP in the syringe, but higher concentrations of RNA were necessary for weaker binding RNAs. For examination of F. ulcerans Δ55–75 and 59–75 in trans, 40 µM of each RNA was folded together and titrated with 800 µM ZMP. For ITC experiments on SEC-purified RNA samples, ITC measurements were recorded first, and 100 µL of the ~300 µL cell solutions, which were bound to ZMP as judged by ITC, were used for post-ITC SEC analysis.
Single-round transcription termination assays
DNA templates containing a lambda phage promoter and 29-nucleotide C-less sequence[66] followed by the F. ulceransZTP riboswitch, termination hairpin, and downstream sequence (50 nts) were PCR amplified from wild-type and mutant plasmids (Supplementary Table 1) using primers described in Supplementary Table 3. Experiments were performed as previously described[54], with minor alterations. Halted transcription complexes were formed using a reaction mix containing 0.08 pmol µL−1 DNA template, 20 mM Tris-HCl, pH 8, 20 mM NaCl, 14 mM MgCl2, 0.1 mM DTT, 0.1 mM EDTA, 1% glycerol, 130 µM ApU, 2.5 µM GTP, 2.5 µM UTP, 1 µM ATP, ~10 µCi 32P-[α]-ATP, and 0.04 U µL−1
E. coli RNA polymerase holoenzyme (Epicenter), incubated at 37 °C for 10 min. Transcription was restarted by addition of 8 µL of halted complex to tubes containing 1 µL of 10× final concentration of ZMP and 1 µL of 150 µM ATP, 150 µM UTP, 150 µM GTP, 50 µM CTP, 1 mg mL−1 heparin, 20 mM Tris-HCl, pH 8, 20 mM NaCl, 14 mM MgCl2, 0.1 mM DTT, 0.1 mM EDTA, and 1% glycerol. Transcription was allowed to proceed for 20 min at 37 °C. To remove DNA template, 1 µL of RNase-free DNase (Promega) was added, and the mixture was incubated at 37 °C for 30 min. 10 µL of loading dye (8 M urea, 20% sucrose, 0.1% SDS, 0.01% bromophenol blue, 0.01% xylene cyanol, 90 mM Tris-HCl, 90 mM borate, and 1 mM EDTA, pH 8) was added, and the mixture was separated by 8% denaturing PAGE (Supplementary Fig. 7). Band intensities were measured using ImageQuant software (GE), background corrected, and normalized to account for the amount of 32P-[α]-ATP incorporated into each band. The amounts of ZMP-induced anti-terminated RNA were calculated by dividing the total anti-terminated RNA in the presence of ZMP by the amount of anti-terminated RNA in the absence of ZMP, minus 100%.
Authors: Changrui Lu; Angela M Smith; Ryan T Fuchs; Fang Ding; Kanagalaghatta Rajashankar; Tina M Henkin; Ailong Ke Journal: Nat Struct Mol Biol Date: 2008-09-21 Impact factor: 15.369
Authors: Kathryn D Smith; Sarah V Lipchock; Tyler D Ames; Jimin Wang; Ronald R Breaker; Scott A Strobel Journal: Nat Struct Mol Biol Date: 2009-11-08 Impact factor: 15.369
Authors: Airlie J McCoy; Ralf W Grosse-Kunstleve; Paul D Adams; Martyn D Winn; Laurent C Storoni; Randy J Read Journal: J Appl Crystallogr Date: 2007-07-13 Impact factor: 3.304
Authors: Brandon Tran; Patricio Pichling; Logan Tenney; Colleen M Connelly; Michelle H Moon; Adrian R Ferré-D'Amaré; John S Schneekloth; Christopher P Jones Journal: Cell Chem Biol Date: 2020-08-13 Impact factor: 8.116