Decades of study of the RNA folding problem have revealed that diverse and complex structured RNAs are built from a common set of recurring structural motifs, leading to the perspective that a generalizable model of RNA folding may be developed from understanding of the folding properties of individual structural motifs. We used single-molecule fluorescence to dissect the kinetic and thermodynamic properties of a set of variants of a common tertiary structural motif, the tetraloop/tetraloop-receptor (TL/TLR). Our results revealed a multistep TL/TLR folding pathway in which preorganization of the ubiquitous AA-platform submotif precedes the formation of the docking transition state and tertiary A-minor hydrogen bond interactions form after the docking transition state. Differences in ion dependences between TL/TLR variants indicated the occurrence of sequence-dependent conformational rearrangements prior to and after the formation of the docking transition state. Nevertheless, varying the junction connecting the TL/TLR produced a common kinetic and ionic effect for all variants, suggesting that the global conformational search and compaction electrostatics are energetically independent from the formation of the tertiary motif contacts. We also found that in vitro-selected variants, despite their similar stability at high Mg2+ concentrations, are considerably less stable than natural variants under near-physiological ionic conditions, and the occurrence of the TL/TLR sequence variants in Nature correlates with their thermodynamic stability in isolation. Overall, our findings are consistent with modular but complex energetic properties of RNA structural motifs and will aid in the eventual quantitative description of RNA folding from its secondary and tertiary structural elements.
Decades of study of the RNA folding problem have revealed that diverse and complex structured RNAs are built from a common set of recurring structural motifs, leading to the perspective that a generalizable model of RNA folding may be developed from understanding of the folding properties of individual structural motifs. We used single-molecule fluorescence to dissect the kinetic and thermodynamic properties of a set of variants of a common tertiary structural motif, the tetraloop/tetraloop-receptor (TL/TLR). Our results revealed a multistep TL/TLR folding pathway in which preorganization of the ubiquitous AA-platform submotif precedes the formation of the docking transition state and tertiary A-minorhydrogen bond interactions form after the docking transition state. Differences in ion dependences between TL/TLR variants indicated the occurrence of sequence-dependent conformational rearrangements prior to and after the formation of the docking transition state. Nevertheless, varying the junction connecting the TL/TLR produced a common kinetic and ionic effect for all variants, suggesting that the global conformational search and compaction electrostatics are energetically independent from the formation of the tertiary motif contacts. We also found that in vitro-selected variants, despite their similar stability at high Mg2+ concentrations, are considerably less stable than natural variants under near-physiological ionic conditions, and the occurrence of the TL/TLR sequence variants in Nature correlates with their thermodynamic stability in isolation. Overall, our findings are consistent with modular but complex energetic properties of RNA structural motifs and will aid in the eventual quantitative description of RNA folding from its secondary and tertiary structural elements.
Essential and ubiquitous
biological functions, including pre-mRNA
splicing and translation, require RNA molecules to fold into well-defined
structures, bind specific proteins and ligands, and undergo precise
conformational changes.[1−4] These properties of RNA are ultimately dictated by its sequence,
and the importance of RNA’s sequence–structure–function
relationship to modern day biology and, likely, to early evolution
has motivated the in-depth investigation of the RNA folding problem.[5−9]Over the past decades, general features of RNA structure have
emerged
that simplify the dissection of RNA folding. First, the high stability
of RNA secondary structure results in hierarchical folding, such that
tertiary folding can be considered to arise from a conformational
search among largely prefolded secondary structure elements.[10−15] Second, phylogenetic and structural studies have revealed recurring
structural motifs in RNA.[16−20] These structural motifs include junction elements, such as four-way
junctions and kink turns that steer emanating helices in preferred
orientations, and tertiary motifs that connect regions of RNA distant
in primary and secondary structure.[21−23]Observation of
RNA motifs with superimposable structures embedded
in different folded RNAs has led to the perspective that RNA is structurally
modular and has suggested that understanding the folding properties
of isolated RNA motifs might provide generalizable insights into RNA
folding.[12,24−32] Thus, there has been considerable effort to dissect the properties
and behaviors of certain common and tractable structural elements.
For example, single-molecule and ensemble studies of isolated kink
turns and four-way junctions have provided insights into their conformational
preferences and dynamics and have revealed a high sensitivity of their
folding properties to ionic conditions.[33−36] Particularly relevant to the
current study are the pioneering single-molecule FRET (smFRET) studies
of the canonical GAAA/11ntR tetraloop/tetraloop-receptor (TL/TLR)
in isolation by Nesbitt, Pardi, and colleagues.[37−41] Here, we build on these studies to describe the molecular
events that underlie the formation of this tertiary structural motif
and to provide insights into the kinetic and thermodynamic properties
of different sequence variants and how these properties may influence
their use in natural structured RNAs.The GAAA/11ntR TL/TLR
is a tertiary structural motif found in group
I and II introns, RNase P, and cyclic-di-GMP riboswitches that consists
of long-range hydrogen bond and base-stacking interactions between
a GAAA tetraloop and an 11 nucleotide tetraloop-receptor (11ntR) (Figure A).[16,24,42] Comparison of crystal structures
of the GAAA/11ntR embedded within different RNAs show that the three-dimensional
structure of the GAAA/11ntR motif is largely independent of structural
context (Figure B),
and solution NMR studies suggest that the crystallographically observed
structure of the GAAA/11ntR is also maintained in solution.[43] Although most studies have focused on the canonical
sequence of the GAAA/11ntR TL/TLR motif, phylogenetic comparisons
of structured RNAs reveal a wealth of variants of the 11ntR tetraloop-receptor
(Figure C),[16,44] and additional variants have been obtained through in vitro selection.[44−46]
Figure 1
Structural modularity and sequence diversity of the GAAA/11ntR
tertiary motif. (A) Secondary structure of the canonical 11ntR tetraloop-receptor
(cyan) and GAAA tetraloop (red). Watson–Crick (WC) and noncanonical
base-pairs are marked as colored dashed lines and circles, respectively.
Tertiary interactions are marked in black. UA_handle[29] and AA-platform[19] submotifs
are boxed. Residue numbering is used throughout. (B) Overlay of four
crystal structures of the GAAA/11ntR motif. Backbone of tetraloop
and tetraloop-receptor are colored as in (A). The structure of the
TL/TLRs were extracted from crystal structures of the P4–P6
domain (PDB 1GID,[47] blue), RNase P (PDB 1NBS,[48] green), and Azoarcus group I intron (PDB 1ZZN,[49] yellow and orange). PDB 1NBS contains a tetraloop-receptor variant
in which the canonical AA-platform is replaced by an AC-platform.
(C) Sequence variation in 515 11ntR-like tetraloop-receptors identified
in the sequence and secondary structure database of aligned group
I introns.[50] Numbers in parentheses correspond
to the total number of receptors containing the specified sequence
variation. The complete sequences of the variants along with their
observed frequencies are provided in Table S1. It is not known whether A–U and G–C at positions
6 and 7 (marked with *) form canonical WC base-pairs.
Structural modularity and sequence diversity of the GAAA/11ntR
tertiary motif. (A) Secondary structure of the canonical 11ntR tetraloop-receptor
(cyan) and GAAA tetraloop (red). Watson–Crick (WC) and noncanonical
base-pairs are marked as colored dashed lines and circles, respectively.
Tertiary interactions are marked in black. UA_handle[29] and AA-platform[19] submotifs
are boxed. Residue numbering is used throughout. (B) Overlay of four
crystal structures of the GAAA/11ntR motif. Backbone of tetraloop
and tetraloop-receptor are colored as in (A). The structure of the
TL/TLRs were extracted from crystal structures of the P4–P6
domain (PDB 1GID,[47] blue), RNase P (PDB 1NBS,[48] green), and Azoarcus group I intron (PDB 1ZZN,[49] yellow and orange). PDB 1NBS contains a tetraloop-receptor variant
in which the canonical AA-platform is replaced by an AC-platform.
(C) Sequence variation in 515 11ntR-like tetraloop-receptors identified
in the sequence and secondary structure database of aligned group
I introns.[50] Numbers in parentheses correspond
to the total number of receptors containing the specified sequence
variation. The complete sequences of the variants along with their
observed frequencies are provided in Table S1. It is not known whether A–U and G–C at positions
6 and 7 (marked with *) form canonical WC base-pairs.We used smFRET to dissect the kinetic and thermodynamic
properties
of a set of sequence variants of the GAAA/11ntR in isolation, including
variants found in natural structured RNAs and variants evolved previously
by in vitro selection. Our results, in combination
with prior studies of the canonical GAAA/11ntR, revealed multiple
steps in the TL/TLR folding pathway. Differences in the ion sensitivity
of the docking and undocking rate constants revealed sequence-dependent
local conformational rearrangements prior to and after the formation
of the docking transition state. These rearrangements and their accompanying
electrostatic effects are separable from a common global conformational
search and the electrostatics associated with global compaction. Finally,
comparison of the thermodynamic stability of the TL/TLR variants at
near-physiological ionic conditions provided insights into the relative
occurrence of TL/TLR sequences in Nature.
Materials
and Methods
smFRET Instrumentation
smFRET data were collected by
two independent methods using (1) a conventional prism-based TIRF
microscope equipped with an Andor Ultra iXon 897 EMCCD camera and
(2) a confocal microscope equipped with avalanche photodiodes for
single-photon counting (Supporting Methods). For the first setup, time-lapse images were recorded at an exposure
time of 5 ms. Control experiments with varying acquisition frame rates
suggested a slight underestimation (∼15%) of the rate constants
measured by the camera at conditions in which the dynamics were greater
than ∼70 s–1. We therefore used single-photon
counting for conditions in which TL/TLR docking dynamics were expected
to exceed 70 s–1. For these conditions, single-photon
arrival times were recorded with ns accuracy and binned offline at
1 ms (Supporting Methods). Herein, data
points obtained with the single-photon counting setup are marked with
an internal “X” (seeFigure for example). Excellent agreement between
data collected by camera (5 ms) and single-photon counting (1 ms)
(Tables S4–S9) at overlapping conditions
indicates that there were no significant time binning effects in the
data analyzed (see Figure for example), as did analysis of the overall relaxation rate
constant (kdock + kundock) obtained directly from photon counting (data not shown).
Figure 4
Docking
kinetics, thermodynamics and Mg2+ dependence
of TL/TLR variants in U7-tether minimal construct. (A–C)
Median kdock, kundock, and Kobs values for each TL/TLR variant
at a range of Mg2+ concentrations. All measurements were
carried out in a background of 140 mM K+. Data and errors
are summarized in Table S4. Standard errors
calculated from bootstrapping were smaller than the size of the symbols.
Data points marked with an internal “X” were determined
using single-photon counting. Linear fits were determined by least-squares.
Mg2+ uptake prior to and after formation of the docking
transition state (D and E, respectively) and net Mg2+ uptake
(F) were obtained from the linear fits in panels A–C as given
by eq and eqs and 2B.
Preparation of Constructs for smFRET Experiments
Synthetic
oligonucleotides (Table S2) with 5′
amino-C6 modifications were purchased from Integrated DNA Technologies,
and a standard protocol was followed to incorporate N-hydroxysuccinamide (NHS) functionalized Cy3B and Cy5 (GE Healthcare)
fluorophores into the oligos.[51] After labeling,
the oligos were purified by denaturing PAGE without the use of UV
shadowing, which can damage RNA and result in artifactual covalent
heterogeneity.[52,53] Prior to smFRET experiments,
the corresponding oligos were mixed at 1 μM and annealed by
heating to 70 °C for 3 min in 100 mM NaMOPS, pH 7.0, and 0.1
mM EDTA, followed by slow cooling to 4 °C at 0.1 °C/sec.
smFRET Experiments
Imaging buffers contained specified
concentration of MgCl2, BaCl2, KCl, NaCl, and/or
RbCl in addition to 50 mM MOPS, pH 8.0 (titrated with KOH, NaOH, or
RbOH), 0.01 mM NaEDTA, 1 mM TROLOX, 2 mM protocatechuic acid, and
0.001 units/μL protocatechuate-3,4-dioxygenase. All smFRET experiments
were performed at 23 ± 2 °C. Before taking smFRET measurements
at a specified salt condition, the flow channel was washed with 20
volumes of buffer containing the specified salt concentration followed
by 2 volumes of imaging buffer with the specified salt concentration.
Salt Activity Calculations
The mean activity coefficients
(γ±) of the monovalent salt solutions were calculated
as described by Leipply et al.,[54] using
the tables of empirical salt activity coefficients reported by Robinson
and Stokes.[55] As there are no activity
estimates for mixed chloride and MOPS salts, we calculated the mean
activity using the total [M+] as if it were present solely
as the chloride salt; because the MOPS salts were present at low concentrations
(40 mM cation), we expect this effect to be insignificant.
Data Analysis
Single-molecule traces were selected
based on anticorrelation of donor and acceptor intensities, consistent
total intensity throughout the trace, and single step photobleaching
as previously described.[25] Consistent with
prior smFRET studies of the canonical GAAA/11ntR, ∼30% of the
single molecules did not show FRET transitions within the observation
time window.[37,38] This inactive population may
be caused by synthesis errors and/or chemical damage, although a population
of molecules trapped in an inactive conformation cannot be ruled out.
Inactive molecules were removed from further analysis as done previously.[38]Bayesian analysis (Table S3) indicated that the smFRET traces are best described
by two FRET states, consistent with previous observations of the canonical
GAAA/11ntR smFRET minimal construct.[37,40] To extract
kinetics, each single-molecule trace was fit to a two-state hidden
Markov model (HMM) using the Single Molecule Analysis Research Tool
(SMART), which fits the donor and acceptor intensities directly, as
described previously.[25,56−58] Fitting the
intensities directly allows for appropriately accounting for the noise
in the donor and acceptor channels and reduces the amount of required
preprocessing of the data. The rate and equilibrium constants and
their errors for each constructs and each solution condition are summarized
in Tables S4–S9. smFRET data summaries
for all TL/TLR variants and conditions are provided (Figures S9–S275). All raw data are available for download.[59]The agreement between data obtained with
the camera (5 ms binning)
and single-photon counting (1 ms binning) setups at overlapping conditions
demonstrated that there were no significant time binning effects (see
for example Figure ). Because of differences in background subtraction between the camera
and the single-photon counting setups, the apparent average FRET states
differed between the two experimental setups. There were sufficient
overlapping data to indicate that the low FRET state value of 0.28
± 0.01 with the camera setup corresponded to an apparent FRET
state value of 0.44 ± 0.01 with the single-photon counting setup,
and similarly for the high FRET state, 0.57 ± 0.01 in camera
setup corresponded to an apparent FRET state of 0.68 ± 0.02 in
the single-photon counting setup.
Identification of Natural
Variants of the GAAA/11ntR in Group
I Introns
The sequence alignments contained in the Group
I Intron Sequence and Structure Database (GISSD)[50] were searched for 11ntR-like tetraloop-receptors at locations
that are predicted to form TL/TLR interactions: L5b-P6, L9-P5, and
L2-P8.[24,60] Based on previous structural characterization
of the GAAA/11ntR and its component submotifs,[29,44,45,61] 11ntR-like
tetraloop-receptors were defined as sequences containing (1) a putative
UA_handle submotif which consists of a noncanonical U·A base-pair
that stacks over a Watson–Crick (WC) base-pair with an intervening
bulging residue (Figure A),[29] (2) a conserved C2–G10 base-pair, and (3) and an internal asymmetrical loop capable
of forming a dinucleotide platform submotif. For simplicity and to
avoid ambiguity in the selection, sequences with insertions and deletions
were not considered.A list of all possible single to quintuple
mutants of the canonical 11ntR were generated keeping C2-G10 and U3·A8 constant and
with a WC or wobble base-pair at positions 1–11 (Figure C). These sequences were searched
for in the P5, P6, and P8 helices of the 1789 group I introns contained
in the database. To eliminate false positives, the criteria for 11ntR-like
tetraloop-receptors described above were verified by visually inspecting
the predicted secondary structures[50] of
the introns containing the putative tetraloop-receptors. The database
was also searched for the in vitro-selected tetraloop-receptors
C7.10, C7.2, and C7.9 described below (Figure A). These in vitro-selected
motifs were not found in the group I introns investigated, consistent
with previous observations of the absence of these motifs in natural
structured RNAs.[24]
Figure 2
Natural and in
vitro-selected TL/TLR variants
were inserted into a minimal U7-tethered smFRET construct.
(A) Sequence of the six tetraloop-receptor variants investigated.
Residues that differ from the canonical 11ntR are boxed. Secondary
structures of 11ntR and 11ntR_AC in the docked state are known experimentally.[47,48] (B) Schematic of smFRET construct in which tetraloop-receptors in
panel A were embedded. GAAA tetraloop (red) and residues in cyan are
common to all of the TL/TLR variants investigated. This smFRET construct
has been used in previous studies to characterize the canonical GAAA/11ntR.[38,40] The complete sequence of all constructs is provided in Table S2.
Natural and in
vitro-selected TL/TLR variants
were inserted into a minimal U7-tethered smFRET construct.
(A) Sequence of the six tetraloop-receptor variants investigated.
Residues that differ from the canonical 11ntR are boxed. Secondary
structures of 11ntR and 11ntR_AC in the docked state are known experimentally.[47,48] (B) Schematic of smFRET construct in which tetraloop-receptors in
panel A were embedded. GAAA tetraloop (red) and residues in cyan are
common to all of the TL/TLR variants investigated. This smFRET construct
has been used in previous studies to characterize the canonical GAAA/11ntR.[38,40] The complete sequence of all constructs is provided in Table S2.
Results
Choice of TL/TLR Variants
Comparison
of the available
sequences of group I and II introns, RNase P, and cyclic-di-GMP riboswitches
reveals substantial sequence variation in the TL/TLR motif.[44,50,62−65] To examine this variation, we
analyzed 1789 aligned group I intron sequences and found 515 identifiable
11ntR-like tetraloop-receptors. Figure C summarizes the most frequent base pairs, dinucleotide
platforms, and residues that comprise the 11ntR-like tetraloop-receptors,
and Table S1 lists the individual sequences.
From these natural tetraloop-receptor variants, we chose to study
the canonical 11ntR as well as tetraloop-receptors 11ntR_AC and 11ntR_A/CC
(Figure A), as they
contain some of the most common mutations relative to the canonical
11ntR and have been shown to fold outside of their natural context.[65] Variant 11ntR_AC differs from the canonical
11ntR only by the replacement of the canonical AA-platform with an
AC-platform, and has a nearly identical three-dimensional conformation
(Figure B and Figure S1). Variant 11ntR_ACC differs from the
canonical 11ntR by the replacement of the canonical G6·U7 by A6·C7 and replacement of the
bulging U9 by C9. Based on their position within
the TL/TLR (Figure A) and their high occurrence in group I intron sequence variants
(Figure C), these
mutations are not expected to substantially alter the three-dimensional
structure of the TL/TLR relative to the canonical GAAA/11ntR.Several additional 11ntR-like tetraloop-receptor variants have been
discovered through in vitro selection.[44−46] Of these, we chose C7.10, C7.2, and C7.9 for our studies (Figure A).[44] Previous studies showed that C7.10 and C7.2 bind to GAAA
tetraloops with affinities comparable to that of the canonical 11ntR
tetraloop-receptor within an engineered artificial dimer construct.[45] Nevertheless, they have not been found in natural
RNAs.[24,45,65] Variant C7.10
differs from the canonical 11ntR by the replacement of the canonical
AA-platform with a GU-platform. GAAA/11ntR and GAAA/C7.10 are expected
to form similar tertiary interactions and conformations as they can
both be inserted into the same structural scaffolds and have shown
identical patterns of discrimination for different tetraloop sequences.[45] A computationally predicted structure supported
by experimental chemical probing suggests that the tertiary interactions
and global conformations of GAAA/C7.2 and GAAA/11ntR are similar,
but that the insertion of an additional residue into the platform
of C7.2 alters the conformation of the residues around the platform
of these two motifs (Figure S2).[66] We also studied the in vitro-selected C7.9 tetraloop-receptor that had not previously been characterized
but differs in sequence relative to the canonical 11ntR only by the
insertion of a G residue between positions 3 and 4 relative to the
canonical 11ntR (Figure A). In total, we studied six tetraloop-receptor variants, three natural
(11ntR, 11ntR_AC, and 11ntR_ACC) and three in vitro-selected (C7.10, C7.2, and C7.9) (Figure A).
Measuring TL/TLR Docking Kinetics and Thermodynamics
by smFRET
The docking/undocking properties of the TL/TLR
variants were investigated
within the context of a minimal smFRET construct designed and used
previously by Nesbitt, Pardi, and co-workers to characterize the folding
properties of the canonical GAAA/11ntR.[38,40] In this construct
(Figure B), the GAAA
tetraloop is connected to its cognate tetraloop-receptor by a flexible
U7 linker and TL/TLR docking/undocking is monitored by
time-dependent fluctuations in FRET efficiency. Models with multiple
FRET states were evaluated and Bayesian statistical analysis indicated
that the FRET traces are best described by two FRET states, a low
FRET state corresponding to the undocked TL/TLR, and a high FRET state
corresponding to the docked TL/TLR (Table S3), consistent with previous observations for the canonical GAAA/11ntR.[37,40] Intermediates with too-short lifetimes or with FRET values only
slightly different from those of the docked and undocked states and
with similar lifetimes would not be distinguishable as additional
FRET states due to measurement limits and noise.Average low
and high FRET values were within error between the different TL/TLR
variants and nearly constant across all salt types and concentrations
investigated (Figure S3), suggesting similar
global architectures of the TL/TLRs within the resolution of smFRET.
Further, the absence of significant changes in the average FRET state
values across ionic conditions is consistent with two-state folding
behavior. Nevertheless, our results provide evidence for multiple
intermediates along the folding pathway that do not accumulate, as
described below.Rate constants for TL/TLR docking and undocking, kdock and kundock, respectively,
were extracted from the fluorescence traces of individual molecules
using hidden Markov models (HMMs) as described previously.[25,56,58,67] Representative traces and HMM fits at two different ionic conditions
for tetraloop-receptor variants 11ntR and C7.2 are shown in Figure A,B. For each construct
and condition, sample traces and complete data summaries are presented
in Supporting Information (Figures S9–S275).
The docking equilibrium constant (Kobs) for TL/TLR formation was determined from the ratio kdock/kundock and agreed with
values obtained from integration of the areas under cumulative FRET
distributions (Tables S4–S9). Our
measurements of kdock and kundock for the canonical GAAA/11ntR agreed within 2-fold
with those previously reported.[38,40]
Figure 3
smFRET reveals distinct
kinetic behaviors and Mg2+ dependences
of TL/TLR in the U7-tether minimal constructs (Figure ). Sample smFRET
traces of variants 11ntR (A) and C7.2 (B) at two Mg2+ concentrations.
Green and red traces are donor and acceptor intensities respectively
and black trace represents the probability of occupying the high FRET
state as determined by a two-state hidden Markov model (HMM). The
values of the fitted kdock and kundock for the sample traces are displayed above
each trace. Traces were truncated at 6 s for ease of comparison. Sample
traces for all variants and conditions are shown in Supporting Information (Figures S9–S275) and raw data
are available to download.[59] (C–D)
Median kdock and kundock values (large circles) for a population of individual
molecules (small circles) at a range of Mg2+ concentrations
(colors) for 11ntR (C) and C7.2 (D). For clarity, only a subset of
Mg2+ concentrations and 50 randomly selected molecules
at each [Mg2+] are shown. Typically more than 100 molecules
were measured at each Mg2+ concentration. The complete
population of molecules at 0.25 mM Mg2+ is shown in the
inset to better display the kinetic homogeneity. Data from measurements
carried out by single-photon counting are marked with an internal
“X”.
smFRET reveals distinct
kinetic behaviors and Mg2+ dependences
of TL/TLR in the U7-tether minimal constructs (Figure ). Sample smFRET
traces of variants 11ntR (A) and C7.2 (B) at two Mg2+ concentrations.
Green and red traces are donor and acceptor intensities respectively
and black trace represents the probability of occupying the high FRET
state as determined by a two-state hidden Markov model (HMM). The
values of the fitted kdock and kundock for the sample traces are displayed above
each trace. Traces were truncated at 6 s for ease of comparison. Sample
traces for all variants and conditions are shown in Supporting Information (Figures S9–S275) and raw data
are available to download.[59] (C–D)
Median kdock and kundock values (large circles) for a population of individual
molecules (small circles) at a range of Mg2+ concentrations
(colors) for 11ntR (C) and C7.2 (D). For clarity, only a subset of
Mg2+ concentrations and 50 randomly selected molecules
at each [Mg2+] are shown. Typically more than 100 molecules
were measured at each Mg2+ concentration. The complete
population of molecules at 0.25 mM Mg2+ is shown in the
inset to better display the kinetic homogeneity. Data from measurements
carried out by single-photon counting are marked with an internal
“X”.
Kinetic heterogeneity
in the folding and/or activity of structured
RNAs has been reported previously and can arise from the coexistence
of chemically identical RNA molecules with distinct conformations
based on the interconversion of kinetic properties of individual RNA
molecules.[68−70] All of our TL/TLR variants displayed homogeneous
kinetic behavior, with tightly clustered values of kdock and kundock under all
conditions, consistent with previous observations for the canonical
GAAA/11ntR.[25] The insets of Figure C,D show this behavior for
tetraloop-receptor variants 11ntR and C7.2 at a single solution condition
(0.25 mM Mg2+ and 140 mM K+), and analogous
clusters for each TL/TLR variant under each solution condition tested
are provided (Figures S9–S275).
The observed homogeneous kinetic behavior provides no indication of
conformational heterogeneity at the level of individual TL/TLR motifs
and allowed us to precisely determine average kdock and kundock values and their
dependences on salt concentration.Figure C,D for tetraloop-receptors 11ntR and C7.2,
respectively, illustrates these data and some of the differences in
kinetic behavior that were observed. For example, at 0.25 mM Mg2+ (red points) kdock was 6-fold
larger for 11ntR than for C7.2 (23.4 ± 0.6 s–1 vs 4.2 ± 0.3 s–1) and kundock was 10-fold smaller (6.5 ± 0.2 vs 65 ± 7 s–1). The Mg2+ dependence of the kinetic behavior
also clearly differed between TL/TLR variants, with kundock essentially constant across [Mg2+] for
tetraloop-receptor 11ntR but decreasing sharply for tetraloop-receptor
C7.2. These differences in docking kinetics and Mg2+ dependences
indicate differences in the folding pathway and electrostatic properties
of the TL/TLR variants and motivated us to more closely examine and
compare the kinetics, thermodynamics, and salt-dependence of the TL/TLR
variants with the objective of developing physical models for the
observed sequence effects on TL/TLR docking behavior and energetics.
Comparison of Docking Kinetics and Thermodynamics of the TL/TLR
Variants in Mg2+
The values of kdock, kundock, and Kobs were determined for each of the six TL/TLR
variants across a range of Mg2+ concentrations (Figure A–C). All kdock values
were slow (range of ∼3–300 s–1) relative
to an estimated rate of collision of ∼105–106 s–1 between the tetraloop and the tetraloop-receptor
based on measurements of end-to-end collisions in short single stranded
oligonucleotides analogous to the flexible U7 tether joining
the TL/TLR.[71] The relatively slow kdock values suggest that all six tetraloop-receptor
variants must undergo substantial unfavorable conformational reorganization
prior to the formation of the docking transition state, as suggested
previously for the 11ntR,[41] and differences
in the magnitude of kdock between variants
indicate sequence-dependent differences in this reorganization and/or
in the contacts that need to form prior to the docking transition
state. Increasing Mg2+ substantially increased kdock and did so similarly for all of the TL/TLR
variants as evidenced by the near-parallel slopes in Figure A. In contrast, the Mg2+ dependence of kundock was shallower
than that for kdock and exhibited larger
variations in slope between TL/TLR variants (Figure B). Undocking of the in vitro-selected tetraloop-receptors (C7.10, C7.2, and C7.9; purple, red,
and black symbols, respectively) exhibited stronger dependence on
Mg2+ concentration than the naturally occurring variants.Docking
kinetics, thermodynamics and Mg2+ dependence
of TL/TLR variants in U7-tether minimal construct. (A–C)
Median kdock, kundock, and Kobs values for each TL/TLR variant
at a range of Mg2+ concentrations. All measurements were
carried out in a background of 140 mM K+. Data and errors
are summarized in Table S4. Standard errors
calculated from bootstrapping were smaller than the size of the symbols.
Data points marked with an internal “X” were determined
using single-photon counting. Linear fits were determined by least-squares.
Mg2+ uptake prior to and after formation of the docking
transition state (D and E, respectively) and net Mg2+ uptake
(F) were obtained from the linear fits in panels A–C as given
by eq and eqs and 2B.Figure C shows
the docking equilibrium, Kobs, for each
variant across the range of Mg2+ concentrations. The variations
in slope translate into different relative stabilities at different
Mg2+ concentrations. Interestingly, the stabilities were
more similar between TL/TLR variants at higher Mg2+. Overall,
the different Mg2+-dependences of kdock, kundock and Kobs reveal complexities in the folding behavior of the
TL/TLR variants. Before exploring this further, we first lay out the
classic thermodynamic framework for the interpretation of Mg2+ uptake in the following section.
Interpreting Mg2+ Effects on Kobs, kdock, and kundock
RNA’s
high negative charge density
leads to the formation of a dynamic sheath of ions around the RNA,
referred to as the ion atmosphere, in which cations are accumulated
and anions are depleted relative to bulk solution.[72−76] Conformational changes, such as a folding transition,
result in ions being taken up or released from the ion atmosphere
to compensate for changes in the charge density of RNA (Figure ). According to a thermodynamic
framework described by Record, Draper and co-workers,[54,77] the relationship between Mg2+ uptake and the thermodynamic
equilibrium for an RNA conformational transition is given by the following:where ΔΓMg is the average number of Mg2+ ions taken into the ion
atmosphere as the RNA undergoes a conformational transition and Kobs is the apparent equilibrium constant for
that conformational transition. This framework makes no assumptions
about the nature of the RNA-Mg2+ interactions and therefore
cannot distinguish between nonspecific electrostatic interactions
and specific binding.[54,78] The physical interpretation of eq is valid when the Mg2+ titration is carried out in a high background concentration
of monovalent salt such that the total concentration of anions and
the activity coefficient of Mg2+ do not change significantly
as the Mg2+ salt is added. Previous studies have shown
that a ratio of 30:1 K+:Mg2+ is sufficient to
meet these requirements,[79] and our measurements
with Mg2+ concentrations ranging from 0.15 to 4.5 mM in
a background of 140 mM K+ (100 mM KCl and 40 mM K-MOPS)
meet these requirements. Thus, the slopes of the linear fits shown
in Figure C correspond
to the net number of Mg2+ ions taken up upon TL/TLR docking.
Figure 5
Physical
interpretation of the sensitivity of Kobs, kdock, and kundock to Mg2+. (A) Schematic of a model RNA
undergoing a folding transition from an extended unfolded (U) state,
through a compact transition state (‡), to a final folded (F)
state. Mg2+ is taken up to compensate for changes in charge
density as the RNA folds. To maintain charge neutrality, Mg2+ uptake must be accompanied by the uptake of anions and/or release
of K+ into ion atmosphere (not shown). The net number of
Mg2+ taken up (ΔΓMg) is the sum of Mg2+ taken up prior to (ΔΓd,Mg‡) and after (ΔΓu,Mg‡) formation of the transition
state (‡; ΔΓMg = ΔΓd,Mg‡ + ΔΓu,Mg‡). (B) Folding process in panel A is
represented in a free-energy diagram. (C) Mg2+ uptake of
TL/TLR variants in U7 smFRET construct. Overall bar gives
ΔΓMg and is divided into contributions
from ΔΓd,Mg‡ (filled) and ΔΓu,Mg‡ (open). Error bars are standard errors of ΔΓMg.
Physical
interpretation of the sensitivity of Kobs, kdock, and kundock to Mg2+. (A) Schematic of a model RNA
undergoing a folding transition from an extended unfolded (U) state,
through a compact transition state (‡), to a final folded (F)
state. Mg2+ is taken up to compensate for changes in charge
density as the RNA folds. To maintain charge neutrality, Mg2+ uptake must be accompanied by the uptake of anions and/or release
of K+ into ion atmosphere (not shown). The net number of
Mg2+ taken up (ΔΓMg) is the sum of Mg2+ taken up prior to (ΔΓd,Mg‡) and after (ΔΓu,Mg‡) formation of the transition
state (‡; ΔΓMg = ΔΓd,Mg‡ + ΔΓu,Mg‡). (B) Folding process in panel A is
represented in a free-energy diagram. (C) Mg2+ uptake of
TL/TLR variants in U7 smFRET construct. Overall bar gives
ΔΓMg and is divided into contributions
from ΔΓd,Mg‡ (filled) and ΔΓu,Mg‡ (open). Error bars are standard errors of ΔΓMg.By measuring the Mg2+ dependence of the docking and
undocking rate constants the net uptake of Mg2+ can be
broken down into Mg2+ taken up prior to and after the formation
of the docking transition state using the following relationships:where ΔΓd,Mg‡ is the average
number of Mg2+ ions taken up prior to the formation of
the docking transition state and ΔΓu,Mg‡ is the
average number of Mg2+ ions taken up after the formation
of the docking transition state as illustrated schematically in Figure A.[80,81] The values of ΔΓd,Mg‡ and ΔΓu,Mg‡ are given by the slopes of the linear fits shown in Figure A,B, respectively. For simplicity,
RNA is represented as a static molecule and folding is depicted as
a simple two-state process with an integer number Mg2+ taken
up in Figure A. In
reality, the statistical nature of the ion atmosphere and RNA conformational
ensembles typically leads to a noninteger Mg2+ uptake.[82]The linearity displayed in Figure A–C for all TL/TLR variants
indicate that the
number of Mg2+ ions taken up is constant across the range
of Mg2+ concentrations investigated and suggests that for
a given TL/TLR variant the docking process involves essentially the
same undocked, transition, and docked states across Mg2+ concentrations. Figure D–F summarizes the values of ΔΓMg, ΔΓd,Mg‡, and ΔΓu,Mg‡ obtained from the linear fits in Figure A–C.Differences in Mg2+ uptake between TL/TLR variants suggest
differences in the electrostatic component of their folding energy
landscape, as shown schematically in Figure B and discussed below.
Using Ba2+ to Test for Specific Ion Effects
The nature of
the interactions between RNA and cations is diverse,
ranging from long-range nonspecific electrostatic interactions to
specific interactions in which the cation is chelated by RNA ligands.[76] In some cases it is possible to infer ion-specific
biding sites in RNA by measuring folding or catalytic activity in
the presence of ions of different size or charge.[74,83] To learn more about the specificity of the Mg2+ effects
observed above we measured docking kinetics of the TL/TLR variants
in the presence of Ba2+. Due to the differences in size
and coordination preferences of Mg2+ and Ba2+, M2+-specific binding sites are expected to typically
discriminate between these two divalent cations. For example, the
metal core of the P4–P6 domain of the Tetrahymena group I intron discriminates between Mg2+ and Ba2+ by 2 orders of magnitude.[84]Differences in kdock and kundock between Mg2+ and Ba2+ (Figure ) were small for
all variants (less than 3-fold), suggesting that the observed M2+ effects on kdock and kundock are largely due to nonspecific electrostatic
interactions between the RNA and the divalent cations. The largely
nonspecific nature of M2+ interactions with the TL/TLRs
is consistent with the observation that the TL/TLR variants fold in
the absence of divalent cations when enough M+ is present,
as described below. However, small consistent differences between
Mg2+ and Ba2+ that depended on the sequence
of the TL/TLR variant were observed. For example, whereas Ba2+ decreased kundock relative to Mg2+ in variants 11ntR, 11ntR_AC, and 11ntR_A/CC by up to 3-fold,
it increased kundock for variants C7.10
and C7.2 by 2-fold, and had no effect on kundock for variant C7.9. These small idiosyncratic effects on the magnitude
and M2+ concentration dependence of the docking kinetics
suggest that differences in conformation and electrostatic potential
between TL/TLR variants lead to small but measurable discrimination
between divalent cations that differ in size and solvation properties.
Figure 6
Comparison
of TL/TLR docking kinetics in Mg2+ and Ba2+ reveals
weak but consistent sequence-dependent discrimination
for different types of divalent cations. Median kdock (A) and kundock (B) values
over a range of Mg2+ (filled symbols) and Ba2+ (open symbols) for each TL/TLR variant. Mg2+ values are
reproduced from Figure . Data and errors are summarized in Tables S4 and S5. Standard errors calculated from bootstrapping were
smaller than the size of the symbols. Linear fits were determined
by least-squares. All measurements were carried out in a background
of 140 mM K+. Data collected with single-photon counting
setup are marked with and internal “X”.
Comparison
of TL/TLR docking kinetics in Mg2+ and Ba2+ reveals
weak but consistent sequence-dependent discrimination
for different types of divalent cations. Median kdock (A) and kundock (B) values
over a range of Mg2+ (filled symbols) and Ba2+ (open symbols) for each TL/TLR variant. Mg2+ values are
reproduced from Figure . Data and errors are summarized in Tables S4 and S5. Standard errors calculated from bootstrapping were
smaller than the size of the symbols. Linear fits were determined
by least-squares. All measurements were carried out in a background
of 140 mM K+. Data collected with single-photon counting
setup are marked with and internal “X”.
Conformational Preferences and Electrostatic
Properties of Connecting
Tether Produce a Common, Separable Effect on TL/TLR Variants
We measured the docking kinetics for four of the TL/TLR variants
with the tetraloop and tetraloop-receptor components connected by
both an A7 and U7 tether (Figure ). Consistent with prior results,[37,38,40] the elements connecting the tetraloop
and the tetraloop-receptor did not significantly alter kundock or its Mg2+ dependence (Figures B, Undocking). Thus, the ratio kundockU7/kundockA7 is essentially unity across the range of
Mg2+ concentrations investigated (Figure D). In contrast, there were differences in kdock between the U7 and A7 tethers (Figure A, Docking). Nevertheless, the effect from the U7 versus
A7 tether was identical for all of the variants at each
Mg2+ concentration, as demonstrated by the superimposable kundockU7/kundockA7 ratios for all four TL/TLRs at each Mg2+ concentration (Figure C). Thus, the differences in kdock and its Mg2+ dependence between U7 and A7 constructs are intrinsic to the conformational
preferences and Mg2+ dependence of the tether itself and
separable from the energetics and distinct Mg2+ dependences
of the TL/TLR variants.
Figure 7
Conformational preferences and Mg2+-dependence of connecting
tether produces a common kinetic and electrostatic effect on TL/TLR
docking. (A,B) Median kdock (A) and kundock (B) values of four TL/TLR variants as
a function of Mg2+ in a background of 140 mM K+ with an A7 (circles) or U7 (triangles) tether
connecting the tertiary contact partners. Data and errors are summarized
in Tables S4 and S6. Standard errors calculated
from bootstrapping were smaller than the size of the symbols. Data
collected with single-photon counting setup are marked with an internal
“X”. Linear fits were determined by least-squares. (C–D)
Ratio of the kdock (C) and kundock (D) values with the U7 and A7 tethers. Open symbols were obtained from interpolation using the
linear fits in panels A and B. Dashed lines at a constant value of
one are shown for reference.
Conformational preferences and Mg2+-dependence of connecting
tether produces a common kinetic and electrostatic effect on TL/TLR
docking. (A,B) Median kdock (A) and kundock (B) values of four TL/TLR variants as
a function of Mg2+ in a background of 140 mM K+ with an A7 (circles) or U7 (triangles) tether
connecting the tertiary contact partners. Data and errors are summarized
in Tables S4 and S6. Standard errors calculated
from bootstrapping were smaller than the size of the symbols. Data
collected with single-photon counting setup are marked with an internal
“X”. Linear fits were determined by least-squares. (C–D)
Ratio of the kdock (C) and kundock (D) values with the U7 and A7 tethers. Open symbols were obtained from interpolation using the
linear fits in panels A and B. Dashed lines at a constant value of
one are shown for reference.The slower docking with the A7 relative to the
U7 tether (Figure A) is consistent with the less frequent end-to-end collisions
and
the higher persistence length observed for poly-A oligonucleotides
relative to poly-U.[71,85,86] Presumably some of the base-stacking interactions between A bases
in the A7 tether break prior to the docking transition
state, or their presence alters the conformational landscape to make
collisions less probable. In contrast, weaker base-stacking interactions
between U bases within the U7 tether may not contribute
significantly to the conformational preferences of the tether and,
hence, to the activation energy for the formation of the compact docking
transition state.The common tether effects across TL/TLR sequence
variants at each
Mg2+ concentration (Figure C,D), contrasted with the idiosyncratic Mg2+-dependence of the TL/TLR variants with a given tether (Figure A–C), indicate
that the contribution to folding from the conformational search of
the tether is independent from the intrinsic energetics of the TL/TLR.
These results are consistent with the model that RNA motifs behave
as separable modules. This observation and previous studies[25,31] underscore the possibility of developing a reconstitution model
for RNA folding from the isolated properties of its junctions, helices,
and tertiary motifs (see Discussion).
Comparison
of Docking Kinetics and Thermodynamics of the TL/TLR
Variants in Monovalent Salts
To further investigate the electrostatic
energy landscape of the TL/TLR variants, we explored the docking kinetics
and thermodynamics of the TL/TLR variants across a range of K+, Na+, and Rb+ concentrations (Figure and Figures S4 and S5). Previous studies provided
evidence for a K+ binding site below the AA-platform of
the canonical GAAA/11ntR tertiary motif that has a significant thermodynamic
effect,[87−89] and other sequence-specific cation binding sites,
whether inner or outer sphere, could form within the grooves and crevices
of the TL/TLR variants and affect the kinetics and thermodynamics
of TL/TLR formation.
Figure 8
Docking kinetics and thermodynamics of TL/TLR variants
in KCl.
To account for nonideal electrolyte interactions, the data are plotted
as a function of the mean activity (a±) of the salt solution. (A–C) Median kdock, kundock, and Kobs values for each of the TL/TLR variants over a range
of KCl concentrations. Coloring scheme is shown in panel A. Data and
errors are summarized in Table S7. Standard
errors determined from bootstrapping were smaller than the size of
the symbols. Data marked with an internal “X” were determined
by single-photon counting. Linear fits were determined by least-squares.
Docking kinetics and thermodynamics of TL/TLR variants
in KCl.
To account for nonideal electrolyte interactions, the data are plotted
as a function of the mean activity (a±) of the salt solution. (A–C) Median kdock, kundock, and Kobs values for each of the TL/TLR variants over a range
of KCl concentrations. Coloring scheme is shown in panel A. Data and
errors are summarized in Table S7. Standard
errors determined from bootstrapping were smaller than the size of
the symbols. Data marked with an internal “X” were determined
by single-photon counting. Linear fits were determined by least-squares.The six TL/TLR variants exhibited
qualitatively similar behavior
in the presence of K+ as observed with Mg2+,
i.e., increasing kdock, decreasing kundock, and higher stability with increasing
K+ concentration. As observed in Mg2+, the 11ntR
and 11ntR_A/CC variants docked with greater kdock values compared to other variants over the measured salt
concentration range, followed closely by variant C7.9 (Figure A). Interestingly, these three
tetraloop-receptor variants have a pair of adjacent A residues in
their asymmetric loops that can form AA-platforms, whereas the other
variants have alternative sequences in those positions (Figure A); as discussed below, organization
of the dinucleotide platform may influence the docking rate of TL/TLR
motifs. The undocking behavior of the TL/TLR variants was also qualitatively
similar in K+ and Mg2+. For example, 11ntR displayed
relatively low kundock values (<10
s–1) over the entire salt concentration range, whereas
C7.9 displayed kundock values that exceeded
100 s–1 over the entire concentration range (Figure B). Also as in Mg2+, the natural TL/TLRs were more stable than the in
vitro-selected variants in K+ (Figure C).Despite these similarities,
there are differences between TL/TLRs
in K+ and Mg2+ that reveal a complex relationship
between the sequence of the tetraloop-receptor and the relative effects
of divalent and monovalent cations. For example, relative to K+ only, addition of 1 mM Mg2+ accelerates docking
for all TL/TLR variants but it does so to slightly different extents
for different TL/TLR variants (Figure A). Notably, relative to K+, Mg2+ produces a stronger effect on the kdock values of variants 11ntR_AC (green) and C7.10 (purple) compared
to other variants, and these two variants also had higher kdock values in Mg2+ relative to Ba2+ (Figure ). These observations are consistent with small but measurable specific
interactions between Mg2+ and these two tetraloop-receptor
variants prior to the formation of the docking transition state.
Figure 9
Relative
effects of Mg2+ vs K+ on the docking
kinetics and thermodynamics of the TL/TLR variants. kdock (A), kundock (B), and Kobs (C) in 140 mM K+ were compared
with and without 1 mM Mg2+. (kdock)rel is defined as kdock at
140 mM K+ and 1 mM Mg2+ divided by kdock at 140 mM K+; (kundock)rel and (Kobs)rel are similarly defined. Values for variants that were
too weak to measure in 140 mM K+ (a±,KCl = 103 mM) were extrapolated from the linear fits
in Figure .
Relative
effects of Mg2+ vs K+ on the docking
kinetics and thermodynamics of the TL/TLR variants. kdock (A), kundock (B), and Kobs (C) in 140 mM K+ were compared
with and without 1 mM Mg2+. (kdock)rel is defined as kdock at
140 mM K+ and 1 mM Mg2+ divided by kdock at 140 mM K+; (kundock)rel and (Kobs)rel are similarly defined. Values for variants that were
too weak to measure in 140 mM K+ (a±,KCl = 103 mM) were extrapolated from the linear fits
in Figure .Similarly, there were sequence
dependent differences in the relative
effect of Mg2+ and K+ in kundock (Figure B). For example, whereas kundock was
essentially the same in K+ and Mg2+ for variant
11ntR, in vitro-selected variants C7.10, C7.2, and
C7.9 displayed much slower undocking in the presence of Mg2+. Corresponding to these effects on kdock and kundock, there were ∼10-fold
differences in the relative effect of Mg2+ vs K+ on the stability of the TL/TLR variants (Figure C).The differential effects of Mg2+ and K+ suggest
differences in the electrostatic potential of the variants that may
arise from local conformational differences between them. For example,
conformations which place phosphate groups in close proximity may
create high charge density “pockets” that favor Mg2+ uptake over K+ more than conformations without
such electrostatic pockets.[78,90] The modest sequence-dependent
differences in divalent (Figure ) and monovalent (Figure S5) cation identity effects suggest that differences in the electrostatic
potentials of the variants or in the positions of functional groups
can discriminate slightly for the size or the solvation properties
of the cations.
Discussion
The growing database
of RNA structures has revealed that complex
RNAs are commonly built from a set of recurring structural motifs
that largely preserve their three-dimensional structure irrespective
of the context in which they are embedded.[26−30,61,91] From an energetic perspective, recent studies have shown that the
canonical GAAA/11ntR tertiary motif undocks with the same energetic
barrier, i.e., has the same rate constant, when embedded in RNAs of
varying complexity.[25] These and additional
observations suggest that kinetic and thermodynamic properties of
isolated motifs may be applicable across structured RNAs containing
those motifs, thereby providing a means to generalize and predict
RNA folding behavior.[24,25,31,92] Below, we describe insights from our in-depth
investigation of TL/TLR variants into the order of rearrangement in
the folding pathway of TL/TLR docking, implications for the potential
to quantitatively predict RNA folding kinetics and thermodynamics,
and insights into the basis of Nature’s choice of TL/TLR variants.
Dissecting
the Multistep Folding Pathway of TL/TLRs
RNA folding studies
to date have led to the suggestion that tertiary
contact formation generally involves early transition states.[93] For example, ablation of individual tertiary
interactions in the hairpin ribozyme increased kundock without significant effects on kdock, suggesting that these contacts form late in the folding
process.[80] For GAAA/11ntR docking, a near-zero
enthalpy change in going from the undocked state to the transition
state and a substantial negative enthalpy change after the transition
state provided support for the late formation of tertiary contacts
in the overall folding of this RNA tertiary motif.[40] Tertiary folding of P4–P6 RNA, an independently
folding component from the Tetrahymena group I intron,
was also suggested to have an early transition state, although the
reaction step or steps that were reported on by the attached fluorescent
dye could not be ascertained.[94]Notwithstanding
the evidence for the late formation of tertiary contacts, RNA tertiary
folding events are much slower than diffusional collision, strongly
suggesting a need for substantial rearrangements preceding the formation
of the transition state that would likely involve the breaking and
formation of interactions.[95]For
GAAA/11ntR docking, the bimolecular association rate constant
(kassoc) is 5 orders of magnitude slower
than diffusional collision.[95] Indeed, NMR
and X-ray crystal structures reveal that the undocked and docked conformations
of the 11ntR tetraloop-receptor differ substantially, whereas the
GAAA tetraloop appears to be unchanged (Figure A).[43,96] Thus, a reasonable
model invokes contributions to the observed slow docking from some
of these conformational rearrangements prior to the formation of the
docking transition state. Other tetraloop-receptor variants docked
with similar or lower kdock values compared
to 11ntR (Figures and 8), suggesting similar unfavorable conformational
rearrangements prior to the attainment of the docking transition state.
To test this model and to ascertain which rearrangements might occur
prior to the docking transition state, we compared the docking kinetics
of these tetraloop-receptor variants along with three 11ntR single-point
mutants studied previously (Figure B).[31]
Figure 10
Conformational rearrangements
of canonical 11ntR tetraloop-receptor
and kinetic effect of sequence variations relative to 11ntR. (A) Schematic
of the experimentally determined conformations of the undocked (left)
and docked (right) canonical 11ntR tetraloop-receptor.[24,47,96] The GAAA tetraloop does not undergo
significant conformational changes upon docking.[24,30] For the 11ntR tetraloop-receptor, in the undocked state A4, A8, and A5 form a series of stacking interactions
(A-zipper) that break upon formation of the AA-platform present in
the docked state.[96] Base-stacking and hydrogen-bond
interactions in the undocked state that need to break prior to attaining
the docked state are colored in red, and interactions that form in
the docked state are colored in green. Residues that were mutated
to U in a previous study[31] are colored
in purple. kdock (B) and kundock (C) of variants relative to the canonical 11ntR
at 1 mM Mg2+ and 140 mM K+ (except for single
point mutants marked with ‘*’, which were measured previously[31] in 1 mM Ba2+ and 140 mM K+; measurements across a range of [M2+] suggest that TL/TLRs
display only minor kinetic differences in Mg2+ vs Ba2+ (Figure )). The sequences of the residues that comprise the putative platforms
are shown in small boxes on the top of left panel. Variants that are
identical to the 11ntR except for a single mutation to the AA-platform
are marked with a red star.
Conformational rearrangements
of canonical 11ntR tetraloop-receptor
and kinetic effect of sequence variations relative to 11ntR. (A) Schematic
of the experimentally determined conformations of the undocked (left)
and docked (right) canonical 11ntR tetraloop-receptor.[24,47,96] The GAAA tetraloop does not undergo
significant conformational changes upon docking.[24,30] For the 11ntR tetraloop-receptor, in the undocked state A4, A8, and A5 form a series of stacking interactions
(A-zipper) that break upon formation of the AA-platform present in
the docked state.[96] Base-stacking and hydrogen-bond
interactions in the undocked state that need to break prior to attaining
the docked state are colored in red, and interactions that form in
the docked state are colored in green. Residues that were mutated
to U in a previous study[31] are colored
in purple. kdock (B) and kundock (C) of variants relative to the canonical 11ntR
at 1 mM Mg2+ and 140 mM K+ (except for single
point mutants marked with ‘*’, which were measured previously[31] in 1 mM Ba2+ and 140 mM K+; measurements across a range of [M2+] suggest that TL/TLRs
display only minor kinetic differences in Mg2+ vs Ba2+ (Figure )). The sequences of the residues that comprise the putative platforms
are shown in small boxes on the top of left panel. Variants that are
identical to the 11ntR except for a single mutation to the AA-platform
are marked with a red star.If residues rearrange or make interactions prior to attainment
of the docking transition state, then mutations of those residues
are predicted to have an effect on kdock. Conversely, mutations to residues that participate in rearrangements
or make interactions only after the docking transition state are predicted
to affect kundock and to have no significant
effect on kdock. Despite the limitations
in the interpretation of this type of mutational effects (i.e., Φ-value
analysis as developed for protein folding) that have been discussed
previously,[97,98] the results herein for TL/TLR
docking suggest a single dominant pathway and provide evidence for
events that occur early and late in this pathway. For example, had
multiple disparate pathways been present, any single mutation would
be expected to affect only one or a few of these pathways and give
a small rate effect. Instead, our kinetic comparisons between tetraloop-receptor
variants reveal substantial effects and provide evidence for substantial
rearrangements of the residues that constitute the dinucleotide platform
prior to attaining the docking transition state and for formation
of tertiary hydrogen bonds after the formation of the docking transition
state. A model summarizing these results is presented in Figure and described
below and in the next section.
Figure 11
Schematic model for TL/TLR folding for
the GAAA/11ntR. The canonical
11ntR is shown starting with a schematic of the experimentally determined
unbound form of the tetraloop-receptor and ending with the know structure
of the docked GAAA/11ntR.[47,96] The features of this
pathway are supported by mutational and ionic effects described in
the text. Nevertheless, this model simplifies the many rearrangements
that must occur in folding, and the experimental data distinguish
only between effects that occur prior and subsequent to the rate-limiting
transition state (‡). The U·U base pair in the unbound
state is expected to be weak[43] and in the
simplest model it breaks to allow rearrangement of the A residues
to form the dinucleotide platform.
Schematic model for TL/TLR folding for
the GAAA/11ntR. The canonical
11ntR is shown starting with a schematic of the experimentally determined
unbound form of the tetraloop-receptor and ending with the know structure
of the docked GAAA/11ntR.[47,96] The features of this
pathway are supported by mutational and ionic effects described in
the text. Nevertheless, this model simplifies the many rearrangements
that must occur in folding, and the experimental data distinguish
only between effects that occur prior and subsequent to the rate-limiting
transition state (‡). The U·U base pair in the unbound
state is expected to be weak[43] and in the
simplest model it breaks to allow rearrangement of the A residues
to form the dinucleotide platform.Relative to the canonical 11ntR, single point mutations to
residues
that make up the AA-platform substantially reduce kdock (Figure B, red stars). Tetraloop-receptor C7.10 and C7.2 which have
GU and GUA platforms respectively, also docked more slowly than 11ntR
(Figure B, hashed
purple and red). In contrast, variants 11ntR_A/CC, C7.9, and C2U that
each have two adjacent A residues capable of forming AA-platforms
exhibited kdock values within 2–3-fold
of the canonical 11ntR value (Figure B, filled orange, hashed black, and open black). These
results suggest that rearrangements and interactions that involve
the dinucleotide platform occur, at least in part, prior to the docking
transition state. These rearrangements presumably involve breaking
base-stacking and hydrogen bond interactions between residues of the
tetraloop-receptor to expose the residues that form the dinucleotide
platform and allow docking to occur (e.g., A-zipper in the undocked
conformation of 11ntR, Figure A). The previous observation of near-zero net enthalpy
change in the transition from the undocked to the transition state
of the GAAA/11ntR[40] most simply suggests
that the magnitude of the enthalpy changes of broken and newly formed
interactions prior to the formation of the docking transition state
are comparable. The proposed rearrangements of the dinucleotide platform
of the 11ntR (i.e., AA-platform) prior to the docking transition state
are illustrated in Step 1 of the model in Figure .Mutations to the dinucleotide platform
relative to the canonical
11ntR also increased kundock, albeit modestly
(2–8-fold), and these effects could arise from further reorganization
of or additional interactions with the dinucleotide platform after
the attainment of the docking transition state (Figure , Step 4). Nevertheless, effects
from mutations at other positions had larger effects on kundock.We consider these larger effects on kundock in terms of the last step in the folding
model of Figure (Step 5). Mutation C2U is
predicted to affect the network of hydrogen bonds (i.e., A-minor motif[61]) between the GAAA tetraloop and the C2–G10 base pair in the docked state of the GAAA/11ntR
(Figure A). Relative
to the canonical 11ntR, this mutation had a large effect on kundock but a negligible effect on kdock, suggesting that the hydrogen bond network to the
wild-type C2–G10 base pair forms after
the transition state in the folding pathway (Figure , Step 5). Late formation of the tertiary
hydrogen bonds between the tetraloop and the tetraloop-receptor is
also consistent with the previous finding of a large favorable enthalpy
change after the docking transition state.[40]Additional information can be obtained from the insertion
of a
G residue between residues 3 and 4 relative to 11ntR, in variant C7.9,
which is expected to alter the relative position of the U3·A8 noncanonical base pair and the AA-platform (Figure A). As the U3·A8 and the AA-platform both make tertiary
interactions with the GAAA tetraloop, the ∼50-fold increase
in kundock with only ∼2.5-fold
effect on kdock from the insertion of
G between U3·A8 and the AA-platform suggest
that the tertiary interactions at one or both of these sites form
or rearrange subsequent to the docking transition state, consistent
with the order of events in the folding model of Figure .In summary, our kinetic
analysis at near-physiological ionic conditions
in combination with previous studies of the canonical GAAA/11ntR suggests
a dominant pathway for the formation of the GAAA/11ntR in which reorganization
of the dinucleotide platform occurs early in the folding pathway,
before the formation of the docking transition state, followed by
additional rearrangements and formation of tertiary hydrogen bonds
after the docking transition state. For simplicity, we depict formation
of the tertiary hydrogen bond network involving the conserved C2–G10 as the final step in docking (Figure , Step 5). As described
in the next section, differential ionic effects provide evidence for
additional rearrangements prior to and subsequent to the formation
of the docking transition state.
Common and Specific Electrostatic
Effects in the TL/TLR Folding
Pathway
In a seminal study, the rate constant for docking
of the hairpin ribozyme was shown to depend strongly on the concentration
of Mg2+, whereas the rate constant for undocking was largely
independent of [Mg2+].[80] These
observations, along with electrostatic modeling, led to a simple model
in which Mg2+ is taken up as the helices containing the
tertiary contact partners come together to form a compact transition
state, with the charge density of the RNA not changing significantly
in subsequent folding steps. Previous studies of the isolated GAAA/11ntR
showed similar Mg2+ effects, i.e., a strong dependence
of docking on Mg2+ and little or no dependence of undocking
on Mg2+, suggesting that this simple model might be broadly
applicable in RNA folding.[37,40]Our systematic
comparative analysis of multiple TL/TLR variants under a range of
ionic conditions has allowed us to test and extend this model. As
described below, our results indicate a more complex electrostatic
landscape in which Mg2+ uptake occurs at multiple steps
along the folding pathway; nevertheless, contributions from electrostatics
and ion interactions can be assigned to separate energetic terms within
a framework that ultimately may allow quantitative prediction of RNA
folding kinetics and thermodynamics.[25,31]Figure C summarizes
the average number of Mg2+ ions taken up prior to (ΔΓd,Mg‡) and subsequent to (ΔΓu,Mg‡) the docking transition
state, and the overall Mg2+ uptake (ΔΓMg) for each of the TL/TLR variants studied herein.For all variants, a large fraction of the total Mg2+ uptake occurs prior to the formation of the docking transition state,
with similar values of ΔΓd,Mg‡ (1.02 ± 0.16)
between TL/TLR variants (Figure C, filled). This observation suggests a common large
change in charge density as the tertiary contact partners come together
to form a compact transition state (Figure , Step 2) and is consistent with the previous
studies of the hairpin ribozyme and the canonical GAAA/11ntR mentioned
above.[37,40,80] Nevertheless,
we observe small but significant differences in ΔΓd,Mg‡ values between variants (Figure C, filled) and small sequence-dependent differences
in the relative effects of distinct divalent (Figure ) and monovalent (Figure S6) cations on kdock. These observed
sequence-dependent effects suggest that, in addition to compaction,
local conformational rearrangements prior to the transition state
alter the charge density of the TL/TLR and result in cation uptake
(Figure , Steps
1 and 3).Idiosyncratic effects are even more evident in Mg2+ uptake
subsequent to the formation of the docking transition state (0.40
± 0.40, Figure C, open bars). In contrast to early results that suggested minimal
charge density changes after the formation of the docking transition
state, our analysis revealed the presence of such effects for most
of the sequence variants. These effects are caused by the combined
effect of local conformational rearrangements, alignment of the tertiary
contact partners and ion binding that occurs after the formation of
the docking transition state (Figure , Step 4).Despite sequence dependent differences
in ionic effects before
and after the formation of the docking transition state, substituting
the U7 tether with A7 gave a uniform effect
on kdock and no effect on kundock (Figure ), suggesting that it will be possible to consider the electrostatics
of junctions and compaction separately from the intrinsic electrostatics
and ion-binding of the tertiary contacts. In essence, this separability
represents the core of the Reconstitution Hypothesis for RNA folding,
which posits that the folding of a complex RNA can be understood from
the isolated behavior of its constituent helices, junctions, and tertiary
structural motifs.[31] The findings herein
provide additional evidence in support of the Reconstitution Hypothesis,
and indicate that additional sequence-dependent electrostatic effects
will need to be incorporated in the Reconstitution model.
Correspondence
between Thermodynamic Stability and Biological
Occurrence of Tetraloop-Receptors
The in vitro-selected variants studied herein are not observed in natural structured
RNAs, even though the selection experiments that yielded these variants
also produced the naturally occurring 11ntR variants and these variants
were shown to have similar thermodynamic stability.[24,44,65] As the in vitro selection
and prior thermodynamic measurements were carried out at high Mg2+ (15–100 mM) and at low K+ (0–50
mM) relative to physiological salt conditions,[44,45] we compared the thermodynamic stability of the TL/TLR variants at
salt conditions closer to physiological.Our measurements with
the minimal smFRET TL/TLR construct support the prior observations
of similar thermodynamic stability of natural and in vitro-selected variants at high Mg2+ concentrations (Figure A, left); however,
under salt concentrations that approximate physiological conditions
(1 mM Mg2+; 140 mM K+), the naturally occurring
canonical 11ntR was >10-fold more stable than the in vitro-selected variants (Figure A, right). Under these conditions, two natural variants that
are less common in natural RNAs than the canonical 11ntR (11nR_AC
and 11ntR_A/CC) were also less stable than the 11ntR, though still
more stable than the tetraloop-receptors not observed in Nature. Similarly,
the three previously studied 11nR single-point mutants that have not
been observed in Nature are also highly destabilizing (Figure A, open bars). Overall, there
appears to be a relationship between the thermodynamic stability of
a tetraloop-receptor variant and its usage in natural structured RNAs,
as shown in our analysis of group I introns (Figure B).
Figure 12
Thermodynamic stability of TL/TLR variants
in isolated smFRET construct
at near physiological ionic conditions correlates with observed frequency
in group I introns. (A) Stability of TL/TLR variants in smFRET construct
relative to the canonical 11ntR at high and near-physiological Mg2+ concentrations (values with “*” were measured
previously[31] in Ba2+). Values
at 20 mM Mg2+ (left) for TL/TLR variants studied here were
extrapolated from linear fits in Figure C. All measurements were carried out in a
background of 140 mM K+. (B) Relative stability of TL/TLR
variants in 1 mM Mg2+ and 140 mM K+ (except
for variants with “*” in figure legend which were measured
at 1 mM Ba2+ and 140 mM K+ previously[31]) plotted against their observed frequency in
the Group I Intron Sequence and Structure Database (GISSD).[50]
Thermodynamic stability of TL/TLR variants
in isolated smFRET construct
at near physiological ionic conditions correlates with observed frequency
in group I introns. (A) Stability of TL/TLR variants in smFRET construct
relative to the canonical 11ntR at high and near-physiological Mg2+ concentrations (values with “*” were measured
previously[31] in Ba2+). Values
at 20 mM Mg2+ (left) for TL/TLR variants studied here were
extrapolated from linear fits in Figure C. All measurements were carried out in a
background of 140 mM K+. (B) Relative stability of TL/TLR
variants in 1 mM Mg2+ and 140 mM K+ (except
for variants with “*” in figure legend which were measured
at 1 mM Ba2+ and 140 mM K+ previously[31]) plotted against their observed frequency in
the Group I Intron Sequence and Structure Database (GISSD).[50]Though thermodynamic stability can account for the relative
usage
of the 11ntR-like TL/TLRs (Figure B), other types of TL/TLR motifs that are commonly
found in natural RNA sequences, such as GNRA tetraloops docking into
tandem Watson–Crick base pairs, are thermodynamically much
weaker than the naturally occurring TL/TLRs described herein.[24] These distinct TL/TLRs may be used by Nature
to favor different tertiary orientations, to facilitate conformational
rearrangements, and in cases where there is sufficient stability from
other structural elements.
Authors: Kathryn D Smith; Sarah V Lipchock; Tyler D Ames; Jimin Wang; Ronald R Breaker; Scott A Strobel Journal: Nat Struct Mol Biol Date: 2009-11-08 Impact factor: 15.369
Authors: Xuesong Shi; Namita Bisaria; Tara L Benz-Moy; Steve Bonilla; Dmitri S Pavlichin; Daniel Herschlag Journal: J Am Chem Soc Date: 2014-04-28 Impact factor: 15.419
Authors: Steve L Bonilla; Sarah K Denny; John H Shin; Aurora Alvarez-Buylla; William J Greenleaf; Daniel Herschlag Journal: Proc Natl Acad Sci U S A Date: 2021-08-17 Impact factor: 11.205
Authors: Thomas Zettl; Xuesong Shi; Steve Bonilla; Steffen M Sedlak; Jan Lipfert; Daniel Herschlag Journal: Nucleic Acids Res Date: 2020-08-20 Impact factor: 16.971