Travis Walton1, Jack W Szostak1. 1. Howard Hughes Medical Institute, Department of Molecular Biology, and Center for Computational and Integrative Biology, Massachusetts General Hospital , Boston, Massachusetts 02114, United States.
Abstract
The nonenzymatic polymerization of RNA may have enabled copying of functional sequences during the origin of life. Recent progress utilizing 5'-phosphoro-2-aminoimidazole activation has reinvigorated the possibility of using nonenzymatic RNA polymerization for copying arbitrary sequences. However, the reasons why 2-aminoimidazole (AI) is a superior activation group remain unclear. Here we report that the predominant mechanism of polymerization using cytidine-5'-phosphoro-2-aminoimidazolide (Cp*) involves a 2-aminoimidazolium-bridged dinucleotide (Cp*pC) intermediate. To explore the role of this intermediate, we first identify and quantify four reactions involving the synthesis and breakdown of Cp*pC that occur in the absence of the primer-template duplex. We then analyze the dependence of the rate of polymerization on the concentration of the Cp*pC intermediate in the presence and absence of the competitive inhibitor Cp. We also show that the contribution of the monomer Cp* to the polymerization rate is negligible under our primer extension conditions. Finally, we use the experimentally determined rate constants of these reactions to develop a kinetic model that helps explain the changing rate of nonenzymatic RNA polymerization over time. Our model accounts for the concentration of Cp*pC formed by Cp* under primer extension conditions. The model does not completely account for the decline in polymerization rate observed over long times, which indicates that additional important inhibitory processes have not yet been identified. Our results suggest that the superiority of 2-aminoimidazole over the traditional 2-methylimidazole activation is mostly due to the higher level of accumulation of the imidazolium-bridged intermediate under primer extension conditions.
The nonenzymatic polymerization of RNA may have enabled copying of functional sequences during the origin of life. Recent progress utilizing 5'-phosphoro-2-aminoimidazole activation has reinvigorated the possibility of using nonenzymatic RNA polymerization for copying arbitrary sequences. However, the reasons why 2-aminoimidazole (AI) is a superior activation group remain unclear. Here we report that the predominant mechanism of polymerization using cytidine-5'-phosphoro-2-aminoimidazolide (Cp*) involves a 2-aminoimidazolium-bridged dinucleotide (Cp*pC) intermediate. To explore the role of this intermediate, we first identify and quantify four reactions involving the synthesis and breakdown of Cp*pC that occur in the absence of the primer-template duplex. We then analyze the dependence of the rate of polymerization on the concentration of the Cp*pC intermediate in the presence and absence of the competitive inhibitor Cp. We also show that the contribution of the monomer Cp* to the polymerization rate is negligible under our primer extension conditions. Finally, we use the experimentally determined rate constants of these reactions to develop a kinetic model that helps explain the changing rate of nonenzymatic RNA polymerization over time. Our model accounts for the concentration of Cp*pC formed by Cp* under primer extension conditions. The model does not completely account for the decline in polymerization rate observed over long times, which indicates that additional important inhibitory processes have not yet been identified. Our results suggest that the superiority of 2-aminoimidazole over the traditional 2-methylimidazole activation is mostly due to the higher level of accumulation of the imidazolium-bridged intermediate under primer extension conditions.
The RNA world
hypothesis proposes
that early stages of life may have involved the self-replication of
RNA oligonucleotides. However, experimental demonstration of RNA self-replication
has been extremely difficult, leading some to abandon this hypothesis
and suggest alternative scenarios.[1,2] Both the lack
of regioselectivity and the limited template generality of nonenzymatic
RNA polymerization have been cited as major obstacles to the RNA world.
In addition, the prebiotic formation of the canonical RNA nucleosides
through glycosylation of ribose is very inefficient, which has raised
questions about the availability of RNA in prebiotic environments.[3]Recent work has revived the hypothesis
of a primarily RNA-based
foundation to the origin of life. First, the synthesis of both ribonucleotides
and amino acids from cyanide and other simple molecular precursors
offers a plausible set of prebiotic chemical reactions leading to
both RNA and peptides.[4] Second, potential
solutions to several steps in the nonenzymatic replication of RNA
have been proposed. For example, replacing uridine with 2-thiouridine
increases the rate of template-directed nonenzymatic polymerization
by increasing the affinity of monomers for the template.[5,6] In addition, the reannealing of a large RNA duplex has been greatly
slowed by increased solvent viscosity and RNA secondary structure,
allowing information transfer.[7]Nevertheless,
the one-pot synthesis of a functional RNA sequence
through template-directed nonenzymatic polymerization has not been
achieved and remains a critical goal for establishing a physicochemical
approach to primitive genetic inheritance. Recent progress in nonenzymatic
template-directed RNA polymerization has utilized RNA monomers combined
with downstream oligomers, all activated with a substituted imidazole
group on the 5′-phosphate.[8,9] This system
iteratively extends the 3′ end of a primer to synthesize a
complementary strand, even over difficult mixed-template sequences.
Furthermore, the increased length and yield of primer extension reactions
have been observed using 2-aminoimidazole instead of 2-methylimidazole
activation.[9]Continued improvement
of nonenzymatic RNA polymerization is likely
to benefit from a detailed understanding of the chemical mechanism.
Our recent report on the kinetics of primer extension by 2-methylimidazole-activated
monomers suggested that the first step of the mechanism involves the
formation of a 2-methylimidazolium-bridged dinucleotide.[10] Upon subsequent binding to the template, this
dimer intermediate reacts with the 3′-hydroxyl of the primer,
extending it by one nucleotide and releasing an activated monomer
as the leaving group. This mechanism contrasts with a previous proposal
that two monomers bind the template such that the downstream monomer
catalyzes primer extension through noncovalent interactions with the
upstream monomer.[8,11] Recent thermodynamic studies
have cast doubt on the latter model because the affinity of the monomer
in the downstream (+2) position is relatively low.[12] In addition, crystallographic studies of monomers bound
to a template revealed a variety of conformations, some of which are
not productive for polymerization reactions.[13] In contrast, structural studies of GpppG, a stable analogue of the
intermediate, suggest that the imidazolium bridge may help to preorganize
the reaction center for polymerization.[14] However, the relative contribution of each of these two mechanisms
to the rate of primer extension has not yet been clarified.Kinetic analysis has been a powerful tool for understanding the
reaction mechanism of primer extension systems. For instance, kinetic
studies have elucidated the mechanism of 2-MeImpG hydrolysis, a simplified
model of template-directed polymerization.[15,16] In addition, two earlier kinetic models identified key determinants
of the yield and rate in nonenzymatic RNA polymerization that depend
on the particular system studied. Kanavarioti et al.[17,18] account for the synthesis of poly-G catalyzed by poly-C through
the quantification of three reactions: monomer hydrolysis, off-template
oligomerization, and template-directed primer extension. Also, Kervio
et al.[19] have shown that the hydrolysis
of 1-hydroxybenzotriazole-activated monomers and competitive inhibition
by inactivated monomers are the main limitations for primer extension
in that system. However, neither of these models can likely be applied
to primer extension by nucleotides activated by 2-aminoimidazole because
they do not account for the potential kinetic effects of the imidazolium-bridged
dinucleotide intermediate.Here, we present a kinetic model
of nonenzymatic RNA polymerization
by cytidine-5′-phosphoro-2-aminoimidazolide (Cp*) (Chart ). Our results show
that the predominant mechanism of polymerization involves a dinucleotide
intermediate in which the two 5′-phosphates are bridged by
a 2-aminoimidazolium moiety (Cp*pC). To understand the factors that
control the concentration of the intermediate, we studied the kinetics
of four reactions that affect Cp*pC levels. We first examined the
formation of Cp*pC and 2-aminoimidazole (AI) from Cp* under primer
extension conditions by 31P nuclear magnetic resonance
(NMR), as well as the reverse process in which AI reacts with Cp*pC
to form Cp*. We also measured the rates of hydrolysis of the intermediate
Cp*pC and the monomer Cp*. To understand the relationship between
the concentration of Cp*pC and nonenzymatic polymerization, we analyzed
the rate of primer extension by Cp*pC and competitive inhibition by
cytidine 5′-monophosphate (Cp) using Michaelis–Menten
kinetics. We combined our empirical rate constants into a kinetic
model that relates a series of off-template reactions, including synthesis
and decay of Cp*pC, to template-directed primer extension. Our model
accounts for the long-term behavior of the off-template reactions
and for observations of primer extension rate over the first 2 h.
At longer times, additional uncharacterized effects lead to a gradual
slowing of the rate of primer extension. Our results suggest that
the improved polymerization yield of 2-amino- versus 2-methylimidazole-activated
monomers is likely due to the higher level of accumulation of the
intermediate under primer extension conditions.
Chart 1
Chemical Structure
and Cartoon Representation of the Molecules Investigated
in This Study
Materials and Methods
All reagents were purchased from Sigma-Aldrich unless specified.
2-Aminoimidazole hemisulfate was purchased from Combi-Blocks, Inc.
All other exceptions are specified in our previous report.[10] All syntheses involving the attachment of imidazole
to the 5′-phosphate have been previously reported.[9,10]With the following exceptions, all nonenzymatic primer extension
reactions were performed as previously described.[10] For all reactions, the template concentration is 3 μM
and the primer concentration is 2 μM. Except where noted below,
primer extension occurs in 90 mM MgCl2 and 90 mM Tris (pH
8.3–8.4). In Figure b, the reactions including 45 mM Cp used 160 mM Tris to improve
pH buffering. Also, in Figure e, primer extension occurs in 100 mM MgCl2 and
100 mM Tris (pH 8.3–8.4).
Figure 5
Cp competitively inhibits primer extension by Cp*pC. (a) During
the primer extension reaction, Cp*pC binds the template with an affinity
related to the KM. The Cp and Cp* monomers
also bind the template and competitively inhibit the primer extension
reaction. (b) Michaelis–Menten plot of Cp*pC with 0–45
mM Cp. Lines represent the Michaelis–Menten equation evaluated
with empirical determinations of kobsmax and effective KM. Error bars indicate
±SD (n = 3).
Figure 6
Primer extension
in the presence of Cp* is largely due to the formation
of Cp*pC. (a) Pseudo-first-order plot of primer extension initiated
by the Cp* stock at pH 9.6. (b) Schematic illustrating that addition
of AI favors Cp* at the expense of Cp*pC. (c) Gel image of nonenzymatic
RNA primer extension with 20 mM Cp* and no additional AI. Samples
are at 3, 10, 30, 60, 120, and 180 min. (d) Gel image of nonenzymatic
RNA primer extension with 20 mM Cp* and 100 mM AI. (e) Pseudo-first-order
plot of nonenzymatic RNA primer extension with various concentrations
of AI. The observed values of rate constant k1h are recorded in the key. Error bars indicate ±SD (n = 3).
All NMR spectra were recorded
on a Varian INOVA 400 MHz NMR spectrometer
at 25 °C. For all kinetic analyses, samples were prepared in
H2O, and a coaxial insert containing D2O was
used for locking. This eliminates possible solvent isotope effects.
All peak assignments were confirmed by addition of standards. For
characterization, samples were prepared in D2O, and a coaxial
insert was not used. Peaks are referenced to internal trimethyl phosphate
(δ 0.00) for 31P NMR, internal acetone (δ 30.89)
for 13C NMR, and HOD (δ 4.79) for 1H NMR.
NMR spectra of Cp*pC characterization are included (Figure S1).Cp*pC: 1H NMR (400 MHz) δ
7.66 (d, J = 7.6 Hz, 1H), 6.96 (m, 1H), 6.03 (d, J = 7.5 Hz,
1H), 5.85 (d, J = 3.2 Hz, 1H), 4.12 (m, 5H); 31P NMR (161 MHz) δ −12.84 (s); 13C
NMR (100 MHz) δ 166.58 (s), 157.90 (s), 150.91 (t, J = 7.1 Hz), 141.62 (s), 116.83 (dd, J = 3.7, 6.7
Hz), 96.77 (s), 90.73 (s), 82.19 (d, J = 8.5 Hz),
74.68 (s), 69.49 (s), 66.16 (d, J = 5.8 Hz); calcd m/z −692.12, observed m/z −692.1.All NMR spectra were analyzed
and quantified using MestReNova software.
All kinetic analyses were performed using Microsoft Excel and Prism
7. Parameter fitting of the off-template reactions was performed using
code written in R. All values are presented as means ± the standard
deviation unless specified otherwise.
Results and Discussion
Cp* Monomers
Self-React To Form Cp*pC in Primer Extension Buffer
We began
our analysis of Cp* by determining whether dinucleotide
intermediate Cp*pC can form in primer extension buffer (Figure a). On the basis of previous
studies of 2-MeImpG, we expected that the Mg2+ concentration
and pH would likely affect our observations.[15] Recent studies of nonenzymatic RNA polymerization have used 50–200
mM MgCl2 and pH 8–9.[8−10] For these experiments,
we used 100 mM MgCl2 and 100 mM Tris (pH 8.3–8.4)
as the primer extension buffer.
Figure 1
Measuring the reaction of two Cp* monomers
to form AI and Cp*pC.
(a) Schematic of the reaction being measured for calculation of rate
constant k1. (b) 31P NMR spectra
of 24 mM Cp* incubated in primer extension buffer over time. Peaks
at −11.38 ppm correspond to Cp* and −12.86 ppm to Cp*pC.
(c) Analysis of the 31P NMR spectra at 24 mM Cp* in triplicate
by a second-order rate plot. Error bars indicate ±SD (n = 3). The slope is equal to 2k1.
Measuring the reaction of two Cp* monomers
to form AI and Cp*pC.
(a) Schematic of the reaction being measured for calculation of rate
constant k1. (b) 31P NMR spectra
of 24 mM Cp* incubated in primer extension buffer over time. Peaks
at −11.38 ppm correspond to Cp* and −12.86 ppm to Cp*pC.
(c) Analysis of the 31P NMR spectra at 24 mM Cp* in triplicate
by a second-order rate plot. Error bars indicate ±SD (n = 3). The slope is equal to 2k1.We used 31P NMR to
observe the formation of Cp*pC from
Cp* in primer extension buffer (Figure a). Because incubation of monomers near the pKa of the imidazole group is known to promote
formation of the intermediate, solutions of Cp* were kept at pH ∼10
to prevent the reaction from occurring before addition to the primer
extension buffer. Incubation of 24 mM Cp* in primer extension buffer
resulted in the time-dependent appearance of a new peak in 31P NMR spectra (Figure b). The new peak corresponds to intermediate Cp*pC, as confirmed
by the addition of a synthetically prepared standard. We did not observe
significant formation of Cp during the course of the experiment, suggesting
that Cp*pC was not immediately hydrolyzing.Next, we determined
rate constant k1 for the reaction of two
Cp* monomers to form Cp*pC and AI in primer
extension buffer. Samples were prepared in H2O to avoid
potential solvent isotope effects and concentrations measured by integration
of 31P NMR peaks. The formation of AI was not directly
observed because AI does not contain a phosphorus atom. We assumed
that the observed kinetics would be entirely due to the reaction of
two Cp* molecules to form Cp*pC. First, we confirmed that the formation
of Cp*pC does indeed follow second-order kinetics by measuring the
initial rate of Cp*pC synthesis at four initial concentrations of
Cp*. As expected, the initial rate of Cp*pC synthesis increased with
the square of the initial concentration of Cp* (Figure S2). Using the 12 experiments with initial Cp* concentrations
from 15 to 40 mM, we calculated the second-order rate constant (k1) to be [4.49 ± 0.47 (standard deviation)]
× 10–3 h–1 mM–1 and to range from 3.65 to 5.22 × 10–3 h–1 mM–1 (Figure c and Figure S3).
AI Reacts with Cp*pC To Form Two Molecules of Cp*
Having
determined that intermediate Cp*pC can form in primer extension buffer,
we began to consider additional reactions that would affect the concentration
of the intermediate during a primer extension experiment. We first
examined the reverse of its synthetic reaction, i.e., the nucleophilic
attack of AI on Cp*pC to generate two Cp* monomers (Figure a). This reaction has not been
previously described or measured.
Figure 2
Cp*pC reacts with AI to form two molecules
of Cp*. (a) Schematic
of the reaction measured to determine k2. (b) 31P NMR spectra of 5 mM Cp*pC and 11.5 mM AI incubated
in primer extension buffer. The peaks at −11.41 ppm correspond
to monomer Cp* and −12.86 ppm to dinucleotide intermediate
Cp*pC. (c) Analysis of the reaction between 5 mM Cp*pC and 11.5 mM
AI in a second-order kinetic plot. In the plot, k2 is equal to the slope divided by [AI]0 –
[Cp*pC]0. Error bars indicate ±SD (n = 3).
Cp*pC reacts with AI to form two molecules
of Cp*. (a) Schematic
of the reaction measured to determine k2. (b) 31P NMR spectra of 5 mM Cp*pC and 11.5 mM AI incubated
in primer extension buffer. The peaks at −11.41 ppm correspond
to monomer Cp* and −12.86 ppm to dinucleotide intermediate
Cp*pC. (c) Analysis of the reaction between 5 mM Cp*pC and 11.5 mM
AI in a second-order kinetic plot. In the plot, k2 is equal to the slope divided by [AI]0 –
[Cp*pC]0. Error bars indicate ±SD (n = 3).To follow the reaction of AI with
Cp*pC, 11.5 mM AI was incubated
with 5 mM Cp*pC in primer extension buffer, and the progress of the
reaction was monitored by 31P NMR spectroscopy (Figure b). We observed that
the magnitude of the peak for Cp*pC rapidly decreased and that the
magnitude of the peak for Cp* increased over time. In addition, the
concentration of Cp did not significantly change during the course
of the reaction. This indicates that the increase in the level of
Cp* is not due to hydrolysis of Cp*pC. Instead, these results indicate
that AI reacts with Cp*pC to form two molecules of Cp*.We measured
the rate of the reaction between AI and Cp*pC at five
different concentrations to determine the reaction order. On the basis
of the reaction order, the kinetics should be overall second-order
for this reaction, with first-order for both AI and Cp*pC. A reaction
order of ∼0.8 was determined for both AI and Cp*pC (Figure S4). We suspect that the empirical determinations
of the reaction order might be slight underestimations because of
the technical difficulty of measuring the fast rate of the reaction.
The 15 experiments with varying concentrations of 5.75–17.25
mM AI and 2.5–7.5 mM Cp*pC were analyzed using second-order
kinetic plots to calculate a k2 of 0.238
± 0.020 (SD) h–1 mM–1 with
a range of 0.190–0.271 h–1 mM–1 (Figure c and Figure S5).
Cp*pC Hydrolyzes to Cp*
and Cp
In addition to reacting
with AI, we expected that Cp*pC would also decay through hydrolysis
to form Cp and Cp* (Figure a). This reaction has been previously observed for the 2-methylimidazolium-bridged
dinucleotide.[10] To measure this reaction, 31P NMR spectra of 5 mM Cp*pC in primer extension buffer were
recorded every 31 min for 4.1 h (Figure b). In the absence of free AI, the observed
changes in concentration during this time frame should be entirely
due to hydrolysis of Cp*pC. As expected, the intermediate decayed
to approximately equal amounts of Cp and Cp*, consistent with hydrolysis
of Cp*pC (Figure S6). We calculated a pseudo-first-order
rate constant of intermediate hydrolysis [k3 = 0.171 ± 0.006 h–1 (Figure c)]. This value corresponds to a half-life
of 4.06 ± 0.15 h.
Figure 3
Intermediate Cp*pC hydrolyzes to Cp* and Cp. (a) Schematic
of the
reaction measured to determine k3. (b) 31P NMR spectrum of 5 mM Cp*pC incubated in primer extension
buffer for 3.9 h. The peak at 0.28 ppm corresponds to Cp, that at
−11.41 ppm to Cp*, and that at −12.86 ppm to Cp*pC.
(c) Quantification of hydrolysis in a first-order kinetic plot. The
negative slope is equal to k3 = 0.171
± 0.006 h–1. Error bars indicate ±SD (n = 3).
Intermediate Cp*pC hydrolyzes to Cp* and Cp. (a) Schematic
of the
reaction measured to determine k3. (b) 31P NMR spectrum of 5 mM Cp*pC incubated in primer extension
buffer for 3.9 h. The peak at 0.28 ppm corresponds to Cp, that at
−11.41 ppm to Cp*, and that at −12.86 ppm to Cp*pC.
(c) Quantification of hydrolysis in a first-order kinetic plot. The
negative slope is equal to k3 = 0.171
± 0.006 h–1. Error bars indicate ±SD (n = 3).As a comparison to the
traditional 2-methylimidazole activation,
the hydrolysis rate of the 2-methylimidazolium-bridged dicytidine
intermediate was also measured under similar reaction conditions (Figure S7). We used a partially purified 2-methylimidazolium
intermediate as described in our previous report.[10] The observed rate constant of hydrolysis for the 2-methylimidazolium
intermediate was determined to be 4.40 ± 0.07 h–1. This value is 26 times higher than the hydrolysis rate constant
of 2-aminoimidazolium Cp*pC, indicating that 2-aminoimidazolium Cp*pC
is much more stable than the 2-methylimidazolium intermediate.
The Cp*
Monomer Slowly Hydrolyzes to Cp and AI
We also
measured the rate of hydrolysis of Cp* in primer extension buffer
(Figure a). Measuring
the hydrolysis rate of Cp* was complex because Cp* also self-reacts
to form Cp*pC (Figure ), which can then hydrolyze to Cp, the product of Cp* hydrolysis.
To minimize the concentration of Cp*pC, the hydrolysis of Cp* in primer
extension buffer was measured at a series of concentrations of AI.
We hypothesized that the excess AI would react with Cp*pC to form
two Cp* molecules (Figure ), thereby reducing the Cp*pC concentration without affecting
Cp* hydrolysis.
Figure 4
Monomer Cp* decays slowly to Cp and AI. (a) Schematic
of the reaction
measured to determine k4. (b) 31P NMR spectrum of 5 mM Cp* and 20 mM AI incubated in primer extension
buffer for 72 h. The peak at 0.29 ppm corresponds to Cp and that at
−11.39 ppm to Cp*. The peak at −4.92 ppm corresponds
to cyclic cytidine 3′,5′-monophosphate as verified by
spike-in. (c) Hydrolysis of 5 mM Cp* in 40 mM AI quantified by 31P NMR spectra in a first-order rate plot. The negative slope
is equal to k4 = (3.33 ± 0.03) ×
10–3 h–1. Error bars indicate
±SD (n = 3).
Monomer Cp* decays slowly to Cp and AI. (a) Schematic
of the reaction
measured to determine k4. (b) 31P NMR spectrum of 5 mM Cp* and 20 mM AI incubated in primer extension
buffer for 72 h. The peak at 0.29 ppm corresponds to Cp and that at
−11.39 ppm to Cp*. The peak at −4.92 ppm corresponds
to cyclic cytidine 3′,5′-monophosphate as verified by
spike-in. (c) Hydrolysis of 5 mM Cp* in 40 mM AI quantified by 31P NMR spectra in a first-order rate plot. The negative slope
is equal to k4 = (3.33 ± 0.03) ×
10–3 h–1. Error bars indicate
±SD (n = 3).We began our analysis of the hydrolysis of Cp* by recording 31P NMR spectra of 5 mM Cp* and 20 mM AI over 5 days. Cp* gradually
decayed into Cp as well as trace amounts of other products that were
detected on the third day (Figure b). A peak corresponding to Cp*pC was not observed,
suggesting that Cp formed directly by hydrolysis of Cp*. In the presence
of 20 mM AI, we calculated the observed Cp* hydrolysis rate constant
[k4 = (4.04 ± 0.16) × 10–3 h–1].Given that k3 is ∼40 times greater
than k4, our results could easily be affected
by trace levels of Cp*pC. We repeated our experiment using 40 mM AI
and 5 mM Cp* in primer extension buffer. This time we observed that k4 = (3.33 ± 0.03) × 10–3 h–1 (Figure c), suggesting that our previous results overestimated k4 because of trace formation of Cp*pC. Again,
our experiments were repeated using 60 mM AI. However, a precipitate
was observed on the third day of the experiment, and these data were
not analyzed. Overall, these results place an upper limit on the value
of k4 and suggest that the half-life of
Cp* is >8.7 days because of hydrolysis in primer extension buffer.
In addition, we observed that the rate constant of 2-MeImpC hydrolysis
equals (2.93 ± 0.26) × 10–3 h–1 under comparable conditions (Figure S8).
Cp*pC Is a Substrate for Primer Extension
Because Cp*pC
both forms and decays in primer extension buffer, we began investigating
how the concentration of Cp*pC affects the rate of primer extension.
This relationship can be approximated through Michaelis–Menten
kinetics because of the separate binding and reaction steps that occur
on the primer–template complex. The primer–template
complex is present at very low concentrations relative to those of
substrates, which show saturation binding.[19−21] In addition,
Michaelis–Menten kinetics offers a framework for studying the
effect of competitive inhibition on the polymerization rate (Figure a).Cp competitively inhibits primer extension by Cp*pC. (a) During
the primer extension reaction, Cp*pC binds the template with an affinity
related to the KM. The Cp and Cp* monomers
also bind the template and competitively inhibit the primer extension
reaction. (b) Michaelis–Menten plot of Cp*pC with 0–45
mM Cp. Lines represent the Michaelis–Menten equation evaluated
with empirical determinations of kobsmax and effective KM. Error bars indicate
±SD (n = 3).We determined the rate of primer extension using five concentrations
of Cp*pC from 0.5 to 20 mM. We observed that the pseudo-first-order
rate constant of polymerization during the first 3 min, kobs, increased with the concentration of the intermediate
(Figure S9). kobs plateaus at Cp*pC concentrations above 5 mM, suggesting that the
template becomes saturated by Cp*pC binding (Figure b, red line). We analyzed these data using
a double-reciprocal plot and substituted Vmax with kobsmax for the purpose of comparison.
For primer extension by Cp*pC in the absence of competitive inhibition,
we observed that kobsmax = 19.5 ±
2.1 h–1 and KM = 1.06
± 0.12 mM.Having determined that Cp*pC is a substrate
of the primer extension
reaction, we next analyzed how the monomer competitively inhibits
primer extension by Cp*pC. We repeated our Michaelis–Menten
analysis of Cp*pC in the presence of 15–45 mM Cp and observed
increasingly lower rates for this set of reactions compared to the
set with 0 mM Cp (Figure b). Overall, this set of data is consistent with competitive
inhibition as analyzed by a double-reciprocal plot (Figure S10). As expected, the effective KM increases with Cp concentration. On the basis of these
values, we calculated that the Ki of Cp
inhibition of primer extension by Cp*pC is 24.7 ± 11.6 mM. This
value agrees with the Kd of 15–19
mM of Cp measured for a different RNA duplex.[20] In addition, this value is consistent with the Kd of 19 mM for Cp and the Kd of 27 mM for 2-MeImpC previously determined for an RNA hairpin.[21]We also note that our calculated kobsmax slightly decreases at higher concentrations
of Cp, suggesting possible
mixed inhibition. However, we cannot fully exclude potential nonspecific
effects of adding Cp, such as changing the ionic strength of the primer
extension reaction. In total, the data indicate that kobsmax = 17.6 ± 1.6 h–1, which
is within error of our values obtained with both 0 mM Cp and 45 mM
Cp.
Cp* Does Not Discernibly Contribute to the Primer Extension
Rate
We next sought to determine the rate of primer extension
by Cp* directly reacting with the primer. The interpretation of these
experiments is complicated by the fact that Cp* forms Cp*pC under
primer extension conditions (Figure ). However, we have quantified Cp*pC formation and
the rate of primer extension by Cp*pC via competitive inhibition (Figure ). Therefore, we
can account for the rate of primer extension due to Cp*pC and then
determine the rate due to Cp*, including possible noncovalent interactions
between template-bound Cp*. We decided on a final Cp* concentration
of 30 mM for these experiments to be above the Kd of the monomer, but not too high to limit Cp*pC formation
at short time intervals.The primer extension assay was initiated
by adding a 45 mM Cp* stock at pH 9.6 to a reaction mix for final
concentrations of 30 mM Cp*, 90 mM Tris, 90 mM MgCl2, 2
μM primer, and 3 μM template, identical to our analysis
of Cp*pC. We observed the reaction over the first 8 min, during which
time the reaction rate noticeably increased, likely because of the
formation of Cp*pC (Figure a). A similar
phenomenon has been reported for primer extension by 2-MeImpG.[10] Because of this “speed-up” effect,
the calculated rate of primer extension is only approximated by pseudo-first-order
kinetic plots. For the first 3 min of the primer extension time course,
we observed that the initial rate constant of primer extension kobs = 0.93 ± 0.11 h–1. In comparison, kobs = 2.6 ± 0.5
h–1 for the last 3 min of this time course.Primer extension
in the presence of Cp* is largely due to the formation
of Cp*pC. (a) Pseudo-first-order plot of primer extension initiated
by the Cp* stock at pH 9.6. (b) Schematic illustrating that addition
of AI favors Cp* at the expense of Cp*pC. (c) Gel image of nonenzymatic
RNA primer extension with 20 mM Cp* and no additional AI. Samples
are at 3, 10, 30, 60, 120, and 180 min. (d) Gel image of nonenzymatic
RNA primer extension with 20 mM Cp* and 100 mM AI. (e) Pseudo-first-order
plot of nonenzymatic RNA primer extension with various concentrations
of AI. The observed values of rate constant k1h are recorded in the key. Error bars indicate ±SD (n = 3).To account for the primer
extension rate due to Cp*pC, we first
calculated the concentration of Cp*pC using the previously determined k1. On the basis of this value, we expect an
average concentration of 65 ± 3 μM Cp*pC over the first
3 min of the primer extension reaction. By using the KM and Ki values previously
obtained and propagating the errors, we approximate that the kobs due to Cp*pC is ≈0.47 ± 0.13
h–1. This calculation suggests that part of the
observed rate may be due to Cp* directly extending the primer. However,
a low initial concentration of Cp*pC in our Cp* stock at pH 9.6 could
explain the discrepancy. We repeated our calculations for the 6–8
min interval and computed a kobs of 2.0
± 0.5 h–1, which is within error of the observed
value.We sought another approach to directly measure primer
extension
by Cp* by reducing the Cp*pC concentration. Previously, excess AI
was used to decrease the concentration of Cp*pC in Cp* solutions when
studying hydrolysis (Figures and 6b). We adopted a similar approach
by adding 0–100 mM AI to a series of primer extension reaction
mixtures containing a final Cp* concentration of 20 mM. Because of
the low rates, we observed the primer extension reactions for 3 h.
In addition, we used 100 mM MgCl2 and 100 mM Tris.Addition of AI greatly inhibited the primer extension reaction
(Figure c,d). For
this set of experiments, we compared a pseudo-first-order rate constant
calculated from the first hour of the primer extension reaction, k1h. For 0 mM AI, we observed that k1h = 2.70 h–1. At 100 mM AI, we observed
that k1h = 0.38 h–1.
Notably, the rate markedly declines between 50 and 100 mM AI, suggesting
that our observed rate constant is an upper limit to the polymerization
rate of Cp*.From these data, we extrapolated the primer extension
rate to infinitely
high concentrations of AI, or when the Cp*pC concentration is zero.
For the purpose of this calculation, we assumed that Cp* is in equilibrium
with Cp*pC and AI (Figure b). To estimate the equilibrium constant, we noted that Keq = k1/k2 = 0.019 ± 0.004 (Figure S11). Because Keq = [Cp*pC][AI][Cp*]−2, [AI] is reciprocal to [Cp*pC], and this equation
can be used to approximate the concentration of Cp*pC when [AI] is
varied. The k1h values were plotted versus
the approximated [Cp*pC] values in a Michaelis–Menten plot
(Figure S12). We then analyzed this plot
using nonlinear regression to a Michaelis–Menten equation modified
with a constant term to represent the possible reaction rate due to
20 mM Cp*. We observed that the y-intercept, when
[Cp*pC] is zero, is equal to −0.286 ± 0.159 h–1 standard error. This negative value of k1h is likely due to an overestimation of [Cp*pC] from the Keq value, because Cp*pC also hydrolyzes. However, our
analysis suggests that the contribution of Cp* to the primer extension
rate is indistinguishable from zero under these conditions.
Kinetic
Modeling of the Primer Extension Reaction
We
sought to combine our kinetic data into a kinetic model to develop
and test our understanding of the primer extension reaction. The model
needed to explain both the concentration of the intermediate formed
over time under primer extension conditions and how this affects the
rate of polymerization. We reasoned that solutions of Cp* would undergo
a series of off-template reactions involving Cp*pC, AI, and Cp (Scheme ). Only intermediate
Cp*pC binds the template and subsequently reacts with the primer.
As observed in our analysis of inhibition by AI, the reaction of Cp*
with the primer is too slow to significantly affect the rate of polymerization
(Figure ). Finally,
we assume that the concentrations of AI, Cp, Cp*, and Cp*pC are not
greatly affected by the primer–template duplex on short time
scales.
Scheme 1
Kinetic Model of Primer Extension by Cp*
On the left, the off-template
reactions form and destroy Cp* and Cp*pC. Cp is formed as a hydrolysis
product. On the right, the primer is extended by only Cp*pC. Primer
extension is also competitively inhibited by Cp* and Cp.
Kinetic Model of Primer Extension by Cp*
On the left, the off-template
reactions form and destroy Cp* and Cp*pC. Cp is formed as a hydrolysis
product. On the right, the primer is extended by only Cp*pC. Primer
extension is also competitively inhibited by Cp* and Cp.We modeled the four off-template reactions using four
kinetic rate
equations to describe how the concentration of each molecule will
change over time.In these equations, rate constants k1–k4 correspond to the reactions
studied by 31P NMR (Figures –4 and Table ). By iteratively solving these
equations following initial conditions, we can model the concentrations
of all four molecules over time.
Table 1
Off-Template Reaction
Rate Constants
± SD Observed Experimentally (Figures –4) or Computationally
Fit to 24 mM Cp* Solutions (Figure )
observed
fitted
k1 (h–1 mM–1)
(4.49 ± 0.47) × 10–3
(4.55 ± 0.48) × 10–3
k2 (h–1 mM–1)
0.238 ± 0.020
0.183 ± 0.026
k3 (h–1)
0.171 ± 0.006
0.167 ± 0.012
k4 (h–1)
(3.33 ± 0.03) × 10–3
(2.11 ± 0.84) × 10–3
We modeled the rate of the primer extension reaction based on our
Michaelis–Menten kinetic analysis of primer extension by Cp*pC
(Figure ). The rate
constant of polymerization can be calculated using the relationshipwhere kobsmax, KM, and Ki are empirically determined
constants (Figure ). These equations combine
the Michaelis–Menten equation with competitive inhibition by
the monomers, Cp and Cp*. Under our reaction conditions, both Cp and
Cp* are assumed to bind the template with approximately equal affinity
to competitively inhibit primer extension. Using the calculated concentrations
of Cp, Cp*, and Cp*pC (eqs –4) to evaluate kcalc (eqs and 6), we can model how the polymerization
rate of nonenzymatic primer extension changes over time due to the
off-template reactions.
Analyzing Long-Term Behavior of Off-Template
Reactions with
the Kinetic Model
We first tested our kinetic model of the
off-template reactions against experimental data obtained by 31P NMR (Figure ). Beginning with 24 mM Cp* in primer extension
buffer, we observed the concentrations of Cp, Cp*, and Cp*pC every
7 min and 44 s for 12.5 h. Because AI cannot be observed by 31P NMR, we inferred that the concentration of AI is equal to the concentration
of Cp plus Cp*pC.
Figure 7
Kinetic model describing the long-term behavior of off-template
reactions in 24 mM Cp* solutions in primer extension buffer. Concentrations
over time for experimentally observed (blue), empirical model (red),
and fitted model (green) are presented for (a) Cp, (b) Cp*, (c) Cp*pC,
and (d) AI. Concentrations were experimentally determined using 31P NMR. Modeled concentrations were iteratively calculated
using eqs –4 and the rate constants listed in Table .
Kinetic model describing the long-term behavior of off-template
reactions in 24 mM Cp* solutions in primer extension buffer. Concentrations
over time for experimentally observed (blue), empirical model (red),
and fitted model (green) are presented for (a) Cp, (b) Cp*, (c) Cp*pC,
and (d) AI. Concentrations were experimentally determined using 31P NMR. Modeled concentrations were iteratively calculated
using eqs –4 and the rate constants listed in Table .Our experimental observations of off-template reactions displayed
dynamic changes in concentration for some of the molecules. For Cp,
the concentration steadily increased over time from 0.2 to 4.0 mM
over 12.5 h (Figure a, blue line). In contrast, the concentration of Cp* rapidly decreased
from 22.4 to 18.0 mM over the first 2 h and then slowly decreased
to 15.9 mM over the next 10.5 h (Figure b, blue line). For Cp*pC, we observed a sharp
increase over the first 2 h from 0.4 to 2.2 mM (Figure c, blue line). At this peak concentration,
Cp*pC accounts for 18% of the cytidine present in the mixture. Over
the next 10.5 h, the concentration of Cp*pC gradually decreases to
1.2 mM. Finally, the AI concentration was inferred to sharply increase
during the first 2 h from 0.7 to 3.1 mM and then gradually increase
to 5.2 mM by the end of the NMR series (Figure d, blue line). In addition, we also observed
four peaks from trace products (Figure S13).We compared these data to our kinetic model initialized
with the
concentrations observed in the first spectrum of the NMR series and
a step size of 1 min for dt in our calculations.
Overall, we observed similar trends between our model and the experimental
observations of the four molecules (Figure , red lines). For Cp*, Cp*pC, and AI, the
model predicts a 2 h initial phase of rapid concentration change,
followed by a second phase of gradual changes. However, there are
important discrepancies between the model and the experimental data.
Most notable is the 14% difference between the calculated and observed
concentrations of Cp at the end of the time course (Figure a). This may be due in part
to the 12% underestimation of Cp*pC at its peak concentration near
2 h (Figure c). In
addition, a 1.9 mM difference develops between the model and the observed
concentration of Cp* (Figure b). Part of this discrepancy is likely due to the formation
of trace materials that amount to ∼1.8 mM by 12.5 h. Future
studies of the off-template reactions should account for the formation
of these trace materials.Given the differences between the
model and the observed concentrations,
we computationally fit the off-template k values
to this experimental data set. Our approach randomly varied k values within ±20% for 100 iterations and then computed
the concentrations of all four molecules for 12.5 h. The k values were kept if they simultaneously improved the square of normalized
residuals for all of the molecules. As our input k values for the first iteration, we used the experimentally determined k values randomly multiplied or divided by up to 2 times.We repeated this calculation 100 times and then averaged the fitted k parameters (Table ). As expected, the fitted k values improve
our agreement between the kinetic model and experimental observations
(Figure , green lines).
In general, the fitted k values agree with our experimental
determinations of k1 and k3 but significantly differ from our determinations of k2 and k4 (Table ). Notably, this analysis
suggests that we systematically overestimated rate constant k2, which caused our empirical model to underestimate
the level of Cp*pC and subsequently underestimate the levels of Cp
and AI.
The Kinetic Model Explains the Rate of Primer Extension at Early
Times
Next, we compared our predicted kcalc against the experimentally determined kobs of primer extension reactions over time. To determine
how the rate of polymerization changes over 10 h, we incubated 24
mM Cp* in primer extension buffer and periodically removed aliquots
to initiate primer extension reactions and determine kobs. After addition of Cp*, the primer extension mixture
contains 90 mM MgCl2 and 90 mM Tris, which is identical
to the conditions of our KM determination
(Figure ).We
observed that the change in the kobs of
primer extension displayed two stages (Figure a). In the first 2 h, the kobs of primer extension increases from 5.0 to 9.7 h–1. Subsequently, kobs steadily
declines to 3.4 h–1 after incubation for 10 h in
primer extension buffer. On the basis of our model using either the
empirical or fitted off-template parameters listed in Table , the change in kcalc also displayed two stages over 10 h. For the fitted
parameters, kcalc increases during the
first 2 h from 2.9 to 9.0 h –1 and then gradually
decreases to 7.2 h–1 by the end of the time course
(Figure a, green line).
By comparing kcalc with kobs, we observed that this model agrees with observation
for the first 2 h but subsequently overestimates the polymerization
rate for the next 8 h. The difference between kcalc and kobs grows over time and
cannot be explained by a difference between observed and calculated
concentrations. These results indicate that the decline in kobs is not fully explained by Cp*pC polymerization
and competitive inhibition by monomers Cp and Cp*.
Figure 8
kobs drifts lower than kcalc over
long-term primer extension experiments. (a)
A 24 mM Cp* solution incubated in primer extension buffer is used
to initiate primer extension reactions at various times to obtain kobs (blue). kcalc is obtained by evaluating eqs and 6 with concentrations calculated
using the empirically determined (red) or computationally fit (green)
rate constants of the off-template reactions. (b) Experimentally determined kobs values of primer extension are plotted vs
the concentration of Cp*pC observed by 31P NMR.
kobs drifts lower than kcalc over
long-term primer extension experiments. (a)
A 24 mM Cp* solution incubated in primer extension buffer is used
to initiate primer extension reactions at various times to obtain kobs (blue). kcalc is obtained by evaluating eqs and 6 with concentrations calculated
using the empirically determined (red) or computationally fit (green)
rate constants of the off-template reactions. (b) Experimentally determined kobs values of primer extension are plotted vs
the concentration of Cp*pC observed by 31P NMR.To better understand the reasons why the model
does not fully explain
the rate of primer extension, we analyzed a plot of kobs versus the concentration of Cp*pC observed by 31P NMR (Figure b). This plot was obtained by combining the data from Figures c and 8a and eliminating the time variable. We observed a positive correlation
between kobs and the concentration of
the intermediate (Pearson r = 0.836, and two-tailed p < 0.0002), suggesting that the concentration of Cp*pC
is a key factor that determines the polymerization rate. In addition,
we noticed that the later time points generally displayed a value
for kobs lower than those of earlier time
points. For instance, the observed concentration of Cp*pC at both
30 min and 10 h is 1.3 mM. However, kobs = 6.2 h–1 at 30 min, and kobs = 3.4 h–1 at 10 h. These results suggest
that an additional factor is changing over time to decrease the rate
of primer extension. We hypothesize that trace amounts of potent inhibitors,
possibly short oligomers, may accumulate over time. Alternatively,
hydrolysis products AI and Cp might inhibit primer extension in ways
not represented by our analysis. Future improvement of the kinetic
model should identify and analyze the cause of decreased rates in
long-term experiments.
Conclusions
We have provided a kinetic
model that recapitulates several features
of the primer extension reaction over time and establishes that the
predominant mechanism of nonenzymatic RNA polymerization of cytidine-5′-phosphoro-2-aminoimidazolide
involves reaction of the primer with Cp*pC. Although we cannot entirely
exclude the traditional mechanism of a nucleophilic attack from the
primer on Cp*, the contribution of this proposed mechanism to the
reaction rate is negligible (<0.4 h–1) under
these conditions. This is likely due to the enhanced reactivity of
Cp*pC relative to Cp*, as well as the higher affinity of the dinucleotide
intermediate for the template. In addition, the alternative mechanism
of a downstream monomer mediating a noncovalent “leaving group–leaving
group” interaction under these conditions is unlikely based
upon measured binding affinities and analogue studies.[12,13]Although this study clarifies the chemical mechanism of polymerization
solely using monomers, the effect of downstream oligomers on the mechanism
remains to be investigated. Possibly, noncovalent interactions occur
using a stably bound downstream oligomer activated with AI. In addition,
we have yet to explicitly investigate the catalysis of Cp*pC formation
by a complementary template. Previous reports indicate that complementary
oligonucleotides act as a template for the reaction between 2-methylimidazole-activated
monomers and ribonucleotide 5′-monophosphates to form dinucleotide
5′,5′-pyrophosphates.[22,23] This work
suggests that two template-bound Cp* might locally react to form Cp*pC
and then polymerize.[24]The differences
between the kinetic model and the experimental
observations identify gaps in our understanding of nonenzymatic RNA
polymerization. First, the kinetic model does not include the formation
of the trace materials observed after extended incubation in primer
extension buffer. Second, the rate of polymerization decreases more
quickly than our model predicts. This leads us to speculate that these
trace compounds might be the inhibitors that explain why the polymerization
rate decreases more quickly than expected. However, we note that the
potential formation of inhibitors may be a peculiarity of the specific
reaction system we have studied. Future extensions of the kinetic
model should address additional, more complex systems, including multiple
monomers and mixed-template sequences.Lastly, our results address
the reasons why 5′-phosphoro-2-aminoimidazole
activation outperforms that of 2-methylimidazole. When we incubated
24 mM 2-MeImpC in primer extension buffer, we did not observe detectable
levels of the corresponding 2-methylimidazolium-bridged dinucleotide
by 31P NMR (Figure S14). Because
it is readily detectable at lower pH, we hypothesize that it is present
at low concentrations.[10] We suspect two
reasons for the accumulation of the 2-aminoimidazolium Cp*pC. First,
the formation of the intermediate is pH-dependent and is optimal when
pH = pKa.[10] The higher pKa of the 2-aminoimidazole
group is more favorable than 2-methylimidazole for formation of the
intermediate at pH 8.3–8.4.[9] Second,
the increased stability of the 2-aminoimidazolium Cp*pC relative to
the 2-methylimidazolium dinucleotide also favors accumulation of this
molecule under primer extension conditions (Figure and Figure S7). We note that the 2-methylimidazolium intermediate was too reactive
to effectively purify in our previous study,[10] but we were readily able to synthesize the 2-aminoimidazolium Cp*pC
with >90% purity in this work. Unexpectedly, the hydrolysis rates
of the 2-MeImpC and the 2-amino Cp* were very similar when free imidazole
was used to limit formation of the intermediate (Figure and Figure S8). This observation differs from a previous result suggesting
that the 2-amino monomer hydrolyzes twice as fast as the 2-methyl
monomer under primer extension conditions.[9] Because of the comparable stabilities of these monomers, we suspect
that the nucleophilicity of the imidazole group also plays an important
role in the strongly enhanced reaction rate of 2-aminoimidazole-activated
monomers. Continuing investigations of the reaction mechanism will
help to clarify the reasons for the superiority of 2-aminoimidazole
and identify the reaction pathways that will need to be optimized
to further improve nonenzymatic RNA polymerization.
Authors: Constantin Giurgiu; Ziyuan Fang; Harry R M Aitken; Seohyun Chris Kim; Lydia Pazienza; Shriyaa Mittal; Jack W Szostak Journal: Angew Chem Int Ed Engl Date: 2021-09-14 Impact factor: 16.823
Authors: Travis Walton; Saurja DasGupta; Daniel Duzdevich; Seung Soo Oh; Jack W Szostak Journal: Proc Natl Acad Sci U S A Date: 2020-03-02 Impact factor: 11.205
Authors: Lijun Zhou; Seohyun Chris Kim; Katherine H Ho; Derek K O'Flaherty; Constantin Giurgiu; Tom H Wright; Jack W Szostak Journal: Elife Date: 2019-11-08 Impact factor: 8.140
Authors: Wen Zhang; Seohyun Chris Kim; Chun Pong Tam; Victor S Lelyveld; Saikat Bala; John C Chaput; Jack W Szostak Journal: Nucleic Acids Res Date: 2021-01-25 Impact factor: 16.971