Literature DB >> 29022704

A Kinetic Model of Nonenzymatic RNA Polymerization by Cytidine-5'-phosphoro-2-aminoimidazolide.

Abstract

The nonenzymatic polymerization of RNA may have enabled copying of functional sequences during the origin of life. Recent progress utilizing 5'-phosphoro-2-aminoimidazole activation has reinvigorated the possibility of using nonenzymatic RNA polymerization for copying arbitrary sequences. However, the reasons why 2-aminoimidazole (AI) is a superior activation group remain unclear. Here we report that the predominant mechanism of polymerization using cytidine-5'-phosphoro-2-aminoimidazolide (Cp*) involves a 2-aminoimidazolium-bridged dinucleotide (Cp*pC) intermediate. To explore the role of this intermediate, we first identify and quantify four reactions involving the synthesis and breakdown of Cp*pC that occur in the absence of the primer-template duplex. We then analyze the dependence of the rate of polymerization on the concentration of the Cp*pC intermediate in the presence and absence of the competitive inhibitor Cp. We also show that the contribution of the monomer Cp* to the polymerization rate is negligible under our primer extension conditions. Finally, we use the experimentally determined rate constants of these reactions to develop a kinetic model that helps explain the changing rate of nonenzymatic RNA polymerization over time. Our model accounts for the concentration of Cp*pC formed by Cp* under primer extension conditions. The model does not completely account for the decline in polymerization rate observed over long times, which indicates that additional important inhibitory processes have not yet been identified. Our results suggest that the superiority of 2-aminoimidazole over the traditional 2-methylimidazole activation is mostly due to the higher level of accumulation of the imidazolium-bridged intermediate under primer extension conditions.

Entities: Chemical Disease Species

Mesh：

Substances：

Year: 2017 PMID： 29022704 PMCID： PMC6340129 DOI： 10.1021/acs.biochem.7b00792

Source DB: PubMed Journal: Biochemistry ISSN： 0006-2960 Impact factor: 3.162

The RNA world hypothesis proposes that early stages of life may have involved the self-replication of RNA oligonucleotides. However, experimental demonstration of RNA self-replication has been extremely difficult, leading some to abandon this hypothesis and suggest alternative scenarios.[1,2] Both the lack of regioselectivity and the limited template generality of nonenzymatic RNA polymerization have been cited as major obstacles to the RNA world. In addition, the prebiotic formation of the canonical RNA nucleosides through glycosylation of ribose is very inefficient, which has raised questions about the availability of RNA in prebiotic environments.[3] Recent work has revived the hypothesis of a primarily RNA-based foundation to the origin of life. First, the synthesis of both ribonucleotides and amino acids from cyanide and other simple molecular precursors offers a plausible set of prebiotic chemical reactions leading to both RNA and peptides.[4] Second, potential solutions to several steps in the nonenzymatic replication of RNA have been proposed. For example, replacing uridine with 2-thiouridine increases the rate of template-directed nonenzymatic polymerization by increasing the affinity of monomers for the template.[5,6] In addition, the reannealing of a large RNA duplex has been greatly slowed by increased solvent viscosity and RNA secondary structure, allowing information transfer.[7] Nevertheless, the one-pot synthesis of a functional RNA sequence through template-directed nonenzymatic polymerization has not been achieved and remains a critical goal for establishing a physicochemical approach to primitive genetic inheritance. Recent progress in nonenzymatic template-directed RNA polymerization has utilized RNA monomers combined with downstream oligomers, all activated with a substituted imidazole group on the 5′-phosphate.[8,9] This system iteratively extends the 3′ end of a primer to synthesize a complementary strand, even over difficult mixed-template sequences. Furthermore, the increased length and yield of primer extension reactions have been observed using 2-aminoimidazole instead of 2-methylimidazole activation.[9] Continued improvement of nonenzymatic RNA polymerization is likely to benefit from a detailed understanding of the chemical mechanism. Our recent report on the kinetics of primer extension by 2-methylimidazole-activated monomers suggested that the first step of the mechanism involves the formation of a 2-methylimidazolium-bridged dinucleotide.[10] Upon subsequent binding to the template, this dimer intermediate reacts with the 3′-hydroxyl of the primer, extending it by one nucleotide and releasing an activated monomer as the leaving group. This mechanism contrasts with a previous proposal that two monomers bind the template such that the downstream monomer catalyzes primer extension through noncovalent interactions with the upstream monomer.[8,11] Recent thermodynamic studies have cast doubt on the latter model because the affinity of the monomer in the downstream (+2) position is relatively low.[12] In addition, crystallographic studies of monomers bound to a template revealed a variety of conformations, some of which are not productive for polymerization reactions.[13] In contrast, structural studies of GpppG, a stable analogue of the intermediate, suggest that the imidazolium bridge may help to preorganize the reaction center for polymerization.[14] However, the relative contribution of each of these two mechanisms to the rate of primer extension has not yet been clarified. Kinetic analysis has been a powerful tool for understanding the reaction mechanism of primer extension systems. For instance, kinetic studies have elucidated the mechanism of 2-MeImpG hydrolysis, a simplified model of template-directed polymerization.[15,16] In addition, two earlier kinetic models identified key determinants of the yield and rate in nonenzymatic RNA polymerization that depend on the particular system studied. Kanavarioti et al.[17,18] account for the synthesis of poly-G catalyzed by poly-C through the quantification of three reactions: monomer hydrolysis, off-template oligomerization, and template-directed primer extension. Also, Kervio et al.[19] have shown that the hydrolysis of 1-hydroxybenzotriazole-activated monomers and competitive inhibition by inactivated monomers are the main limitations for primer extension in that system. However, neither of these models can likely be applied to primer extension by nucleotides activated by 2-aminoimidazole because they do not account for the potential kinetic effects of the imidazolium-bridged dinucleotide intermediate. Here, we present a kinetic model of nonenzymatic RNA polymerization by cytidine-5′-phosphoro-2-aminoimidazolide (Cp*) (Chart ). Our results show that the predominant mechanism of polymerization involves a dinucleotide intermediate in which the two 5′-phosphates are bridged by a 2-aminoimidazolium moiety (Cp*pC). To understand the factors that control the concentration of the intermediate, we studied the kinetics of four reactions that affect Cp*pC levels. We first examined the formation of Cp*pC and 2-aminoimidazole (AI) from Cp* under primer extension conditions by 31P nuclear magnetic resonance (NMR), as well as the reverse process in which AI reacts with Cp*pC to form Cp*. We also measured the rates of hydrolysis of the intermediate Cp*pC and the monomer Cp*. To understand the relationship between the concentration of Cp*pC and nonenzymatic polymerization, we analyzed the rate of primer extension by Cp*pC and competitive inhibition by cytidine 5′-monophosphate (Cp) using Michaelis–Menten kinetics. We combined our empirical rate constants into a kinetic model that relates a series of off-template reactions, including synthesis and decay of Cp*pC, to template-directed primer extension. Our model accounts for the long-term behavior of the off-template reactions and for observations of primer extension rate over the first 2 h. At longer times, additional uncharacterized effects lead to a gradual slowing of the rate of primer extension. Our results suggest that the improved polymerization yield of 2-amino- versus 2-methylimidazole-activated monomers is likely due to the higher level of accumulation of the intermediate under primer extension conditions.

Chart 1

Chemical Structure and Cartoon Representation of the Molecules Investigated in This Study

Materials and Methods

All reagents were purchased from Sigma-Aldrich unless specified. 2-Aminoimidazole hemisulfate was purchased from Combi-Blocks, Inc. All other exceptions are specified in our previous report.[10] All syntheses involving the attachment of imidazole to the 5′-phosphate have been previously reported.[9,10] With the following exceptions, all nonenzymatic primer extension reactions were performed as previously described.[10] For all reactions, the template concentration is 3 μM and the primer concentration is 2 μM. Except where noted below, primer extension occurs in 90 mM MgCl2 and 90 mM Tris (pH 8.3–8.4). In Figure b, the reactions including 45 mM Cp used 160 mM Tris to improve pH buffering. Also, in Figure e, primer extension occurs in 100 mM MgCl2 and 100 mM Tris (pH 8.3–8.4).

Figure 5

Cp competitively inhibits primer extension by Cp*pC. (a) During the primer extension reaction, Cp*pC binds the template with an affinity related to the KM. The Cp and Cp* monomers also bind the template and competitively inhibit the primer extension reaction. (b) Michaelis–Menten plot of Cp*pC with 0–45 mM Cp. Lines represent the Michaelis–Menten equation evaluated with empirical determinations of kobsmax and effective KM. Error bars indicate ±SD (n = 3).

Figure 6

Primer extension in the presence of Cp* is largely due to the formation of Cp*pC. (a) Pseudo-first-order plot of primer extension initiated by the Cp* stock at pH 9.6. (b) Schematic illustrating that addition of AI favors Cp* at the expense of Cp*pC. (c) Gel image of nonenzymatic RNA primer extension with 20 mM Cp* and no additional AI. Samples are at 3, 10, 30, 60, 120, and 180 min. (d) Gel image of nonenzymatic RNA primer extension with 20 mM Cp* and 100 mM AI. (e) Pseudo-first-order plot of nonenzymatic RNA primer extension with various concentrations of AI. The observed values of rate constant k1h are recorded in the key. Error bars indicate ±SD (n = 3).

All NMR spectra were recorded on a Varian INOVA 400 MHz NMR spectrometer at 25 °C. For all kinetic analyses, samples were prepared in H2O, and a coaxial insert containing D2O was used for locking. This eliminates possible solvent isotope effects. All peak assignments were confirmed by addition of standards. For characterization, samples were prepared in D2O, and a coaxial insert was not used. Peaks are referenced to internal trimethyl phosphate (δ 0.00) for 31P NMR, internal acetone (δ 30.89) for 13C NMR, and HOD (δ 4.79) for 1H NMR. NMR spectra of Cp*pC characterization are included (Figure S1). Cp*pC: 1H NMR (400 MHz) δ 7.66 (d, J = 7.6 Hz, 1H), 6.96 (m, 1H), 6.03 (d, J = 7.5 Hz, 1H), 5.85 (d, J = 3.2 Hz, 1H), 4.12 (m, 5H); 31P NMR (161 MHz) δ −12.84 (s); 13C NMR (100 MHz) δ 166.58 (s), 157.90 (s), 150.91 (t, J = 7.1 Hz), 141.62 (s), 116.83 (dd, J = 3.7, 6.7 Hz), 96.77 (s), 90.73 (s), 82.19 (d, J = 8.5 Hz), 74.68 (s), 69.49 (s), 66.16 (d, J = 5.8 Hz); calcd m/z −692.12, observed m/z −692.1. All NMR spectra were analyzed and quantified using MestReNova software. All kinetic analyses were performed using Microsoft Excel and Prism 7. Parameter fitting of the off-template reactions was performed using code written in R. All values are presented as means ± the standard deviation unless specified otherwise.

Results and Discussion

Cp* Monomers Self-React To Form Cp*pC in Primer Extension Buffer

We began our analysis of Cp* by determining whether dinucleotide intermediate Cp*pC can form in primer extension buffer (Figure a). On the basis of previous studies of 2-MeImpG, we expected that the Mg2+ concentration and pH would likely affect our observations.[15] Recent studies of nonenzymatic RNA polymerization have used 50–200 mM MgCl2 and pH 8–9.[8−10] For these experiments, we used 100 mM MgCl2 and 100 mM Tris (pH 8.3–8.4) as the primer extension buffer.

Figure 1

Measuring the reaction of two Cp* monomers to form AI and Cp*pC. (a) Schematic of the reaction being measured for calculation of rate constant k1. (b) 31P NMR spectra of 24 mM Cp* incubated in primer extension buffer over time. Peaks at −11.38 ppm correspond to Cp* and −12.86 ppm to Cp*pC. (c) Analysis of the 31P NMR spectra at 24 mM Cp* in triplicate by a second-order rate plot. Error bars indicate ±SD (n = 3). The slope is equal to 2k1. We used 31P NMR to observe the formation of Cp*pC from Cp* in primer extension buffer (Figure a). Because incubation of monomers near the pKa of the imidazole group is known to promote formation of the intermediate, solutions of Cp* were kept at pH ∼10 to prevent the reaction from occurring before addition to the primer extension buffer. Incubation of 24 mM Cp* in primer extension buffer resulted in the time-dependent appearance of a new peak in 31P NMR spectra (Figure b). The new peak corresponds to intermediate Cp*pC, as confirmed by the addition of a synthetically prepared standard. We did not observe significant formation of Cp during the course of the experiment, suggesting that Cp*pC was not immediately hydrolyzing. Next, we determined rate constant k1 for the reaction of two Cp* monomers to form Cp*pC and AI in primer extension buffer. Samples were prepared in H2O to avoid potential solvent isotope effects and concentrations measured by integration of 31P NMR peaks. The formation of AI was not directly observed because AI does not contain a phosphorus atom. We assumed that the observed kinetics would be entirely due to the reaction of two Cp* molecules to form Cp*pC. First, we confirmed that the formation of Cp*pC does indeed follow second-order kinetics by measuring the initial rate of Cp*pC synthesis at four initial concentrations of Cp*. As expected, the initial rate of Cp*pC synthesis increased with the square of the initial concentration of Cp* (Figure S2). Using the 12 experiments with initial Cp* concentrations from 15 to 40 mM, we calculated the second-order rate constant (k1) to be [4.49 ± 0.47 (standard deviation)] × 10–3 h–1 mM–1 and to range from 3.65 to 5.22 × 10–3 h–1 mM–1 (Figure c and Figure S3).

AI Reacts with CppC To Form Two Molecules of Cp

Having determined that intermediate Cp*pC can form in primer extension buffer, we began to consider additional reactions that would affect the concentration of the intermediate during a primer extension experiment. We first examined the reverse of its synthetic reaction, i.e., the nucleophilic attack of AI on Cp*pC to generate two Cp* monomers (Figure a). This reaction has not been previously described or measured.

Figure 2

Cp*pC reacts with AI to form two molecules of Cp*. (a) Schematic of the reaction measured to determine k2. (b) 31P NMR spectra of 5 mM Cp*pC and 11.5 mM AI incubated in primer extension buffer. The peaks at −11.41 ppm correspond to monomer Cp* and −12.86 ppm to dinucleotide intermediate Cp*pC. (c) Analysis of the reaction between 5 mM Cp*pC and 11.5 mM AI in a second-order kinetic plot. In the plot, k2 is equal to the slope divided by [AI]0 – [Cp*pC]0. Error bars indicate ±SD (n = 3). To follow the reaction of AI with Cp*pC, 11.5 mM AI was incubated with 5 mM Cp*pC in primer extension buffer, and the progress of the reaction was monitored by 31P NMR spectroscopy (Figure b). We observed that the magnitude of the peak for Cp*pC rapidly decreased and that the magnitude of the peak for Cp* increased over time. In addition, the concentration of Cp did not significantly change during the course of the reaction. This indicates that the increase in the level of Cp* is not due to hydrolysis of Cp*pC. Instead, these results indicate that AI reacts with Cp*pC to form two molecules of Cp*. We measured the rate of the reaction between AI and Cp*pC at five different concentrations to determine the reaction order. On the basis of the reaction order, the kinetics should be overall second-order for this reaction, with first-order for both AI and Cp*pC. A reaction order of ∼0.8 was determined for both AI and Cp*pC (Figure S4). We suspect that the empirical determinations of the reaction order might be slight underestimations because of the technical difficulty of measuring the fast rate of the reaction. The 15 experiments with varying concentrations of 5.75–17.25 mM AI and 2.5–7.5 mM Cp*pC were analyzed using second-order kinetic plots to calculate a k2 of 0.238 ± 0.020 (SD) h–1 mM–1 with a range of 0.190–0.271 h–1 mM–1 (Figure c and Figure S5).

CppC Hydrolyzes to Cp and Cp

In addition to reacting with AI, we expected that Cp*pC would also decay through hydrolysis to form Cp and Cp* (Figure a). This reaction has been previously observed for the 2-methylimidazolium-bridged dinucleotide.[10] To measure this reaction, 31P NMR spectra of 5 mM Cp*pC in primer extension buffer were recorded every 31 min for 4.1 h (Figure b). In the absence of free AI, the observed changes in concentration during this time frame should be entirely due to hydrolysis of Cp*pC. As expected, the intermediate decayed to approximately equal amounts of Cp and Cp*, consistent with hydrolysis of Cp*pC (Figure S6). We calculated a pseudo-first-order rate constant of intermediate hydrolysis [k3 = 0.171 ± 0.006 h–1 (Figure c)]. This value corresponds to a half-life of 4.06 ± 0.15 h.

Figure 3

Intermediate Cp*pC hydrolyzes to Cp* and Cp. (a) Schematic of the reaction measured to determine k3. (b) 31P NMR spectrum of 5 mM Cp*pC incubated in primer extension buffer for 3.9 h. The peak at 0.28 ppm corresponds to Cp, that at −11.41 ppm to Cp*, and that at −12.86 ppm to Cp*pC. (c) Quantification of hydrolysis in a first-order kinetic plot. The negative slope is equal to k3 = 0.171 ± 0.006 h–1. Error bars indicate ±SD (n = 3). As a comparison to the traditional 2-methylimidazole activation, the hydrolysis rate of the 2-methylimidazolium-bridged dicytidine intermediate was also measured under similar reaction conditions (Figure S7). We used a partially purified 2-methylimidazolium intermediate as described in our previous report.[10] The observed rate constant of hydrolysis for the 2-methylimidazolium intermediate was determined to be 4.40 ± 0.07 h–1. This value is 26 times higher than the hydrolysis rate constant of 2-aminoimidazolium Cp*pC, indicating that 2-aminoimidazolium Cp*pC is much more stable than the 2-methylimidazolium intermediate.

The Cp* Monomer Slowly Hydrolyzes to Cp and AI

We also measured the rate of hydrolysis of Cp* in primer extension buffer (Figure a). Measuring the hydrolysis rate of Cp* was complex because Cp* also self-reacts to form Cp*pC (Figure ), which can then hydrolyze to Cp, the product of Cp* hydrolysis. To minimize the concentration of Cp*pC, the hydrolysis of Cp* in primer extension buffer was measured at a series of concentrations of AI. We hypothesized that the excess AI would react with Cp*pC to form two Cp* molecules (Figure ), thereby reducing the Cp*pC concentration without affecting Cp* hydrolysis.

Figure 4

Monomer Cp* decays slowly to Cp and AI. (a) Schematic of the reaction measured to determine k4. (b) 31P NMR spectrum of 5 mM Cp* and 20 mM AI incubated in primer extension buffer for 72 h. The peak at 0.29 ppm corresponds to Cp and that at −11.39 ppm to Cp*. The peak at −4.92 ppm corresponds to cyclic cytidine 3′,5′-monophosphate as verified by spike-in. (c) Hydrolysis of 5 mM Cp* in 40 mM AI quantified by 31P NMR spectra in a first-order rate plot. The negative slope is equal to k4 = (3.33 ± 0.03) × 10–3 h–1. Error bars indicate ±SD (n = 3). We began our analysis of the hydrolysis of Cp* by recording 31P NMR spectra of 5 mM Cp* and 20 mM AI over 5 days. Cp* gradually decayed into Cp as well as trace amounts of other products that were detected on the third day (Figure b). A peak corresponding to Cp*pC was not observed, suggesting that Cp formed directly by hydrolysis of Cp*. In the presence of 20 mM AI, we calculated the observed Cp* hydrolysis rate constant [k4 = (4.04 ± 0.16) × 10–3 h–1]. Given that k3 is ∼40 times greater than k4, our results could easily be affected by trace levels of Cp*pC. We repeated our experiment using 40 mM AI and 5 mM Cp* in primer extension buffer. This time we observed that k4 = (3.33 ± 0.03) × 10–3 h–1 (Figure c), suggesting that our previous results overestimated k4 because of trace formation of Cp*pC. Again, our experiments were repeated using 60 mM AI. However, a precipitate was observed on the third day of the experiment, and these data were not analyzed. Overall, these results place an upper limit on the value of k4 and suggest that the half-life of Cp* is >8.7 days because of hydrolysis in primer extension buffer. In addition, we observed that the rate constant of 2-MeImpC hydrolysis equals (2.93 ± 0.26) × 10–3 h–1 under comparable conditions (Figure S8).

Cp*pC Is a Substrate for Primer Extension

Because Cp*pC both forms and decays in primer extension buffer, we began investigating how the concentration of Cp*pC affects the rate of primer extension. This relationship can be approximated through Michaelis–Menten kinetics because of the separate binding and reaction steps that occur on the primer–template complex. The primer–template complex is present at very low concentrations relative to those of substrates, which show saturation binding.[19−21] In addition, Michaelis–Menten kinetics offers a framework for studying the effect of competitive inhibition on the polymerization rate (Figure a). Cp competitively inhibits primer extension by Cp*pC. (a) During the primer extension reaction, Cp*pC binds the template with an affinity related to the KM. The Cp and Cp* monomers also bind the template and competitively inhibit the primer extension reaction. (b) Michaelis–Menten plot of Cp*pC with 0–45 mM Cp. Lines represent the Michaelis–Menten equation evaluated with empirical determinations of kobsmax and effective KM. Error bars indicate ±SD (n = 3). We determined the rate of primer extension using five concentrations of Cp*pC from 0.5 to 20 mM. We observed that the pseudo-first-order rate constant of polymerization during the first 3 min, kobs, increased with the concentration of the intermediate (Figure S9). kobs plateaus at Cp*pC concentrations above 5 mM, suggesting that the template becomes saturated by Cp*pC binding (Figure b, red line). We analyzed these data using a double-reciprocal plot and substituted Vmax with kobsmax for the purpose of comparison. For primer extension by Cp*pC in the absence of competitive inhibition, we observed that kobsmax = 19.5 ± 2.1 h–1 and KM = 1.06 ± 0.12 mM. Having determined that Cp*pC is a substrate of the primer extension reaction, we next analyzed how the monomer competitively inhibits primer extension by Cp*pC. We repeated our Michaelis–Menten analysis of Cp*pC in the presence of 15–45 mM Cp and observed increasingly lower rates for this set of reactions compared to the set with 0 mM Cp (Figure b). Overall, this set of data is consistent with competitive inhibition as analyzed by a double-reciprocal plot (Figure S10). As expected, the effective KM increases with Cp concentration. On the basis of these values, we calculated that the Ki of Cp inhibition of primer extension by Cp*pC is 24.7 ± 11.6 mM. This value agrees with the Kd of 15–19 mM of Cp measured for a different RNA duplex.[20] In addition, this value is consistent with the Kd of 19 mM for Cp and the Kd of 27 mM for 2-MeImpC previously determined for an RNA hairpin.[21] We also note that our calculated kobsmax slightly decreases at higher concentrations of Cp, suggesting possible mixed inhibition. However, we cannot fully exclude potential nonspecific effects of adding Cp, such as changing the ionic strength of the primer extension reaction. In total, the data indicate that kobsmax = 17.6 ± 1.6 h–1, which is within error of our values obtained with both 0 mM Cp and 45 mM Cp.

Cp* Does Not Discernibly Contribute to the Primer Extension Rate

We next sought to determine the rate of primer extension by Cp* directly reacting with the primer. The interpretation of these experiments is complicated by the fact that Cp* forms Cp*pC under primer extension conditions (Figure ). However, we have quantified Cp*pC formation and the rate of primer extension by Cp*pC via competitive inhibition (Figure ). Therefore, we can account for the rate of primer extension due to Cp*pC and then determine the rate due to Cp*, including possible noncovalent interactions between template-bound Cp*. We decided on a final Cp* concentration of 30 mM for these experiments to be above the Kd of the monomer, but not too high to limit Cp*pC formation at short time intervals. The primer extension assay was initiated by adding a 45 mM Cp* stock at pH 9.6 to a reaction mix for final concentrations of 30 mM Cp*, 90 mM Tris, 90 mM MgCl2, 2 μM primer, and 3 μM template, identical to our analysis of Cp*pC. We observed the reaction over the first 8 min, during which time the reaction rate noticeably increased, likely because of the formation of Cp*pC (Figure a). A similar phenomenon has been reported for primer extension by 2-MeImpG.[10] Because of this “speed-up” effect, the calculated rate of primer extension is only approximated by pseudo-first-order kinetic plots. For the first 3 min of the primer extension time course, we observed that the initial rate constant of primer extension kobs = 0.93 ± 0.11 h–1. In comparison, kobs = 2.6 ± 0.5 h–1 for the last 3 min of this time course. Primer extension in the presence of Cp* is largely due to the formation of Cp*pC. (a) Pseudo-first-order plot of primer extension initiated by the Cp* stock at pH 9.6. (b) Schematic illustrating that addition of AI favors Cp* at the expense of Cp*pC. (c) Gel image of nonenzymatic RNA primer extension with 20 mM Cp* and no additional AI. Samples are at 3, 10, 30, 60, 120, and 180 min. (d) Gel image of nonenzymatic RNA primer extension with 20 mM Cp* and 100 mM AI. (e) Pseudo-first-order plot of nonenzymatic RNA primer extension with various concentrations of AI. The observed values of rate constant k1h are recorded in the key. Error bars indicate ±SD (n = 3). To account for the primer extension rate due to Cp*pC, we first calculated the concentration of Cp*pC using the previously determined k1. On the basis of this value, we expect an average concentration of 65 ± 3 μM Cp*pC over the first 3 min of the primer extension reaction. By using the KM and Ki values previously obtained and propagating the errors, we approximate that the kobs due to Cp*pC is ≈0.47 ± 0.13 h–1. This calculation suggests that part of the observed rate may be due to Cp* directly extending the primer. However, a low initial concentration of Cp*pC in our Cp* stock at pH 9.6 could explain the discrepancy. We repeated our calculations for the 6–8 min interval and computed a kobs of 2.0 ± 0.5 h–1, which is within error of the observed value. We sought another approach to directly measure primer extension by Cp* by reducing the Cp*pC concentration. Previously, excess AI was used to decrease the concentration of Cp*pC in Cp* solutions when studying hydrolysis (Figures and 6b). We adopted a similar approach by adding 0–100 mM AI to a series of primer extension reaction mixtures containing a final Cp* concentration of 20 mM. Because of the low rates, we observed the primer extension reactions for 3 h. In addition, we used 100 mM MgCl2 and 100 mM Tris. Addition of AI greatly inhibited the primer extension reaction (Figure c,d). For this set of experiments, we compared a pseudo-first-order rate constant calculated from the first hour of the primer extension reaction, k1h. For 0 mM AI, we observed that k1h = 2.70 h–1. At 100 mM AI, we observed that k1h = 0.38 h–1. Notably, the rate markedly declines between 50 and 100 mM AI, suggesting that our observed rate constant is an upper limit to the polymerization rate of Cp*. From these data, we extrapolated the primer extension rate to infinitely high concentrations of AI, or when the Cp*pC concentration is zero. For the purpose of this calculation, we assumed that Cp* is in equilibrium with Cp*pC and AI (Figure b). To estimate the equilibrium constant, we noted that Keq = k1/k2 = 0.019 ± 0.004 (Figure S11). Because Keq = [Cp*pC][AI][Cp*]−2, [AI] is reciprocal to [Cp*pC], and this equation can be used to approximate the concentration of Cp*pC when [AI] is varied. The k1h values were plotted versus the approximated [Cp*pC] values in a Michaelis–Menten plot (Figure S12). We then analyzed this plot using nonlinear regression to a Michaelis–Menten equation modified with a constant term to represent the possible reaction rate due to 20 mM Cp*. We observed that the y-intercept, when [Cp*pC] is zero, is equal to −0.286 ± 0.159 h–1 standard error. This negative value of k1h is likely due to an overestimation of [Cp*pC] from the Keq value, because Cp*pC also hydrolyzes. However, our analysis suggests that the contribution of Cp* to the primer extension rate is indistinguishable from zero under these conditions.

Kinetic Modeling of the Primer Extension Reaction

We sought to combine our kinetic data into a kinetic model to develop and test our understanding of the primer extension reaction. The model needed to explain both the concentration of the intermediate formed over time under primer extension conditions and how this affects the rate of polymerization. We reasoned that solutions of Cp* would undergo a series of off-template reactions involving Cp*pC, AI, and Cp (Scheme ). Only intermediate Cp*pC binds the template and subsequently reacts with the primer. As observed in our analysis of inhibition by AI, the reaction of Cp* with the primer is too slow to significantly affect the rate of polymerization (Figure ). Finally, we assume that the concentrations of AI, Cp, Cp*, and Cp*pC are not greatly affected by the primer–template duplex on short time scales.

Scheme 1

Kinetic Model of Primer Extension by Cp*

On the left, the off-template reactions form and destroy Cp* and Cp*pC. Cp is formed as a hydrolysis product. On the right, the primer is extended by only Cp*pC. Primer extension is also competitively inhibited by Cp* and Cp. We modeled the four off-template reactions using four kinetic rate equations to describe how the concentration of each molecule will change over time. In these equations, rate constants k1–k4 correspond to the reactions studied by 31P NMR (Figures –4 and Table ). By iteratively solving these equations following initial conditions, we can model the concentrations of all four molecules over time.

Table 1

Off-Template Reaction Rate Constants ± SD Observed Experimentally (Figures –4) or Computationally Fit to 24 mM Cp* Solutions (Figure )

	observed	fitted
k₁ (h^–1 mM^–1)	(4.49 ± 0.47) × 10^–3	(4.55 ± 0.48) × 10^–3
k₂ (h^–1 mM^–1)	0.238 ± 0.020	0.183 ± 0.026
k₃ (h^–1)	0.171 ± 0.006	0.167 ± 0.012
k₄ (h^–1)	(3.33 ± 0.03) × 10^–3	(2.11 ± 0.84) × 10^–3

We modeled the rate of the primer extension reaction based on our Michaelis–Menten kinetic analysis of primer extension by Cp*pC (Figure ). The rate constant of polymerization can be calculated using the relationshipwhere kobsmax, KM, and Ki are empirically determined constants (Figure ). These equations combine the Michaelis–Menten equation with competitive inhibition by the monomers, Cp and Cp*. Under our reaction conditions, both Cp and Cp* are assumed to bind the template with approximately equal affinity to competitively inhibit primer extension. Using the calculated concentrations of Cp, Cp*, and Cp*pC (eqs –4) to evaluate kcalc (eqs and 6), we can model how the polymerization rate of nonenzymatic primer extension changes over time due to the off-template reactions.

Analyzing Long-Term Behavior of Off-Template Reactions with the Kinetic Model

We first tested our kinetic model of the off-template reactions against experimental data obtained by 31P NMR (Figure ). Beginning with 24 mM Cp* in primer extension buffer, we observed the concentrations of Cp, Cp*, and Cp*pC every 7 min and 44 s for 12.5 h. Because AI cannot be observed by 31P NMR, we inferred that the concentration of AI is equal to the concentration of Cp plus Cp*pC.

Figure 7

Kinetic model describing the long-term behavior of off-template reactions in 24 mM Cp* solutions in primer extension buffer. Concentrations over time for experimentally observed (blue), empirical model (red), and fitted model (green) are presented for (a) Cp, (b) Cp*, (c) Cp*pC, and (d) AI. Concentrations were experimentally determined using 31P NMR. Modeled concentrations were iteratively calculated using eqs –4 and the rate constants listed in Table . Our experimental observations of off-template reactions displayed dynamic changes in concentration for some of the molecules. For Cp, the concentration steadily increased over time from 0.2 to 4.0 mM over 12.5 h (Figure a, blue line). In contrast, the concentration of Cp* rapidly decreased from 22.4 to 18.0 mM over the first 2 h and then slowly decreased to 15.9 mM over the next 10.5 h (Figure b, blue line). For Cp*pC, we observed a sharp increase over the first 2 h from 0.4 to 2.2 mM (Figure c, blue line). At this peak concentration, Cp*pC accounts for 18% of the cytidine present in the mixture. Over the next 10.5 h, the concentration of Cp*pC gradually decreases to 1.2 mM. Finally, the AI concentration was inferred to sharply increase during the first 2 h from 0.7 to 3.1 mM and then gradually increase to 5.2 mM by the end of the NMR series (Figure d, blue line). In addition, we also observed four peaks from trace products (Figure S13). We compared these data to our kinetic model initialized with the concentrations observed in the first spectrum of the NMR series and a step size of 1 min for dt in our calculations. Overall, we observed similar trends between our model and the experimental observations of the four molecules (Figure , red lines). For Cp*, Cp*pC, and AI, the model predicts a 2 h initial phase of rapid concentration change, followed by a second phase of gradual changes. However, there are important discrepancies between the model and the experimental data. Most notable is the 14% difference between the calculated and observed concentrations of Cp at the end of the time course (Figure a). This may be due in part to the 12% underestimation of Cp*pC at its peak concentration near 2 h (Figure c). In addition, a 1.9 mM difference develops between the model and the observed concentration of Cp* (Figure b). Part of this discrepancy is likely due to the formation of trace materials that amount to ∼1.8 mM by 12.5 h. Future studies of the off-template reactions should account for the formation of these trace materials. Given the differences between the model and the observed concentrations, we computationally fit the off-template k values to this experimental data set. Our approach randomly varied k values within ±20% for 100 iterations and then computed the concentrations of all four molecules for 12.5 h. The k values were kept if they simultaneously improved the square of normalized residuals for all of the molecules. As our input k values for the first iteration, we used the experimentally determined k values randomly multiplied or divided by up to 2 times. We repeated this calculation 100 times and then averaged the fitted k parameters (Table ). As expected, the fitted k values improve our agreement between the kinetic model and experimental observations (Figure , green lines). In general, the fitted k values agree with our experimental determinations of k1 and k3 but significantly differ from our determinations of k2 and k4 (Table ). Notably, this analysis suggests that we systematically overestimated rate constant k2, which caused our empirical model to underestimate the level of Cp*pC and subsequently underestimate the levels of Cp and AI.

The Kinetic Model Explains the Rate of Primer Extension at Early Times

Next, we compared our predicted kcalc against the experimentally determined kobs of primer extension reactions over time. To determine how the rate of polymerization changes over 10 h, we incubated 24 mM Cp* in primer extension buffer and periodically removed aliquots to initiate primer extension reactions and determine kobs. After addition of Cp*, the primer extension mixture contains 90 mM MgCl2 and 90 mM Tris, which is identical to the conditions of our KM determination (Figure ). We observed that the change in the kobs of primer extension displayed two stages (Figure a). In the first 2 h, the kobs of primer extension increases from 5.0 to 9.7 h–1. Subsequently, kobs steadily declines to 3.4 h–1 after incubation for 10 h in primer extension buffer. On the basis of our model using either the empirical or fitted off-template parameters listed in Table , the change in kcalc also displayed two stages over 10 h. For the fitted parameters, kcalc increases during the first 2 h from 2.9 to 9.0 h –1 and then gradually decreases to 7.2 h–1 by the end of the time course (Figure a, green line). By comparing kcalc with kobs, we observed that this model agrees with observation for the first 2 h but subsequently overestimates the polymerization rate for the next 8 h. The difference between kcalc and kobs grows over time and cannot be explained by a difference between observed and calculated concentrations. These results indicate that the decline in kobs is not fully explained by Cp*pC polymerization and competitive inhibition by monomers Cp and Cp*.

Figure 8

kobs drifts lower than kcalc over long-term primer extension experiments. (a) A 24 mM Cp* solution incubated in primer extension buffer is used to initiate primer extension reactions at various times to obtain kobs (blue). kcalc is obtained by evaluating eqs and 6 with concentrations calculated using the empirically determined (red) or computationally fit (green) rate constants of the off-template reactions. (b) Experimentally determined kobs values of primer extension are plotted vs the concentration of Cp*pC observed by 31P NMR. To better understand the reasons why the model does not fully explain the rate of primer extension, we analyzed a plot of kobs versus the concentration of Cp*pC observed by 31P NMR (Figure b). This plot was obtained by combining the data from Figures c and 8a and eliminating the time variable. We observed a positive correlation between kobs and the concentration of the intermediate (Pearson r = 0.836, and two-tailed p < 0.0002), suggesting that the concentration of Cp*pC is a key factor that determines the polymerization rate. In addition, we noticed that the later time points generally displayed a value for kobs lower than those of earlier time points. For instance, the observed concentration of Cp*pC at both 30 min and 10 h is 1.3 mM. However, kobs = 6.2 h–1 at 30 min, and kobs = 3.4 h–1 at 10 h. These results suggest that an additional factor is changing over time to decrease the rate of primer extension. We hypothesize that trace amounts of potent inhibitors, possibly short oligomers, may accumulate over time. Alternatively, hydrolysis products AI and Cp might inhibit primer extension in ways not represented by our analysis. Future improvement of the kinetic model should identify and analyze the cause of decreased rates in long-term experiments.

Conclusions

We have provided a kinetic model that recapitulates several features of the primer extension reaction over time and establishes that the predominant mechanism of nonenzymatic RNA polymerization of cytidine-5′-phosphoro-2-aminoimidazolide involves reaction of the primer with Cp*pC. Although we cannot entirely exclude the traditional mechanism of a nucleophilic attack from the primer on Cp*, the contribution of this proposed mechanism to the reaction rate is negligible (<0.4 h–1) under these conditions. This is likely due to the enhanced reactivity of Cp*pC relative to Cp*, as well as the higher affinity of the dinucleotide intermediate for the template. In addition, the alternative mechanism of a downstream monomer mediating a noncovalent “leaving group–leaving group” interaction under these conditions is unlikely based upon measured binding affinities and analogue studies.[12,13] Although this study clarifies the chemical mechanism of polymerization solely using monomers, the effect of downstream oligomers on the mechanism remains to be investigated. Possibly, noncovalent interactions occur using a stably bound downstream oligomer activated with AI. In addition, we have yet to explicitly investigate the catalysis of Cp*pC formation by a complementary template. Previous reports indicate that complementary oligonucleotides act as a template for the reaction between 2-methylimidazole-activated monomers and ribonucleotide 5′-monophosphates to form dinucleotide 5′,5′-pyrophosphates.[22,23] This work suggests that two template-bound Cp* might locally react to form Cp*pC and then polymerize.[24] The differences between the kinetic model and the experimental observations identify gaps in our understanding of nonenzymatic RNA polymerization. First, the kinetic model does not include the formation of the trace materials observed after extended incubation in primer extension buffer. Second, the rate of polymerization decreases more quickly than our model predicts. This leads us to speculate that these trace compounds might be the inhibitors that explain why the polymerization rate decreases more quickly than expected. However, we note that the potential formation of inhibitors may be a peculiarity of the specific reaction system we have studied. Future extensions of the kinetic model should address additional, more complex systems, including multiple monomers and mixed-template sequences. Lastly, our results address the reasons why 5′-phosphoro-2-aminoimidazole activation outperforms that of 2-methylimidazole. When we incubated 24 mM 2-MeImpC in primer extension buffer, we did not observe detectable levels of the corresponding 2-methylimidazolium-bridged dinucleotide by 31P NMR (Figure S14). Because it is readily detectable at lower pH, we hypothesize that it is present at low concentrations.[10] We suspect two reasons for the accumulation of the 2-aminoimidazolium Cp*pC. First, the formation of the intermediate is pH-dependent and is optimal when pH = pKa.[10] The higher pKa of the 2-aminoimidazole group is more favorable than 2-methylimidazole for formation of the intermediate at pH 8.3–8.4.[9] Second, the increased stability of the 2-aminoimidazolium Cp*pC relative to the 2-methylimidazolium dinucleotide also favors accumulation of this molecule under primer extension conditions (Figure and Figure S7). We note that the 2-methylimidazolium intermediate was too reactive to effectively purify in our previous study,[10] but we were readily able to synthesize the 2-aminoimidazolium Cp*pC with >90% purity in this work. Unexpectedly, the hydrolysis rates of the 2-MeImpC and the 2-amino Cp* were very similar when free imidazole was used to limit formation of the intermediate (Figure and Figure S8). This observation differs from a previous result suggesting that the 2-amino monomer hydrolyzes twice as fast as the 2-methyl monomer under primer extension conditions.[9] Because of the comparable stabilities of these monomers, we suspect that the nucleophilicity of the imidazole group also plays an important role in the strongly enhanced reaction rate of 2-aminoimidazole-activated monomers. Continuing investigations of the reaction mechanism will help to clarify the reasons for the superiority of 2-aminoimidazole and identify the reaction pathways that will need to be optimized to further improve nonenzymatic RNA polymerization.

15 in total

Review 1. Protocells and RNA Self-Replication.

Authors: Gerald F Joyce; Jack W Szostak
Journal: Cold Spring Harb Perspect Biol Date: 2018-09-04 Impact factor: 10.005

2. Deep sequencing of non-enzymatic RNA primer extension.

Authors: Daniel Duzdevich; Christopher E Carr; Jack W Szostak
Journal: Nucleic Acids Res Date: 2020-07-09 Impact factor: 16.971

3. Structure-Activity Relationships in Nonenzymatic Template-Directed RNA Synthesis.

Authors: Constantin Giurgiu; Ziyuan Fang; Harry R M Aitken; Seohyun Chris Kim; Lydia Pazienza; Shriyaa Mittal; Jack W Szostak
Journal: Angew Chem Int Ed Engl Date: 2021-09-14 Impact factor: 16.823

Review 4. Enzyme-free genetic copying of DNA and RNA sequences.

Authors: Marilyne Sosson; Clemens Richert
Journal: Beilstein J Org Chem Date: 2018-03-12 Impact factor: 2.883

5. Crystallographic observation of nonenzymatic RNA primer extension.

Authors: Wen Zhang; Travis Walton; Li Li; Jack W Szostak
Journal: Elife Date: 2018-05-31 Impact factor: 8.140

6. Eighty routes to a ribonucleotide world; dispersion and stringency in the decisive selection.

Authors: Michael Yarus
Journal: RNA Date: 2018-05-21 Impact factor: 4.942

7. In vitro selection of ribozyme ligases that use prebiotically plausible 2-aminoimidazole-activated substrates.

Authors: Travis Walton; Saurja DasGupta; Daniel Duzdevich; Seung Soo Oh; Jack W Szostak
Journal: Proc Natl Acad Sci U S A Date: 2020-03-02 Impact factor: 11.205

8. Non-enzymatic primer extension with strand displacement.

Authors: Lijun Zhou; Seohyun Chris Kim; Katherine H Ho; Derek K O'Flaherty; Constantin Giurgiu; Tom H Wright; Jack W Szostak
Journal: Elife Date: 2019-11-08 Impact factor: 8.140

9. Nonenzymatic Template-Directed Synthesis of Mixed-Sequence 3'-NP-DNA up to 25 Nucleotides Long Inside Model Protocells.

Authors: Derek K O'Flaherty; Lijun Zhou; Jack W Szostak
Journal: J Am Chem Soc Date: 2019-06-20 Impact factor: 15.419

10. Structural interpretation of the effects of threo-nucleotides on nonenzymatic template-directed polymerization.

Authors: Wen Zhang; Seohyun Chris Kim; Chun Pong Tam; Victor S Lelyveld; Saikat Bala; John C Chaput; Jack W Szostak
Journal: Nucleic Acids Res Date: 2021-01-25 Impact factor: 16.971