Martin Pfeiffer1,2, Bernd Nidetzky3,4. 1. Institute of Biotechnology and Biochemical Engineering, Graz University of Technology, NAWI Graz, Petersgasse 12, 8010, Graz, Austria. 2. Austrian Centre of Industrial Biotechnology (acib), Petersgasse 14, 8010, Graz, Austria. 3. Institute of Biotechnology and Biochemical Engineering, Graz University of Technology, NAWI Graz, Petersgasse 12, 8010, Graz, Austria. bernd.nidetzky@tugraz.at. 4. Austrian Centre of Industrial Biotechnology (acib), Petersgasse 14, 8010, Graz, Austria. bernd.nidetzky@tugraz.at.
Abstract
C-Analogues of the canonical N-nucleosides have considerable importance in medicinal chemistry and are promising building blocks of xenobiotic nucleic acids (XNA) in synthetic biology. Although well established for synthesis of N-nucleosides, biocatalytic methods are lacking in C-nucleoside synthetic chemistry. Here, we identify pseudouridine monophosphate C-glycosidase for selective 5-β-C-glycosylation of uracil and derivatives thereof from pentose 5-phosphate (D-ribose, 2-deoxy-D-ribose, D-arabinose, D-xylose) substrates. Substrate requirements of the enzymatic reaction are consistent with a Mannich-like addition between the pyrimidine nucleobase and the iminium intermediate of enzyme (Lys166) and open-chain pentose 5-phosphate. β-Elimination of the lysine and stereoselective ring closure give the product. We demonstrate phosphorylation-glycosylation cascade reactions for efficient, one-pot synthesis of C-nucleoside phosphates (yield: 33 - 94%) from unprotected sugar and nucleobase. We show incorporation of the enzymatically synthesized C-nucleotide triphosphates into nucleic acids by RNA polymerase. Collectively, these findings implement biocatalytic methodology for C-nucleotide synthesis which can facilitate XNA engineering for synthetic biology applications.
C-Analogues of the canonical N-nucleosides have considerable importance in medicinal chemistry and are promising building blocks of xenobioticnucleic acids (XNA) in synthetic biology. Although well established for synthesis of N-nucleosides, biocatalytic methods are lacking in C-nucleoside syntheticchemistry. Here, we identify pseudouridine monophosphateC-glycosidase for selective 5-β-C-glycosylation of uracil andderivatives thereof from pentose 5-phosphate (D-ribose, 2-deoxy-D-ribose, D-arabinose, D-xylose) substrates. Substrate requirements of the enzymatic reaction are consistent with a Mannich-like addition between the pyrimidinenucleobase and the iminium intermediate of enzyme (Lys166) and open-chain pentose 5-phosphate. β-Elimination of the lysine and stereoselective ring closure give the product. We demonstrate phosphorylation-glycosylation cascade reactions for efficient, one-pot synthesis of C-nucleoside phosphates (yield: 33 - 94%) from unprotectedsugar andnucleobase. We show incorporation of the enzymatically synthesizedC-nucleotide triphosphates into nucleic acids by RNA polymerase. Collectively, these findings implement biocatalytic methodology for C-nucleotide synthesis which can facilitate XNA engineering for synthetic biology applications.
Most natural and syntheticnucleosides (e.g., uridine, U; Fig. 1a) have their sugar andnucleobase parts connected via a hemiaminal ether (N-glycosidic) linkage. Nucleoside analogs replacing the canonical N- by a rare-to-nature C-glycosidic linkage resist the common routes of biological degradation by N-glycoside hydrolases and phosphorylases[1-4]. Since in vivo stability is crucial for compound efficacy and applicability in therapy, syntheticC-nucleosides have considerable importance in medicinal chemistry. Nucleoside analogs generally represent promising lead structures in drug design for antiviral, antibacterial and antitumor activity[5-9]. SyntheticC-nucleosides, such as BCX4430 (Galidesivir) andGS-6620 (Supplementary Fig. 1), can be remarkably effective against filovirus infections andhepatitis C virus, respectively[7,10-13]. Originally developed for use against filoviruses, GS-5734 (Remdesivir; Supplementary Fig. 1) is now also evaluated against infections of the SARS-CoV-2 virus[10,13,14]. Other C-nucleosides, such as showdomycin, formycin, pseudouridimycin andminimycin (Supplementary Fig. 1), constitute an emerging class of microbial natural products that exhibit unusual structures anddiverse biological activities, including antibiotic efficacy[15-21].
Fig. 1
Proposed reaction mechanism of YeiN and the enzymatic phosphorylation-glycosylation cascade used in this study.
The N-nucleoside uridine (U) (a) compared to the C-nucleoside pseudouridine (Ψ) (b) and reverse reaction of YeiN for synthesis of Ψ 5’-phosphate (ΨMP) from d-Rib5P (1) and Ura (12) (c). In panel c, B is the base residue from the enzyme. The key catalytic steps are the following: formation of a covalent iminium ion intermediate between the enzyme (Lys166) and the open-chain d-Rib5P, assisted by Glu31 (I); Mannich-like reaction for C-C coupling (II); β-elimination of the amine of Lys166 (III); cyclization through an oxa-Michael addition-like reaction (IV). For further discussion of the enzyme mechanism, see the text under the Results and the Discussion. d Phosphorylation-glycosylation cascades for one-pot synthesis of ΨMP and Ψ 5’-triphosphate (ΨTP) from unphosphorylated d-Rib are shown. Dephosphorylation of ΨMP yields Ψ. RbsK d-Rib-5-kinase, PK pyruvate kinase, CIP calf intestine phosphatase, NDK nucleoside diphosphate kinase. PEP phosphoenolpyruvate, PYR pyruvate.
Proposed reaction mechanism of YeiN and the enzymatic phosphorylation-glycosylation cascade used in this study.
The N-nucleoside uridine (U) (a) compared to the C-nucleoside pseudouridine (Ψ) (b) and reverse reaction of YeiN for synthesis of Ψ 5’-phosphate (ΨMP) from d-Rib5P (1) andUra (12) (c). In panel c, B is the base residue from the enzyme. The key catalytic steps are the following: formation of a covalent iminium ion intermediate between the enzyme (Lys166) and the open-chain d-Rib5P, assisted by Glu31 (I); Mannich-like reaction for C-Ccoupling (II); β-elimination of the amine of Lys166 (III); cyclization through an oxa-Michael addition-like reaction (IV). For further discussion of the enzyme mechanism, see the text under the Results and the Discussion. d Phosphorylation-glycosylation cascades for one-pot synthesis of ΨMP and Ψ 5’-triphosphate (ΨTP) from unphosphorylatedd-Rib are shown. Dephosphorylation of ΨMP yields Ψ. RbsKd-Rib-5-kinase, PK pyruvate kinase, CIPcalf intestine phosphatase, NDK nucleosidediphosphate kinase. PEPphosphoenolpyruvate, PYR pyruvate.C-nucleosides are furthermore important as natural or synthetic building blocks of nucleic acids, RNA in particular. The U isomer pseudouridine (Ψ; Fig. 1b) occurs naturally in all types of RNA (tRNA, rRNA, snRNA, snoRNA, scaRNA, and mRNA)[22-26]. The Ψ formation is catalyzed by Ψ-synthase, which post-translationally isomerizes U to Ψ [27,28]. Owing to additional hydrogen bonding from the nucleobase N1 in Ψ compared to U (Fig. 1a, b), positional substitution of U with Ψ can increase RNA stability by locally enhancing the macromolecular rigidity[23]. Chemically synthesized Ψ andderivatives thereof are important building blocks for study of C-nucleoside biology as well as for numerous synthetic biology applications based on xenobioticnucleic acids (XNA)[29-36]. For example, the uniform substitution of U by Ψ can render synthetic mRNA non-immunogenic and was shown to enhance the translation efficiency[30-32]. Additionally, Ψ-modified mRNA can boost the gene-cutting efficiency in CRISPR-based gene editing[37,38].To advance C-nucleosides for their emerging applications in XNA engineering or in drug design, the target compounds must be made available through efficient and structurally precise synthetic methods. Chemical routes to C-nucleosides typically follow the synthetic rationale of (1) reconstructing the heteroaryl nucleobase on a C1′-functionalizedsugar or (2) coupling the prepared heteroaryl with a suitable glycosyl reagent, such as a hydroxy group-protectedglycal or pentonolactone[5,39-43]. However, despite much progress from research over decades, a purely chemical approach to definedC-nucleosides remains challenging to establish[44-50]. A synthesis that exploits the high stereo- and regiocontrol of enzymatic reactions for installing the C-glycosidic linkage would be highly desirable. Natural enzymes of C-glycoside biosynthesis, such as tRNA pseudouridine synthase[27,28,51,52] andC-glycosynthase[15,18-21,53,54] have complex substrate requirements, leading to perceived limitations on their applicability. Therefore, unlike N-nucleosides for which N-nucleoside phosphorylases present excellent tools of glycoside synthesis[55-60], biocatalytic methods are lacking for C-nucleoside syntheticchemistry.Here, we identify pseudouridine monophosphate (ΨMP) C-glycosidase for C-nucleoside synthesis. The enzyme’s natural role is the recycling of Ψ via degradation of its 5′-phosphate into uracil (Ura(12)) andd-ribose 5-phosphate (d-Rib5P(1)), as shown in Supplementary Fig. 2[61,62]. Despite early evidence suggesting applicability[63-65], the reverse C-glycosidase reaction (Fig. 1c) is unexplored synthetically. Multiplicity of enzymes in C-nucleoside turnover and overlapped enzyme naming used previously complicate clear assignment of enzyme type to function. For this study, we use the biochemically and structurally well-characterized ΨMP C-glycosidase from E. coli (YeiN). We show that YeiNcatalyzes the selective 5-β-C-glycosylation of Ura (12) andderivatives thereof from several pentose 5-phosphate substrates, including the 5-phosphate of d-ribose, 2-deoxy-d-ribose, 2-deoxy-2-fluoro-d-ribose, d-arabinose or d-xylose. The ΨMP and its corresponding sugar or nucleobase variants are obtained in a single-step, highly efficient transformation. Remarkably, although formally a reversion of hydrolysis, the overall enzymaticconversion benefits from the favorable thermodynamics of its central catalytic step, a carbon-carbon bond-forming (Mannich-like) addition. It thus proceeds with excellent synthetic yields (≥90%), even in water. Finally, we show incorporation of the enzymatically synthesizedC-nucleotide triphosphates into RNA. Collectively, therefore, our study implements biocatalytic methodology for C-nucleotide synthesis, which can facilitate XNA engineering for synthetic biology applications and establish new routes to natural product C-glycosides.
Results
The substrate scope of YeiN in reverse C-glycosidase reaction
Figure 2 shows the substrates used. Besides the native substrates d-Rib5P (1) andUra (12), we tested a series of sugar phosphates (in combination with Ura, 12) andnucleobase analogs (in combination with d-Rib5P, 1) as alternative substrates (Fig. 2) of C-glycosylation by YeiN. Based on product formation detected with HPLC, 2-deoxy-d-Rib5P (d-dRib5P, 2), d-xylose5-phosphate (d-Xyl5P, 3), d-arabinose5-phosphate (d-Ara5P, 4) and2-deoxy-2-fluoro-d-ribose5-phosphate (2F-d-Rib5PP, 5) are active. l-Ribose 5-phosphate (l-Rib5P, 6), d-glucose 6-phosphate (d-Glc6P, 7), d-mannose 6-phosphate (d-Man6P, 8), d-fructose 6-phosphate (d-Fru6P, 9), d-ribulose 5-phosphate (d-Rul5P, 10) andd-erythrose 4-phosphate (d-Ery4P, 11) are inactive. We also considered the 5-phosphates of 2-amino-2-deoxy- and 3-amino-3-deoxy-d-ribose, l-arabinose, d-lyxose, andl-xylose (Supplementary Fig. 3), but enzymatic phosphorylation of these pentoses from ATP proved not possible. The nucleobase analogs used for reaction with YeiN feature substitutions at each Uracarbon (Fig. 2). 3-Methyl-Ura (13), 6-amino-Ura (14), 4-thio-Ura (15) and2-thio-Ura (16) are active. Absence of activity with 1-methyl-Ura (17), 5-fluoro-Ura (18) and6-trifluoro-methyl-Ura (19) can be mechanistically relevant, as discussed below. Additionally, thymine (5-methyl-Ura, 20), 6-aza-Ura (21), 6-Cl-Ura (22), cytosine (23), 2,4-thio-Ura (24), 5-ethynyl-Ura (25), maleimide (26), adenine (27), guanine (28), 4-pyrimidone (29) andpyrimidine (30) were evaluated but found to be inactive. Combinatorial assessment of all sugar 5-phosphates active with Ura (12) against the full panel of Ura analogs used with d-Rib5P (1) confirms the substrate scope of YeiN (Supplementary Table 1).
Fig. 2
Substrate scope of YeiN.
Substrates tested for C-glycosylation by YeiN. The compounds framed in blue are active while those framed in magenta are not.
Substrate scope of YeiN.
Substrates tested for C-glycosylation by YeiN. The compounds framed in blue are active while those framed in magenta are not.We obtained kinetic parameters for the substrates (Fig. 2 and Supplementary Table 2). Compared with d-Rib5P (1) andUra (12), catalytic efficiencies (kcat/KM) for alternative substrates are decreased between ~10 to ~103-fold, reflecting substantial (≥5-fold) changes in both kcat (decrease) and KM (increase).
Characterization of the YeiN reaction
The substrate scope of YeiN is rationalized in terms of structural and mechanistic requirements of the enzymatic reaction. Molecular-docking results show that, with the terminal phosphate group accommodated in the enzyme binding pocket (Fig. 3a), a five-carbonchain is needed to precisely position the substrates carbonyl for iminium ion formation with the ε-amino group of Lys166. Docking of d-Rib5P (1) results in a potentially catalyticcomplex, with the ε-amino group of Lys166 positioned (3.3 Å) for nucleophilic attack at the C1 carbonyl carbon. Thr139 (2.6 Å) andGlu31 (3.2 Å) are placed suitably to stabilize negative charge developing on the C1 oxygen atom during carbinol intermediate formation (Fig. 3a, c). The carbinol intermediate may be stabilized by Thr131 and the main chain carbonyl oxygen of Gly132 (Fig. 3c). Glu31 is furthermore positioned to promote expulsion of water from the carbinol to give the iminium ion (protonatedSchiff base). In terms of structural arrangement and proposedcatalytic function, Thr139 andGlu31 represent a clear analogy to aldolase active sites[66-72]. In terms of sequence and three-dimensional structure, however, YeiN shows hardly any similarity with well-studied aldolases, such as deoxy-ribose phosphate aldolase[67,71], d-fructose bisphosphate aldolase[69], and 2-dehydro-3-deoxy-phosphogluconate aldolase[70,72]. Mechanistically, YeiNdiffers from these aldolases in that it requires the iminium ion as an electrophile for C–Ccoupling (Fig. 1c). The aldolases convert the iminium ion to an enamine (carbanion iminium by resonance) so that it can function as a nucleophile in their aldol reactions[66-70,72].
Fig. 3
The substrate specificity of YeiN analyzed with molecular docking.
Docking poses of a
d-Rib5P (1, cyan) and b Ura (12, white). The enzyme bound Mn2+ is shown as a violet sphere. Distances are shown in Å. c Suggested mechanism of formation of the key covalent iminium ion intermediate that serves as electrophile in the subsequent Mannich-like addition, as shown in Fig. 1c.
The substrate specificity of YeiN analyzed with molecular docking.
Docking poses of a
d-Rib5P (1, cyan) and b Ura (12, white). The enzyme boundMn2+ is shown as a violet sphere. Distances are shown in Å. c Suggested mechanism of formation of the key covalent iminium ion intermediate that serves as electrophile in the subsequent Mannich-like addition, as shown in Fig. 1c.Contrary to d-Rib5P (1), the docking poses for d-Ery4P (11), d-Rul5P (10), andd-Fru6P (9) have Thr139 andGlu31 not properly aligned to the carbonyl oxygen, geometrically (d-Ery4P, 11; Supplementary Fig. 4a) or in terms of interaction distance (d-Rul5P, 10; d-Fru6P, 9; Supplementary Fig. 4b, d, respectively). Additionally, the poses of d-Ery4P (11) andd-Rul5P (10) suggest loss of the stabilizing interaction with Gly132. The d-Glc6P (7) pose (Supplementary Fig. 4c) shows an enlargeddistance between the ε-amino of Lys166 and the C1 carbon. Lack of activity of these sugar phosphates is thus plausibly explained.Docking of Ura (12) into the structure of the covalent enzyme-d-Rib5P adduct (iminium ion form) places the reactive sugarC1′ andnucleobase C5 at a distance (3.4 Å) and orientation (91° angle between C5-C1′ andpyrimidine ring plane; Fig. 3b) well suited for carbon-carbon bond formation. Mechanistically, the catalytic reaction on the enzyme resembles a Mannich-like addition, for it involves the addition of a nucleophile (the cyclic enamine of the nucleobase) to an iminium ion (the activatedaldehyde of d-Rib5P, 1) (Figs. 1c and 3c).Except for positioning, there seems to be limitedcatalytic facilitation from the enzyme to the agency of the Ura (12) C5 as a nucleophile of the reaction. The crystal structure of a covalent intermediate between YeiN and ΨMP shows just a single hydrogen bond between the C2 oxygen atom of Ura andAsn289 (Supplementary Fig. 5)[62]. The docking pose of Ura (12) suggests additional polar contacts (Fig. 3b) that can contribute to the precise positioning of the C5 of Ura above the C1 of the covalently boundd-Rib5P (1). Further docking analysis summarized in Supplementary Fig. 6 shows that lack of reactivity with d-Rib5P (1) can reflect steric restraint on the productive binding of certain nucleobases, such as 6-trifluoromethyl-Ura (19), 6-Cl-Ura (22) and1-methyl-Ura (17) (Supplementary Fig. 6a, b, d). Other Ura (12) analogs, including 5-fluoro-Ura (18) as well as 5-methyl-Ura (20) and6-aza-Ura (21) (Supplementary Fig. 6e, f, c) are accommodated in the enzyme binding pocket almost identically as the parent Ura (12). Positioned properly for catalysis, these Ura (12) analogs seem to be rendered inactive intrinsically. The proposed path for the catalytic reaction of YeiN (Fig. 1c) suggests mechanistic rationale for why the particular substitutions are incompatible with the enzyme activity. The 1-methyl substituent precludes formation of the mechanistically important cyclic imine tautomer in the catalytic step of C–Ccoupling (Fig. 1c). Owing to electronic effect, the 6-aza-Ura (21) is an unlikely nucleophile for reaction with the iminium ion of the covalent enzyme intermediate. In the subsequent catalytic step, as shown in Fig. 1c, the Mannich adduct undergoes β-elimination of the amine of the lysine (Fig. 1c). Ura (12) analogs bearing substituents at the C5 are chemically incompetent to release the lysine analogously.In the last step of the synthetic reaction, cyclization to a β-C-pentofuranosyl ring occurs and this proceedchemically through a kind of oxa-Michael addition (Fig. 1c). The cyclization could happen on the enzyme, in which case it is likely to be stereospecific, or in solution. Anomeric stereoselectivity can be a distinguishing feature of enzymes that catalyze C-ribosylation. The C-glycosynthase AlnA produces a mixture of α- (16%) and β-glycosidic (84%) forms during C-ribosylation of the naphthoquinoneprealnumycin from d-Rib5P (1), suggesting ribosyl ring closure without enzymatic assistance (Supplementary Fig. 7)[54]. Here, we used in situ proton NMR in 2H2O solvent to monitor ΨMP formation from, anddegradation to, d-Rib5P (1) andUra (12). The time-resolvedNMR spectra for the synthesis direction (Fig. 4a–c; 14 mM Ura, 12 and 8 mM d-Rib5P, 1) show that only a single anomer (β-C-riboside) is formed. They also show that solvent deuterium is incorporated at the Ura (12) C5 at a rate (0.07 mM/min) about 3-fold lower than the rate of conversion of Ura (12) into ΨMP (0.22 mM/min). This suggests that the deuteriumuptake does not happen immediately during ΨMP synthesis but occurs only later when ΨMP andUra (12) equilibrate dynamically. Suitable controls show that deuteriumuptake into Ura (12) necessitates ΨMP formation (Supplementary Fig. 8). Inactive sugar phosphates (d-Ery4P, 11; d-Rul5P, 10; d-Glc6P, 7; andd-Fru6P, 9) do not promote deuteriumuptake into the C5 of Ura (12). Analyzing the hydrolysis of ΨMP (Fig. 4d–f), we confirm that the Ura (12) released from ΨMP incorporates solvent deuterium at its C5. The hydrolysis reaction approaches equilibrium after about 100 min and an equilibrium constant of 8.9 × 10−5 M can be calculated from the data (0.9 mM Ura, 12 andd-Rib5P, 1; 9.1 mM ΨMP).
Fig. 4
Enzymatic synthesis and hydrolysis of ΨMP monitored by 1H-NMR.
a Scheme of enzymatic synthesis of ΨMP. Solvent deuterium is incorporated into Ura (12) C5 due to enzymatic hydrolysis of ΨMP. b
1H-NMR traces of ΨMP synthesis from 14 mM Ura (12) and 8 mM d-Rib5P (1), using 3 µM YeiN. c Time course of ΨMP synthesis as determined from the NMR spectra (n = 1). d Scheme of solvent deuterium incorporation into Ura C5 due to the enzymatic hydrolysis of ΨMP e
1H-NMR traces of hydrolysis of ΨMP (10 mM) by 3 µM YeiN. f Time course of ΨMP hydrolysis as determined from the NMR spectra (n = 1). Relative amounts of β-d-Rib5P and α-d-Rib5P are displayed on the secondary axis.
Enzymatic synthesis and hydrolysis of ΨMP monitored by 1H-NMR.
a Scheme of enzymatic synthesis of ΨMP. Solvent deuterium is incorporated into Ura (12) C5 due to enzymatic hydrolysis of ΨMP. b
1H-NMR traces of ΨMP synthesis from 14 mM Ura (12) and 8 mM d-Rib5P (1), using 3 µM YeiN. c Time course of ΨMP synthesis as determined from the NMR spectra (n = 1). d Scheme of solvent deuterium incorporation into Ura C5due to the enzymatic hydrolysis of ΨMP e
1H-NMR traces of hydrolysis of ΨMP (10 mM) by 3 µM YeiN. f Time course of ΨMP hydrolysis as determined from the NMR spectra (n = 1). Relative amounts of β-d-Rib5P and α-d-Rib5P are displayed on the secondary axis.The NMR analysis (Fig. 4e, f) additionally shows that at all times, the d-Rib5P (1) released from ΨMP involves α-furanose (33%) and β-furanose (67%) forms at anomeric equilibrium. Plausible interpretation is that the open-chain form of d-Rib5P is released into solution where the furanose structures are formed spontaneously. However, the mutarotation of d-Rib5P (1)[73] occurs at the limits of temporal resolution of the experiment, leaving the conformation of the releasedd-Rib5P (1) undecided.
Phosphorylation-glycosylation cascade for the synthesis of C-nucleoside monophosphates
We considered that limitation on the syntheticutility of the YeiN-catalyzed formation of C-nucleoside monophosphatescan arise from the requirement for a pentose 5-phosphate as the substrate. The unphosphorylatedpentose, however, is an attractive starting material for synthesis. We therefore designed an enzymaticcascade reaction (Fig. 1d) in which pentose phosphorylation[74] andC-glycosylation of the nucleobase are telescoped into a one-pot transformation. We show efficient enzymatic synthesis of ΨMP from d-Rib andUra (12) anddemonstrate applicability of the biocatalytic system to the synthesis of all sugar andnucleobase variants of the C-glycoside that were shown in Fig. 2 to be accessible via the YeiN reaction. As shown in Supplementary Fig. 9 and summarized in Table 1, the yield of synthesis ranges between useful and excellent (33-95%), and product (10–60 mg) can be recovered generally with high purity from small-scale reactions (15 mL). Product structures are confirmed by NMR (Supplementary Fig. 10–29) and agree well with published spectra[75].
Table 1
Enzymatic syntheses with yields and product purities indicated.
Enzymatic syntheses with yields and product purities indicated.n.d. no product formation detected.n.i. not isolated.*NMR spectra not recorded. Purity based on HPLC.§Based on Ura (12) converted.#Based on pentoseconverted.i5-(2-deoxy-2-fluoro-β-d-ribofuranosyl 5-phosphate)uracil, ii5-(β-d-arabinofuranosyl 5-phosphate)uracil, iii5-(β-d-xylofuranosyl 5-phosphate)uracil.
Dephosphorylation and phosphorylation for structural diversification of the C-nucleoside triphosphates
We considered that chemical biology and medicinal chemistry applications can require the unphosphorylatedC-nucleoside or the C-nucleoside triphosphate. We therefore show enzymaticdephosphorylation of ΨMP with non-specific phosphatase to obtain the C-nucleoside in excellent yield (Table 1 and Supplementary Fig. 30). Dephosphorylation can be analogously applied to the other C-glycoside monophosphates. To prepare the C-nucleoside triphosphate (e.g., ΨTP), we set up a cascade reaction in which the monophosphate is phosphorylated by nucleotide monophosphate kinase and pyruvate kinase in the presence of ATP andPEP (Fig. 1d). Interestingly, the addition of a nucleotide diphosphate kinase did not improve efficiency of the overall phosphorylation (Supplementary Fig. 31)[76]. We show synthesis of ΨTP, 2-deoxy-ΨTP (dΨTP) and 6-amino ΨTP in a one-pot reaction starting from unphosphorylatedsugar andUra (12) (Fig. 5 and Supplementary Fig. 32). Products were recovered in mg amounts and in suitable purity (≥85%) for characterization and further studies.
Fig. 5
Enzymatic cascade synthesis of ΨTP and dΨTP.
Panels a and b show ΨTP and dΨTP, respectively. Pentoses (100 mM) were phosphorylated applying RbsK (15 µM for d-Rib; 30 µM for d-dRib) and pyruvate kinase (4 µM) using 100 mM PEP and 2 mM ATP as phosphate donors (see Supplementary Fig. 33). ΨMP synthesis was initiated by addition of 15 mM Ura, 1 mM MnCl2, YeiN (1 µM for d-Rib, 3 µM for d-dRib) and 10 mM HEPES buffer, resulting in a concentration of 25 mM pentose 5-phosphate (time point zero a, b. The arrow indicates the start of the phosphorylation reaction by addition of 15 mM PEP, 15 µM CMPK, and 1 mM ATP, resulting in a volume increase of 33% (n = 1).
Enzymatic cascade synthesis of ΨTP and dΨTP.
Panels a and b show ΨTP anddΨTP, respectively. Pentoses (100 mM) were phosphorylated applying RbsK (15 µM for d-Rib; 30 µM for d-dRib) and pyruvate kinase (4 µM) using 100 mM PEP and 2 mM ATP as phosphatedonors (see Supplementary Fig. 33). ΨMP synthesis was initiated by addition of 15 mM Ura, 1 mM MnCl2, YeiN (1 µM for d-Rib, 3 µM for d-dRib) and 10 mM HEPES buffer, resulting in a concentration of 25 mM pentose 5-phosphate (time point zero a, b. The arrow indicates the start of the phosphorylation reaction by addition of 15 mM PEP, 15 µM CMPK, and 1 mM ATP, resulting in a volume increase of 33% (n = 1).
C-Nucleoside triphosphates for enzymatic synthesis of xenobiotic RNA
Using the YeiN-encoding gene as the template, we examined amplification of the corresponding RNA andDNA using ΨTP, 6-amino-ΨTP anddΨTP in place of, or together with, the native UTP anddTTP, respectively. Applying T7 RNA polymerase as shown in Fig. 6a, we succeeded in synthesizing RNA built solely from Ψ instead of U.
Fig. 6
Polymerase-catalyzed amplification of RNA and DNA.
Panels a and b show RNA and DNA, respectively. RNA was transcribed from YeiN-encoding template DNA (n = 1). a In vitro translation was performed using standard conditions, 30 U of T7 RNA polymerase, 700 ng template DNA, transcription buffer (40 mM Tris-HCl, pH 7.9, at 25 °C; 6 mM MgCl2, 10 mM DTT, 10 mM NaCl and 2 mM spermidine) and 2 mM of each NTP (ATP, GTP, UTP, CTP). Modified RNA was synthesized either by addition of 2 mM of ΨTP to the NTPs (NTPs + ΨTP) or by fully replacing UTP by 2 mM ΨTP (NTPs + ΨTP − UTP). b The yein gene was amplified from 50 ng expression vector (pet15_yein) using 1.25 U of Dream-Taq DNA polymerase, 0.2 mM of each dNTP (dATP, dCTP, dGTP, dTTP) and 0.5 µM of forward and reverse primer. To incorporate dΨ into the DNA either 0.2 mM of ΨTP were additionally added to dNTP mix (dNTPs + dΨTP) or TTP was fully replaced by dΨTP (dNTPs + YTP − TTP). Further details are described in “Methods”.
Polymerase-catalyzed amplification of RNA and DNA.
Panels a and b show RNA andDNA, respectively. RNA was transcribed from YeiN-encoding template DNA (n = 1). a In vitro translation was performedusing standardconditions, 30 U of T7 RNA polymerase, 700 ng template DNA, transcription buffer (40 mM Tris-HCl, pH 7.9, at 25 °C; 6 mM MgCl2, 10 mM DTT, 10 mM NaCl and 2 mM spermidine) and 2 mM of each NTP (ATP, GTP, UTP, CTP). Modified RNA was synthesized either by addition of 2 mM of ΨTP to the NTPs (NTPs + ΨTP) or by fully replacing UTP by 2 mM ΨTP (NTPs + ΨTP − UTP). b The yein gene was amplified from 50 ng expression vector (pet15_yein) using 1.25 U of Dream-Taq DNA polymerase, 0.2 mM of each dNTP (dATP, dCTP, dGTP, dTTP) and 0.5 µM of forward and reverse primer. To incorporate dΨ into the DNA either 0.2 mM of ΨTP were additionally added to dNTP mix (dNTPs + dΨTP) or TTP was fully replaced by dΨTP (dNTPs + YTP − TTP). Further details are described in “Methods”.Densitometric analysis of gel bands suggested that RNA polymerase transcription efficiency (expressed as gel band intensity) was similar when ΨTP was used to substitute UTP. Unexpectedly, the band intensity decreased to about half when ΨTP andUTP were added at the same time. Overloading of the RNA polymerase with NTP substrate may have caused inhibition, perhaps due to limitation in the concentration of Mg2+used. 6-Amino-ΨTP anddΨTP were not accepted by the used RNA polymerase. Applying DreamTaq-polymerase that is known for its relaxeddNTP substrate specificity, we observed that dΨTP cannot substitute for dTTP (Fig. 6b). However, when used together with dTTP, the dΨTP appeared to inhibit the amplification slightly (∼25%).
Discussion
We here demonstrate the reverse reaction of ΨMP C-glycosidase for efficient biocatalytic synthesis of C-nucleoside monophosphates. Besides the natural ΨMP, a panel of sugar andnucleobase analogs of the C-nucleoside monophosphate are synthesized with excellent regiocontrol and stereoselectivity. The substrate requirements of the enzymatic reaction are consistent with the mechanistic proposal (Fig. 1c) of a Mannich-like C-C addition between the iminium ion of an enzyme bound open-chain pentose 5-phosphate intermediate and the pyrimidinenucleobase. β-Elimination of the enzyme residue and stereospecificcyclization via oxa-Michael-like addition give the C-nucleoside monophosphate product. Use of the iminium ion intermediate as an electrophile of the reaction distinguishes YeiN from well-studied aldolases that form essentially the same covalent intermediate but convert it to an enaminenucleophile for aldol coupling.Chemical studies of C-glycoside synthesis have shown that the condensation product is favored thermodynamically, even in water[77,78], and the syntheticuse of the chemical reaction was demonstrated[73-77]. Equilibrium on the YeiN reaction far on the side of the C-nucleoside monophosphate, which would seem unusual for a nominal glycosidase reaction, is thus explained. EnzymaticC–Ccoupling reactions are synthetically important in the carbohydrate sciences [65,78-86] and the current study adds new application-relevant feature to them.The development of enzymatic phosphorylation-glycosylation cascade reactions enable expedient, one-pot biocatalytic syntheses of the C-nucleoside 5′-phosphates from unphosphorylatedpentoses as the substrates (Fig. 1d). We show that the C-nucleoside 5′-phosphates are readily dephosphorylated to give the C-nucleoside and that they can also be converted into the corresponding 5′-triphosphates. The non-phosphorylatedC-nucleoside and its 5′-triphosphate are promising substrates for chemical and polymerase-catalyzedXNA synthesis, respectively. Phosphorylation-glycosylation cascades extend the syntheticutility of the ΨMP C-glycosidase reaction, enabling C-nucleoside phosphates to be obtained from unprotectedsugar andnucleobase.The ΨMP C-glycosidases are related evolutionary to a class of C-glycosynthases, represented by the well-characterized AlnA from Streptomyces sp. CM020, that catalyze β-C-riboside formation from d-Rib5P (1) anddifferent naphthoquinones (Supplementary Fig. 7)[53,54,87]. The herein developed idea of modular phosphorylation-glycosylation cascades could be analogously applied to AlnA-type C-glycosynthases, to also harness these enzymes’ catalytic proficiency and substrate promiscuity for practical synthesis.
Methods
Materials
ATP ( > 95% purity) was from Roth (Karlsruhe, Germany). Deuterium oxide (99.96%) was from Euriso-Top (Saint-Aubin Cedex, France). Calf intestine phosphatase (CIP) was from New England Biolabs (Frankfurt am Main, Germany). Rabbit muscle pyruvate kinase (PK), adenosine triphosphate (ATP), d-ribose (d-Rib), d-2-deoxy-ribose (d-dRib), d-xylose (d-Xyl), d-arabinose (d-Ara), d-mannose (d-Man), d-glucosamine (d-GlcN), N-acetyl-d-glucosamine (d-GlcNAc), d-galactose (d-Gal), d-galactosamine (d-GalN), d-lyxose (d-Lyx), l-arabinose (l-Ara), l-xylose (l-Xyl), d-ribose-5-phosphate (d-Rib5P(1)), d-fructose 6-phosphated-Fru6P(9), d-mannose 6-phosphate (d-Man6P (8)), d-glucose 6-phosphate (d-Glc6-P (7)), d-erythrose 4-phosphate (d-Ery4P (11)), d-ribulose 5-phosphate (d-Rul5P (10)), uracil (Ura (12)), 6-amino-Ura (14), 4-thio-Ura (15), 2-thio-Ura (16), 6-trifluoromethyl-Ura (19), 1-methyl-uracil (17), pyrimidine (29), pyrimidone (30), maleimide (26), adenine (27), guanin (28), thymine (20), andcytosine (23) were from Sigma Aldrich (Vienna, Austria). 3-Methyl-Ura (13), 6-aza-Ura (21), 6-Cl-Ura (22), andl-ribose 5-phosphate (6) were from Carbosynth (Compton, UK). 2-Amino-2-deoxy-d-ribose (2A-d-Rib), 3-amino-3-deoxy-d-ribose (3A-d-Rib), and2-deoxy-2-fluoro-d-ribose (2F-d-Rib)) were from Synthose (Concord, Ontario, Canada). Expression vectors (pet15b) containing the genes for ribokinase (RbsK) or β-pseudouridine 5′-phosphate glycosidase (YeiN) were obtained from Genescript (Leiden, The Netherlands). Restriction enzymes, Phusion polymerase, T7 RNA Polymerase, Dream Taq DNA polymerase, and GenJET® PCR clean-up kit were from Thermo Fisher Scientific (Altham, MA, USA). DNA sequencing was done at LGC Genomics (Berlin, Germany).
Ψ-5’-phosphate glycosidase (YeiN)
An N-terminally His6-taggedE. coliYeiN was used. The expression vector (pet15_yein) as obtained from Genescript was transformed into the E. coliBL21_DE3 (pLys) expression strain by electroporation and transformants were selected on 0.1 mg/mL ampicillin LB-agar plates.YeiN was expressed in 1-L baffled shaken flasks at 37 °C and 110 rpm using 250 mL Lennox-mediacontaining 0.1 mg/mL of ampicillin. Inoculation was to an initial OD600 of 0.1. At an OD600 of 0.8, the temperature was decreased to 18 °C and gene expression was induced with 0.4 mM IPTG (isopropyl β-d-1-thiogalactopyranoside) for 20 h. Cells were harvested by centrifugation at 2000 × g at 4 °C for 30 min using a Sorvall RC-5B refrigerated superspeedcentrifuge (Du Pont Instruments, Newtown, CT, USA). The supernatant was discarded, and the pellet was suspended in 50 4-(2-hydroxyethyl) piperazine-1-ethanesulfonic acid (HEPES) buffer (pH 8.0). Cells were disruptedusing sonication (Fisherbrand SonicDismembrator, model Ultrasonic Processor FB-505; Fisher Scientific, Vienna, Austria; 5 min) on ice and the cell-free supernatant was recovered by centrifugation at 4 °C and 2000 × g for 30 min. Pre-treatedcell lysate (20 mL) was loaded on 2 × 5 mL HisTrap FF column (GE Healthcare, Little Chalfont, UK), equilibrated with buffer A (50 mM HEPES, pH 8.0 containing 125 mM NaCl, 20 mM imidazole) and mounted on an ÄKTA prime plus (GE Healthcare) system. Protein was elutedusing an imidazole gradient from 0% to 80% buffer B (50 mM HEPES, pH 8.0, containing 125 mM NaCl and 500 mM imidazole). The temperature was 10 °C and a flow rate of 3 mL/min was used. Fractions containing the target protein were pooled, concentrated and buffer-exchanged with Amicon Ultra-15 Centrifugal Filter Units (Millipore; Billerica, MA, USA). The final protein concentration was 30 mg/mL in 50 mM HEPES buffer containing 2 mM MgCl2 (pH 7.0). Enzyme was stored at −20 °Cuntil further use. Protein purification was monitored by sodium dodecyl sulphate–polyacrylamide gel electrophoresis (SDS PAGE) (Supplementary Fig. 34).The YeiN activity assay was performed at 37 °C in 50 mM HEPES buffer supplemented with 2 mM MnCl2 (pH 7.0). About 1.5 µM YeiN was used in the reaction. The substrates used were d-Rib5P (1) (2 mM) andUra (12) (1 mM). One unit of YeiN activity is the amount of enzyme producing 1 μmol of ΨMP/min under the conditions used. Enzyme preparations had typically a specific activity of ∼3 U/mg.
Ribokinase (RbsK)
The expression vector (pet15_RbsK) as obtained from Genescript was transformed into the E. coliBL21_DE3 (pLys) expression strain by electroporation and transformants were selected on 0.1 mg/mL ampicillin LB-agar plates. RbsK was expressed and purified as described for YeiN, except the protein was stored in 50 mM HEPES supplemented with 2 mM MgCl2 (pH 7.0).An N-terminally His6-taggedE. coliRbsK was used. The RbsK activity assay was performed at 37 °C in 50 mM HEPES buffer (pH 7.0, pH adjusted with KOH) containing 2 mM MgCl2 and 3 µM RbsK. The substrates used were ATP (10 mM) andd-Rib (10 mM). One unit of RbsK activity is the amount of enzyme producing 1 μmol of d-Rib5P (1)/min under the conditions used. The enzyme preparation had a specific activity of ∼50 U/mg.
Pentose-5-phosphate phosphatase BT4131
The sugar phosphate phosphatase BT4131 from Bacteroides thetaiotaomicron VPI-5482 was used as an N-terminal fusion with the positively charged binding module Zbasic2[88]. Expression, harvest andcell disruption was performed as described for YeiN except 50 µg/mL kanamycin was used instead of ampicillin.Cell-free extract (20 mL) was loaded on 2 × 5 mL HiTrap SP FF column (GE Healthcare), equilibrated with buffer A (50 mM MES buffer, pH 7.0, containing 100 mM NaCl) and mounted on an ÄKTA prime plus (GE Healthcare) system. Protein was elutedusing an NaCl gradient from 0% to 100% buffer B (50 mM HEPES buffer supplemented with 2 M NaCl, pH 7.0) at 10 °Cusing a flow rate of 3 mL/min. Fractions containing the target protein were pooled, concentrated and buffer-exchanged with Amicon Ultra-15 Centrifugal Filter Units (Millipore; Billerica, MA, USA). The final protein concentration was 10 mg/mL in 50 mM HEPEScontaining 2 mM MgCl2 (pH 7.0). The enzyme was stored at 4 °Cuntil further use. Protein purification was monitoredusing SDS PAGE.The specific activity was determined in 50 mM HEPES buffer supplemented with 5 mM MgCl2 (pH 7.0) at 37 °Cusing 20 mM d-Glc6P as the substrate. The reaction was started with 1.5 µM BT4131 and samples taken were quenched by heat treatment. Inorganic phosphate was measured at 850 nm[88]. One unit of BT4131 activity is the enzyme amount releasing 1 μmol of phosphate from d-Glc6P/min under the conditions applied. The specific activity of the phosphatase preparation was typically ∼16 U/mg.
Nucleoside monophosphate kinases (NPK)
The uridinemonophosphate kinase (umpk) andcytidine monophosphate kinase (cmpk) genes were amplified from genomicE. coliBL21_DE3 (pLys) DNA by PCR using Phusion polymerase and the following set of primers (restriction sites underlined): UMPK_NdeI_fw (CATATGGCTACCAATGCAAAACCCGTC), UMPK_XhoI_rv (CTCGAGTTATTCCGTGATTAAAGTCCCTTCTTTTTC); CMPK_NdeI_fw (CATATGACGGCAATTGCCCCGGTTATTACC), CMPK_XhoI_rv (CTCGAGTTATGCGAGAGCCAATTTCTGGCGC). Doubly digested (NdeI and XhoI) PCR products were purified from agarose gels treated with shrimp alkaline phosphatase (NEB, Ipswich, MA, USA) and ligated (T4 ligase) into the linearized (NdeI and XhoI) pET-STREP3 expression vector. The constructed vector was transformed into electro-competent E. coliBL21_DE3 (pLys) and transformants were selected on LB-agar plates (0.05 mg/mL kanamycin). Isolated vectors were sequenced (LGC Genomics). NPK expression, cell harvesting andcell disruption was performed as described for YeiN, except that 0.05 mg/mL kanamycin instead of ampicillin was used andcells were dissolved in wash buffer (100 mM Tris-Cl pH 8.0, 150 mM NaCl, 1 mM EDTA). Cell-free extract (20 mL) was loaded (2 mL/min) on 1 × 5 mL StrepTrap HP column (GE Healthcare), equilibrated with wash buffer and mounted on an ÄKTA prime plus (GE Healthcare) system. Unspecifically bound protein was washed off with 10 column volumes of wash buffer at a flow of 3 mL/min. Protein was elutedusing elution buffer (100 mM Tris-Cl pH 8.0, 150 mM NaCl, 1 mM EDTA, 2.5 mM desthiobiotin). Fractions containing the target protein were pooled, concentrated and buffer-exchanged with Amicon Ultra-15 Centrifugal Filter Units (Millipore). The final protein concentration was 15 mg/mL in 50 mM HEPEScontaining 2 mM MgCl2 (pH 7.0). Enzyme was stored at −20 °Cuntil further use. Protein purification was monitoredusing SDS PAGE (Supplementary Fig. 34). The UMPK was only weakly expressed and most of the recombinant enzyme was found in inclusion bodies.An N-terminally strep-taggedCMPK was used. The NPK-assay was performed at 37 °C in 50 mM HEPES buffer (pH 7.0) containing 1 mM MgCl2 and 5 µM CMPK. The substrates used were 3 mM UMP and 3 mM ATP. One unit of NPK activity is the amount of enzyme producing 1 μmol of UDP/min under the conditions used. CMPK preparation typically exhibited a specific activity of 20 U/mg.
Nucleoside diphosphate kinase (NDK)
An N-terminally Strep-tagged variant was used. The ndk gene was amplified from genomicE. coliBL21_DE3 (pLys) DNA by PCR using Phusion polymerase and the following set of primers (overlapping 5'-overhangs in lower case letters): NDK_fwd (ATGGCTATTGAACGTACTTTTTCCATC), NDK_rv (TTAACGGGTGCGCGGGC). The PCR product was cloned into the pET-STREP3 expression vector by circular polymerase extension cloning. Briefly, using the primer pairs CEPEC_NDK_fwd (GTTCGAGAAAGGCTTAATTAACCATATGgctattgaacgtactttttccatc), CPEC-NDK_rv (GTGGTGGTGGTGGTGCTCGAGttaacgggtgcgcgggc) andCPEC-NDK-Strep_fwd (ggaaaaagtacgttcaatagcCATATGGTTAATTAAGCCTTTCTCGAAC) CPEC-NDK-Strep_rv (gcccgcgcacccgttaaCTCGAGCACCACCACCACCAC) in two independent PCR reactions, complementary overlaps are introduced into the ndk gene and into the target vector.The purified PCR products were used in an overlap-extension-PCR to prime each other and to create the final expression vector, which was transformed into E. coli XL1 Blue. Transformants were selected on LB-agar plates containing 0.1 mg/mL of ampicillin, and the isolated plasmid was sequenced. The sequenced vector was transformed into the expression strain E. coliBL21_DE3 (pLys) and transformants were selected on LB-agar plates containing 0.1 mg/mL of ampicillin. NDK expression and purification were as described for NDKs (Supplementary Fig. 34).The NDK activity assay was performed at 37 °C in 50 mM HEPES buffer (pH 7.0) containing 1 mM MgCl2 and 5 µM NDK. The substrates used were 3 mM uridine diphosphate (UDP) and 3 mM ATP. One unit of NDK activity is the amount of enzyme producing 1 μmol of UTP/min under the conditions used. The enzyme preparation had a specific activity of 21 U/mg.
Determination of protein concentration
Protein concentrations were measuredusing DS-11 Spectrophotometer (DeNovix; Wilmington, DE, USA) at 280 nm. Protein concentration was calculatedusing the corresponding molar extinction coefficient and the molecular weight computed by protparam (https://web.expasy.org/protparam/). RbsK (12,490 M−1cm−1, 32,290 Da), YeiN (10,095 M−1cm−1, 32,909 Da), CMPK (13,075 M−1cm−1, 25,970 Da), NDK (4470 M−1cm−1, 15,463 Da), BT4131 (14,690 M−1cm−1, 28,856 Da).
Determination of nucleotide concentration
Uracilcontent was measured photometrically using a DS-11 Spectrophotometer (DeNovix). Ura (12) (260 nm, 10 absorbance units (AU) mM−1 cm−1), 4-thio-Ura (15) (270 nm, 22 AU mM−1 cm−1), 2-thio-Ura (16) (273 nm, 12 AU mM−1 cm−1), thymidine (20) (260 nm, 8.9 AU mM−1 cm−1), guanine (28) (246 nm, 10.7 mM−1 cm−1), cytosine (23) (267 nm, 6.1 mM−1 cm−1), adenine (27) (260 nm, 13 mM−1 cm−1).
High-performance liquid chromatography (HPLC)
Nucleosides were quantified by reversed-phase ion-pairing HPLC. Typically, 5 µL of sample containing an overall nucleosideconcentration of about 1 mM were loaded on a Kinetex C18 column (Phenomenex, Aschaffenburg, Germany; 5 µm, 100 Å, 50 × 4.6 mm). Analytes were separated in 5-min long isocratic runs using 20 mM phosphate buffer, pH 5.9, containing 40 mM tetra-n-butylammonium bromide (TBAB) and 12.5% acetonitrile. The flow rate was 2 mL/min and the temperature set to 35 °C. Eluents were detected at 260 nm. A 5-min HPLC trace is shown in Supplementary Fig. 35. Typical retention times were as follows: Ura (12) (0.3 min), ΨMP (0.4 min), ΨDP (0.9 min), ADP (1.1 min), ΨTP (2.7 min), andATP (4 min).
Thin-layer chromatography (TLC)
About 1–2 µL sample were spotted onto a TLC plate (Merk, Darmstadt, Germany). The mobile phase was composed of 2-BuOH:AcOH:H2O (2:1:1). The development solution consisted of 0.5 g thymol, 95 mL EtOH and 5 mL H2SO4. Reducing sugar spots were developed by heating the plate to approximately 70 °C for 2 min using a heat-gun (Supplementary Fig. 33).
Synthes is and isolation of pentose 5-phosphates
Prior to synthesis, the substrate scope of RibK was evaluatedusing mixture of 100 mM sugar (d-Rib, d-dRib, d-Xyl, d-Ara, 2F-d-Rib, d-Glc, 2F-d-Rib, 3A-d-Rib, l-Xyl, l-Ara, d-Lyx, d-Man, d-GlcN, d-GlcNAc, d-GlcA, d-Gal, d-GalN), 100 mM PEP, 2 mM ATP and 10 mM HEPES buffer (pH 7.0) containing 1 mM MgCl2 and enzymes (60 µM RbsK, 0.4 µM PK) in a total volume of 100 µL. The reactions were incubated at 37 °C without agitation for 20 h and 5 µL sample were analyzed by TLC. Of the testedsugars, d-Rib, d-dRib, d-Xyl, d-Ara,2F-d-Rib, d-Glc were accepted.Synthesis was performed at 5 mL scale, except for 2-deoxy-2-fluoro-d-Rib 5-phosphate (2F-D-Rib5P (5) that was prepared in a 1 mL reaction. Reaction mixtures contained 100 mM pentose (d-Rib, d-dRib, d-Xyl, d-Ara, 2F-d-Rib), 100 mM PEP, 2 mM ATP in 10 mM HEPES buffer (pH 7.0) with 1 mM MgCl2 and 0.4 µM PK. For the phosphorylation of d-Rib, 15 µM RbsK, for d-dRib 30 µM RbsK, and for other pentoses 60 µM RbsK were used. The reactions were incubated at 37 °C without agitation for 20 h and 5 µL sample were analyzed by TLC. The reaction was stopped by heat treatment at 99 °C for 10 min and precipitated protein was removed by centrifugation for 10 min at 4 °C and 16,000 × g. The reactions were diluted with twice volume of water and applied (20 mL; 2 mL/min) to a FliQ FPLCcolumn (10 mL; 104 × 11.0 mm; Generon, Maidenhead, U.K.) packed with SuperQ-650M (Tosh, Tokyo, Japan) anion exchange resin mounted onto an ÄKTA prime plus FPLC system (GE Healthcare). The column was equilibrated in 10 mM sodium acetate (NaAc), pH 4.5. Products were elutedusing a linear gradient of 1 M NaAc (pH 4.5), from 0% to 50% in 120 mL with a flow of 4 mL/min. Eluting fractions were evaluated for the presence of sugar phosphatesusing TLC. Following the product elution, the column was regenerated with 100% of 1 M NaAc buffer for 10 min. The fractions containing pentose 5-phosphates were pooled andconcentratedunder reduced pressure (20 mbar, 40 °C) to a total volume of 1 mL using a rotary evaporator (Laborta 500-efficient; Heidolph Instruments, Schwabach, Germany). The concentration of pentose 5-phosphates were evaluated as follows: sugar phosphates were completely hydrolyzed by applying 10 µM of sugar phosphate phosphatase (AGP)[88] for 20 h. The releasedinorganic phosphate was determined in a colorimetric assay based on the formation of a blue phospho-molybdatecomplex at 850 nm[89].
Screening sugar phosphate and nucleobase substrates
All reactions were performed at 37 °C in 50 mM HEPES buffer (pH 7.0) containing 2 mM MnCl2 in a total volume of 100 μL. Nucleobase screening was performed by incubating 5 mM d-Rib5P (1) and 1 mM of nucleobase (Ura (12), 3-methyl-Ura (13), 6-amino-Ura (14), 2-thio-Ura (15), 4-thio-Ura (16), 1-methyl-Ura (17), 5-fluoro-Ura (18), 6-trifluoro-methyl-Ura (19), 6-aza-Ura (21), 6-Cl-Ura (22), 2,4-dithio-Ura (24), 5-ethynyl-Ura (25), maleimide (26), cytosine(23), thymine (20), adenine (27), guanine (28), 4-pyrimidone (29) andpyrimidine (30)) with 30 µM YeiN. Sugar phosphates were screened by incubating 1 mM Ura and 5 mM of sugar phosphate (d-Rib5P (1), d-dRib5P (2), d-Xyl5P (3), d-Ara5P (4), d-Fru6P (9), d-Man6P (8), d-Glc6P (7), d-Rul (10), d-Ery4P (11), 2F-d-Rib5P (5), l-Rib5P (6) with 30 µM YeiN. Samples (20 µL) were withdrawn after 0, 3, 24 h and quenched by methanol addition (100 µL). Precipitated protein was removed by centrifugation at 20,000 × g for 5 min and supernatant was analyzed by HPLC.Five millimolar of pentosephosphates (previously isolated as described above) accepted by YeiN in the initial screen (d-Rib5P (1), d-dRib5P (2), d-Xyl5P (3), d-Ara5P (4), d-2F-Rib5P(5)) were incubated with 2 mM of each nucleobase (Ura (12), 3-methyl-Ura (13), 6-amino-Ura (14), 2-thio-Ura (15), 4-thio-Ura (16), 1-methyl-Ura (17), 5-fluoro-Ura (18), 6-trifluoro-methyl-Ura (19), 6-aza-Ura (21), 6-Cl-Ura (22), 2,4-dithio-Ura (24), 5-ethynyl-Ura (25), maleimide (26), cytosine(23), thymine (20), adenine (27), guanin (28), 4-pyrimidone (29) andpyrimidine (30)) and 30 µM of YeiN for 3 h. The reactions were quenched by methanol addition (500 µL) and precipitated protein was removed by centrifugation at 20,000 × g for 5 min prior to HPLC analysis.
YeiN kinetics
The rate constant kcat and the Michaelis constant KM were determined by measuring the initial rate of ΨMP synthesis catalyzed by YeiN (1 µM–120 µM) in 50 mM HEPES (pH 7.0) supplemented with 2 mM MnCl2. Reactions were incubated at 37 °C on a Thermomixer comfort (Eppendorf, Hamburg, Germany) with agitation of 300 rpm. Initial rates for ΨMP synthesis with nucleobases (Ura (12), 6-amino-Ura (13), 4-thio-Ura (14), 2-thio-Ura (15), 3-methy-Ura(16)) were determined at a constant d-Rib5P (1) concentration of 5 mM and at nucleobase concentrations between 0.05 mM and 1.00 mM. Initial rates for ΨMP synthesis with pentose 5-phosphates (d-Rib5P (1), d-dRib5P (2), d-Ara5P (4), d-Xyl5P(3) were determined at a constant Ura (10) concentration of 5 mM and 1.00−0.05 mM of the corresponding pentose 5-phosphate. Reactions were started by YeiN (1.00 µM−100 µM) added in a concentration adjusted to the enzyme’s specific activity with the substrate used. The reaction mixture was incubated for 20 min and quenched by methanol addition (1:1, v:v). Precipitated protein was removed by centrifugation at 20,000 × g for 5 min and samples were analyzed by HPLC. It was ensured that conversion was below 20%. Kinetic parameters were obtained from non-linear least squares fits (SigmaPlot; Systat, Erkrath, Germany) of initial ΨMP-synthesis rates to Michaelis–Menten Eq. (1). V is the initial rate, Vmax is the maximum initial rate, KM is the Michaelis constant and [S] is the initial substrate concentration. The kcat was determined from the relationship kcat = Vmax/[E], where [E] is the molar enzyme concentration. [E] was determined from the protein concentration and enzyme molecular weight.
Synthesis of ΨMP derivatives
Synthesis of ΨMP derivatives was performed in two steps in one pot. Reactions were started by addition of enzyme. All reactions were incubated at 37 °C without agitation.Step one: pentose phosphorylation. The reaction mixture contained 100 mM pentose (d-Rib, d-dRib, d-Xyl, d-Ara, 2F-d-Rib), 100 mM PEP, 2 mM ATP and 10 mM HEPES buffer (pH 7.0), 1 mM MgCl2 and 0.4 µM pyruvate kinase. For the phosphorylation of d-Rib 15 µM RbsK, for d-dRib 30 µM RbsK and for d-Ara andd-Xyl 60 µM RbsK was used. Reactions were performed in a total volume of 5 mL. Samples (5 µL) were taken at reaction start and then after 2 h and 20 h. They were analyzed by TLC.Step two: C-glycosylation using YeiN. After phosphorylation, the reaction mixture was diluted to a final concentration of 25 mM pentose 5-phosphate (d-Rib5P (1), d-dRib5P (2), d-Xyl5P (3), d-Ara5P(4)) using 10 mM HEPES (pH 7.0). Ten mM of nucleobase (Ura (12), 6-amino-Ura (13), 2-thio-Ura (14), 4-thio-Ura (15), 3-methyl-Ura (16)) were added to the phosphorylation reaction. ΨMP-synthesis was started with 15 µM YeiN. To compensate for the lower activity of YeiN towards different pentose 5-phosphates andnucleobases, the YeiNconcentration was adjusted as follows: for the synthesis of dΨMP 30 µM YeiN was used; for synthesis of Ara-ΨMP, Xyl-ΨMP, 6-amino-ΨMP, 2-thio-ΨMP and 4-thio-ΨMP, 90 µM YeiN was used. Samples (5 µL) were taken at start and after 5, 15, 30, 60, 120, 180, 240, 300, 360 min, and 10 h for 2-thio-ΨMP, Ara-ΨMP, Xyl-ΨMP. Each sample was quenched by 100 µL methanol. Precipitated protein was removed by centrifugation for 20,000 × g for 5 min and the sample was analyzed by HPLC. After completion of the reaction or latest after 10 h of incubation, the reactions were stopped by heat inactivation for 5 min at 99 °C. Precipitated protein was removed by centrifugation for 10 min at 4 °C and 16,000 × g.
ΨMP dephosphorylation
ΨMP (5 mM) was dissolved in 15 mL 10 mM HEPES buffer, pH 8.0, supplemented with 1 mM MgCl2 and incubated with 2 U/mL of CIP at 37 °C for 600 min. Ten microliters samples were withdrawn at start, and after 30, 90, 600 min, and quenched in 100 µL methanol. Precipitated enzyme was removed by centrifugation (16,000 × g, 4 °C, 5 min) and supernatant was analyzed by HPLC.
CMPK, NDK, PK, phosphorylation cascade
The reaction set-up consisted of 0.5 mM ATP, 5 mM purified ΨMP, 12 µM CMPK, 1 mM MgCl2, 30 µM PK, 12 mM PEP, 50 mM HEPES, pH 7.0, in a total volume of 250 µL. Ten µL samples taken at certain times were quenched in 100 µL methanol. The precipitated enzyme was removed by centrifugation (20,000 × g, 4 °C, 5 min) and supernatant was analyzed by HPLC. To test the effect of NDK removal for ΨTP-synthesis the same reaction set up without addition of 40 µM NDK was used (Supplementary Fig. 31).
One-pot multistep enzymatic synthesis of Ψ-triphosphates
Pentose phosphorylation was performed as described above under “Step one: pentose phosphorylation”. For ΨMP synthesis, 15 mM nucleobase (Ura (12), 6-amino-Ura (13), 2-thio-Ura (14), 4-thio-Ura (15), 3-methyl-Ura (16)), 1 mM MnCl2, and 10 mM HEPES buffer were added to the reaction mixture to a final concentration of 25 mM pentose 5-phosphate. The reaction was started with YeiN (1 µM for the ΨMP synthesis, 3 µM for the dΨMP synthesis and 90 µM for the synthesis using Uraderivatives). ΨMP phosphorylation was started with 15 mM PEP, 15 µM CMPK, and 1 mM ATP, resulting in a volume increase of 33%. Ten microliters of samples were quenched by addition of 100 μL methanol and precipitated protein was removed by centrifugation for 20,000 × g for 5 min prior to HPLC analysis. Enzymes were heat inactivated (5 min, 99 °C) after 21 h reaction and precipitated protein was removed by centrifugation for 10 min at 4 °C and 16,000 × g.
Dephosphorylation of pentose 5-phosphates
BT4131 (10 µM) and 1 mM MgCl2 were added to the reaction mixtures and incubated for 60 min of at 37 °C. The reaction was stopped by heat inactivation (5 min, 99 °C) and precipitated protein was removed by centrifugation for 10 min at 4 °C and 16,000 × g.
Anion exchange chromatography
Enzyme-free reaction mixture was applied (20 mL; 2 mL/min) to a FliQ FPLCcolumn (10 mL; 104 × 11.0 mm; Generon, Maidenhead, U.K.) packed with SuperQ-650M (Tosh, Tokyo, Japan) anion exchange resin mounted onto an ÄKTA prime plus FPLC system (GE Healthcare.). The column was equilibrated in 10 mM NaAc, pH 4.5. Products were elutedusing a linear gradient of 1 M NaAc, pH 4.5, from 0% to 50% in 120 mL with a flow of 4 mL/min and monitored at 260 nm. Following product elution, buffer B was set to 100% for 10 min to elute unspecifically bound substances. The fractions containing the desiredC-nucleoside were pooled andconcentratedunder reduced pressure (20 mbar, 40 °C) to a total volume of 10 mL using a rotary evaporator (Laborta 500-efficient; Heidolph Instruments, Schwabach, Germany).
Size exclusion chromatography
To remove excess NaAc, concentrated products were split in 5 × 2 mL portions and loaded onto a size exclusion column (16 × 1000 mm; XR 16/100 column; GE Healthcare) packed with Sephadex G10 (exclusion limit <700 Da) equilibrated in double distilledH2O and mounted onto an ÄKTA prime plus FPLC system (GE Healthcare). Compound elusion was performedusing doubly distilledwater and monitored at 260 nm. Fractions containing salt-free product were collected, pooled andconcentratedunder reduced pressure (20 mbar, 40 °C) to a total volume of 10 mL using a rotary evaporator (Laborta 500-efficient; Heidolph Instruments) and lyophilized.
Docking
Molecular docking was used to structurally evaluate the substrate scope of YeinN. AutoDock13[90] as implemented in Yasara v. 18.2.7 was used for enzyme-liganddocking. The AmberFB-15[91] force field and the default parameters provided by the standarddocking macro were used. The structure of YeiN (pdb code: 4GIL)[62] was used as a receptor in molecular-docking experiments. For dockings of sugar phosphates (d-Rib5P (1), d-Rul5P (10), d-Fru6P (9), d-Glu6P (7), d-Ery4P(11)), the ring-opened intermediate was deleted and a simulation cell of 15 Å × 10 Å × 10 Å was placed at the former location of d-Rib5P (1). Sugar phosphates were docked in the ring-open conformation. The protonation state of key catalytic residues (Glu31 protonated; Lys166deprotonated) was set prior to the docking experiment. The protonation states of all other protein residues were set automatically for a pH of 7.00. For dockings of nucleobases (Ura (12), 6-trifluoro-methyl-Ura (19), 6-Cl-Ura (22), 6-aza-Ura (21), 1-methyl-Ura (17), 5-fluoro-Ura (18), 5-methyl-Ura(20)), the uracil moiety was deleted and the charge of lysine was set to fit the covalent iminium intermediate. The protonation states of all other protein residues were set automatically for a pH of 7.00. A simulation cell of 15 Å × 10 Å × 10 Å was placed at the former location of Ura (11). Following the docking runs, the obtained poses were optimized by energy using the standard macro included in Yasara. All ligands were generatedusing the Grade Web Server (http://grade.globalphasing.org/cgi-bin/grade/server.cgi)[92]. Docking poses were evaluated by their associated free energy and mechanistic plausibility.
Synthesis of pseudouridinated RNA
In vitro translation (IVT) using T7 RNA polymerase (Thermo Fisher Scientific) was performed following the manufacturer’s protocol. Shortly, linear DNA template containing the T7 promoter and the yein gene was amplified from the YeiNexpression vector (peT15_yein) by PCR using Phusion polymerase and a set of primers complementary to the T7 promoter (TAATACGACTCACTATAGGG) or terminator (GCTAGTTATTGCTCAGCGG). The PCR product was purified via column purification (GenJET® PCR clean up; Thermo Fisher Scientific) and eluted in DEPC-water. IVT was performedusing 30 U of T7 RNA polymerase, 600 ng template DNA, Transcription Buffer (40 mM Tris-HCl pH 7.9, at 25 °C), 6 mM MgCl2, 10 mM DTT, 10 mM NaCl and 2 mM spermidine) and 2 mM of each NTP (ATP, GTP, UTP, CTP) in a total volume of 50 µL. Synthesis of modified RNA was performed in two ways; either 2 mM of ΨTP (or: dΨTP, 6-amino-ΨTP) was additionally added to the conventional NTPs or, UTP was fully replaced by 2 mM of either ΨTP, dΨTP or 6-amino-ΨTP. The reaction was incubated at 37 °C in the thermocycler (Doppio-Dual; VWR, Radnor, PA, USA) and the resulting PCR products were evaluated by agarose gel electrophoresis.
Synthesis of pseudouridinated DNA
To evaluate the potential of PCR catalyzed generation of pseudouridinatedDNA, the yein gene was amplified from 50 ng expression vector (pet15_yein) using 1.25 U of Dream-Taq DNA polymerase, 0.2 mM of dNTPs (dATP, dCTP, dGTP, dTTP), 0.5 µM T7_fwd, 0.5 µM T7_bwd primers andDreamTaq Buffer as recommended by the supplier (Thermo Fisher Scientific). To incorporate Ψ into the DNA either 0.2 mM of ΨTP were additionally added to dNTP mix or fully replaceddTTP by dΨTP. The temperature profile consisted of a preheating step at 95 °C for 1 min, followed by 30 reaction cycles of denaturation at 95 °C for 30 s, annealing at 55 °C for 30 s, and elongation at 72 °C for 2 min. The final extension step was carried out at 72 °C for 5 min. PCRs were performedusing the Doppio-Dual thermocycler (VWR). The resulting PCR products were evaluated by agarose gel electrophoresis.
In situ 1H-NMR of ΨMP synthesis or hydrolysis
YeiN-catalyzed synthesis of ΨMP from d-Rib5P (1) andUra (12), as well as hydrolysis of ΨMP, were analyzed by 1H-NMR. All reagents including the enzyme were dissolved in D2O and the reaction was set up in a 5 mm high-precision NMR sample tube (Promochem, Wesel, Germany). YeiN was buffered by washing the enzyme with 10-fold excess of 10 mM potassium phosphate buffer (pD 7.5) supplemented with 0.1 mM MnCl2, using centrifugation (16,000 × g, 15 min, 4 °C) with Vivaspin 500 centrifugal concentrators (10 kDa cut-off). The synthesis reaction contained 14 mM Ura (12) and 8 mM d-Rib5P (1), while 10 mM ΨMP were used for the hydrolysis reaction. Each reaction was initiated by adding 3 µM enzyme. The reactions were incubated at 30 °C without agitation in the electromagnetic field.Measurements were performed with a Varian INOVA 500-MHz NMR spectrometer (Agilent Technologies; Santa Clara, CA, USA). The VNMRJ 2.2D software was used for all measurements. 1H-NMR spectra (499.98 MHz) were recorded on a 5 mm indirect detection PFG-probe with pre-saturation of the water signal by a shaped pulse. The following standard pre-saturation sequence was used: 2 s relaxation delay; 90° proton pulse; 2.048 s acquisition time; 8 kHz spectral width; number of points 32 k.
1H-NMR analysis of YeiN-catalyzed deuterium exchange
The potential of YeiN-catalyzeddeuterium wash-in into Ura (12) without transient formation of ΨMP was evaluated by 1H-NMR. All reagents including the enzyme were dissolved in D2O and prepared as described above. The 600 µL reactions contained 5 mM Ura (12) and 5 mM of sugar phosphate (d-Rib5P (1), d-Ery4P (11), d-Rul5P (10), d-Glc6P (7), d-Fru6P(9)) in 10 mM potassium phosphate buffer (pD 7.5) supplemented with 0.1 mM MnCl2. Conversions were started by addition of 3 µM enzyme and incubated at 30 °C in a Thermomixer comfort (Eppendorf, Hamburg, Germany) with agitation of 300 rpm for 3 h. The reactions were stopped by heat treatment (5 min, 99 °C) and precipitated protein was removed by centrifugation for 10 min at 4 °C and 16,000 × g. The supernatant was transferred into a 5 mm NMR sample tube and analyzed by 1H-NMR.
NMR analysis of purified reaction products
Lyophilized products were dissolved to 3−10 mM in 50 mM potassium phosphate buffer (pD 7.9) in a total volume of 600 μL and transferred to a 5 mm high-precision NMR sample tube. They were analyzedusing a Varian INOVA 500-MHz NMR spectrometer (Agilent Technologies). The VNMRJ 2.2D software was used for measurements. 1H-NMR spectra (499.98 MHz) were measured on a 5 mm indirect detection PFG probe. Standard pre-saturation sequence was used: relaxation delay 2 s; 908 proton pulse; 2.048 s acquisition time; spectral width 8 kHz; number of points 32,000 13CNMR spectra (125.71 MHz). Standard pre-saturation sequence was used: relaxation delay 2 s; 908 proton pulse; 2.048 s acquisition time; spectral width 8 kHz; number of points 32 k. MNova 12.0.3 (Mestrelab Research S.L. Bajo, Santiago de Compostela, Spain) was used for data processing and analysis (Supplementary Figs. 10–29). Recorded spectra were in good agreement with publisheddata[75].
Authors: Erwin R van Rijssel; Pieter van Delft; Daan V van Marle; Stefan M Bijvoets; Gerrit Lodder; Herman S Overkleeft; Gijsbert A van der Marel; Dmitri V Filippov; Jeroen D C Codée Journal: J Org Chem Date: 2015-04-13 Impact factor: 4.354
Authors: Terhi Oja; Laila Niiranen; Tatyana Sandalova; Karel D Klika; Jarmo Niemi; Pekka Mäntsälä; Gunter Schneider; Mikko Metsä-Ketelä Journal: Proc Natl Acad Sci U S A Date: 2013-01-07 Impact factor: 11.205
Authors: Daniel E Eyler; Monika K Franco; Zahra Batool; Monica Z Wu; Michelle L Dubuke; Malgorzata Dobosz-Bartoszek; Joshua D Jones; Yury S Polikanov; Bijoyita Roy; Kristin S Koutmou Journal: Proc Natl Acad Sci U S A Date: 2019-10-31 Impact factor: 11.205