Vipender Singh1, Bogdan I Fedeles1, John M Essigmann2. 1. Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA Center for Environmental Health Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. 2. Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA Center for Environmental Health Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA jessig@mit.edu.
Abstract
Heterocyclic nucleic acid bases and their analogs can adopt multiple tautomeric forms due to the presence of multiple solvent-exchangeable protons. In DNA, spontaneous formation of minor tautomers has been speculated to contribute to mutagenic mispairings during DNA replication, whereas in RNA, minor tautomeric forms have been proposed to enhance the structural and functional diversity of RNA enzymes and aptamers. This review summarizes the role of tautomerism in RNA biochemistry, specifically focusing on the role of tautomerism in catalysis of small self-cleaving ribozymes and recognition of ligand analogs by riboswitches. Considering that the presence of multiple tautomers of nucleic acid bases is a rare occurrence, and that tautomers typically interconvert on a fast time scale, methods for studying rapid tautomerism in the context of nucleic acids under biologically relevant aqueous conditions are also discussed.
Heterocyclic nucleic acid bases and their analogs can adopt multiple tautomeric forms due to the presence of multiple solvent-exchangeable protons. In DNA, spontaneous formation of minor tautomers has been speculated to contribute to mutagenic mispairings during DNA replication, whereas in RNA, minor tautomeric forms have been proposed to enhance the structural and functional diversity of RNA enzymes and aptamers. This review summarizes the role of tautomerism in RNA biochemistry, specifically focusing on the role of tautomerism in catalysis of small self-cleaving ribozymes and recognition of ligand analogs by riboswitches. Considering that the presence of multiple tautomers of nucleic acid bases is a rare occurrence, and that tautomers typically interconvert on a fast time scale, methods for studying rapid tautomerism in the context of nucleic acids under biologically relevant aqueous conditions are also discussed.
Nucleic acid bases can exist in multiple tautomeric forms due to the presence of solvent-exchangeable protons. Tautomers are structural isomers that differ from one another based on the position of proton(s) and double bonds (Antonov 2014). The presence of multiple tautomers is expected to increase the structural and chemical diversity of nucleic acid bases, as shown in Figure 1. Structural diversification by tautomerism of canonical bases, their metabolic products and related analogs, can be utilized by many RNA enzymes and aptamers to execute some of their biological functions (Cochrane et al. 2007; Klein et al. 2007; Cochrane and Strobel 2008a; Thore et al. 2008; Gilbert et al. 2009; Wilcox et al. 2011; Singh et al. 2014). This review focuses on the importance of tautomerism in catalysis of small self-cleaving ribozymes and in the recognition of certain ligand analogs by the purine and TPP riboswitches. Additionally, this review also discusses methods that can be utilized for studying tautomerism under biologically relevant aqueous conditions, an area that has traditionally posed many experimental challenges, because typically, minor tautomeric forms of nucleic acid bases are rare and interconvert rapidly in aqueous solution.
FIGURE 1.
Tautomers of (A) adenosine, (B) guanosine, (C) uridine, (D) xanthosine, (E) cytidine, and (F) oxythiamine.
Tautomers of (A) adenosine, (B) guanosine, (C) uridine, (D) xanthosine, (E) cytidine, and (F) oxythiamine.The term tautomerism was first used by Laar in 1886 to describe a dynamic equilibrium between two compounds containing weakly bonded hydrogen atoms (Laar 1886). This type of tautomerism that occurs from repositioning or movement of protons is called prototropic tautomerism. Nucleic acid bases have carbonyl and amino functional groups, which contain solvent-exchangeable (weakly bonded) hydrogen atoms that can participate in keto–enol and amino–imino types of tautomerism (Fig. 1; Watson and Crick 1953; Topal and Fresco 1976). Specifically, adenine has the ability to adopt amino and imino tautomeric forms involving the exocyclic group at the 6-position; uracil and thymine have carbonyl functional groups that can participate in keto–enol tautomerism. Guanine and cytosine have both amino and carbonyl groups, thus they can exhibit both amino–imino and keto–enol types of tautomerism (Fig. 1). Of all possible tautomeric forms of the canonical nucleobases, the keto- and amino-forms predominate under physiological conditions, and are thus considered “major” tautomers. The imino- and enol forms are considered “minor” tautomers and are typically very rare. This review covers specifically the instances where the minor tautomers of nucleobases and base analogs have been shown to play critical roles in biochemical processes pertaining to, or involving RNA molecules.Although all nucleic acid bases can potentially adopt minor tautomeric forms, only the tautomeric equilibria of guanine have been proposed to play essential roles in RNA biochemistry. Specifically, minor tautomeric and/or ionized forms of catalytic guanosines have been implicated in the acid–base catalysis of the autolytic cleavage reactions of self-cleaving ribozymes, such as hairpin, hammerhead, and glmS (Cochrane and Strobel 2008a). Tautomerism has also been invoked as a molecular rationale for the interaction of certain base analogs with RNA aptamers. For example, the purine riboswitch (Gilbert et al. 2009), and the thiamine pyrophosphate (TPP) riboswitch (Thore et al. 2008; Gilbert et al. 2009; Singh et al. 2014) are proposed to bind to the minor tautomeric forms of their nonnative ligands, xanthine, and oxythiamine pyrophosphate (OxyTPP), respectively.Studying tautomerism in aqueous conditions is challenging (Singh et al. 2014). While tautomerism of nucleic acid bases has been studied in aprotic solvents, gas phase or excited state conditions, the conclusions derived are less relevant for RNA biochemistry. Under these conditions, tautomeric equilibria are significantly altered, and the relative proportion of major and minor tautomeric forms is changed, even for the canonical nucleobases (Nir et al. 2002). Challenges of studying tautomerism in aqueous conditions include the fast rates of the tautomeric equilibria, the low abundance of minor tautomers, and the high chemical and structural similarity, between the corresponding minor and major tautomeric species. Traditional spectroscopic methods, although sensitive to certain chemical and structural properties, have been of limited use for studying tautomerism under aqueous conditions (Peng and Tokmakoff 2012; Singh et al. 2014). However, recent important developments in spectroscopic methods are beginning to overcome historical limitations and are starting to allow comprehensive characterization of tautomeric equilibria of base analogs free in solution or in the context of nucleic acids (Singh et al. 2014). Such methods (discussed later in this review) are based on variable temperature NMR and 2D IR spectroscopies, binding isotope effect measurements combined with theoretical calculations (Singh et al. 2014), and high-resolution crystallography (Bebenek et al. 2011; Wang et al. 2011; Demeshkina et al. 2012).This review article examines the following topics: (i) the role of tautomerism of catalytic guanosines in the catalysis of hammerhead, hairpin, and glmS small self-cleaving ribozymes; (ii) the importance of ligand tautomerism in the recognition of xanthine by the purine riboswitch and OxyTPP by the TPP riboswitch; and (iii) current methods available for studying tautomerism of nucleobases and analogs free in solution and in RNA context under biologically relevant aqueous conditions. We also discuss the contribution of nucleobase tautomerism in creating structural and functional diversity in RNA biochemistry.
BIOCHEMICAL FUNCTIONS AND MECHANISMS OF CHEMICAL DIVERSIFICATION IN RNA
Despite the limited complexity of the RNA building blocks (the four canonical ribonucleotides), RNA enzymes and aptamers are capable of efficiently performing a wide range of biochemical functions (for review, see Cochrane and Strobel 2008a; Hiller and Strobel 2011). Small self-cleaving ribozymes promote acid–base-catalyzed autolytic reactions at rates comparable to protein enzymes (Nakano et al. 2000; Bevilacqua et al. 2004; Cochrane and Strobel 2008a). These ribozymes were originally found in the genomes of many RNA viruses, where they are thought to participate in resolving products of rolling circle replication, but studies have shown that they are more widely distributed (Ferre-D'Amare and Scott 2010). In addition to naturally occurring ribozymes, artificial ribozymes have also been obtained through selection; such artificial ribozymes can catalyze a wide range of chemistries including carbon–carbon bond formation (Diels–Alder reaction) (Seelig and Jäschke 1999), isomerization (Prudent et al. 1994), redox chemistry (Tsukiji et al. 2003), and even small molecule biosynthesis (Huang et al. 2000). In addition, RNA molecules also play diverse regulatory roles in many biological processes. For example, RNA aptamers, riboswitches and T-box RNAs recognize their target ligands with high affinities (micromolar to picomolar) and specificities to regulate expression of genes involved in the biosynthesis and transport of their respective ligands. Additionally, the active site of the ribosome, which catalyzes the peptidyl transferase reaction, is entirely composed of RNA (Nissen et al. 2000).To increase their chemical versatility, RNA enzymes and aptamers use multiple mechanisms. One such mechanism involves nucleobase tautomerism and/or nucleobase ionization. Many small-cleaving ribozymes and riboswitches are believed to utilize tautomerism and/or ionization to execute their functions. Another general mechanism involves covalent modifications of the RNA nucleotides with alkyl group and other functional groups, which greatly expand the chemical repertoire of the oligoribonucleotides. For example, tRNA molecules have been shown to feature over a hundred of different modified nucleosides (Su et al. 2014).RNA aptamers or riboswitches also harness the chemical proficiency of cofactors such as flavin mononucleotide (Winkler et al. 2002a), S-adenosylmethionine (Winkler et al. 2003), lysine (Sudarsan et al. 2003), glycine, thiamine pyrophosphate (TPP), and glucosamine-6-phosphate to perform their biochemical functions. In addition, divalent cations such as Mg2+ and backbone phosphate oxygens also play key functional roles in catalysis of many ribozymes (Cochrane and Strobel 2008a; Donghi and Schnabl 2011). Biochemical roles of cofactors and metal ions in RNA biochemistry have been studied elsewhere (for review, see Cochrane and Strobel 2008b; Donghi and Schnabl 2011). Furthermore, as indicated above, chemical modifications of nucleobases (such as the ones observed in tRNA and rRNA) are also known to contribute to an increased chemical versatility of RNA, as detailed in Björk et al. (1999) and Czerwoniec et al. (2009). In contrast, tautomerism as a mechanism of chemical diversification in RNA has not been well studied.
MECHANISM OF TAUTOMERIZATION OF RNA BASES
In aqueous solution, the interconversion among different prototropic tautomers is acid–base catalyzed. Using model systems for nucleic acid bases, it has been shown that the mechanism involves water-mediated exchange of proton(s) between the donor and the acceptor atoms, with equilibrium being established on a nanosecond time scale (Peng and Tokmakoff 2012; Peng et al. 2013). Acid catalyzed interconversion involves formation of a cation intermediate through protonation, followed by deprotonation at a different location to generate another tautomeric form. Base catalyzed tautomeric interconversions involve deprotonation to form an anion, followed by protonation at another location to generate a neutral alternative tautomeric form. Ionized (cationic or anionic) states of nucleic acid bases have been well characterized in many RNA systems (Colominas et al. 1996; Bevilacqua et al. 2004; Das and Piccirilli 2005; Perrotta et al. 2006; Gong et al. 2007, 2011; Suydam and Strobel 2008; Viladoms et al. 2011; Wilcox et al. 2011; Viladoms and Fedor 2012; Wilcox and Bevilacqua 2013). Given the observed mechanism of tautomerization, such ionized states suggest the existence of tautomeric equilibria, as they can be intermediates in the acid- or base-mediated conversion of one tautomeric form to another.Tautomeric equilibria are influenced by chemical and physical factors including the presence of metals, temperature (Peng et al. 2013; Singh et al. 2014) and pH (CS Peng, V Singh, BI Fedeles, D Li, T Amariuta, JM Essigmann, and A Tokmakoff, in prep.). The effect of metals on nucleobase tautomerism has been reviewed previously (Lippert and Gupta 2009), while the influence of temperature on tautomeric equilibria is discussed later in this review. Given the acid–base catalytic mechanism of tautomerism in aqueous solutions, pH is expected to dictate the relative amounts of different tautomers at equilibrium. Therefore, the concentration of ionized intermediates that promote tautomerism, regardless of their charge state, is maximal when the pH is close to the pKa of the functional groups involved in tautomerism. Consequently, under physiological conditions (pH 7), compounds with pKa’s ∼7 are expected to display a maximum diversity of tautomers. However, the functional groups in RNA bases have unperturbed pKa values that are either significantly lower than neutrality (pKa = 4.2 for N3 of cytidine and pKa = 3.5 for N1 of adenosine), or higher than the neutral pH (pKa = 9.2 for N1 of guanosine and pKa = 9.2 for N3 of uridine) (Bevilacqua et al. 2004). Therefore, under physiological conditions, the tautomeric equilibria are shifted toward the keto and amino forms, which are the predominant (major) tautomeric forms observed. As a result, the proportion of minor tautomers of canonical nucleic acid bases is small around physiological pH. The presence of only one major tautomeric form at physiological pH (pH ∼7) is critical for maintaining the genomic information integrity during DNA and RNA replication and the structural integrity of RNA enzymes and aptamers.If the pKa’s of nucleic acid bases, however, are perturbed toward physiological pH, tautomerism is expected to play a more significant role in the biochemistry of nucleic acids. An example of a nucleoside analog with a pKa ∼7.0 is 5-aza-5,6-dihydro-2′-deoxycytidine (KP1212) (Li et al. 2014; CS Peng, V Singh, BI Fedeles, D Li, T Amariuta, JM Essigmann, and A Tokmakoff, in prep.), an anti-HIV agent currently in phase-IIa clinical trials. KP1212 has been shown to exist in as many as five different tautomeric forms (Li et al. 2014). One of the consequences of having multiple tautomeric forms is ambiguous base-pairing, as different tautomeric forms are expected to have different and distinct base-pairing preferences. Indeed, KP1212 is a mutagenic compound, causing G to A and A to G mutations, with this property being successfully exploited as an antiviral strategy (Li et al. 2014). Considering that pKa shifts of up to ∼4 units toward the physiological pH have been observed in RNA enzymes (Legault and Pardi 1997; Ravindranathan et al. 2000; Gong et al. 2007; Wilcox and Bevilacqua 2013), formation of minor tautomers of nucleic acid bases is expected to occur to a significant extent and play a functional role in these systems.Besides the prototropic tautomerism that stems from proton repositioning, other types of prototropic tautomerism exist including annular (Minkin et al. 2000; Alkorta et al. 2006), valence (Sato et al. 2007) and ring-chain (Baker et al. 1924) tautomerism. The annular tautomerism explores a quantum mechanical property of a proton in which the proton can simultaneously reside on multiple positions on a heterocycle, such as a purine or pyrimidine. Valence tautomerism involves electronic exchanges without the repositioning of protons and is more common in transition metal complexes (Sato et al. 2007). The ribose or deoxyribose in nucleosides can also participate in ring-chain tautomerism (Jones 1963), a type of tautomerism in which one of the tautomeric forms is cyclic and the interconversion is mediated by the transfer of a proton. An example of ring-chain tautomerism enables mutarotation of sugars (Baker et al. 1924). As the functional importance of valence, annular and ring-chain tautomerism in nucleic acids remains to be established, this review only focuses on examples of prototropic tautomerism that arises from the repositioning of protons in the context of RNA systems.
ROLE OF TAUTOMERISM IN RNA BIOCHEMISTRY
Self-cleaving ribozymes such as hammerhead, hairpin, and glmS are proposed to utilize minor tautomeric or ionized forms of catalytic guanosines to perform their catalytic function (Figs. 4, 5, below; Cochrane and Strobel 2008a). Tautomerism has also been proposed to influence recognition of xanthine and OxyTPP by the purine and TPP riboswitches, respectively (Thore et al. 2008; Gilbert et al. 2009; Singh et al. 2014). Although not discussed below, tautomeric preference of DNA and RNA polymerases is another area where formation of minor tautomeric forms may have important physiological consequences; specifically, the minor tautomeric forms of canonical nucleobases have been long speculated to induce spontaneous mutations, both during DNA replication (Watson and Crick 1953; Topal and Fresco 1976; Bebenek et al. 2011; Wang et al. 2011) and during transcription (Shugar and Kierdaszuk 1985). Considering different tautomers of nucleic acid bases and analogs have different chemical properties and base-pairing preferences (Topal and Fresco 1976; Peng et al. 2013; Li et al. 2014; Singh et al. 2014), formation of alternative tautomeric forms is expected to influence RNA biochemistry.
Role of tautomerism in recognition of ligands by riboswitches
Structural and biochemical studies have predicted that purine and the thiamine pyrophosphate riboswitches bind to their target ligand analogs by preferentially recognizing their minor tautomeric forms (Thore et al. 2008; Gilbert et al. 2009; Singh et al. 2014). Riboswitches are structured RNA regulatory elements found in the noncoding region of many bacterial mRNAs (Roth and Breaker 2009; Breaker 2011). They bind to small molecules to regulate the expression of the downstream genes. Below we review studies where specific minor tautomeric forms of xanthine and OxyTPP ligands are proposed to be recognized preferentially by the purine and the TPP riboswitches, respectively.
Recognition of xanthine by the purine riboswitch
The purine riboswitch regulates the expression of genes involved in the metabolism of adenine and guanine (Mandal et al. 2003; Mandal and Breaker 2004). In addition to binding to normal purines, this riboswitch has been shown to also bind purine metabolites (hypoxanthine, xanthine) and synthetic analogs (such as 2,6-diaminopurine) (Gilbert et al. 2009). Structural and biochemical studies of xanthine bound to the purine riboswitch indicated that the heterocyclic base binds as the 2-enol minor tautomer (Gilbert et al. 2009). Crystal structures of the riboswitch with various ligands show that a single pyrimidine at the 74-position is critical for determining ligand specificity (Fig. 2A; Batey et al. 2004; Serganov et al. 2004; Gilbert et al. 2006a, 2009). The xanthine ligand forms a Watson–Crick base pair with C74 (Gilbert et al. 2009). Conserved residues at the U51, U47, and U22 positions in the aptamer form part of the binding pocket and contribute to ligand recognition (Fig. 2). The carbonyl oxygens (O2) of C74 and U51 form hydrogen bonds with the 2-amino functional group of the native ligand guanine (Serganov et al. 2004). Hypoxanthine does not have the 2-amino group, thus binding to the purine riboswitch with almost 200-fold less affinity compared with guanine (Batey et al. 2004). Besides the 2-amino functional group, the 6-keto, and N1 groups are also recognized by the riboswitch to form a Watson–Crick type of base-pairing (Batey et al. 2004; Serganov et al. 2004; Gilbert et al. 2006b).
FIGURE 2.
(A) The 2F−F omit map of xanthine bound to the purine riboswitch (coloring scheme: carbons, green; nitrogens, blue; and oxygens, red). (B) Proposed interactions for the 2-enol tautomer of xanthine bound to the purine riboswitch. (C) Proposed interactions for the 2-keto tautomer of xanthine bound to the purine riboswitch.
(A) The 2F−F omit map of xanthine bound to the purine riboswitch (coloring scheme: carbons, green; nitrogens, blue; and oxygens, red). (B) Proposed interactions for the 2-enol tautomer of xanthine bound to the purine riboswitch. (C) Proposed interactions for the 2-keto tautomer of xanthine bound to the purine riboswitch.The carbonyl oxygens of C74 and U51 surround the 2-position of ligands to create a highly negatively charged binding pocket. Therefore, the riboswitch strongly prefers ligands that have hydrogen bond donor groups at their 2-position, such as the amine in guanine or 2,6-diaminopurine. Xanthine, with an oxygen at the 2-position was not expected to bind to the riboswitch, yet it binds albeit weakly with a KD of 32 µM (Gilbert et al. 2009). The crystal structures of the riboswitch bound to xanthine or guanine reveal no significant structural differences between guanine and xanthine binding (Fig. 2; Gilbert et al. 2009). To rationalize the biochemical basis of xanthine recognition, it was proposed that the 2-enol minor tautomeric form of xanthine would partially alleviate the repulsive interactions that exist due to the presence of three negatively charged oxygens in close proximity. This proposal was also consistent with the biochemical observation that xanthine binds the riboswitch at pH 6 (KD = 33 µM), but no binding is detected at pH 8.5, because enol tautomers are expected to be more stable at lower pH (Gilbert et al. 2009). On the basis of these observations, it was speculated that the purine riboswitch preferentially binds to the 2-enol tautomer of xanthine (Gilbert et al. 2009). However, no direct evidence exists showing the presence of minor tautomeric forms of xanthine either in isolation or bound to the purine riboswitch. As it is shown in the Methods section of this review, techniques based on 18O binding isotope effects combined with the density functional theory (DFT) calculation could be used to characterize the tautomeric form of xanthine bound to the riboswitch. The tautomeric equilibria of the unbound form of xanthine could also be studied using 2D IR and variable temperature NMR methods. We applied a combination of these methods to characterize the tautomeric form(s) of oxythiamine in the context of the TPP riboswitch, as described in the next section (Singh et al. 2014).
Recognition of oxythiamine pyrophosphate by the thiamine pyrophosphate riboswitch
Our work on the tautomerism of OxyTPP in the context of the TPP riboswitch utilized a combination of spectroscopic, biochemical, and theoretical techniques to identify all three tautomeric forms of oxythiamine (OxyT) in the unbound form and the tautomeric form of oxythiamine pyrophosphate (OxyTPP) in the bound form (Singh et al. 2014). The TPP riboswitch binds to TPP, providing negative regulation of the expression of genes involved in the biosynthesis and transport of this essential vitamin (Winkler et al. 2002b, 2004). Crystal structures of the TPP riboswitch from thale cress (Arabidopsis thaliana) in complex with various ligands have been solved showing that the G28 within the riboswitch is critical for determining ligand specificity (Thore et al. 2006, 2008). The X-ray structure of the riboswitch with the TPP ligand shows that the amino group at the 4′-position of TPP acts as a hydrogen bond donor to the N3 position of G28 (Fig. 3A; Thore et al. 2006). OxyTPP, a presumably natural metabolite of TPP, features a keto group instead of the 4′-amino functionality and could play a role in maintaining TPP homeostasis via its interaction with the TPP riboswitch (Singh et al. 2014). Structural studies of OxyTPP with the TPP riboswitch indicate that, under crystalline conditions, its hydrogen bonding interactions to G28 are almost identical to those of TPP (Fig. 3A; Thore et al. 2008). Since the 4′- position of OxyTPP can only act as a hydrogen bond donor in its enol form, it was speculated that OxyTPP binds to the riboswitch as an enol tautomer (Thore et al. 2008).
FIGURE 3.
(A) Interactions of thiamine and oxythiamine pyrophosphate (TPP and OxyTPP, respectively) with the G28 of the TPP riboswitch (generated using the pdb 3D2X: coloring scheme; nitrogens are shown in blue, carbons in white, oxygens in red, and sulfur in yellow). (B) (Top) Tautomeric forms of oxythiamine (OxyT), as identified using 2D IR and variable temperature NMR spectroscopy (shown in Fig. 6B); (bottom) tautomeric form of OxyTPP bound to the TPP riboswitch as determined using binding isotope effects and DFT calculations (shown in Fig. 6B). (C) Proposed interactions for the keto and enol forms of OxyTPP with the G28 of the TPP riboswitch.
(A) Interactions of thiamine and oxythiamine pyrophosphate (TPP and OxyTPP, respectively) with the G28 of the TPP riboswitch (generated using the pdb 3D2X: coloring scheme; nitrogens are shown in blue, carbons in white, oxygens in red, and sulfur in yellow). (B) (Top) Tautomeric forms of oxythiamine (OxyT), as identified using 2D IR and variable temperature NMR spectroscopy (shown in Fig. 6B); (bottom) tautomeric form of OxyTPP bound to the TPP riboswitch as determined using binding isotope effects and DFT calculations (shown in Fig. 6B). (C) Proposed interactions for the keto and enol forms of OxyTPP with the G28 of the TPP riboswitch.
Spectroscopic approaches based on variable temperature NMR and 2D IR spectroscopies were used to show that OxyT, under physiologically relevant aqueous conditions, exists in three possible tautomeric forms: 4′-keto-N1′H-OxyT, 4′-keto-N3′H-OxyT, and 4′-enol-OxyT (Fig. 3B; Singh et al. 2014). Application of these methods also allowed quantitation of the relative distribution of tautomers at equilibrium. The spectroscopic data showed that 4′-keto-N1′H-OxyT is the predominant tautomeric state in the unbound form followed by 4′-keto-N3′H-OxyT and 4′-enol-OxyT (Fig. 6B, below). Experimental binding isotope effects (BIE) measured under aqueous conditions, combined with DFT calculations showed that the TPP riboswitch actually binds to OxyTPP in its keto tautomeric form, namely 4′-keto-N1′H-OxyT (Fig. 3B). The presence of the keto tautomeric form of OxyTPP, however, is expected to form unfavorable interactions with the residues in the binding pocket of the riboswitch, a finding consistent with the observation that OxyTPP binds 33 times weaker than TPP (Singh et al. 2014). Because the TPP cellular concentration is tightly regulated, the lower binding affinity of OxyTPP allows activation of the riboswitch in response to oxidative damage to the TPP pool.The presence of similar hydrogen bonding interactions for the keto form of OxyTPP and TPP ligands with the G28 of the TPP riboswitch suggested that the N3 of G28 is protonated in its interaction with OxyTPP (Fig. 3C; Singh et al. 2014). Protonation of N3 of G28 would either generate an ionized cationic form or a neutral minor tautomeric form produced through subsequent deprotonation of the cationic G28 elsewhere on the base. Further work is needed to determine whether these forms (cationic or minor tautomeric) of G28 exist and/or interact with the 4′-keto-OxyTPP ligand. The elucidation of the details of this interaction will significantly enhance our understanding of the role played by the structural diversification through tautomerism/ionization of the canonical nucleic acid bases.
Tautomerism in small autolytic ribozymes catalysis
Small self-cleaving (autolytic) ribozymes catalyze nucleolytic intramolecular self-scission reactions. Well-studied examples of small self-cleaving ribozymes include hammerhead, hairpin, glmS, Varkud Satellite (VS), Hepatitis delta virus (HDV), HDV-like, and twister ribozymes. With the exception of glmS, small autolytic ribozymes do not require participation of an external coenzyme to catalyze the self-cleavage reaction (Cochrane and Strobel 2008a). The self-cleavage reaction is driven by the base-catalyzed activation of a key 2′-hydroxyl nucleophile. The activation is followed by an internal transesterification reaction, in which the nucleophilic 2′-oxygen attacks the adjacent scissile 3′-phosphate to form two pieces of RNA, one containing the 2′,3′-cyclic phosphate and the other the 5′-OH functional group (Cochrane and Strobel 2008a).Tautomerism and/or ionization of catalytic guanosines have been invoked in the initial step of the activation of the 2′-hydroxyl nucleophile. A catalytic guanosine with unprotonated N1 has been proposed to participate in the reaction as a general base to activate the 2′-hydroxyl for nucleophilic attack. Consistent with this mechanism, structural and biochemical studies of hammerhead, hairpin, and glmS ribozymes show that the N1 of catalytic G is in close proximity, within hydrogen bonding distance, to the 2′-hydroxyl nucleophile adjacent to the scissile phosphate (Figs. 4A, 5; Rupert and Ferré-D'Amaré 2001; Martick and Scott 2006). A similar mechanism, based on biochemical observations, has also been proposed for the VS ribozyme; however, in the absence of crystal structure information, the type of nucleic acid base (adenine or guanine) in close structural proximity to the catalytic 2′-hydroxyl in the VS ribozymes is uncertain.
FIGURE 4.
(A) The active site of the glmS ribozyme in the presence of glucosamine-6-phosphate (GlcN6P), showing the N1 of catalytic guanosine (G33) in close proximity to the 2′-OH nucleophile (generated using pdb 2NZ4). (B) Schematic of the autolytic reaction catalyzed by the glmS ribozyme. (C) Guanosine and its tautomeric or ionized forms in which the N1 position is not protonated.
FIGURE 5.
(A) General catalytic mechanism used by self-cleaving ribozymes—glmS, hairpin, and hammerhead with either the anionic form (top) or the minor tautomeric form (bottom) of catalytic guanosines participating in the reaction as a general base. (B) Structure of the active site of the precleaved hairpin ribozyme, emphasizing the role of G8 in the activation of the 2′-OH nucleophile (generated from pdb 1M5K). (C) Active site of the hammerhead ribozyme (generated from pdb 3ZD5). A similar mechanism is also used by the glmS ribozyme. The active site structure is shown in Figure 4A. Coloring scheme: The nucleotides flanking the scissile phosphate are in blue; the nucleotides that interact with the reactive atoms are shown in yellow. Hydrogen bonds between the catalytic guanosines (shown in green [carbon], blue [nitrogen], and red [oxygen]) and the 2′-OH nucleophile are shown as dashed lines.
(A) The active site of the glmS ribozyme in the presence of glucosamine-6-phosphate (GlcN6P), showing the N1 of catalytic guanosine (G33) in close proximity to the 2′-OH nucleophile (generated using pdb 2NZ4). (B) Schematic of the autolytic reaction catalyzed by the glmS ribozyme. (C) Guanosine and its tautomeric or ionized forms in which the N1 position is not protonated.(A) General catalytic mechanism used by self-cleaving ribozymes—glmS, hairpin, and hammerhead with either the anionic form (top) or the minor tautomeric form (bottom) of catalytic guanosines participating in the reaction as a general base. (B) Structure of the active site of the precleaved hairpin ribozyme, emphasizing the role of G8 in the activation of the 2′-OH nucleophile (generated from pdb 1M5K). (C) Active site of the hammerhead ribozyme (generated from pdb 3ZD5). A similar mechanism is also used by the glmS ribozyme. The active site structure is shown in Figure 4A. Coloring scheme: The nucleotides flanking the scissile phosphate are in blue; the nucleotides that interact with the reactive atoms are shown in yellow. Hydrogen bonds between the catalytic guanosines (shown in green [carbon], blue [nitrogen], and red [oxygen]) and the 2′-OH nucleophile are shown as dashed lines.Although the N1 of G has been implicated in the activation of the catalytic 2′-hydroxyl, the discrepancy in pKa’s between the N1 of G and the 2′-OH nucleophile has been difficult to explain. The N1 of guanosine has a pKa of ∼10, and therefore at physiological pH, it is expected to be fully protonated and thus, not chemically suitable to deprotonate the 2′-OH, which has a pKa of ∼13 (Velikyan et al. 2001). However, a tautomeric and/or ionized form of catalytic G in which the N1 is unprotonated is anticipated to be more nucleophilic, and thus chemically capable of activating the 2′-hydroxyl nucleophile. Moreover, it is possible that the pKa of N1 would be perturbed in a minor tautomeric form, because of different electronic distribution around N1.As explained earlier, tautomerization and/or ionization of G to generate a form with unprotonated N1 would occur to a greater extent at pKa’s around the physiological pH. Significant alterations in pKa’s are not uncommon and changes of up to 4 units have been observed in RNA enzymes (Legault and Pardi 1997; Wilcox and Bevilacqua 2013). Furthermore, in the context of RNA, minor tautomeric forms of nucleosides are expected to be more stable than the ionized (anionic) forms. Formation of minor tautomer through protonation at an alternative site would stabilize the active (unprotonated N1) anionic form of the catalytic guanosine, by removing the negative charge of an anionic intermediate, which in a polyanionic system like RNA would be unfavorable. The next section discusses specific experimental evidence suggestive of the involvement of the N1 of catalytic G, through ionization or tautomerism, in the chemistry of small self-cleaving ribozymes.
Potential for tautomerism in the glmS ribozyme-catalyzed nucleolytic reaction
The glmS riboswitch/ribozyme binds to glucosamine-6-phosphate (GlcN6P) to regulate expression of the gene that encodes for glutamine-fructose-6-phosphate amidotransferase, the biosynthetic enzyme for GlcN6P (Milewski 2002; Winkler et al. 2004; Collins et al. 2007; McCown et al. 2012). The glmS riboswitch is also a ribozyme that utilizes the self-cleavage reaction as a mode of genetic regulation. The reaction is ligand-dependent, the ligand participating in the reaction as a general acid (McCarthy et al. 2005; Cochrane et al. 2007). Its reaction mechanism is similar to RNase A and involves an attack of the vicinal 2′-OH on the scissile phosphate to generate two fragments of RNA, one containing the 2′,3′ cyclic phosphate and the other having a free 5′-OH group (Klein and Ferré-D'Amaré 2006; Cochrane et al. 2007; Cochrane and Strobel 2008a). Structural and biochemical analysis supports a mechanism in which the transphosphorylation reaction is initiated by the base-catalyzed activation of the vicinal 2′-hydroxyl nucleophile by the N1 of a catalytic guanine at the 33-position (G33) (Klein and Ferré-D'Amaré 2006; Cochrane et al. 2007; Klein et al. 2007). The amine of GlcN6P, through fine tuning of its pKa (Davis et al. 2011; Gong et al. 2011; Wilcox et al. 2011), participates in the transesterification reaction as a general acid by donating a proton to the 5′-oxygen of the leaving group RNA fragment (Klein and Ferré-D'Amaré 2006; Cochrane et al. 2007; Viladoms and Fedor 2012). Brønsted coefficient of 0.7 measured for the 5′-oxygen supports a late transition state in which its protonation is rate-limiting (Viladoms and Fedor 2012).The G33 residue of glmS is a critical residue for the glmS catalyzed self-cleavage reaction (Cochrane et al. 2007; Klein et al. 2007; Cochrane and Strobel 2008a), because the N1 position of G33 is structurally in close proximity to the 2′-OH nucleophile (Fig. 4A). Replacement of the G33 with any other canonical nucleobase significantly reduces the rate of the self-cleavage reaction by a factor of 103–105 (Cochrane et al. 2007; Klein et al. 2007). Given the functional importance of G33 in the cleavage reaction and the structural proximity to the 2′-hydroxyl group, it was proposed that the N1 of G33 directly activates the 2′-OH nucleophile (Cochrane et al. 2007). However, as mentioned earlier the N1 of guanine has a pKa ∼10, making it a poor base to abstract a proton from the 2′-OH, with a pKa of 13. Deprotonation of the N1 of G33 through formation of either the minor tautomeric form (6-enol) or the anionic form, as shown in Figures 4C and 5A, could generate a form of G33 more suitable to participate in the glmS catalyzed reaction as a general base (Cochrane et al. 2007; Cochrane and Strobel 2008a). Deprotonation of N1 of G either through the formation of an anionic state or a minor tautomeric form would however require significant alteration in the pKa of the G33. Given that ionized states of catalytic guanosines have been identified (Wilcox et al. 2011), we suspect that minor tautomers are also formed and play a role in the mechanism of self-cleaving ribozymes; however, identifying such minor tautomers in the context of RNA has been experimentally challenging. Further work is needed to characterize the functional state of the catalytic G33 guanosine in the glmS ribozyme. Methods based on binding isotope effects, which are described later in this review, could be useful to characterize the functional state of the G33 catalytic guanosine.
Possible role of tautomerism in catalysis of hammerhead ribozyme reaction
The hammerhead ribozyme is a subcategory of satellite RNAs originally found in the genomes of plant viruses, but now known to be present in many eukaryotic genomes (McKay 1996; Hammann et al. 2012). It participates in catalyzing the cleavage and ligation of RNA as part of the rolling circle replication to resolve the replicated products (McKay 1996), and may contribute to RNA processing such as RNA-catalyzed ligation and trans-cleavage (Hammann et al. 2012).Structural and biochemical studies of the hammerhead ribozyme are consistent with the observation that the N1 of the catalytic guanosine G12 is in close proximity to the 2′-hydroxyl nucleophile and plays an important role in its activation (Fig. 5C; McKay 1996; Martick and Scott 2006; Thomas and Perrin 2008). The N1 of the catalytic guanosine is within hydrogen bonding distance (3.5 Å) from the 2′-OH nucleophile (Martick and Scott 2006). G12 exhibits a log-linear trend of activity with pH that does not plateau even at the highest pH, indicating general base catalysis (Han and Burke 2005). Replacement of G12 with any other canonical nucleobase results in 102- to 105-fold decrease in the reaction rate (Ruffner et al. 1990) with maximal reduction of 105-fold observed for the adenine substitution (Chi et al. 2008). Structural studies have shown that substitution of G12 to A does not perturb its structure, suggesting that the decrease in the rate of cleavage is primarily from the pKa shift, which changes from ∼9.5 to ∼3.5 upon G12A substitution (Chi et al. 2008). In addition, even though the A12 (with unprotonated N1) is a much weaker base than G12 (protonated N1) because of its lower pKa, G12A substitution results in a better alignment of N1 of the purine with the 2′-OH nucleophile (3.5 Å for G versus 2.5 Å for A), indicating the preference for the deprotonated N1 as a general base catalyst (Chi et al. 2008). Furthermore, substituting G12 with analogs that specifically disrupt the pKa of N1 of G, such as inosine, diaminopurine, or 2-aminopurine, affects the overall rate of the reaction with maximum reduction of 103 observed with 2-aminopurine (Han and Burke 2005), which suggests a direct role of the N1 of G in the chemistry of hammerhead ribozyme. In addition, replacement of the nucleophilic 2′-hydroxyl group with the electrophilic 2′-bromoacetamide group results in alkylation of the N1 position of G12 in a pH and Mg2+-dependent manner that is consistent with the direct involvement of the N1 of G12 in the cleavage reaction (Thomas and Perrin 2008).Taken together, the biochemical and structural data are consistent with a mechanism in which the N1 of the G12 residue directly participates as a general base in the hammerhead ribozyme-catalyzed self-cleavage reaction. Similarly to the mechanism discussed earlier for glmS ribozyme, the unprotonated form of N1 is more suitable to participate in the reaction as a general base, which can be an anionic G form, generated through deprotonation at N1, or a minor tautomer of G, generated by the subsequent reprotonation of anionic G at another site. It remains to be determined the extent to which the anionic form or the minor tautomeric form of the catalytic guanosine participate in the catalytic step of the cleavage reaction.
Possible tautomerism in hairpin ribozyme catalysis
The hairpin ribozymes like hammerhead constitute a subset of RNA satellites in the genomes of RNA viruses and are thought to catalyze processes that precede genomic RNA replication (Fedor 2000). They catalyze reversible cleavage of sequence-specific regions of RNA to form a 2′,3′-cyclic phosphate and a 5′-OH terminus.Similar to hammerhead and glmS ribozymes, the first step in the hairpin ribozyme-catalyzed cleavage reaction involves activation of a 2′-OH nucleophile by a catalytic guanosine (G8 for hairpin) (Fedor 2000; Bevilacqua and Yajima 2006). The N1 of G8 is within hydrogen bonding distance from the 2′-hydroxyl nucleophile (Fig. 5B; Rupert and Ferré-D'Amaré 2001). The G8 residue is essential for catalysis, replacement of G8 with an abasic site resulting in a 103-fold reduction in the rate of cleavage reaction (Kuzmin et al. 2004). Perturbation of pKa of the N1 of G8 through its substitution with either inosine or diaminopurine changes the pH profile of the self-cleavage reaction (Pinard et al. 2001), suggesting a direct role of the N1 of G8 in the cleavage reaction (Pinard et al. 2001). However, the decrease in rate observed with the abasic substitution did not influence the pH dependence of the reaction, indicating that the apparent pKa could be due to another source (Kuzmin et al. 2004). While the catalytic role of the G8 residue, as a general base, in catalysis of hairpin ribozyme has been established, further studies are needed to examine whether G8 participates in the reaction as a minor tautomer or in its anionic form.
Varkud Satellite (VS) ribozyme catalysis
The Varkud Satellite (VS) ribozyme is the largest auto-nucleolytic ribozyme found in the mitochondrial satellite plasmid of Neurospora species, playing a role in the processing and resolution of replication intermediates (Griffiths 1995; Lilley 2004). It catalyzes reversible site-specific cleavage and ligation reactions (Saville and Collins 1990). Since the crystal structure of VS ribozyme is not yet available, the mechanism of VS ribozyme catalysis remains uncertain. Biochemical data suggest the involvement of a site-specific adenosine (A756) and guanosine (G638) in acid–base catalyses. Substitution of A756 with any other base results in a 300-fold or more reduction of the catalytic activity of the VS ribozyme, whereas G638 to A substitution results in a 104-fold reduction in rate (Lafontaine et al. 2001; Sood and Collins 2002). Complete removal of A756 causes 900-fold reduction in activity, without affecting ribozyme folding suggesting that it directly participates in the ribozyme catalysis (Lafontaine et al. 2001, 2002). Biochemical cross-linking experiments using 4-thiouridine also demonstrated a direct interaction between A756 and the cleavage site (Hiley et al. 2002). Nucleotide analog interference mapping (NAIM) studies, using a series of ionization sensitive adenosine and cytosine analogs, indicated that ionization of A756 plays a direct role in the VS ribozyme-catalyzed ligation reaction (Jones and Strobel 2003). Although NAIM studies are consistent with the ionization of A756 during catalysis, systematic studies to structurally identify the ionized or tautomeric form of A756 have been lacking. Methods based on BIEs could be adapted to study amino–imino tautomerism of A756 in the VS ribozyme. The VS ribozyme is particularly suitable for studying amino–imino tautomerism in RNA, as it uses adenosine in its catalysis. For this purpose, double labeled adenosines, [6-N15, 1′-C14] and [6-N14, 1′-H3], can be used to characterize the tautomeric form A756 in the active site of the VS ribozyme by using a strategy that is briefly described later in this review.
Significant structural and biochemical evidence exists to suggest that tautomerism plays an important role in the biochemistry of many self-cleaving ribozymes and riboswitches. Given that tautomerism is an intrinsic property of ribonucleotides, one can imagine that other biological processes involving RNA could also be influenced by the structural diversity generated by tautomeric equilibria. Additionally, tautomerism is important in other areas of nucleic acid biochemistry. Spontaneous formation of minor tautomeric forms of nucleic acid bases, either in the nucleotide pool or in the template strand, has been proposed to generate mutations, owing to suspected noncanonical base-pairing properties of minor tautomers. Tautomeric nucleoside analogs also have therapeutic applications as antiviral drugs because of their ability to induce lethal mutagenesis (Li et al. 2014), increasing the viral mutation rates above the error catastrophe limit of a virus. Furthermore, because small self-cleaving ribozymes and riboswitches are considered as evolutionary remnants that have persisted in the modern genomes (Ferre-D'Amare and Scott 2010), biochemical versatility of RNA enhanced by tautomerism may have also played a crucial role in the ancestral RNA World (Gilbert 1986). Tautomerism of nucleic acid bases is perhaps also relevant to the origin of life question, and pertinent for understanding why Nature selected A, T, G, C, and U as nucleic acid bases for encoding the genetic information of all life forms.
Authors: Marija Košutić; Sandro Neuner; Aiming Ren; Sara Flür; Christoph Wunderlich; Elisabeth Mairhofer; Nikola Vušurović; Jan Seikowski; Kathrin Breuker; Claudia Höbartner; Dinshaw J Patel; Christoph Kreutz; Ronald Micura Journal: Angew Chem Int Ed Engl Date: 2015-10-16 Impact factor: 15.336
Authors: Aiming Ren; Xin C Wang; Colleen A Kellenberger; Kanagalaghatta R Rajashankar; Roger A Jones; Ming C Hammond; Dinshaw J Patel Journal: Cell Rep Date: 2015-03-26 Impact factor: 9.423
Authors: Aamir Mir; Ji Chen; Kyle Robinson; Emma Lendy; Jaclyn Goodman; David Neau; Barbara L Golden Journal: Biochemistry Date: 2015-10-02 Impact factor: 3.162