Chun Kit Kwok1, Aleksandr B Sahakyan1, Shankar Balasubramanian2. 1. The University of Cambridge, Department of Chemistry, Lensfield Road, Cambridge, CB2 1EW, UK. 2. The University of Cambridge, Department of Chemistry, Lensfield Road, Cambridge, CB2 1EW, UK. sb10031@cam.ac.uk.
Abstract
RNA G-quadruplex (rG4) structures are of fundamental importance to biology. A novel approach is introduced to detect and structurally map rG4s at single-nucleotide resolution in RNAs. The approach, denoted SHALiPE, couples selective 2'-hydroxyl acylation with lithium ion-based primer extension, and identifies characteristic structural fingerprints for rG4 mapping. We apply SHALiPE to interrogate the human precursor microRNA 149, and reveal the formation of an rG4 structure in this non-coding RNA. Additional analyses support the SHALiPE results and uncover that this rG4 has a parallel topology, is thermally stable, and is conserved in mammals. An in vitro Dicer assay shows that this rG4 inhibits Dicer processing, supporting the potential role of rG4 structures in microRNA maturation and post-transcriptional regulation of mRNAs.
RNA G-quadruplex (rG4) structures are of fundamental importance to biology. A novel approach is introduced to detect and structurally map rG4s at single-nucleotide resolution in RNAs. The approach, denoted SHALiPE, couples selective 2'-hydroxyl acylation with lithium ion-based primer extension, and identifies characteristic structural fingerprints for rG4 mapping. We apply SHALiPE to interrogate the human precursor microRNA 149, and reveal the formation of an rG4 structure in this non-coding RNA. Additional analyses support the SHALiPE results and uncover that this rG4 has a parallel topology, is thermally stable, and is conserved in mammals. An in vitro Dicer assay shows that this rG4 inhibits Dicer processing, supporting the potential role of rG4 structures in microRNA maturation and post-transcriptional regulation of mRNAs.
RNA G‐quadruplexes (rG4s) appear to be important in gene regulation and disease.1 This structural motif comprises G‐quartets (Figure 1 A) with connecting loops, and can be stabilized by cations, especially potassium (K+). Recently, rG4s have been visualized in human cells,2 and ligands, such aspyridostatin (PDS)3 (Figure 1 B), have been shown to further stabilize rG4 structures.2a
Figure 1
Structure probing of rG4s using SHALiPE. Chemical structure of A) a G‐quartet and B) pyridostatin (PDS). C) Overview of SHALiPE. RNA is folded under Li+, K+, or K++PDS conditions, followed by selective 2′‐hydroxyl acylation (SHA) using NAI. Li+‐based primer extension (LiPE) is conducted to analyze the NAI‐modified RNA. Controls are performed with no NAI.
Structure probing of rG4s using SHALiPE. Chemical structure of A) a G‐quartet and B) pyridostatin (PDS). C) Overview of SHALiPE. RNA is folded under Li+, K+, or K++PDS conditions, followed by selective 2′‐hydroxyl acylation (SHA) using NAI. Li+‐based primer extension (LiPE) is conducted to analyze the NAI‐modified RNA. Controls are performed with no NAI.Our understanding of rG4 structure and function is still in its infancy. Biophysical characterization of rG4s has typically involved circular dichroism (CD) spectroscopy,4 UV‐thermal melting,5 and NMR spectroscopy,6 which report rG4 features, but largely without considering the effects of flanking sequences and an extended sequence context on rG4 formation. In‐line probing can address these shortcomings and identifies rG4s;7 however, the long experimental time (two‐day reaction) poses some practical limitations. Reverse transcriptase stalling (RTS)8 can map the 3′‐end of the rG4 in a short time (15‐minute reaction), but does not provide nucleotide resolution details of the G‐quartets and loops.SHAPE9 (selective 2′‐hydroxyl acylation analyzed by primer extension) exploits differential, structure‐dependent kinetics of acylation of 2′‐OH of RNA, which is measured by reverse transcriptase stop at the 2′‐O adduct. SHAPE has mapped various RNA structures,10 including motifs such as RNA hairpins, but not rG4s. Herein, we show a new method to map rG4s that overcomes limitations of the classical SHAPE method. Specifically, we introduce lithium ion (Li+)‐based primer extension (LiPE) with 2‐methylnicotinic acid imidazolide (NAI) to map rG4s at single nucleotide resolution (Figure 1 C). Importantly, we apply it to reveal rG4 formation in biologically important RNAs, and report the role of rG4s in Dicer processing.We first evaluated the current SHAPE approach on an rG4 structure, using a construct comprising 5′ and 3′ stem loops (SL)11 and the telomeric TERRA
12 rG4, UUAGGGUUAGGGUUAGGGUUAGGGUUA (Supporting Information, Table S1 and Figure S1). Using NAI‐acylation (5‐minute reaction),13 we observed that patterns of NAI‐induced modifications are obscured by strong RTS by TERRA rG4 formation, in the standard K+‐containing PE buffer (Figure 2, lanes 1–6, green). This suggests that K+ should be avoided in the PE buffer14 to enable SHAPE analysis of rG4s.
Figure 2
SHALiPE development and analysis. TERRA RNA was probed with NAI under Li+, K+, and K++PDS conditions, followed by primer extension (PE) using either the SSIII commercial (com.) K+‐containing PE buffer14 (left gel) or home‐made Li+‐containing PE buffer (right gel). On the left, intense stalling was observed (lanes 1–6), caused by formation of TERRA rG4 (stabilized by K+ in the commercial PE buffer) in the PE step. On the right, stalling was reduced using the Li+‐based PE buffer. Lanes 9–14 show (−) and (+) NAI signals under Li+, K+, and K++PDS conditions. TERRA rG4 was unstructured in Li+ conditions. Under physiological K+ conditions, TERRA rG4 displayed distinct NAI profiles (low NAI signals in several Gs involved in rG4; purple asterisks, see lanes 10 and 12). Blue asterisks indicate Gs that were protected upon PDS addition (lanes 12 and 14). The loops (L1, L2, and L3) and loop nucleotides (UUA) are shown in red. Lanes 7–8 show sequencing of G. 5′ SL, 5′ stem loop (Supporting Information, Figure S1).
SHALiPE development and analysis. TERRA RNA was probed with NAI under Li+, K+, and K++PDS conditions, followed by primer extension (PE) using either the SSIII commercial (com.) K+‐containing PE buffer14 (left gel) or home‐made Li+‐containing PE buffer (right gel). On the left, intense stalling was observed (lanes 1–6), caused by formation of TERRA rG4 (stabilized by K+ in the commercial PE buffer) in the PE step. On the right, stalling was reduced using the Li+‐based PE buffer. Lanes 9–14 show (−) and (+) NAI signals under Li+, K+, and K++PDS conditions. TERRA rG4 was unstructured in Li+ conditions. Under physiological K+ conditions, TERRA rG4 displayed distinct NAI profiles (low NAI signals in several Gs involved in rG4; purple asterisks, see lanes 10 and 12). Blue asterisks indicate Gs that were protected upon PDS addition (lanes 12 and 14). The loops (L1, L2, and L3) and loop nucleotides (UUA) are shown in red. Lanes 7–8 show sequencing of G. 5′ SL, 5′ stem loop (Supporting Information, Figure S1).AsrG4 structures are less stable in Li+,15 we prepared and evaluated a Li+‐based PE buffer to reduce rG4 stalling and unmask patterns of rG4‐dependent 2′‐OH acylation. When the reaction of RNA with NAI was performed in the presence of Li+, the 2′‐OH of most nucleotides in the TERRA rG4 region were strongly modified (Figure 2, lane 10), indicating that the rG4 region is unstructured. In contrast, NAI reaction in the presence of K+, led to protection of some Gs in the rG4 region (Figure 2, lanes 10 and 12, purple asterisks), suggestive of rG4 formation. Interestingly, the modifications at the remaining Gs in the rG4 motif were more intense in K+ compared to Li+ (Figure 2, lanes 10 and 12, blue asterisks), but more protected in the presence of both K+ and the rG4 stabilizing ligand, namely PDS (K++PDS) (Figure 2, lanes 12 and 14, blue asterisks). These Gs were mostly located at the 3′‐end of each G‐tract. Similar findings were observed for TERC rG4 and NRAS rG4 (Supporting Information, Figures S2, S3), suggesting that the 2′‐OHs of the 3′‐G‐quartet are more flexible than those of the 5′‐G‐quartet. These data show that SHALiPE provides a distinctive pattern for rG4 structure that can be used to fine map rG4s, along with other RNA structures (Supporting Information, Figure S4) at single nucleotide resolution.RNA structures are dynamic and hairpin to G‐quadruplex conformational transition16 may have regulatory roles in biological processes. We surveyed the miRBase17 to identify precursor microRNA (pre‐miRNA) hairpins that contain putative rG4s (Supporting Information, Table S2), and selected a conserved candidate (Supporting Information, Table S3), pre‐miRNA 149, for further investigation. The processing of pre‐miRNA 149 by Dicer produces miR149 and miR149*, both of which have been shown to function as oncogenic regulators in cancers.18Using comparative sequence analysis, we identified a conserved rG4 sequence that partially overlaps with miRNA 149* (Figure 3 A). We confirmed rG4 formation in the putative rG4 sequence by CD and UV‐melting spectroscopic analyses (Figure 3 B; Supporting Information, Figure S5). Under 150 mm K+ conditions, the CD spectrum of the wildtype rG4 sequence, but not the mutated sequence, showed a distinct CD profile consistent with a parallel rG4 topology4 (Figure 3 B; Supporting Information, Table S1). UV‐melting analysis of the wildtype rG4 sequence displayed a hypochromic shift at 295 nm (Supporting Information, Figure S5), a hallmark for rG4 formation.5 Furthermore, the melting temperature for the rG4 sequence under the 150 mm K+ condition was found to be >90 °C, suggestive of a highly thermostable rG4 under physiological K+ conditions.
Figure 3
Computational analysis and biophysical characterization of a putative rG4 in human pre‐miRNA 149. A) Comparative sequence analysis of pre‐miRNA 149. The miR149, miR149*, and putative rG4 are highlighted. Conserved nucleotides are marked by asterisks and the length of RNAs are shown. The name of the species, presented here with three‐letter codes, can be found in the Supporting Information, Table S3. B) CD spectrum of wildtype rG4 sequence under K+ condition shows a characteristic signature that indicates formation of a parallel topology rG4 structure. Mutant shows no such signature. The Y axis is molar ellipticity per nucleotide (Δϵ). C) The secondary structure model for the human pre‐miRNA 149 is derived using the sequences in (A) and TurboFold.19
Computational analysis and biophysical characterization of a putative rG4 in human pre‐miRNA 149. A) Comparative sequence analysis of pre‐miRNA 149. The miR149, miR149*, and putative rG4 are highlighted. Conserved nucleotides are marked by asterisks and the length of RNAs are shown. The name of the species, presented here with three‐letter codes, can be found in the Supporting Information, Table S3. B) CD spectrum of wildtype rG4 sequence under K+ condition shows a characteristic signature that indicates formation of a parallel topology rG4 structure. Mutant shows no such signature. The Y axis is molar ellipticity per nucleotide (Δϵ). C) The secondary structure model for the human pre‐miRNA 149 is derived using the sequences in (A) and TurboFold.19Next, we predicted the secondary structure of the human pre‐miRNA 149 by TurboFold,19 which takes into consideration the sequences of orthologues and generates a consensus RNA structural model (Figure 3 C). Since the rG4 partially overlaps with the canonical hairpin structure in pre‐miRNA 149 (Figure 3 C), we hypothesize that the formation of an rG4 structure may affect the folding of the hairpin structure in this pre‐miRNA.We applied SHALiPE to probe the structure of pre‐miRNA 149 under Li+, K+, and K++PDS conditions (Figure 4 A). The results revealed a number of important findings. First, NAI modification (reactivity) profiles for Li+ versus K+ and Li+ versus K++PDS were weakly correlated, with Pearson's correlation coefficients (PCC) of 0.10 and 0.19 respectively (Figure 4 B, left and middle plots), suggesting that the major RNA conformations under Li+ and K+ (or K++PDS) conditions are different.
Figure 4
SHALiPE and Dicer cleavage assays on human pre‐miRNA 149 reveal an rG4 that inhibits in vitro Dicer processing. A) SHALiPE on pre‐miRNA 149 under Li+, K+, and K++PDS conditions (lanes 1–6). Lanes 7–10 show sequencing of U, C, G, and A respectively. The rG4 loops are in red. G that are protected with the addition of PDS (G55, G59, G63) are marked with blue asterisks. B) The PCC of the normalized NAI reactivity for K+ versus Li+ (left), K++PDS versus Li+ (middle), and K++PDS versus K+ (right). The PCC increases from 0.65 to 0.95 by removing G55, G59 and G63 (right). C) The hairpin conformation of pre‐miRNA 149. Some nucleotides are marked with green asterisks. D) The rG4‐containing conformation of pre‐miRNA 149. G55, G59, and G63 are marked with blue asterisks. E) Results of in vitro Dicer assay on pre‐miRNA 149 wildtype (WT) and mutant (MUT). Relative miRNA ratio (K+/Li+ or K++PDS/Li+) are reported. Values are from 3 biological replicates and error bars depict standard deviations. The constructs for the Dicer assay do not contain the 5′ and 3′SL (Supporting Information, Table S1).
SHALiPE and Dicer cleavage assays on human pre‐miRNA 149 reveal an rG4 that inhibits in vitro Dicer processing. A) SHALiPE on pre‐miRNA 149 under Li+, K+, and K++PDS conditions (lanes 1–6). Lanes 7–10 show sequencing of U, C, G, and A respectively. The rG4 loops are in red. G that are protected with the addition of PDS (G55, G59, G63) are marked with blue asterisks. B) The PCC of the normalized NAI reactivity for K+ versus Li+ (left), K++PDS versus Li+ (middle), and K++PDS versus K+ (right). The PCC increases from 0.65 to 0.95 by removing G55, G59 and G63 (right). C) The hairpin conformation of pre‐miRNA 149. Some nucleotides are marked with green asterisks. D) The rG4‐containing conformation of pre‐miRNA 149. G55, G59, and G63 are marked with blue asterisks. E) Results of in vitro Dicer assay on pre‐miRNA 149 wildtype (WT) and mutant (MUT). Relative miRNA ratio (K+/Li+ or K++PDS/Li+) are reported. Values are from 3 biological replicates and error bars depict standard deviations. The constructs for the Dicer assay do not contain the 5′ and 3′SL (Supporting Information, Table S1).Second, we found that in Li+ (Figure 4 A, lanes 1–2), NAI modifications mapped consistently to the predicted hairpin structure (Figure 4 A, C, green asterisks). In contrast, under K+ and K++PDS conditions (Figure 4 A, lanes 3–6), NAI modifications did not agree with the hairpin structure. For example, A56, A60, and A64 were strongly modified by NAI (Figure 4 A, lanes 3–4), despite being assigned as base‐paired nucleotides in the hairpin structure (Figure 4 C). To verify the SHALiPE results, a nucleobase‐specific reagent dimethyl sulfate (DMS) was used, which modifies the Watson–Crick face of As (N1A) and Cs (N3C)20 in a manner that stalls reverse transcriptase. As such, DMSLiPE provides a measure of nucleobase‐specific DMS reactivity with RNA. Similar to NAI, we observed that A56, A60, and A64 were strongly modified by DMS under K+ and K++PDS conditions but not in Li+ conditions (Supporting Information, Figure S6), suggesting that A56, A60, and A64 are not base‐paired and are the loop nucleotides of the rG4 (Figure 4 A,D). Our results support that the major RNA conformation under K+ and K++PDS conditions contains an rG4 (Figure 4 D).Furthermore, we performed SHALiPE on a mutant construct that cannot form an rG4, and found that the NAI modification profiles were almost identical under Li+, K+, and K++PDS conditions (Supporting Information, Figure S7). Also, the NAI modifications mapped well to the predicted structure of this construct (Supporting Information, Figure S7). This verified that the change in NAI profiles observed in the natural sequence (Figure 4 A) was indeed due to rG4 formation.Third, we observed that the PCC for NAI reactivity values under K+ and K++PDS conditions was 0.65 (Figure 4 B, right plot). Visually, three nucleotides (G63, G55, and G59) stood out, and by removing them from the correlation analysis yielded a PCC of 0.95 (Figure 4 B, right plot), similar to the PCC of 0.93 for DMS reactivity (only considers As and Cs)20 (Supporting Information, Figure S6). Interestingly, these three nucleotides were the 3′‐G of each G‐tract in the rG4 (Figure 4 D, blue asterisks). They were highly modified by NAI under K+, and protected with the addition of PDS (Figure 4 A, lanes 4 and 6), as for the other rG4 examples we tested (Figure 2; Supporting Information, Figures S2, S3). Different acylation reagents and rG4 ligands yielded the same results (Supporting Information, Figure S8), suggesting that this feature is not reagent‐ or rG4 ligand‐dependent, but rather due to the flexible nature of the 3′‐G‐quartet and/or the unique conformation of the 3′‐Gs of the rG4, making them susceptible for 2′‐OH acylation attack.Taken together, these results provide strong evidence for the formation of an rG4 under K+ and K++PDS conditions (Figure 4 D), whereas a hairpin conformation predominates under Li+ conditions (Figure 4 C) in the pre‐miRNA 149 wildtype. It is possible that two (or more) RNA conformations exist under K+ and K++PDS conditions.To assess the role of the rG4 structure in this pre‐miRNA 149, we performed an in vitro Dicer cleavage assay on the wildtype and mutant constructs (without 5′ and 3′ SL) under Li+, K+, and K++PDS conditions (Figure 4 E; Supporting Information, Figure S9, Table S1). Our results showed that the rG4 inhibits Dicer processing activity. The yield of miRNA under K+ and K++PDS conditions were 2.5–2.7‐fold less than under the Li+ condition for the wildtype (Figure 4 E, blue bars, 0.37±0.05 for K+/Li+ and 0.39±0.04 for K++PDS/Li+), in contrast to 1.1–1.2‐fold for the mutant (Figure 4 E, red bars, 0.91±0.12 for K+/Li+ and 0.82±0.08 04 for K++PDS/Li+). The repressive role of the rG4 in pre‐miRNA 149 is consistent with recently reported cases of pre‐miRNA 92b and pre‐miRNA let‐7e,21 suggesting a broader role for rG4s in miRNA maturation. Our bioinformatics analysis shows 85 putative rG4s in human pre‐miRNAs (using G3+L1‐12‐based motif, see the Supporting Information) and many others across organisms (Supporting Information, Table S2), suggested the phenomenon may be widespread.In summary, SHALiPE (and DMSLiPE) has enabled fine mapping of rG4 structures in RNAs. The approach has numerous benefits over prior methods and we have used it to reveal an rG4 structure in the biologically important pre‐miRNA 149, and showed that the rG4 inhibits Dicer processing in vitro. Our findings provide insights into the biological roles of rG4 structures and the approach may ultimately have utility in transcriptome‐wide probing of rG4s.As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors.SupplementaryClick here for additional data file.
Authors: Robert C Spitale; Pete Crisalli; Ryan A Flynn; Eduardo A Torre; Eric T Kool; Howard Y Chang Journal: Nat Chem Biol Date: 2012-11-25 Impact factor: 15.040
Authors: Chun Kit Kwok; Giovanni Marsico; Aleksandr B Sahakyan; Vicki S Chambers; Shankar Balasubramanian Journal: Nat Methods Date: 2016-08-29 Impact factor: 28.547
Authors: Yueying Zhang; Minglei Yang; Susan Duncan; Xiaofei Yang; Mahmoud A S Abdelhamid; Lin Huang; Huakun Zhang; Philip N Benfey; Zoë A E Waller; Yiliang Ding Journal: Nucleic Acids Res Date: 2019-12-16 Impact factor: 16.971