Literature DB >> 28090421

RNA G-quadruplexes and their potential regulatory roles in translation.

Jingwen Song1, Jean-Pierre Perreault2, Ivan Topisirovic3, Stéphane Richard1.   

Abstract

DNA guanine (G)-rich 4-stranded helical nucleic acid structures called G-quadruplexes (G4), have been extensively studied during the last decades. However, emerging evidence reveals that 5'- and 3'-untranslated regions (5'- and 3'-UTRs) as well as open reading frames (ORFs) contain putative RNA G-quadruplexes. These stable secondary structures play key roles in telomere homeostasis and RNA metabolism including pre-mRNA splicing, polyadenylation, mRNA targeting and translation. Interestingly, multiple RNA binding proteins such as nucleolin, FMRP, DHX36, and Aven were identified to bind RNA G-quadruplexes. Moreover, accumulating reports suggest that RNA G-quadruplexes regulate translation in cap-dependent and -independent manner. Herein, we discuss potential roles of RNA G-quadruplexes and associated trans-acting factors in the regulation of mRNA translation.

Entities:  

Keywords:  Aven; DHX36; FMRP; G-quadruplexes; RNA binding proteins; translation

Year:  2016        PMID: 28090421      PMCID: PMC5173311          DOI: 10.1080/21690731.2016.1244031

Source DB:  PubMed          Journal:  Translation (Austin)        ISSN: 2169-0731


Structure of G-quadruplexes

In 1910, Bang demonstrated that guanylic acid forms gels at high concentration, providing the first evidence that guanine (G)-rich sequences may form higher-order structures. Fifty years later, Gellert et al. (1962) reported that guanylic acid has the ability to form tetrameric structures by self-association. These non-canonical structures were identified in conserved DNA sequences of telomeres and shown to form so-called G-quadruplexes (G4) structures in vitro. Since then, increasing evidence shows that both DNA and RNA containing spaced guanine repeats form G4 structures. G-quadruplexes are folded in G-quartets that are square planar arrangements formed via Hoogsten pairing of adjacent guanines. The G-quartets lay on the top of each other to form 4-stranded structures which are stabilized by a cation positioned in the middle of the tetrads with preference for potassium (Fig. 1). The ability of potential G4 sequences to form G-quadruplexes is therefore influenced by the nature of the central cation, the number of stacking G-quartets, the length of the sequences connecting the strands (at least in DNA G-quadruplexes), the direction of the strands, and the presence of an alternative Watson-Crick pair-based stable structure.
Figure 1.

Structure of G-quadruplexes. Guanine-rich sequences fold into G-quadruplex structures (G4 structures), composed of planar G-quartets. This representation is a G-quadruplex parallel structure that could be observed in RNA as well as DNA molecules. The guanines are represented as blue, while cations are yellow.

Structure of G-quadruplexes. Guanine-rich sequences fold into G-quadruplex structures (G4 structures), composed of planar G-quartets. This representation is a G-quadruplex parallel structure that could be observed in RNA as well as DNA molecules. The guanines are represented as blue, while cations are yellow.

DNA G-quadruplexes

Bioinformatic search using a script that identifies “regular” G-quadruplexes (i.e. GxN1-7GxN1-7GxN1-7Gx, where G stands for guanine, N can be any nucleotide (A, G, C, U) and x≥3 ) has shown that the human genome harbors ∼376,000 potential G-quadruplex sequences (pG4). With the development of high resolution sequencing-based methods > 716,000 G4 structures were identified, where ∼451,000 were not predicted by computational methods. These sequences are highly conserved in mammals, with limited conservation in evolutionarily lower organisms. Genome-wide searches using G-quadruplex-specific probes and structure-based pull-down strategies have revealed that the pG4 sequences are not randomly distributed, being enriched at telomeres, promoter regions and replication origins. The highest abundance of pG4 is in human telomeres, where the G4 formation protects the end of chromosomes by inhibiting telomerase activity. Abnormal telomerase overexpression has been observed in > 85% of cancers. Thus, extensive investigations were performed to obtain anti-cancer therapies using small molecules to stabilize telomeric G-quadruplexes as a means to inhibit telomerase. Moye et al. (2015) characterized the stable human telomeric G-quadruplexes and demonstrated that these G4s are able to extend into parallel, intermolecular conformations, aligning with the intrinsic RNA moiety of the human telomerase RNA (hTR). They also showed that telomerase colocalizes with a subset of telomeric G4 structures in vivo. Additionally, structural and computational analysis revealed that pG4 sequences are found in ∼40% of gene promoter regions, mostly acting as transcriptional repressors. It has been shown that pG4 sequences occur with high frequency in the promoter regions of genes encoding oncogenes such as c-Myc, c-Kit, KRAS, PDGF-A (platelet derived growth factor-A), and hTERT (human telomerase reverse transcriptase), while tumor suppressor genes correlate with low pG4 abundance in their promoter regions. Recently, Onel et al. (2016) showed that Bcl-2 forms G-quadruplexes in its promoter by nuclear magnetic resonance (NMR) spectroscopy and dimethylsulfate (DMS) footprinting assays, and these G4s were shown to inhibit transcription by promoter-driven luciferase assay. Another study also demonstrated that the human tyrosine hydroxylase (hTH) gene harbors a G4 structure in the 3′ proximal promoter region, acting as a necessary element for transcriptional regulation. Since hTH is linked to several neurological and psychiatric disorders such as Parkinson disease and Schizophrenia, these findings suggest that promoter G4 sequences may be linked with these diseases. Furthermore, emerging evidence suggests that ∼90% of DNA replication origins contain pG4 sequences. Recently, it was discovered by performing nascent strand sequencing that G4 sequences may also position nucleosomes at a subset of human replication origins. Moreover, it was reported that Potential DNA:RNA Hybrid G-Quadruplex Sequences (PHQS) are present in > 97% of human genes, and these PHQS may modulate transcription. Additional studies show that helicases, which are the molecular motors to unwind DNA and RNA, are involved in the active resolution of G4s. The best-characterized DNA G4 helicase are Pif1, RecQ, FANCJ, DDX11, BLM (Bloom syndrome protein) and WRN (Werner syndrome protein). Known dual RNA-DNA G4 helicases are DHX9 and RHAU (DHX36).

RNA G-quadruplexes

G-quadruplexes also form in RNA and are more stable than their DNA counterparts. RNA G-quadruplexes almost exclusively adopt a parallel conformation in which the 4 strands all have the same directionality. The 2′-hydroxyl group of the ribose locks the RNA in an anti-conformation, which favors the parallel topology. Consequently, RNA G-quadruplexes have less topological diversity than DNA G-quadruplexes. In eukaryotes, RNA G4 structures are enriched at telomeres and within specific protein encoding transcripts i.e., mRNAs. In telomeres, the G-rich TERRA (telomeric repeat-containing RNAs, or TelRNA) RNA which is transcribed from the human C-rich telomeric DNA, was shown to form G4 structures in vitro and in cellulo. The high resolution of the G4 structure in TERRA revealed that the 2′-hydoxyl group provides intramolecular hydrogen bonding within the parallel-stranded structures, and this is important for ligand targeting and higher-order arrangement. The human telomerase RNA (hTR) forms G4 structures at its 5′ end in the presence of potassium, as visualized using gel electrophoresis and UV, CD, and NMR spectroscopy. RNA G-quadruplexes are frequently present in mRNAs. Kumari et al. (2007) demonstrated that among all the genes in the human transcriptome available at that time, ∼3,000 of the 5′-UTRs were identified to possess at least one pG4 sequence. This was revised, when Beaudoin and Perreault (2010) identified 9,979 5′-UTRs to contain at least one pG4 sequence, among the 124,315 transcripts from the human UTRfull dataset based on a “regular” definition of G-quadruplex. Subsequently, it was discovered that 1,453 human pG4 sequences possess 2 short distal loops of 1 nucleotide in length and a long central loop of up to 70 nucleotides long in the 5′-UTRs, which significantly expands the number of pG4s in the transcriptome. A bioinformatic search for “regular” G-quadruplex in 3′-UTRs of the human transcriptome, detected 8,903 pG4 sequences showing that the enrichment for G-quadruplex is not limited to the 5′-UTRs. Additionally, other bioinformatic analyses, in-line probing and luciferase reporter assays revealed the existence of pG4 sequences in both the 5′- and 3′-UTR of transcripts, indicating G4 structures function in the post-transcription regulation in cellulo, and neighboring C-rich sequences that affect G4 folding. Moreover, a bioinformatics search for pG4 sequences in mRNA-coding regions was performed, which revealed that ∼1600 pG4s are present in human ORFs. Recently, the existence of G-quadruplex formation in RNA was further confirmed in human cells by using stabilizing ligands that specifically trap RNA G-quadruplexes. The authors were able to visualize G4 structures in the cytoplasm of human cells using G-quadruplex-specific antibodies. RNA G-quadruplexes in TERRA and hTR RNAs regulate telomerase function, whereas G4 formation in mRNAs has been implicated in several gene expression steps, including pre-mRNA processing, mRNA turnover and targeting, and translation. While recent reviews have broadly covered the G4 structures in DNA and RNA molecules as well as their functions in telomere maintenance and RNA metabolism in physiology and pathology conditions, we will discuss in detail the emerging areas of RNA G4 structures in mRNA translation and known trans-acting factors that bind these structures.

G-quadruplexes in the control of mRNA translation

mRNA translation is one of the most fundamental processes in RNA metabolism, and its regulation is tightly controlled. Protein synthesis is composed of 4 main steps: translation initiation, elongation, and termination as well as ribosome recycling. For most eukaryotic mRNAs, translation initiation involves the association of the 7-methylguanosine cap with the cap-binding complex called eukaryotic initiation factor 4F (eIF4F,Fig. 2A). eIF4F complex contains the cap-binding protein eIF4E, the scaffold protein eIF4G that bridges interaction between eIF4F and multifactor complex (MFC), thus allowing recruitment of the mRNA to the ribosome, and the ATP-dependent DEAD box RNA helicase eIF4A required to unwind secondary structures in the 5′-UTRs. The association between the mRNA and eIF4F is the first step of translation, followed by the recruitment of 43S initiation complex, composed of the 40S ribosomal subunit, the eukaryotic initiation factors eIF3, eIF1, eIF1A and eIF5, as well as the ternary complex containing methionine-loaded tRNA, eIF2 and GTP (Fig. 2A). The 43S complex recognizes the initiation codon, where it is joined by a 60S ribosomal subunit to form the 80S ribosome. Excessive secondary structures in 5′-UTRs impede mRNA translation in a cap-dependent manner in eukaryotes. The 3′-UTRs also participate in translational regulation, where the added poly (A) tail is bound by the poly(A)-binding protein (PABP) and eIF4G, resulting in the circularization of mRNAs and enhanced overall initiation rate. An alternative mode of translation is driven by IRES (Internal Ribosome Entry Sites), and it occurs in a cap-independent mode. Independent of the presence or integrity of several canonical initiation factors (especially eIF4E), IRES directly recruits ribosomes, bypassing the requirement for the 5′cap and eIF4E. Efficient IRES-driven translation is facilitated by the IRES trans-acting factors (ITAFs,Fig. 2B). Collectively, secondary structure in 5′-UTR is thought to have a major impact on translation efficiency.
Figure 2.

Cap-dependent and Cap–independent translation initiation. (A) In cap-dependent translation, eIF4E binds to the 5′-m7GpppN of the mRNA (m7G). The capped 5′-end is associated with 43S complex by a bridging protein called eIF4G. eIF4G is also bound to eIF4A, the RNA helicase that unwinds 5′ secondary structures. PABP binds the poly (A) tail and brings the 5′-end and 3′-end of the mRNA together through the interaction with eIF4G. eIF3, eIF5, eIF1/eIF1A and ternary complex are shown as represented. (B) ITAFs and eIG4GI (also known as p97/DAP5/NAT1, purple) facilitates IRES cap-independent translation.

Cap-dependent and Cap–independent translation initiation. (A) In cap-dependent translation, eIF4E binds to the 5′-m7GpppN of the mRNA (m7G). The capped 5′-end is associated with 43S complex by a bridging protein called eIF4G. eIF4G is also bound to eIF4A, the RNA helicase that unwinds 5′ secondary structures. PABP binds the poly (A) tail and brings the 5′-end and 3′-end of the mRNA together through the interaction with eIF4G. eIF3, eIF5, eIF1/eIF1A and ternary complex are shown as represented. (B) ITAFs and eIG4GI (also known as p97/DAP5/NAT1, purple) facilitates IRES cap-independent translation.

G-quadruplexes in 5′-UTR and translational control

Kumari et al. (2007) reported that pG4 sequences in the human NRAS mRNA are conserved in different organisms. They documented the formation of G4 structures in vitro by circular dichroism (CD) spectroscopy and UV-melting experiments, while luciferase reporter assays revealed that the RNA G4 in 5′-UTR of NRAS inhibits translation by ∼80% in rabbit reticulocyte lysates. Moreover, it was established that the human ZIC-1 mRNA forms a 27 nucleotide G4 structure within its 5′-UTR and represses protein production by ∼80% in HeLa cells using the dual-luciferase plasmid based assay. The presence of G4 structures in 5′-UTR of various human mRNAs and multiple strategies such as bioinformatic analyses, mutagenesis and reporter gene-based expression assays showed that G4s in 5′-UTRs correlate with translational repression of various mRNAs including MT3-MMP, ERS1, BCL-2, TRF2, ADAM10 and TGFβ2 (Fig. 3). Moreover, in-depth analysis using CD spectroscopy and in-line probing, identified several 5′-UTRs that harbor pG4 sequences including EBAG9, AASDHPPT, FZD2, BARHL1, NCAM2, and THRA (Fig. 3). Most of these genes are involved in transcriptional regulation, protein modification, G-protein-mediated signaling, cation transport and developmental processes. The C-to-A substitution, known to destabilize G4 formation, was able to rescue the repressed translation of all but one gene. Recently, G4 structures with longer central loops (> 7 nucleotides) in the HIRA, TOM112 and APC 5′-UTR were also shown to have the ability to repress translation when tested by luciferase reporter assays (Fig. 3). Similar conclusions were reached in the study where the “irregular” G4 structures were discovered in the H2AFY and AKIRIN 5′-UTR (Fig. 3). It was also shown that antisense oligonucleotides can be used to inhibit or promote the formation of RNA G4 structures. Additionally, by using ribosome footprinting on a transcriptome-wide scale, Wolfe et al. (2014) reported that the 12-nucleotide guanine quartet motifs that can form G4 structures in 5′-UTRs rendering mRNAs exceptionally sensitive to eIF4A. As a key factor in cap-dependent translation initiation, eIF4A plays a role in scanning the 5′-UTR of the mRNAs for start codons. Notably, eIF4A inhibitors including silvestrol, hippuristanol and pateamine A exhibit potent anticancer activity. By using silvestrol in murine T-cell acute lymphoblastic leukemia (T-ALL) models and primary human T-ALL samples, Wolf et al. observed that eIF4A promotes the T-ALL development and maintenance by unwinding the G4 structures and stimulating translation of mRNAs encoding oncogenes, superenhancers-associated transcription factors and epigenetic regulators including MYC, NOTCH1, MYB, CDK6, MDM2, CCND3, ETS1, and BCL-2 (Figs. 3, 4B). It was, however, suggested that motifs other than 5′-UTR G4 structures may be required to render mRNA translation sensitive to eIF4A. To this end, mRNAs with long, but not short 5′-UTRs, appear to exhibit eIF4A-sensitivity, thereby suggesting that the length of 5′-UTR may also determine eIF4A requirement.
Figure 3.

Possible roles of G-quadruplexes in mRNA translation and mRNAs that harbor G4 structures. G-quadruplexes in 5′-UTRs, ORF and 3′-UTRs mainly repress cap-dependent translation, whereas G-quadruplexes in 5′-UTR near IRESs likely enhance the IRES-mediated translation. The genes harboring G4 structures in 5′-UTRs, ORF and 3′-UTRs are listed below.

Figure 4.

Schematic illustration of the functions of RNA binding proteins that bind RNA G4 structures in mRNAs. (A) Phosphorylated FMRP binds ORFs of mRNAs and inhibits translation. It stalls ribosomes in the elongation stage, resulting in the repressed translation of transcripts related to FXS/ASD. It recruits the co-factor CYFIP in synaptoneurosomes. By interacting with CYFIP, FMRP prevents the complex assembly between eIF4E, eIF4G and PABP, thereby inhibiting translation initiation. RGG/RG motif is denoted as vv. (B) eIF4A unwinds the G4 structures in 5′-UTR of many key transcription factors and oncogenes, thereby contributing to the T cell-acute lymphoblastic leukemia development. (C) Methylated Aven binds G4 structures in the ORFs of MLL1 and MLL4 mRNAs in an RGG/RG motif (denoted as vv) dependent manner. Aven also recruits DHX36 onto the polysomes that may facilitate unwinding of G4 structures. Thus Aven favors the translation of oncogenic proteins to increase leukemic cell proliferation.

Possible roles of G-quadruplexes in mRNA translation and mRNAs that harbor G4 structures. G-quadruplexes in 5′-UTRs, ORF and 3′-UTRs mainly repress cap-dependent translation, whereas G-quadruplexes in 5′-UTR near IRESs likely enhance the IRES-mediated translation. The genes harboring G4 structures in 5′-UTRs, ORF and 3′-UTRs are listed below. G4 structures in 5′-UTRs also influence cap-independent, IRES-driven-translation. The IRES within the 5′-UTR of the FGF2 mRNA forms a G4 structure affecting cap-independent translation. Deletion analysis in human liver adenocarcinoma cells showed that the pG4 sequences are sufficient to facilitate IRES activity. Another example was shown by Morris et al. (2010) who reported that the hVEGF (human vascular endothelial growth factor) mRNA forms a G4 structure essential for IRES-mediated translation. Interestingly, it was also shown that the stabilization of the G4 structure leads to inhibition of IRES-mediated translation of VEGF-A. These findings show that G4 structures may influence IRES-mediated cap-independent translation, although the mechanism on how this is achieved is unclear.

G-quadruplexes in open reading frames and translational control

The role of G4 structures in translational control has been focused mainly on G4 sequences in the 5′-UTRs. However, ORFs also contain G4 sequences and these sequences frequently encode low complexity amino acid sequences, amino acid repeats or short motifs. While G-quadruplexes in 5′-UTRs decreases protein expression by inhibiting ribosome scanning processes, G4 structures in ORFs contribute to other translation-related processes, such as elongation, ribosomal frameshift, no-go mRNA decay and translational folding of newly synthesized proteins. Originally, it was demonstrated that the Herpes virus thymidine kinase ORF contains pG4 sequences, leading to the expression of full-length thymidine kinase. Subsequently, the ORFs containing G-quadruplexes were found in FMR1 and APP (amyloid precursor protein) mRNAs. Murat et al. (2014) revealed that the mRNA encoding Epstein-Barr virus-encoded nuclear antigen 1 (EBNA1) forms a G4 in its ORF and using antisense oligonucleotides to G4 sequences to destabilize G4 formation, they observed that the structure impairs translation elongation (Fig. 3). Recently, a bioinformatics analysis identified 2 mixed lineage leukemia (MLL) proto-oncogenes KMT2A (lysine methyltransferase 2A or mixed lineage leukemia, MLL1) and KMT2B (MLL4) to harbor pG4 sequences in their ORFs ∼200 nucleotides downstream of the start codon (Fig. 3). By performing in-line probing analysis with G-A mutagenesis and the dual luciferase reporter assay, it was confirmed that MLL1 and MLL4 mRNAs form bona-fide G4 structures that block translation by >75%. Endoh & Sugimoto (2016) observed that the positioning of the G4 in ORFs, but not in 5′-UTR has a dramatic impact on translational efficiency. These results suggest that by acting as ‘roadblocks’, G-quadruplexes in ORFs may play a significant role in protein synthesis by inhibiting ribosomal progression during elongation.

G-quadruplexes in 3′-UTR and translational control

3′ end processing is an essential process in post-transcriptional regulation through which the mRNA is endonucleolytically cleaved and a poly (A) tail is added downstream of a canonical polyadenylation signal (AAUAAA) . G-quadruplexes are located in the 3′-UTR of some mRNAs and their presence affects translational output. The 3′-UTR of the proto-oncogene PIM1 harbors a conserved pG4 sequence (Fig. 3) and Arora & Suess (2011) showed using reporter assays that this pG4 sequence inhibits translation. A pG4 structure in TP53 3′-UTR maintains the 3′-end processing under DNA damage and the G4 formation is critical for p53 protein expression contributing to p53-induced apoptosis. More recently, a study combining in silico, in vitro and in cellulo approach, demonstrated that LRP5 and FXR1 mRNAs form G4 structures in their 3′-UTR, affecting the ratio of short/long isoforms produced (Fig. 3). Moreover, via a bioinformatic approach and CD spectrophotometry, Crenshaw et al. (2015) identified a candidate G-quadruplex in the 3′-UTR of APP mRNA, whose overexpression leads to Alzheimer disease. The authors also showed that this G-quadruplex inhibits APP protein expression by dual luciferase reporter assay(Fig. 3). In addition to translational control, G-quadruplexes in the 3′-UTR also play a role in alternative splicing, polyadenylation and mRNA targeting. For instance, G4 structures in the 3′-UTR of TP53 and hTERT mRNAs regulate alternative splicing. G-quadruplex in the intron 6 of hTERT mRNA appears to act as a splicing silencer, inasmuch as a potent G-quadruplex-stabilizing agent impaired the splicing of hTERT. Similarly, Marcel et al. (2011) reported that the G-quadruplex in intron 3 of TP53 mRNA modulates the splicing of intron 2. Additionally, G4 structures within the LRP5 and FXR1 3′-UTRs increase the efficiency of alternative polyadenylation site selection, leading to the expression of shorter transcripts. Finally, G4 structures in 3′-UTR of 2 dendritic mRNAs PSD95 and CaMKIIa facilitate their localization in neurites.

G-quadruplexes and trans-acting proteins

Many cellular RNAs associate with RNA-binding proteins to form ribonucleoprotein (RNP) complexes. A large number of RNA binding proteins contain RGG/RG motif or the related RGG/YGG motif. The RGG/RG motifs of nucleolin and fragile X mental retardation protein (FMRP) have been shown to associate with RNA G4 forming sequences. Some RNA G-quadruplex binding proteins are required for the unwinding of G4 structures during translation progression. The trans-acting factors that bind G4 sequences and their links with diseases are discussed below.

Nucleolin

Nucleolin is a 100kDa nucleolar phosphoprotein that contains an RGG/RG motif. Nucleolin interacts with both DNA and RNA G-rich sequences and plays a role in translational repression of specific mRNAs. C9orf72, a disease-related gene, harbors the expansion of a (GGGGCC)n(GGCCCC)n repeats within the first intron, leading to amyotrophic lateral sclerosis and frontotemporal dementia. It has been reported that the DNA/RNA hexanucleotide repeat expansion (HRE), (GGGGCC)n, forms extremely stable G-quadruplex structures and this may play a role in C9orf72 gene activity, protein binding as well as translation into pathologic dipeptides. It was also elucidated that the DNA of C9orf72 HRE forms antiparallel- and parallel-G-quadruplexes, while the HRE RNA sequence only adopts the parallel-stranded G4s. Nucleolin preferentially binds the HRE G-quadruplexes, which is mislocalized in patient cells carrying the C9orf72 mutation. These studies show DNA/RNA G4 sequences bound by nucleolin as a determinant of repeat-associated neurodegenerative diseases.

FMRP

Fragile X syndrome (FXS) is caused by expansion of (CGG)n repeats in the FMR1 gene, and subsequent hypermethylation of the FMR1 gene promoter, leading to the loss of FMRP expression. FXS is recognized to be the most-frequent heritable syndrome of mental insufficiency. This implicates the role of RNA G-quadruplexes in neuronal function. In addition to binding to the G4s, the RGG/RG motif of FMRP also modulates its association with polysomes, being consistent with its role in translational regulation. Originally, it was demonstrated that 432 mRNAs were co-immunoprecipitated with the FMRP RNP complex from mouse brain, nearly 70% of which contain a G4 sequence. Dramatic changes in polysome-association of these mRNAs were observed in the absence of FMRP, indicating that translational dysregulation of these mRNAs may underpin FXS. Recently, this was redefined by large-scale CLIP (crosslinking immunoprecipitation) studies, where 34,218 consensus FMRP binding sites in 3,703 genes were identified. It was reported that a G4 sequence in the FMR1 mRNA provides the docking site for FMRP. Luciferase reporter assays demonstrated that the binding of FMRP to a G4 sequence in 5′-UTR of a reporter gene strongly represses translation initiation in vitro. Strikingly, FMR1 G4 structures are bound by several FMRP isoforms, thereby suggesting the presence of a feedback mechanism regulating different FMRP isoform production. Additionally, MAP1B, PP2A and Shank1 genes that are essential for neural development have been shown to harbor one or more G-quadruplexes in their 5′-UTRs and/or 3′-UTRs, whereby the absence of FMRP increases production of the corresponding proteins, indicating that FMRP plays pivotal roles in neonatal brain development (Fig. 3). Furthermore, to explore the functions of FMRP in translational control in neurons, Napoli et al. (2008) found that FMRP inhibits cap-dependent translation by recruiting and/or stabilizing CYFIP1 in synaptoneurosomes. As a novel eIF4E binding protein, CYFIP1 binds eIF4E independently of other factors to impede cap-dependent translation of many mRNAs (Fig. 4A). FMRP is dephosphorylated in response to synaptic activation, resulting in the release from mRNAs encoding synaptic proteins and their derepression. In contrast, Bechara et al. (2009) reported a novel role of FMRP in translation activation. They observed that FMRP binds SOD1 mRNA through a motif called SoSLIP (SOD1 mRNA Stem Loops Interacting with FMRP), which competes with FMRP binding to G4 structures favoring translation. The solution structure of FMRP in complex with the in vitro selected G-rich RNA sequence, sc1 RNA, revealed that arginines within the RGG/RG motif of FMRP are positioned in the major groove of the G4 structure. Recently, X-ray crystallography analysis demonstrated that an RGG peptide of human FMRP was able to bind to the selected G-rich RNA in vitro. The RGG peptide was shown to stabilize G-quartets and facilitate G4 formation. It was also revealed that the specific binding of these RNAs with FMRP likely also involves the hydrogen binding with RNA duplexes, shown by mutagenesis and footprinting. High throughput sequencing of RNAs collected by HITS-CLIP (high-throughput sequencing-crosslinking immunoprecipitation) identified that FMRP also interacts with ORFs and stalls ribosomes on mRNA encoding presynaptic and postsynaptic proteins implicated in autism spectrum disorders (ASD) (Fig. 4A). Moreover, FMRP was shown to repress translation by direct binding to the L5 protein on the 80S ribosome, and this activity is depended on the integrity of the RGG/RG motif. Schematic illustration of the functions of RNA binding proteins that bind RNA G4 structures in mRNAs. (A) Phosphorylated FMRP binds ORFs of mRNAs and inhibits translation. It stalls ribosomes in the elongation stage, resulting in the repressed translation of transcripts related to FXS/ASD. It recruits the co-factor CYFIP in synaptoneurosomes. By interacting with CYFIP, FMRP prevents the complex assembly between eIF4E, eIF4G and PABP, thereby inhibiting translation initiation. RGG/RG motif is denoted as vv. (B) eIF4A unwinds the G4 structures in 5′-UTR of many key transcription factors and oncogenes, thereby contributing to the T cell-acute lymphoblastic leukemia development. (C) Methylated Aven binds G4 structures in the ORFs of MLL1 and MLL4 mRNAs in an RGG/RG motif (denoted as vv) dependent manner. Aven also recruits DHX36 onto the polysomes that may facilitate unwinding of G4 structures. Thus Aven favors the translation of oncogenic proteins to increase leukemic cell proliferation.

RHAU (DHX36)

RHAU, the RNA helicase associated with AU-rich elements, also known as DHX36, is a member of ATP-dependent DEAH-box RNA helicase family. DHX36 has been shown to be the predominant G-quadruplex resolving helicase in cellulo, with high activity in unwinding RNA G4 structures. The solution structure of DHX36 recognizing a G-quadruplex was resolved and identified a 3-anchor-point electrostatic interaction. It was also elucidated that DHX36 uses a local, non-processive mechanism to unwind G4 structures, mimicking the DEAD-box RNA helicase eIF4A. By applying genome-wide analysis, DXH36 was shown to bind ∼106 RNAs, the majority harboring pG4 sequences including the human telomerase RNA. Sexton et al. (2011) also discovered the association between DHX36 and TERC (hTR) pG4 sequences in the HEK293 cells. Another study confirmed that the interaction between DHX36 and the TERC G-quadruplex disrupts the formation of P1 helix, a structure which defines template boundary for reverse transcription. DHX36 was sufficient to unwind the quadruplexes and promote the formation of a stable P1 helix, and DHX36 depletion led to a reduction in average telomere length. Besides the function on telomere maintenance, DHX36 regulates translation, although the precise mechanism has not yet been elucidated. By performing a RNA-co-immunoprecipitation screen, Booy et al. (2014) discovered that DHX36 interacts with G-quadruplexes in 3′-UTR of PITX1 mRNA in cellulo, and depletion of DHX36 results in increased PITX1 expression, a transcription factor with roles in development and cancer. Finally, it was also demonstrated that the DHX36-mediated regulation of PITX1 involves miRNA regulatory components. Moreover, the impact of Aven on mRNA translation regulation is dependent on DHX36 (see below).

Aven

Aven is an anti-apoptotic protein first shown to interact and stabilize the pro-survival factor Bcl-xl, while inhibiting Apaf-1, thereby stimulating survival in particular when the cells are exposed to stress. Aven is overexpressed in acute myeloid and acute lymphoblastic leukemia where its expression has been associated with poor prognosis. Recently, Aven was shown to bind RNAs with G4 structures via its RGG/RG motif. The RGG/RG motif of Aven is arginine methylated by PRMT1 and this promotes association with the methylarginine interactors SMN and TDRD3. The arginine methylation and binding to SMN and TDRD3 is required for association of Aven with polysomes. In vitro binding and photocrosslinking immunoprecipitation assays revealed that Aven associates with MLL1 and MLL4 mRNAs. Interestingly, the G4 sequences bound by Aven were in the coding regions regulating their mRNA translation (Fig. 4C). Mechanistically, Aven recruits DHX36 onto the polysomes likely to facilitate unwinding of the G4 structures during translation. Depletion of Aven/DHX36 inhibits KMT2A/KMT2B translation, whereas luciferase assays with G-to-A mutagenesis revealed that both Aven and DHX36 are required to rescue translation in the presence of G4 motifs. Thus, Aven represents a G4 interacting protein that recruits DHX36 to unwind G4 structures containing mRNAs during translational elongation. These findings foster our understanding that RNA G4 binding proteins can play a key role in modulating mRNA translation.

Conclusions and perspectives

Increasing evidence shows that G-quadruplexes play essential roles in RNA metabolism. Advances in computational analyses and genome-wide sequencing have facilitated the characterization of RNA quadruplexes and their implication in translational control. Further advancements are required to confirm their occurrence in vivo. Moreover, the trans-acting factors that bind G4 structures are the key to understanding the role of G-quadruplexes in the regulation of mRNA fate. Nucleolin, FMRP, DHX36 and Aven represent the tip of the iceberg of trans-acting factors that recognize G4 RNA structures. Large scale analyses including HITS-CLIP-seq, SHAPE-seq, and HITS-RAP (high-throughput RNA affinity profiling) hold great promise for knowledge advancement as to the role of G4 structures and interacting proteins in vivo. It is possible that multiple RNA binding proteins recognize one G-quadruplex, and there could be functional interplay. The binding of different RNA binding proteins are likely to: 1) stabilize the G4 structures, 2) unwind the G4s by recruiting helicases, 3) act as chaperones to transport G4 containing mRNAs, and 4) serve as scaffold proteins to recruit other proteins or RNAs. A growing number of disease-related genes are regulated by G4 structures and their RNA binding proteins. This includes mRNAs encoding the tumor suppressor TP53, oncogene NRAS, oncogenes KMT2A/KMT2B, anti-apoptotic Bcl-2, FMR1, and the telomerase hTERT. This indicates that by affecting mRNA translation, splicing, and polyadenylation, G4 structures may be implicated in numerous human diseases, especially cancer. Thus, targeting G4 structures with synthesized small molecules is attractive to modify oncogene expression. Extensive studies have evaluated some “drug-like” molecules including the pyridostatins (PDS) , cationic porphyrins and derivatives and bisquinolinium compounds. Short antisense oligonucleotides affect RNA G4 folding and translation regulation of specific mRNAs and may eventually be of therapeutic potential. A new polyaromatic molecule, RGB-1, was recently shown to specifically stabilize RNA G-quadruplexes, but not DNA G-quadruplexes or other RNA structures. RGB-1 inhibits translation in mammalian cells and decreases NRAS expression, providing a new tool to understand G4 structures and therapeutic applications. Thus, a better understanding of the various contributions of G4 structures in the human transcriptome may provide important insights into strategies targeting G4s in molecular medicine.
  160 in total

1.  The 5' guanosine tracts of human telomerase RNA are recognized by the G-quadruplex binding domain of the RNA helicase DHX36 and function to increase RNA accumulation.

Authors:  Alec N Sexton; Kathleen Collins
Journal:  Mol Cell Biol       Date:  2010-12-13       Impact factor: 4.272

Review 2.  Making sense of G-quadruplex and i-motif functions in oncogene promoters.

Authors:  Tracy A Brooks; Samantha Kendrick; Laurence Hurley
Journal:  FEBS J       Date:  2010-07-29       Impact factor: 5.542

3.  Bulges in G-quadruplexes: broadening the definition of G-quadruplex-forming sequences.

Authors:  Vineeth Thachappilly Mukundan; Anh Tuân Phan
Journal:  J Am Chem Soc       Date:  2013-03-22       Impact factor: 15.419

4.  A G-quadruplex structure within the 5'-UTR of TRF2 mRNA represses translation in human cells.

Authors:  Dennis Gomez; Aurore Guédin; Jean-Louis Mergny; Bernard Salles; Jean-François Riou; Marie-Paule Teulade-Fichou; Patrick Calsou
Journal:  Nucleic Acids Res       Date:  2010-06-22       Impact factor: 16.971

Review 5.  The scanning mechanism of eukaryotic translation initiation.

Authors:  Alan G Hinnebusch
Journal:  Annu Rev Biochem       Date:  2014-01-29       Impact factor: 23.643

Review 6.  Targeting G-quadruplexes in gene promoters: a novel anticancer strategy?

Authors:  Shankar Balasubramanian; Laurence H Hurley; Stephen Neidle
Journal:  Nat Rev Drug Discov       Date:  2011-04       Impact factor: 84.694

7.  Structure-function studies of FMRP RGG peptide recognition of an RNA duplex-quadruplex junction.

Authors:  Anh Tuân Phan; Vitaly Kuryavyi; Jennifer C Darnell; Alexander Serganov; Ananya Majumdar; Serge Ilin; Tanya Raslin; Anna Polonskaia; Cynthia Chen; David Clain; Robert B Darnell; Dinshaw J Patel
Journal:  Nat Struct Mol Biol       Date:  2011-06-05       Impact factor: 15.369

8.  G-quadruplex structures are stable and detectable in human genomic DNA.

Authors:  Enid Yi Ni Lam; Dario Beraldi; David Tannahill; Shankar Balasubramanian
Journal:  Nat Commun       Date:  2013       Impact factor: 14.919

9.  New scoring system to identify RNA G-quadruplex folding.

Authors:  Jean-Denis Beaudoin; Rachel Jodoin; Jean-Pierre Perreault
Journal:  Nucleic Acids Res       Date:  2013-10-10       Impact factor: 16.971

10.  Exploring mRNA 3'-UTR G-quadruplexes: evidence of roles in both alternative polyadenylation and mRNA shortening.

Authors:  Jean-Denis Beaudoin; Jean-Pierre Perreault
Journal:  Nucleic Acids Res       Date:  2013-04-22       Impact factor: 16.971

View more
  44 in total

Review 1.  Functional 5' UTR mRNA structures in eukaryotic translation regulation and how to find them.

Authors:  Kathrin Leppek; Rhiju Das; Maria Barna
Journal:  Nat Rev Mol Cell Biol       Date:  2017-11-22       Impact factor: 94.444

Review 2.  Potential Roles for G-Quadruplexes in Mitochondria.

Authors:  Micol Falabella; Rafael J Fernandez; F Brad Johnson; Brett A Kaufman
Journal:  Curr Med Chem       Date:  2019       Impact factor: 4.530

Review 3.  G-Quadruplex Assembly by Ribosomal DNA: Emerging Roles in Disease Pathogenesis and Cancer Biology.

Authors:  Arindam Datta; Kevin J Pollock; Karen A Kormuth; Robert M Brosh
Journal:  Cytogenet Genome Res       Date:  2021-09-01       Impact factor: 1.941

4.  Assessing the Potential for DNA Quadruplex Formation in the Predatory Bacterium Bdellovibrio bacteriovorus.

Authors:  Lucille H Tsao; Sally Shepardson-Fungairiño; Hikari Murayama; Amelia Cecere; Elizabeth Wren; Megan Núñez
Journal:  Biochemistry       Date:  2022-09-15       Impact factor: 3.321

Review 5.  Action and function of helicases on RNA G-quadruplexes.

Authors:  Marco Caterino; Katrin Paeschke
Journal:  Methods       Date:  2021-09-10       Impact factor: 4.647

6.  G-quadruplexes Stabilization Upregulates CCN1 and Accelerates Aging in Cultured Cerebral Endothelial Cells.

Authors:  Brian Noh; Maria P Blasco-Conesa; Yun-Ju Lai; Bhanu Priya Ganesh; Akihiko Urayama; Ines Moreno-Gonzalez; Sean P Marrelli; Louise D McCullough; Jose Felix Moruno-Manchon
Journal:  Front Aging       Date:  2022-01-12

7.  Alternatively spliced variants of the 5'-UTR of the ARPC2 mRNA regulate translation by an internal ribosome entry site (IRES) harboring a guanine-quadruplex motif.

Authors:  Munir A Al-Zeer; Mariola Dutkiewicz; Annekathrin von Hacht; Denise Kreuzmann; Viola Röhrs; Jens Kurreck
Journal:  RNA Biol       Date:  2019-08-14       Impact factor: 4.652

8.  Ion Binding Properties and Dynamics of the bcl-2 G-Quadruplex Using a Polarizable Force Field.

Authors:  Brian D Ratnasinghe; Alexa M Salsbury; Justin A Lemkul
Journal:  J Chem Inf Model       Date:  2020-12-02       Impact factor: 4.956

9.  Sugar modulation of anaerobic-response networks in maize root tips.

Authors:  Maria-Angelica Sanclemente; Fangfang Ma; Peng Liu; Adriana Della Porta; Jugpreet Singh; Shan Wu; Thomas Colquhoun; Timothy Johnson; Jiahn-Chou Guan; Karen E Koch
Journal:  Plant Physiol       Date:  2021-03-15       Impact factor: 8.340

10.  Constrained G4 structures unveil topology specificity of known and new G4 binding proteins.

Authors:  A Pipier; A Devaux; T Lavergne; A Adrait; Y Couté; S Britton; P Calsou; J F Riou; E Defrancq; D Gomez
Journal:  Sci Rep       Date:  2021-06-29       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.