Literature DB >> 34977886

G-Quadruplexes in Neurobiology and Virology: Functional Roles and Potential Therapeutic Approaches.

Jinglei Xu1, Haiyan Huang2, Xiang Zhou1,2.   

Abstract

A G-quadruplex (G4) is a four-stranded nucleic acid secondary structure maintained by Hoogsteen hydrogen bonds established between four guanines. Experimental studies and bioinformatics predictions support the hypothesis that these structures are involved in different cellular functions associated with both DNA and RNA processes. An increasing number of diseases have been shown to be associated with abnormal G4 regulation. Here, we describe the existence of G4 and then discuss G4-related pathogenic mechanisms in neurodegenerative diseases and the viral life cycle. Furthermore, we focus on the role of G4s in the design of antiviral therapy and neuropharmacology, including G4 ligands, G4-based aptamers, G4-related proteins, and CRISPR-based sequence editing, along with a discussion of limitations and insights into the prospects of this unusual nucleic acid secondary structure in therapeutics. Finally, we highlight progress and challenges in this field and the potential G4-related research fields.
© 2021 The Authors. Published by American Chemical Society.

Entities:  

Year:  2021        PMID: 34977886      PMCID: PMC8715485          DOI: 10.1021/jacsau.1c00451

Source DB:  PubMed          Journal:  JACS Au        ISSN: 2691-3704


Introduction

G-quadruplexes (G4s) are noncanonical guanine-rich secondary structures that exist in both DNA and RNA. The four guanine nucleotides bound through Hoogsteen hydrogen base-pairing to form a planar tetrad, called G-quartet, and two or more tetrads then interact through π–π stacking to form a G-quadruplex (Figure A). G4s are stabilized by monovalent or divalent cations, typically potassium ions, under physiological conditions.[1] The orientation of strands and isomerization of glycosidic bonds lead to the polymorphism of G4s with parallel, antiparallel, and hybrid topological structures. The most fundamental differences between RNA and DNA G4s are the presence of uracil instead of thymine and a ribose sugar instead of a deoxyribose sugar. Combining with the presence of 2′-hydroxyl group in the ribose sugar inducing steric hindrance, resulted in anticonformation of glycosidic bond. The presence of 2′-hydroxyl group also enhanced stability for RNA G4.[2,3] Consequently, nearly all naturally occurring RNA G4s adopt a parallel topology, except for a fluorogenic RNA, Spinach, contains a G4 structure with antiparallel folding topology.[4] While DNA G4s adapt parallel, antiparallel, or hybrid conformations (Figure B). The distinction between DNA and RNA G4s not only displays in genome localization but also shows in regulation processing. Detailed descriptions of G4 structure and conformation are provided in other reviews.[5]
Figure 1

The G-quadruplex structure. (A) G-quartet, a planar array of four guanines stabilized by pairwise hydrogen-bonding and coordination by a cation at the center of the tetrad (M+). Preferential binding of monovalent cations to G4s and consensus sequence of G4 are shown below. (B) Differences between DNA and RNA G4 structure. Arrows indicate strand polarity. The composition of the G4 structure is telomere sequence stabilized by K+. The 2′-hydroxyl group is highlighted in red for clarity.

The G-quadruplex structure. (A) G-quartet, a planar array of four guanines stabilized by pairwise hydrogen-bonding and coordination by a cation at the center of the tetrad (M+). Preferential binding of monovalent cations to G4s and consensus sequence of G4 are shown below. (B) Differences between DNA and RNA G4 structure. Arrows indicate strand polarity. The composition of the G4 structure is telomere sequence stabilized by K+. The 2′-hydroxyl group is highlighted in red for clarity. Genome-wide mapping of G4s has been achieved by computational prediction and experimental tools. On the basis of the consensus sequence features of canonical G4s, “G3-N1–7G3-N1–7G3-N1–7G3 (where N = A, C, G, T or U)”, researchers have developed many algorithms for computational prediction of G4s. These in silico analyses include Quad-Parser,[6] QGRS Mapper,[7] G4P Calculator,[8] QuadBase,[9] cGcC score,[10] and G4Hunter.[11] Considering the lack of predictions for other noncanonical G4 structures (long loops, bugles, 2 quartets), more recent computational methods added flanking sequence variants[12] and used a machine learning program[13] for better analysis. The results revealed that G4s are enriched in promoters, CpG islands, 5-untranslated regions (5′-UTRs), and nuclease-hypersensitive sites,[6,8,11] suggesting a regulatory role for G4s in key biological processes. Experimental tools for G4 identification and analyzing their distribution in a high-throughput manner are implemented with three typical approaches: reverse transcription stalling, chemical reactive sequencing, and antibody immunoprecipitation. In the presence of potassium ions and/or G4-stabilizing ligands, G4s in DNA or RNA can cause DNA polymerase (G4-seq)[14] or reverse transcriptase stalling (rG4-seq),[15] which is combined with next-generation sequencing to enable the detection of G4s. G4-seq identified more than 700 000 G4s in DNA in the human genome, and rG4-seq showed that G4s were distributed in more than 3000 human mRNAs. G4-specific antibodies have been utilized in chromatin immunoprecipitation (ChIP) and immunohistochemical analyses to detect G4s in endogenous chromatin, and ChIP-seq peak analysis identified a positive and dynamic relationship between G4s and elevated transcriptional activity.[16] This following study identified a lower number of G4 structures (∼10 000 G4 ChIP–seq peaks) than the previous G4-seq result, reflecting a suppressive role of heterochromatin for G4 formation.[17] Two chemical probes that interact with unprotected residues of RNA G4s have been applied to investigate the folding state of G4s in the cellular environment. One is dimethyl sulfate sequencing (DMS-seq). Surprisingly, DMS-seq analysis of mouse, human, and yeast cells suggests that most RNA G4 regions are unfolded.[18] The other method is selective 2’–OH acylation analyzed by primer extension (SHAPE). The SHAPE reagent reacts with all four nucleotides, which provides structural information on RNA at single-nucleotide resolution. The combination of SHAPE with lithium ions (SHALiPE) can probe RNA G4s in vitro.[19] Structure profiling of plant cells using SHALiPE showed that RNA G4s are mainly folded in vivo.[20] However, due to the complex cellular context, dynamic structural conversion, different transcript levels and heterogeneity between cells, improved methods must be developed to further confirm the folding state of RNA G4s in vivo.[21] Nevertheless, cellular G4 structures have been detected using NMR spectroscopy,[22,23] various fluorescent probes based on G4 ligands and aptamers. Other reviews provide extensive detailed description of G4 biosensing and bioimaging.[24−26] Taken together, the abundance and functional distribution of G4s, revealed using these methods, suggest that G4 motifs may confer special regulatory properties to cells. In this Perspective, we summarize and assess the recently reported emerging roles of G4s in pathological processes of neurobiology and virology. An understanding of these roles not only provides critical insights into fundamental molecular mechanisms that control nervous system function and the viral life cycle but also provides opportunities to identify novel therapeutic targets to treat these diseases. We also discuss the recent progress in G4-targeting techniques for biomedical applications, with an emphasis on the emerging roles of G4s for examination in future studies.

FUNCTIONS OF THE G-QUADRUPLEX IN VIVO

Computational analyses of various organisms allows the prediction of putative G-quadruplex sequences (PQS), and it has revealed that the localization of PQS is nonrandom: G4 motifs usually colocalize with functional regions of the genome and are highly conserved among different species.[27] Given these results, G4 motifs may have positive functions in cells. The in vivo functions of G4s include biological processes such as telomere maintenance, DNA replication, transcriptional regulation, alternative splicing, and translational regulation. Many relatively recent studies have reported that G4 plays a functional role in phase separation,[28] epigenetic control,[29,30] and mRNA localization.[31] For a more comprehensive review on the overall topic, we refer you to other articles.[3,32,33] The multiple functions of G4s provide an important link between G4s and many diseases, which has increased research interest in the past few years, particularly in cancer. While the biological role of G4s in cancer concentrated on their regulation of telomere maintenance, gene expression, and genome duplication, making them a potential therapeutic target against tumor cells.[34,35] However, accumulating evidence suggests important functional roles for G4 in neurological and viral diseases.[36,37] In the development and homeostasis of normal tissue, G4s have been implicated in more RNA-related mechanisms, such as mRNA splicing and localization, even regulating noncoding RNAs. Beyond different G4-related pathogenetic mechanisms of these diseases, the emerging role of G4s as therapeutic intervention of neurological disorder and antiviral is less explored. Here, we focus on recent progress in these fields, providing clarification of G4 alternative regulation from molecular mechanisms to disease states.

THE ROLE OF THE G-QUADRUPLEX IN NEUROLOGY

In recent years, G4s have been deemed to function as regulatory structural elements in many neurodegenerative diseases. Two primary mechanisms have been proposed for the contribution of G4s to neurological disorders. (i) Toxic transcribed RNA foci and dipeptide repeat proteins are generated and accumulated through the expansion of G-rich sequences (Figure B). (ii) Gene mutations affect pathological gene expression and G4 binding protein affinity, along with downstream regulation of defects (Figure A). Here, we summarize the established mechanisms of several typical neurodegenerative diseases, such as amyotrophic lateral sclerosis (ALS), frontotemporal dementia (FTD), fragile X syndrome (FXS), Alzheimer’s disease (AD), and Parkinson’s disease (PD), which have been proven to be related to G4 formation, with a discussion of the potential functional roles of G4s and future research directions. The overview of G4 in these five neurodegenerative diseases is shown in Table .
Figure 2

Proposed molecular mechanisms of G4-related neurological pathogenesis. (A) Abnormal repeat sequence in neuronal gene folded into G4 structure, inducing R-loop formation, RNA polymerase stalling, and ribosome progression stalling, resulted in genome instability, aberrant transcriptions and gene dysregulation. Multiple G4 binding protein (G4BP) involved in pathogenetic processes. (B) Transcribed RNA G4 can lead mRNA mislocalization, sequester RNA binding protein and regulate alternative splicing, cause RNA foci formation through liquid–liquid phase separation (LLPS), toxic protein aggregation through G4-induced repeat-associated non-AUG (RAN) translation, splicing dysregulation, and nucleolar stress.

Table 1

Overview of G-Quadruplexes in Neurological Diseases: Position, Proposed Function, and Related Diseases

genepositionbiological role of G4diseasemechanisma
C9orf72intron 1decrease replication efficiency[48]frontotemporal dementia/amyotrophic lateral sclerosis (FTD/ALS)(i)
  impede transcription[44] (i)
  influence mRNA splicing[49] (ii)
  inducing RNA foci formation[38,44,50] (i)
tiRNA repress translation[51] (ii)
FMR15′-UTRrepress translation,[52,53]fragile X syndrome (FXS)(i)
  reducing RAN translation,[54]  
  mRNA mislocalization[54,55]  
 exon 15influence mRNA splicing[52] (i)
PSD953′-UTRrepress translation[56,57] (ii)
BACE1exon 3influence mRNA splicing[58,59]Alzheimer’s disease (AD)(ii)
APP3′-UTRrepress translation[60] (ii)
ADAM105′-UTRrepress translation[61] (ii)
APOEexon 4control gene expression[62] (ii)
MIR1229 control miRNA maturation[63,64] (i)
SNCA5′-UTRrepress translation[65]Parkinson’s disease (PD)(ii)
  reducing RAN translation[65] (i)

Mechanism (i) represents G4 intrinsic affect, (ii) represents G4-related regulation processes. Detailed mechanisms are summarized in the text.

Mechanism (i) represents G4 intrinsic affect, (ii) represents G4-related regulation processes. Detailed mechanisms are summarized in the text. Proposed molecular mechanisms of G4-related neurological pathogenesis. (A) Abnormal repeat sequence in neuronal gene folded into G4 structure, inducing R-loop formation, RNA polymerase stalling, and ribosome progression stalling, resulted in genome instability, aberrant transcriptions and gene dysregulation. Multiple G4 binding protein (G4BP) involved in pathogenetic processes. (B) Transcribed RNA G4 can lead mRNA mislocalization, sequester RNA binding protein and regulate alternative splicing, cause RNA foci formation through liquid–liquid phase separation (LLPS), toxic protein aggregation through G4-induced repeat-associated non-AUG (RAN) translation, splicing dysregulation, and nucleolar stress.

DNA G4 Regulates Neural Gene Transcription

The most fundamental molecular basis of neurological disorders is repeat sequence expansion in neuronal genes. For example, a familial mutation causing a G-rich sequence expansion in chromosome 9 open reading frame 72 (C9orf72) was identified as the prevailing genetic cause of two related neurodegenerative syndromes, FTD and ALS.[38,39] Abnormal expansion of the hexanucleotide repeat GGGGCC (G4C2) in the noncoding intron of the C9orf72 gene is usually present in approximately 2–23 units in healthy individuals,[40] but patients with these two diseases have pathogenic G4C2 repeats ranging from 500 to 3500 units.[41,42] Several studies of G4C2 repeat DNA showed that disease-relevant DNA repeats form a mixture of parallel and antiparallel DNA G4 structures in C9orf72 impede transcription,[43−45] inducing the generation of abortive transcripts and resulting in the loss of C9orf72 function.[44] Researchers have shown that the G4C2 repeat also forms transcriptionally induced DNA·RNA hybrids, known as R-loops. An in vitro study showed that R-loops caused RNA polymerase stalling and the accumulation of abortive C9orf72 transcripts. Recent studies implied that G4 motifs in the nontemplate DNA strand were induced by transcription initiation and that R-loop formation occurred prior to G4 formation and was stabilized by G4s.[46,47] The mechanisms of G4 and R-loop regulatory feedback in the C9orf72 gene still require further investigation in the future. In addition, DNA G4s have also been shown to the inhibit transcription of BRCA1, a gene encoding a critical DNA repair factor that has also been linked to Alzheimer’s disease (AD). Two G4 structures in the Brca1 gene promoter are stabilized by PDS and TPMyP4, promoting neurotoxicity and the formation of DNA double–strand breaks (DSBs) in cultured neurons.[66] Recently, a ChIP-seq analysis showed that BMI1, a Polycomb group protein associated with sporadic late-onset AD, is enriched at heterochromatin regions containing putative G4 DNA sequences in neuronal cells. The accumulation of DNA G4s reduces BMI1 expression levels in AD neurons, and G4 structures essentially form only in actively transcribed LINE1s elements. This result suggests that G4 structures alter gene expression and are drastically reduced in AD neurons by more tightly compacting heterochromatin.[67] The G4 motifs in the C9orf72 gene also exhibit close relationships with genome stability and epigenetic regulation. G4C2 repeats lead to a decrease in replication efficiency and an increase in instability in a length-dependent manner.[48] Although G4 has been proven to cause replication fork stalling and alter epigenetic modifications, an understanding of the organizational role of G4 as a key regulator will improve therapeutic targeting in the future.

RNA G4 Regulates Neural mRNA Localization and Gene Expression

In addition to the proposed mechanisms described above that repeat expansions lead to neural gene loss of function, two other potential mechanisms of neurological disease pathogenesis are the toxicity of nuclear RNA foci and toxic dipeptide repeat (DPR) proteins generated by repeat-associated non-AUG (RAN) translation of repeat-containing RNA. The structural role of G4s in these mechanisms is either disrupting the interaction between G4 binding proteins and disease-associated RNAs or recruiting G4 binding proteins to evaluate DPR protein levels. Both full-length and abortive transcripts of the G4C2 repeat expansion fold into RNA G4s, and RNA G4 accumulation leads to an naturally liquid–liquid phase separation (LLPS) process, forming nuclear aggregates, or RNA foci, in the brains of patients with FTD/ALS.[38] These RNA G4s are specifically recognized by nucleolin (NCL). An in vivo study indicated that these G4-containing toxic nucleic acids cause NCL sequestration and mislocalization, leading to functional defects associated with nucleolar stress in patients with ALS/FTD.[44,50] In addition, several independent studies showed that G4C2 RNA repeats sequester several other proteins including splicing factors (hnRNP A1, hnRNP H); the mRNA export receptor ALYREF; the single strand DNA binding protein Pur α; the RNA-binding proteins hnRNPA3, hnRNP-U, hnRNP-K; and the RNA editing factor ADARB2. Extensive and detailed descriptions are provided in another review.[49] In patients with FXS, early studies estimated that approximately 4% of brain mRNAs bind to FMRP,[68] while subsequent high-throughput methods identified several thousand mRNAs that potentially bind to FMRP.[69] Further biochemical assays only validated that G4s distinctly regulated the binding activity of FMRP to its own Fmr1 mRNA, as well as to the MAP1B, APP, Task3, and PP2Ac mRNAs.[52,70] The G4s located in the 5′-UTR of these mRNAs are recognized and stabilized by FMRP, preventing ribosome scanning and repressing their translation. Thus, mRNAs with G4 motifs are authentic targets of FMRP. Nevertheless, FMRP might also interact indirectly with some mRNAs that do not contain G4s by either binding to adapter proteins or noncoding RNAs that recognize these mRNAs. However, some mRNAs harboring G4s in the 3′-UTR are also binding targets of FMRP, including the Sem3F, NR2B, and Shank1 mRNAs.[53,71,72] These mRNAs have been shown to form one or more G4 structures in their 3′-UTR s that are recognized specifically by FMRP, suggesting a common mechanism of recognition. Conversely, FMRP recognizes and unwinds the G4 structure of the Sema3F mRNA through the RGG box domain.[71] FMRP binds to two stable intramolecular G4s within the Shank1 mRNA and probably regulates dendritic mRNA translation and synaptic protein synthesis.[53] A very recent study further investigated the G4s in the 3′-UTR s of these transcripts by analyzing FMRP CLIP data, and the results revealed that the RGG domain of FMRP and the G4s in target mRNAs are important for efficient mRNA transport to neurites.[55] Parkinson’s disease (PD) is a common neurodegenerative disorder caused by the degeneration of neurons in the area of the brain that controls movement.[73] α-Synuclein (SNCA) accumulation has been reported to play a central role in the pathogenesis of PD. Bioinformatics analysis showed that three nonoverlapping G4 motifs are present in the 5′-UTR of SNCA, and mutations in two of the G4s promoted translation by either enhancing transcription or stabilizing the SNCA 5′-UTR reporter mRNA. Hence, these predicted G-q motifs, if preserved in the native SNCA mRNA, are likely to mediate a negative response in SNCA mRNA translation.[65] Taken together, these results identified G4 as an authentic target for PD therapy, but the effect of G4 regulation on PD remains to be elucidated. The RAN translation products of FXTAS, a polyglycine-containing protein (FMRpolyG), are critical for the formation of inclusions observed in the brains of patients with FXTAS.[74] According to a recent study, FMRpolyG preferentially binds to CGG RNA G4s, which promotes the liquid-to-solid transition and aggregation of FMRpolyG, thereby eliciting neuronal dysfunction.[54] A recent study revealed that several G4 sequences in the 5′-UTRs of PRKN and VPS35 mRNAs associated with Parkinson’s disease were folded in vitro by performing biochemical assays. Moreover, the authors identified a new G4 binding protein, GNL1, which specifically interacts with G4s in the VPS35 and PRKN mRNAs.[75] Alzheimer’s disease (AD) is the most common age-related neurodegenerative disease. The pathogenic mechanism of AD is typically characterized as a protein misfolding disease in which abnormal plaques form due to the accumulation of tau and β-amyloid (Aβ) proteins.[76] Aβ protein is produced by cleavage of the Aβ precursor protein (APP) by the β-site APP cleaving enzyme 1 (BACE1) and γ-secretase.[59] Biochemical assays revealed that a G-rich sequence within exon 3 of BACE1 folds into a G4 structure, and this G4 prevented alternative splicing by hnRNP H. Following hnRNP H knockdown, a decrease in the level of the full-length BACE1 mRNA and a decrease in Aβ production were observed.[58] Additionally, G4-related regulation of several other genes contributed to the pathogenetic ways for AD, including APP,[60]ADAM10,[61] and APOE.[62]

The Role of G4 in Regulating Noncoding RNAs

Sortilin-related receptor 1 (SORL1) is responsible for the processing and trafficking of Aβ. The levels of mature microRNA-1229-3p (miR-1229-3p) have been shown to regulate SORL1 translation in neurons.[77] Various biophysical techniques have shown that premature microRNA-1229 (pre-miR-1229) forms a G4 structure that coexists in equilibrium with the canonical hairpin structure, regulating the production of mature miR-1229-3p.[63] Similarly, the presence of the G4 structure in pre-miR-92b regulates the maturation of miR-92b, as G4 impedes Dicer-mediated maturation both in vivo and in vitro.[64] The PSD-95 mRNA G-rich region folds into G4, and FMRP binds to this region that also encompasses the binding site for miR-125a. This G4 adopts multiple conformations, some increasing the accessibility of the miR-125a-binding site, suggesting that G4 regulates the access of miR-125a to its binding site.[56] A subsequent study showed that phosphorylated FMRP functions as a switch that regulates the translation of the PSD-95 mRNA together with miR-125.[57] These findings indicate a potential role of G4-related miRNA regulatory pathways in the pathogenesis of neurological disorders. Translation interfering tRNAs (tiRNAs) are tRNA fragments formed under stress conditions that may have roles in cancer progression. A G4 structure in tiRNA-Ala or tetramolecular G4s (formed from four individual tiRNAs) appears to mediate stress granule formation[51] and translation inhibition by interacting with Y-box binding protein 1, subsequently displacing eIF4F from mRNAs. DNA analogues of tiRNA-Ala may trigger a neuroprotective response in motor neurons, suggesting new possibilities for interventions in neurodegenerative diseases. These studies proved that G4 represents a new avenue for neurodegeneration and brain aging research, and more G4-related pathogenic mechanisms of neurological disorders might require further study.

THE ROLE OF G-QUADRUPLEX IN VIROLOGY

Viruses, which are simple acellular organisms mainly composed of nucleic acids genome and capsids, cause infectious diseases. Virus-borne infectious diseases tend to spread abruptly over wide geographical areas, such as the recent outbreak of coronavirus pneumonia 2019 (COVID-19) caused by the novel coronavirus SARS-CoV-2.[78] In addition to humans, G4 structures are also proposed to fold in viral life cycle (Figure ). Various bioinformatics tools are now adopted to predict PQS in viruses. Most of the reported viral G4s are identified first by computational prediction and then experimental tools. Two research groups have analyzed PQS in the genome of all known viruses that with the viral genome available in NCBI database.[79,80] They found that the distribution of G4s in viruses is statistically relevant, suggesting a specific biological role of G4s in viruses. The number of PQS differ across evolutionary groups and families,[79] but the distribution of PQS is highly conserved among viral strains, despite the high recombination rates of viruses.[80]
Figure 3

Role of G4 structures in viruses. Traditionally, the viral life cycle is depicted as four major steps: attachment and entry into the target cell (step 1 and 2), replication of the viral genome (step 3 and 4), translation of viral proteins (step 5 and 6) and assembly of the viral genome into infectious progeny, and egress to infect the next target cell (step 7 and 8).[81] (A) Formation of a G4 in the genome (or previral DNA) of viruses, can regulate the genome replication (or revers transcription), gene transcription, and genome integration of viruses. (B) Formation of a G4 in the mRNA of viruses can modulate the translation process, and the interaction of viral mRNA G4 with host protein or its translated protein self regulated the viral latency. (C) Intermolecular G-quartet structure increase the recombination rate of viral genome. (D) The interaction of G4s and G4-binding proteins between viruses and host, which modulate the expression of viral protein and cause viral latency.

Role of G4 structures in viruses. Traditionally, the viral life cycle is depicted as four major steps: attachment and entry into the target cell (step 1 and 2), replication of the viral genome (step 3 and 4), translation of viral proteins (step 5 and 6) and assembly of the viral genome into infectious progeny, and egress to infect the next target cell (step 7 and 8).[81] (A) Formation of a G4 in the genome (or previral DNA) of viruses, can regulate the genome replication (or revers transcription), gene transcription, and genome integration of viruses. (B) Formation of a G4 in the mRNA of viruses can modulate the translation process, and the interaction of viral mRNA G4 with host protein or its translated protein self regulated the viral latency. (C) Intermolecular G-quartet structure increase the recombination rate of viral genome. (D) The interaction of G4s and G4-binding proteins between viruses and host, which modulate the expression of viral protein and cause viral latency. The reports that describing the presence of G4s in virus genomes have increased substantially in the recent years, and are attracting increasingly attention as a potential target for treatment and diagnosis. Several reviews have been reported focusing on the G4s in viruses.[81−84] Here, we provide an insight into the biological roles of G4s in viruses with an emphasis on the recently reported examples (all belong to human viruses), and discuss the potential of utilizing G4s as antiviral targets. The overview of G4 in viruses is summarized in Table .
Table 2

Summury of G-Quadruplexes in Viruses: Name/Genome, Position, Proposed Biological Role of G4s

virus/genomepositionbiological role of G4s
HSV-1/ds DNAinverted repeats region[88,93]regulating viral replication and transcription[8892]
packaging signal (pac1)[89]virus entry[92]
immediate early promoters[90,91]determine the landscape of recombination[111]
unique long (UL) region[111]
HPV/ds DNAlong control region (LCR)[94]regulating viral transcription[94]
 L1, E1, and E4 coding regions[94] 
CMV/ds DNApromoter or gene regulatory regions[112]affecting gene expression[112]
KSHV/ds DNAterminal repeats[113]alter latent DNA replication and episomal persistence[113]
 LANA mRNA[114]causing latency[113,114]
HHV-6/ds DNAdirect repeat regions[115]modulating the interaction of viral genome into host genome to cause latency[115]
EBV/ds DNAEBNA1 mRNA[116,117]regulating viral replication and transcription[116118]
  modulating immune evasion[119,120]
HIV-1/(+) ssRNAlong terminal repeats[97101,9498]modulate the interaction of viral genome into host genome[97100]
 U3 region[97,100,101]regulating viral transcription[97,100,101]
  regulating revers transcription[101,121]
  causing latency[99,122125]
SARS-CoV/(+) ssRNAnucleocapsid gene[108]repressing translation[121]
HCV/(+) ssRNAHCV core gene[18,126]suppressing viral gene eplication[126]
IAV/(−) ssRNApromoter region of TMPRSS2[127]reducing viral gene expression[127]
HBV/gapped dsDNAenvelope gene promoter[128]regulating transcription and virion secretion[128]
 precore promoter region of cccDNA[129]influencing HBV replication[129]

Viral G4s Regulate Gene Replication and Expression

Under physiological conditions, the stable G4 structure leads to knot formation in the genome, obstructing replication, transcriptional (Figure A), and translational machinery (Figure B), thereby playing a major role in gene expression regulation. This pattern of regulation is reported in almost all kinds of virus. The Herpesviridae family, which contains long linear dsDNA genomes, is extremely widespread and responsible for the onset of several diseases including different types of cancer.[85] Most of the studies conducted on the role of G4s in human herpesviruses (HHVs) focused on herpes simplex virus type 1 (HSV-1), as the genome is characterized by a 68% GC content, with multiple and highly stable G4-forming sequences.[86,87] G4 structures form in the inverted repeats region,[88] the packaging signal (pac1),[89] and the immediate early promoters,[90] have been demonstrated to may carry out regulatory functions in HSV-1. Furthermore, the HSV-1 viral protein named ICP4 was reported to bind and unfold G4s in its own promoter, thereby regulating HSV-1 viral gene expression via a G4-mediated interaction.[91] A recently reported article showed that TMPyP4, a porphyrin known to interact with G4s, displayed significant anti HSV-1 activity, which, however, was independent of inhibition of virus DNA replication or entry.[92] In addition, the stable formation of multiple clustered G4s in the repeat sequences of HSV-1 was visualized in HSV-1 infected cells using immunofluorescence and immune electronmicroscopy.[88,93] Human papillomavirus (HPV), which is a small DNA virus with double-stranded closed loop DNA genome, led to the onset of cervical cancers and several oropharyngeal cancers. G-rich regions with the capacity to form G4s are located in the long control region (LCR) and in the coding regions of the E1, E4, and L2 proteins in eight HPV types,[94,95] all belonging to the “high risk” HPV subgroup. Therefore, G4s might play a regulatory role in these viruses. Human immunodeficiency virus, type 1 (HIV-1) is a retrovirus that cause AIDS, which is one of the most serious public health challenges.[96] The genome of HIV-1 consists of two identical or nearly identical (+) ssRNA. Studies of G4s in the viral genomic RNA and the integrated provirus of HIV-1 have been quite productive. Several G-rich sequences were reported to form G4s in the long terminal repeats (LTRs) that enable integration of the HIV-1 provirus into the host genome.[97−100] Three overlapping quadruplex motifs were identified in the U3 region of LTRs.[101] Furthermore, G4s formed in the LTR might function as silencing elements to regulate viral transcription.[99] In addition, a bioinformation analysis of the Retroviridae family showed that most retroviruses contain highly conserved PQS in their promoter regions.[102] The severe acute respiratory syndrome coronavirus (SARS-CoV) is a member of the Coronaviridae family and has a (+) ssRNA genome, which can lead to severe acute respiratory syndrome. Since the COVID-19 pandemic, the reports of G4s in coronaviruses have been highly productive. It has been predicted that all seven human coronaviruses harbor G-quadruplex sequences, and conserved G-quadruplex sequences in SARS-CoV and SARS-CoV-2 were analyzed and verified.[103−107] Recently, Qu et al.[108] searched for a small-molecule drug therapy by targeting the SARS-CoV-2 RNA secondary structure to fight the COVID-19 pandemic and found that PQSs located in the coding sequences region of nucleocapsid protein, named RG-1 and RG-2, are able to fold into G-quadruplexes in vitro. Furthermore, RG-1 actually forms the G4 structure in live cells, and the G4 ligand PDP stabilized RG-1 G4 and significantly reduced the level of nucleocapsid protein by inhibiting its translation both in vitro and in vivo. Thus, RG-1 G4 might be a promising therapeutic target for SARS-CoV-2. The influenza A virus (IAV) consist of a (−) ssRNA genome, causing seasonal epidemics which can be deadly pathogens to humans. Bioinformation analysis in H1N1 influenza genomes reveals that PQS were identified in all genomes of the recently emerged genotype.[109,110] A guanine-rich tract in the promoter region of TMPRSS2 gene, which is the essential protease of IAV, was identified to form G-quadruplex in the presence of potassium ions. More importantly, compounds that are able to stabilize G4s can down-regulate TMPRSS2 gene expression as well as protein levels, and consequently suppress influenza A virus propagation in vitro.[127] Viral G4s regulated gene replication and transcription have also been reported in the genome of human cytomegalovirus (CMV),[112] Kaposi’s sarcoma associated herpesvirus (KSHV),[113] and hepatitis B virus (HBV).[128,129] In addition, PQS in the genome of HHVs is conserved in regulatory and repeated regions, corroborating that G4s are functional elements regulating the HHVs life cycle. Treatment of HSV-1-infected cells with G4 ligands such as BRACO-19 and TMPyP4, inhibit virus production.[88,92] Consequently, G4s may represent efficient pharmacological targets and alternatives to the current antiherpetic therapies.

G4s Involved in The Evolution of Viruses

The bioinformation analysis reveal that the PQS frequency, motif size, loop length, and nucleotide compositions show a correlation between the genome of virus and host, suggesting a coevolution of virus and host.[79,130,131] HIV-1 consists of two copies of (+) ssRNA that dimerize by forming hairpin loops in the dimerization site (DIS). Several G-rich sequences have been identified in proximity to DIS, and the DIS is associated with dimerization and primer-strand transfer during reverse transcription, events that promote recombination.[121,132] Even though there’s still little known about the mechanisms involved in dimerization of viral genomes, the increased rate of recombination in this region is very likely to derive from the presence of intermolecular G-quartet structure (Figure C).[111] The investigation of the G4-motifs in recombination among HSV-1 found that the PQS of the genome are enriched in recombination breakpoints and in regions flanking the recombination breakpoints. The terminal guanosine of G4 clusters at the boundaries of unique long (UL) region represent the most common breakpoint among the HSV-1 recombinants.[133] On the basis of the accumulating evidence, G4s in these regions might play a role in the evolution of viruses, suggesting that RNA G4s are fundamental for the evolution of viruses.

The Interaction of G4s and G4-Binding Proteins between Viruses and Host (Figure D)

Human herpesvirus 6A (HHV-6A) is a double-stranded DNA virus belong to the subfamily Betaherpesvirinae, which is the pathogen of the febrile illness roseola infantum. The HHV-6 genome is characterized by the presence of variable-length telomere-like repeat regions at its termini region, which integrate into the human chromosome at telomeres by homologous recombination. The stabilization of G4s by BRACO-19 affect the ability of HHV-6A to integrate its genome into the host chromosomes.[115] Incubation of telomerase-expressing cells with BRACO-19 caused a significant reduction in the HHV-6A integration frequency; in contrast, it had no effect on the HHV-6 integration frequency in U2OS cells that lack telomerase activity. Thus, cellular G4 formation in telomeres is important for efficient chromosomal integration of HHV-6A. Hepatitis C virus (HCV) consists of a single-stranded, positive-sense RNA genome approximately 9600 nucleotide bases in length, is a representative single-stranded RNA virus. HCV infection is a major cause of human chronic liver disease and hepatocellular carcinoma. The core gene of HCV contains a G4 RNA structure,[18] and a cellular protein, nucleolin (NCL), which was reported to bind and stabilize the HCV core RNA G4 structure.[126] HCV infection induces NCL mRNA and protein expression. While NCL suppresses wild-type viral replication, silencing of NCL substantially increases viral RNA replication. This finding provides new insights that NCL may function as a host factor for antiviral innate immunity, and binding of cellular NCL with the viral core RNA G4 structure is involved in suppressing HCV replication. Additionally, cellular nucleolin modulating EBNA1 mRNA G4s was reported, which directly mediates Epstein–Barr virus immune evasion.[119] The SARS-unique domain (SUD), which is exclusively present in the nonstructural protein (nsp3) of SARS-CoV, is postulated to contribute to the higher pathogenicity of SARS-CoV than other human coronaviruses.[134] Interestingly, the SUD was reported to preferentially bind G4-forming oligonucleotides in the 3′-UTR of mRNAs encoding host cell proteins implicated in the regulation of several processes, such as apoptosis and signal transduction.[135] In addition, human cellular protein CNBP, which is a reported G4 helicase, was reported to bind to two PQSs in the positive-sense (+gRNA) and negative-sense (-gRNA) RNA of SARS-CoV-2.[136] Therefore, the G4 binding protein/G4 interaction between viruses and host cells is speculated to be involved in the modulation of genome replication and the host cell response to viral infection.

G4s Play a Role in Viral Latency

The latency program of viruses allows them to survive inside the host and protects them from the immune surveillance of the host. Interestingly, G4s in mRNAs of EBV and KSHV were reported to modulate the mechanism of immune evasion with two different mechanisms, suggesting that G4s may play a role in the strategies of viral latency.[114,116,117,119] The first is that G4s in mRNA repress the translation of viral proteins, some of which exert immunomodulatory effects by restricting antigen presentation to cytotoxic T cells, allowing the virus to persist in infected cells. EBNA1 is a virus-encoded protein that is critical for the replication and maintenance of the genome during latency in proliferating cells. The EBNA1 mRNA itself is able to fold into parallel G4s.[117] The destabilization of EBNA1 mRNA G4s using antisense oligonucleotides increases EBNA1 mRNA translation. In contrast, pretreatment with a G4–stabilizing small molecule reduces EBNA1 synthesis.[116,117] Destabilization of these G4 structures limits both the presentation of MHC class I-restricted CD8+ T cell epitopes by CD11c+ dendritic cells in draining lymph nodes and early priming of antigen-specific CD8+ T-cells.[120] In addition, cellular nucleolin, which was reported to interact with G4s in EBNA1 mRNA, was also involved in the latency of EBV.[119] The KSHV is a human gamma herpesvirus that has been implicated in several lymphoproliferative diseases and is responsible for AIDS associated morbidities and mortalities. The genome of KSHV is organized into a 137 kb long unique region, flanked by the terminal repeats, which are rich in G residues and able to form stable G4s, both in the forward and reverse strands.[113] The latency-associated nuclear antigen 1 (LANA1) protein is the master regulatory protein of KSHV latency. It was reported that host cell protein hnRNP A1 regulate the translation of LANA mRNA through binding to the mRNA G4 structures. These findings highlight the importance of G4s within virally encoded transcripts as unique regulatory signals for translational control and immune evasion. The second is that the viral latency related proteins with the ability to bind to G4 structures is causing viral latency. KSHV achieves latent infection by tethering its epigenome to the host chromosome by LANA, and LANA thus self-regulates its expression by mRNA sequestration in the nucleus and competing with hnRNP A1 for association with G4s at the LANA mRNA.[114] It has been reported that EBNA1 recruits the cellular origin recognition complex (ORC) through a G4 structure-related mechanism;[118] G4 interacting moleculars disrupt the interaction between EBNA1 and ORC, inhibit EBNA1-dependent stimulation of viral DNA replication, and preferentially block proliferation of EBV-positive cells. In addition to the immune evasion, integration into the host genome is another strategy employed by HHVs for latent. As mentioned above, stabilization of telomere G-quadruplexes reduces HHV-6A chromosomal integration, suggesting that G4 structures mediate viral latency via interference with viral genome integration.[115] Similarly, G4s in HIV were also reported to modulate HIV latency with two different mechanisms: the HIV-1 LTR promoter is processed by G4 stabilizing (nucleolin)[99] and destabilizing proteins (hnRNP A2/B1),[137] and the LTR promoter has been suggested to be the region where viral latency is regulated,[122] indicating that the G4 switch may play a role in its shift to latency. Additionally, it was reported that latently infected cells incorporated silent HIV-1 provirus, resulting in dysfunctional DNA damage response (DDR). These cells are susceptible to long-term exposure to G-quadruplex stabilizing agents,[123−125] with no affect on HIV-1 promoter activity in cell culture, and this effect is enhanced when the agent is combined with an inhibitor targeting DNA-Pκ, which is crucial for repair of DNA breaks (DSB) and telomere maintenance. These findings suggest the possibility of eliminating HIV latency by targeting telomere maintenance and inhibitors targeting DDR.

THERAPIES TARGETING G-QUADRUPLEXES

As discussed above, the formation and unfolded state of G4 motifs are always in a dynamic balance, such as the potential to form G4s or block G4 structures that are not favorable for normal life processes. However, strategies that take advantage of the diverse regulatory functions of G4s by further enhancing the G4-mediated regulation of cellular processes through chemical biological tools might lead to beneficial regulation of life processes. In this section, we examine and discuss the application of G4 regulation in biomedical studies, focusing on promising techniques for targeting G4s as a therapeutic modality.

Targeting G4 Using Small Molecules

To date, numerous small molecules that are able to bind and stabilize G4s have been developed for G4 detection and regulation. Most of their molecular structures are similar, as planar aromatic rings interact with G-tetrads through π–π stacking and electrostatic interactions. The advantage of using small molecules as therapeutics for diseases is that these compounds target not only pathogenic proteins but also cellular processes affected by the underlying pathology of G4 structures. A set of small molecules with very similar chemical structures that target G4C2 repeat RNA G4 was found to ameliorate the two key pathologies associated with FTD/ALS in Drosophila and mammalian cell models.[138] Treatment with PDS and BRACO-19 also independently targets DNA G4 and RNA G4, resulting in the downregulation of neuronal autophagy and reducing proliferation of adult neural stem cells.[139,140] According to a previous study, oral administration of 5-ALA and TMPyP4 improves cognitive dysfunction caused by mutations in the G4-binding protein ATRX, which have been observed in a thalassemia intellectual disability X-linked syndrome mouse model and human patients. 5-ALA produces the porphyrins protoporphyrin IX (PpIX) and hemin in cells.[141] Recent studies have shown that PpIX may unfold G4s within CGG repeats in the Fmr1 transcript, resulting in the inhibition of RNA translation and ameliorating aberrant synaptic plasticity and behavior in FXTAS model mice.[54] The stabilization of these G4s with ligands has also been investigated as a potential mechanism for inhibiting viruses (Figure A). Richter et al. focused on the presence and proposed function of G4s in virus genomes and presented the classes of G4 ligands that have been reported to display efficacious antiviral activity, with a special emphasis on the structural and physicochemical properties that characterize the viral G4/G4 ligand interaction.[81,83] For example, in HCV, TMPyP4 stabilizes RNA G4s and inhibits intracellular HCV replication at both RNA and protein levels,[18] further suppressing intracellular HCV levels in an infectious HCV culture system. Several studies have shown that small molecules that selectively target G4s in the viral genome have the potential to remove not only the replicating virus but also the latent virus, therefore preventing the development of virus-induced diseases.[117−119]
Figure 4

Potential techniques for G4-mediated therapeutic target. (A) Small molecules can specifically interact with the G4 structure, reducing transcription and translation efficiency downstream. (B) Oligonucleotide probes can form a complementary strand to G-rich sequences and prevent G4 structure formation. (C) G-rich sequences can be targeted by designed sgRNA, followed by Cas protein cleavage, abolishing G4 structure formation. (D) Regulation of G4-related protein level can influence G4 activity in cellular environment.

Potential techniques for G4-mediated therapeutic target. (A) Small molecules can specifically interact with the G4 structure, reducing transcription and translation efficiency downstream. (B) Oligonucleotide probes can form a complementary strand to G-rich sequences and prevent G4 structure formation. (C) G-rich sequences can be targeted by designed sgRNA, followed by Cas protein cleavage, abolishing G4 structure formation. (D) Regulation of G4-related protein level can influence G4 activity in cellular environment. However, because of the structural similarity of RNA G4s to DNA G4s, selective targeting of RNA G4s remains challenging. To date, few selective RNA G4 ligands have been developed. Because of the conformational similarity among different G4s and the complexity of cells, complete target selectivity is difficult to achieve. In addition, small molecules have limited interaction sites, and the use of these compounds in clinical trials is far from being adopted because of their low selectivity profile and poor drug-like properties. Hence, the ability of G4 ligands to discriminate single G4 targets remains challenging in complicated cellular environments.

Targeting G4s with Oligonucleotides

Obviously, G4s are oligonucleotide-based structures, and the high specificity of base-pair recognition can be exploited as a targeting method. In contrast to quartets or tetrads, which are the “fixed” elements of the quadruplex, the nucleobase of each G4 can be quite distinct. Antisense oligonucleotides (ASOs) targeting G4 have been tested in different studies to overcome the issues with the selectivity of small molecule-based therapeutics.[142,143] Nakatani et al. reported guanine-tethered antisense oligonucleotides (g-ASs) that consisted of two functionally independent domains: one was an antisense domain that bound to the target RNA at its binding site, similar to conventional antisense DNAs, and the other was a contiguous guanine run at the 5′-end of the antisense domain. When the DNA probe hybridizes to the target, the G-rich tail composed of only three G-runs is positioned in the target to form an RNA-DNA hybrid quadruplex structure together with a guanine-rich region in the target RNA. This oligonucleotide rearrangement procedure in practice effectively inhibits reverse transcription of a variety of RNA sequences, including the HIV-1 RNA genome.[144] The remarkable ability of guanine-tethered antisense oligonucleotides to inhibit reverse transcription, translation, and replication, might facilitate the development of novel antiretroviral gene therapies based on blocking the gene replication and expression. In contrast, an alternative approach targeting the sequence-specific disruption of a G4 was established using cytosine-rich (C-rich) nucleotide probes or and their analogues that designed to hybridize to the G-rich sequence (Figure B). Targeting EBNA1 mRNA with an antisense oligonucleotide complementary to the EBNA1 mRNA G4 motif destabilizes the G4 structures, stimulates EBNA1 synthesis, and enhances antigen presentation.[117] γPNA (peptide nucleic acid) oligomers, which are analogues of oligonucleotides, were reported to show high-affinity invasion to the PQS derived from the WNV NS5 gene and were previously predicted to fold into a two-tetrad RNA G-quadruplex structure.[145] After treatment of FTD/ALS cells in vitro[146,147] and in vivo with ASOs,[148] a reduction in RNA foci was observed. In addition, treatment with ASOs also led to a decrease in DPRs.[148,149] G4-based aptamers have been identified against a wide variety of protein targets. The high affinity and selectivity for the target make aptamers very useful in many applications. Prion diseases are neurodegenerative disorders in mammals attributed to the accumulation of soluble normal cellular prion protein in the brain.[150] Researchers discovered an RNA aptamer targeting bovine prion protein with a sequence of r(GGAGGAGGAGGA) (R12). It folds into a dimeric G4 structure and exhibits antiprion activity in mouse neurons by reducing the abnormal protein level.[151] Interestingly, DNA aptamers composed of two distinct classes of G4s and non-G4s forming sequences against the SARS-CoV helicase were isolated using SELEX.[152,153] While the possibility of nucleic acid therapeutics has been a promising idea for the treatment of neurological disorders, the underlying weaknesses, such as the delivery efficiency and maintenance of biostability, increase the difficulty of achieving therapeutic success.

Targeting G4 Using the CRISPR-Cas System

The CRISPR (clustered regularly interspaced short palindromic repeats)/Cas (CRISPR-associated protein) bacterial immune system has revolutionized the field of molecular biology and generated excitement for new and improved gene therapies. The simplicity and flexibility of the site-specific system has led to its widespread use in many biological research fields, including the development of model cell lines, discovery of mechanisms of disease, identification of disease targets, and transcriptional modulation.[154] As G4 possesses a consensus sequence, targeting G4 by the CRISPR-Cas system with the designed sgRNA is a reasonable approach (Figure C). Hence, researchers developed a CRISPR-Cas12a-based biosensor for the detection of SARS-CoV-2, as Cas12a has collateral cleavage sites in nontarget ssDNA,[155] and the authors found that the activated Lachnospiraceae bacterium ND2006 Cas12a (LbCas12a) also has trans-cleavage activity toward DNA G4s. Moreover, different cation concentrations resulted in efficient cleavage of G4, which was strongly related to its stability.[156] Another study investigated the structural characteristics of the complex formed by CRISPR–Cas9 and target DNA. The stability of G4 and the position of G4 in the target/nontarget strand both contributed to CRISPR–Cas9 activity.[157] As the CRISPR-Cas system has been thoroughly developed and applied in virus diagnostic tests,[158] a comprehensive approach that takes full advantage of this gene editing ability should be developed to target G4 sequences in the cellular environment. Through the elaborate design of sgRNAs with high efficiency for the target and low off-target scores, the CRISPR-Cas system will become a powerful tool for some sequence-based therapeutics for pathogenic diseases. Additionally, the addition of chemical modifications to the CRISPR-Cas system may achieve the bidirectional regulation of G4s, such as editing activated by light or chemical reagents.[159,160]

Targeting G4 Using G4-Related Proteins

The functional role of G4 is based on its efficient recognition elements for several related proteins in the cellular environment. G4 unfolding by helicase enzymes has been reported since the 1900s.[161] Interestingly, the potential of a coronavirus therapy using G4s as targets could also be achieved using molecules or aptamers that block the activity of RNA helicases (Figure D).[162] The nsp13 helicase in SARS-CoV unwinds the 3′- end of the nascent RNA primer, thus regulating virus replication.[163,164] The adamantane-derived bananins exhibit activity as noncompetitive inhibitors of nsp13 helicase, probably through an allosteric mechanism.[165] A novel small-molecule inhibitor was reported to block the activity of SARS coronavirus helicase by inhibiting ATP hydrolysis and dsDNA unwinding activities, without any cytotoxicity up to 80 μM.[166] In addition, researchers found that the G4 helicase DHX36 has a high affinity for G4C2 repeat RNA and efficiently unwinds the G4 structure; knockdown of DHX36 significantly decreases the levels of DPR proteins. The observation of aberrant DHX36 upregulation in tissues from patients with C9orf72-linked ALS indicates that DHX36 is a positive regulator of C9orf72 repeat-associated RNA translation.[167] Conversely, another RNA helicase, DDX3X, was found to be a negative regulator of C9orf72 repeat-associated RNA translation, and a reduction in DDX3X expression increased DPR levels in cells from patients with ALS/FTD and enhanced G4C2-mediated toxicity in Drosophila.[168] While DDX3X recognizes the same hairpin in the G4C2 repeat, the different effects of various secondary structures remain controversial. As G4-binding proteins are important for regulating function, the identification of new G4-related proteins involved in these diseases will improve diagnosis and therapy. Although many proteins identified in neuronal and viral diseases tend to interact with G4s in 5′- and 3′-UTRs, a recent study identified 15 proteins whose binding sites are enriched for PQS in two cell lines, several of which have been reported to be involved in viral gene expression.[169] Further studies of these proteins will reveal the possible roles of G4s in the mechanism for coopting host cell machinery.

CONCLUSIONS AND OUTLOOK

In the past few years, G4s have attracted increasing attention, as they represent some of the most important nucleic acid secondary structures with extensive regulatory functions in cells. This widespread utilization of G4s provides an opportunity for these structures to contribute to the molecular pathology of diseases, including neurodegenerative diseases and viral pathogenesis. The common regulatory mechanisms described in this Perspective highlight several factors involved in G4s based on their structural composition, localization, and binding proteins. On the basis of the different regulatory roles of G4, we can develop different techniques for effective targeted therapies. The flat G-tetrads of G4s enable structure-based recognition by small molecules. G4-based drugs may represent a significant turning point in the management of viral infections, especially in people who cannot access immunization, such as immunocompromised patients or elderly people. Although a number of G4 ligands have been developed, few derivatives are bioavailable and can be further utilized in clinical trials. As G4s are present in both cellular and virus genomes, one challenge is developing specific G4 ligands for pathogenic structures that do not recognize cellular G4s. Another challenge is validating the mechanism of G4-involved pathological processes. Once the effect of G4s is determined, the time points for antiviral reagent delivery and therapeutic interventions can be further investigated. Moreover, G4 is considered related to i-motif formation, epigenetic modification, and oxidative stress, which are related to the development of several diseases. Therefore, an understanding of their complex and dynamic mechanisms might provide insights into important therapeutic strategies. Our recommendations for future research in these fields are as follows: (i) visualize the formation of G4s, (ii) explore G4 interaction network, (iii) investigate the function of G4s, and (iv) specifically manipulate G4s. First, a prerequisite is to confirm G4 structure formation in critical regions of genes associated with these diseases. Although specific antibodies, in-cell NMR analysis and many fluorescent probes are available to detect G4s in the cellular environment, contrasting observations of the G4 dynamic folded state have been reported in vivo,[21−23] despite different dynamic situations, cell cycles, and populations. PQS with various sequences may form different topological structures in vitro and in vivo, including parallel, antiparallel, hybrid, or interact with each other to form a higher-order structures.[170,171] Therefore, further studies utilizing newer technologies may be needed to unambiguously detect G4s in vivo and in real time. Second, DNA and RNA G4 distribution in cells have been studied by multiple methods, the formation of G4 demonstrated to be related with cellular stress,[28] molecular crowding,[172] and torsional strain,[16] but without general observation about which conditions do influence G4s formation in cells. Moreover, the key protein regulators that mediate G4 formation and depletion remain unknown. There might be interactions between G4s and other epigenetic features to form this complex regulation system. These crucial questions will help us to get a new insight for G4 biological roles. Third, based on a model in which structure determines function and function influences phenotype, DNA/RNA binding proteins either modulate the G4 conformation or serve to recruit additional protein regulators that block the processing machinery. Few efficient methods are available to identify G4-related proteins in a high-throughput manner, as most pull-down assays are performed in vitro based on G4 sequences.[173,174] We tried to perform G4-related proteomics in vivo on the basis of the G4 ligand,[175] but further study is needed to reveal the G4-protein recognition model and protein interaction network. Additionally, further study is needed to reveal the G4-protein recognition model and protein interaction network. A better understanding of the G4 binding protein will help define the regulatory role of G4 and elucidate the mechanism. Similarly, therapeutic approaches developed to manipulate the activities of bona fide G4-binding proteins will be promising. Fourth, manipulation of G4 formation in pathogenetic process deemed to be effective for therapeutic interventions, while the potential problems are target specificity and biocompatibility. Moreover, it is unclear whether the compounds actually affect the structure or just impair their detection. Unlike acidic extracellular pH and the limitless replication potential of tumor cells, there are fewer differences between diseased neurons and normal cells for molecule recognition. Furthermore, viral infection will trigger a robust and usually well-coordinated immune response including several pathways, which get weakened during cancer treatment. Developing a prodrug with low off-target possibilities and high delivery efficiency to the target region will meet therapeutics needed for advancement of this area.
  170 in total

1.  Electrospray mass spectrometry of telomeric RNA (TERRA) reveals the formation of stable multimeric G-quadruplex structures.

Authors:  Gavin W Collie; Gary N Parkinson; Stephen Neidle; Frédéric Rosu; Edwin De Pauw; Valérie Gabelica
Journal:  J Am Chem Soc       Date:  2010-07-14       Impact factor: 15.419

2.  A hexanucleotide repeat expansion in C9ORF72 is the cause of chromosome 9p21-linked ALS-FTD.

Authors:  Alan E Renton; Elisa Majounie; Adrian Waite; Javier Simón-Sánchez; Sara Rollinson; J Raphael Gibbs; Jennifer C Schymick; Hannu Laaksovirta; John C van Swieten; Liisa Myllykangas; Hannu Kalimo; Anders Paetau; Yevgeniya Abramzon; Anne M Remes; Alice Kaganovich; Sonja W Scholz; Jamie Duckworth; Jinhui Ding; Daniel W Harmer; Dena G Hernandez; Janel O Johnson; Kin Mok; Mina Ryten; Danyah Trabzuni; Rita J Guerreiro; Richard W Orrell; James Neal; Alex Murray; Justin Pearson; Iris E Jansen; David Sondervan; Harro Seelaar; Derek Blake; Kate Young; Nicola Halliwell; Janis Bennion Callister; Greg Toulson; Anna Richardson; Alex Gerhard; Julie Snowden; David Mann; David Neary; Michael A Nalls; Terhi Peuralinna; Lilja Jansson; Veli-Matti Isoviita; Anna-Lotta Kaivorinne; Maarit Hölttä-Vuori; Elina Ikonen; Raimo Sulkava; Michael Benatar; Joanne Wuu; Adriano Chiò; Gabriella Restagno; Giuseppe Borghero; Mario Sabatelli; David Heckerman; Ekaterina Rogaeva; Lorne Zinman; Jeffrey D Rothstein; Michael Sendtner; Carsten Drepper; Evan E Eichler; Can Alkan; Ziedulla Abdullaev; Svetlana D Pack; Amalia Dutra; Evgenia Pak; John Hardy; Andrew Singleton; Nigel M Williams; Peter Heutink; Stuart Pickering-Brown; Huw R Morris; Pentti J Tienari; Bryan J Traynor
Journal:  Neuron       Date:  2011-09-21       Impact factor: 17.173

3.  Constitutive translation of human α-synuclein is mediated by the 5'-untranslated region.

Authors:  Pelagia Koukouraki; Epaminondas Doxakis
Journal:  Open Biol       Date:  2016-04-20       Impact factor: 6.411

4.  CRISPR-Cas12a has both cis- and trans-cleavage activities on single-stranded DNA.

Authors:  Shi-Yuan Li; Qiu-Xiang Cheng; Jia-Kun Liu; Xiao-Qun Nie; Guo-Ping Zhao; Jin Wang
Journal:  Cell Res       Date:  2018-03-12       Impact factor: 25.617

5.  R-loop induced G-quadruplex in non-template promotes transcription by successive R-loop formation.

Authors:  Chun-Ying Lee; Christina McNerney; Kevin Ma; Walter Zhao; Ashley Wang; Sua Myong
Journal:  Nat Commun       Date:  2020-07-07       Impact factor: 14.919

6.  CNBP Binds and Unfolds In Vitro G-Quadruplexes Formed in the SARS-CoV-2 Positive and Negative Genome Strands.

Authors:  Georgina Bezzi; Ernesto J Piga; Andrés Binolfi; Pablo Armas
Journal:  Int J Mol Sci       Date:  2021-03-05       Impact factor: 5.923

7.  Identification and characterization of RNA guanine-quadruplex binding proteins.

Authors:  Annekathrin von Hacht; Oliver Seifert; Marcus Menger; Tatjana Schütze; Amit Arora; Zoltán Konthur; Peter Neubauer; Anke Wagner; Christoph Weise; Jens Kurreck
Journal:  Nucleic Acids Res       Date:  2014-04-25       Impact factor: 16.971

8.  U3 region in the HIV-1 genome adopts a G-quadruplex structure in its RNA and DNA sequence.

Authors:  Dorota Piekna-Przybylska; Mark A Sullivan; Gaurav Sharma; Robert A Bambara
Journal:  Biochemistry       Date:  2014-04-15       Impact factor: 3.162

9.  Differential inhibitory activities and stabilisation of DNA aptamers against the SARS coronavirus helicase.

Authors:  Ka To Shum; Julian A Tanner
Journal:  Chembiochem       Date:  2008-12-15       Impact factor: 3.164

10.  FMRP promotes RNA localization to neuronal projections through interactions between its RGG domain and G-quadruplex RNA sequences.

Authors:  Raeann Goering; Laura I Hudish; Bryan B Guzman; Nisha Raj; Gary J Bassell; Holger A Russ; Daniel Dominguez; J Matthew Taliaferro
Journal:  Elife       Date:  2020-06-08       Impact factor: 8.140

View more
  3 in total

1.  Characterization of G-Quadruplexes in Enterovirus A71 Genome and Their Interaction with G-Quadruplex Ligands.

Authors:  Lu Lv; Leiliang Zhang
Journal:  Microbiol Spectr       Date:  2022-04-21

2.  Identification of G-quadruplex DNA sequences in SARS-CoV2.

Authors:  Amit K Maiti
Journal:  Immunogenetics       Date:  2022-03-18       Impact factor: 3.330

Review 3.  Acute Cerebellar Inflammation and Related Ataxia: Mechanisms and Pathophysiology.

Authors:  Md Sorwer Alam Parvez; Gen Ohtsuki
Journal:  Brain Sci       Date:  2022-03-10
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.