| Literature DB >> 27324275 |
Rohini Garg1, Jyoti Aggarwal1, Bijal Thakkar1.
Abstract
DNA, in addition to the canonical B-form, can acquire a variety of alternate structures, such as G-quadruplexes. These structures have been implicated in several cellular processes in animals. In this study, we identified different types of G-quadruplex forming sequences (GQSes) in 15 sequenced plants and analyzed their distribution in various genomic features, including gene body, coding, intergenic and promoter regions. G2-type GQSes were most abundant in all the plant species analyzed. A strong association of G3-type GQSes with intergenic, promoter and intronic regions was found. However, G2-type GQSes were enriched in genic, CDS, exonic and untranslated regions. Further, we identified GQSes present in the conserved genes among monocots and dicots. The genes involved in development, cell growth and size, transmembrane transporter, and regulation of gene expression were found to be significantly enriched. In the promoter region, we detected strong co-occurrence of Telobox, ERF, MYB, RAV1B and E2F motifs with GQSes. Further, we validated the structure formation of several plant GQSes, demonstrated their effect on stalling in-vitro replication and revealed their interaction with plant nuclear proteins. Our data provide insights into the prevalence of GQSes in plants, establish their association with different genomic features and functional relevance.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27324275 PMCID: PMC4914980 DOI: 10.1038/srep28211
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Number of putative G-quadruplex motifs identified in selected plant species.
| Plant species | G2L1 | G2L1-2 | G2L1-4 | G3L1-3 | G3L1-7 |
|---|---|---|---|---|---|
| 31138 | 50368 | 140903 | 3524 | 8906 | |
| 20755 | 40814 | 135897 | 3990 | 8106 | |
| 31856 | 66476 | 240084 | 1985 | 7892 | |
| 11487 | 25564 | 105861 | 996 | 4556 | |
| 64877 | 112705 | 390940 | 10761 | 20189 | |
| 7518 | 11661 | 42220 | 260 | 1219 | |
| 15310 | 25512 | 95785 | 658 | 2785 | |
| 134993 | 275171 | 862476 | 11709 | 42066 | |
| 149073 | 280208 | 743556 | 12698 | 40937 | |
| 185375 | 309779 | 730718 | 13437 | 40759 | |
| 89189 | 165547 | 475214 | 8437 | 26044 | |
| 31223 | 54234 | 166629 | 5997 | 12138 | |
| 35998 | 61982 | 224849 | 3381 | 10019 | |
| 32735 | 67757 | 243632 | 8722 | 17444 | |
| 27334 | 52095 | 167013 | 4001 | 9025 |
Figure 1Frequency and distribution of various GQSes in the plant genomes.
(a) The bar graph displays the frequency (per Mb) of GQS motifs (G3L1-7, G3L1-3, G2L1-4, G2L1-2 and G2L1) in different plant genomes. Blue boxes represent classification of species as depicted on left side. (b) Box-plots showing the enrichment of different GQS motifs (G3L1-7, G3L1-3, G2L1-4, G2L1-2 and G2L1) within different genomic features (various gene components and intergenic regions) across all plant species.
Figure 2Gene ontology (GO) enrichment of orthologous genes harboring putative GQSes in genic region of all dicot species (a) and monocot species (b).
The genes were analyzed using BiNGO and the terms showing significant enrichment are shown. Significantly enriched GO categories in genes are shown. Node size is proportional to the number of genes in each category and colors shade represent significance level (white - no significant difference; color scale, yellow - P-value = 0.05, orange - P-value < 0.0000005). (c) GO enrichment of orthologous genes harboring GQSes in 1 kb promoter region of monocot species.
Figure 3Biological processes conserved among orthologous genes harboring GQSes in monocots and dicots.
Orthologous genes harbouring GQSes were analyzed for gene ontology (GO) enrichment using Cytoscape (p ≤ 0.005). Nodes highly enriched in monocot orthologous genes are shown in blue edges while those in dicots are shown in green edges. Node size represents number of genes. Colour of the node and border corresponds to the significance based on the p-value of the gene set. Edge thickness represents the degree of overlap between two gene-sets. Nodes were grouped according to GO definition. The clusters with both green and blue edges are highlighted and annotated with the group names.
Figure 4Transcription factor (TF) binding sites identified in GQSes present in promoters of orthologous genes.
(a) Left panel: G3L1-7 sequences in three promoter associated GQSes of orthologous genes conserved across all monocot species with annotation of rice genes. Os, Oryza sativa; Si, Setaria italica, Sb, Sorghum bicolor, Bd, Brachypodium distachyon. Middle panel: Logos showing the TF binding sites in the three OsG3L1-7. Right panel: CD spectroscopy of the rice GQSes. (b) TF binding sites (red circles) enriched within 1 kb promoters of Arabidopsis genes harboring GQSes (G3: G3L1-7, G3L1-3; G2: G2L1-4, G2L1-2 and G2L1; blue circles) or within the GQS motifs present in 1 kb promoters in Arabidopsis (AtG2/AtG3; blue circles).
Oligo sequences used for GQS validation.
| Oligo ID | Sequence (5′-3′) | Molecularity | Structural characteristics |
|---|---|---|---|
| Os3 | GGGAAGAGGGAAGAGGGGAAGAGGGG | Intramolecular (KCl) | (3+1) and parallel (KCl): Positive peak at 290 and 260 nm |
| Os4 | GGGGGTTGGGGGAGGGTGGGGAAAGTCGGGG | Intramolecular (NaCl) | Anti-parallel (NaCl): Positive peak at 295 nm and negative peak 260 nm |
| Os6 | GGGAGGAGGGAGAAGGGTGGG | Intermolecular (KCl, NaCl): bimolecular and tetramolecular) | Parallel (NaCl): Positive peak at 260 nm (3+1) and parallel (KCl): Positive peak at 290 and 260 nm |
| Os9 | GGGCGCGAGGGAGGAGGGCGCGGG | Intramolecular (KCl, NaCl) | (3+1) and parallel (KCl, NaCl): Positive peak at 290 and 260 nm |
| Os11 | GGGACACGGGGGAGAACTTGGGCATGGGGAGGGTGGGCAGGG | Intramolecular (KCl) | Parallel (KCl): Positive peak at 260 nm |
| Os5 | GGGTCCTAGGGGTGGGGTGGGAAGGGTGGGAGGGGAAGGGGGAGGAGGGA | Intramolecular (KCl) | Parallel (KCl): Positive peak at 260 nm |
| Os7 | GGGTGTGGGGAGGGTGGGG | Intramolecular (KCl) | Parallel (KCl): Positive peak at 260 nm |
| Os10 | GGGGAGGAGGGAGGGTGGGTAGGGGGGGGAGGG | Intramolecular (KCl) | Parallel (KCl): Positive peak at 260 nm |
| Os13 | GGGGAGAGGGAGAAGGGGGAGGAGAAGGGAGAAGGGAGGG | Intramolecular (KCl, NaCl) | Parallel (KCl): Positive peak at 260 nm |
| Os18 | GGGGGGAGGGTGGGGAGTAGGG | Intramolecular (KCl, NaCl) | Parallel (KCl): Positive peak at 260 nm (3+1) and parallel (NaCl): Positive peak at 290 and 260 nm |
| At3 | GGGTGGCGGGAAAATTGGGGACTTAGGG | Intramolecular (KCl) | Parallel (KCl): Positive peak at 260 nm (3+1) and parallel (NaCl): Positive peak at 290 and 260 nm |
| At8 | GGGACGGGTTTGGCGGGACGGG | Intramolecular (NaCl) | (3+1) and parallel (NaCl): Positive peak at 290 and 260 nm |
| At12 | GGGTTGGTTGGATGG | Intermolecular (NaCl) | (3+1) and parallel (KCl, NaCl): Positive peak at 290 and 260 nm |
| At15 | GGTTTGGTTAGGGAGGG | Intramolecular (KCl) Intermolecular (NaCl) | (3+1) and parallel (KCl): Positive peak at 290 and 260 nm Anti-parallel (NaCl): Positive peak at 295 nm and negative peak 260 nm |
| At17 | GGTGGCGTGGCGG | Intramolecular (KCl) Intermolecular (NaCl) | (3+1) and parallel (KCl): Positive peak at 290 and 260 nm Anti-parallel (NaCl): Positive peak at 295 nm and negative peak 260 nm |
| Ca1 | GGGAGAAGGGAGAAGGGAGAAGGGAGAAGGG | Intramolecular (KCl, NaCl) | Parallel (KCl, NaCl): Positive peak at 260 nm |
| Ca2 | GGGGTGGGTGGGTAAGGTGGGG | Intermolecular (KCl, NaCl) | Parallel (KCl): Positive peak at 260 nm (3+1) and parallel (NaCl): Positive peak at 290 and 260 nm |
| Gm1 | GGGAGAAGGGAGAAGGGATGGGGTGGG | Intramolecular (KCl, NaCl) | Parallel (KCl, NaCl): Positive peak at 260 nm |
| Gm2 | GGGAGAAGGGAGAAGGGAAGGG | Intermolecular (KCl, NaCl) | Parallel (KCl): Positive peak at 260 nm (3+1) and parallel (NaCl): Positive peak at 290 and 260 nm |
| Gm4 | GGGTGGGGTTGGGAAGGTGGGAGGAGAGGGTGAGGG | Intramolecular (KCl) Intermolecular (NaCl) | Parallel (KCl): Positive peak at 260 nm Parallel (NaCl): Positive peak at 260 nm |
Os: Oryza sativa; At: Arabidopsis thaliana; Gm: Glycine max; Ca: Cicer arietinum.
Figure 5Validation of GQS structure and conformation.
Native gel electrophoresis (left panel) and CD spectra (right panel) of selected putative GQS forming oligos in presence of various cations. Li-150 mM LiCl, K-150 mM KCl, Na- 150 mM NaCl. X-axis of CD-spectra depicts wavelength (nm) and Y-axis depicts Ellipticity. Os: Oryza sativa; At: Arabidopsis thaliana; Gm: Glycine max; Ca: Cicer arietinum. Oligo ID are given on top right side of CD spectra. All the gels were run under the same experimental conditions and presented by using cropped images. Stars represent shift in mobility of oligos in particular lane.
Oligo sequences containing GQSes used for polymerase stalling.
| Oligo ID | Sequence (5′-3′) |
|---|---|
| Os3_CT | TCCAACTATGTATACTGAAG |
| Os3_MCT | TCCAACTATGTATACTGAAG |
| Os4_CT | TCCAACTATGTATACTGAAG |
| Os4_MCT | TCCAACTATGTATACTGAAG |
| Os6_CT | TCCAACTATGTATACTGAAG |
| Os6_MCT | TCCAACTATGTATACTGAAG |
| Os9_CT | TCCAACTATGTATACTGAAG |
| Os9_MCT | TCCAACTATGTATACTGAAG |
| Os11_CT | TCCAACTATGTATACTGAAG |
| Os11_MCT | TCCAACTATGTATACTGAAG |
| Bio_Primer | [Biotin]TAATACGACTCACTATAGCAATTGCGTG |
Sequences in bold and underline represent G residues predicted to be involved in GQS formation and their mutated counterparts in MCT oligos. CT: control oligos; MCT: mutated oligos.
Figure 6DNA polymerase stalling by GQS formation.
Taq polymerase stalling assay in presence of LiCl, NaCl, KCl or (NH4)2SO4 with control (CT) vs mutated (MCT) oligos using biotinylated primers (Primer). Amplified products were loaded on 15% denaturating urea-PAGE transferred to biodyne B membrane and blotted with avidin-HRP. Arrowheads indicate full-length product, FLP; truncated products, TP and biotinylated-primer, Primer. All the gels were run under the same experimental conditions and blots are presented by using cropped images.
Figure 7Binding of GQSes to plant nuclear proteins.
Electrophoretic mobility shift assay was performed with biotinylated oligos forming G-quadruplex structures and rice nuclear proteins (5 and 10 μg) in the presence of different ions (NaCl and KCl). Protein-DNA complexes were resolved on 6% native acrylamide gel, transferred to biodyne B membrane and blotted with avidin-HRP. All the gels were run under the same experimental conditions and blots are presented by using cropped images.