| Literature DB >> 24885782 |
Scott Frees1, Camille Menendez, Matt Crum, Paramjeet S Bagga.
Abstract
BACKGROUND: Nucleic acids containing guanine tracts can form quadruplex structures via non-Watson-Crick base pairing. Formation of G-quadruplexes is associated with the regulation of important biological functions such as transcription, genetic instability, DNA repair, DNA replication, epigenetic mechanisms, regulation of translation, and alternative splicing. G-quadruplexes play important roles in human diseases and are being considered as targets for a variety of therapies. Identification of functional G-quadruplexes and the study of their overall distribution in genomes and transcriptomes is an important pursuit. Traditional computational methods map sequence motifs capable of forming G-quadruplexes but have difficulty in distinguishing motifs that occur by chance from ones which fold into G-quadruplexes.Entities:
Mesh:
Year: 2014 PMID: 24885782 PMCID: PMC4017754 DOI: 10.1186/1479-7364-8-8
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Figure 1G-quadruplex structure. Left: an intramolecular G-quadruplex formed by a G3AG3CUG3CG3 RNA motif. Right: a G-tetrad structure.
Figure 2Overlap percentage computation. The extension of sequence length using 50% of the total sequence length as padding inflates overlap scores for proximate sequences.
Figure 3Algorithm stages for SNCA mRNA. Overview of the algorithm stages for QGRS identification (independently) on each mRNA, performing a semi-global alignment on the homologs, and finally evaluating the QGRS pair for conservation.
Figure 4Overlapping QGRS yielding different G-scores. A small sample of hundreds of overlapping QGRS found in the ‘GGATGAGCCTGTGGGTGGAGGGCAGTGGAGCGGGCTGGG’ sequence of human GREB1 gene.
Figure 5Choosing QGRS considered for conservation calculations. The number next to each QGRS motif represents its calculated G-score. This figure demonstrates how it is possible that QGRS with highest G-score do not necessarily yield the highest conservation score.
An overview of human QGRS conservation across seven orthologous exomes
| | | | |
| Entire mRNA | 255453 (75%) | 16731 (68%) | 612 (61%) |
| 5′-UTR | 28398 (65%) | 3512 (58%) | 165 (53%) |
| CDS | 170536 (78%) | 6887 (75%) | 195 (75%) |
| 3′-UTR | 61244 (72%) | 6747 (68%) | 260 (59%) |
| | | | |
| Entire mRNA | 132440 (36%) | 5121 (19%) | 144 (13%) |
| 5′-UTR | 6249 (14%) | 635 (10%) | 25 (7%) |
| CDS | 121915 (52%) | 3744 (38%) | 91 (33%) |
| 3′-UTR | 5996 (7%) | 858 (8%) | 33 (7%) |
| | | | |
| Entire mRNA | 156152 (41%) | 6461 (23%) | 200 (17%) |
| 5′-UTR | 13600 (29%) | 1367 (20%) | 63 (17%) |
| CDS | 128755 (52%) | 3238 (31%) | 75 (25%) |
| 3′-UTR | 16543 (18%) | 2010 (18%) | 67 (12%) |
| | | | |
| Entire mRNA | 29001 (10%) | 198 (1%) | (0%) |
| 5′-UTR | 391 (1%) | 6 (<1%) | (0%) |
| CDS | 28580 (16%) | 191 (3%) | (0%) |
| 3′-UTR | 188 (<1%) | 4 (<1%) | (0%) |
| | | | |
| Entire mRNA | 5970 (6%) | 29 (<1%) | (0%) |
| 5′-UTR | 153 (1%) | 1 (<1%) | (0%) |
| CDS | 5798 (9%) | 26 (1%) | (0%) |
| 3′-UTR | 42 (<1%) | 2 (<1%) | (0%) |
| | | | |
| Entire mRNA | 1543 (2%) | (0%) | (0%) |
| 5′-UTR | 13 (<1%) | (0%) | (0%) |
| CDS | 1534 (4%) | (0%) | (0%) |
| 3′-UTR | 5 (<1%) | (0%) | (0%) |
| | | | |
| Entire mRNA | 444 (2%) | (0%) | (0%) |
| 5′-UTR | 1 (<1%) | (0%) | (0%) |
| CDS | 443 (3%) | (0%) | (0%) |
| 3′-UTR | (0%) | (0%) | (0%) |
Human QGRS are conserved in untranslated as well as translated regions of orthologous mRNAs. The conservation is more prevalent in mammalian orthologs. Values in parentheses in the first column represent the number of human mRNAs with an established ortholog in the given species. Values under ‘QGRS distribution’ represent data for QGRS types with two tetrads, three tetrads, or four and higher number of tetrads. For each tetrad column, the first number represents the number of QGRS in H. sapiens mRNA conserved within the given species exome. Data for only QGRS with conservation score ≥0.95 are presented. The second number in parenthesis is the percentage of H. sapiens QGRS conserved in the given species.
Mapping QGRS orthologs of previously published human G-quadruplexes
| LRP5 | Receptor-mediated endocytosis | NM_002335.2 | 3′-UTR (+136) | GGGGTGGGCAGGGCTGGG | Beaudoin et al. 2013 [ | NM_008513.3 | 86 | 1.00 |
| PIM1 | Cancer and apoptosis | NM_002648.3 | 3′-UTR (+279) | GGGGTGGGGGGTGGGGGTGGG | Arora and Suess 2011 [ | NM_008842.3 | 52 | 1.00 |
| IGF2 | Wilms' tumor | NM_001127598.1 | 3′-UTR (+2194) | GGGGTGGGTGGGGGGCAGTGGGGGCTGGGCGGGG | Christiansen et al. 1994 [ | NM_010514.3 | 56 | 1.00 |
| PSD95 | Synaptic junction formation | NM_001365.3 | 3′-UTR (+685) | GGGAGGGAGGGTGGG | Subramanian et al. 2011 [ | NM_001109752.1 | 64 | 1.00 |
| KISS1 | Metastasis suppression | NM_002256.3 | 3′-UTR (+53) | GGGGCGGGGGCGGGGGGCGGGGACGTAGGGCTAAGGGAGGGG | Huijbregts et al. 2012 [ | NM_178260.3 | 49 | 0.706b |
| NRAS | Rectal cancer | NM_002524.4 | 5′-UTR (-240) | GGGAGGGGCGGGTCTGGG | Kumari et al. 2007 [ | NM_010937.2 | 64 | 1.00c |
| ADAM10 | Anti-amyloidogenic activity | NM_001110.2 | 5′-UTR (-78) | GGGGACGGGTATGGGCGGG | Lammich et al. 2011 [ | NM_007399.3 | 71 | 1.00c |
| MECP2 | Rett syndrome/autism | NM_004992.3 | 5′-UTR (-108) | GGAGGAGGAGGAGGCGAGG | Bagga and D′Antonio 2013 [ | NM_001081979.1 | 74 | 1.00 |
| YY1 | Transcription | NM_003403.3 | 5′-UTR (-203) | GGGCGCGGGCGCACCGAGGCGAGGGAGGCGGG | Huang et al. 2011 [ | NM_009537.3 | 89 | 0.944c |
| TERF2 | Telomeric stabilization | NM_005652.2 | 5′-UTR (-20) | GGGAGGGCGGGGAGGG | Gomez et al. 2010 [ | NM_001286200.1 | 65 | 1.00 |
A sampling of human G-quadruplexes reported in the published literature, along with their mouse QGRS orthologs mapped with QGRS-Conserve. (Many of these conserved motifs were also successfully identified in additional mammalian species; data not shown). aValues obtained by semi-global alignments between human and mouse orthologous mRNAs. bRelatively lower conservation score noted due to slight differences in the number of tetrads, loop lengths, and overall lengths of aligned motifs. However, sequence alignment showed high location conservation. cHomologous QGRS mapped upstream of the mouse orthologous mRNA TSS.