| Literature DB >> 35628656 |
Dorota Magner1, Rafal Nowak1, Elzbieta Lenartowicz Onyekaa1, Anna Pasternak1, Ryszard Kierzek1.
Abstract
Among types of trinucleotide repeats, there is some disproportion in the frequency of their occurrence in the human exome. This research presents new data describing the folding and thermodynamic stability of short, tandem RNA repeats of 23 types, focusing on the rare, yet poorly analyzed ones. UV-melting experiments included the presence of PEG or potassium and magnesium ions to determine their effect on the stability of RNA repeats structures. Rare repeats predominantly stayed single-stranded but had the potential for base pairing with other partially complementary repeat tracts. A coexistence of suitably complementary repeat types in a single RNA creates opportunities for interaction in the context of the secondary structure of RNA. We searched the human transcriptome for model RNAs in which different, particularly rare trinucleotide repeats coexist and selected the GABRA4 and CHIC1 RNAs to study intramolecular interactions between the repeat tracts that they contain. In vitro secondary structure probing results showed that the UAA and UUG repeat tracts, present in GABRA4 3' UTR, form a double helix, which separates one of its structural domains. For the RNA CHIC1 ORF fragment containing four short AGG repeat tracts and the CGU tract, we proved the formation of quadruplexes that blocked reverse transcription.Entities:
Keywords: CHIC1 mRNA quadruplexes; GABRA4 3′UTR secondary structure; RNA repeats; RNA thermodynamics; RNA trinucleotide repeats interaction; RNA trinucleotide repeats secondary structure; UAA-UUG RNA duplexes
Mesh:
Substances:
Year: 2022 PMID: 35628656 PMCID: PMC9144543 DOI: 10.3390/ijms23105850
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
RNA repeats potential for folding into different secondary structures.
| Unstructured | Duplex | Quadruplex |
|---|---|---|
| A(UUA)3U | A(CAA)3C | G(AGG)3A |
| G(AAG)3A | A(CGA)3C | G(UGG)3U |
| G(UUG)3U | U(GCU)3G | |
| A(GAA)3G | U(CGU)3C | |
| A(UAA)3U | C(GCC)3G | |
| U(CUU)3C | G(GCG)3G | |
| U(CCU)3C | A(GCA)3G | |
| A(CCA)3C | G(CCUG)3C | |
| A(CUA)3C | ||
| U(CAU)3C | ||
| G(UAG)3U | ||
| G(AUG)3A | ||
| U(AUUCU)3A |
Figure 1Thermodynamic stability of self-duplexing RNA trinucleotide repeats.
Figure 2Thermodynamic stability of inter-strand duplexes of non-self-duplexing RNA repeats.
Figure 3Thermodynamic stability of intermolecular duplexes formed by G(AGG)3A repeats (A) and G(GCG)3G repeats (B).
Top results of megaBLAST search for RNAs with multiple TNR tracts in human transcriptome. Records marked in red were chosen for detailed analysis of RNA secondary structure to follow trinucleotide repeat tracts interactions. Records marked with an asterix contain discontinuous tracts of repeats.
| Gene ID | Variant | Repeat Type | Repeats Number | Transcript Region | Region Lenght | Distance between Repeats | Codon Type | |
|---|---|---|---|---|---|---|---|---|
|
|
| NM_000809 | UUG | 10 | 3′UTR | 9325 | 318 | - |
| UAA | 14 | 3′UTR | - | |||||
| ALG13 | NM_001099922 | CCA | 14 | ORF | 3414 | Pro | ||
| CCU | 13 | ORF | 0 | Pro | ||||
| HRC | NM_002152 | AGG | 8 | ORF | 2100 | Asp | ||
| AUG | 14 | ORF | 138 | Glu | ||||
| ARID3A | NM_005224 | UGG | 10 | 5′UTR | 292 | |||
| AGG | 6 | ORF | 1782 | 434 | Glu | |||
|
| ZFHX3 | NM_001164766 | GCC | 10 | 5′UTR | 515 | ||
| AAC | 9 | ORF | 8370 | 2642 | ||||
| GCA | 7 | ORF | 4369 | Gly | ||||
| GCG | 7 | ORF | 947 | |||||
| MAFA | NM_201589 | GCG | 5 | ORF | 1062 | His | ||
| CCA | 10 | ORF | 352 | Gly | ||||
| ATXN8OS | NR_002717 | CUA | 10 | ncRNA | 1472 | |||
| GCU | 14 | 0 | ||||||
| RPS6KA6 | NM_014496 | GCG | 11 | 5′UTR | 300 | |||
| CUA | 12 | 3′UTR | 5927 | 3637 | ||||
| AR | NM_000044 | GCA | 23 | ORF | 2763 | Gln | ||
| GCG | 17 | ORF | 1130 | Gly | ||||
| VEZF1 | NM_001330393 | GCC | 6 | 5′UTR | 382 | |||
| GCA | 13 | ORF | 1539 | 1298 | Gln | |||
| HTT | NM_002111 | GCA | 21 | ORF | 9435 | Gln | ||
| GCC | 7 | ORF | 3 | Pro | ||||
| AAK1 | NM_014911 | GCA | 6 | ORF | 2886 | Gln | ||
| GUU | 10 | 3′UTR | 17,872 | 12,815 | ||||
| POU4F2 | NM_004575 | GCG | 11 | ORF | 1230 | Gly | ||
| CCA | 6 | ORF | 331 | His | ||||
| HMGB3 | NM_001301228 | GCC | 17 | 5′UTR | 262 | |||
| AGG | 7 | ORF | 603 | 663 | Glu | |||
| ** SKIDA1 | NM_207371 | GCC | 10, 8 | ORF | 2727 | Ala | ||
| GCG | 8, 4 | ORF | 174 | Ala | ||||
| CCA | 7, 6 | ORF | 65 | His | ||||
| AGG | 5, 4 | ORF | 188 | Glu | ||||
| ** ZSWIM6 | NM_020928 | GCC | 10, 4 | ORF | 3648 | Ala | ||
| GCG | 6, 6, 6, 6 | ORF | 44 | Gly | ||||
| ** KCNMA1 | NM_001014797 | GCG | 6, 5, 7 | ORF | 3549 | Gly | ||
| CCU | 7 | ORF | 60 | Ser | ||||
| CUU | 4 | ORF | 0 | Ser | ||||
|
| NM_001039840 | CGU | 4, 5 | ORF | 675 | Ser | ||
| AGG | 4, 4, 3 | ORF | 23 | Glu |
**: The repeat tracts are discontinuous.
Figure 4RNA repeats coexistence in a single transcript. The values on the crossing of particular repeats type row and column represent the number of RNAs found with the four (yellow–green scale) and the six (yellow–red scale) repeat number.
Figure 5Experimental secondary structure of GABRA4 3′UTR fragment, including normalized nucleotide reactivity’s and base-pairing probability.
Figure 6Analysis of potential miRNA target site regions within the GABRA4 3′UTR fragment. The sites of interactions significantly overlap with the reactive sites determined by chemical structure probing.
Figure 7(A) Structure prediction for CHIC1 ORF RNA fragment (RNAstructure). Putative quadruplexes are marked black, and CGU repeats are marked green; red—single stranded regions, blue—duplex regions. (B) Native 8% polyacrylamide gel stained with NMM, showing two CHIC1 isoforms containing quadruplex structures (II). Control RNA (I) of similar length is G4-free and therefore invisible.