| Literature DB >> 32441456 |
Abstract
It has been nearly 63 years since the first characterization of an RNA triple helix in vitro by Gary Felsenfeld, David Davies, and Alexander Rich. An RNA triple helix consists of three strands: A Watson-Crick RNA double helix whose major-groove establishes hydrogen bonds with the so-called "third strand". In the past 15 years, it has been recognized that these major-groove RNA triple helices, like single-stranded and double-stranded RNA, also mediate prominent biological roles inside cells. Thus far, these triple helices are known to mediate catalysis during telomere synthesis and RNA splicing, bind to ligands and ions so that metabolite-sensing riboswitches can regulate gene expression, and provide a clever strategy to protect the 3' end of RNA from degradation. Because RNA triple helices play important roles in biology, there is a renewed interest in better understanding the fundamental properties of RNA triple helices and developing methods for their high-throughput discovery. This review provides an overview of the fundamental biochemical and structural properties of major-groove RNA triple helices, summarizes the structure and function of naturally occurring RNA triple helices, and describes prospective strategies to isolate RNA triple helices as a means to establish the "triplexome". This article is categorized under: RNA Structure and Dynamics > RNA Structure and Dynamics RNA Structure and Dynamics > RNA Structure, Dynamics and Chemistry RNA Structure and Dynamics > Influence of RNA Structure in Biological Systems.Entities:
Keywords: RNA stability element; RNA triple helix; base triples; catalytic triplex; riboswitch; telomerase; triplexome
Year: 2020 PMID: 32441456 PMCID: PMC7583470 DOI: 10.1002/wrna.1598
Source DB: PubMed Journal: Wiley Interdiscip Rev RNA ISSN: 1757-7004 Impact factor: 9.957
FIGURE 1Composition and structural arrangement of major‐groove RNA triple helices. Hydrogen‐bonding interactions (gray dashed lines) are shown for the two canonical major‐groove base triples: (a) U•A‐U and C+•G‐C (ball‐and‐stick representation of base triples from PDB 6SVS; Ruszkowska, Ruszkowski, Hulewicz, Dauter, & Brown, 2020). Interactions are shown along with the Hoogsteen and Watson–Crick faces. (b) Schematics are shown for strand polarity and nucleotide composition in parallel and anti‐parallel motifs. Interbase interactions are represented as a circle (•) for Hoogsteen base pair, a single dash (−) for Watson–Crick base pair, and a square (■) for reverse Hoogsteen base pair. (c) Major‐groove RNA triple helices are commonly found in H‐type or I‐type pseudoknot structures. The three strands of triple helix are identified by the following tricolor scheme: Hoogsteen strand is blue, Watson strand is purple, and Crick strand is green
Summary of three‐dimensional structures solved for natural RNAs containing a major‐groove triple helix
| RNA source | Base triples | Structure | Part of RNP? | References | |
|---|---|---|---|---|---|
| No. | Identity | ||||
|
| |||||
| Human TR | 3 | U•A‐U | NMR (PDB 1YMO, 2K95, 2K96) | Yes, cryo‐EM structure of holoenzyme (EMD‐7521) | Kim et al. ( |
|
|
1 5 |
C•G‐C U•A‐U | NMR (PDB 2M8K) | Yes, but no structure of holoenzyme | Cash et al. ( |
|
|
2 1 |
U•A‐U A•G‐C | NMR (PDB 5KMZ) | Yes, cryo‐EM structure of holoenzyme (EMD‐6443) | Jiang et al. ( |
|
| |||||
|
|
1 1 |
G•G•U C•C‐G | Cryo‐EM (PDB 5G2Y) | Yes, cryo‐EM structure with LtrA (PDB 5G2X) | Qu et al. ( |
|
Group IIB intron |
1 1 1 |
U•A‐U G•G•U A•C‐G | X‐ray crystallography (PDB 6CHR) | Yes, but no structure with maturase | Chan et al. ( |
|
Group IIC intron |
2 1 |
C•C‐G G•G•U | X‐ray crystallography (PDB 3BWP) | Yes, but no structure with maturase | Toor, Keating, Taylor, and Pyle ( |
|
Group IIC intron |
1 1 1 |
A•A‐U G•G•U A•C‐G | Cryo‐EM (PDB 6MEC) | Yes, cryo‐EM structure with maturase | Haack et al. ( |
| Human U2‐U6 snRNAs |
1 1 1 |
A•A‐U G•G‐C U•C‐G | Cryo‐EM (PDB 5MQF) | Yes, cryo‐EM structures of spliceosome | Bertram et al. ( |
|
|
1 1 1 |
A•A‐U G•G‐C U•C‐G | Cryo‐EM (PDB 5GMK) | Yes, cryo‐EM structures of spliceosome | Wan, Yan, Bai, Huang, and Shi ( |
|
|
1 1 1 |
A•A‐U G•G‐C U•C‐G | Cryo‐EM (PDB 3JB9) | Yes, cryo‐EM structures of spliceosome | Hang, Wan, Yan, and Shi ( |
|
| |||||
|
c‐di‐GMP‐II riboswitch |
1 1 2 1 |
U•U‐A G•G‐C A•C‐G c‐di‐GMP•A‐U | X‐ray crystallography (PDB 3Q3Z) | Unknown | Smith, Shanahan, Moore, Simon, and Strobel ( |
|
Guanidine‐III riboswitch |
1 1 2 1 |
GAI•(G•A‐U•C) (A•C‐G)•GAI G•G‐C U•C‐G | X‐ray crystallography (PDB 5NWQ) | Unknown | L. Huang, Wang, Wilson, and Lilley ( |
|
PreQ1‐II riboswitch |
1 2 |
A•U•PreQ1•C U•A‐U | X‐ray crystallography (PDB 4JF2) | Unknown | Liberman, Salim, Krucinska, and Wedekind ( |
|
PreQ1‐II riboswitch |
1 2 |
A•U•PreQ1•C U•A‐U | NMR (PDB 2MIY) | Unknown | Kang, Eichhorn, and Feigon ( |
|
PreQ1‐III riboswitch |
1 2 1 |
A•U•PreQ1•C U•A‐U A•A‐U | X‐ray crystallography (PDB 4RZD) | Unknown | Liberman et al. ( |
|
Sargasso Sea metagenome SAM‐II riboswitch |
1 1 1 1 2 |
A•A•U G•G‐C A•C‐G U•U•ASAM U•A‐U | X‐ray crystallography (PDB 2QWY) | Unknown | Gilbert, Rambo, Van Tyne, and Batey ( |
|
SAM‐V riboswitch |
1 1 1 2 1 |
G•A‐U A•G‐C U•U•ASAM U•A‐U G•U•A | X‐ray crystallography (PDB 6FZ0) | Unknown | L. Huang and Lilley ( |
|
| |||||
| Human MALAT1 |
9 1 |
U•A‐U C•G‐C | X‐ray crystallography (PDB 4PLX) | Yes, but no structure with METTL16 | Brown et al. ( |
| Kaposi's sarcoma‐associated herpesvirus PAN RNA | 5 | U•A‐U | X‐ray crystallography (PDB 3P22) | Unknown | Mitton‐Fry, DeGregorio, Wang, Steitz, and Steitz ( |
Abbreviations: c‐di‐GMP, bis‐(3′‐5′)‐cyclic dimeric guanosine monophosphate; GAI, guanidine; PreQ1, 7‐aminomethyl‐7‐deazaguanine (a precursor of the modified nucleoside, queuosine), and ASAM, S‐adenosylmethionine.
Only a representative PDB ID is provided because there are too many to list. Refer to Galej, Toor, Newman, and Nagai (2018) for a more comprehensive list.
None detected in the presence of HEK293 and HeLa cell lysate using an in vitro gel‐shift assay (Brown et al., 2016b).
FIGURE 2Structural views of triple and double helices. The top and side views are shown for X‐ray crystal structures (cartoon representation) solved for an RNA triple helix, A‐RNA double helix, A′‐RNA double helix, and B‐DNA double helix. Hoogsteen strand is blue, Watson (or purine‐rich) strand is purple, and Crick (or pyrimidine‐rich) strand is green. Select structural parameters are defined by red text and pictures. The PDB IDs for displayed structures are listed below each name
Computed values of various structural parameters
| Parameter | RNA triple helix | A‐RNA double helix | A′‐RNA double helix | B‐DNA double helix |
|---|---|---|---|---|
| Helical sense | Right handed | Right handed | Right handed | Right handed |
| Diameter (Å) | 24 | 22.9 | 24 | 23.9 |
| Helix pitch (Å) | 35.1 | 27.5 | 32.8 | 33.7 |
| Helical rise per base pair or base triple (Å) | 2.9 | 2.6 | 2.9 | 3.4 |
| Helical twist (°) | 29.7 | 33.7 | 31.4 | 36.2 |
| Base pairs per helical turn or base triples per helical turn | 12 | 10.7 | 11.5 | 10 |
| Axial displacement (Å) | −5.0 | −3.9 | −4.9 | 0.3 |
| Major‐groove dimensions | ||||
| Width (Å) | 9.2 (2.3) | 3.1 | 8.4 | 11.4 |
| Depth (Å) | 11.0 | 9.2 | 9.3 | 4.5 |
| Minor‐groove dimensions | ||||
| Width (Å) | 9.0 | 9.9 | 9.7 | 4.9 |
| Depth (Å) | 1.5 | 0.5 | 0.5 | 5.3 |
| Inclination angle (°) | 8.4 | 17.6 | 13.6 | −0.5 |
| Glycosidic bond conformation |
|
|
|
|
| Sugar pucker conformation | C3′ | C3′ | C3′ | C2′ |
Note: Values obtained from Ruszkowska et al. (2020).
All values computed using only the double helix portion of RNA triple helix except for parenthetical 2.3, which is major‐groove width between Hoogsteen and Crick strands.
FIGURE 3Triple helix in telomerase RNA. Telomerase RNA forms a nearly continuous triple helix (tricolor cartoon representation) in which a U•A‐U major‐groove triple helix is adjacent to a minor‐groove triple helix
FIGURE 4Catalytic triplexes involved in splicing. (a) The catalytic triplex of both the group II self‐splicing intron (tricolor stick representation) and the spliceosome (not shown) coordinate two catalytic metal ions (orange spheres) via electrostatic interactions with phosphate backbone. Two views are shown. (b) His5 of CWC2P (orange stick representation) may form hydrogen bonds (gray dashed lines) with two of the three base triples (tricolor sticks) in the catalytic triplex observed for the S. cerevisiae spliceosome
FIGURE 5Structural basis of ligand recognition by an RNA triple helix. The 3D structures are shown for six ligand‐bound riboswitches: (a) SAM‐II, SAM‐V, (b) PreQ1‐II, PreQ1‐III, (c) c‐di‐GMP‐II, and (d) guanidine‐III. For each riboswitch, there is a panel that displays the entire experimentally determined structure with PDB in parenthesis (left panel, cartoon representation), the triple helix interacting with ligand (middle panel, cartoon representation), and noncovalent interactions of ligand and nucleotides (right panel, ball‐and‐stick representation). The ligand is shown in orange and triple helix components are shown in tricolor scheme
FIGURE 6Structures of triple helices that function as RNA stability elements. (a) A generalized schematic diagram is shown for a single‐domain stability element alongside the X‐ray crystal structure of the KSHV PAN RNA triple helix (cartoon representation). Important structural regions are labeled. (b) A generalized schematic diagram is shown for the predicted structure of a double‐domain stability element: TWIFB1_Osa. Gray circles (•) and dashed lines (−) indicate putative nucleotide interactions. (c) A generalized schematic diagram and X‐ray crystal structure is shown for a blunt‐ended RNA triple helix: Human MALAT1 (cartoon representation). The tricolor scheme of strands and symbols for their interactions are described in Figure 1. The PDB IDs are listed below each RNA label
FIGURE 7Chemical structures of triplex‐binding small molecules. Chemical structures are shown for small molecules that (a) broadly recognize RNA triple helices and (b) bind to the MALAT1 RNA triple helix