| Literature DB >> 29914113 |
Dmitri D Pervouchine1,2,3.
Abstract
The ability to form an intramolecular structure plays a fundamental role in eukaryotic RNA biogenesis. Proximate regions in the primary transcripts fold into a local secondary structure, which is then hierarchically assembled into a tertiary structure that is stabilized by RNA-binding proteins and long-range intramolecular base pairings. While the local RNA structure can be predicted reasonably well for short sequences, long-range structure at the scale of eukaryotic genes remains problematic from the computational standpoint. The aim of this review is to list functional examples of long-range RNA structures, to summarize current comparative methods of structure prediction, and to highlight their advances and limitations in the context of long-range RNA structures. Most comparative methods implement the “first-align-then-fold” principle, i.e., they operate on multiple sequence alignments, while functional RNA structures often reside in non-conserved parts of the primary transcripts. The opposite “first-fold-then-align” approach is currently explored to a much lesser extent. Developing novel methods in both directions will improve the performance of comparative RNA structure analysis and help discover novel long-range structures, their higher-order organization, and RNA⁻RNA interactions across the transcriptome.Entities:
Keywords: DST; Dscam; Nmnat; RNA processing; RNA structure; RNA–RNA interaction; folding; long-range; mutually exclusive splicing; polyadenylation
Year: 2018 PMID: 29914113 PMCID: PMC6027157 DOI: 10.3390/genes9060302
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Functional long-range RNA structures in Drosophila and Human.
| Species | Gene | Function | Length * | Spread * | References |
|---|---|---|---|---|---|
|
|
| Exon 4 cluster | 13 | 4500 | [ |
|
| Exon 6 cluster | 16 | 11,000 | [ | |
|
| Exon 9 cluster | 16 | 14,000 | [ | |
|
| Exon 17 cluster | 15 | 1000 | [ | |
|
| Exon 7 cluster | 14 | 2500 | [ | |
|
| Exon 9 cluster | 14 | 1600 | [ | |
|
| Exon 11 cluster | 15 | 2600 | [ | |
|
| Exon 5 and polyA site | 14 | 400 | [ | |
|
| Exon 10 | 16 | 350 | [ | |
|
| Exon 4 cluster | 21 | 450 | [ | |
|
| Exon 5 cluster | 22 | 1200 | [ | |
| Human |
| Exon 10 | 17 | 100 | [ |
|
| Exon 11a | 18 | 1800 | [ | |
|
| Exons 47-52 | 15 | 10,000 | [ | |
|
| Exon 7 | 8 + 7 + 8 | 280 | [ | |
|
| Exon 3 | 10 + 5 | 600 | [ | |
|
| Exons 7 and 8 | Repeat | 6500 | [ | |
|
| Paraspeckle formation | N/A | 10,000 | [ |
(*) Length: approximate number of base pairs in complementary regions; Spread: loop size, i.e., sequence distance between complementary parts; N/A: not applicable.
Figure 1A “commutative diagram” of the alignment and folding tradeoff. Top left: unaligned RNA sequences. Bottom left: their structure-agnostic alignment; conserved regions are shown in gray. Top right: sparse folding identifies candidate helices shown as arcs. Bottom right: conserved helices are matched by structure-aware alignment or identified in a multiple sequence alignment.