Microtubule-based mRNA transport is widely used to restrict protein expression to specific regions in the cell and has important roles in defining cell polarity and axis determination as well as in neuronal function. However, the structural basis of recognition of cis-acting mRNA localization signals by motor complexes is poorly understood. We have used NMR spectroscopy to describe the first tertiary structure to our knowledge of an RNA element responsible for mRNA transport. The Drosophila melanogaster fs(1)K10 signal, which mediates transport by the dynein motor, forms a stem loop with two double-stranded RNA helices adopting an unusual A'-form conformation with widened major grooves reminiscent of those in B-form DNA. Structure determination of four mutant RNAs and extensive functional assays in Drosophila embryos indicate that the two spatially registered A'-form helices represent critical recognition sites for the transport machinery. Our study provides insights into the basis for RNA cargo recognition and reveals a key biological function encoded by A'-form RNA conformation.
Microtubule-based mRNA transport is widely used to restrict protein expression to specific regions in the cell and has important roles in defining cell polarity and axis determination as well as in neuronal function. However, the structural basis of recognition of cis-acting mRNA localization signals by motor complexes is poorly understood. We have used NMR spectroscopy to describe the first tertiary structure to our knowledge of an RNA element responsible for mRNA transport. The Drosophila melanogasterfs(1)K10 signal, which mediates transport by the dynein motor, forms a stem loop with two double-stranded RNA helices adopting an unusual A'-form conformation with widened major grooves reminiscent of those in B-form DNA. Structure determination of four mutant RNAs and extensive functional assays in Drosophila embryos indicate that the two spatially registered A'-form helices represent critical recognition sites for the transport machinery. Our study provides insights into the basis for RNA cargo recognition and reveals a key biological function encoded by A'-form RNA conformation.
In eukaryotes, asymmetric localization of mRNAs plays widespread roles in protein targeting and is crucial for many processes, including patterning of embryonic axes, polarized cell functions and synaptic plasticity1,2. In most cases, mRNAs are localized asymmetrically by directed transport along the cytoskeleton by molecular motors1. Transport of specific mRNAs depends on cis-acting RNA elements commonly located in their 3′ untranslated region (3′UTR). These RNA signals are recognized by trans-acting protein factors, which link the mRNA to the motors. However, the molecular basis underlying the recognition of localizing mRNAs is poorly understood.An emerging model for elucidating the molecular principles of mRNA localization is the delivery of developmentally important transcripts to the minus-ends of microtubules during early Drosophila development. This process is dynein-dependent and can be accessed by microinjection of in vitro synthesized fluorescent transcripts3,4. Several minus-end-directed transport signals have been mapped in Drosophila mRNAs4-10. These signals are all predicted to adopt stem-loop structures comprising ~ 40-65 nucleotides (nt), but do not share primary sequence or any obvious RNA motifs. Thus, it is unclear what features within any of these mRNAs are recognized by the transport machinery.We have used NMR spectroscopy to describe the structure of the 44 nt RNA element responsible for dynein-mediated localization of Drosophila fs(1)K10 (K10) transcripts9. This maternal transcript is transported from the nurse cells into the oocyte, where it localizes at the anterior, and its product regulates dorsoventral polarity 9,11. The K10 signal adopts a stem-loop with unexpected structural features. Stacking interactions of purine bases within canonical, double-stranded (ds) RNA helices give rise to base pair inclinations and widened major grooves, consistent with stem regions adopting a so-called A′-form conformation. The results of structural determination of mutant RNAs and functional assays in Drosophila embryos suggest that two spatially registered, widened major grooves represent the binding sites for the transport machinery. The present study also demonstrates that dsRNA with regular base pairs has unappreciated structural complexity capable of mediating selective recognition, and thereby assigns a key biological function to the A′-form RNA conformation.
RESULTS
Structure of the K10 transport and localization signal
To reveal the specific RNA features that mediate recognition by the transport machinery, we used NMR spectroscopy to determine the structure of the Drosophila melanogasterK10 transport and localization signal (TLS), a 44 nt sequence in the K10 3′UTR that is essential for patterning the dorsoventral axis9,10. RNA molecules larger than 30 nucleotides often display substantial resonance overlap, which makes unambiguous resonance assignments impossible12. In the TLS RNA, over 80% of the base paired helices are formed by A-U or U-A Watson-Crick base pairs (Fig. 1a). Nonetheless, resonance overlap could be resolved using a combination of homonuclear and heteronuclear NMR spectroscopy including site-specific deuteration of pyrimidineH-5 protons (Supplementary Fig. 1). The final ensemble of K10 TLS structures is well defined (r.m.s. deviation of 1.15 Å) and both its local and global precision greatly depended on 115 angular restraints derived from experimental residual dipolar couplings (RDCs)13,14 (Fig. 1b, Table 1 and Supplementary Fig. 2a).
Figure 1
Solution structure of K10 TLS RNA.
(a) Secondary structure of wild-type K10 TLS RNAs. Numbering according to ref.9. Outlined nucleotides were added to improve transcription efficiency. The three helical segments are separated by single nucleotide bulges (C33 and A37). The lower helix (blue) comprises nt 1-7 and 38-44, the middle helix (green) nt 8-10 and 34-36, and the upper helix (red) nt 11-17 and 26-32, respectively.
(b) Heavy-atom superposition of the 12 lowest-energy K10-WT RNAs refined with RDCs. Bases are red and the ribose-phosphate backbone is pink. The three helical regions are indicated beside the structures using the same color scheme as in a. C33 and A37 are shown in green.
(c) Electrostatic surface potential of K10-WT RNA. Both the upper and lower helical regions display widened major grooves with a relative orientation of 90° along the helical axis. The widened major grooves are indicated beside the structures using the same color scheme as in a.
(d) Representative structures of A-form and A′-form dsRNA and B-dsDNA compared to upper and lower helical regions of K10-WT RNA. All helices are shown from the major groove side to visualize the differences in inclination angles and groove widths. The helical axis is shown in blue and the inclination angle of base pairs is indicated by a dashed line. Bases are red and the ribose-phosphate backbone is pink. The PDB IDs are 1SDR (A-form RNA), 413D (A′-form RNA) and 1BNA (B-form DNA).
(e) View down the lower and upper helix of K10-WT RNA displaying continuous stacking of purine bases. Five purine bases in the lower helix (A5-G44) and seven adenine bases in the upper helix (A17-A32) display continuous base-base stacking giving rise to A′-form inclination angles and widened major grooves (see Supplementary Tables 1 and 2). Pyrimidine bases are blue and purine bases are pink; ribose-phosphate backbone and chemical groups on the bases are omitted for clarity. Numbering according to a.
Table 1
NMR and refinement statistics for K10-WT and mutant K10 RNAs
WT
au-up
2gc-up
2gc-low
A-low
NMR distance and dihedral constraints
Distance restraintsa
Total NOE
766
789
745
638
723
Intra-residue
284
286
257
193
283
Inter-residue
482
503
488
435
440
Sequential (|i − j| = 1)
376
396
374
331
328
Non-sequential (|i − j| > 1 )
106
107
114
104
112
Hydrogen bonds
17
17
17
18
21
Total dihedral angle restraints
337
338
349
339
339
Base pair
18
18
18
19
19
Sugar pucker
180
180
180
180
180
Backbone
139
140
141
140
140
Total RDCsb
115
100
117
95
103d
Structure statistics
Violations (mean ± s.d.)
Distance constraints (Å)
0.017±0.0005
0.016±0.001
0.018±0.001
0.019±0.003
0.018±0.001
Dihedral angle constraints (°)
0.73±0.05
0.65±0.05
0.68±0.05
0.94±0.31
0.74±0.06
RDCs (Hz)
1.03±0.05
0.89±0.04
0.96±0.01
0.92±0.1
1.1±0.04d
Max. dihedral angle violation (°)
5.8
8.1
6.9
9.6
6.7
Max. distance constraint violation (Å)
0.22
0.28
0.20
0.37
0.29
Deviations from idealized geometry
Bond lengths (Å)
0.003±0.00004
0.003±0.00004
0.004±0.0001
0.004±0.00007
0.004±0.00003
Bond angles (°)
0.93±0.001
0.91±0.01
0.96±0.01
0.91±0.01
0.95±0.006
Impropers (°)
0.48±0.006
0.41±0.02
0.41±0.01
0.49±0.06
0.48±0.05
Average pairwise r.m.s. deviationc (Å)
All RNA heavy
1.15
1.28
1.08
1.54
1.21
Lower helix (nt 1-7 and 38-44)
0.33
0.46
0.49
0.61
0.70
Middle helix (nt 8-10 and 33-37)
0.30
0.62
0.53
0.70
0.62
Upper helix (nt 11-17 and 26-31)
0.55
0.52
0.39
0.59
0.33
Only meaningful, nonfixed distance constraints were used.
The axial (Da) and rhombic (R) component of the alignment tensor used in the final structure calculations are Da = −26.43 and R = 0.111 (WT), Da = −28.10 and R = 0.097 (au-up), Da = −31.78 and R = 0.098 (2gc-up), Da = −30.29 and R = 0.132 (2gc-low) and Da = −27.11 and R = 0.034 (A-low), respectively.
Refinement of A-low also includes a separate set of 47 local RDCs for the lower helix (Da = −15.17 and R = 0.029).
The K10 TLS RNA adopts a stem-loop structure capped by an octanucleotide loop (5′-A(18)UUAAUUC(25)-3′), which displays a compact fold (Supplementary Fig. 3a). The helical part of the TLS can be divided into three regions: an upper helix composed of seven Watson-Crick A-U or U-A base pairs, a middle helix of three Watson-Crick base pairs, flanked at each end by single nucleotide bulges on the 3′ side, and a lower helix consisting of a G-U and six Watson-Crick base pairs (Fig. 1a,b). The two unpaired bases adopt different orientations relative to the helices. The base moiety of C33 resides in the major groove, maintaining the helical twist between the adjacent base pairs, while the base of A37 is stacked in between the middle and the lower helix and increases the helical twist between the adjacent base pairs (Fig. 1b and Supplementary Fig. 3b).Both the upper and lower helical regions display major grooves that are unusually widened relative to typical A-form RNA, such that the groove widths are reminiscent of B-form DNA (Fig. 1c,d; see also Fig. 3 and Supplementary Tables 1 and 2). This is very surprising in the context of the K10 signal whose double helical regions are composed of Watson-Crick base pairs and a G-U base pair that should maintain A-form helical geometry normally seen in dsRNA15. A-form dsRNA is characterized by a positive inclination angle of the Watson-Crick base pairs relative to the helical axis resulting in a deep and narrow major groove inaccessible for ligand interaction (Fig. 1d). The upper and lower helical regions of the K10 RNA, in contrast, display lower inclination angles and accessible major grooves (Fig. 1b-d and Supplementary Tables 1 and 2), consistent with A′-form RNA conformation previously deduced from X-ray fibre diffraction data15,16 and observed in a crystal structure of a model RNA duplex that includes both non-canonical and Watson-Crick base pairs17. Within helix IV of 5S ribosomal RNA, non-canonical G-A base pairs induce cross-strand purine-purine stacking in adjacent G-U wobble base pairs and thereby A′-form conformation with major groove widths similar to B-form DNA18.
Figure 3
Solution structure of mutant K10 TLS RNAs.
(a-d) Heavy-atom superposition of the lowest-energy mutant K10 RNAs refined with RDCs. Bases are red and the ribose-phosphate backbone is pink. The mutated bases are green and the naming and sequence of each mutant corresponds to Fig. 2a. The widest opening of the major groove in the upper and lower helix is shown in the left and right view (rotation by 90° relative to the helical axis) of each ensemble.
(e) Plot of the major groove width (Å) at each base pair in wild-type and mutant K10 RNAs. Base pairs are indicated corresponding to their 5′ nucleotide numbered according to Fig. 1a. Mean values are displayed for each RNA. Standard errors of the mean for each value are below 1.0 Å and summarized in Supplementary Table 2. Idealized A-form (dotted line) and B-form (solid line) values from ref.17 are also displayed. The corresponding mean base pair inclination angles are listed in Supplementary Table 1.
The two unusually widened grooves in the K10 TLS, the widest parts of which are orientated at 90° to one another (Fig. 1c), derive from continuous stacking interactions of purine bases that lower inclination angles and unwind the helix (Fig. 1e). In the lower helix, low inclination angles are caused by a continuous stack of five purine bases (Fig. 1e) on one side of the helix, which allows formation of four Watson-Crick base pairs, but distorts the G-U base pairing, so that it cannot adopt the wobble conformation with two imino-carbonyl hydrogen bonds usually seen in conventional helical regions (Supplementary Fig. 3c). In the upper helix, there is continuous stacking of seven Watson-Crick base paired adenine bases including a cross-strand stacking between bases of A16 and A28 that positions each of the adenineH-2 protons above the other base moiety (Fig. 1e). This unusual placement results in strong upfield shifts of their H-2 proton resonance frequencies (6.18 and 6.22 p.p.m., respectively) due to the ring current of the neighbouring base (Supplementary Fig. 1b). This resonance frequency alteration provides additional NMR spectroscopic evidence for unusual local adenine-adenine stacking, since adenineH-2 protons in Watson-Crick base pairs usually display proton resonance frequencies between 7-8 p.p.m19. The presence of A′-form conformations of the upper and lower helix in the K10 TLS is further supported by B-form DNA-like circular dichroism (CD) spectra with a peak at 280 nm instead of 260 nm as usually observed for A-form RNA20 (Supplementary Fig. 4a-d).
To investigate the importance of the structural features of the TLS for signal activity, we exploited a robust in vivo assay, which monitors dynein-dependent localization upon injection of fluorescently-labelled, in vitro synthesized RNAs into the cytoplasm of Drosophila blastoderm embryos3. Injected RNAs assemble into particles and those containing an active localization signal are transported within ~ 6 min to the apical cytoplasm, the site of microtubule minus-end nucleation21. RNA species differentially regulate the persistence of minus-end-directed movement on microtubules by controlling the average number of the Egalitarian (Egl), Bicaudal-D (BicD) proteins and, possibly, dynein molecules assembled on the transported particles22.We examined the activity of mutant K10 TLSs within the context of a ~ 2300 nt fragment of the K10 transcript that depends on the TLS for efficient, apical localization23 (Table 2 and Fig. 2a,b; wild-type (WT) versus scrambled). Replacing the octaloop with a stable UUCG tetraloop closed by a C-G base pair24 had no discernible affect on the efficiency of apical K10 localization (Table 2 and Supplementary Fig. 5). Thus, the loop is not a mediator of signal activity.
Table 2
Localization efficiency of wild-type and mutant K10 TLS RNAs
Transcripta
Localization efficiencyb
Transcripta
Localization efficiencyb
WT
++ (47)
2cg-up (A′/A′)
++ (18)
scrambledc
− (60)
A-low-2cg-up (A/A′)
+ (31)
tetraloop
++ (64)
A-low-h44-up (A/A′)
+ (57)
2gc-low (A′/A′)d
++ (19)
h44-up (A′/A′)
+ (38)
A-low (A/A′)
+ (83)
ΔC33
+ (55)
A-up (A′/A)
+ (58)
ΔA37
+ (33)
A-low-A-up (A/A)
− (33)
ΔC33+ΔA37
− (30)
au-up (A′/A′)
++ (29)
C33Ae
+ (29)
A-low-au-up (A/A′)
+ (30)
C33U
++ (14)
2gc-up (A′/int)
++ (30)
C33G
++ (23)
2gc-low-2gc-up (A′/int)
++ (29)
A37U
++ (22)
A-low-2gc-up (A/int)
− (75)
A37C
++ (23)
2gc-low-au-up (A′/A′)
++ (26)
A37G
++ (15)
2gc-low-A-up (A′/A)
+ (75)
C33A+A37C
++ (23)
5cg-up (A′/A′)
++ (16)
ΔA37+gcf
++ (46)
A-low-5cg-up (A/A′)
+ (18)
ΔC33+gcf
+ (28)
The sequence of the specific mutations are shown in Fig. 2a and 4a.
++, strong localization; + weak localization; −, no localization. See Online Methods for more details of scoring system. Number of injected embryos is shown in parentheses.
This is a randomized version of the constituent bases of the 44 nt K10 TLS which is predicted not to form extensive secondary structure (5′-UUUAUACUCAUAUAUUUAUUAAUGUAAUUAAAUCUAGAACAAUG-3′).
For mutants designed to interfere with stacking interactions, the determined, or predicted, helical geometries (lower helix/upper helix) are shown in parentheses as follows: A′, widened major groove (A′-form); A, narrow major groove (A-form); int, intermediate groove width between A′-form and A-form.
In blind experiments, the C33A mutant was consistently scored as slightly more active than ΔC33.
Residue C33 or A37 are deleted and an additional G-C base pair is inserted in the middle helix above the G8-C36 base pair.
Figure 2
Localization activity of wild-type and lower and upper stem mutant K10 RNAs.
(a) Secondary structure of wild-type and mutant K10 TLS RNAs. Numbering according to ref.9. Outlined nucleotides were added to improve transcription efficiency. The sequences of K10 RNA mutations and the corresponding names used throughout the text are displayed. (b) Representative confocal images of blastoderm embryos injected with transcripts as indicated. TLS mutations were introduced within the context of a 2300 nt K10 sequence (see Online Methods). Transcripts were visualized by virtue of directly incorporated fluorochrome-coupled UTP. Arrow indicates the approximate site of injection in all experiments. Apical is to the top and basal is to the bottom in all images. Images of injections of additional transcripts are shown in Supplementary Fig. 5. Scale bar, 50μm.
We then tested whether specific structural or sequence motifs within the upper and lower helices are required for transport. First, we focussed on the distorted G-U base pair in the lower helix, which could contribute to signal activity through participation in purine base-base stacking or direct sequence-specific recognition by the transport machinery. To attempt to distinguish between these possibilities, we analyzed a mutant RNA that alters base identities but maintains purine stacking by replacing the G-U base pair by a Watson-Crick G-C base pair and the U-A base pair below with a C-G base pair (Fig. 2a; 2gc-low). Despite the alteration of base identity, NMR structure determination reveals that the A′-form inclination angles and the widened major groove are largely preserved (Fig. 3 and Supplementary Tables 1 and 2) and the signal drives transport that is indistinguishable from K10-WT (Table 2 and Fig. 2b; 2gc-low versus WT). Thus, the G-U base pair is also not a determinant of signal activity.Next we replaced the entire lower A′-form section with a model A-form RNA sequence derived from the brain cytoplasmic 1 RNA25 (Fig. 2a and Supplementary Fig. 2; A-low). The NMR structure of this A-low mutant reveals a deep and narrow major groove and steep inclination angles in the lower stem (Supplementary Tables 1 and 2). A′-form features in the upper helix are preserved in this mutant (Fig. 3), indicating that the local conformations in the upper and lower stems are independent (Supplementary Fig. 6). The A-low mutant supports only inefficient apical transport (Table 2 and Fig. 2b), demonstrating that the presence of a widened major groove in the lower helix correlates with full signal activity.To investigate if the upper widened major groove also contributes to signal activity, this region was replaced with the same model A-form RNA helix (A-up), while preserving the K10-WT base pairs adjacent to the hairpin loop and the C33 bulge (Fig. 2a; A-up). This mutation weakens signal activity to the same extent as A-low (Table 2 and Fig. 2b; A-low versus A-up). Replacing both helices with A-form regions (A-low-A-up) completely inactivates the TLS (Table 2 and Fig. 2b; A-low-A-up), indicating that each A′-form region contributes to full signal activity and is recognized by the localization machinery as a distinct feature.To further test the importance of the upper A′-form helix, we determined the structure of two additional mutants that partially interrupt the contiguity of stacked purines in this region (Fig. 2a). Transversion of a single U-A base pair in the upper stem (au-up) does not disrupt the A′-form geometry of the TLS (Fig. 3c) and maintains full transport activity (Table 2 and Fig. 2b, au-up). In addition, combining the weakly localizing lower stem A-form mutant with this transversion (to produce A-low-au-up) does not further reduce signal activity (Table 2 and Supplementary Fig. 5; A-low-au-up), presumably because A′-form geometry is maintained in the upper stem.Transversion of two U-A base pairs in the upper stem to G-C base pairs (2gc-up) leads to higher inclination angles and reduced major groove widths compared to the wild-type TLS (Fig. 3d,e), albeit distinguishable from regular A-form because of the residual stacking interactions of adenines below the octaloop (Supplementary Fig. 7). This partial reduction in A′-form geometry impairs but does not abolish the activity of the upper stem. It is sufficient to support localization of transcripts in which the lower stem forms an A′-form helix (as in 2gc-up and the double mutant 2gc-low-2gc-up; Table 2 and Fig. 2b). However, when combined with the lower A-form stem mutant, which localizes weakly in the context of the wild-type K10 upper helix, apical localization is completely abolished (Table 2 and Fig. 2b; A-low-2gc-up). These data, together with the analysis of two other double mutants (Table 2 and Fig. 2b; 2gc-low-au-up and 2gc-low-A-up) demonstrate that the efficiency of transport correlates strongly with the overall extent of A′-form structure within the TLS, and that the two A′-form helices co-operate in order to achieve full activity.In agreement with our data, site-specific mutagenesis experiments previously indicated that the base-paired stems are critical determinants of K10 TLS activity during oogenesis10. This study also suggested that specific base pairs are important for TLS activity during these stages and that localization of the K10 transcript is strongly inhibited only when mutations substantially alter their stereochemistry in the minor groove. Mutations that have little effect on the number or arrangement of hydrogen-bond (H-bond) donor and acceptor groups in the minor groove (e.g. A-U to U-A or U-A to A-U)26, decreased K10 localization in the oocyte only modestly10. In contrast, several mutations which more dramatically alter minor groove stereochemistry, by displaying amino groups instead of adenineH-2 protons (e.g. A-U to G-C or U-A to C-G)26, greatly reduced localization10.To test whether apical localization of K10 in blastoderm embryos also depends on minor groove features, we mutated U-A to C-G base pairs, thereby altering the number of H-bond donors and acceptors in the minor groove whilst maintaining the A′-form-inducing purine runs (Fig. 4a). Transversion of two U-A base pairs in the upper stem to C-G base pairs (2cg-up) maintains wild-type levels of apical transcript localization and combination with the lower A-form stem mutant (A-low-2cg-up) supports weak localization as observed for the A-low mutant with the wild-type K10 upper helix (Table 2 and Fig. 4; 2cg-up and A-low-2cg-up). An even more drastic change in the upper helix of 5 U-A to 5 C-G base pairs (5cg-up) also fully drives apical transport in the context of the wild-type lower helix (Table 2 and Fig. 4; 5cg-up). Importantly, 5cg-up also functions like a wild-type upper helix in supporting weak localization in combination with the A-low mutation (Table 2 and Fig. 4; A-low-5cg-up). As described above, an upper helix containing mutations of just two U-A base pairs to G-C base pairs is unable to complement the A-form lower helix, resulting in an inactive signal (A-low-2gc-up), despite G-C and C-G base-pairs having a very similar arrangement of H-bond acceptors and donors in the minor groove26. Collectively, these observations strongly argue against recognition of minor groove features in the upper helix of K10 in the embryo. Instead recognition of features associated with the widened major grooves induced by purine-purine stacking is likely to underpin apical K10 transport.
Figure 4
Localization activity of A′-form and bulge mutant K10 RNAs.
(a) Secondary structure of wild-type and mutant K10 TLS RNAs. Numbering according to ref.9. The sequences of K10 RNA mutations and the corresponding names used throughout the text are displayed; ΔC33 and ΔA37 denote deletion of the correspond nucleotides. Open arrowhead indicates position where an additional base pair is inserted in ΔC33+gc and ΔA37+gc. In the h44-up mutant 4 U-A base pairs are replaced by nucleotides 1420-1427 and 1473-1479 of the 16S rRNA helix 44 shown in b.
(b) Secondary and tertiary structure of an A′-form helix 44 segment from 16S rRNA (PDB ID 2J00)27. Numbering according to (PDB ID 2J00)27. Pyrimidine bases are blue and purine bases are pink; ribose-phosphate backbone and chemical groups on the bases are omitted for clarity. The inclination angles (in degrees) are given for each base pair.
(c and d) Representative confocal images of blastoderm embryos injected with transcripts as indicated. TLS mutations were introduced within the context of a 2300 nt K10 sequence (see Online Methods). Transcripts were visualized by virtue of directly incorporated fluorochrome-coupled UTP. Arrow indicates the approximate site of injection in all experiments. Apical is to the top and basal is to the bottom in all images. Images of injections of additional transcripts are shown in Supplementary Fig. 5. Scale bar, 50μm.
To test whether a heterologous A′-form helix is sufficient to support apical transport, we inspected helices with runs of purines in the crystal structure of the Thermus thermophilus 30S small ribosomal subunit (PDB ID 2J00)27. Although not previously commented on, several helices of 16S rRNA display A′-form inclination angles and widened major grooves. All of these helices are associated with runs of three or more base-paired purines on one side of the stem (Supplementary Fig. 8), and one, helix 44, contains a segment whose inclination angles are particularly reminiscent of the K10 upper helix, despite having a very different sequence composition (Fig. 4b and Supplementary Table 1).To test whether this 16S rRNA A′-form helix also supports apical transport we used it to replace 4 U-A base pairs in the upper helix of the K10 TLS in the context of an A-form lower helix (Fig. 4a; A-low-h44-up). This heterologous upper helix behaves indistinguishably from the wild-type K10 upper helix in this situation, driving weak apical localization of K10 (Table 2 and Fig. 4c, A-low-h44-up). These data provide further evidence of the correlation between A′-form geometry and localization activity. Interestingly, unlike all other upper helix mutants tested in this study, the extent of localization supported by the helix 44 segment is not improved by replacing the A-form helix with the wild-type K10 A′-form lower helix (Table 2 and Fig. 4c; h44-up). The h44-up mutant has a longer upper helix relative to the wild-type and mutant K10 TLSs, implying that the register or spacing of widened major grooves in A′-form helices could be important for full activity of a localization element.Our functional analysis of the bulged nucleotides C33 and A37 also supports this notion. Deletion of each bulge individually reduces the efficiency of apical K10 localization dramatically and deletion of both renders the signal inactive (Table 2 and Fig. 4d; ΔC33, ΔA37 and ΔC33+ΔA37). Any nucleotide at positions 33 and 37 gives more effective localization than the respective bulge deletion (Table 2, Fig. 4d and Supplementary Fig. 5), indicating that specific recognition of functional groups within the bulged nucleotides is not important. Instead the two bulges could fine-tune the relative spacing and/or orientation of the widened major grooves in the K10 lower and upper helix (Supplementary Fig. 3b).Within the wild-type K10 TLS the bulge C33, which does not alter the helical twist between the adjacent base pairs, could serve as a hinge to modulate the relative angle of the upper and lower helix upon interaction with the transport machinery, while the helical twist contributed by A37 might assist to orient them at 90° to one another (Fig. 1c). Consistent with this hypothesis, insertion of an additional Watson-Crick base pair in the middle helix can rescue the deletion of A37 but not deletion of C33 (Table 2 and Fig. 4d, ΔA37+gc, ΔC33+gc). The finding that deletion of both bulges within the wild-type K10 TLS (ΔC33+ΔA37), which would strongly perturb the relative orientation of the two A′-form helices, has a greater inhibitory effect than having an aligned A-form and A′-form helix (e.g. A-up or A-low) argues that the upper and lower helix are not recognized independently. We conclude that the localization machinery recognizes the widened K10 major grooves only when correctly oriented in a longer RNA structure.
DISCUSSION
The results of systematic in vivo analysis and structure determination of mutant RNAs are consistent with a model in which the key factors for K10 signal activity in the embryo are two spatially oriented A′-form RNA helices with widened major grooves. When these helices are correctly aligned, the efficiency of transport correlates with the overall extent of A′-form structure.It was very surprising to detect A′-form conformation in the double helical regions of the K10 TLS, since they are composed of regular Watson-Crick and a G-U base pair which usually maintain A-form helical geometry15. A′-form RNA conformation is characterized by lower inclination angles of the base pairs compared to A-form RNA and a concomitantly widened major groove17. To achieve this, no torsion angle along the backbone has to change substantially, and the angle between the ribose and base (χ angle) is only lowered by ~10 degrees. Thus, there can be no directly observable NOE or torsion angles diagnostic of A′-form versus A-form RNA.Nonetheless, the presence of A′-form helicity in the TLS is further supported by B-form-like CD spectra of RNAs, very strong upfield shifts of adenineH-2 protons and the ability to detect reduced major groove widths in calculated NMR structures of mutant RNAs with disrupted purine-purine stacking. In addition, our functional experiments reveal a strong correlation between the extent of purine-purine stacking and the degree of localization signal activity. We also detected A′-form geometry in several Watson-Crick or G-U base paired 16S rRNA helices again associated with runs of three or more consecutive purines on either side of the stem. Collectively, these observations provide a compelling case that A′-form conformation can be adopted by dsRNAs with regular base pairs undergoing contiguous purine-purine stacking and that such structures are functionally important for recognition of the K10 TLS by the mRNA localization machinery.A′-form conformation results in increased major groove widths in the TLS, which could accommodate positively-charged protein loops, α-helices or beta-hairpins from proteins that link the TLS to dynein complexes. U-A or C-G base pairs can be tolerated in at least some positions of the A′-form helices, arguing against base pair specific contacts by the apical localization machinery. Instead it is likely that interactions occur with the ribose-phosphate backbone, which is accessible from the major groove in A′-form helices, although recognition of consecutive N-7 positions from the stacked purines could also conceivably contribute to TLS activity.The secondary structures of other signals that mediate mRNA transport towards the minus-ends of microtubules in Drosophila suggest that they could contain features similar to the K10 TLS. The orb localization signal preserves the lower and upper stem stacking of base paired purines as found in the K10 TLS (Supplementary Fig. 6e), but lacks the upper bulged nucleotide10. Instead, the upper U-A base paired helix is extended, which could maintain the relative orientation of the widened major grooves. Stem-loops within the other mapped minus-end-directed signals active in the embryo—from gurken—contain 2 or more stretches of at least three contiguous purines on the same side of the stem (Supplementary Fig. 9). Indeed, extensive mutagenesis has revealed that at least some of these purines are essential for signal activity5,6,8. Consistent with a shared structural basis of recognition of minus-end-directed RNA signals, several of these elements, including the K10 TLS, are known to be directly contacted by the Egl protein, despite its lack of a canonical RNA-binding motif28. Future experiments will be aimed at elucidating the molecular basis of RNA cargo recognition by Egl and how the RNA signals control stoichiometry of transport complexes.Finally, our results underscore the importance of comparative structural studies of wild-type and mutant elements to reveal features encoding RNA function. Mutational analysis alone, which is usually based on computationally and biochemically derived secondary structures, could not have identified the presence of A′-form helices that appear critical for signal activity.
Authors: Y Tanaka; S Fujii; H Hiroaki; T Sakata; T Tanaka; S Uesugi; K Tomita; Y Kyogoku Journal: Nucleic Acids Res Date: 1999-02-15 Impact factor: 16.971
Authors: Maria Selmer; Christine M Dunham; Frank V Murphy; Albert Weixlbaumer; Sabine Petry; Ann C Kelley; John R Weir; V Ramakrishnan Journal: Science Date: 2006-09-07 Impact factor: 47.728
Authors: Franziska Theresia Edelmann; Andreas Schlundt; Roland Gerhard Heym; Andreas Jenner; Annika Niedner-Boblenz; Muhammad Ibrahim Syed; Jean-Christophe Paillart; Ralf Stehle; Robert Janowski; Michael Sattler; Ralf-Peter Jansen; Dierk Niessing Journal: Nat Struct Mol Biol Date: 2017-01-16 Impact factor: 15.369