Literature DB >> 24837192

Core structure of the U6 small nuclear ribonucleoprotein at 1.7-Å resolution.

Eric J Montemayor1, Elizabeth C Curran2, Hong Hong Liao3, Kristie L Andrews2, Christine N Treba4, Samuel E Butcher3, David A Brow4.   

Abstract

The spliceosome is a dynamic assembly of five small nuclear ribonucleoproteins (snRNPs) that removes introns from eukaryotic pre-mRNA. U6, the most conserved of the spliceosomal small nuclear RNAs (snRNAs), participates directly in catalysis. Here, we report the crystal structure of the Saccharomyces cerevisiae U6 snRNP core containing most of the U6 snRNA and all four RRM domains of the Prp24 protein. It reveals a unique interlocked RNP architecture that sequesters the 5' splice site-binding bases of U6 snRNA. RRMs 1, 2 and 4 of Prp24 form an electropositive groove that binds double-stranded RNA and may nucleate annealing of U4 and U6 snRNAs. Substitutions in Prp24 that suppress a mutation in U6 localize to direct RNA-protein contacts. Our results provide the most comprehensive view to date of a multi-RRM protein bound to RNA and reveal striking coevolution of protein and RNA structure.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24837192      PMCID: PMC4141773          DOI: 10.1038/nsmb.2832

Source DB:  PubMed          Journal:  Nat Struct Mol Biol        ISSN: 1545-9985            Impact factor:   15.369


U6 snRNA endows substrate specificity and catalytic function to the spliceosome, and is thought to derive from domain 5 of group II self-splicing introns[1-4]. The U6 snRNP in the budding yeast S. cerevisiae contains the 112-nucleotide U6 snRNA, 51 kDa Prp24 protein, and 94 kDa Lsm2–8 heteroheptamer[5-9]. Incorporation of U6 into the spliceosome requires unwinding of the internal stem loop (ISL) within the U6 snRNP and pairing to U4 snRNA, forming a U4/U6 di-snRNP (Fig. 1a). Prp24 acts as a chaperone for annealing of the U4 and U6 snRNPs[10-15], and is displaced from U6 after U4/U6 pairing is complete[6,8,16]. The Lsm ring, which binds the uracil-rich 3′ end of U6, also promotes U4/U6 annealing but remains bound to U6 in the U4/U6 di-snRNP[7,17-19]. During spliceosome activation, U6 is transferred from U4 to U2 snRNA and the U6 ISL reforms (Fig. 1a), creating a structure that binds two catalytic metal ions required for the splicing reaction[3]. After intron excision, U6 snRNA dissociates from U2 and reforms the U6 snRNP, which can enter another splicing cycle by re-annealing with U4 snRNA.
Figure 1

Conformational changes in U6 snRNA during the splicing cycle. (a) Current models of secondary structure in free U6, U4/U6 and U2/U6 snRNAs. A pre-mRNA is shown base-paired to U2/U6. Prp24 is thought to stably bind only free U6 snRNA. Boxes indicate structures shown in detail below. Dashes and circles represent Watson-Crick and non-Watson-Crick base-pairs, respectively, and “+” denotes protonation of A62 in the A62–C85 base-pair. (b) Sequence of the yeast U6 ISL present in the U6 snRNP. A slightly truncated version of the U6 ISL is present in U2/U6. Black arrow denotes the A62G mutation. Red arrow denotes the A62U,C85A double mutation in the “U6-UA” mutant. (c) The U4/U6 base-paired region is shown in detail. The arrows denote the positions of mutations shown in panel b.

Yeast Prp24 contains four RNA recognition motifs (RRMs) and a C-terminal conserved sequence that interacts with the Lsm ring[13]. RRMs are ubiquitous, ∼80 amino acid-long RNA-binding domains that typically recognize four single-stranded nucleotides[20-23]. Many RRM-containing proteins have multiple RRMs to enhance specificity and affinity for cognate RNAs in vivo[23], and can contain up to 6 RRMs[24]. In the case of Prp24, it is not clear how the multiple RRMs functionally cooperate to recognize U6 snRNA and promote annealing with U4 snRNA. However, we have shown that deletion of either the N- or C-terminal RRMs disrupts stable, stoichiometric binding of Prp24 to U6 RNA[25]. Furthermore, we previously determined the crystal and/or NMR structures of all four RRMs, including the non-canonical occluded RRM4 (oRRM4)[15], and identified the binding site of RRM2 in U6 RNA[14,26]. The strong temperature-dependence of RNA duplex stability makes cold-sensitive mutations a powerful tool for probing the dynamics of RNA base-pairing in vivo[1]. In previous work, we showed that the A62G mutation in U6, which stabilizes the base of the ISL (Fig. 1b), inhibits growth at low temperatures and decreases U4/U6 di-snRNP levels in vivo[12,27]. This cold-sensitive growth defect is suppressed by certain cis-acting mutations in U6 snRNA and trans-acting mutations in RRMs 2 and 3 of Prp24 (refs. 12,27). However, the mechanism of action of these suppressor mutations is poorly understood. To investigate the mechanism by which Prp24 chaperones recycling of U6 snRNA, we determined the 1.7 Å-resolution crystal structure of the highly conserved central region of U6 RNA bound to 83% of the 444-residue Prp24 protein, including all four RRMs. The structure reveals a unique RNP architecture, with three RRMs encircling a large RNA loop to form interlocked protein and RNA rings. We also identified a large number of additional trans-acting U6-A62G-suppressor mutations in Prp24, most of which are predicted to disrupt specific protein-RNA contacts evident in the crystal structure. In addition to providing the first image of four contiguous RRMs bound to their RNA target and validating a genetic approach to defining an RNA-protein interface, the core U6 snRNP structure suggests how Prp24 may facilitate dissociation of U6 from U2 in post-catalytic spliceosomes and promote annealing of U6 with U4 RNA.

Results

Structure of U6 snRNA bound to Prp24

Based on our previously determined structures of domains of Prp24, and the locations of U6-A62G-suppressor mutations in U6 and Prp24 (see below), we designed a core U6 snRNP construct for crystallization. A complex consisting of U6-A62G nucleotides 30–101 (with U-to-C substitutions at positions 100 and 101) bound to wild-type Prp24 residues 34–400 was stable to purification by anion-exchange chromatography and formed crystals that diffracted X-rays to 1.7 Å resolution. The phase problem was solved by molecular replacement, using structures of individual RRM domains in Prp24[14] (Fig. 2, Table 1, and Supplementary Figs. 1 and 2). This is the first reported crystal structure of U6 RNA, and the first structure of RNA bound to a protein with more than 2 RRMs (Supplementary Table 1).
Figure 2

Structure of the yeast U6–Prp24 complex. (a) Secondary structure of S. cerevisiae U6(30-101)-A62G,U100C,U101C mutant snRNA bound to Prp24, as observed in the crystal structure. Dashed gray lines indicate regions of the RNA that were deleted to facilitate crystallization. Nucleotides 71–76 and 101 are disordered in the crystal, and red nucleotides are mutated relative to the wild-type U6 sequence. Dashes represent Watson-Crick base-pairing, while open and closed circles denote non-Watson-Crick pairing. (b) Domain architecture of the Prp24 protein from S. cerevisiae. The first 33 and last 44 amino acids of Prp24 (in white) were deleted from the construct used in crystallization trials. (c,d) Two views of the crystal structure of the U6-A62G–Prp24 complex, rotated 90° relative to one another. U6 snRNA is colored salmon and the Prp24 domains are colored as in panel b. A cartoon schematic of the entwined protein/RNA topology is shown in (c). A stereo view of the structure is presented in Supplementary Figure 1.

Table 1

Data collection and refinement statistics (molecular replacement)

U6-A62G-Prp24
Data collection
Space groupP21
Cell dimensions
a, b, c (Å)61.8, 71.4, 82.1
 α, β, γ (°)90, 109.6, 90
Resolution (Å)50-1.7(1.73-1.70)*
Rsym0.06(0.90)
I / σI28.3(2.15)
Completeness (%)99.9(100)
Redundancy7.6(7.5)
Refinement
Resolution (Å)26.3-1.70
No. reflections74201
Rwork / Rfree18.4(27.0)/21.1(30.1)
No. atoms5032
 Protein2942
 RNA1389
 Ligand/ion5
 Water696
B factors (Å2)49.5
 Protein41.8
 RNA69.7
 Ligand/ion57.1
 Water41.8
r.m.s. deviations
 Bond lengths (Å)0.006
 Bond angles (°)1.03

All diffraction data were obtained from a single crystal.

Values in parentheses are for highest-resolution shell.

The U6Prp24 structure confirms the existence of the proposed telestem region in U6[12,19,28], spanning nucleotides 30–40 and 91–101 and including three non-canonical A-A or A-G pairs (Fig. 2 and Supplementary Movie). The ISL is highly similar to previous NMR structures (Supplementary Fig. 3a,b) and extends to include the invariant “AGC triad” (U6 residues 59-61)[29,30]. The telestem and ISL are roughly perpendicular to one another and are separated by an asymmetric internal loop or “bulge” spanning nucleotides A41–C58 (Fig. 2a). This bulge forms an extensive interface (∼2,200 Å2) with RRMs 2, 3 and oRRM4, as well as the region immediately preceding RRM1 in Prp24. This interface induces a highly distorted conformation of RNA that includes several novel ribonucleoprotein motifs that fall outside of the known 46 consensus clusters of RNA backbone suite conformations[31-33] (Fig. 3 and Supplementary Fig 3c). These novel motifs include a “skip-stack turn” (G50–A53) (Fig. 3a). This tight turn is in a region of the RNA that contains 4 consecutive C2′-endo sugar puckers (A49–G52). The skip-stack turn is located adjacent to the 5′ splice site-binding region of U6 (ref. 1) and is reminiscent of the “Z-anchor” motif that stabilizes RNA structure near the 5′ splice site of a group II self-splicing intron[34]. Both the skip-stack turn and Z-anchor have alternating stacked bases, but the former is protein-stabilized while the latter is stabilized by RNA. Another novel motif is the “dinucleotide bulge turn” (Fig. 3b), which bulges U57 and C58 to allow stacking of A56 and A59. The bulged U57 and C58, along with the 3′ side of the skip-stack turn, form a hydrophobic cage around Phe154 in RRM2 (Fig. 3c).
Figure 3

Novel structural motifs in the U6–Prp24 complex. RNA and protein are colored as in Figure 2. (a) A “skip-stack turn” motif stabilized by contacts with residues N-terminal to RRM1 (gray), RRM2 (orange), and oRRM4 (purple). In this motif, U6 nucleotides 50-53 form alternating i to i+2 and i+1 to i+3 stacking interactions, resulting in a tight turn. (b) A “dinucleotide bulge turn” at the base of the ISL spans U6 nucleotides 56-59. The first and last bases stack on one another, while the central two bases point the opposite direction and are bound by Prp24. The remainder of the ISL and telestem are omitted for clarity. (c) A hydrophobic cage of nucleotides surrounds Phe154 in RRM2 and mediates tertiary contacts between RRM2 and oRRM4. (d) The occluding alpha-helices (cartoon under semi-transparent surface) at the N- and C-termini of oRRM4 bridge the ISL and telestem.

As observed in structures of the free protein[14,15,26], the beta-sheet of RRM1 is buried by extensive contacts with RRM2 and the beta-sheet of oRRM4 is occluded by a pair of flanking alpha-helices (Fig. 3d). Therefore, only RRM2 and RRM3 of Prp24 bind single-stranded RNA in the canonical RRM fashion[22]. RRM1 does not contact RNA within the crystallographic asymmetric unit (Fig. 2c), but does contact RNA belonging to a neighboring complex in the crystal (see below). RRM2 interacts with nucleotides 46–58, a region of U6 RNA that contains the highly conserved “ACAGA-box”, which binds the intron 5′ splice site in assembled spliceosomes[1] (Fig. 4, Supplementary Movie). The RRM3 binding site spans U6 nucleotides 39–44, consistent with chemical modification studies predicting binding of Prp24 to U6 residues 40–43 (ref. 16). The occluding alpha-helices of oRRM4 contact both the ISL and telestem of U6, forming non-canonical contacts with double-stranded RNA and fixing the angle between the RNA helices (Figs. 2d and 3d).
Figure 4

Prp24 interactions with the U6 asymmetric bulge. (a) Two views of the U6-A62G–asymmetric bulge, rotated 90° relative to one another. U6 nucleotides that comprise the ACAGA-box motif are colored green. The remaining U6 nucleotides are colored salmon and Prp24 is colored as in Figure 2. The RRM2-3 linker and RRM3 are shown in cartoon representation to allow visualization of the RNA, including the “aspartate bridge” motif involving Asp288. (b) Base-pairing of the U6 snRNA ACAGA-box (nucleotides 47-51) with the intron 5′ splice site in the activated spliceosome[1].

The orientation of RRM3 is dramatically different than in the RRM1-3 crystal structure without U6 RNA[14], where an alpha-helix of RRM3 (residues 262-272) contacts the beta-sheet of RRM2 and partially occludes its canonical RNA-binding surface. Instead, in the U6Prp24 structure, RRM3 undergoes an approximately 180° rotation and 20 Ångstrom displacement relative to RRM2, allowing RRM2 contacts with U6 RNA that would not be possible in the protein only structure. This finding is consistent with our earlier NMR studies, which showed that RRM2–RRM3 contacts present in the RNA-free crystal structure are not present in solution, and that RRM3 does not interfere with RNA binding to RRM2[26].

An interlocked RNP topology

The C-terminus of RRM3 is threaded through the asymmetric bulge of U6, placing oRRM4 on the opposite face of U6. Tertiary contacts between RRM2 and oRRM4 generate a “ring” of protein surrounding the U6 asymmetric bulge, which is also a closed ring due to the flanking telestem and ISL (schematic in Fig. 2c). The interlocked topology is further stabilized by formation of a novel “aspartate-bridged base pair” anchored by the last residue of RRM3 (Asp288), which forms hydrogen bonds with both A42 and G55 on opposite sides of the asymmetric bulge (Fig. 4a, see Fig. 6a). The aspartate bridge likely explains a previously reported UV cross-link between U6 nucleotide G55 and an unidentified amino acid in Prp24 (ref. 28).
Figure 6

U6-A62G-suppressor mutations are expected to destabilize the interaction between Prp24 and U6 snRNA. RNA and protein are colored as in Figure 2. (a) Hydrogen-bonds between residues comprising an “aspartate bridge” between U6 nucleotides A42 and G55. Asp288, C43, A42 and G55 are all sites of suppressor mutations, which should disrupt the protein-mediated base pair. (b) Interactions between RRM3 and the telestem. Substitutions at Asn216, Asn253 and Ser283, and U6 nucleotides G39, A40 and C92 should disrupt this H-bond network. (c) Interaction of the Prp24 N-terminal domain with the U6 ACAGA-box. Arg38 interacts with G50 and G52; suppressor mutation R38T should disrupt both these interactions. (d) Intersection of RRM2, oRRM4, and the U6 asymmetric loop. Suppressor substitutions at Phe154, Arg295, Ser350, Asp351, and U6 nucleotide C58 should disrupt H-bonds and/or stacking interactions.

The interlocked topology helps to explain the low nM apparent dissociation constant for U6 and Prp24 (ref. 25), and the stability of the complex to 2 M monovalent salt[6]. To our knowledge, this is the first reported observation of an interlocked RNP topology. However, multi-RRM proteins are very common[22,24,35,36]. Since the U6Prp24 complex is the first determined structure of an RNP containing more than two linked RRM domains (Supplementary Table 1), it is likely that other examples of an interlocked RNP topology will be found. The interlocked topology highlights the important role of cooperativity in RNA recognition by multi-domain proteins. For example, an NMR structure of isolated RRM2 bound to U6 nucleotides 49–54 (ref. 26) displays a binding mode that is shifted by 1 nucleotide relative to the crystal structure presented here. The shift can be explained by flanking RNA-protein interactions that were not present in the NMR study, including extensive contacts with the RRM2-3 linker, the R36 and R38 side chains that are N-terminal to RRM1, a tertiary contact between RRM2 and RRM4, and even the aspartate-bridged base pair (Figs. 2c, 3a,b,c, 4a and Supplementary Movie). Similarly, prior NMR chemical shift mapping studies of the isolated RRM3 domain indicated that this domain can bind to a double stranded U6 ISL construct with flanking single-stranded sequences[15]; however, we found that RRM3 contacts the telestem and adjacent 3′ single stranded region instead. Thus, it is clear that cooperativity plays a major role in determining binding mode, and structures of highly truncated proteins bound to RNA should be interpreted with caution.

Suppressors of U6-A62G map to the protein-RNA interface

We previously isolated 93 independent pseudo-revertant strains that suppress the cold-sensitive growth phenotype of the U6-A62G mutation[27]. Twenty-nine of these suppressor strains contain cis-acting mutations in U6 and have been described previously[27]. We now report that 32 of the remaining 64 suppressor strains harbor one of 30 different amino acid substitutions in Prp24 (Fig. 5a, Supplementary Table 2 and Supplementary Fig. 2). Sixteen of these substitutions were cloned, and all 16 were found to be sufficient for cold-resistant growth in the presence of either the A62G or more cold-sensitive A62U/C85A double mutation (referred to as “U6-UA”)[12,27] (Figs. 1b and 5b).
Figure 5

Suppressors of U6-A62G cold-sensitivity map to the RNA-protein interface. (a) Substitutions in Prp24 selected as suppressors of U6-A62G cold-sensitivity. R158S and F257I were identified previously[12]. (b) Suppression of the cold-sensitivity of U6-A62G and U6-UA by cloned alleles of PRP24. An OD600 = 1 culture and three successive 10-fold serial dilutions are shown for each. All strains grow well at 30 °C. (c) Schematic of non-covalent interactions between amino acids in Prp24 and nucleotides in U6 snRNA (dotted lines; not intended to differentiate between backbone or side-chain interactions). Positions of suppressor substitutions in Prp24 are denoted by green asterisks, and in U6 by green residues. (d) Positions of suppressor substitutions in the U6-A62G–Prp24 complex. All substituted amino acids are shown in surface representation. All substituted nucleotides are shown in green. The U6-A62G mutation is shown in yellow.

Mapping of the U6-A62G-suppressor mutations in both U6 (ref. 27) and Prp24 reveals a striking correspondence between the suppressor sites and protein-RNA contacts, indicating that suppression is due to destabilization of the U6Prp24 complex (Fig. 5c,d). For example, all four components of the aspartate bridge that anchors the RRM3–oRRM4 junction to the U6 asymmetric bulge (A42, C43, G55 and Asp288, Fig. 6a) are sites of suppressor substitutions. The observation that a conservative Asp288Glu substitution is sufficient for suppression (Fig. 5a) suggests that precise positioning of the side-chain carboxylate group relative to the protein backbone is required for the integrity of the aspartate bridge, consistent with the intricate network of hydrogen bonds that connect these residues. At the telestem–asymmetric bulge junction there are also numerous suppressor mutations in both RNA and protein residues (Fig. 5c,d). RRM3 residues Asn216, Asn253, and Ser283 form numerous hydrogen bonds with the last two base pairs of the telestem (Fig. 6b), and alterations to either the amino acid side chains or nucleotide bases suppresses cold-sensitivity. RRM1 does not contact U6 RNA or contain suppressor mutations. However, Arg38, which is immediately adjacent to RRM1, contacts the U6 ACAGA-box (Fig. 6c) and is the site of a suppressor mutation. The absence of corresponding cis-acting suppressor mutations in this region of U6 (ref. 27) likely arises from the severe growth defects caused by mutation of the ACAGA-box[37]. Amino acid residues in the RRM2–oRRM4 interface that stabilize a “dinucleotide bulge turn” involving U6 residues U54 through A59 (Fig. 3b), including Phe154, Arg295, Ser350, and Asp351, are also altered in suppressor strains (Figs. 5a and 6d). All of the cis-acting suppressor substitution sites in the ISL, as well as U38 in the telestem, do not contact Prp24; these substitutions likely act by destabilizing their resident helix. Together, these findings validate the use of high-density suppressor selections for predicting an RNA-protein interface. Additionally, co-localization of suppressor sites with the protein-RNA interface suggests that global de-stabilization of the U6 snRNP can compensate for local hyperstabilization of the U6 ISL. This notion is in accord with the hypothesis of Guthrie and colleagues that mutations in PRP24 suppress a related cold-sensitive mutation in U4 snRNA (G14C, in U4/U6 Stem II) by weakening the interaction of Prp24 with U6 RNA, so that U6 can more readily pair with U4 (refs. 6,16). Indeed, the three Prp24 substitutions previously selected as suppressors of U4-G14C also suppress U6-A62G and U6-UA (Supplementary Fig. 4).

An electropositive groove in Prp24 binds double-helical RNA

Although RRM1 does not contact U6 in the U6Prp24 complex, it is important for Prp24 function as alanine substitution of a strictly conserved phenylalanine in the RRM1 core (F87A; Supplementary Fig. 2) is lethal at 37 °C and slow-growing at 30 °C[25]. We therefore inspected crystal-packing contacts for clues to the functional role of RRM1. Interestingly, a roughly 20 Å-wide electropositive groove bordered by RRMs 1, 2 and oRRM4 contacts the U6 ISL from a neighboring RNP complex in the crystal lattice (Fig. 7). We previously showed by NMR that the same electropositive surface in a truncated Prp24 protein containing only RRMs 1 and 2 binds to a 12-nucleotide single-stranded RNA corresponding to U6 residues 49–60, and proposed that RRM1 might chaperone annealing of unwound U6 with U4 snRNA[26]. The presence of this surface of RRM1 on one face of a double-stranded RNA-binding groove further supports the notion that the electropositive groove is a site for RNA annealing (see Discussion).
Figure 7

An electropositive groove in Prp24 binds double-stranded RNA. (a) RRMs 1, 2 and oRRM4 delineate a surface groove of approximately 20 Å in width. RNA and protein are colored as in Figure 2. (b) In the crystal, the electropositive groove of one U6 snRNP binds the ISL from an adjacent U6 snRNP (shown in gray). The electrostatic surface of Prp24 is contoured from +8 kT/e (blue) to -8 kT/e (red).

Discussion

Function of Prp24 as a U6 snRNA chaperone

U6 snRNA is at the catalytic center of the spliceosome[3]. Thus, its assembly into and disassembly from the spliceosome must be precisely executed to maintain the fidelity of pre-mRNA splicing. U6 snRNA is released from U2 snRNA at the completion of each splicing cycle, so that it can regenerate the U6 snRNP and allow subsequent assembly of the U4/U6 di-snRNP. The observed interactions of U6 nucleotides 39–44 with RRM3 of Prp24 and nucleotides 54–58 with RRM2 and oRRM4 are expected to be mutually exclusive with base pairing in the U2/U6 di-snRNA[38]. We therefore propose that Prp24 has two functions in chaperoning U6 through the splicing cycle: promoting dissociation of the U2/U6 RNA complex, and promoting association of the U4/U6 RNA complex (Fig. 1a). However, it is not yet clear if Prp24 binds to and actively unwinds the U2/U6 complex, or simply sequesters U6 RNA that has been displaced from U2 snRNA by other means, for example, Brr2 helicase activity[39,40]. The former mechanism is particularly attractive as U2/U6 base pairing does not preclude binding of Prp24 RRM2 to the U6 intron-binding site, raising the possibility that a Prp24–U2/U6 ternary complex acts to prevent reassociation of product RNA and reverse splicing[41]. Such a complex could be stabilized by binding of the U6 5′ stem into the electropositive groove delineated by RRMs 1, 2 and oRRM4, as it possesses electrostatic and spatial features compatible with binding of double-stranded RNA (Fig. 7). The RRM1–2–4 electropositive groove is also a prime candidate for stabilization of U4/U6 duplex RNA. After dissociation from U2, interaction of the U6 ACAGA-box with RRM2 holds the U6 component of U4/U6 Stem I in close proximity to this groove, suggesting U4/U6 pairing is nucleated by stabilizing Stem I within this electropositive scaffold. However, Stem I nucleotides of U6 RNA (54-62) are not accessible for pairing with U4 in the structure presented here (Fig. 8a). We propose that the previously determined RRM1–RRM2 co-structure with a fragment of U6 RNA[26], in which nucleotides 54–60 are bound along one face of the electropositive groove (superposition used to construct Figure 8b), represents an on-pathway intermediate during U4/U6 annealing. Thus, as the ISL is unwound by thermal motion, reannealing of the ISL could be inhibited by favorable interactions between the separated strands and the electropositive surface of Prp24. These interactions could include specific binding of the CCCU sequence in the 5′ strand of the upper ISL, which was previously identified as a top SELEX target of Prp24[10]. In addition to unwinding of the ISL, transition from the ground-state U6 snRNP structure to the proposed intermediate would require breaking Prp24 contacts with the Stem I region of U6 RNA. The presence of U6-A62G-suppressor mutations in Prp24 residues that contact these U6 residues (nucleotides 54–58; Figs. 5c,d and 6d) raises the possibility that at least some suppressors operate by promoting the transition between the ground-state U6 snRNP and the intermediate proposed in Figure 8b.
Figure 8

Proposed mechanism for Prp24-mediated annealing of U4 and U6 snRNAs. (a) Ground state structure of the U6–Prp24 complex. The electrostatic surface of Prp24 is contoured from +8 kT/e (blue) to -8 kT/e (red). The portion of U6 that forms U4/U6 Stem I is highlighted in gray. (b) An early U4/U6 RNA annealing intermediate. We propose transient unwinding of the U6 ISL allows the groove to bind single-stranded U6 snRNA in an orientation compatible with recognition by U4 snRNA nucleotides 56–65 (in green, see Fig. 1). The conformation of U6 nucleotides 49–60 is taken from the solution structure of RRMs 1 and 2 (PDB: 2GO9) in complex with the same region of RNA[26]. The remainder of non-cartoon RNA is taken from the crystal structure presented here. (c) A later U4/U6 RNA annealing intermediate. Stem I of the U4/U6 di-snRNA has formed as is shown in the Prp24 electropositive groove.

Mutational perturbation of equilibria in the splicing cycle

The stable interaction of Prp24 with U6 snRNA is at odds with its function as a U4/U6 annealing chaperone, since it must relinquish U6 to U4 snRNA, just as U4 must subsequently relinquish U6 to U2 snRNA. The splicing cycle is made up of numerous such equilibria, for which a productive direction is favored in vivo, but can be reversed in vitro[41]. This property of the splicing cycle likely explains its temperature-dependent sensitivity to minor perturbations in RNA-RNA interactions (exemplified by the U6-A62G and U6-UA mutations) and RNA-protein interactions (exemplified by the U6-A62G-suppressor mutations in Prp24). The fact that 32 independently selected suppressors in Prp24 resulted in 30 unique substitutions illustrates both the multitude of contacts that stabilize a protein-RNA interface, and the utility of this genetic approach in identifying such contacts. We are actively investigating the identity of the 32 trans-acting suppressors that do not map to the PRP24 gene. None of these suppressor strains contains mutations in the U2 or U4 genes, so the mutations most likely lie in another protein or proteins that influence U6 snRNA equilibria. By identifying appropriate conditional mutations with which to select suppressors, this approach can be extended to other equlibria in the splicing cycle[42,43]. In addition to genome-wide spontaneous suppressor selections, as used here, gene-targeted selections are useful to probe specific interactions in detail. We previously conducted a targeted selection for mutations in the Prp8 protein that suppress a cold-sensitive mutation in U4 RNA that blocks spliceosome activation[44]. Most of these suppressors map to one face of a recent crystal structure of a large portion of Prp8 (ref. 45). It will be of interest to see if these suppressor mutations also lie on an RNA-protein interface.

Architecture of the U6 snRNP

We have determined the first crystal structure of U6 snRNA bound to the Prp24 protein, containing seventy nucleotides of U6 RNA and all four RRM domains of Prp24. The observed structure is consistent with previous chemical modification data[28]. For example, nucleotides 40-42, 49, 50, 52, and 55 are protected from chemical modification despite being in a single-stranded region of RNA, which is explained by the fact that these nucleotides are buried by interactions with Prp24 (Supplementary Fig. 5). Conversely, the Watson-Crick faces of nucleotides 45, 47, 51, 53 and 54 of the asymmetric bulge are all solvent exposed in the complex, and reactive to chemicals[28]. The close agreement with solution studies, along with the fact that the genetic suppressors localize to the RNA-protein interface, provides compelling evidence that the crystal structure accurately reflects the in vivo core U6 snRNP. This complex likely acts as a structural foundation for the remaining RNA and protein elements in the U6 snRNP. The interlocked topology of the U6Prp24 complex raises questions regarding the assembly pathway of the U6 snRNP, as the single-stranded asymmetric bulge of U6 is of insufficient width to allow threading of oRRM4 or RRMs 1–3. We propose that transient unwinding of the relatively unstable telestem allows wrapping of Prp24 around an unclosed loop of single-stranded RNA, after which the telestem reforms to generate the interlocked ribonucleoprotein structure (Supplementary Fig. 6). Notably, such a process would require transient disruption of the RRM2–oRRM4 interaction as well, which is presumed to be considerably less stable in the absence of the U6 snRNA scaffold that mediates contacts between RRM2 and oRRM4 in the assembled snRNP. Another possible mechanism is direct recognition of the U2/U6 complex by Prp24 after release of the lariat intron, allowing U6 nucleotides 49–53 to bind to Prp24 RRM2. The telestem would then form upon dissociation of U2 RNA from U6, thereby generating the interlocked topology. Extant biochemical data and the recently reported Lsm structure[9,13,28,46] can be used to generate a model of the complete U6 snRNP (Supplementary Fig. 7). In this model, the Lsm ring binds the base of the U6 telestem, consistent with observed crosslinks between U6 nucleotide G30 and Lsm2 (ref. 9) and proximity of the ring to the 3′ tail of U6 (ref. 46). This placement of the Lsm ring is also compatible with observed interaction of the extreme C-terminus of Prp24 with Lsm5, 7 and 8 (ref. 13), which could explain additional crosslinks between U6 nucleotides U28 and U29 with unidentified residues in Prp24 (ref. 28). Correct placement of the U6 5′ stem (missing from the model in Supplementary Fig. 7) is not apparent when analyzing the U6Prp24 structure and extant biochemical data, although chemical protection studies strongly suggest the 5′ stem contacts Prp24 (ref. 28). We propose the 5′ stem could bind into the RRM1–2–4 electropositive groove, although this placement would likely require partial unwinding of the telestem. This proposal in turn raises the interesting possibility that binding of the U6 snRNP to the U4 snRNP could cause a conformational change that results in displacement of the U6 5′ stem from the electropositive groove, thereby allowing formation of U4/U6 Stem I (Fig. 8). The location of the U6 5′ stem will be resolved by the structure of the complete U6 snRNP, which will likely be elucidated in the near future since all individual components can now be made at yield and homogeneity suitable for crystallographic analysis.

Implications for ribonucleoprotein structure and function

An interesting feature of Prp24 revealed by our study is its ability to bind single- and double-stranded RNA. RRMs are generally considered single-stranded RNA binding domains, yet both RRM3 and oRRM4 bind the single-stranded asymmetric bulge as well as the adjacent helical regions. oRRM4 in particular contacts both the ISL and telestem, via its non-canonical N- and C-terminal alpha-helices. Thus, multi-RRM proteins are not restricted to direct read-out of adjacent 3-4 nucleotide base sequences, but rather can evolve to stabilize highly complex RNA secondary and tertiary structures. These mixed binding mechanisms could help direct the structural rearrangements necessary for U4/U6 di-snRNP assembly, which must involve unwinding of the U6 ISL[15] and may involve changes in U4 snRNA secondary structure as well. We predict that mixed single- and double-stranded RNA binding modes will be a common feature of proteins that act as RNA chaperones during spliceosome assembly, activation, and disassembly[47,48]. The U6Prp24 complex reveals a striking degree of structural co-evolution of protein and RNA. For example, tertiary contacts between Prp24 RRM2 and oRRM4 are stabilized by the asymmetric bulge of U6 (Fig. 3c), the conformation of which is stabilized by extensive contacts with RRM2, RRM3 and oRRM4 (Figs. 3a,b and 4a). Furthermore, the RNA components of several novel structural motifs presented here, including the “skip-stack turn”, “dinucleotide bulge turn” and “aspartate bridge” (Fig. 3a,b and Fig. 6a) are all built upon a protein scaffold. The intricate and cooperative nature of these contacts implies a lengthy process of evolutionary co-adaptation. There is also a striking lack of tightly bound divalent cation along the protein-RNA interface, suggesting Prp24 has supplanted the stabilizing role of magnesium typically observed in ribonucleoproteins thought to have originated in a protein-free, RNA-only world, such as the ribosome and Ribonuclease P. Potentially, this characteristic could give Prp24 more control over the stability of helices in U6 snRNA, aiding its function as a chaperone for the structural transitions of U6. Together, these findings advance our understanding of how multi-RRM proteins specifically recognize long RNAs and promote assembly of large ribonucleoproteins. The combined structural and genetic approach used here can serve as a model for future investigation of multi-RRM proteins.

Online Methods

Protein Production

E.coli STAR pLysS cells (Invitrogen) harboring a modified pET3a plasmid (Novagen) were used to synthesize residues 34-400 of Prp24 with a non-cleavable hexahistidine tag appended to the C-terminus of the protein. The cells were grown at 37 °C with shaking to mid-logarithmic phase and protein expression induced through addition of isopropyl β-D-thiogalactopyranoside (IPTG) to 1 mM. The induced cells were then grown for an additional 20 hours at 25 °C and harvested by centrifugation at 4,000 x g for 10 minutes. Five gram cell pellets were resuspended in 30 mL immobilized metal affinity chromatography (IMAC) buffer (500 mM NaCl, 50 mM HEPES acid, 50 mM imidazole base, 10 % (v/v) glycerol, 1 mM tris(2-carboxyethyl)phosphine (TCEP) HCl). Ten mg lysozyme and 0.25 mg DNase I (Sigma) were then added to 30 mL of the resuspended cells and the mixture stored at −80 °C. The cell suspension was thawed at room temperature and sonicated on ice prior to clarification of the lysate by centrifugation at 20,000 x g for 30 minutes. The soluble fraction was loaded onto a Ni2+-charged nitrilotriacetic acid (NTA) agarose resin (Qiagen) that had been pre-equilibrated with IMAC buffer. The column was washed with 50 mL fresh IMAC buffer and Prp24 desorbed using IMAC buffer supplemented with 500 mM imidazole pH 7.0. The purified protein was dialyzed overnight at 4 °C against 1 L of cation-exchange chromatography buffer (100 mM NaCl, 10 mM HEPES acid, 10 mM sodium HEPES, 10 % glycerol, 1 mM TCEP HCl, pH 7.0) supplemented with 1 mM ethylenediaminetetraacetic acid (EDTA), and further purified using cation-exchange chromatography with salt gradient elution on an AKTA FPLC system equipped with a HiTrapSP column (GE Healthcare). The protein eluted at approximately 300 mM salt and was concentrated to approximately 10 mg/mL using a spin concentrator with a 10 kDa cutoff. The protein was stored at −80 °C. UV absorption was used to estimate the final protein concentration, using an extinction coefficient of 20,400 M−1cm−1 at 280 nm[49]. The measured A280/A260 ratio was used to further estimate an extinction coefficient of 12,120 M−1cm−1 at 260 nm, which was subsequently used to estimate the concentration of the U6Prp24 complex (see below).

RNA Production

In vitro transcription was used to synthesize nucleotides 30-101 of an A62G/U100C/U101C mutant form of the U6 snRNA, using recombinant his-tagged T7 RNA polymerase and synthetic DNA oligonucleotide templates (Integrated DNA Technologies). After transcription, the target RNA was resolved from abortive transcripts and “n+1” additions via denaturing 8 % polyacrylamide gel electrophoresis in 8 M Urea. The RNA was extracted from the gel by passive diffusion into a solution containing 100 mM sodium acetate, pH 5.6. Ethanol precipitation was used to concentrate the RNA prior to further purification by anion exchange chromatography (High Q column, Bio-Rad) in 20 mM Tris pH 7.6, 300 mM sodium chloride, with gradient elution up to 1.5 M sodium chloride. Finally, the RNA was dialyzed using a 10 kDa membrane into a storage buffer containing 100 mM sodium chloride, 10 mM monobasic potassium phosphate, 10 mM dibasic potassium phosphate, 1 mM disodium EDTA and 1 mM sodium azide. 6 % native PAGE analysis showed a small fraction of the purified RNA (ca. 5 %) was in the form an intermolecular dimer. The final yield from a 20 mL transcription reaction was approximately 5 mg of pure RNA. The RNA concentration was estimated using UV absorption and an extinction coefficient of 891,266 M−1cm−1 at 260 nm[50].

Reconstitution of the U6-Prp24 complex

Equimolar amounts of purified protein and RNA were mixed together (1 mL RNA at 10 μM in RNA storage buffer added to 0.05 mL protein at 200 μM in cation-exchange elution buffer) and loaded directly onto a MonoQ column that had been pre-equilibrated with MonoQ buffer at 4 °C (100 mM potassium chloride, 10 mM Tris base, 10 mM Tris HCl, 2 mM MgCl2, 1 mM TCEP HCl and 5 % glycerol, pH ∼ 8). Under these conditions, free protein flows through the column while ribonucleoprotein and free RNA bind to the column. A linear gradient to 2 M potassium chloride in MonoQ buffer was applied to the column to differentially desorb ribonucleoprotein and RNA. Non-denaturing polyacrylamide gel electrophoresis was used to assess homogeneity of the ribonucleoprotein, and sequential staining/destaining with toluidine blue and coomassie G-250 confirmed the presence of both protein and RNA in a single gel band. The eluted complex was concentrated using a spin concentrator with a 10 kDa cutoff to 5 mg/mL in MonoQ elution buffer, with an approximate concentration of potassium chloride in the sample of 400 mM being determined from measured conductivity. The concentration of the complex was estimated by UV absorption, using the sum of protein and RNA extinction coefficients at 260 nm (903,386 M−1cm−1).

Crystallization and Data Collection

Crystallization conditions for the purified U6Prp24 complex were identified using a Mosquito high-throughput crystallization robot (TTP LabTech). Crystals were obtained by sitting drop vapor diffusion at 4 °C from an initial drop containing 0.2 uL concentrated U6Prp24 complex and 0.2 uL crystallization reagent (100 mM lithium sulfate, 100 mM sodium citrate, pH 5.5 and 20 % PEG 1,000). In order to achieve cryogenic preservation, the crystal-containing drop was diluted against excess crystallization reagent supplemented with 20 % (v/v) glycerol. Quickly after dilution of the drop, a single crystal was harvested and flash cooled in liquid nitrogen. Diffraction data were collected at 100 K on beamline 21-ID-F at the Advanced Photon Source. 360 images were recorded on a MarMosaic 225 detector (Rayonix) at 1 degree oscillations with X-rays tuned to a wavelength of 0.97872 Å. The diffraction data were indexed and scaled using HKL2000[51]. The space group was determined using POINTLESS[52] and Xtriage[53] was used to assay potential twinning of the data.

Structure Determination, Refinement and Analysis

Initial phases were obtained by molecular replacement in Phaser[54], using residues 41-196 and 206-291 from PDB 2GHP[14] as independent search models, which yielded an initial map of suitable quality for iterative, manual model building in Coot[55] with interspersed cycles of automated refinement and phase improvement in Phenix[53]. The final model contains approximately seven-hundred bound water molecules, many of which are visible along the single stranded RNA-protein interface and a single bound sulfate ion from the crystallization mixture. Typical magnesium bond lengths and geometries[56,57] were used to discriminate magnesium from water in the structure. There are residual densities in the final mFo-DFc maps associated with the backbone of U6 ISL (nucleotides 63-86) and the base of the telestem (nucleotides 30-36 and 97-100) that could not be readily modeled through alternate conformers of RNA or bound solutes. In these regions, a single conformer of RNA was fit into unbiased electron density derived from simulated-annealing omit maps and TLS[58] restraints were imposed to model apparent structural dynamics. The unusually high overall atomic displacement parameters in the final model are due in large part to regions of the RNA modeled under TLS restrains, which do not form extensive contacts with the Prp24 protein. The final model has an overall MolProbilty[33] score of 1.7 and over 98 % of all protein residues in allowed regions of a Ramachandran plot[59]. The interface area between single-stranded RNA (U6 nucleotides 41-56) and Prp24 was determined using the online PDBePISA[60] tool available at www.ebi.ac.uk/msd-srv/prot_int/pistart.html. All figures were generated in PyMOL[61]. The electrostatic surface was calculated using APBS[62] as implemented in PyMOL. A sample of the final electron density map is shown in Supplementary Figure 1.

Genetic Suppressor Selection and Analysis

Ninety-five independent spontaneous cold-resistant revertants of a yeast strain (DAB016) bearing a deletion of the chromosomal U6 RNA gene and containing the U6-A62G allele on a plasmid were previously selected[27]. Of these, 31 have a de novo mutation in the U6-A62G allele that is responsible for cold-resistance[27]. Genomic DNA was prepared from the remaining 64 strains and the PRP24 locus was amplified by PCR with primers that generate a 1782 bp DNA fragment spanning from 265 bp upstream of the start codon to 183 bp downstream of the stop codon. The PCR product from each strain was subjected to Sanger sequencing that spanned the entire protein-coding region on at least one strand to identify mutations (Supplementary Table 2). (The wild-type PRP24 coding region in DAB016 contains 16 single-nucleotide polymorphisms relative to the S288C sequence, 10 of which result in amino acid substitutions.) Sixteen of the mutant alleles (Supplementary Table 2) were cloned by digestion of the PCR product with BsrGI and SnaBI, followed by ligation into pRS313[63]. Mutant prp24 alleles in pRS313 were co-transformed with pRS314 containing wild-type SNR6, snr6-A62G, or snr6-A62U/C85A into the SNR6/PRP24 double-disruption strain LL200 (MATa, snr6ΔBsm∷LEU2, prp24ΔClaSnaB∷ADE2, ade2-1, can1-100, his3-11, leu2-3,112, lys2-Δ2, met2-Δ1, trp1-1, ura3-52;[pUN50-PRP24] and [YCp50-SNR6][12]). Transformants were selected on –trp –his medium and plated to synthetic complete medium containing 5-fluoroorotate to select for loss of the URA3-marked plasmids containing wild-type alleles of SNR6 and PRP24. Colonies were picked to 5 ml YEPD, grown overnight at 30°C and adjusted to an A600 = 1.0. The undiluted culture and successive 10-fold serial dilutions were spotted to YEPD plates using a 48-pin inoculation manifold and the plates were incubated at 16, 23, 30 and 37 °C.
  64 in total

1.  Electrostatics of nanosystems: application to microtubules and the ribosome.

Authors:  N A Baker; D Sept; S Joseph; M J Holst; J A McCammon
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-21       Impact factor: 11.205

2.  A catalytically active group II intron domain 5 can function in the U12-dependent spliceosome.

Authors:  Girish C Shukla; Richard A Padgett
Journal:  Mol Cell       Date:  2002-05       Impact factor: 17.970

3.  Both catalytic steps of nuclear pre-mRNA splicing are reversible.

Authors:  Chi-Kang Tseng; Soo-Chen Cheng
Journal:  Science       Date:  2008-06-27       Impact factor: 47.728

Review 4.  RRM-RNA recognition: NMR or crystallography…and new findings.

Authors:  Gerrit M Daubner; Antoine Cléry; Frédéric H-T Allain
Journal:  Curr Opin Struct Biol       Date:  2012-12-14       Impact factor: 6.809

5.  Suppressors of a cold-sensitive mutation in yeast U4 RNA define five domains in the splicing factor Prp8 that influence spliceosome activation.

Authors:  A N Kuhn; D A Brow
Journal:  Genetics       Date:  2000-08       Impact factor: 4.562

Review 6.  Scaling and assessment of data quality.

Authors:  Philip Evans
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2005-12-14

7.  The Pfam protein families database.

Authors:  Marco Punta; Penny C Coggill; Ruth Y Eberhardt; Jaina Mistry; John Tate; Chris Boursnell; Ningze Pang; Kristoffer Forslund; Goran Ceric; Jody Clements; Andreas Heger; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman; Robert D Finn
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

8.  MolProbity: all-atom structure validation for macromolecular crystallography.

Authors:  Vincent B Chen; W Bryan Arendall; Jeffrey J Headd; Daniel A Keedy; Robert M Immormino; Gary J Kapral; Laura W Murray; Jane S Richardson; David C Richardson
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2009-12-21

9.  Novel conformation of an RNA structural switch.

Authors:  Scott D Kennedy; Ryszard Kierzek; Douglas H Turner
Journal:  Biochemistry       Date:  2012-11-12       Impact factor: 3.162

10.  Phaser crystallographic software.

Authors:  Airlie J McCoy; Ralf W Grosse-Kunstleve; Paul D Adams; Martyn D Winn; Laurent C Storoni; Randy J Read
Journal:  J Appl Crystallogr       Date:  2007-07-13       Impact factor: 3.304

View more
  36 in total

1.  Substrate-assisted mechanism of RNP disruption by the spliceosomal Brr2 RNA helicase.

Authors:  Matthias Theuser; Claudia Höbartner; Markus C Wahl; Karine F Santos
Journal:  Proc Natl Acad Sci U S A       Date:  2016-06-27       Impact factor: 11.205

2.  An Allosteric Network for Spliceosome Activation Revealed by High-Throughput Suppressor Analysis in Saccharomyces cerevisiae.

Authors:  David A Brow
Journal:  Genetics       Date:  2019-03-21       Impact factor: 4.562

Review 3.  Methodologies for studying the spliceosome's RNA dynamics with single-molecule FRET.

Authors:  Clarisse van der Feltz; Aaron A Hoskins
Journal:  Methods       Date:  2017-05-18       Impact factor: 3.608

4.  Structure and conformational plasticity of the U6 small nuclear ribonucleoprotein core.

Authors:  Eric J Montemayor; Allison L Didychuk; Honghong Liao; Panzhou Hu; David A Brow; Samuel E Butcher
Journal:  Acta Crystallogr D Struct Biol       Date:  2017-01-01       Impact factor: 7.652

Review 5.  Molecular Mechanisms of pre-mRNA Splicing through Structural Biology of the Spliceosome.

Authors:  Chuangye Yan; Ruixue Wan; Yigong Shi
Journal:  Cold Spring Harb Perspect Biol       Date:  2019-01-02       Impact factor: 10.005

Review 6.  Understanding pre-mRNA splicing through crystallography.

Authors:  Sara Espinosa; Lingdi Zhang; Xueni Li; Rui Zhao
Journal:  Methods       Date:  2017-05-12       Impact factor: 3.608

7.  It takes two to tangle: Prp24 and spliceosome assembly.

Authors:  Daniel A Pomeranz Krummel; Andrew M MacMillan
Journal:  Nat Struct Mol Biol       Date:  2014-06       Impact factor: 15.369

Review 8.  Mechanisms and Regulation of Alternative Pre-mRNA Splicing.

Authors:  Yeon Lee; Donald C Rio
Journal:  Annu Rev Biochem       Date:  2015-03-12       Impact factor: 23.643

Review 9.  Lights, camera, action! Capturing the spliceosome and pre-mRNA splicing with single-molecule fluorescence microscopy.

Authors:  Alexander C DeHaven; Ian S Norden; Aaron A Hoskins
Journal:  Wiley Interdiscip Rev RNA       Date:  2016-05-20       Impact factor: 9.957

Review 10.  Specificity and nonspecificity in RNA-protein interactions.

Authors:  Eckhard Jankowsky; Michael E Harris
Journal:  Nat Rev Mol Cell Biol       Date:  2015-08-19       Impact factor: 94.444

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.