Piotr Wysoczanski1, Markus Zweckstetter1,2,3. 1. a Department for NMR-based Structural Biology ; Max Planck Institute for Biophysical Chemistry ; Am Fassberg 11, Göttingen , Germany. 2. b German Center for Neurodegenerative Diseases (DZNE) ; Göttingen , Germany. 3. c Center for Nanoscale Microscopy and Molecular Physiology of the Brain; University Medical Center ; Göttingen , Germany.
Abstract
One of the great challenges to structural biologists lies in explaining the complexities of the spliceosome - a ribosome-sized molecular machine that is assembled in a step-wise manner and is responsible for pre-mRNA splicing. The spliceosome is both fascinating and difficult to work with, because of its dynamic nature. At each discrete step of splicing tens of proteins come and go orchestrating the functional transition through massive structural rearrangements. The retention and splicing complex (RES) is an important splicing factor interacting with pre-mRNA at the onset of the first transesterification reaction. RES is a specific splicing factor for a number of genes and is important for controlling pre-mRNA retention in the nucleus. RES is a 71 kDa heterotrimer composed of the 3 proteins Pml1p, Bud13p and Snu17p. We solved the 3-dimensional structure of the core of the RES complex as well as the 2 dimers, Snu17p-Bud13p and Snu17p-Pml1p. Further biophysical analysis revealed an astounding cooperativity that governs the assembly of this trimeric complex as well as its interaction with pre-mRNA. The more than 100-fold cooperativity originates from the progressive rigidification of Snu17p upon coupled binding-and-folding of protein regions, which are disordered in the unbound state. Our work highlights the role of cooperativity in the spliceosome and poses new questions about the structure and assembly of the spliceosome.
One of the great challenges to structural biologists lies in explaining the complexities of the spliceosome - a ribosome-sized molecular machine that is assembled in a step-wise manner and is responsible for pre-mRNA splicing. The spliceosome is both fascinating and difficult to work with, because of its dynamic nature. At each discrete step of splicing tens of proteins come and go orchestrating the functional transition through massive structural rearrangements. The retention and splicing complex (RES) is an important splicing factor interacting with pre-mRNA at the onset of the first transesterification reaction. RES is a specific splicing factor for a number of genes and is important for controlling pre-mRNA retention in the nucleus. RES is a 71 kDa heterotrimer composed of the 3 proteins Pml1p, Bud13p and Snu17p. We solved the 3-dimensional structure of the core of the RES complex as well as the 2 dimers, Snu17p-Bud13p and Snu17p-Pml1p. Further biophysical analysis revealed an astounding cooperativity that governs the assembly of this trimeric complex as well as its interaction with pre-mRNA. The more than 100-fold cooperativity originates from the progressive rigidification of Snu17p upon coupled binding-and-folding of protein regions, which are disordered in the unbound state. Our work highlights the role of cooperativity in the spliceosome and poses new questions about the structure and assembly of the spliceosome.
Efforts to characterize the structure of a complete spliceosome at any stage of its catalytic cycle mostly failed so far, because of its huge size, high flexibility and time-variability, and an extremely high content of intrinsically disordered proteins (IDPs). Only very recently the first cryo-electron microscopy structure of a full spliceosome became available. Thus, divide and conquer strategies are often the method of choice for structural biologists seeking atomic resolution coupled with MS-crosslinking and single particle electron microscopy for the subsequent assembly of spliceosomal substructures. Though X-ray crystallography was successful for a subset of those substructures, it largely failed at higher levels of organization. Over the years, NMR spectroscopy demonstrated to be a suitable alternative to some of those otherwise intractable problems with the additional advantage of access to protein dynamics with single-residue resolution, which plays an important role in the spliceosome.Functionally, the spliceosome controls the flow and diversification of genetic information by transforming interrupted exons into mature mRNA molecules. Though a subset of spliceosomal components is pre-assembled such as small-nuclear ribonucleic proteins (snRNPs), by and large the spliceosome achieves its structural and catalytic competence through a rapid cascade of multiple interactions each time it encounters a single substrate molecule. The high complexity of the yeast spliceosome system, which comprises approximately 90 proteins (nearly 200 in human), necessitates cooperativity as a major player accelerating and controlling the assembly and disassembly of the spliceosome. The major steps of the spliceosome cycle include the protein/RNA complexes denoted as E, A (pre-spliceosome), B (pre-catalytic), Bact (activated) and B* (catalytically activated), followed by C (after step 1 transesterification) and the post-spliceosomal complex.We studied the structure, dynamics and assembly of the 71 kDa trimeric complex RES. RES interacts with the 3′ part of the intron and is close to the catalytic center at the onset of the first transesterification reaction. The recently revealed structural details. are consistent with earlier functional assays, which identified RES as a crucial factor for spliceosomal assembly and cell viability at elevated temperatures. Most importantly, RES offers a glimpse into the cooperative assembly of subunits of the spliceosome.
The 3D structures of RES and its dimeric intermediates
RES is composed of 3 proteins. Pml1p and Bud13p fold through their intrinsically disordered parts onto Snu17p (Fig. 1). In our efforts to investigate each step of the assembly, we first attempted to obtain the 3D structure of the isolated Snu17p. This proved to be impossible due to its instability. After establishing the minimal Bud13p and Pml1p fragments required to mimic their full-length parent proteins, we observed that addition of any of the 2 binding partners (Pml1p or Bud13p) stabilized the protein sufficiently. When both proteins were present, stability was increased even further.
Figure 1.
Cooperative RES assembly through folding and rigidification of disordered protein regions. (Upper panel) Regions, which are disordered in free Snu17p (blue), Pml1p (pink) and Bud13p (yellow), are schematically represented as 5-membered ensembles. The RRM domain of Snu17p and the FHA domain of Pml1p are shown using a surface representation. Scales and shapes of proteins are preserved but for clarity the Bud13p representation corresponds only to the Snu17p binding part (both panels). (Lower panel) In the trimeric RES complex, the dynamic/disordered regions of Snu17p, Pml1p and Bud13p form a rigid interface.
Cooperative RES assembly through folding and rigidification of disordered protein regions. (Upper panel) Regions, which are disordered in free Snu17p (blue), Pml1p (pink) and Bud13p (yellow), are schematically represented as 5-membered ensembles. The RRM domain of Snu17p and the FHA domain of Pml1p are shown using a surface representation. Scales and shapes of proteins are preserved but for clarity the Bud13p representation corresponds only to the Snu17p binding part (both panels). (Lower panel) In the trimeric RES complex, the dynamic/disordered regions of Snu17p, Pml1p and Bud13p form a rigid interface.By solving the 3D structure of the 2 dimers and the core of RES itself, we revealed a new type of interaction interface between all 3 partners (Fig. 1 and 3B). The interface is formed by a long C-terminal extension of the RRM domain of Snu17p, which folds into an α-helix along Pml1p, as well as an extended hairpin conformation of Bud13p. In particular, the Bud13p recognition mode was unusual as it did not fit the predicted so called UHM (U2AF homology motif)-ULM (UHM ligand motif) type of interaction. This finding was also independently confirmed in another structure of the Bud13p-Snu17p dimer. UHM-ULM interactions are frequently observed between a subset of RRM domains (denoted UHMs) and central tryptophan containing motifs – ULMs – to which Bud13p was believed to belong. The central tryptophan of the ULM is essential for the stability of such interactions. Frequently ULMs shuffle between different UHMs, and UHMs are capable of recognizing various ULMs. This suggested that Bud13p might recognize a genuine UHM domain given its striking sequence similarity to other ULMs.
The Dynamic Nature and Cooperativity in RES
The observation that the stability of Snu17p increases dramatically with the addition of its binding partners has its origin in the cooperativity and rigidification of this system. All 3 structures - 2 dimers with either Pml1p or Bud13p as well as the RES trimer - show subtle adjustments such as the position of Bud13p and Pml1p. In addition, pronounced differences in the mobility of the C-terminus of Pml1p and the L63–F73 loop of Snu17p were observed. Undoubtedly, the largest scale rearrangement is the folding of the C-terminal extension of Snu17p into a rigid α-helix. A binding-induced rigidification of Snu17p was further supported by hydrogen/deuterium exchange studies, which revealed differences in hydrogen/deuterium exchange rates between the complexes by 6 orders of magnitude. The rigidification of Snu17p from the monomer through the dimeric state to the trimer follows the strong cooperativity in this system where secondary to tertiary interactions show at least 100-fold differences in affinity. For example, the binding of Pml1p to free Snu17p has a Kd of 1328 nM, which decreases to 7.8 nM when Bud13p is present. Notably, no direct interaction between the Pml1p and Bud13p fragments was detected, suggesting that the cooperative effect is mediated by the central binding platform – Snu17p. In addition, Snu17p is most capable to bind to mRNA, when the protein is in complex with Pml1p and Bud13p.
Fig. 2 illustrates the pre-mRNA induced chemical shift perturbation, which was observed in Snu17p either in the free state or bound to Pml1p and/or Bud13p. Clearly, when the C-terminal α-helix of Snu17p is unfolded (in either the free state or bound to Bud13p) a smaller number of amide resonances was affected.
Figure 2.
Interaction of Snu17p with pre-mRNA. Mapping of the chemical shift perturbation (CSP) experienced by Snu17p upon addition of CUUCAUUCUUUUUG RNA to the Pml1p–Snu17p dimer (A), the Bud13p–Snu17p dimer (B) and the RES trimer (C). Gray, CSP < 1σ; light-blue, 1σ < CSP < 2σ; blue, 2σ < CSP < 3σ; dark blue, 3σ < CSP.
Interaction of Snu17p with pre-mRNA. Mapping of the chemical shift perturbation (CSP) experienced by Snu17p upon addition of CUUCAUUCUUUUUG RNA to the Pml1p–Snu17p dimer (A), the Bud13p–Snu17p dimer (B) and the RES trimer (C). Gray, CSP < 1σ; light-blue, 1σ < CSP < 2σ; blue, 2σ < CSP < 3σ; dark blue, 3σ < CSP.The three 3D structures in combination with the hydrogen/deuterium exchange data of Snu17p allowed us to formulate a model in which the binding-mediated rigidification of the Snu17p structure is responsible for the progressive increase in affinity, consistent with the hypothesis that most dynamic proteins exhibit cooperative behavior. Notably, entropic contributions to cooperativity were unfavorable and it was primarily enthalpy, which contributed to the increase in binding affinity. Cooperative behavior in pre-mRNA binding was also shown to be important in other snRNP complexes, as well as in transcription factors and operon binding. In addition, cooperativity was shown to be important for the function of IDPs. The ability of many IDPs to form promiscuous interaction hubs controlled by cooperativity is in line with the enrichment of the spliceosomal proteome by IDPs – 47% of all yeast spliceosomal proteins are predicted to be disordered whereas the average for the entire yeast proteome is only 13%. The advantage of IDPs in terms of reach and promiscuity can be complemented by different types of cooperative behavior. For instance, cooperativity induced by the presence of multiple identical or similar motifs (a form of pseudo-homotropic allostery), similar to the one described in, could be a general way by which rapid recruitment of proteins is achieved. In addition, heterotropic cooperativity, which involves different ligands and binding sites, has been reported.
Remaining questions
Pml1p and the phosphorylated motifs
Approximately 25% of all serine/threonine/tyrosine residues in the spliceosome, most of which are located in disordered parts, are believed to be phosphorylated. For example, phosphorylation of arginine/serine-rich (RS) proteins influences their structure and interaction with other spliceosomal proteins and RNA. In contrast, little is known about the importance of phosphorylation for the structure and function of the RES complex. The most obvious phosphorylation-specific site of regulation in RES might be the forkhead-associated domain (FHA domain) of Pml1p, because FHA domains are phosphopeptide recognition domains found in many regulatory proteins. Unfortunately, our efforts to identify phosphopeptides, which bind to the FHA domain of Pml1p, have failed so far. In a high throughput assay using a large selection of spliceosomal peptides, we identified a possible peptide sequence and could see specific sequence enrichment, consistent with a distinct recognition motif (Fig. 3A). However, binding of the same peptide sequences to the FHA domain of Pml1p in solution could not be detected by NMR spectroscopy (data not shown). It therefore remains an open question which phosphorylated protein is responsible for the interaction with the FHA domain of Pml1p and why it has not yet been picked up in pull-down assays. The latter suggests a transient interaction rather than a stable complex. Also, given the short length of the linker between the FHA domain of Pml1p and its Snu17p-interaction site (Fig. 3B), binding of a phosphopeptide at the predicted binding site in the FHA domain of Pml1p might modulate the interaction of Pml1p with Snu17p.
Figure 3.
Toward the function of RES. (A) High-throughput phosphopeptide assay aiming at identification of potential binding partners of the FHA domain of Pml1p (left). 14 top scoring peptides identified by the assay (right). Below a frequency-based consensus motif is shown listing the 4 residues downstream of the phosphorylated threonine. Residues were selected based on the 40 top scoring sequences with at least 2-fold higher frequency as compared to all (2553) sequences in the assay. (B) Model of the complex of full-length Pml1p (pink) with Snu17p (cyan) and Bud13p (yellow). The orientation of the structures is based on the 2MKC and 3ELV structures as deposited in the PDB. Pml1p residues in between the FHA domain of Pm1lp and the N-terminal end of the Pm1p peptide in complex with Snu17p were added using Pymol. (the “reverse” orientation was selected, because it results in the largest distance between the FHA domain of Pml1p and the RRM domain of Snu17p). The likely position of a phosphorylated threonine in a yet to be determined phosphopeptide is indicated with an arrow. (C) Schematic representation of full-length RES in the context of the Bact spliceosome (gray). The bound pre-mRNA is represented in green. The exact position of potential RES binding partners such as Spp2, Hsh155 or Prp2 is currently not known (indicated with question marks). (D) Diagram illustrating the current knowledge about the function of RES. Proteins, for which pre-mRNA splicing is controlled by RES, are indicated. The function of Pml1p as an inhibitor of pre-mRNA export is also represented.
Toward the function of RES. (A) High-throughput phosphopeptide assay aiming at identification of potential binding partners of the FHA domain of Pml1p (left). 14 top scoring peptides identified by the assay (right). Below a frequency-based consensus motif is shown listing the 4 residues downstream of the phosphorylated threonine. Residues were selected based on the 40 top scoring sequences with at least 2-fold higher frequency as compared to all (2553) sequences in the assay. (B) Model of the complex of full-length Pml1p (pink) with Snu17p (cyan) and Bud13p (yellow). The orientation of the structures is based on the 2MKC and 3ELV structures as deposited in the PDB. Pml1p residues in between the FHA domain of Pm1lp and the N-terminal end of the Pm1p peptide in complex with Snu17p were added using Pymol. (the “reverse” orientation was selected, because it results in the largest distance between the FHA domain of Pml1p and the RRM domain of Snu17p). The likely position of a phosphorylated threonine in a yet to be determined phosphopeptide is indicated with an arrow. (C) Schematic representation of full-length RES in the context of the Bact spliceosome (gray). The bound pre-mRNA is represented in green. The exact position of potential RES binding partners such as Spp2, Hsh155 or Prp2 is currently not known (indicated with question marks). (D) Diagram illustrating the current knowledge about the function of RES. Proteins, for which pre-mRNA splicing is controlled by RES, are indicated. The function of Pml1p as an inhibitor of pre-mRNA export is also represented.
The structure of full-length RES in solution and bound to the spliceosome
Our 3D structure of the core of the RES complex and a previously determined structure of the isolated FHA domain of Pml1p. puts tight constraints on the structure of the full-length RES complex. This allows preparation of a model of the 3D structure of full-length RES (Fig. 3B). At the same time, the model presented in Fig. 3B only represents a crude approximation and an experimentally derived structure of full-length RES is needed. Even more interesting would be 3D structures of RES in complex with possible partners such as mRNA, Hsh155 and Spp2, with the final goal of an atomic resolution structural description of the Bact complex.(Fig. 3C)
Further cooperativity
Equally important and related to the above static picture is to expand the quantitative RES interaction studies with a focus on cooperative contributions (Fig. 3C). Given the ever-increasing recognition of the role of cooperativity, which is often linked to coupled folding and binding, in the transcription factors–DNA interactome, similar investigation into spliceosomal interaction networks is surprisingly scarce. An interesting starting point could be a study of the stoichiometry of the interaction of Bud13p with the Pml1p-Snu17p dimer. Tripsianes et al. observed a 1:2 stoichiometry with a weaker second binding side being evident. We could not observe a similar behavior but our constructs were sufficiently different. Nonetheless, those results suggest that RES assembly might be even more complex opening a possibility for another molecule of Bud13p influencing the cooperative nature of the interaction.Considering the moderate binding affinity of RES to RNA. and given the still poorly characterized interaction with additional RES binding partners such as Spp2 and Hsh155, it is conceivable that the interaction with mRNA within the spliceosome is orders of magnitude stronger (Fig. 3C).
The function of RES
So far 4 pre-mRNAs, which correspond to the proteins actin, Tan1, Mer1 and Mata1, were demonstrated to exhibit RES-controlled splicing (Fig. 3D). Although it was suggested that a weak 5’ splice site might be a defining feature, the recognition motif of introns for RES is still unknown. Additional complication may arise from the fact that RES – beyond its specific activity – also seems to be a general (unspecific) splicing factor. To which degree those 2 roles overlap remains to be established.The exact role of Pml1p is even less clear. Though its involvement in mRNA retention is established, how and with which additional factors is currently unknown (Fig. 3D). Both Tripsianes et al. and we noticed the structural similarity between the complexes of REF2/ICP27 and Bud13p/Snu17p, as well as ORF57/REF2 and Bud13p/Snu17p. Considering that all of these proteins are involved in the same mRNA export pathway, a possible cross-talk with REF2 (Yra1 for Saccharomyces Cerevisiae) might occur. How and at which stage RES or constituents of RES influence mRNA, would require further attention.
Conclusion
Understanding the catalytic cycle of the spliceosome is one of the biggest challenges of structural biology in the 21st century. What appears to be the major hurdle for X-ray techniques – dynamics, flexibility and abundance of IDPs – is in fact what makes the properties of the spliceosome even more fascinating and highlights the functional importance of cooperativity. Our structural and dynamic studies of the RES complex, which is an essential part of this puzzle, is a step toward further understanding the function of the spliceosome at one of its crucial transition points.
Authors: Cameron D Mackereth; Tobias Madl; Sophie Bonnal; Bernd Simon; Katia Zanier; Alexander Gasch; Vladimir Rybin; Juan Valcárcel; Michael Sattler Journal: Nature Date: 2011-07-13 Impact factor: 49.962
Authors: Alaji Bah; Robert M Vernon; Zeba Siddiqui; Mickaël Krzeminski; Ranjith Muhandiram; Charlie Zhao; Nahum Sonenberg; Lewis E Kay; Julie D Forman-Kay Journal: Nature Date: 2014-12-22 Impact factor: 49.962
Authors: Piotr Wysoczański; Cornelius Schneider; ShengQi Xiang; Francesca Munari; Simon Trowitzsch; Markus C Wahl; Reinhard Lührmann; Stefan Becker; Markus Zweckstetter Journal: Nat Struct Mol Biol Date: 2014-09-14 Impact factor: 15.369
Authors: Zbigniew Warkocki; Cornelius Schneider; Sina Mozaffari-Jovin; Jana Schmitzová; Claudia Höbartner; Patrizia Fabrizio; Reinhard Lührmann Journal: Genes Dev Date: 2015-01-01 Impact factor: 11.361
Authors: Maria de Lourdes Coelho Ribeiro; Julio Espinosa; Sameen Islam; Osvaldo Martinez; Jayesh Jamnadas Thanki; Stephanie Mazariegos; Tam Nguyen; Maya Larina; Bin Xue; Vladimir N Uversky Journal: PeerJ Date: 2013-02-12 Impact factor: 2.984
Authors: Juan Pablo Fernandez; Miguel Angel Moreno-Mateos; Andre Gohr; Liyun Miao; Shun Hang Chan; Manuel Irimia; Antonio J Giraldez Journal: PLoS Genet Date: 2018-07-03 Impact factor: 5.917