| Literature DB >> 30737185 |
Chris H Hill1, Vytautė Boreikaitė1, Ananthanarayanan Kumar1, Ana Casañal1, Peter Kubík1, Gianluca Degliesposti1, Sarah Maslen1, Angelica Mariani1, Ottilie von Loeffelholz2, Mathias Girbig1, Mark Skehel1, Lori A Passmore3.
Abstract
Cleavage and polyadenylation factor (CPF/CPSF) is a multi-protein complex essential for formation of eukaryotic mRNA 3' ends. CPF cleaves pre-mRNAs at a specific site and adds a poly(A) tail. The cleavage reaction defines the 3' end of the mature mRNA, and thus the activity of the endonuclease is highly regulated. Here, we show that reconstitution of specific pre-mRNA cleavage with recombinant yeast proteins requires incorporation of the Ysh1 endonuclease into an eight-subunit "CPFcore" complex. Cleavage also requires the accessory cleavage factors IA and IB, which bind substrate pre-mRNAs and CPF, likely facilitating assembly of an active complex. Using X-ray crystallography, electron microscopy, and mass spectrometry, we determine the structure of Ysh1 bound to Mpe1 and the arrangement of subunits within CPFcore. Together, our data suggest that the active mRNA 3' end processing machinery is a dynamic assembly that is licensed to cleave only when all protein factors come together at the polyadenylation site.Entities:
Keywords: X-ray crystallography; baculovirus; cleavage; cryo-EM; hydrogen-deuterium exchange; mRNA; mass spectrometry; nuclease; polyadenylation; pre-mRNA
Mesh:
Substances:
Year: 2019 PMID: 30737185 PMCID: PMC6436931 DOI: 10.1016/j.molcel.2018.12.023
Source DB: PubMed Journal: Mol Cell ISSN: 1097-2765 Impact factor: 17.970
Figure 1The Mpe1 UBL Domain Binds to the Ysh1 Catalytic Domain Next to the Active Site Tunnel
(A) Domain diagram of Ysh1 and Mpe1 proteins, with truncations indicated by black lines. ZnK, zinc knuckle.
(B) SDS-PAGE analysis of pull-down experiments following baculovirus-driven co-expression of pairs of Ysh1 and Mpe1 constructs shown in (A), with full-length Cft2. The tagged Mpe1 constructs (asterisks) were captured by Strep-Tactin resin, and co-purification of Ysh1 and Cft2 was analyzed.
(C) SDS-PAGE analysis of complexes identified in (B) after anion exchange chromatography. The Ysh1-Mpe1 proteins remain associated, but Cft2 dissociates.
(D) X-ray crystal structure of the Ysh1 N-terminal catalytic domain (yellow) bound to the Mpe1 N-terminal UBL domain (orange). N and C termini of both models are indicated, zinc-coordinating residues are shown in sticks, and zinc ions are spheres. A slice through the complex (right) reveals a narrow tunnel leading to a large solvent-filled cavity adjacent to the active site. Inset: electrostatic surface potential at pH 7.4. A large basic patch comprising residues from both proteins lies adjacent to the active site tunnel.
(E) Details of metal ion coordination in the Ysh1 active site.
(F) Details of the interface between Ysh1 and Mpe1. Hydrogen bonds and electrostatic interactions are indicated by green dashed lines. Two orthogonal views are shown.
See also Figure S1.
Crystallographic Data Collection, Processing, and Refinement
| Space group | P21 |
| a, b, c (Å) | 43.38, 124.27, 63.45 |
| α, β, γ (°) | 90.0, 103.21, 90.0 |
| Number of reflections | 99,955 (4,970) |
| Resolution range (Å) | 62.13–2.28 (2.32–2.28) |
| Completeness (%) | 99.08 (99.38) |
| Redundancy | 3.39 (3.45) |
| 〈I/σ(I)〉 | 12.3 (1.3) |
| CC1/2 | 0.999 (0.578) |
| Rmerge | 0.065 (1.13) |
| Resolution range (Å) | 62.13–2.28 |
| Number of reflections in working set | 27,979 (2510) |
| Number of reflections in free set | 1,487 (144) |
| Rwork/Rfree | 0.1726/0.2219 |
| Number of atoms | 4,565 |
| Average B-factors (Å2) | 72.2 |
| Ramachandran | |
| Favored (%) | 95.91 |
| Outliers (%) | 0.37 |
| RMSDs | |
| Bonds (Å) | 0.003 |
| Angles ( | 0.56 |
Values for the outer shell are given in parentheses.
Mean I/σ(I) is >2.0 at resolutions >2.5 Å. The CC1/2 values (above) were used to decide resolution cutoff (Karplus and Diederichs, 2012).
Figure 2Mass Spectrometry and Cryo-EM Define the Interactions among Full-Length Ysh1, Mpe1, and Yjr141w
(A) Interactions among Ysh1, Mpe1, and Yjr141w mapped by crosslinking mass spectrometry of the Ysh1-Mpe1-Yjr141w trimer and Ysh1-Mpe1 and Ysh1-Yjr141w heterodimers. Lines are color-coded as indicated.
(B) Hydrogen-deuterium exchange mass spectrometry difference plot (Ysh1-Mpe1-Yjr141w versus Ysh1-Yjr141w) showing peptides of Ysh1 that are protected (negative) and exposed (positive) by Mpe1.
(C) Hydrogen-deuterium exchange mass-spectrometry difference plot (Ysh1-Mpe1-Yjr141w versus Ysh1-Mpe1) showing peptides of Ysh1 that are protected (negative) and exposed (positive) by Yjr141w.
In (B) and (C), triplicate data from four independent color-coded time-points are shown. The significance threshold is indicated by a dotted line. Gray shading indicates the SD of all charge states and replicates per peptide.
(D) Cryo-EM analysis of the Ysh1-Mpe1-Yjr141w heterotrimer. A representative micrograph at original magnification × 105,000 and −0.5 μm defocus.
(E) Selected 2D class averages of aligned particles.
(F) The crystal structure from Figure 1D was docked into the EM map filtered to 6 Å resolution. No density was observed for the Ysh1 CTD or the Yjr141w or Mpe1 CTDs.
See also Figure S2 and Table S1.
EM Data Collection and Processing
| Ysh1-Mpe1-Yjr141w | CPFcore | CPFpol + Cft2 | ||
|---|---|---|---|---|
| Cryo-EM | Cryo-EM | Negative-Stain EM | Negative-Stain EM | |
| Data collection | ||||
| Microscope | Titan Krios | FEI Tecnai Polara | FEI Tecnai Spirit | FEI Tecnai Spirit |
| Detector | K2 | Falcon III | Ultrascan 1000 | Ultrascan 1000 |
| Magnification | 105,000 × | 59,000 × | 26,000 × | 26,000 × |
| Pixel size (Å) | 1.09 | 1.78 | 3.98 | 3.98 |
| Voltage (keV) | 300 | 300 | 120 | 120 |
| Electron dose (e-/Å2) | ∼45 | ∼60 | ∼40–60 | ∼40–60 |
| Defocus range (μm) | −0.5 to −0.7 | −2.5 to −4.5 | −0.6 | −0.6 |
| Phase shift range (°) | 20–140 | N/A | N/A | N/A |
| Number of particles | 43,308 | 120,773 | 23,969 | 38,142 |
| Processing | ||||
| Resolution | 4.8 | N/A | 20 | N/A |
| Efficiency (Eod) | 0.29 | N/A | 0.79 | N/A |
N/A, not available.
Volta phase plate used during data collection.
3D reconstruction not performed.
Naydenova and Russo (2017).
Figure 3Ysh1 Is Primed for Activation by Assembly into an Eight-Subunit CPFcore Complex
(A) Schematic diagrams showing the expression and purification workflow, composition of recombinant complexes, and details of the in vitro activity assay. Proteins are represented by circles, with a yellow star to highlight an enzymatic subunit. S, StrepII tag; H, His6 tag; CPFpol, polymerase module.
(B) SDS-PAGE analysis of recombinant protein complexes after affinity, anion exchange, and size exclusion chromatography. Asterisks indicate contaminant proteins.
(C) The CYC1 model pre-mRNA is specifically cleaved by CPFcore with CF IA and CF IB, and the 5ʹ-cleavage product is polyadenylated in the presence of ATP, as shown by denaturing gel electrophoresis of RNA. The negative control reaction (−) contained CF IA and CF IB, but not CPFcore.
(D) Denaturing RNA gel electrophoresis of cleavage assay time courses performed using the protein complexes shown in (B). The negative control lanes (−) show no RNA cleavage when incubated with CF IA and CF IB (left) or buffer (right) for 90 min.
See also Figure S3.
Figure 4CPFcore Binds and Cleaves a 36-nt Minimal RNA Substrate
(A) Sequences of RNA substrates derived from the CYC1 3ʹ UTR. Each substrate carries both 5ʹ-FAM and 3ʹ-A647 labels (red and blue stars, respectively). The canonical cleavage site is highlighted in bold, and the minimal sequence required for efficient cleavage is represented by the gray box.
(B) Denaturing gel electrophoresis of the short RNA substrates after incubation with CPFcore, CF IA, and CF IB. The negative control reaction (−) contained CF IA and CF IB, but not CPFcore.
(C) Electrophoretic mobility shift assays (EMSAs) performed with CYC1d (cleaved by CPFcore) and CYC1f (not cleaved by CPFcore) RNAs.
See also Figures S4–S6 and Table S2.
Summary of EMSA Experiments (Figure S5) Testing All Stable Components and Subcomplexes for RNA Binding Activity
| Protein or complex | RNA | |||||
|---|---|---|---|---|---|---|
| A15 | U15 | C15 | G15 | |||
| CF IB | + | ++ | − | +++ | ++++ | ++++ |
| CF IA | − | +++++ | − | +++++ | +++++ | +++++ |
| Pcf11-Clp1 | − | − | − | − | +++ | ++ |
| Rna14-Rna15 | − | +++ | − | +++ | +++ | ++ |
| Cft2 | − | − | − | +++ | +++ | ++ |
| Pap1 | − | − | − | − | − | − |
| Ysh1-Mpe1-Yjr141w | − | − | − | +++ | + | + |
| Ysh1-Mpe1 | − | − | − | +++ | + | + |
| Ysh1-Yjr141w | − | − | − | − | − | − |
| CPFpol (no Pap1) | ++ | − | − | ++++ | +++ | + |
| CPFpol | ++ | − | − | ++++ | +++ | + |
| CPFpol + Cft2 | ++ | − | − | +++++ | ++++ | ++ |
| CPFcore | ++ | − | − | +++++ | ++++ | ++ |
CYC1d, CYC1f, and 15-mers of A, U, C, and G were used.
Figure 5The Enzymatic Subunits of CPFcore Assemble around a Central Scaffold
(A) Size-exclusion chromatography of CPFcore and SDS-PAGE analysis of fractions across the peak. Asterisks indicate contaminant proteins.
(B) Representative negative-stain micrograph of CPFcore.
(C) Negative-stain 2D class averages show a distinctive 21-nm particle with the polymerase module at one end.
(D) Representative cryo-EM micrograph of CPFcore.
(E) Selected 2D class averages from cryo-EM analysis of CPFcore. Approximately 80% of the particles are present in classes that comprise the 13-nm scaffold of the polymerase module only. Up to three additional subunits are visible in ∼0.5% of the particles.
(F) A model for the structure of CPFcore obtained from a 3D reconstruction of the negative-stain data. Three orthogonal views filtered to 25 Å are shown as insets. The cryo-EM structure of Cft1-Pfs2-Yth1 (Casañal et al., 2017) and X-ray crystal structures of Cft2 (Mandel et al., 2006), Pap1-Fip1 (Meinke et al., 2008), and Ysh1-Mpe1 (this work; Figure 1) are docked into the negative-stain map. Known disordered or flexible regions are indicated with colored lines. The weak interaction between Ysh1 and Cft2 CTDs is indicated with dashed lines.
Also see Figure S7.
Figure 6Model for 3ʹ End Formation on the Minimal CYC1 Pre-mRNA Substrate
(A) CF IA, CF IB, and CPF each preferentially bind certain RNA sequences. CF IA binds U-rich elements via interactions with Rna15 RRM domains, CF IB binds UA-rich sequences, and CPFcore binds the 5ʹ AAGAA element.
(B) When all of the correct sequence elements are present, the 3ʹ end processing machinery can assemble into an active complex, resulting in an opening of the active site cleft of Ysh1. Pre-mRNA cleavage occurs within a 3-nt window. Some of the interactions in the model are speculative.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Thermo Fisher Scientific | 18258012 | |
| Geneva Biotech | ||
| Thermo Fisher Scientific | C602003 | |
| Thermo Fisher Scientific | C404010 | |
| Thermo Fisher Scientific | C101010 | |
| Insect-XPRESS protein-free insect cell medium with L-glutamine | Lonza | 12-730Q |
| Protease Inhibitor Cocktail | Sigma-Aldrich | 11836170001 |
| Desthiobiotin | IBA | 2-1000-001 |
| Imidazole | Sigma-Aldrich | I5513 |
| Formamide | Sigma-Aldrich | 11814320001 |
| TEMED | Sigma-Aldrich | T9281 |
| Ammonium persulfate (APS) | Sigma-Aldrich | A3678 |
| Accugel 19:1 acrylamide:bis-acrylamide 40% w/v mix | National Diagnostics | EC-850 |
| Urea | VWR chemicals | 28877.260 |
| KOD Hot Start DNA Polymerase | Merck | 71086 |
| SYBR Safe DNA Gel Stain | Thermo Fisher Scientific | S33102 |
| SYBR Green II RNA Gel Stain | Thermo Fisher Scientific | S7586 |
| Ni-NTA Agarose | QIAGEN | 30210 |
| StrepTactin Sepharose high performance | GE Healthcare | 28-9356-00 |
| Deuterium oxide 99.9% | Millipore | 1133660009 |
| DiSuccinimidylSuberate (DSS) | Creative Molecules | 001S |
| DiSuccinimidyl Dibutyric Urea (DSBU) | This work | |
| Recombinant protein: | This work | N/A |
| Recombinant protein: | This work | N/A |
| Recombinant protein: | This work | N/A |
| Recombinant protein complex: | This work | N/A |
| Ysh1(1-462)-Mpe1(1-161)-3C-SII | ||
| Recombinant protein complex: | This work | N/A |
| Ysh1-Mpe1-3C-SII | ||
| Recombinant protein complex: | This work | N/A |
| Ysh1-Mpe1-3C-SII -Yjr141w | ||
| Recombinant protein complex: | This work | N/A |
| Ysh1- SII-3C-Yjr141w | ||
| Recombinant protein complex: | This work, based on | N/A |
| Cft1-Pfs2-3C-SII -Yth1-3C-8H -Fip1 | ||
| Recombinant protein complex: | This work, based on | N/A |
| “CPFpol” Cft1-Pfs2-3C-SII -Yth1-3C-8H -Fip1-Pap1 | ||
| Recombinant protein complex: | This work | N/A |
| Recombinant protein complex: | This work | N/A |
| Endogenous protein complex: | This work, based on | N/A |
| Recombinant protein: | This work, based on | N/A |
| Recombinant protein complex: | This work, based on | N/A |
| Recombinant protein complex: | This work, based on | N/A |
| Recombinant protein complex: | This work, based on | N/A |
| N/A | ||
| Mendeley raw data (e.g., uncropped gels, MS peptides) | This work | |
| Ysh1-Mpe1 crystal structure | This work | PDB: |
| Ysh1-Mpe1 cryo-EM map | This work | EMD: 0325 |
| CPFcore negative stain EM map | This work | EMD: 0324 |
| NMR structure of Rna14-Rna15 monkeytail-hinge (used for | PDB: | |
| Crystal structure of Rna14-Rna15 complex (used for | PDB: | |
| NMR structure of C-terminal domain pf CstF-64 (used for | PDB: | |
| NMR structure of Hrp1-Rna15 RRMs (used for | PDB: | |
| Crystal structure of Rna15 RRM with bound GU (used for | PDB: | |
| Crystal structure of Pcf11-Clp1 complex (used for | PDB: | |
| Crystal structure of Pcf11-Clp1 complex (used for | PDB: | |
| Crystal structure of Pcf11-RNA pol II CTD complex (used for | PDB: | |
| Cryo-EM structure of Cft1-Pfs2-Yth1 (used for | PDB: | |
| Crystal structure of Cft2 (used for | PDB: | |
| Crystal structure of Pap1-Fip1 complex (used for | PDB: | |
| Crystal structure of CPSF-73 (used as molecular replacement search model) | PDB: | |
| NMR structure of Rbbp6 (used as molecular replacement search model) | PDB: | |
| Sf9 | Oxford Expression Technologies Ltd. | 600100-SF9 cells |
| kanMX6 MATalpha pra1-1 prb1-1 prc1-1 cps1-3 ura3delta5 leu2-3 his- Parent strain JWY104 | ||
| RNA and DNA sequences, with details of end-labeling | This work | See |
| (modified) pBig1A | This work, based on | P24-63 |
| (modified) pBig1B | This work, based on | P24-64 |
| (modified) pBig1C | This work, based on | P25-1 |
| (modified) pBig1D | This work, based on | P25-2 |
| (modified) pBig1E | This work, based on | P25-6 |
| (modified) pBig2AB | This work, based on | P25-3 |
| Pap1-SII pACEBac1 (expression). Used to make protein: | This work | P25-8 |
| Pap1-SII | ||
| Cft2-SII pACEBac1 (expression) | This work | P25-7 |
| Used to make protein: | ||
| Cft2-SII | ||
| SII-3C-Yjr141w pIDS (assembly intermediate) | This work | P18-62 |
| SII-3C-Yjr141w pIDS/pACEBac1 (baculovirus expression) | This work | P19-1 |
| Used to make protein: | ||
| SII-3C-Yjr141w | ||
| Pta1_Cft2_Ysh1_Mpe1 pIDS (CPFcore assembly intermediate) | This work | P19-10 |
| Cft1_Pfs2-3C-SII_Yth1-3C-8H_Pap1_Fip1 pIDC/pACEBac1 (CPFcore assembly intermediate and baculovirus expression) Used to make two complexes: | P15-18 | |
| Cft1-Pfs2-3C-SII-Yth1-3C-8H-Fip1 | ||
| “CPFpol” (Cft1-Pfs2-3C-SII-Yth1-3C-8H-Fip1-Pap1) | ||
| Cft2_Ysh1_Mpe1_Pta1_Cft1_Pfs2-3C-SII_Yth1-3C-8H_Pap1_Fip1 pIDC/pIDS/pACEBac1 (baculovirus expression). Used to make: | This work | P19-11 |
| “CPFcore” (Cft1-Pfs2-3C-SII -Yth1-3C-8H -Fip1-Pap1-Cft2-Ysh1-Mpe1) | ||
| 8H-Cft2_Ysh1_Mpe1-3C-SII pIDS (assembly intermediate) | This work | P15-13 |
| 8H-Cft2_Ysh1_Mpe1-3C-SII_Yjr141w pIDS (assembly intermediate) | This work | P18-59 |
| SII-3C-Yjr141w_Ysh1 pIDS (assembly intermediate) | This work | P25-9 |
| 8H-Cft2_Ysh1_Mpe1-3C-SII pIDS/pACEBac1 (baculovirus expression) Used to make: | This work | P15-20 |
| Ysh1-Mpe1-3C-SII | ||
| 8H-Cft2_Ysh1_Mpe1-3C-SII_Yjr141w pIDS/pACEBac1 (baculovirus expression) Used to make: | This work | P18-63 |
| Ysh1-Mpe1-3C-SII -Yjr141w | ||
| SII-3C-Yjr141w_Ysh1 pIDS/pACEBac1 (baculovirus expression) Used to make: | This work | |
| Ysh1- SII-3C-Yjr141w | ||
| Cft1_Pfs2-3C-SII_Yth1_Pap1_Fip1 pBig1A (assembly intermediate) | This work | P20-3 |
| Cft2 pBig1B (assembly intermediate) | This work | P20-5 |
| Cft2_Cft1_Pfs2-3C-SII_Yth1_Pap1_Fip1 pBig2AB (baculovirus expression) Used to make: | This work | P20-15 |
| “CPFpol+Cft2” (Cft2-Cft1-Pfs2-3C-SII-Yth1-Fip1-Pap1) | ||
| Cft2_Ysh1_Mpe1-3C-SII pBig1B (baculovirus expression) | This work | P20-8 |
| Cft2_Ysh1_Mpe1-1(1-369)-3C-SII pBig1B (baculovirus expression) | This work | P20-29 |
| Cft2_Ysh1_Mpe1-2(1-270)-3C-SII pBig1B (baculovirus expression) | This work | P20-30 |
| Cft2_Ysh1_Mpe1-3(1-160)-3C-SII pBig1B (baculovirus expression) | This work | P20-31 |
| Cft2_Ysh1_Mpe1-4(81-441)-3C-SII pBig1B (baculovirus expression) | This work | P20-32 |
| Cft2_Ysh1_Mpe1-5(161-441)-3C-SII pBig1B (baculovirus expression) | This work | P20-33 |
| Cft2_Ysh1-N(1-474)_Mpe1-3C-SII pBig1B (baculovirus expression) | This work | P20-34 |
| Cft2_Ysh1-N(1-474)_Mpe1-1(1-369)-3C-SII pBig1B (baculovirus expression) | This work | P20-35 |
| Cft2_Ysh1-N(1-474)_Mpe1-2(1-270)-3C-SII pBig1B (baculovirus expression) | This work | P20-36 |
| Cft2_Ysh1-N(1-474)_Mpe1-3(1-160)-3C-SII pBig1B (baculovirus expression) | This work | P20-37 |
| Cft2_Ysh1-N(1-474)_Mpe1-4(81-441)-3C-SII pBig1B (baculovirus expression) | This work | P20-38 |
| Cft2_Ysh1-N(1-474)_Mpe1-5(161-441)-3C-SII pBig1B (baculovirus expression) | This work | P20-39 |
| Cft2_Ysh1-C(475-779)_Mpe1-3C-SII pBig1B (baculovirus expression) | This work | P20-40 |
| 6H-Hrp1 pOPINB (bacterial expression) Used to make: | P2-43 | |
| “CF IB” (6H-Hrp1) | ||
| 6H-Rna14_Rna15 pETduet (bacterial expression) Used to make: | P11-44 | |
| “CF IA” (6H-Rna14-Rna15- 6H-Pcf11-Clp1) | ||
| 6H-Rna14-Rna15 | ||
| 6H-Pcf11_Clp1 pSRFduet (bacterial expression) Used to make: | P11-45 | |
| “CF IA” (6H-Rna14-Rna15- 6H-Pcf11-Clp1) | ||
| 6H-Pcf11-Clp1 | ||
| DynamX 3.0 | Waters | |
| ProteinLynx Global Server | Waters | |
| Stavrox | ||
| msConvert | ProteoWizard | |
| XIA2 | N/A | |
| XDS | N/A | |
| AIMLESS | N/A | |
| Phaser | N/A | |
| phenix.autobuild | N/A | |
| COOT | N/A | |
| phenix.refine | N/A | |
| MolProbity | N/A | |
| ePISA, European Bioinformatics Institute, EBI | N/A | |
| PDB2PQR | N/A | |
| PROPKA | N/A | |
| APBS | N/A | |
| SerialEM | N/A | |
| EPU | FEI company | N/A |
| MotionCor 2 | N/A | |
| Gctf | N/A | |
| RELION 2 | N/A | |
| EMAN | N/A | |
| PyMOL 1.5.0.5 | Schrödinger LLC | N/A |
| UCSF Chimera | N/A | |
| HADDOCK 2.2 | N/A | |
| InkScape 0.92.3 | N/A | |
| Novex NuPAGE 4-12% Bis-Tris gels | Invitrogen | NP0323BOX |
| Amicon Ultra Centrifugal Filter Units | Millipore | UFC901096 |
NB. “SII” denotes a StrepII tag, “3C” denotes a protease cleavage site and “6H/8H” denotes a His6/His8 tag. These descriptors are positioned before or after a gene/protein name based on whether tag is N- or C-terminal