Olga Jasnovidova1, Magdalena Krejcikova2, Karel Kubicek2, Richard Stefl1. 1. CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic olga.jasnovidova@ceitec.muni.cz richard.stefl@ceitec.muni.cz. 2. CEITEC - Central European Institute of Technology, Masaryk University, Brno, Czech Republic.
Abstract
Phosphorylation patterns of the C-terminal domain (CTD) of largest subunit of RNA polymerase II (called the CTD code) orchestrate the recruitment of RNA processing and transcription factors. Recent studies showed that not only serines and tyrosines but also threonines of the CTD can be phosphorylated with a number of functional consequences, including the interaction with yeast transcription termination factor, Rtt103p. Here, we report the solution structure of the Rtt103p CTD-interacting domain (CID) bound to Thr4 phosphorylated CTD, a poorly understood letter of the CTD code. The structure reveals a direct recognition of the phospho-Thr4 mark by Rtt103p CID and extensive interactions involving residues from three repeats of the CTD heptad. Intriguingly, Rtt103p's CID binds equally well Thr4 and Ser2 phosphorylated CTD A doubly phosphorylated CTD at Ser2 and Thr4 diminishes its binding affinity due to electrostatic repulsion. Our structural data suggest that the recruitment of a CID-containing CTD-binding factor may be coded by more than one letter of the CTD code.
Phosphorylation patterns of the C-terminal domain (CTD) of largest subunit of RNA polymerase II (called the CTD code) orchestrate the recruitment of RNA processing and transcription factors. Recent studies showed that not only serines and tyrosines but also threonines of the CTD can be phosphorylated with a number of functional consequences, including the interaction with yeast transcription termination factor, Rtt103p. Here, we report the solution structure of the Rtt103pCTD-interacting domain (CID) bound to Thr4 phosphorylated CTD, a poorly understood letter of the CTD code. The structure reveals a direct recognition of the phospho-Thr4 mark by Rtt103p CID and extensive interactions involving residues from three repeats of the CTD heptad. Intriguingly, Rtt103p's CID binds equally well Thr4 and Ser2 phosphorylated CTD A doubly phosphorylated CTD at Ser2 and Thr4 diminishes its binding affinity due to electrostatic repulsion. Our structural data suggest that the recruitment of a CID-containing CTD-binding factor may be coded by more than one letter of the CTD code.
RNA polymerase II (RNAPII) utilizes a long and flexible carboxyl‐terminal domain (CTD) of its largest subunit to specifically recruit protein/RNA‐binding factors during transcription 1, 2, 3, 4, 5. The CTD consists of tandem repeats with conserved consensus Tyr1‐Ser2‐Pro3‐Thr4‐Ser5‐Pro6‐Ser7 that is repeated 26 times in yeast and 52 times in humans 6. The CTD sequence is post‐translationally phosphorylated at serines (Ser2, Ser5 and Ser7) and Tyr1 in a dynamic manner, yielding specific patterns that are recognized by appropriate factors in coordination with the transcription cycle events 3, 4, 5, 7.Additionally another highly conserved position, Thr4, was reported to be phosphorylated both in yeast and humans 8, 9, 10, 11, 12. However, the levels of pThr4 in cells remain controversial based on two recent mass‐spectrometry studies 10, 11. Substitution of Thr4 to Ala (T4A) or Val (T4V) is lethal for chicken and human cells 12, 13, 14; however, the same mutants are viable in yeast 9, 15, 16. In humans, genomewide studies revealed increasing levels of pThr4 throughout the gene body with the peak after the poly‐A site 12. In agreement with this, the T4A mutant showed defect in transcription elongation 12. In yeast, the pThr4‐mark is enriched along the whole gene body, similarly to the pTyr1‐mark 17. Both marks go down prior recruitment of transcription termination factors 17. Therefore, it was suggested that the pThr4 mark along with the pTyr1‐mark prevent binding of transcription termination factors during transcription elongation 17. However, recent high resolution ChiP‐nexus data suggested a different role for the pThr4 mark involved in transcription termination and post‐transcriptional splicing 18.It has been unclear for a long time what protein factors are recruited through the pThr4 signal. Interestingly, yeast transcription termination factor, Rtt103p, well known to be associated with the pSer2‐mark 17, 19, 20, was identified as a part of the interactome of RNAPII phosphorylated at Thr4 18. Based on the overlay of NET‐seq and ChIP‐nexus profiles, Rtt103p coincides with the pThr4 mark after poly‐A site. Both, deletion of the entire Rtt103p protein or expression of Rpb1 T4V CTD mutant, cause similar RNAPII pausing defect after poly‐A site. The authors suggested a model, in which both pSer2 and pThr4 marks can contribute to the recruitment of Rtt103p to the poly‐A site 18. This concept is also supported by recent mass‐spectrometry analyses of RNAPII CTD population pulled down by Rtt103p, which revealed simultaneous presence of pThr4 and pSer2 marks 11.To understand the puzzling roles of the pSer2 and pThr4 marks in recruitment of transcription termination factor Rtt103p, we solved NMR structure of the pThr4CTDpeptide in complex with Rtt103pCTD‐interacting domain (CID). Our structure reveals for the first time a direct readout of the pThr4 mark within the CTD. We also reveal significantly larger interaction area of Rtt103p with the CTDpeptide than previously reported 20. Next, we show that two adjacently positioned phosphorylations, pSer2 followed by pThr4, inhibit the binding of Rtt103p CID due to a charge–charge repulsion of the two closely positioned phosphate moieties. Finally, we propose that the CTD code is degenerated, as Rtt103p reads the pThr4 and pSer2 marks equally well using the same molecular mechanism.
Results and Discussion
Rtt103p CID binds equally well Thr4 and Ser2 phosphomarks
To test the binding affinity of Rtt103p CID towards pThr4‐CTD in vitro, we performed an equilibrium‐binding assay using fluorescence anisotropy (FA) (Fig 1B). The experiment revealed that Rtt103p binds pThr4‐CTD with a K
D of 15 ± 1 μM, which is 2.5 times weaker binding than to the CTD with the pSer2 mark (K
D = 6.0 ± 0.2 μM). This finding is in a good agreement with previous co‐immunoprecipitation studies, where Rtt103p was pulled down by RNAPII with the pThr4 mark and successfully competed out by pSer2‐CTD or pThr4‐CTD antisera 18. Doubly phosphorylated pThr4‐CTD at both Thr4 displayed increased binding affinity due to avidity effects (K
D = 6 ± 0.2 μM). Remarkably, if the pSer2 and pThr4 marks are positioned adjacently, binding affinity (K
D = 43 ± 2 μM) is lowered almost to the level of non‐phosphorylated CTD (K
D = 64 ± 2 μM). The pSer5 phosphorylation mark was also previously shown to abolish and lower the binding with Rtt103p or its close human homologue 20, 21. Next, we introduced the pTyr1 mark to the central heptad of the CTDpeptide, which completely abolished the binding with Rtt103p (Fig 1B). This suggests that Y1b is accommodated in the hydrophobic pocket following the previously established binding model for CIDs 17.
Figure 1
How CTD phosphorylations modulate binding to Rtt103p CID
Numbering of residues and order of heptad repeats of the CTD peptide used throughout the study.
Equilibrium binding of Rtt103p CID with fluorescently labelled CTD peptides monitored by fluorescence anisotropy (FA). Rtt103p CID titrated into 10 nM FAM‐labelled CTD peptides. Peptide sequences, corresponding binding isotherms and dissociation constant (K
D, ± standard deviation of the fit) are shown. FAM, 5,6‐carboxyfluorescein. N.D., not determined.
How CTD phosphorylations modulate binding to Rtt103p CID
Numbering of residues and order of heptad repeats of the CTDpeptide used throughout the study.Equilibrium binding of Rtt103p CID with fluorescently labelled CTDpeptides monitored by fluorescence anisotropy (FA). Rtt103p CID titrated into 10 nM FAM‐labelled CTDpeptides. Peptide sequences, corresponding binding isotherms and dissociation constant (K
D, ± standard deviation of the fit) are shown. FAM, 5,6‐carboxyfluorescein. N.D., not determined.
NMR structure of Rtt103p CID bound to CTD with phospho‐threonine mark
To reveal the structural basis of pThr4 recognition, we solved solution structure of a reconstituted complex that harbours Rtt103p CID (3‐131) and a 16‐amino acid peptide, pThr4‐CTD (PS YSP(pT)SPS YSPTSPS; Fig 2A–C; Table 1). We used this peptide with a single phosphorylation to avoid binding in multiple registers that would complicate NMR data analyses. The resulted structure of Rtt103p CID is formed by eight α‐helices in a right‐handed superhelical arrangement (Fig 2A and B), out of which helices α2, α4 and α7 contact the pThr4‐CTDpeptide at residues P6a, S7a, Y1b, P3b, pT4b, S7b and Y1c (Figs 2B and C, and EV1). This minimal CTD‐binding moiety binds Rtt103p CID with a K
D of 18 ± 1 μM (assayed by FA), which is almost identical as pThr4‐CTD used for structural determination. The structure is similar to the one of Rtt103p CID–pSer2‐CTD complex 20 in terms of the overall CID fold and the conformation of the N‐terminal part of the CTDpeptide, but entirely different for the C‐terminal part of the CTDpeptide (Figs 3, EV1, and EV2).
Figure 2
Solution structure of Rtt103p CID in complex with pThr4‐CTD
Overlay of the 20 lowest energy structures of Rtt103p CID (black ribbon) complexed with pThr4‐CTD (red ribbon) shown in stereo. N‐ and C‐termini of the protein and peptide are indicated.
Solution structure of Rtt103p CID (grey helices) bound to the pThr4‐CTD peptide (magenta sticks). Highlighted Rtt103p CID residues (grey sticks, blue labels) form hydrophobic contacts and putative hydrogen bonds (dashed black lines) with pThr4‐CTD peptide.
Schematic diagram of Rtt103p CID (blue) and pThr4‐CTD (black) interactions (hydrophobic contacts, spoked arcs; hydrogen bonds, dashed lines).
Equilibrium binding of Rtt103p CID mutants with pThr4‐CTD peptide monitored by fluorescence anisotropy (FA). Rtt103p CID mutants titrated into 10 nM FAM‐labelled CTD peptides. Corresponding binding isotherms and K
Ds (± standard deviation of the fit) are shown. FAM, 5,6‐carboxyfluorescein. N.D., not determined.
Table 1
NMR and refinement statistics for the Rtt103p CID‐pThr4 CTD complex
Rtt103p CID–pThr4 CTD complex
NMR distance & dihedral constraints
Distance restraints
Total NOEs
3,639
Intra‐residue
843
Inter‐residue
2,796
Short
1,691
Medium
1,104
Long
844
Hydrogen bonds
99
Intermolecular distance restraints
47
Total dihedral angle restraintsa
198
Structure statisticsb
Violations (mean and s.d.)
Number of distance restraint violations > 0.5 Å
0.10 ± 0.31
Number of dihedral angle restraint violations > 15°
α‐helical dihedral angle restraints imposed for the backbone based on the CSI.
Calculated for an ensemble of the 20 lowest energy structures.
Based on PROCHECK analysis 42.
Figure EV1
Comparison of Rtt103p CID structures bound to differently phosphorylated CTD
Comparison of Rtt103p CID (grey helices) bound to the (A) pThr4‐CTD (magenta sticks; PDB ID: 5LVF) or (B) to the pSer2‐CTD peptide (yellow sticks; PDB ID: 2L0I.). Rtt103p CID residues involved in the interaction with CTD are shown in grey sticks and labelled with blue font. Sequences of peptides used in structure determination are indicated below the structures. Peptide residues shown on the image are highlighted in red and black font; the residues highlighted in red have intermolecular contacts used for the structure calculation; residues in grey are not displayed for clarity reasons or are missing coordinates.
Figure 3
Degeneracy of the CTD code
Superposition of pThr4‐CTD (magenta; PDB ID: 5LVF) and pSer2‐CTD (yellow; PDB ID: 2L0I) peptides on the Rtt103p CID surface (grey). N‐ and C‐termini of the peptides are indicated.
Close view on the phospho‐recognition site of Rtt103p CID. Interaction of pThr4‐ (magenta sticks, left) and pSer2‐CTD (yellow sticks, right) peptides with Arg108 (grey). Hydrogen bonds of the phospho‐groups are indicated with black dashed lines.
Figure EV2
Rtt103p CID interacts with pSer2‐CTD and pThr4‐CTD using the same canonical interface
In order to identify the surface of the CID involved in interaction with pThr4‐CTD, we performed a 1H15N‐TROSY NMR titration experiment, where 15N‐labelled Rtt103 CID was titrated with pThr4 peptide. Titration confirmed that Rtt103p CID is interacting with pThr4‐CTD peptide in an almost identical fashion as with pSer2‐CTD, using the canonical surface of helices α2, α4 and α7 (Fig 1B and C). However, important rearrangements were observed for Val109 and Ile112, residues that lay in close proximity of P3b and T4b.
Chemical shift perturbations (CSP) of the Rtt103p CID upon interaction with FAM‐pSer2 CTD (red) or FAM‐pThr4 CTD (grey) peptides plotted against residue number of Rtt103p CID. Secondary structure elements are shown below the x‐axis. Helices involved in the interaction with phospho‐peptides are coloured in black. FAM, 5,6‐carboxyfluorescein.
Overlay of 1H‐15N TROSY spectra of complex of Rtt103p CID with FAM‐pSer2 (red) and FAM‐pThr4 (blue).
Solution structure of Rtt103p CID in complex with pThr4‐CTD
Overlay of the 20 lowest energy structures of Rtt103p CID (black ribbon) complexed with pThr4‐CTD (red ribbon) shown in stereo. N‐ and C‐termini of the protein and peptide are indicated.Solution structure of Rtt103p CID (grey helices) bound to the pThr4‐CTDpeptide (magenta sticks). Highlighted Rtt103p CID residues (grey sticks, blue labels) form hydrophobic contacts and putative hydrogen bonds (dashed black lines) with pThr4‐CTDpeptide.Schematic diagram of Rtt103p CID (blue) and pThr4‐CTD (black) interactions (hydrophobic contacts, spoked arcs; hydrogen bonds, dashed lines).Equilibrium binding of Rtt103p CID mutants with pThr4‐CTDpeptide monitored by fluorescence anisotropy (FA). Rtt103p CID mutants titrated into 10 nM FAM‐labelled CTDpeptides. Corresponding binding isotherms and K
Ds (± standard deviation of the fit) are shown. FAM, 5,6‐carboxyfluorescein. N.D., not determined.NMR and refinement statistics for the Rtt103p CID‐pThr4CTD complexα‐helical dihedral angle restraints imposed for the backbone based on the CSI.Calculated for an ensemble of the 20 lowest energy structures.Based on PROCHECK analysis 42.
Comparison of Rtt103p CID structures bound to differently phosphorylated CTD
Comparison of Rtt103p CID (grey helices) bound to the (A) pThr4‐CTD (magenta sticks; PDB ID: 5LVF) or (B) to the pSer2‐CTDpeptide (yellow sticks; PDB ID: 2L0I.). Rtt103p CID residues involved in the interaction with CTD are shown in grey sticks and labelled with blue font. Sequences of peptides used in structure determination are indicated below the structures. Peptide residues shown on the image are highlighted in red and black font; the residues highlighted in red have intermolecular contacts used for the structure calculation; residues in grey are not displayed for clarity reasons or are missing coordinates.
Degeneracy of the CTD code
Superposition of pThr4‐CTD (magenta; PDB ID: 5LVF) and pSer2‐CTD (yellow; PDB ID: 2L0I) peptides on the Rtt103p CID surface (grey). N‐ and C‐termini of the peptides are indicated.Close view on the phospho‐recognition site of Rtt103p CID. Interaction of pThr4‐ (magenta sticks, left) and pSer2‐CTD (yellow sticks, right) peptides with Arg108 (grey). Hydrogen bonds of the phospho‐groups are indicated with black dashed lines.
Rtt103p CID interacts with pSer2‐CTD and pThr4‐CTD using the same canonical interface
In order to identify the surface of the CID involved in interaction with pThr4‐CTD, we performed a 1H15N‐TROSY NMR titration experiment, where 15N‐labelled Rtt103 CID was titrated with pThr4 peptide. Titration confirmed that Rtt103p CID is interacting with pThr4‐CTDpeptide in an almost identical fashion as with pSer2‐CTD, using the canonical surface of helices α2, α4 and α7 (Fig 1B and C). However, important rearrangements were observed for Val109 and Ile112, residues that lay in close proximity of P3b and T4b.Chemical shift perturbations (CSP) of the Rtt103p CID upon interaction with FAM‐pSer2 CTD (red) or FAM‐pThr4CTD (grey) peptides plotted against residue number of Rtt103p CID. Secondary structure elements are shown below the x‐axis. Helices involved in the interaction with phospho‐peptides are coloured in black. FAM, 5,6‐carboxyfluorescein.Overlay of 1H‐15N TROSY spectra of complex of Rtt103p CID with FAM‐pSer2 (red) and FAM‐pThr4 (blue).
Recognition of the phospho‐threonine CTD by Rtt103p
The upstream part of the pThr4‐CTDpeptide adopts a β‐turn conformation at S2bP3bpT4bS5b and docks into a hydrophobic pocket of the Rtt103p CID that is formed by Ile22, Tyr62, His66, Val109 and Ile112, using Y1b and P3b residues (Fig 2B and C). The peptide conformation in the hydrophobic pocket is further stabilized by a hydrogen bond between hydroxyl of Y1b and the side‐chain amide of Asn65. This hydrophobic pocket of Rtt103p is highly conserved, and mutations of residues Tyr62 and His66 (not affecting the structural integrity; Fig EV3) completely abolish the binding with pThr4‐CTD (Figs 2D and EV4). P3b is inserted into the hydrophobic pocket next to Val109 and has a trans conformation of the S2bP3b peptidyl‐prolyl bond. As a result of this arrangement, both the S2b and pT4b side chains are positioned closely to each other in the solvent exposed area and form intramolecular hydrogen bond between the hydroxyl group of S2b and phospho‐group of T4b. The phospho‐group of pT4b forms a hydrogen bond with the guanidinium group of Arg108. This is a critical interaction with the pThr4 mark, as confirmed by the affinity data for the Arg108Asn mutant (Fig 2D). Akin to Rtt103p, also other CID‐containing proteins such as SCAF4/8 22, RPRD1A/1B/2 21 and CHERP 21 contain the equivalent arginine in the CID pocket. It will be interesting to see whether these human proteins really recognize pThr4‐CTD as well and whether the pThr4 mark is relevant to their functions. Other CID‐containing proteins in yeast, such as Nrd1p and Pcf11p, do not contain the equivalent arginine and these proteins were absent in the pThr4‐CTD interactome 18.
Figure EV3
Structural integrity of Rtt103p CID mutants
Comparison of 1H NMR spectra of the wild type (green), Y62A (blue) and H66A (red) mutants of Rtt103p CID; the region with NH backbone and side‐chain resonances is shown. Data were collected on 850 MHz Bruker AVANCE III spectrometer at 293 K.
Figure EV4
Structural sequence alignment of CIDs
Sequence alignment of CID based on superposition of the CID structures (PDB IDs: 2KM4, 4NAC, 4FLB, 3CLJ, 2BF0, 3D9I) using Align tool of UCSF Chimera 40. In this type of alignment, residue types are not used, only their spatial proximities. Yellow boxes highlight structured elements; red boxes show key residues responsible for the CTD recognition according to numbering of Rtt103p.
Structural integrity of Rtt103p CID mutants
Comparison of 1H NMR spectra of the wild type (green), Y62A (blue) and H66A (red) mutants of Rtt103p CID; the region with NH backbone and side‐chain resonances is shown. Data were collected on 850 MHz Bruker AVANCE III spectrometer at 293 K.
Structural sequence alignment of CIDs
Sequence alignment of CID based on superposition of the CID structures (PDB IDs: 2KM4, 4NAC, 4FLB, 3CLJ, 2BF0, 3D9I) using Align tool of UCSF Chimera 40. In this type of alignment, residue types are not used, only their spatial proximities. Yellow boxes highlight structured elements; red boxes show key residues responsible for the CTD recognition according to numbering of Rtt103p.Remarkably, we observed multiple strong intermolecular NOEs among the aromatics of Y1c in the downstream region of the CTDpeptide and the C‐terminal parts of helices α4 and α7 (Figs 2C, EV1, and EV5). The interaction of Y1c at the tip of helices α4 and α7 creates a second turn in the peptide at residues pT4bS5bP6bS7b, bringing two backbone carbonyl groups in close proximity and allows for their interaction with the guanidinium group of Arg116 (Figs 2B and C, and EV1). The side chain of Y1c forms numerous hydrophobic contacts with Lys72, Gly73, Ile118. Arg116Glu and Lys72Glu charge swapping mutants cause affinity drop of K
D = 107 ± 23 μM and K
D = 54 ± 3 μM, respectively (Fig 2D). The similar arrangement of the downstream region of the CTD was observed in the crystal structure of close human homologue of Rtt103p CID, RPRD1A, where the arginine forms a hydrogen bond with carbonyl of T4b and P6b
21. The Arg116 position is conserved in RPRD1A/1B (Arg114) and RPRD2 (Arg130) (Fig EV4) 21. Interestingly, the coordination of tyrosine from the third heptad repeat Y1c was not observed previously in the structure of Rtt103p CID bound to the CTD with Ser2 phosphorylation 20. The previous study used a CTDpeptide lacking the complete binding moiety (PS YSPTSPS Y) that possibly precluded the accommodation of the downstream part of CTDpeptide including the second tyrosine (Y1c; Fig EV1). The comparison of chemical shift perturbations of Rtt103p upon binding to the singly phosphorylated pSer2‐CTD and pThr4‐CTDpeptides with complete binding moiety suggests similar accommodation of downstream region of both peptides (Fig EV2).
Figure EV5
Intermolecular contacts between the Y1c and Rtt103p CID
Strip plots from 3D F1‐13C/15N‐filtered NOESY‐[13C‐1H]‐HSQC showing intermolecular contacts between Y1c and Arg116 (left) and Ile118 (middle, right).
Intermolecular contacts between the Y1c and Rtt103p CID
Strip plots from 3D F1‐13C/15N‐filtered NOESY‐[13C‐1H]‐HSQC showing intermolecular contacts between Y1c and Arg116 (left) and Ile118 (middle, right).
Cis‐trans equilibrium of the Ser–Pro prolyl‐peptidyl bond
We also tested as to whether two proximal phosphorylation marks (pSer2/pThr4) on the CTDpeptide can alter the cis‐trans equilibrium of the neighbouring prolyl‐peptidyl bond (Fig EV6A). It has been shown that the cis‐trans equilibrium of the CTD is critical for its recognition by cognate proteins 23, 24, 25 and the trans conformation of the Ser–Pro prolyl‐peptidyl bond is required for the β‐turn formation 20, 23, 26, 27. To exclude the possibility that a highly populated cis conformer would attenuate the binding of the CTDpeptide with two phospho‐marks, we assayed the conformational population of mono‐ and diphosphorylated peptides using the [1H,13C]‐HSQC spectra of PS Y(pS)13CP(pT)SPS YS and PS YS13CP(pT)SPS YSpeptides, where all P3b carbons were 13C‐isotopically labelled (Fig EV6B). In case of pThr4‐CTD, we observed 6.6% of the cis conformer. We obtained virtually identical number for the pSer2pThr4‐CTDpeptide, where the cis conformation was populated at 7.8%. Our data suggest that the double phosphorylation at pSer2/pThr4 of the CTD does not influence the ratio of cis‐trans conformers. Next, we titrated the PS YS13CP(pT)SPS YS peptide with Rtt103p CID and monitored the titration by [1H,13C]‐HSQC experiment (Fig EV6C). The spectra show the disappearance of peaks that correspond to the cis conformation during titration, indicating a shift in the cis‐trans equilibrium towards the trans conformation of the S2b–P3b prolyl‐peptidyl bond that is required for the β‐turn formation. The peaks corresponding to the trans conformation of P3b moved upon titration with protein, reflecting the accommodation of the proline in the hydrophobic pocket of Rtt103p CID.
Figure EV6
Double phosphorylation of CTD does not influence serine–proline peptide bond isomerization state population
Scheme of cis‐ and trans‐isomers of X‐proline peptide bond (X stands for any amino acid). Peptide bond is highlighted in blue.
To establish conformational populations of mono‐ and di‐phosphorylated peptides, [13C,1H] HSQC spectra of CTD peptides were measured. Comparison of [13C,1H] HSQC spectra of PSYS13CPpTSPSYS (left) and PSYpS13CPpTSPSYS (right). 1H‐13C correlations of β and γ C‐H pairs are shown. Trans‐chemical shift region of Cγ/β highlighted in yellow 41.
Overlay of [13C,1H] HSQC spectra from the NMR titration of the PSYS13CPpTSPSYS peptide with non‐labelled Rtt103p CID. Protein–peptide molar ratios for each titration step and corresponding colour of the spectrum are indicated on the right.
Double phosphorylation of CTD does not influence serine–proline peptide bond isomerization state population
Scheme of cis‐ and trans‐isomers of X‐prolinepeptide bond (X stands for any amino acid). Peptide bond is highlighted in blue.To establish conformational populations of mono‐ and di‐phosphorylated peptides, [13C,1H] HSQC spectra of CTDpeptides were measured. Comparison of [13C,1H] HSQC spectra of PSYS13CPpTSPSYS (left) and PSYpS13CPpTSPSYS (right). 1H‐13C correlations of β and γ C‐H pairs are shown. Trans‐chemical shift region of Cγ/β highlighted in yellow 41.Overlay of [13C,1H] HSQC spectra from the NMR titration of the PSYS13CPpTSPSYS peptide with non‐labelled Rtt103p CID. Protein–peptide molar ratios for each titration step and corresponding colour of the spectrum are indicated on the right.
CTD code degeneration
The complex of Rtt103p CID–pThr4‐CTD reported here represents the first structure capturing the recognition of the CTD phosphorylated at threonine and explains the structural basis of why Rtt103p can be a part of the pSer2‐ and pThr4‐CTD interactomes. Previous reports suggested that Thr4 phosphorylations could interfere with CTD binding by destabilizing the β‐turn conformation that is required for CTD binding 17, 26. However, our structure shows that the pThr4 mark is directly recognized by Rtt103p and also that the phosphate group of pThr4 forms intramolecular hydrogen bond stabilizing the bound CTD conformation. This conformation involves the β‐turn at S2bP3bpT4bS5b that is a prerequisite for an effective docking into the hydrophobic pocket of Rtt103p CID (Figs 2B and C, and 3). Interestingly, the intramolecular hydrogen bond that stabilizes the β‐turn mirrors the one of the Ser2 phosphorylated CTD bound to Rtt103p (Fig 3B). The Rtt103p Arg108Asn mutant has also a similar drop in affinity for pThr4‐CTD and pSer2‐CTD, K
D = 38 ± 2 μM and K
D = 44 ± 2 μM, respectively. These observations suggest that CTD modifications preventing intramolecular stabilization of the β‐turn should negatively affect CTD binding. Indeed, we observed that doubly phosphorylated CTD at Ser2 and Thr4 binds to Rtt103p as weak as unmodified CTD (Fig 1). Electrostatic repulsion between closely arranged phosphates of pSer2 and pThr4 interfere with the formation of the bound CTD conformation and the peptide with pSer2/pThr4 marks cannot be accommodated in the binding pocket of Rtt103p (Fig 3). In support of this, the coexistence of the pSer2/pThr4 marks in the same repeat has not been detected by recent mass‐spectrometry analysis of RNAPII CTD population pulled down by Rtt103p 11. Our structure also explains lethality of the Thr4Glu CTD mutant in yeast 18. Permanent substitution for glutamate mimics Thr4 phosphorylation that interferes with Ser2 phosphorylation, which consequently prevents binding of the CTD to cognate proteins as described above.The individual letters of the CTD code have so far been associated with unique information translated to stimulation or inhibition of recruitment of CTD readers. Comparison of Rtt103p CID–pThr4‐CTD structure with the Rtt103p CID–pSer2‐CTD complex shows fascinating feature that the same interaction pocket of Rtt103p can read two different phosphorylation patterns of the CTD (pSer2 and pThr4) using the same mechanism and involves the same residues (mainly Arg108; Fig 3). Based on our structural findings, we suggest that the CTD code can be degenerated when read by CID‐containing proteins. In other words, the recruitment of a single CTD‐binding factor may be coded by more than one letter of the CTD code. As a consequence of this redundancy, CID‐containing CTD‐binding factors can be recruited to the poorly conserved heptad repeats of the CTD (e.g. the CTD of fruit fly) or they can tolerate some errors or imperfections in phosphorylation of the CTD 1, 3, 28.
Materials and Methods
Cloning and protein purification
pET28b‐Rtt103p CID was a gift from B. Lunde 20. Rtt103p CID point mutants were obtained by QuikChange site‐directed mutagenesis kit (Stratagene). Resulting constructs were verified by DNA sequencing and then transformed into E. coli BL21‐Codon Plus (DE3)‐RIPL cells (Stratagene). Rtt103p CID (3‐131–6xHIS) was expressed and purified as previously described 20.
NMR measurements and structure determination
All NMR spectra for the backbone and side‐chain assignments were recorded on Bruker AVANCE III HD 950, 850 and 700 MHz spectrometers equipped with cryoprobes at a sample temperature of 20°C using 1 mM uniformly 15N,13C‐labelled Rtt103p CID in 35 mM KH2PO4, 100 mM KCl, pH 6.8 (20°C) (90% H2O/10% D2O). Initial backbone resonance frequency assignment was transferred from BMRB entries 17044 and 16411 and confirmed by HNCA spectrum. The spectra were processed using TOPSPIN 3.2 (Bruker Biospin), and the protein resonances were assigned manually using Sparky software (Goddard T.G. and Kneller D.G., University of California, San Francisco). For the assignment of the side‐chain proton and carbon resonances, 4D version of HCCH TOCSY 29 was measured with a non‐uniform sampling. Acquired data were processed and analysed analogously as described previously 30, 31.All distance constraints were derived from the three‐dimensional 15N‐ and 13C‐edited NOESYs collected on a 950 MHz spectrometer. Additionally, intermolecular distance constraints were obtained from the three‐dimensional F1‐13C/15N‐filtered NOESY‐[13C,1H]‐HSQC experiment 32, 33, with a mixing time of 150 ms on a 950 MHz spectrometer. The NOEs were semi‐quantitatively classified based on their intensities in the 3D NOESY spectra. The initial structure determinations of the Rtt103p‐CTD complex were performed with the automated NOE assignment module implemented in the CYANA 3.97 program 34. Then, the CYANA‐generated restraints along with manually assigned protein‐CTD intermolecular restraints were used for further refinement of the preliminary structures with AMBER16 software 35. These calculations employed a modified version (AMBER ff14SB) of the force field 36, using a protocol described previously 37, 38. The 20 lowest energy conformers were selected (out of 50 calculated) to form the final ensemble of structures. The atomic coordinates for the NMR ensemble of the Rtt103p CID‐pThr4‐CTD complex have been deposited in the Protein Data Bank under ID code 5LVF and in Biological Magnetic Resonance Bank under ID code 34041. Molecular graphics were generated using PyMOL (The PyMOL Molecular Graphics System, Version 1.8 Schrödinger, LLC).
Fluorescence anisotropy
The equilibrium binding of Rtt103p CID constructs to differently phosphorylated CTD was analysed by fluorescence anisotropy. The CTDpeptides were N‐terminally labelled with the 5,6‐carboxyfluorescein (FAM). The measurements were conducted on a FluoroLog‐3 spectrofluorometer (Horiba Jobin‐Yvon Edison, NJ). The instrument was equipped with a thermostatted cell holder with a Neslab RTE7 water bath (Thermo Scientific). Samples were excited with vertically polarized light at 467 nm, and both vertical and horizontal emissions were recorded at 516 nm. All measurements were conducted at 10°C in 35 mM KH2PO4, 100 mM KCl (pH 6.8). Each data point is an average of three measurements. The experimental binding isotherms were analysed by DynaFit using 1:1 model with non‐specific binding 39.
Cis‐trans population estimation
For the estimation of the cis‐trans population of conformers around the Ser‐Propeptide bond, aliphatic [13C,1H]‐HSQC was collected using 1 mM sample of peptide PSYS13CP(pT)SPSYS or PSY(pS)13CP(pT)SPSYS in 35 mM KH2PO4, 100 mM KCl (pH 6.8) in 90% H2O/10% D2O at 20°C on Bruker AVANCE III HD 700 MHz spectrometer. For the [13C,1H]‐HSQC titration experiment, 0.2 mM PSYS13CP(pT)SPSYS peptide was used and 1.2 mM Rtt103p CID stock was added. 1H‐13Cγ and 1H‐13Cβ peaks were integrated using Sparky routine (Goddard T.G. and Kneller D.G., University of California, San Francisco). Population was estimated as a ratio of the peak volume of a given conformation to the sum of volumes of all conformations.
Determination of chemical shift perturbation (CSP) value
Chemical shift perturbation (CSP) value is defined as the normalized length of a vector E
, whose components are differences δ
between observed chemical shifts (bound form) and chemical shifts from a reference experiment (free form). Index j represents the amino acid type within the primary sequence of the protein. Weight factors for each atom type w
H = 1 and w
N = 0.15 were used.
Peptides used in the study
The following peptides were synthesized by JPT (Berlin, DE) and Clonestar (Brno, CZ): FAM‐PSY(pS)PTSPSYSPTSPS; FAM‐PSYSP(pT)SPSYSPTSPS; FAM‐PSY(pS)P(pT)SPSYSPTSPS; FAM‐PSYSPTSPSYSPTSPS; FAM‐PSYSP(pT)SPSYSP(pT)SPS; FAM‐PSYSP(pT)SPSYS; FAM‐PS(pY)SPTSPSYSPTSPS; PSYSP(pT)SPSYSPTSPS; PSYS13CP(pT)SPSYS; PS Y(pS)13CP(pT)SPSYS.
Author contributions
OJ designed the experiments, prepared protein and peptide samples, measured and analysed FA, assigned spectra, calculated and refined structure, and wrote the manuscript; MK collected and processed 4D HCCH TOCSY spectra, and assisted with structure refinement; KK collected and processed NMR spectra, and assisted with structure calculation and refinement; RS designed the experiments, assisted with structure calculation and refinement, and wrote the manuscript.
Conflict of interest
The authors declare that they have no conflict of interest.Expanded View Figures PDFClick here for additional data file.Review Process FileClick here for additional data file.
Authors: Eric F Pettersen; Thomas D Goddard; Conrad C Huang; Gregory S Couch; Daniel M Greenblatt; Elaine C Meng; Thomas E Ferrin Journal: J Comput Chem Date: 2004-10 Impact factor: 3.376
Authors: Minkyu Kim; Nevan J Krogan; Lidia Vasiljeva; Oliver J Rando; Eduard Nedea; Jack F Greenblatt; Stephen Buratowski Journal: Nature Date: 2004-11-25 Impact factor: 49.962
Authors: Christian G Noble; David Hollingworth; Stephen R Martin; Valerie Ennis-Adeniran; Stephen J Smerdon; Geoff Kelly; Ian A Taylor; Andres Ramos Journal: Nat Struct Mol Biol Date: 2005-01-16 Impact factor: 15.369
Authors: Olga Jasnovidova; Tomas Klumpler; Karel Kubicek; Sergei Kalynych; Pavel Plevka; Richard Stefl Journal: Proc Natl Acad Sci U S A Date: 2017-10-04 Impact factor: 11.205
Authors: Thomas Evangelidis; Santrupti Nerli; Jiří Nováček; Andrew E Brereton; P Andrew Karplus; Rochelle R Dotas; Vincenzo Venditti; Nikolaos G Sgourakis; Konstantinos Tripsianes Journal: Nat Commun Date: 2018-01-26 Impact factor: 14.919