Literature DB >> 32469312

Structure of substrate-bound SMG1-8-9 kinase complex reveals molecular basis for phosphorylation specificity.

Lukas M Langer1, Yair Gat1, Fabien Bonneau1, Elena Conti1.   

Abstract

PI3K-related kinases (PIKKs) are large Serine/Threonine (Ser/Thr)-protein kinases central to the regulation of many fundamental cellular processes. PIKK family member SMG1 orchestrates progression of an RNA quality control pathway, termed nonsense-mediated mRNA decay (NMD), by phosphorylating the NMD factor UPF1. Phosphorylation of UPF1 occurs in its unstructured N- and C-terminal regions at Serine/Threonine-Glutamine (SQ) motifs. How SMG1 and other PIKKs specifically recognize SQ motifs has remained unclear. Here, we present a cryo-electron microscopy (cryo-EM) reconstruction of a human SMG1-8-9 kinase complex bound to a UPF1 phosphorylation site at an overall resolution of 2.9 Å. This structure provides the first snapshot of a human PIKK with a substrate-bound active site. Together with biochemical assays, it rationalizes how SMG1 and perhaps other PIKKs specifically phosphorylate Ser/Thr-containing motifs with a glutamine residue at position +1 and a hydrophobic residue at position -1, thus elucidating the molecular basis for phosphorylation site recognition.
© 2020, Langer et al.

Entities:  

Keywords:  Cryo-EM; PIKK; RNA quality control; human; molecular biophysics; nonsense-mediated mRNA decay; phosphorylation; structural biology

Mesh:

Substances:

Year:  2020        PMID: 32469312      PMCID: PMC7334022          DOI: 10.7554/eLife.57127

Source DB:  PubMed          Journal:  Elife        ISSN: 2050-084X            Impact factor:   8.140


Introduction

Family members of phosphatidylinositol 3-kinase-related kinases (PIKKs) activate distinct signaling pathways that promote cellular survival in different environmental and endogenous stress conditions (Baretić and Williams, 2014; Imseng et al., 2018; Lempiäinen and Halazonetis, 2009). <span class="Chemical">Specifically, PIKKs oversee translation machinery in the cytoplasm (<span class="Gene">mTOR, SMG1), or regulate DNA damage repair in the nucleus (ATM, ATR and DNA-PK) (Shimobayashi and Hall, 2014; Saxton and Sabatini, 2017; Yamashita, 2013; Yamashita et al., 2001; Blackford and Jackson, 2017; Elías-Villalobos et al., 2019). With the exception of the enzymatically inactive TRAPP/Tra1, which serves as a scaffold in chromatin modification complexes, all other members of the PIKK family are Ser/Thr-protein kinases, and are among the largest proteins in the eukaryotic kinome. Recent publications have revealed the organization of PIKK active sites at better than 4 Å resolution (Gat et al., 2019; Zhu et al., 2019; Yang et al., 2013; Jansma et al., 2020; Yates et al., 2020), but the key question of how members of this kinase family recognize their substrates remains unanswered. Human SMG1 is one of the largest PIKK family members (~410 kDa) and plays a crucial role in nonsense-mediated mRNA decay (NMD), a conserved pathway that regulates mRNA stability in the cytoplasm of eukaryotic cells (Kurosaki and Maquat, 2016; Karousis and Mühlemann, 2019). In its canonical surveillance function, the NMD pathway recognizes and degrades aberrant mRNAs containing premature translation termination codons, thus preventing the accumulation of truncated protein products. In addition, NMD also regulates the levels of a subset of normal, physiological transcripts, amounting to 5–10% of the transcriptome. In metazoans, SMG1 forms a stable complex with two additional proteins, SMG8 and SMG9, and specifically phosphorylates the RNA helicase UPF1. Phosphorylation of UPF1 is a crucial event in this pathway as it enables the recruitment of downstream NMD factors SMG5, SMG6 and SMG7, leading to ribonucleolytic cleavage of the RNA. SMG1 phosphorylates UPF1 specifically at Ser/Thr - Gln (SQ) motifs present in the unstructured N- and C-terminal regions that flank the helicase core (Yamashita et al., 2001; Denning et al., 2001). Specificity for a glutamine residue at the +1 position is shared by other PIKK family members, namely, ATM, ATR and DNA-PK kinases (Kim et al., 1999; Bannister et al., 1993). However, there is an additional layer of phosphorylation site specification. Among the 20 possible SQ motifs in UPF1, studies in vitro and in vivo have shown that only a selected few are effectively phosphorylated, including Ser1073, Ser1078, Ser1096 and Ser1116 (Yamashita et al., 2001; Ohnishi et al., 2003; Durand et al., 2016). Interestingly, these UPF1 phosphorylation sites share a Leu-Ser-Gln (LSQ) consensus sequence identical to the LSQ consensus motif identified in substrates of the ATM kinase (O'Neill et al., 2000; Kim et al., 1999). In this work, we studied the interaction between recombinant human SMG1-SMG8-SMG9 with UPF1 peptides using cryo-EM and mass spectrometry to identify the molecular basis with which SMG1, and potentially other PIKKs, recognizes specific phosphorylation sites in its substrate.

Results and discussion

Cryo-EM structure of the human SMG1-8-9 kinase complex bound to a UPF1 peptide

We used stably transfected HEK293T cells to express and purify a human wild-type SMG1-8-9 complex, as previously reported (Gat et al., 2019). The complex phosphorylated full-length recombinant UPF1 in a radioactive kinase assay (Figure 1—figure supplement 1A), confirming the enzymatic activity of purified SMG1 towards its physiological substrate. We selected a frequently phosphorylated site within UPF1 (Yamashita et al., 2001), Ser1078, and used a peptide spanning residues 1074–1084 (hereby defined as UPF1-LSQ) for subsequent structural and biochemical analysis (Figure 1—figure supplement 1B). We confirmed the ability of SMG1-8-9 to specifically phosphorylate UPF1-LSQ using a mass spectrometry-based phosphorylation assay (Figure 1—figure supplement 1C and D). This assay allowed us to monitor the relative amount of phosphorylation of a specific peptide over time. As a control, phosphorylation was abolished when Ser1078 was changed to Asp (Figure 1—figure supplement 1C and D). Hence, the reconstitutions used in this study recapitulate specific phosphorylation site selection.
Figure 1—figure supplement 1.

SMG1-8-9 activity and UPF1 SQ motifs.

(A) Radioactive phosphorylation assay using SMG1-8-9 and full-length UPF1. Coomassie-stained SDS-PAGE showing a change in migration behavior for UPF1 over time as phosphorylation proceeds. The corresponding radioactive signal is shown in the lower panel indicating an increase of UPF1 phosphorylation over time. (B) Alignment showing all SQ motifs present in UPF1 N- and C-terminus including positions −2 to +3. Note the high variance amongst position −1 residues. (C) Mass spectrometry-based phosphorylation assay with UPF1-LSQ and the indicated position 0 variations. The peptide sequence is indicated with the varying position marked as ‘X’. Error bars representing standard deviations calculated from independent experimental triplicates are shown. Phosphorylation was abolished when the phospho-acceptor residue was changed from Ser to Asp. This shows Ser1081 was not recognized for phosphorylation and confirms specificity toward the SQ motif. In addition, phosphorylation was decreased when Ser was changed to Thr, consistent with previous data for ATM, ATR and DNA-PK (Kim et al., 1999; O'Neill et al., 2000). (D). M/z spectra for representative single measurements at indicated time points of the experiment shown in C. The inset lists the expected sizes for the three peptides used in this experiment. Peaks corresponding to unphosphorylated and phosphorylated peptides are highlighted. Note the appearance and increase of intensity of peaks corresponding to phosphorylated peptides over time.

For structure determination, we incubated SMG1-8-9 with UPF1-LSQ and AMPPNP, a non-hydrolyzable ATP analogue, and subjected the sample to cryo-EM single particle analysis. The final reconstruction reached an overall resolution of 2.9 Å (Figure 1—figure supplements 2 and 3), and allowed us to further complete and refine the published model for SMG1-8-9 (Supplementary file 1; Gat et al., 2019). Briefly, SMG1 consists of an N-terminal solenoid 'arch' and a compact C-terminal 'head' region (Figure 1A and B). The C-terminal 'head' is formed by the tight interaction between the catalytic module, typical of Ser/Thr-kinases, and the so-called FAT and FATC domains (Imseng et al., 2018; Baretić and Williams, 2014; Bosotti et al., 2000). The N-terminal ‘arch’ provides binding sites for both SMG8 and SMG9 (Figure 1A and B). As we had previously reported, SMG9 contains an unusual G-fold domain that binds ATP rather than GTP or GDP (Gat et al., 2019). The local resolution of around 3 Å allowed us to model SMG9-bound ATP in the reconstructed density, revealing the molecular basis for how the adenosine nucleotide is recognized by this unusual G-fold domain (Figure 1B and D, Figure 1—figure supplement 4). Briefly, the G4 and G5 motifs responsible for the recognition of the base have rearranged to preferentially bind an adenine base rather than a guanine (Figure 1—figure supplement 4).
Figure 1—figure supplement 2.

Cryo-EM analysis of SMG1-8-9 bound to UPF1-LSQ.

(A) Coomassie-stained SDS-PAGE showing purified SMG1-8-9. The asterisk indicates contaminants. (B) Representative micrograph of the collected data set with some SMG1-8-9 particles indicated by blue circles. Scale bar ≈ 500 Å. (C) Representative 2D averages of picked particles. Scale bar ≈ 100 Å. (D) Spherical angular distribution of particles contributing to the final reconstruction with larger red rods indicating more prominent particle views and smaller blue rods indicating rarer particle views. (E) Map of SMG1-8-9 colored according to estimated local resolution shown in two different views as in Figure 1B. Large parts of the complex including the kinase domain and the bound UPF1 peptide are resolved to around 3 Å. Important features of the map are indicated. (F) Three-dimensional FSC plot and further analysis of orientation bias. The red line represents the estimated global masked half-map FSC curve indicating a nominal overall resolution of 2.97 Å according to the gold standard FSC cut off of 0.143 (Rosenthal and Henderson, 2003). The spread of directional resolution values is defined as ± 1σ (shown as dashed grey lines). Overall isotropy of the map is confirmed by a sphericity of 0.957 (out of 1) (Tan et al., 2017). (G) Model vs. map FSC plot for the real spaced refined model. (H) Model vs. map FSC plots for half map 1 ("work") used for real space refinement after displacing atoms of the final model (σ = 0.5 Å) and half map 2 not used for refinement ("free"). The good agreement of the two curves indicates that no substantial overfitting took place.

Figure 1—figure supplement 3.

Cryo-EM data processing scheme.

Processing steps are indicated in blue; particle numbers of classes used for downstream processing steps (in brackets: percentage with respect to initial candidate particles) and resolutions are in black. The class selected for the 3D refinement after the last step of 3D classification is indicated by a red rectangle. Density for the unmodelled, C-terminal part of SMG8 appearing in two other 3D classes not used for the final reconstruction is highlighted in magenta. This part of SMG8 has been suggested to contribute to kinase regulation (Zhu et al., 2019).

Figure 1.

Cryo-EM reconstruction of SMG1-8-9 bound to UPF1-LSQ.

(A) Domain organization of SMG1, SMG8, SMG9 and UPF1. White parts are not resolved in the reconstruction. The sequence and location of UPF1-LSQ is indicated with blue text and dotted lines. (B) Segmented cryo-EM reconstruction of substrate-bound SMG1-8-9. Two different views are shown; proteins and domains are colored as in A. (C) A zoomed-in view of SMG1 showing the kinase active site with bound AMPPNP and UPF1-LSQ. Reconstructed density for UPF1-LSQ is shown as a blue mesh. (D) Zoom-in showing ATP bound to SMG9 with reconstructed density displayed as a blue mesh.

(A) Radioactive phosphorylation assay using SMG1-8-9 and full-length UPF1. Coomassie-stained SDS-PAGE showing a change in migration behavior for UPF1 over time as phosphorylation proceeds. The corresponding radioactive signal is shown in the lower panel indicating an increase of UPF1 phosphorylation over time. (B) Alignment showing all SQ motifs present in UPF1 N- and C-terminus including positions −2 to +3. Note the high variance amongst position −1 residues. (C) Mass spectrometry-based phosphorylation assay with UPF1-LSQ and the indicated position 0 variations. The peptide sequence is indicated with the varying position marked as ‘X’. Error bars representing standard deviations calculated from independent experimental triplicates are shown. Phosphorylation was abolished when the phospho-acceptor residue was changed from Ser to Asp. This shows Ser1081 was not recognized for phosphorylation and confirms specificity toward the SQ motif. In addition, phosphorylation was decreased when Ser was changed to Thr, consistent with previous data for ATM, ATR and DNA-PK (Kim et al., 1999; O'Neill et al., 2000). (D). M/z spectra for representative single measurements at indicated time points of the experiment shown in C. The inset lists the expected sizes for the three peptides used in this experiment. Peaks corresponding to unphosphorylated and phosphorylated peptides are highlighted. Note the appearance and increase of intensity of peaks corresponding to phosphorylated peptides over time.

(A) Coomassie-stained SDS-PAGE showing purified SMG1-8-9. The asterisk indicates contaminants. (B) Representative micrograph of the collected data set with some SMG1-8-9 particles indicated by blue circles. Scale bar ≈ 500 Å. (C) Representative 2D averages of picked particles. Scale bar ≈ 100 Å. (D) Spherical angular distribution of particles contributing to the final reconstruction with larger red rods indicating more prominent particle views and smaller blue rods indicating rarer particle views. (E) Map of SMG1-8-9 colored according to estimated local resolution shown in two different views as in Figure 1B. Large parts of the complex including the kinase domain and the bound UPF1 peptide are resolved to around 3 Å. Important features of the map are indicated. (F) Three-dimensional FSC plot and further analysis of orientation bias. The red line represents the estimated global masked half-map FSC curve indicating a nominal overall resolution of 2.97 Å according to the gold standard FSC cut off of 0.143 (Rosenthal and Henderson, 2003). The spread of directional resolution values is defined as ± 1σ (shown as dashed grey lines). Overall isotropy of the map is confirmed by a sphericity of 0.957 (out of 1) (Tan et al., 2017). (G) Model vs. map FSC plot for the real spaced refined model. (H) Model vs. map FSC plots for half map 1 ("work") used for real space refinement after displacing atoms of the final model (σ = 0.5 Å) and half map 2 not used for refinement ("free"). The good agreement of the two curves indicates that no substantial overfitting took place.

Processing steps are indicated in blue; particle numbers of classes used for downstream processing steps (in brackets: percentage with respect to initial candidate particles) and resolutions are in black. The class selected for the 3D refinement after the last step of 3D classification is indicated by a red rectangle. Density for the unmodelled, C-terminal part of SMG8 appearing in two other 3D classes not used for the final reconstruction is highlighted in magenta. This part of SMG8 has been suggested to contribute to kinase regulation (Zhu et al., 2019).

(A) Detailed view of ATP purine ring recognition by SMG9 with important residues highlighted and reconstructed density indicated. (B) Two-dimensional sketch of adenine base recognition by SMG9 shown in panel A with G motifs, key residues and distances indicated. (C) Detailed view of GTP recognition by the GTPase RAS with important residues highlighted (PDB ID 1ZVQ). (D) Two-dimensional sketch of guanine base recognition shown in panel C by RAS with G motifs, key residues and distances indicated for comparison with panel B (based on PDB ID 1ZVQ). (E) Two-dimensional sketch of the overall recognition of ATP by SMG9. The G1-G3 motifs of SMG9 coordinate the phosphate moieties of ATP, similarly to the corresponding motifs of canonical GTPases, but with an important exception, in that they lack the typical residues crucial for catalysis. Another difference with canonical G-fold is that the G4 and G5 motifs responsible for the recognition of the base have rearranged to preferentially bind an adenine base rather than a guanine. (F) Sequence alignment of G motifs of RHEB and RAS GTPases with SMG9. Residues highlighted in A are indicated by a black dot. In the G4 motif, the aspartic acid that specifically engages the guanine moiety in GTPases has diverged to an arginine residue. Instead, Asn372 in the SMG9 G4 motif has shifted so that it engages the side-chain carbonyl group to interact with the adenine amino group (compare panels A and B with C and D). In the G5 motif, the SAK consensus sequence in RAS and RHEB has diverged in SMG9, with the main-chain carbonyl and amine groups of the upstream residues, Pro432 and Met434, forming adenine-specific interactions (compare panels A and B with C and D). (G) Three-dimensional sketches of SMG9 (top) and RAS/RHEB (bottom) highlighting similarities in their overall topology. Positions of the G motifs are highlighted.

(A) Active site density reported in this study. Densities attributed to either AMPPNP or UPF1 peptide are indicated. (B) Active site density of the apo SMG1-8-9 reconstruction reported previously (EMDB 10347). (C) Overlay of the active site densities shown in A and B with the active site density reported in this study shown in transparent grey and density of apo SMG1-8-9 (EMDB 10347) shown in blue. Densities attributed to either AMPPNP or UPF-LSQ are indicated. (D) Model of IP6 and UPF1-LSQ with the corresponding density shown as a blue mesh. Colors of the model as in Figure 1. (E) Additional parts of the model with the corresponding density shown as in B.

Figure 1—figure supplement 4.

SMG9 is a G-fold containing protein binding ATP and exhibits distinct differences to the bona fide GTPase RAS.

(A) Detailed view of ATP purine ring recognition by SMG9 with important residues highlighted and reconstructed density indicated. (B) Two-dimensional sketch of adenine base recognition by SMG9 shown in panel A with G motifs, key residues and distances indicated. (C) Detailed view of GTP recognition by the GTPase RAS with important residues highlighted (PDB ID 1ZVQ). (D) Two-dimensional sketch of guanine base recognition shown in panel C by RAS with G motifs, key residues and distances indicated for comparison with panel B (based on PDB ID 1ZVQ). (E) Two-dimensional sketch of the overall recognition of ATP by SMG9. The G1-G3 motifs of SMG9 coordinate the phosphate moieties of ATP, similarly to the corresponding motifs of canonical GTPases, but with an important exception, in that they lack the typical residues crucial for catalysis. Another difference with canonical G-fold is that the G4 and G5 motifs responsible for the recognition of the base have rearranged to preferentially bind an adenine base rather than a guanine. (F) Sequence alignment of G motifs of RHEB and RAS GTPases with SMG9. Residues highlighted in A are indicated by a black dot. In the G4 motif, the aspartic acid that specifically engages the guanine moiety in GTPases has diverged to an arginine residue. Instead, Asn372 in the SMG9 G4 motif has shifted so that it engages the side-chain carbonyl group to interact with the adenine amino group (compare panels A and B with C and D). In the G5 motif, the SAK consensus sequence in RAS and RHEB has diverged in SMG9, with the main-chain carbonyl and amine groups of the upstream residues, Pro432 and Met434, forming adenine-specific interactions (compare panels A and B with C and D). (G) Three-dimensional sketches of SMG9 (top) and RAS/RHEB (bottom) highlighting similarities in their overall topology. Positions of the G motifs are highlighted.

Cryo-EM reconstruction of SMG1-8-9 bound to UPF1-LSQ.

(A) Domain organization of SMG1, SMG8, SMG9 and UPF1. White parts are not resolved in the reconstruction. The sequence and location of UPF1-LSQ is indicated with blue text and dotted lines. (B) Segmented cryo-EM reconstruction of substrate-bound SMG1-8-9. Two different views are shown; proteins and domains are colored as in A. (C) A zoomed-in view of SMG1 showing the kinase active site with bound AMPPNP and UPF1-LSQ. Reconstructed density for UPF1-LSQ is shown as a blue mesh. (D) Zoom-in showing ATP bound to SMG9 with reconstructed density displayed as a blue mesh.

SMG1-8-9 activity and UPF1 SQ motifs.

(A) Radioactive phosphorylation assay using SMG1-8-9 and full-length UPF1. Coomassie-stained SDS-PAGE showing a change in migration behavior for UPF1 over time as phosphorylation proceeds. The corresponding radioactive signal is shown in the lower panel indicating an increase of UPF1 phosphorylation over time. (B) Alignment showing all SQ motifs present in UPF1 N- and C-terminus including positions −2 to +3. Note the high variance amongst position −1 residues. (C) Mass spectrometry-based phosphorylation assay with UPF1-LSQ and the indicated position 0 variations. The peptide sequence is indicated with the varying position marked as ‘X’. Error bars representing standard deviations calculated from independent experimental triplicates are shown. Phosphorylation was abolished when the phospho-acceptor residue was changed from Ser to Asp. This shows Ser1081 was not recognized for phosphorylation and confirms specificity toward the SQ motif. In addition, phosphorylation was decreased when Ser was changed to Thr, consistent with previous data for ATM, ATR and DNA-PK (Kim et al., 1999; O'Neill et al., 2000). (D). M/z spectra for representative single measurements at indicated time points of the experiment shown in C. The inset lists the expected sizes for the three peptides used in this experiment. Peaks corresponding to unphosphorylated and phosphorylated peptides are highlighted. Note the appearance and increase of intensity of peaks corresponding to phosphorylated peptides over time.

Cryo-EM analysis of SMG1-8-9 bound to UPF1-LSQ.

(A) Coomassie-stained SDS-PAGE showing purified SMG1-8-9. The asterisk indicates contaminants. (B) Representative micrograph of the collected data set with some SMG1-8-9 particles indicated by blue circles. Scale bar ≈ 500 Å. (C) Representative 2D averages of picked particles. Scale bar ≈ 100 Å. (D) Spherical angular distribution of particles contributing to the final reconstruction with larger red rods indicating more prominent particle views and smaller blue rods indicating rarer particle views. (E) Map of SMG1-8-9 colored according to estimated local resolution shown in two different views as in Figure 1B. Large parts of the complex including the kinase domain and the bound UPF1 peptide are resolved to around 3 Å. Important features of the map are indicated. (F) Three-dimensional FSC plot and further analysis of orientation bias. The red line represents the estimated global masked half-map FSC curve indicating a nominal overall resolution of 2.97 Å according to the gold standard FSC cut off of 0.143 (Rosenthal and Henderson, 2003). The spread of directional resolution values is defined as ± 1σ (shown as dashed grey lines). Overall isotropy of the map is confirmed by a sphericity of 0.957 (out of 1) (Tan et al., 2017). (G) Model vs. map FSC plot for the real spaced refined model. (H) Model vs. map FSC plots for half map 1 ("work") used for real space refinement after displacing atoms of the final model (σ = 0.5 Å) and half map 2 not used for refinement ("free"). The good agreement of the two curves indicates that no substantial overfitting took place.

Cryo-EM data processing scheme.

Processing steps are indicated in blue; particle numbers of classes used for downstream processing steps (in brackets: percentage with respect to initial candidate particles) and resolutions are in black. The class selected for the 3D refinement after the last step of 3D classification is indicated by a red rectangle. Density for the unmodelled, C-terminal part of <span class="Gene">SMG8 appearing in two other 3D classes not used for the final reconstruction is highlighted in magenta. T<span class="Chemical">his part of SMG8 has been suggested to contribute to kinase regulation (Zhu et al., 2019).

SMG9 is a G-fold containing protein binding ATP and exhibits distinct differences to the bona fide GTPase RAS.

(A) Detailed view of ATP purine ring recognition by SMG9 with important residues highlighted and reconstructed density indicated. (B) Two-dimensional sketch of adenine base recognition by SMG9 shown in panel A with G motifs, key residues and distances indicated. (C) Detailed view of GTP recognition by the GTPase RAS with important residues highlighted (PDB ID 1ZVQ). (D) Two-dimensional sketch of guanine base recognition shown in panel C by RAS with G motifs, key residues and distances indicated for comparison with panel B (based on PDB ID 1ZVQ). (E) Two-dimensional sketch of the overall recognition of ATP by SMG9. The G1-G3 motifs of SMG9 coordinate the phosphate moieties of ATP, similarly to the corresponding motifs of canonical GTPases, but with an important exception, in that they lack the typical residues crucial for catalysis. Another difference with canonical G-fold is that the G4 and G5 motifs responsible for the recognition of the base have rearranged to preferentially bind an adenine base rather than a guanine. (F) Sequence alignment of G motifs of RHEB and RAS GTPases with SMG9. Residues highlighted in A are indicated by a black dot. In the G4 motif, the aspartic acid that specifically engages the guanine moiety in GTPases has diverged to an arginine residue. Instead, Asn372 in the SMG9 G4 motif has shifted so that it engages the side-chain carbonyl group to interact with the adenine amino group (compare panels A and B with C and D). In the G5 motif, the SAK consensus sequence in RAS and RHEB has diverged in SMG9, with the main-chain carbonyl and amine groups of the upstream residues, Pro432 and Met434, forming adenine-specific interactions (compare panels A and B with C and D). (G) Three-dimensional sketches of SMG9 (top) and RAS/RHEB (bottom) highlighting similarities in their overall topology. Positions of the G motifs are highlighted.

Quality of the reconstructed density.

(A) Active site density reported in this study. Densities attributed to either AMPPNP or UPF1 peptide are indicated. (B) Active site density of the apo SMG1-8-9 reconstruction reported previously (EMDB 10347). (C) Overlay of the active site densities shown in A and B with the active site density reported in this study shown in transparent grey and density of apo SMG1-8-9 (EMDB 10347) shown in blue. Densities attributed to either AMPPNP or UPF-LSQ are indicated. (D) Model of IP6 and UPF1-LSQ with the corresponding density shown as a blue mesh. Colors of the model as in Figure 1. (E) Additional parts of the model with the corresponding density shown as in B. Importantly, compared to the previously published apo-SMG1-8-9 structure (Gat et al., 2019), the current reconstruction revealed extra density in the kinase active site accounting for both AMPPNP and UPF1-LSQ (Figure 1B and C, Figure 1—figure supplement 5A,B and C).
Figure 1—figure supplement 5.

Quality of the reconstructed density.

(A) Active site density reported in this study. Densities attributed to either AMPPNP or UPF1 peptide are indicated. (B) Active site density of the apo SMG1-8-9 reconstruction reported previously (EMDB 10347). (C) Overlay of the active site densities shown in A and B with the active site density reported in this study shown in transparent grey and density of apo SMG1-8-9 (EMDB 10347) shown in blue. Densities attributed to either AMPPNP or UPF-LSQ are indicated. (D) Model of IP6 and UPF1-LSQ with the corresponding density shown as a blue mesh. Colors of the model as in Figure 1. (E) Additional parts of the model with the corresponding density shown as in B.

Positioning of UPF1 Ser1078 in the SMG1 active site for phosphoryl transfer

The active site of SMG1 shows excellent density for residues 1075–1081 of UPF1-LSQ (Figure 1C, Figure 1—figure supplement 5). The directionality of the bound substrate peptide is consistent with that of other kinase structures, such as CDK2, and the arrangement of key active site residues is well conserved (Figure 2—figure supplement 1; Bao et al., 2011). The geometry of the catalytic loop (residues 2332 to 2340) and of the activation segment (residues 2352 to 2375) in the SMG1 kinase domain as well as orientation of important active site residues are very similar to those observed in mTOR (Figure 2—figure supplement 1B), indicative of an active kinase state (Yang et al., 2013). Specific recognition of the phosphorylation site is achieved via conserved residues contributed by the activation segment and the catalytic loop as well as by the FATC domain (Figure 2). The hydroxyl group of Ser1078, the phospho-acceptor residue in UPF1-LSQ, is positioned by residues of the catalytic loop, in particular Asp2335 and His2337 (Figure 2B and C, Figure 2—figure supplement 2A and B). Consistent with the structural observations, mutation of either of the corresponding residues in SMG1, mTOR and other protein kinases results in catalytically inactive enzyme (Bao et al., 2011; Madhusudan et al., 2002; Yang et al., 2013; Brown et al., 1995; Yamashita et al., 2001; Denning et al., 2001). Therefore, the overall architecture of the substrate-bound SMG1 catalytic module corroborates the structural conservation among PIKK active sites and reveals that positioning of the substrate phospho-acceptor is achieved by residues that are shared between a wide range of protein kinases.
Figure 2—figure supplement 1.

Comparison of SMG1 substrate-bound active site to other protein kinases.

(A) Superposition of substrate-bound SMG1 active site with substrate-bound CDK2 active site (PDB ID, 3QHW). SMG1 catalytic loop is colored in magenta, SMG1 activation segment is in pink, CDK2 is shown in salmon and the CDK2 substrate peptide is in orange. Directionality of the substrates and important catalytic residues are indicated and colored as described above. P-loops were omitted for clarity. The SMG1 kinase contains features that have previously been associated with a model preferring the presence of a ‘dissociative’ transition state, such as a positive charge closely involved in β-phosphate coordination, namely K2155 (residue not shown) (Wang and Cole, 2014). (B) Superposition of substrate-bound SMG1 active site with nucleotide-bound mTOR active site (PDB ID, 4JSP). SMG1 active site is colored as described in panel A and mTOR is in green. Important active site residues are indicated. P-loops were omitted for clarity. (C) Close-up of the active site hydrophobic cage after superposing the model of the substrate-bound SMG1 active site with Chaetomium thermophilum Tel1ATM (PDB ID, 6SKY). SMG1 active site is colored as described in panel A and CtTel1ATM is colored in light orange. Key residues are indicated. (D) Close-up of the active site hydrophobic cage based on the superposition of SMG1 and mTOR shown in B. Color scheme as in B and important residues are indicated. View is similar to C.

Figure 2.

Organization of the substrate-bound kinase active site.

(A) The structure of the entire complex is shown overlayed with transparent reconstructed density. The black box indicates the location of the kinase active site. (B) SMG1 active site with important residues shown as sticks. Activation segment and catalytic loop as indicated; the p-loop was omitted for clarity. UPF1-LSQ is shown in blue with positions of important residues highlighted. (C) Two-dimensional sketch of the SMG1 active site with key kinase-substrate interactions indicated. (D) Activation segment (pink) and catalytic loop (magenta) regions of a SMG1 sequence alignment are shown, indicating a high level of conservation across Homo sapiens, Bos taurus, Mus musculus, Gallus gallus, Xenopus laevis, Danio rerio and Caenorhabditis elegans. Key residues shown in B are highlighted by a black dot and activation segment and catalytic loop are indicated.

(A) Superposition of substrate-bound SMG1 active site with substrate-bound CDK2 active site (PDB ID, 3QHW). SMG1 catalytic loop is colored in magenta, SMG1 activation segment is in pink, CDK2 is shown in salmon and the CDK2 substrate peptide is in orange. Directionality of the substrates and important catalytic residues are indicated and colored as described above. P-loops were omitted for clarity. The SMG1 kinase contains features that have previously been associated with a model preferring the presence of a ‘dissociative’ transition state, such as a positive charge closely involved in β-phosphate coordination, namely K2155 (residue not shown) (Wang and Cole, 2014). (B) Superposition of substrate-bound SMG1 active site with nucleotide-bound mTOR active site (PDB ID, 4JSP). SMG1 active site is colored as described in panel A and mTOR is in green. Important active site residues are indicated. P-loops were omitted for clarity. (C) Close-up of the active site hydrophobic cage after superposing the model of the substrate-bound SMG1 active site with Chaetomium thermophilum Tel1ATM (PDB ID, 6SKY). SMG1 active site is colored as described in panel A and CtTel1ATM is colored in light orange. Key residues are indicated. (D) Close-up of the active site hydrophobic cage based on the superposition of SMG1 and mTOR shown in B. Color scheme as in B and important residues are indicated. View is similar to C.

(A) Close-up showing how residues of the SMG1 catalytic loop coordinate Ser1078 of UPF1-LSQ. (B) Alignment of catalytic loop sequences from Homo sapiens and Xenopus laevis for all PIKKs colored according to conservation. PIKKs are grouped by phosphorylation site specificity and residues highlighted in subfigure C are indicated by a black dot.

Figure 2—figure supplement 2.

Recognition of UPF1-LSQ phospho-acceptor residue Ser1078 by SMG1 catalytic loop residues.

(A) Close-up showing how residues of the SMG1 catalytic loop coordinate Ser1078 of UPF1-LSQ. (B) Alignment of catalytic loop sequences from Homo sapiens and Xenopus laevis for all PIKKs colored according to conservation. PIKKs are grouped by phosphorylation site specificity and residues highlighted in subfigure C are indicated by a black dot.

Organization of the substrate-bound kinase active site.

(A) The structure of the entire complex is shown overlayed with transparent reconstructed density. The black box indicates the location of the kinase active site. (B) SMG1 active site with important residues shown as sticks. Activation segment and catalytic loop as indicated; the p-loop was omitted for clarity. UPF1-LSQ is shown in blue with positions of important residues highlighted. (C) Two-dimensional sketch of the SMG1 active site with key kinase-substrate interactions indicated. (D) Activation segment (pink) and catalytic loop (magenta) regions of a SMG1 sequence alignment are shown, indicating a high level of conservation across Homo sapiens, Bos taurus, Mus musculus, Gallus gallus, Xenopus laevis, Danio rerio and Caenorhabditis elegans. Key residues shown in B are highlighted by a black dot and activation segment and catalytic loop are indicated.

Comparison of SMG1 substrate-bound active site to other protein kinases.

(A) Superposition of substrate-bound SMG1 active site with substrate-bound CDK2 active site (PDB ID, 3QHW). SMG1 catalytic loop is colored in magenta, SMG1 activation segment is in pink, CDK2 is shown in salmon and the CDK2 substrate peptide is in orange. Directionality of the substrates and important catalytic residues are indicated and colored as described above. P-loops were omitted for clarity. The SMG1 kinase contains features that have previously been associated with a model preferring the presence of a ‘dissociative’ transition state, such as a positive charge closely involved in β-phosphate coordination, namely K2155 (residue not shown) (Wang and Cole, 2014). (B) Superposition of substrate-bound SMG1 active site with nucleotide-bound mTOR active site (PDB ID, 4JSP). SMG1 active site is colored as described in panel A and mTOR is in green. Important active site residues are indicated. P-loops were omitted for clarity. (C) Close-up of the active site hydrophobic cage after superposing the model of the substrate-bound SMG1 active site with Chaetomium thermophilum Tel1ATM (PDB ID, 6SKY). SMG1 active site is colored as described in panel A and CtTel1ATM is colored in light orange. Key residues are indicated. (D) Close-up of the active site hydrophobic cage based on the superposition of SMG1 and mTOR shown in B. Color scheme as in B and important residues are indicated. View is similar to C.

Recognition of UPF1-LSQ phospho-acceptor residue Ser1078 by SMG1 catalytic loop residues.

(A) Close-up showing how residues of the SMG1 catalytic loop coordinate Ser1078 of UPF1-LSQ. (B) Alignment of catalytic loop sequences from Homo sapiens and Xenopus laevis for all PIKKs colored according to conservation. PIKKs are grouped by phosphorylation site specificity and residues highlighted in subfigure C are indicated by a black dot.

Crucial recognition of a glutamine residue at +1 position of the UPF1 consensus motif

A glutamine residue following the phospn>ho-acceptor site is the minimal requirement for UPF1 phosphorylation by SMG1 (Figure 3A; Yamashita et al., 2001). To validate the importance of this residue, we performed a mass spectrometry-based phosphorylation assay using a series of peptides based on UPF1-LSQ. We changed the residue at position +1 in the UPF1-LSQ peptide to test the effect of different side chain properties on phosphorylation. Only wildtype UPF1-LSQ was efficiently phosphorylated by SMG1 (Figure 3B and Figure 1—figure supplement 1C). In our structure, the glutamine at position +1 of UPF1-LSQ reaches into a hydrophobic cage formed by the SMG1 activation segment and FATC domain (Figure 3C). In particular, UPF1 Gln1079 stacks against Tyr3654 and Leu2365 and forms hydrogen bonds with the backbone of Val2367 (Figure 3C). The hydrophobic cage is highly conserved in other PIKKs that recognize SQ motifs (ATM, ATR and DNA-PK) but is different in mTOR (where Glu2369 is found at the equivalent position of SMG1 Leu2365) (Figure 3D). This difference is also apparent from the superposition of SMG1 with the 2.8 Å resolution structure of the ATM orthologue Chaetomium thermophilum (Ct) Tel1ATM and with mTOR (Figure 2—figure supplement 1C and D; Jansma et al., 2020; Yang et al., 2013). While the geometry of the hydrophobic cage is highly similar between SMG1 and CtTel1ATM, it deviates in mTOR due to the described Leu to Glu substitution. Indeed, mTOR has been found to prefer small or non-polar residues at position +1 of its phosphorylation consensus motif (Hsu et al., 2011). Taken together, these observations provide a rationale for the difference in phosphorylation site specificity between SMG1, ATM, ATR, DNA-PK and mTOR. Intriguingly, the structural superposition with CtTel1ATM shows that its PIKK regulatory domain (PRD) places a Gln residue in the corresponding hydrophobic cage, effectively occupying the substrate Gln binding site (Figure 2—figure supplement 1C). This explains the autoinhibitory function of the ATM PRD domain (Jansma et al., 2020; Yates et al., 2020). The corresponding PRD domain in SMG1 is a ~ 1100 amino-acid long insertion (Figure 1A) that negatively impacts its kinase activity (Deniaud et al., 2015). However, there is no ordered density for this region in neither the previous apo-structure (Gat et al., 2019) nor in the current substrate-bound structure (Figure 1A and B).
Figure 3.

Recognition of position +1 glutamine of UPF1-LSQ.

(A) Sequence logo derived from an alignment of all SQ motifs present in human UPF1 with the respective residue positions indicated. The heights of single letters correspond to the observed frequency at that position and the overall height of a stack of letters indicates the level of conservation (Figure 1—figure supplement 1B and Figure 4—figure supplement 2; Crooks et al., 2004). (B) Mass spectrometry-based phosphorylation assay comprising UPF1-LSQ and the indicated position +1 variations. The peptide sequence is indicated in the upper left with the varied position marked as ‘X’. Error bars representing standard deviations calculated from independent experimental triplicates are shown. (C) Zoom-in of the SMG1 active site showing the recognition of UPF1 position +1 glutamine by SMG1 residues located in the activation segment and FATC domain. Residues of interest are shown as sticks. Colors as in Figure 2. (D) Alignment of PIKK sequences from Homo sapiens and Xenopus laevis with the activation segment and FATC domain sequences shown and colored according to conservation. PIKKs are grouped by phosphorylation site specificity and residues highlighted in subfigure C are indicated by a black dot.

Recognition of position +1 glutamine of UPF1-LSQ.

(A) Sequence logo derived from an alignment of all SQ motifs present in human UPF1 with the respective residue positions indicated. The heights of single letters correspond to the observed frequency at that position and the overall height of a stack of letters indicates the level of conservation (Figure 1—figure supplement 1B and Figure 4—figure supplement 2; Crooks et al., 2004). (B) Mass spectrometry-based phosphorylation assay comprising UPF1-LSQ and the indicated position +1 variations. The peptide sequence is indicated in the upper left with the varied position marked as ‘X’. Error bars representing standard deviations calculated from independent experimental triplicates are shown. (C) Zoom-in of the SMG1 active site showing the recognition of UPF1 position +1 glutamine by SMG1 residues located in the activation segment and FATC domain. Residues of interest are shown as sticks. Colors as in Figure 2. (D) Alignment of PIKK sequences from Homo sapiens and Xenopus laevis with the activation segment and FATC domain sequences shown and colored according to conservation. PIKKs are grouped by phosphorylation site specificity and residues highlighted in subfigure C are indicated by a black dot.
Figure 4—figure supplement 2.

Alignment of UPF1 N- and C-terminal sequences from Homo sapiens, Bos taurus, Canis lupus, Mus musculus, Gallus gallus, Xenopus tropicalis, Danio rerio and Caenorhabditis elegans.

The alignment is colored according to conservation. (A) Sequence alignment of UPF1 N-terminus. SQ motifs are highlighted by orange boxes with the phosphorylation site position indicated on top. (B) As in A, but for UPF1 C-terminus and using red color to indicate SQ motifs. Location and range of UPF1-LSQ are indicated.

Preferred recognition of a leucine residue at −1 position of the UPF1 consensus motif

Previous results have indicated that SQ motifs preceded by a hydrophobic residue in position −1 are preferentially phosphorylated by SMG1 (Yamashita et al., 2001). In our model, the Leu residue at position −1 in the substrate forms a C-H⋅⋅⋅π-interaction with SMG1 Phe2215 and is further stabilized by hydrophobic interactions with SMG1 Pro2249 and Gly3656. The binding pocket is also restricted by the catalytic loop residues His2337 and Asp2339 (Figure 4A). To biochemically characterize the importance of position −1, we assayed a peptide library based on UPF1-LSQ, in which we varied the residue in position −1 to represent all those found in the 20 different SQ motifs of human UPF1. Following phosphorylation of the peptides over time, we could observe that SQ motifs carrying a hydrophobic residue in position −1 were more efficiently phosphorylated. Notably, a Leu elicited the highest phosphorylation rate (Figure 4B). An end-point measurement experiment using single peptides confirmed these observations (Figure 4—figure supplement 1). We conclude that a Leu at position −1 is optimal for the interaction with SMG1 at the structural level. This is reflected at the biochemical level, whereby decreasing hydrophobicity of the residue at the −1 position negatively affects phosphorylation efficiency.
Figure 4.

SMG1 preferentially selects for substrates with hydrophobic residues in position -1.

(A) Recognition of position -1 residue of UPF1-LSQ by SMG1. Important residues are shown as sticks and colored as in Figure 2. (B) Mass spectrometry-based phosphorylation assay with UPF1-LSQ and derivatives varied in position -1. The peptide sequence is indicated in the upper left with the varying position marked as “X”. Error bars representing standard deviations calculated from independent experimental triplicates are shown and curves are colored according to hydrophobicity of the position -1 residue (Eisenberg et al., 1984). The most hydrophobic peptides are in blue while non-hydrophobics are in red. (C) Final time points of experiment shown in B. Peptides grouped and colored according to the location of the respective position -1 residue in UPF1 N- or C-terminus. Individual data points are shown as circles and error bars representing standard deviations are indicated. (D) Mass spectrometry-based phosphorylation assay with UPF1 N-terminus phosphorylation site 28 and the indicated position -1 variation. The peptide sequence is shown in the upper left with the varied position marked as “X”. Error bars represent standard deviations resulting from independent experimental triplicates. A tyrosine residue was added to the C-terminal end of the wildtype sequence to increase absorbance at = 280 nm.

(A) Mass spectrometry-based end-point measurements of single peptides combined with a control. Data points corresponding to one experiment are shown in the same color. One preferred (LSQ) and two suboptimal (DSQ, KSQ) motifs were chosen to evaluate how the absence of more optimal, competing motifs affects phosphorylation efficiency toward single peptides. The results indicate that decreased phosphorylation of peptides with non-hydrophobic position −1 residues is not only caused by competition with more optimal substrates (compare assay shown in Figure 4B). A peptide with the sequence QPELDQDSYLG was used as a control in all three experiments. Reactions were quenched and analyzed after 80 min. The peptide sequence is shown with the varied position marked as ‘X’ (B) Mass spectrometry-based phosphorylation assay with UPF1-LSQ and derivatives varying in position +2. The peptide sequence is indicated with the varying position marked as ‘X’. Error bars representing standard deviations calculated from independent experimental triplicates are shown.

The alignment is colored according to conservation. (A) Sequence alignment of UPF1 N-terminus. SQ motifs are highlighted by orange boxes with the phosphorylation site position indicated on top. (B) As in A, but for UPF1 C-terminus and using red color to indicate SQ motifs. Location and range of UPF1-LSQ are indicated.

Figure 4—figure supplement 1.

In vitro phosphorylation of UPF1-LSQ and derivatives.

(A) Mass spectrometry-based end-point measurements of single peptides combined with a control. Data points corresponding to one experiment are shown in the same color. One preferred (LSQ) and two suboptimal (DSQ, KSQ) motifs were chosen to evaluate how the absence of more optimal, competing motifs affects phosphorylation efficiency toward single peptides. The results indicate that decreased phosphorylation of peptides with non-hydrophobic position −1 residues is not only caused by competition with more optimal substrates (compare assay shown in Figure 4B). A peptide with the sequence QPELDQDSYLG was used as a control in all three experiments. Reactions were quenched and analyzed after 80 min. The peptide sequence is shown with the varied position marked as ‘X’ (B) Mass spectrometry-based phosphorylation assay with UPF1-LSQ and derivatives varying in position +2. The peptide sequence is indicated with the varying position marked as ‘X’. Error bars representing standard deviations calculated from independent experimental triplicates are shown.

SMG1 preferentially selects for substrates with hydrophobic residues in position -1.

(A) Recognition of position -1 residue of UPF1-LSQ by SMG1. Important residues are shown as sticks and colored as in Figure 2. (B) Mass spectrometry-based phosphorylation assay with UPF1-LSQ and derivatives varied in position -1. The peptide sequence is indicated in the upper left with the varying position marked as “X”. Error bars representing standard deviations calculated from independent experimental triplicates are shown and curves are colored according to hydrophobicity of the position -1 residue (Eisenberg et al., 1984). The most hydrophobic peptides are in blue while non-hydrophobics are in red. (C) Final time points of experiment shown in B. Peptides grouped and colored according to the location of the respective position -1 residue in UPF1 N- or C-terminus. Individual data points are shown as circles and error bars representing standard deviations are indicated. (D) Mass spectrometry-based phosphorylation assay with UPF1 N-terminus phosphorylation site 28 and the indicated position -1 variation. The peptide sequence is shown in the upper left with the varied position marked as “X”. Error bars represent standard deviations resulting from independent experimental triplicates. A tyrosine residue was added to the C-terminal end of the wildtype sequence to increase absorbance at = 280 nm.

In vitro phosphorylation of UPF1-LSQ and derivatives.

(A) Mass spectrometry-based end-point measurements of single peptides combined with a control. Data points corresponding to one experiment are shown in the same color. One preferred (LSQ) and two suboptimal (DSQ, KSQ) motifs were chosen to evaluate how the absence of more optimal, competing motifs affects phosphorylation efficiency toward single peptides. The results indicate that decreased phosphorylation of peptides with non-hydrophobic position −1 residues is not only caused by competition with more optimal substrates (compare assay shown in Figure 4B). A peptide with the sequence QPELDQDSYLG was used as a control in all three experiments. Reactions were quenched and analyzed after 80 min. The peptide sequence is shown with the varied position marked as ‘X’ (B) Mass spectrometry-based phosphorylation assay with UPF1-LSQ and derivatives varying in position +2. The peptide sequence is indicated with the varying position marked as ‘X’. Error bars representing standard deviations calculated from independent experimental triplicates are shown.

Alignment of UPF1 N- and C-terminal sequences from Homo sapiens, Bos taurus, Canis lupus, Mus musculus, Gallus gallus, Xenopus tropicalis, Danio rerio and Caenorhabditis elegans.

The alignment is colored according to conservation. (A) Sequence alignment of UPF1 N-terminus. SQ motifs are highlighted by orange boxes with the phosphorylation site position indicated on top. (B) As in A, but for UPF1 C-terminus and using red color to indicate SQ motifs. Location and range of UPF1-LSQ are indicated. Interestingly, further analysis of the final time points in the time course phosphorylation experiment showed that the SQ motifs that carry rather hydrophobic residues at the −1 position (and are therefore more efficiently phosphorylated) reside exclusively in the <span class="Gene">UPF1 C-terminus (Figure 4C, Figure 4—figure supplement 2). To validate our hypothesis on the importance of position −1 hydrophobicity, we turned to a known phospn>horylation site in the N-terminus, <span class="Chemical">Thr28. We tested whether SMG1-8-9 phosphorylation activity toward this motif could be enhanced by changing the residue at position −1 from the naturally occurring Asp to a Leu. Indeed, mutating the residue upstream of this SQ motif (Asp27Leu) resulted in a gain-of-function effect in the phosphorylation assay (Figure 4D). These findings are in agreement with data for ATM and ATR (Kim et al., 1999), although the residues involved in the recognition of UPF1 Leu1077 have diverged, suggesting that the details of −1 recognition will differ in other PIKKs. Finally, we do not observe extensive interactions between SMG1 and the peptide residues preceding or following the LSQ motif in our structure. Consistently, we did not detect a marked effect on phosphorylation in a time course experiment where we changed the residue at position +2 of UPF1-LSQ (Figure 4—figure supplement 1B).

Conclusions

In this manuscript, we report the first structure of a substrate-bound PIKK active site, thereby revealing the basis for phospn>horylation site selection by SMG1 and other PIKK family members. The results elucidate the mechanism of phospho-acceptor recognition, and explain the specificity for Ser-containing substrates with a glutamine downstream residue at position +1 and an upstream hydrophobic residue at position −1 (particularly Leu). These findings can be extrapolated to other PIKK members, such as ATM and ATR, and suggest a specific mechanism for PRD function by acting as a pseudosubstrate. Our results provide molecular insights into a key step of the NMD pathway. Whether phosphorylation of full-length UPF1 by SMG1 involves additional elements of recognition and/or additional levels of regulation will be a subject for future studies.

Materials and methods

Protein expression and purification

The SMG1-SMG8-SMG9 complex was expressed and purified as previously described (Gat et al., 2019). Briefly, a pool of HEK293T cells (obtained from ATCC) stably expressing full length SMG1 (N-terminally fused to a TwinStrep-tag and a 3C protease cleavage site), SMG8 and SMG9 was established using the piggybac method by initially transfecting the cells using polyethylenimine (Yusa et al., 2011; Li et al., 2013). The source cells were authenticated by genotyping (Eurofins) and tested negative for mycoplasma contamination (LookOut Mycoplasma PCR Detection Kit, Sigma-Aldrich). For SMG1-SMG8-SMG9 expression, cultures were adjusted to a density of 1 × 106 cells per mL in FreeStyle 293 Expression Medium (Gibco, Thermo Fisher Scientific). The cells were induced by addition of doxycycline and were harvested 48 hr after induction. After lysis by douncing the cells in 1xPBS, 1 mM MgCl2 and 1 mM DTT supplemented with DNase I, Benzonase and EDTA-free cOmplete Protease Inhibitor Cocktail (Roche) the cleared lysate was applied to a StrepTrap HP column (Sigma-Aldrich) and the complex affinity purified using the N-terminal TwinStrep-tag on SMG1. After washing with 50 column volumes of 1xPBS, 1 mM MgCl2 and 1 mM DTT the complex was eluted using wash buffer supplemented with 2.5 mM desthiobiotin. SMG1-8-9 was further purified by size-exclusion chromatography using a Superose 6 Increase 10/300 GL column (Sigma-Aldrich) equilibrated with 1xPBS, 1 mM MgCl2 and 1 mM DTT (Aekta purifier FPLC system, GE Healthcare). Purified SMG1-8-9 was concentrated up to 6 μM and stored in gel filtration buffer. To obtain full-length unphosphorylated human UPF1, the protein was expressed in Escherichia coli BL21 STAR (DE3) pRARE fused to a C-terminal 6xHis-tag cleavable with Tobacco etch virus (TEV) protease, as described before (Chakrabarti et al., 2011; Chakrabarti et al., 2014). Bacteria were grown at 37°C in TB medium shaking at 180 rpm and induced using IPTG at an OD of 2 for overnight expression at 18°C. Harvested bacteria (6000 rpm, 10 min) were lysed by sonication in lysis buffer (50 mM Tris-Cl pH 7.5, 500 mM NaCl, 10 mM Imidazole, 1 mM β-mercaptoethanol, 10% (v/v) glycerol, 2 mM MgCl2 and 0.2% (v/v) NP-40) supplemented with DNase I and EDTA-free cOmplete Protease Inhibitor Cocktail (Roche). The lysate was cleared by centrifugation (25.000 rpm, 30 min), filtered and combined with TALON resin (Takara) equilibrated with lysis buffer for gravity-flow affinity purification. After washing with 70 column volumes of lysis buffer, the protein was eluted with lysis buffer supplemented with 300 mM imidazole pH 7.5 and the eluate was combined with His-tagged TEV protease and dialyzed overnight against 20 mM HEPES pH 7.5, 85 mM KCl, 1 mM MgCl2, 10% (v/v) glycerol and 2 mM DTT. The dialyzed sample was passed over another TALON column by gravity-flow, in order to separate cleaved protein from the cleaved-off His-tag, the His-tagged TEV protease and uncleaved UPF1 protein. The flow-through of this column contained cleaved UPF1 and was loaded on a HiTrap Heparin HP column (GE Healthcare). Following binding and washing with Heparin buffer A (as for dialysis), UPF1 was eluted by a gradient increasing salt concentration from 85 mM to 500 mM over 50 column volumes (Aekta prime FPLC system, GE Healthcare). The peak corresponding to full-length UPF1 was pooled and concentrated before a final sizing step using a Superdex 200 Increase 10/300 GL column (Sigma-Aldrich) equilibrated with Heparin buffer A (Aekta purifier FPLC system, GE Healthcare). Purified full-length UPF1 was pooled and concentrated up to 30 μM using an Amicon Ultra Centrifugal Filter (50 kDa MWCO, Merck). All described protein purification steps were carried out at 4°C and all purified proteins were flash frozen in size-exclusion buffer using liquid nitrogen and stored at −80°C until further usage.

Cryo-EM sample preparation and data collection

A sample of 0.5 μM (final concentration) purified <span class="Gene">SMG1-<span class="Gene">SMG8-SMG9 was mixed with 0.5 mM of UPF1-LSQ, 1 mM AMPPNP, 2 mM MgCl2, 2 mM DTT and 0.04% (v/v) n-octyl-beta-D-glucoside in 1xPBS and incubated for 30 min on ice. The UPF1-LSQ peptide (sequence: QPELSQDSYLG) was synthesized in-house as described for the mass spectrometry-based phosphorylation assay. A 4 μL sample was applied to a glow-discharged Quantifoil R1.2/1.3, Cu 200 mesh grid and incubated for 30 s at 4°C and approximately 100% humidity. Grids were subsequently plunge frozen directly after blotting using a liquid ethane/propane (37% ethane, temperature range when plunging: −170°C to −180°C) mixture and a ThermoFisher FEI Vitrobot IV set to a blot time of 3.5 s and a blot force of 4. Cryo-EM data were collected using a ThermoFisher FEI Titan Krios microscope operated at 300 kV equipped with a post-column GIF (energy width 20 eV) and a Gatan K3 camera operated in counting mode, the SerialEM software suite, and a beam-tilt based multi-shot acquisition scheme. Movies were recorded at a nominal magnification of 81.000x corresponding to a pixel size of 1.094 Å at the specimen level. The sample was imaged with a total exposure of 68.75 e-/Å2 evenly spread over 5.5 s and 79 frames. The target defocus during data collection ranged between −0.8 and −2.9 μm.

Cryo-EM data processing

Data processing was carried out using RELION 3.0 (Zivanov et al., 2018) unless stated otherwise. Beam-induced sample motions were corrected and dose-weighting was carried out using the RELION implementation of MotionCor2 (Zheng et al., 2017). Particles were picked using Gautomatch (https://www.mrc-lmb.cam.ac.uk/kzhang/Gautomatch/) and CTF estimation was done using the RELION wrapper for CtfFind4.1 (Rohou and Grigorieff, 2015). After extraction (box size: 320 pix, 1.094 Å/pix) and downsampling (box size: 80 pix, 4.376 Å/pix), 4,368,586 particles were submitted to two rounds of reference-free 2D classification. A subset of the cleaned candidate particles was used to generate an initial model using CryoSPARC v2 (Punjani et al., 2017). 1,524,355 selected particles were 3D classified before re-extracting 886,714 particles with original sampling followed by two additional rounds of 3D classification resulting in 481,754 final particles. All classification steps were carried out with the total amount of particles being distributed over multiple batches. After 3D auto-refinement, sharpening (b-factor = −119.5) and Ctf refinement in RELION 3.0, the final refined map (3D auto-refinement) was again submitted to RELIONs’ post-processing routine for automatic B-factor weighting and high-resolution noise substitution (b-factor = −102.6). The final reconstruction (EMD-11063) reached an overall resolution of 2.9 Å with local resolution ranging from 2.8 Å to 4.5 Å as estimated by RELION 3.0.

Model building and refinement

The reconstructed density was interpreted using COOT (version 1.0) and our previously published model of SMG1-8-9 (PDB: 6SYT) (Emsley et al., 2010). Model building was iteratively interrupted by real-space refinements using PHENIX (version 1.17) (Adams et al., 2010; Liebschner et al., 2019). Statistics assessing the quality of the final model (PDB ID 6Z3R) were generated using Molprobity (Chen et al., 2010; Supplementary file 1). FSC curves were calculated using PHENIX and the 3D FSC online application (Tan et al., 2017). Images of the calculated density and the built model were prepared using UCSF Chimera (Pettersen et al., 2004), UCSF ChimeraX (Goddard et al., 2018) and PyMOL (version 2.3.2).

Radioactive in vitro kinase assay

In vitro kinase assays were essentially carried out as before (Gat et al., 2019). 1 μM of full-length UPF1 was mixed with 50 nM SMG1-8-9, 10mM MgCl2 and 2mM DTT in 1xPBS. The reaction was started by adding 0.5mM ATP and 0.06 μM of -32P-labeled ATP. The reaction was incubated at 30°C and samples were taken at different time points to follow phosphorylation over time. The samples were immediately quenched by adding SDS-containing sample buffer and initially analyzed by SDS gel electrophoresis followed by Coomassie-staining. Phosphoproteins were subsequently detected using autoradiography and a Typhoon FLA7000 imager (GE Healthcare).

Mass spectrometry-based in vitro kinase assay

All peptides were synthesized in-house using solid-phase peptide synthesis and the quality of the product was assessed by electrospray ionization mass spectrometry (ESI MS). For the purpose of this study, peptides were dissolved in 1xPBS supplemented with 500 mM HEPES pH 7.4. Two types of experiments were carried out. Firstly, several peptides (typically comprising a library) and a control were mixed, and their individual phosphorylation ratios were determined at several time points (0, 10, 20, 40 and 80 min). Secondly, one end-point measurement experiment was carried out. In this setup, a single peptide was mixed with a control and the relative phosphorylation ratio was determined at a single, final time point (80 min). This type of experiment was used to assess whether effects on phosphorylation ratios observed in time course assays are caused by competition between several peptides for SMG1 binding. In both cases, 0.5 μM of kinase complex was combined with 0.1 mM of each peptide, 0.5 mM ATP, 20 mM MgCl2 and 2 mM DTT in 1xPBS. The reaction was started by addition of kinase and incubated at 30°C. Samples were taken at desired time points and immediately quenched after collection by adding EDTA to a final concentration of 50 mM on ice. In order to remove kinase complex and transfer the peptides into a compatible buffer, we made use of home-made StageTips (Rappsilber et al., 2007). Poly(styrenedivinylbenzene)copolymer (SDB-XC) was washed with methanol by centrifugation before being washed again with buffer B (0.1% (v/v) formic acid, 80% (v/v) acetonitrile). Buffer A (0.1% (v/v) formic acid) was used for equilibration of the SDB-XC material. Following sorbent equilibration, the sample was applied and the tips were washed using buffer A. Finally, the sample was eluted using buffer B. Using an Agilent 1290 HPLC, typically about 5 μL of the sample in 70% (v/v) acetonitrile and 0.05% (v/v) trifluoroacetic acid were flow-injected (250 μL/min) into a Bruker maXis II ETD mass spectrometer for ESI MS time-of-flight analysis. Peptides were ionized at a capillary voltage of 4500 V and an end plate offset of 500 V. Full scan MS spectra (200–1600 m/z) were acquired at a spectra rate of 1 Hz and a collision energy of 10 eV. All experiments were carried out as independent experimental triplicates. Raw data files were processed using Bruker Compass DataAnalysis software. The m/z spectra were deconvoluted by maximum entropy with an instrument resolving power of 10,000. The 12C peaks corresponding to individual peptides were identified in the resulting neutral spectra and integrated, both for masses accounting for unphosphorylated and phosphorylated peptides. To calculate a relative phosphorylation ratio, the area for phosphorylated peptide was divided by the sum of phosphorylated and unphosphorylated peptide. All time points were normalized to time point 0. Means of independent experimental triplicates and error bars indicating standard deviations were visualized using Prism (GraphPad). In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses. Acceptance summary: This study reports a high resolution structure using CryoEM of a PIKK family kinase in complex with a peptide substrate and an ATP analog. It is the first member of this family of protein kinases to be captured at atomic resolution with a peptide substrate. The structural interactions between enzyme and substrate were corroborated with very solid biochemical analysis including a range of kinase assays with various peptides. This work will serve as a model for related kinases including ATM, ATR, DNAPK, and mTOR. Decision letter after peer review: Thank you for submitting your article "Structure of substrate-bound SMG1-8-9 kinase complex reveals molecular basis for phospn>horylation specificity" for consideration by eLife. Your article has been reviewed by Reviewing/Senior Editor Philip Cole and three reviewers. The following individuals involved in review of your submission have agreed to reveal their identity: Nikolaus Grigorieff (Reviewer #1); Kacper Rogala (Reviewer #2); Michael B Yaffe (Reviewer #3). The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted t<span class="Chemical">his decision to help you prepare a revised submission. We would like to draw your attention to changes in our revision policy that we have made in response to <span class="Disease">COVID-19 (https://elifesciences.org/articles/57162). <span class="Chemical">Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below, although fairly numerous, are meant to address clarity and presentation. Summary: Langer et al. elucidate a cryo-EM structure of a human PIKK family member SMG-1-8-9 in complex with a short peptide substrate (from UPF1) and a non-hydrolyzable analog of ATP, AMPPNP. This is an extension of the cryo-EM work that they did in 2019 on apo SMG-1-8-9 (Gat et al., 2019). However, this study is in fact more interesting than the previous work because it captures the SMG1 kinase in a state with its substrate peptide bound, frozen in time just before the actual act of phosphorylation. Importantly, the authors do an extensive characterization of the peptide recognition sequence, and reveal the chemical compatibility of the LSQ sequence and its derivatives for phosphorylation by SMG1. They compare the binding site of SMG1 to other members of the PIKK family and explain the observed differences between them in terms of substrate recognition. Overall, we believe that this work is novel and interesting. In general, the claims are supported by solid data, and the structure is validated with substantial biochemical work. Although we do not think additional experimental work is needed, below we make the following suggestions for revisions to strengthen the manuscript. Essential revisions: -The structure and its interpretation seem plausible. However, we wonder if using an 11-amino acid peptide as a substrate captures all the contacts relevant for the kinase specificity that the authors are interested in. Is it possible that the full UPF1 molecule makes tertiary contacts with the kinase that are important for specificity? The authors should consider this possibility and explain why they can rule this out. Related to the above point, the authors state that "there are no extensive interactions between SMG1 and the residues preceding or following the LSQ motif in our structure." It is not clear how they can conclude this from their structure. If this is known from other work, they should cite it. -The authors seem to have side-stepped the other 3D classes that were generated during data processing. There are at least two classes (center top and center bottom in Figure 1—figure supplement 3) that show extra density that is not present in the final reconstruction presented. That density appears to extend from SMG8's stalk and reach towards the FRB of SMG1. According to Li et al., 2019), this extra density is likely the kinase inhibitory domain (KID) of SMG8. This density also appeared in the authors' previous map of the apo SMG-1-8-9 complex (EMDB 10348), but they avoided discussing it in their previous manuscript (Gat et al., 2019). If the authors feel that delving into this would be too speculative, adding a label and a note to point out what they represent would be helpful. -A discussion regarding nucleotide binding to SMG9 and its effects on SMG8 binding and SMG1 catalytic activity is lacking. The authors in their previous work revealed a GTP/GDP nucleotide binding pocket in C. elegans SMG1 (Li et al., 2017), and then realized that human SMG9 co-purifies with bound ATP instead of GTP (Gat et al., 2019). Can human SMG1 also associate with GTP/GDP? In Figure 1—figure supplement 4 of this manuscript, the authors talk extensively about the adaptations for adenine vs guanine binding. This topic appears to be rather confusing in the field, so please include a short discussion that deals with the nucleotide-binding matter, including the mutagenesis study from another SMG-1-8-9 structure paper by Li et al., 2019. -The methods are not sufficiently detailed, and it would be challenging to reproduce the authors' work by simply following them. -Figure 3A and Figure 3 legend, particularly the statement that sequence logo letter size reflects the frequency of occurrence is not technically correct. True sequence logos use bit scores that typically do not reflect the frequency of occurrence but rather indicate the information content (in bits) that is being provided by each residue in that position in terms of the informational entropy content in each position of a motif, at least as originally described by <span class="Gene">Tom Schneider and Mike Stephens in 1990. A good review of the concept is found in Crooks et al., 2004. (http://www.genome.org/cgi/doi/10.1101/gr.849004). The authors should clarify how they are using the sequence logos here. -Regarding the data in Figure 3B described in subsection “Crucial recognition of a glutamine residue at +1 position of the UPF1 consensus motif”, the claims of "peptide library" and "systematically" in the text go a bit beyond the data, since they are really looking at a small series of peptides, not a complete library of all residues in that position. The data is solid, and there is no need to do anything more, other than tone down the claims of this being a 'library'. We think the use of the term 'library' in Figure 4 and its description is fine, since in that case the authors really are looking at the set of all possible X-SQ sites that exist in UPF1. -The authors may want to expound a bit more on the similarity and differences in the hydrophobic cage between SMG1 and other PIKKs, particularly the Leu to Glu activation loop substitution seen in mTor (Figure 2—figure supplement 1B). This is important because the optimal kinase motif for mTor is NOT SQ as it is in ATM, ATR and DNA-PK, but rather more like SF or SP (see Figure 2 in Hsu et al., 2011). -Subsection “Preferred recognition of a leucine residue at -1 position of the UPF1 consensus motif” and Figure 4C- presumably the point the authors are making is that C-terminal SQ motifs are better phosphorylated than the N-terminal ones because they are better matches to the hydrophocis-Ser-Gln motif. Perhaps they can state this conclusion a bit more clearly. -Similarly, it appears from Figure 2C that the Gln side chain is actually helping stabilize the activation segment conformation. Is this correct, and if so, shouldn't this be alluded to somewhere in the Results section? -Regarding the catalytic mechanism, as the authors have a (not so common) ternary complex protein kinase crystal structure containing both a peptide substrate and a nucleotide substrate, it would be nice to say a bit more about the relevance to potential phospn>horyl transfer transition states. There has been a classical discussion in the field among physical organic chemists and enzymologists about whether phospn>horyl transfers involving monoesters like the γ phosphate of ATP are more associative or dissociative. High resolution structures have been used to inform these discussions. For examples, please see: PMID:9408938, PMID:21513457, PMID:25399640 Summary: Langer et al., elucidate a cryo-EM structure of a human PIKK family member SMG-1-8-9 in complex with a short peptide substrate (from UPF1) and a non-hydrolyzable analog of ATP, AMPPNP. This is an extension of the cryo-EM work that they did in 2019 on apo SMG-1-8-9 (Gat et al., 2019). However, this study is in fact more interesting than the previous work because it captures the SMG1 kinase in a state with its substrate peptide bound, frozen in time just before the actual act of phosphorylation. Importantly, the authors do an extensive characterization of the peptide recognition sequence, and reveal the chemical compatibility of the LSQ sequence and its derivatives for phosphorylation by SMG1. They compare the binding site of SMG1 to other members of the PIKK family and explain the observed differences between them in terms of substrate recognition. Overall, we believe that this work is novel and interesting. In general, the claims are supported by solid data, and the structure is validated with substantial biochemical work. Although we do not think additional experimental work is needed, below we make the following suggestions for revisions to strengthen the manuscript. Essential revisions: 1) The structure and its interpretation seem plausible. However, we wonder if using an 11-amino acid peptide as a substrate captures all the contacts relevant for the kinase specificity that the authors are interested in. Is it possible that the full UPF1 molecule makes tertiary contacts with the kinase that are important for specificity? The authors should consider this possibility and explain why they can rule this out. All the published in vivo and in vitro data at this point in time point to short phospn>horylation motifs in the UPF1 unstructured regions as the determinants for recognition. Consistently, our biochemical assays (Figure 3B and Figure 1—figure supplement 1C) show that the peptide we used in the structural analysis recapitulates the specificity of SMG1 towards UPF1 SQ phosphorylation motifs. So far, we have not been able to detect the involvement of any other region of UPF1 in our biochemical studies or cryo-EM data. However, we do not rule out that there may be additional (and possibly regulated) interactions with the UPF1 structural region that may play a role in the crowded cellular environment and in the context of large NMD complexes, and indeed we are currently pursuing these studies. We have clarified this by adding a sentence at the end of the Conclusions, stating: “Whether phosphorylation of full-length UPF1 by SMG1 involves additional elements of recognition and/or additional levels of regulation will be a subject for future studies.” 2) Related to the above point, the authors state that "there are no extensive interactions between SMG1 and the residues preceding or following the LSQ motif in our structure." It is not clear how they can conclude this from their structure. If this is known from other work, they should cite it. This statement was meant to describe the observations we make in our structure: The N- and C-terminal residues of the LSQ peptide do not show ordered density in our reconstruction, indicative of lack of extensive interactions with these residues (otherwise they would be ordered). This interpretation is supported by the assay in Figure 4—figure supplement 1B showing no significant effect of changing residues outside of the LSQ motif (in contrast to the prominent effect for changing residues within the motif itself). We have clarified that this is an observation by changing the text to: “we do not observe extensive interactions between SMG1 and the peptide residues preceding or following the LSQ motif in our structure. Consistently, we did not detect a marked effect on phosphorylation in a time course experiment where we changed the residue at position +2 of UPF1-LSQ (Figure 4—figure supplement 1B).” 3) The authors seem to have side-stepped the other 3D classes that were generated during data processing. There are at least two classes (center top and center bottom in Figure 1—figure supplement 3) that show extra density that is not present in the final reconstruction presented. That density appears to extend from SMG8's stalk and reach towards the FRB of SMG1. According to Li et al., 2019), this extra density is likely the kinase inhibitory domain (KID) of SMG8. This density also appeared in the authors' previous map of the apo SMG-1-8-9 complex (EMDB 10348), but they avoided discussing it in their previous manuscript (Gat et al., 2019). If the authors feel that delving into this would be too speculative, adding a label and a note to point out what they represent would be helpful. The described extra density indeed likely corresponds to the less well resolved, unmodelled C-terminal half of SMG8 (compare Figure 1A). We have now highlighted these classes and indicated a potential regulatory function in the figure caption (Figure 1—figure supplement 3). We refrain from further speculation on the precise function of this part of SMG8 in the context of the data presented here. 4) A discussion regarding nucleotide binding to SMG9 and its effects on SMG8 binding and SMG1 catalytic activity is lacking. The authors in their previous work revealed a GTP/GDP nucleotide binding pocket in C. elegans SMG1 (Li et al., 2017), and then realized that human SMG9 co-purifies with bound ATP instead of GTP (Gat et al., 2019). Can human SMG1 also associate with GTP/GDP? In Figure 1—figure supplement 4 of this manuscript, the authors talk extensively about the adaptations for adenine vs guanine binding. This topic appears to be rather confusing in the field, so please include a short discussion that deals with the nucleotide-binding matter, including the mutagenesis study from another SMG-1-8-9 structure paper by Li et al., 2019. The crystal structure of a C. elegans SMG8-SMG9 heterodimer purified from insect cells was initially solved as an apo-complex as well as bound to GDP following a crystal soaking experiment (Li et al., 2017). These soaking experiments had been guided by the overall G-domain fold of SMG9. More recently, mass spectrometry experiments using SMG1-8-9 as well as SMG8-9 complexes purified from HEK 293T cells revealed that SMG9 co-purifies bound to ATP, not with GTP (Gat et al. 2019). The higher resolution structural analysis carried out in the present manuscript confirms the presence of ATP in SMG8-SMG9 and explains the adaptions in the SMG9 G-fold making it indeed recognize an adenine base rather than a guanine base (Figure 1—figure supplement 4). Although uncommon, this is actually not the first instance where ATP binding to a G-fold domain has been observed. The speculation presented in the other SMG1-SMG8-SMG9 structure published last year (PMID: 31729466) was based on the (with hindsight wrong) assumption that SMG9 binds GTP from the earlier Li et al., paper (PMID: 28389433). While we would keep the majority of the details of ATP binding to the SMG9 G fold domain in the supplementary material (not to detract from the main take-home message of this paper), we have now added this paragraph: “The local resolution of around 3 Å allowed us to model SMG9-bound ATP in the reconstructed density, revealing the molecular basis for how the adenosine nucleotide is recognized by this unusual G-fold domain (Figure 1B and D, Figure 1—figure supplement 4). Briefly, the G4 and G5 motifs responsible for the recognition of the base have rearranged to preferentially bind an adenine base rather than a guanine (Figure 1—figure supplement 4).” We have no evidence that human SMG1 kinase can associate with GTP/GDP, and believe this is very unlikely. We would defer from discussing and speculating on the impact of ATP-bound SMG9 on SMG1 activity in this paper, as this is not what the focus of this paper is about. In this paper, we focus on substrate specificity. Understanding the regulation of catalytic activity is beyond the scope of this manuscript. 5) The methods are not sufficiently detailed, and it would be challenging to reproduce the authors' work by simply following them. We have expanded the respective Materials and methods section to improve clarity and included sufficient details to allow to reproduce t<span class="Chemical">his work. 6) Figure 3A and Figure 3 legend, particularly the statement that sequence logo letter size reflects the frequency of occurrence is not technically correct. True sequence logos use bit scores that typically do not reflect the frequency of occurrence but rather indicate the information content (in bits) that is being provided by each residue in that position in terms of the informational entropy content in each position of a motif, at least as originally described by <span class="Gene">Tom Schneider and Mike Stephens in 1990. A good review of the concept is found in Crooks et al., 2004. (http://www.genome.org/cgi/doi/10.1101/gr.849004). The authors should clarify how they are using the sequence logos here. We have clarified this and modified the sentence to more precisely describe the meaning of the sequence logo: "Sequence logo derived from an alignment of all SQ motifs present in human UPF1 with the respective residue positions indicated. The heights of single letters correspond to the observed frequency at that position and the overall height of a stack of letters indicates the level of conservation (Crooks et al., 2004)." 7) Regarding the data in Figure 3B described in subsection “Crucial recognition of a glutamine residue at +1 position of the UPF1 consensus motif”, the claims of "peptide library" and "systematically" in the text go a bit beyond the data, since they are really looking at a small series of peptides, not a complete library of all residues in that position. The data is solid, and there is no need to do anything more, other than tone down the claims of this being a 'library'. We think the use of the term 'library' in Figure 4 and its description is fine, since in that case the authors really are looking at the set of all possible X-SQ sites that exist in UPF1. We have changed this section to: "To validate the importance of this residue, we performed a mass spectrometry-based phosphorylation assay using a series of peptides based on UPF1-LSQ. We changed the residue at position +1 in the UPF1-LSQ peptide to test the effect of different side chain properties on phosphorylation." 8) The authors may want to expound a bit more on the similarity and differences in the hydrophobic cage between SMG1 and other PIKKs, particularly the Leu to Glu activation loop substitution seen in mTor (Figure 2—figure supplement 1B). This is important because the optimal kinase motif for mTor is NOT SQ as it is in ATM, ATR and DNA-PK, but rather more like SF or SP (see Figure 2 in Hsu et al., 2011). We have included two sentences to stress this aspect and make it more easily understandable: "While the geometry of the hydrophobic cage is highly similar between SMG1 and CtTel1ATM, it deviates in mTOR due to the described Leu to Glu substitution. Indeed, mTOR has been found to prefer small or non-polar residues at position +1 of its phosphorylation consensus motif (Hsu et al., 2011)." 9) Subsection “Preferred recognition of a leucine residue at -1 position of the UPF1 consensus motif” and Figure 4C- presumably the point the authors are making is that C-terminal SQ motifs are better phosphorylated than the N-terminal ones because they are better matches to the hydrophocis-Ser-Gln motif. Perhaps they can state this conclusion a bit more clearly. We have addressed t<span class="Chemical">his in the respn>ective sentence: "Interestingly, further analysis of the final time points in the time course phosphorylation experiment showed that the SQ motifs that carry rather hydrophobic residues at the -1 position (and are therefore more efficiently phosphorylated) reside exclusively in the <span class="Gene">UPF1 C-terminus (Figure 4C, Figure 4—figure supplement 2)." 10) Similarly, it appears from Figure 2C that the Gln side chain is actually helping stabilize the activation segment conformation. Is this correct, and if so, shouldn't this be alluded to somewhere in the Results section? We cannot conclude that there is a change in activation segment conformation upon substrate binding when comparing our reconstructions of apo- and substrate-bound SMG1 (see Figure 1—figure supplement 5). While the density of the substrate-bound active site appears more ordered compared to the apo reconstruction (Figure 1—figure supplement 5A vs. B), this is likely due to the overall increased quality of the map and we cannot objectively judge whether binding of Gln1079 stabilizes the conformation of the SMG1 hydrophobic cage. 11) Regarding the catalytic mechanism, as the authors have a (not so common) ternary complex protein kinase crystal structure containing both a peptide substrate and a nucleotide substrate, it would be nice to say a bit more about the relevance to potential phospn>horyl transfer transition states. There has been a classical discussion in the field among physical organic chemists and enzymologists about whether phospn>horyl transfers involving monoesters like the γ phosphate of ATP are more associative or dissociative. High resolution structures have been used to inform these discussions. For examples, please see: PMID:9408938, PMID:21513457, PMID:25399640 In the manuscript, we focus on phosphorylation motif recognition and tried avoiding making statements on detailed phosphoryl transfer mechanisms on purpose. The reason is that our cryo-EM structure was solved using a <span class="Chemical">phosphate-based buffer. T<span class="Chemical">his may lead to depletion of a good portion of free magnesium ions from the buffer solution and potentially alter the precise conformation of the nucleotide in the active site. With the experimental caveat in mind, we do not feel comfortable in making strong in-depth statements on the chemistry of phosphoryl transfer transition states. However, we have taken the comment on board, and now added a sentence to Figure 2—figure supplement 1 pointing out that: “The SMG1 kinase contains features that have previously been associated with a model preferring the presence of a "dissociative" transition state, such as a positive charge closely involved in β-phosphate coordination, namely K2155 (residue not shown) (Wang and Cole, 2014)."
Key resources table
Reagent type (species) or resourceDesignationSource or referenceIdentifiersAdditional information
Gene (Homo sapiens)SMG1Shigeo Ohno labUniprot Q96Q15
Gene (Homo sapiens)SMG8Shigeo Ohno labUniprot Q8ND04
Gene (Homo sapiens)SMG9Shigeo Ohno labUniprot Q9H0W8
Cell line (Homo sapiens)HEK293TATCC
Strain, strain background (Escherichia coli)BL21 Star (DE3) pRAREEMBL Heidelberg Core FacilityElectrocompetent cells
Peptide, recombinant proteinUPF1- LSQ (peptide 1078) and derivatives, UPF1- peptide 28 and derivativesin-house as described in the Materials and methods section
Chemical compound, drugAMPPNPSigma-Aldrich
Chemical compound, drugATPSigma-Aldrich
Software, algorithmRELIONdoi: 10.7554/eLife.42166RELION 3.0
Software, algorithmCryosparcdoi: 10.1038/nmeth.4169Cryosparc2
Software, algorithmCtfFinddoi: 10.1016/j.jsb.2015.08.008CtfFind4.1.9
Software, algorithmCryosparcdoi: 10.1038/nmeth.4169Cryosparc2
Software, algorithmUCSF ChimeraUCSF, https://www.cgl.ucsf.edu/chimera/
Software, algorithmUCSF ChimeraXUCSF, https://www.rbvi.ucsf.edu/chimerax/
Software, algorithmCOOThttp://www2.mrc-lmb.cam.ac.uk/personal/pemsley/coot/
Software, algorithmPHENIXhttps://www.phenix-online.org/PHENIX 1.17
Software, algorithmPyMOLPyMOL Molecular Graphics System, Schrodinger LLCPyMOL 2.3.2https://www.pymol.org/
  49 in total

Review 1.  ATM, ATR, and DNA-PK: The Trinity at the Heart of the DNA Damage Response.

Authors:  Andrew N Blackford; Stephen P Jackson
Journal:  Mol Cell       Date:  2017-06-15       Impact factor: 17.970

Review 2.  Nonsense-Mediated mRNA Decay Begins Where Translation Ends.

Authors:  Evangelos D Karousis; Oliver Mühlemann
Journal:  Cold Spring Harb Perspect Biol       Date:  2019-02-01       Impact factor: 10.005

3.  Utilization of oriented peptide libraries to identify substrate motifs selected by ATM.

Authors:  T O'Neill; A J Dwyer; Y Ziv; D W Chan; S P Lees-Miller; R H Abraham; J H Lai; D Hill; Y Shiloh; L C Cantley; G A Rathbun
Journal:  J Biol Chem       Date:  2000-07-28       Impact factor: 5.157

4.  A hyperactive piggyBac transposase for mammalian applications.

Authors:  Kosuke Yusa; Liqin Zhou; Meng Amy Li; Allan Bradley; Nancy L Craig
Journal:  Proc Natl Acad Sci U S A       Date:  2011-01-04       Impact factor: 11.205

5.  Cloning of a novel phosphatidylinositol kinase-related kinase: characterization of the human SMG-1 RNA surveillance protein.

Authors:  G Denning; L Jamieson; L E Maquat; E A Thompson; A P Fields
Journal:  J Biol Chem       Date:  2001-04-30       Impact factor: 5.157

Review 6.  mTOR Signaling in Growth, Metabolism, and Disease.

Authors:  Robert A Saxton; David M Sabatini
Journal:  Cell       Date:  2017-03-09       Impact factor: 41.582

7.  Phosphorylation of hUPF1 induces formation of mRNA surveillance complexes containing hSMG-5 and hSMG-7.

Authors:  Tetsuo Ohnishi; Akio Yamashita; Isao Kashima; Thomas Schell; Kirk R Anders; Andrew Grimson; Takahisa Hachiya; Matthias W Hentze; Philip Anderson; Shigeo Ohno
Journal:  Mol Cell       Date:  2003-11       Impact factor: 17.970

8.  Addressing preferred specimen orientation in single-particle cryo-EM through tilting.

Authors:  Yong Zi Tan; Philip R Baldwin; Joseph H Davis; James R Williamson; Clinton S Potter; Bridget Carragher; Dmitry Lyumkis
Journal:  Nat Methods       Date:  2017-07-03       Impact factor: 28.547

9.  InsP6 binding to PIKK kinases revealed by the cryo-EM structure of an SMG1-SMG8-SMG9 complex.

Authors:  Yair Gat; Jan Michael Schuller; Mahesh Lingaraju; Elisabeth Weyher; Fabien Bonneau; Mike Strauss; Peter J Murray; Elena Conti
Journal:  Nat Struct Mol Biol       Date:  2019-12-02       Impact factor: 15.369

10.  mTOR kinase structure, mechanism and regulation.

Authors:  Haijuan Yang; Derek G Rudge; Joseph D Koos; Bhamini Vaidialingam; Hyo J Yang; Nikola P Pavletich
Journal:  Nature       Date:  2013-05-01       Impact factor: 49.962

View more
  7 in total

1.  Autophosphorylation transforms DNA-PK from protecting to processing DNA ends.

Authors:  Lan Liu; Xuemin Chen; Jun Li; Huaibin Wang; Christopher J Buehl; Noah J Goff; Katheryn Meek; Wei Yang; Martin Gellert
Journal:  Mol Cell       Date:  2021-12-21       Impact factor: 17.970

2.  Structure of the human ATM kinase and mechanism of Nbs1 binding.

Authors:  Christopher Warren; Nikola P Pavletich
Journal:  Elife       Date:  2022-01-25       Impact factor: 8.140

Review 3.  No-nonsense: insights into the functional interplay of nonsense-mediated mRNA decay factors.

Authors:  Justine Mailliot; Mirella Vivoli-Vega; Christiane Schaffitzel
Journal:  Biochem J       Date:  2022-05-13       Impact factor: 3.766

4.  Structures of Mec1/ATR kinase endogenously stimulated by different genotoxins.

Authors:  Qingjun Zhang; Po Wang; Tengwei Wu; Yueyue Zhang; Zexuan Zheng; Shangzhi Zhou; Dong Qian; Xuejuan Wang; Gang Cai
Journal:  Cell Discov       Date:  2022-09-29       Impact factor: 38.079

5.  Structure of substrate-bound SMG1-8-9 kinase complex reveals molecular basis for phosphorylation specificity.

Authors:  Lukas M Langer; Yair Gat; Fabien Bonneau; Elena Conti
Journal:  Elife       Date:  2020-05-29       Impact factor: 8.140

Review 6.  Structural Insights into TOR Signaling.

Authors:  Lucas Tafur; Jennifer Kefauver; Robbie Loewith
Journal:  Genes (Basel)       Date:  2020-08-04       Impact factor: 4.096

7.  Cryo-EM reconstructions of inhibitor-bound SMG1 kinase reveal an autoinhibitory state dependent on SMG8.

Authors:  Lukas M Langer; Fabien Bonneau; Yair Gat; Elena Conti
Journal:  Elife       Date:  2021-10-26       Impact factor: 8.140

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.