| Literature DB >> 25483036 |
Ajla Hrle1, Lisa-Katharina Maier, Kundan Sharma, Judith Ebert, Claire Basquin, Henning Urlaub, Anita Marchfelder, Elena Conti.
Abstract
Upon pathogen invasion, bacteria and archaea activate an RNA-interference-like mechanism termed CRISPR (clustered regularly interspaced short palindromic repeats). A large family of Cas (CRISPR-associated) proteins mediates the different stages of this sophisticated immune response. Bioinformatic studies have classified the Cas proteins into families, according to their sequences and respective functions. These range from the insertion of the foreign genetic elements into the host genome to the activation of the interference machinery as well as target degradation upon attack. Cas7 family proteins are central to the type I and type III interference machineries as they constitute the backbone of the large interference complexes. Here we report the crystal structure of Thermofilum pendens Csc2, a Cas7 family protein of type I-D. We found that Csc2 forms a core RRM-like domain, flanked by three peripheral insertion domains: a lid domain, a Zinc-binding domain and a helical domain. Comparison with other Cas7 family proteins reveals a set of similar structural features both in the core and in the peripheral domains, despite the absence of significant sequence similarity. T. pendens Csc2 binds single-stranded RNA in vitro in a sequence-independent manner. Using a crosslinking - mass-spectrometry approach, we mapped the RNA-binding surface to a positively charged surface patch on T. pendens Csc2. Thus our analysis of the key structural and functional features of T. pendens Csc2 highlights recurring themes and evolutionary relationships in type I and type III Cas proteins.Entities:
Keywords: CRISPR; CRISPR, Clustered regulatory short interspaced palindromic repeats; Cas, CRISPR-associated; Cas7; H1 and H2 and H1-2, β-hairpins of insertion domain 1 (or lid domain); Mk, Methanopyrus kandleri; RAMP, Repeat associated mysterious protein; RNA binding; RNAi, RNA interference; RRM domain; RRM, RNA recognition motif; Rmsd, Root mean square deviation; SAD, Single-wavelength anomalous dispersion; Ss, Sulfolobus solfataricus; Tp, Thermofilum pendens; crRNA, CRISPR RNA; dCASCADE, interference complex subtype I-D; eCASCADE, interference complex subtype I-E; prokaryotic immune system
Mesh:
Substances:
Year: 2014 PMID: 25483036 PMCID: PMC4615900 DOI: 10.4161/rna.29893
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652
Data Collection and Structure Refinement Statstics of Tp Csc2
| Native | SeMet | |
| Space group | P 2 21 21 | P 2 21 21 |
| Unit cell (Å)a | a = 60.47 | a = 60.81 |
| Resolution range (Å)a | 46.22–1.82 (1.88–1.82) | 48.68–2.37 (2.46–2.37) |
| Unique reflectionsa | 50416 (7188) | 23518 (2402) |
| I/σ (I)a | 17.8 (1.6) | 31.9 (6.4) |
| Multiplicitya | 6.5 (6.0) | 13.1 (12.6) |
| Rmerge (%)a | 6.7 (97.7) | 7.3 (43.4) |
| CC(1/2) (%)a | 99.9 (50.5) | 99.9 (95.4) |
| Average B-factor | 32.70 | 34.28 |
| Rwork (%) | 18.15 (31.75) | 20.85 (24.14) |
| Rfree (%) | 21.21 (34.64) | 23.72 (25.13) |
| Rmsd bonds (Å) | 0.017 | 0.004 |
| Rmsd angles (°) | 1.36 | 0.789 |
| Ramachandran favored (%) | 97.0 | 96.7 |
| Ramachandran outliers (%) | 0.0 | 0.0 |
Values in parentheses correspond to the highest resolution shell; SeMet: Selenomethionine derivatized protein.
Figure 1.Crystal Structure of Thermofilum pendens Csc2. (A) Structure of Tp Csc2 can be divided into four distinct domains: a core domain (green), a lid domain (insertion 1, blue), a metal-binding domain (insertion 2, red) and a helical domain (insertion 3, yellow). Secondary structure elements of the core adopt a ferredoxin-like fold with β-α-β-β-α-β arrangement. Multiple insertions within the core define the accessory domains. Dashed lines indicate the disordered loops. The inset shows a detailed view of the zinc ion (gray sphere) with coordinating residues. (B) Topology diagram of TpCsc2. α-Helices are represented as circles and β-strands arrows. The secondary structure elements have been labeled numerically maintaining the nomenclature of RRM domains. The hairpins of insertion domain 1 are labeled as described in the text (H1, H2 and H1–2). The α-helices of in the insertion domains are labeled with letters (αA to αH).
Figure 2.Structure-based sequence alignment of Tp Csc2. The alignment includes four sequences from representative species of the Csc2 family, based on a comprehensive alignment. Secondary structure elements are indicted by the cartoon above the sequences, color-coded and labeled according to Figure 1A. Colors represent the percentage of sequence identity (dark > 60%, light 60–30%). U15 cross-linked residues are highlighted with yellow dots. Blue dots above the K179 and R183 mark the mutated amino acids, brackets indicate the boundaries of the sequence spanning (P197-L214), which was replaced by (GS)3 (Δloop mutant).
Figure 3.Structural comparison of Cas7 proteins. (A) Topology diagrams of Tp Csc2, Ss Cas7 and Mk Csm3 highlight the high structural conservation within the core RRM-like fold (boxed in gray) and show the connectivity of the insertion domains. The topological arrangement of the insertions 1–3 is similar in all proteins. Variations within secondary structure elements of the three proteins reflect subtype specificities. (B) Crystal structures of Cas7 orthologs, Tp Csc2, Ss Csa2, Mk Csm3, depicted according to the orientation in Figure 1A after optimal superposition of their RRM-like domains. The molecules are overall colored in gray. Significant structural similarities are colored according to the color-code of the respective proteins, Tp Csc2 (salmon), Ss Csa2 (orange), Mk Csm3 (blue). Numbers (1–5) refer to the significant structural elements discussed in the text. Dashed lines indicate the structurally unresolved loops. (C) Boxes highlight the structurally and sequence-conserved basic residues along β2 and the preceding insertion.
Figure 4.For figure legend, see page 1078.Figure 4 (See previous page). Mapping the RNA-binding surface of Thermofilum pendens Csc2. (A) Electrophoretic mobility shift assays (EMSA) with wild-type Tp Csc2. Left panel: EMSA were performed with P -5′-end labeled poly(U)15 or poly(U)15 RNAs and increasing concentrations of Tp Csc2 (0, 5, 25, 50μM). The positions of the free RNA probe (arrow head) and of the RNA-bound complexes (asterisks) are shown on the right. Right panel: EMSA assay with Tp Csc2 and P -5′-end labeled crRNA. (B) MS/MS mass spectra of Tp Csc2 peptides, carrying an additional mass corresponding to one (panel two and three) or two (panel one) uracil nucleotides associated with the respective amino-acid. Peptide sequence and the fragment ions are indicated on top. The direcly crosslinked residues are colored yellow. The peptide fragmentation occurs with the cleavage of amide bonds resulting in b-ions and y-ions when the charge is retained by the N-terminal and C-terminal fragments, respectively. #, #,1 #2 and #3 indicate the b- and y-ions that were observed with a mass shift corresponding to U’, U-H3PO4, U-H2O and U, respectively. IM: Immonium ions. U’: U marker ion adduct of 112.0273 Da. (C) Mapping RNA-binding properties on the Tp Csc2 crystal structure. Upper panel: a cartoon representation of Tp Csc2 is shown in gray (in the same orientation as in Figure 1A) with the crosslinked residues colored in yellow (stick representation) and regions targeted for mutagenesis colored in blue (K178E/R183E and Δloop, indicated with scissors, stick representation). Lower panel: surface representation of Tp Csc2 (in the same orientation as in panel C) depicting the electrostatic potential (red for electronegative and blue for electropositive). (D) Quantitative measurements of RNA-binding affinities. Upper panel: 13% SDS-PAGE with the wild-type (WT) an mutant proteins used in the fluorescence anisotropy (FA) assay and a table with the Kd values obtained. The Δloop mutant was engineered by replacing the segment between Pro197 and Leu214 with a (GS)3 sequence. Lower panel: FA measurements of WT and mutant Tp Csc2 with a 5′-6-carboxy-fluorescein-labeled poly(U)15 -RNA.
Figure 5.A structurally and functionally conserved surface groove in the Cas7 protein family. Upper panels: cartoon representation of the structures of the Cas7-like proteins Tp Csc2 (salmon), Ss Csa2 (orange), Mk Csm3 (blue) (in the same orientations as in Figure 3B). Lower panels: corresponding surface representations showing the electrostatic potential (red for electronegative and blue for electropositive). For all proteins positively charged patches are present at the interface between the core and insertion domain 2 (identified with a circle). Conserved lysines and arginines contribute to these patches (Fig. 2) and in Tp Csc2 (arrows point to pink residues) are involved in RNA binding (Fig. 4). Residues reported to have an effect on RNA binding in Ss Csa2 and Mk Csm3 (arrows point to pink residues) are located within positively charged surfaces of the respective lid domains.