| Literature DB >> 34580310 |
Arik Shams1, Sean A Higgins1,2,3, Christof Fellmann1,4,5, Thomas G Laughlin1,6, Benjamin L Oakes1,2,3, Rachel Lew4, Shin Kim1,2, Maria Lukarska1,2, Madeline Arnold1, Brett T Staahl1,2,3, Jennifer A Doudna1,2,4,7,8,9,10,11, David F Savage12,13.
Abstract
Proteins evolve through the modular rearrangement of elements known as domains. Extant, multidomain proteins are hypothesized to be the result of domain accretion, but there has been limited experimental validation of this idea. Here, we introduce a technique for genetic minimization by iterative size-exclusion and recombination (MISER) for comprehensively making all possible deletions of a protein. Using MISER, we generate a deletion landscape for the CRISPR protein Cas9. We find that the catalytically-dead Streptococcus pyogenes Cas9 can tolerate large single deletions in the REC2, REC3, HNH, and RuvC domains, while still functioning in vitro and in vivo, and that these deletions can be stacked together to engineer minimal, DNA-binding effector proteins. In total, our results demonstrate that extant proteins retain significant modularity from the accretion process and, as genetic size is a major limitation for viral delivery systems, establish a general technique to improve genome editing and gene therapy-based therapeutics.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34580310 PMCID: PMC8476515 DOI: 10.1038/s41467-021-25992-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Minimization by iterative size exclusion and recombination (MISER).
A MISER library construction. A 6-bp SpeI or NheI recognition site is inserted separately into a dCas9-encoding plasmid flanked by BsaI sites using plasmid recombineering. The resultant libraries are digested with BsaI and either SpeI or NheI, and the two fragment pools are combined and ligated together to generate a library of dCas9 ORFs possessing all possible deletions. B The MISER library is cloned into a vector and co-transformed in E. coli expressing RFP and GFP with a sgRNA targeting GFP. The library products are expressed, functional variants bind to the target, and repress the fluorophore. Repression activity in vivo is measured by flow cytometry. C Enrichment map of the MISER deletion landscape of S. pyogenes dCas9. A single pixel within the map represents an individual variant that contains a deletion beginning where it intersects with the horizontal axis moving to the left (N) and ends where it intersects with the axis moving to the right (C). Larger deletions are at the top, with some deletions almost spanning the whole protein. The heatmap shows relative repression activity of variants from two FACS sorts of a single replicate. The map is a composite of Slice 4 and Slice 5 in Supplementary Fig. 3A, B, which present variant ratios post- versus pre-FACS sorting.
Fig. 2Cas9 tolerates whole-domain deletions while maintaining target-binding activity.
A In vivo transcription repression activity of MISER-dCas9 variants with specified amino acids deleted, targeting either GFP (left) or RFP (right). dCas9s with REC2, REC3, HNH, or RuvC domain deletions have near-WT binding activity when targeted to GFP. When targeted to RFP, ΔREC2, and ΔREC3 show less robust binding activity. Data are normalized to vector-only control representing maximum fluorescence. Data are plotted as mean ± SD from biological triplicates. B Schema showing cloned MISER constructs with individual domain deletions corresponding to tolerated regions found in MISER screen. C Bio-layer interferometry (BLI) assay of MISER constructs. ΔREC2 and ΔREC3 exhibit weak binding against a fully complementary dsDNA target, while ΔHNH and ΔRuvC show intermediate binding. Binding is rescued to near-WT levels in ΔREC2 and ΔREC3, although at a slower rate, when the dsDNA contains a 3-bp bubble in the PAM-proximal seed region. Data are normalized to dCas9 binding to fully complementary dsDNA. D U-251 cells stably expressing the indicated MISER-dCas9 or WT-dCas9 KRAB fusion. Proteins were transduced with mCherry-tagged lentiviral vectors expressing sgRNAs targeting essential genes (sgPCNA, sgRPA1) or nontargeting controls (sgNT). At Day 2 post transduction, cells were mixed with the respective parental population; mCherry fluorescence was monitored over time. Data represent the mean and SD of triplicates (n = 3). Significance in cell depletion was assessed by comparing samples to their respective Day 2 controls using unpaired, two-tailed t tests (α = 0.01). E Measurement of CRISPRi efficacy of single-deletion MISER constructs in mammalian U-251 cells using RT-qPCR. U-251 cells were stably transduced with lentiviral vectors encoding dCas9 or MISER constructs fused with a KRAB repressor, along with lentivirus expressing sgRNA targeting PCNA. Cells were harvested 2 (left panel) or 5 (right) days post transduction of the sgRNA and assayed for PCNA expression. Bar graphs represent fold change of PCNA expression relative to a nontargeting sgRNA. Data presented as mean and SD (for triplicates where shown). Source data are provided as a Source Data file.
Fig. 3Stacking multiple domain deletions on Cas9 results in defective DNA-binding activity.
A In vivo transcription repression activity of MISER CRISPR effectors containing triple (Δ3CE) and quadruple (Δ4CE) deletion variants. Sublibraries of REC2, REC3, HNH, and RuvC were combined to build a library of stacked deletions, and the resulting library was assayed for high-performing variants using FACS (light blue bars). As none of the variants contained a REC2 deletion (~Δ167–307), we named the highest-performing triple-deletion variant in this library (Library 2; see Supplementary Fig. 6) Δ3CE. To force a library containing REC2 deletions, a sublibrary of REC2 deletions was added to Δ3CE, resulting in a library of quadruple deletion variants that contain Δ3CE and a REC2 deletion (dark blue bars). Data are plotted as mean ± SD from biological triplicates. B Expression constructs for Δ3CE and Δ4CE, with specified deletions manually cloned in. C BLI assay of CE constructs. Δ3CE and Δ4CE exhibit almost no binding against a fully complementary dsDNA target at 300 nM RNP (see Supplementary Fig. 10); and weak binding at 1000 nM RNP. Binding is rescued to near-WT levels when RNP concentration is 3.3× that of dCas9 if the dsDNA contains a 3-bp bubble in the PAM-proximal seed region. Data are normalized to 300 nM dCas9 binding to fully complementary dsDNA. D Measurement of CRISPRi efficacy of Δ3CE and Δ4CE in U-251 cells using RT-qPCR. Fold change in PCNA expression levels is measured by RT-qPCR, 2 and 5 days after KRAB-Δ3CE and KRAB-Δ4CE expressing cell lines are transduced with a sgRNA targeting PCNA. Δ3CE and Δ4CE exhibit weak DNA binding and transcriptional repression activity compared to dCas9. Bars represent the fold change of PCNA expression relative to a nontargeting sgRNA. Data are presented as mean and SD (for triplicates where shown). Source data are provided as a Source Data file.
Fig. 4Density map of Δ4CE compared to WT SpCas9.
A Single-particle cryo-electron microscopy was used to obtain a density map of the dsDNA-bound RNP complex of the Δ4CE construct at an overall resolution of 6.2 Å (EMD-22518). The light gray volume shows the Δ4CE density overlaid onto RNA–DNA hybrid R-loop (red and blue) and structure of WT SpCas9 (PDB 5Y36). The cartoon model corresponds to the WT SpCas9 structure, showing only the remaining residues and corresponding domains after the REC2, REC3, HNH, and RuvC deletions from the Δ4CE construct are manually removed from the model. Deletion termini are labeled with the distances between termini. B Density of Δ4CE cryo-EM overlaid with the WT SpCas9 clearly shows volumes representing dsDNA target and the sgRNA stem loop (black boxes). The red mesh represents the total WT SpCas9 density from EMD-8236.