| Literature DB >> 26644285 |
Abba Malina1, Christopher J F Cameron2, Francis Robert1, Mathieu Blanchette2, Josée Dostie1,3, Jerry Pelletier1,3,4.
Abstract
In CRISPR-Cas9 genome editing, the underlying principles for selecting guide RNA (gRNA) sequences that would ensure for efficient target site modification remain poorly understood. Here we show that target sites harbouring multiple protospacer adjacent motifs (PAMs) are refractory to Cas9-mediated repair in situ. Thus we refine which substrates should be avoided in gRNA design, implicating PAM density as a novel sequence-specific feature that inhibits in vivo Cas9-driven DNA modification.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26644285 PMCID: PMC4686818 DOI: 10.1038/ncomms10124
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Figure 1CRISPR-Cas9 genome editing is inhibited by increasing target site PAM density.
(a) Schematic representation of engineered target sites tested for repair efficiency by CRISPR-Cas9. Blue highlight denotes the target site PAM, red highlights additional PAMs in the target site gRNA homology region, while underlined purple text denotes PAMs on the opposite (and non-homologous to the gRNA) strand. (b) TLR flow cytometry analysis. Shown are representative fluorescence scatter plots, with RFP+ cells (NHEJ repair events) versus GFP+ cells (HDR repair events) for the indicated cell lines. One plot is for cells transfected with Cas9 and a gRNA that targets the engineered site compared to a plot for the same cells transfected with Cas9 and gRNA that targets a upstream control site (VF2468). (c) Cumulative inhibitory fold-difference in Cas9-driven genome editing efficiency observed at the engineered target site for cell lines expressing TLR loci with increasing number of PAMs relative to the VF2468 control upstream site. N=4±s.e.m. (d) Cas9 protein levels cannot account for inhibitory fold-difference effects seen in the TLR assay. Expression of Cas9 for the various transfected ‘all-in-one' vectors. Shown is a representative western blot measuring FLAG-epitope expression of the Cas9 protein in the various TLR lines (eEF2 serves as a loading control). (e) Differences in gRNA levels cannot account for the inhibitory effects observed in the TLR assay. Northern blot of RNA extracted from the indicated cell lines, each transfected with donor plasmid alone or with Cas9 and the labelled gRNA, either control VF2468 or 0x or 6x gRNAs. The probed transcripts are indicated on the right. Quantification of the fold-difference between the gRNA levels relative to control U6 RNA is shown underneath.
Figure 2Endogenous PAM-rich/G-Q motif sites are refractory to Cas9-driven editing.
(a) Schematic representation of the location for each target site sequence in their genic context for FOS and TGFB1 loci. The sequences of the target sites are shown. Red text indicates the gRNA complementary target site sequence, blue highlights the targeted PAM, and yellow highlights NGG motifs within the body of the target site (flanking non-targeted DNA sequences is written in black text). Primer binding sites for PCR amplification prior to performing the T7endI assay are marked as lines with half-arrowheads. (b,c) Genomic modification by Cas9 at PAM-rich G-Q local sites is strongly inhibited relative to nearby PAM poor low or high GC-content target sites as measured by a T7endI assay. Shown are representative experiments with the digested fragments, denoted by triangles, representing the fraction of the targeted genomic region by Cas9 that had been repaired by NHEJ. The 100 bp increments of the DNA ladder are shown to the left of the blot. Underneath each lane the % modification is indicated. (d) Sequences with G-Q motifs on the sense strand are disfavoured by CRISPR-Cas9 for editing. Box-and-whisker plots of the reported gRNA percent ranks separating sequences into those that exclude or include predicted G-Q motifs, for either sense or antisense gRNA targeting site sequences. The top, middle and bottom lines of the ‘box' represent the 25th, 50th and 75th percentiles, respectively, while the ‘whiskers' represent the 10th and 90th percentiles. ****, P value ≤0.0001 as determined by the Kolmogorov–Smirnov test.
NGG-rich sequences that are predicted to form G-Q structures.
| FOS low GC | (T)GTTATAAAAGCAGTGGCTG | 14q24 | 75278793-75278814 | negative | 40 | 0 | 0.2543 |
| FOS high GC | GGACGCGCTGCTGCAGCGGG | 14q24 | 75279023-75279045 | positive | 80 | 0 | 0.0750 |
| FOS G-Q | GGGGCTTGGGGTCGCGGAGG | 14q24 | 75279155-75279177 | positive | 80 | 17 | 0.0870 |
| TGFβ low GC | (A)GAGTGACTGGCAGAAGAAAC | 19q13 | 41352594-41352616 | negative | 45 | 0 | 0.1329 |
| TGFβ high GC 1 | GCTACCGCTGCTGTGGCTAC | 19q13 | 41352987-41353009 | negative | 65 | 0 | 0.0443 |
| TGFβ high GC 2 | (C)GCGCTCTCGGCAGTGCCGG | 19q13 | 41353059-41353081 | negative | 80 | 0 | 0.0477 |
| TGFβ G-Q | GGGCAGGGGAGCCGGGAGGG | 19q13 | 41352655-41352677 | negative | 85 | 39 | 0.0100 |
| KRAS high GC | (A)GCTGGGAGCGAGCGCGGCGC | 12p12 | 25250809-25250828 | positive | 85 | 0 | 0.1610 |
| KRAS G-Q | (C)GGGCGAAGGTGGCGGCGGCT | 12p12 | 25250860-25250879 | positive | 80 | 19 | 0.0172 |
| TNFα high GC | GGAGACGGCGATGCGGCTGA | 6p21 | 31577299-31577321 | negative | 70 | 0 | 0.0756 |
| TNFα G-Q | (A)GTTGGGGCAGGGGAGGCGTT | 6p21 | 31577560-31577582 | negative | 70 | 19 | 0.0287 |
| PIM1 high GC | (T)GCAGCGCTGCCCGACCCCGC | 6p21 | 37170342-37170361 | positive | 85 | 0 | 0.1501 |
| PIM1 G-Q | GTCGGTGGCAGCGGCGGCGG | 6p21 | 37170220-37170239 | positive | 85 | 18 | 0.0670 |
gRNA, guide RNA; TNFα, tumour necrosis factor-α.
*Brackets denote non-G starting nucleotide found in the genomic sequence which was altered to G in the gRNA sequence for proper U6 transcription initiation.
†Position numbers based on the assembly for the human genome GRCh38 from NCBI.
‡Based on the scoring system from http://bioinformatics.ramapo.edu/QGRS/ which ranks the likelihood for a given candidate sequence to form a G-Q structure18.
§Based on the scoring algorithm as reported8.