| Literature DB >> 23934178 |
Vikram Pattanayak1, Steven Lin, John P Guilinger, Enbo Ma, Jennifer A Doudna, David R Liu.
Abstract
The RNA-programmable Cas9 endonuclease cleaves double-stranded DNA at sites complementary to a 20-base-pair guide RNA. The Cas9 system has been used to modify genomes in multiple cells and organisms, demonstrating its potential as a facile genome-engineering tool. We used in vitro selection and high-throughput sequencing to determine the propensity of eight guide-RNA:Cas9 complexes to cleave each of 10(12) potential off-target DNA sequences. The selection results predicted five off-target sites in the human genome that were confirmed to undergo genome cleavage in HEK293T cells upon expression of one of two guide-RNA:Cas9 complexes. In contrast to previous models, our results show that guide-RNA:Cas9 specificity extends past a 7- to 12-base-pair seed sequence. Our results also suggest a tradeoff between activity and specificity both in vitro and in cells as a shorter, less-active guide RNA is more specific than a longer, more-active guide RNA. High concentrations of guide-RNA:Cas9 complexes can cleave off-target sites containing mutations near or within the PAM that are not cleaved when enzyme concentrations are limiting.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23934178 PMCID: PMC3782611 DOI: 10.1038/nbt.2673
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Figure 1In vitro selection overview
(a) Cas9 complexed with a short guide RNA (sgRNA) recognizes ~20 bases of a target DNA substrate that is complementary to the sgRNA sequence and cleaves both DNA strands. The white triangles represent cleavage locations. (b) A modified version of our previously described in vitro selection[21] was used to comprehensively profile Cas9 specificity. A concatemeric pre-selection DNA library in which each molecule contains one of 10^12 distinct variants of a target DNA sequence (white rectangles) was generated from synthetic DNA oligonucleotides by ligation and rolling-circle amplification. This library was incubated with a Cas9:sgRNA complex of interest. Cleaved library members contain 5′ phosphate groups (green circles) and therefore are substrates for adapter ligation and PCR. The resulting amplicons were subjected to high-throughput DNA sequencing and computational analysis.
Figure 2In vitro selection results for Cas9:CLTA1 sgRNA
Heat maps[21] show the specificity profiles of Cas9:CLTA1 sgRNA v2.1 under enzyme-limiting conditions (a, b), Cas9:CLTA1 sgRNA v1.0 under enzyme-excess conditions (c, d), and Cas9:CLTA1 sgRNA v2.1 under enzyme-excess conditions (e, f). Heat maps show all post-selection sequences (a, c, e) or only those sequences containing a single mutation in the 20-base pair sgRNA-specified target site and two-base pair PAM (b, d, f). Specificity scores of 1.0 (dark blue) and -1.0 (dark red) corresponds to 100% enrichment for and against, respectively, a particular base pair at a particular position. Black boxes denote the intended target nucleotides. (g) Effect of Cas9:sgRNA concentration on specificity. Positional specificity changes between enzyme-limiting (200 nM DNA, 100 nM Cas9:sgRNA v2.1) and enzyme-excess (200 nM DNA, 1000 nM Cas9:sgRNA v2.1) conditions are shown for CLTA1. Red lines indicate the maximum possible change in positional specificity for a given position. (h) Effect of sgRNA architecture on specificity. Positional specificity changes between sgRNA v1.0 and sgRNA v2.1 under enzyme-excess conditions are shown for CLTA1. Red lines indicate the maximum possible change in positional specificity for a given position. See Supplementary Figures S4-S6, S23, and S24 for corresponding data for CLTA2, CLTA3, and CLTA4.
Cellular modification induced by Cas9:CLTA4 sgRNA
33 human genomic DNA sequences were identified that were enriched in the Cas9:CLTA4 v2.1 sgRNA in vitro selections under enzyme-limiting or enzyme-excess conditions. Sites shown in red contain insertions or deletions (indels) that are consistent with significant Cas9:sgRNA-mediated modification in HEK293T cells. In vitro enrichment values for selections with Cas9:CLTA4 v1.0 sgRNA or Cas9:CLTA4 v2.1 sgRNA are shown for sequences with three or fewer mutations. Enrichment values were not calculated for sequences with four or more mutations due to low numbers of in vitro selection sequence counts. Modification frequencies (number of sequences with indels divided by total number of sequences) in HEK293T cells treated with Cas9 without sgRNA (“no sgRNA”), Cas9 with CLTA4 v1.0 sgRNA, or Cas9 with CLTA4 v2.1 sgRNA. P-values are listed for those sites that show significant modification in v1.0 sgRNA- or v2.1 sgRNA-treated cells compared to cells treated with Cas9 without sgRNA. P-values were calculated using a one-sided Fisher exact text. “Not tested (n.t.)” indicates that PCR of the genomic sequence failed to provide specific amplification products.
| modification frequency in | P-value | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| # of | sequence | gene | v1.0 | v2.1 | no | v1.0 | v2.1 | v1.0 | v2.1 | |
| CLTA4-0-1 | 0 | GCAGATGTAGTGTTTCCACAGGG | CLTA | 20 | 7.95 | 0.021% | 11% | 76% | <1E-55 | <1E-55 |
| CLTA4-3-1 | 3 | aCAtATGTAGTaTTTCCACAGGG | 16.5 | 12.5 | 0.006% | 0.055% | 24% | 6.0E-04 | <1E-55 | |
| CLTA4-3-2 | 3 | GCAtATGTAGTGTTTCCAaATGt | 2.99 | 6.97 | 0.017% | 0% | 0.014% | |||
| CLTA4-3-3 | 3 | cCAGATGTAGTaTTcCCACAGGG | CELF1 | 1.00 | 4.95 | 0% | 0% | 0.469% | 2.5E-21 | |
| CLTA4-3-4 | 3 | GCAGtTtTAGTGTTTtCACAGGG | BC073807 | 0.79 | 3.12 | 0% | 0% | 0% | ||
| CLTA4-3-5 | 3 | GCAGAgtTAGTGTTTCCACACaG | MPPED2 | 0 | 1.22 | 0.005% | 0.015% | 0.018% | ||
| CLTA4-3-6 | 3 | GCAGATGgAGgGTTTtCACAGGG | DCHS2 | 1.57 | 1.17 | 0.015% | 0.023% | 0.021% | ||
| CLTA4-3-7 | 3 | GgAaATtTAGTGTTTCCACAGGG | 0.43 | 0.42 | 0.005% | 0.012% | 0.003% | |||
| CLTA4-4-1 | 4 | aaAGAaGTAGTaTTTCCACATGG | n.t. | n.t. | n.t. | |||||
| CLTA4-4-2 | 4 | aaAGATGTAGTcaTTCCACAAGG | 0.004% | 0% | 0.005% | |||||
| CLTA4-4-3 | 4 | aaAtATGTAGTcTTTCCACAGGG | 0.004% | 0.009% | 0% | |||||
| CLTA4-4-4 | 4 | atAGATGTAGTGTTTCCAaAGGa | NR1H4 | 0.032% | 0.006% | 0.052% | ||||
| CLTA4-4-5 | 4 | cCAGAgGTAGTGcTcCCACAGGG | 0.005% | 0.006% | 0.007% | |||||
| CLTA4-4-6 | 4 | cCAGATGTgagGTTTCCACAAGG | XKR6 | 0.018% | 0% | 0.007% | ||||
| CLTA4-4-7 | 4 | ctAcATGTAGTGTTTCCAtATGG | HKR1 | 0.006% | 0% | 0.008% | ||||
| CLTA4-4-8 | 4 | ctAGATGaAGTGcTTCCACATGG | CDK8 | 0.009% | 0.013% | 0.730% | 9.70E-21 | |||
| CLTA4-4-9 | 4 | GaAaATGgAGTGTTTaCACATGG | 0% | 0% | 0.004% | |||||
| CLTA4-4-10 | 4 | GCAaATGaAGTGTcaCCACAAGG | 0.004% | 0% | 0% | |||||
| CLTA4-4-11 | 4 | GCAaATGTAtTaTTTCCACtAGG | NOV | 0% | 0.00% | 0% | ||||
| CLTA4-4-12 | 4 | GCAGATGTAGctTTTgtACATGG | 0% | 0.00% | 0% | |||||
| CLTA4-4-13 | 4 | GCAGcTtaAGTGTTTtCACATGG | GRHL2 | 0.020% | 0.02% | 0.030% | ||||
| CLTA4-4-14 | 4 | ttAcATGTAGTGTTTaCACACGG | LINC00535 | n.t. | n.t. | n.t. | ||||
| CLTA4-5-1 | 5 | GaAGAgGaAGTGTTTgCcCAGGG | RNH1 | 0.004% | 0.01% | 0.006% | ||||
| CLTA4-5-2 | 5 | GaAGATGTgGaGTTgaCACATGG | FZD3 | 0.004% | 0.00% | 0% | ||||
| CLTA4-5-3 | 5 | GCAGAaGTAcTGTTgttACAAGG | 0.002% | 0.00% | 0.003% | |||||
| CLTA4-5-4 | 5 | GCAGATGTgGaaTTaCaACAGGG | SLC9A2 | 0% | 0.00% | 0% | ||||
| CLTA4-5-5 | 5 | GCAGtcaTAGTGTaTaCACATGG | 0.004% | 0.00% | 0.005% | |||||
| CLTA4-5-6 | 5 | taAGATGTAGTaTTTCCAaAAGt | 0.007% | 0.01% | 0% | |||||
| CLTA4-6-1 | 6 | GCAGcTGgcaTtTcTCCACACGG | n.t. | n.t. | n.t. | |||||
| CLTA4-6-2 | 6 | GgAGATcTgaTGgTTCtACAAGG | 0.007% | 0.00% | 0.009% | |||||
| CLTA4-6-3 | 6 | taAaATGcAGTGTaTCCAtATGG | SMA4 | 0.015% | 0.00% | 0% | ||||
| CLTA4-7-1 | 7 | GCcagaaTAGTtTTTCaACAAGG | SEPHS2 | 0% | 0.00% | 0.007% | ||||
| CLTA4-7-2 | 8 | ttgtATtTAGaGaTTgCACAAGG | RORB | 0% | 0.00% | 0% | ||||