| Literature DB >> 26501275 |
Kentaro Ishida1, Peter Gee2, Akitsu Hotta3,4.
Abstract
Programmable nucleases, such as zinc finger nucleases (ZFNs), transcription activator like effector nucleases (TALENs), and clustered regularly interspersed short palindromic repeats associated protein-9 (CRISPR-Cas9), hold tremendous potential for applications in the clinical setting to treat genetic diseases or prevent infectious diseases. However, because the accuracy of DNA recognition by these nucleases is not always perfect, off-target mutagenesis may result in undesirable adverse events in treated patients such as cellular toxicity or tumorigenesis. Therefore, designing nucleases and analyzing their activity must be carefully evaluated to minimize off-target mutagenesis. Furthermore, rigorous genomic testing will be important to ensure the integrity of nuclease modified cells. In this review, we provide an overview of available nuclease designing platforms, nuclease engineering approaches to minimize off-target activity, and methods to evaluate both on- and off-target cleavage of CRISPR-Cas9.Entities:
Keywords: CRISPR Cas9; genome editing; mutagenesis; off-target effect
Mesh:
Substances:
Year: 2015 PMID: 26501275 PMCID: PMC4632775 DOI: 10.3390/ijms161024751
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Streptococcus pyogenes (Sp) Cas9 recognition of target DNA. Sp Cas9 consists of two lobes, one containing the HNH and RuvC nuclease domains, and an α helix lobe. Arginine residues (R1333 and R1335) of Sp Cas9 recognize the dinucleotide guanine bases of the “NGG” protospacer adjacent motif (PAM). Then, serine (S1109) and lysine (K1107) form a phosphate lock by interacting with a phosphate residue adjacent to the PAM site, which triggers the melting of the double stranded DNA so that sgRNA (green) can form Watson–Crick pairing with a complementary DNA strand. D10 and H840 are the main catalytic residues in the HNH and RuvC nuclease domains, respectively. Aspartic acid at 1135 (D1135) interacts with the third PAM guanine residue through a water molecule, and a modification of the residue to glutamic acid (D1135E) has been shown to improve the recognition of PAM sequence and target specificity.
Software to design CRISPR (clustered regularly interspersed short palindromic regions) sgRNA (single guide RNA).
| Tool | Organism | Input (Length) | off-Target Sites | Reference |
|---|---|---|---|---|
| ZiFiT | Hs, Rn, Mm, Dr, Dm, Ce, Aa, Ec | Target sequence (<1000 bp) | Mismatches | [ |
| CRISPR Design | >15 species | Target sequence (<250 bp) | Mismatches | [ |
| Cas9 design | >10 species | Target sequence (>10 kbp) | Mismatches | [ |
| E-CRISP | >15 species | Target sequence (>10 kbp), gene name | Mismatches | [ |
| CasOT | Any species | Target sequence (>10 kbp) | Mismatches | [ |
| Cas-OFFinder | >10 species | Designed sgRNA (10–25 nt) | Mismatches, insertion and deletions | [ |
| CHOPCHOP | >20 species | Target sequence (>10 kbp) | Mismatches | [ |
| GT-Scan | >20 species | Target sequence (<4000 bp) | Mismatches | [ |
| sgRNAcas9 | Any species | Target sequence (>10 kbp) | Mismatches | [ |
| CRISPR-P | >20 plants | Target sequence (<5 kbp) | Mismatches | [ |
| COSMID | Hs, Mm, Rn, Ce, Mam, Dr | Designed sgRNA (10–55 nt) | Mismatches, insertions and deletions | [ |
| sgRNA Designer | Hs, Mm | Target sequence (<10 kbp), gene ID | N.A. | [ |
| iGEATs | Hs, Mm | Chromosomal locus, target sequence (<25 kb), gene name, gRNA | N.A. | [ |
| CRISPRdirect | >15 species | Target sequence (<10 kbp) | Mismatches, insertions and deletions | [ |
| CRISPR-ERA | Hs, Mm, Rn, Dr, Dm, Ce, Sc, Ec, Bs | Target sequence (<5 kbp), gene name | Mismatches | [ |
| Protospacer Workbench | Any species | Target sequence (>10 kbp), gene name | Mismatches | [ |
Hs: Homo sapiens, Rn: Rattus norvegicus, Mm: Mus musculus, Dr: Danio rerio, Dm: Drosophila melanogaster, Aa: Aedes aegypti, Ec: Escherichia coli, Sc: Saccharomyces cerevisiae, Bs: Bacillus subtilis, Ce: Caenorhabditis elegans, Mam: Macaca mulatta.
Figure 2Tendency of identified off-target sites by sgRNA design and sequence search tools. (A) To demonstrate the sgRNA design process, we selected the human ILF3 gene as a target. To identify uniquely targetable region of sgRNAs, genomic region around the splicing acceptor site of exon 18 and peaks of unique k-mer sequences are shown as red histogram. Thirteen candidate sgRNA sequences (blue arrow with a box, indicating the position of PAM sequence) were identified within the region and the identified sgRNAs are ordered based on the rank provided by the CRISPR design tool; (B) With the same set of 13 sgRNAs as shown in (A), we calculated the number of potential off-target sites in human genome (hg19) by using various tools as indicated. For the calculation, the following options are used: GT-Scan, allowing three mismatches; CRISPRdirect, potential targets with 12 nt plus PAM; Cas-OFFinder, allowing three mismatches, two DNA bulge size, and two RNA bulge size; stand alone Blastn+, “blastn-short” task, e-value < 1.0; stand alone Bowtie, allowing three mismatches, seed length 16 nt; GGGenome, allowing three mismatches.
Figure 3Modified Cas9 strategies to reduce off-target mutagenesis. (A) Wild type Sp Cas9 nuclease can bind and cleave either sgRNA-complementary (green) or -mismatched (red) DNA with canonical PAM (cPAM) or noncanonical PAM (ncPAM), respectively; (B) PAM-optimized Cas9 nuclease induces cleavage specifically to cPAM complementary DNA; (C) A pair of SpCas9 nickases (i.e., D10A mutant) must bind to two closely spaced DNA sequences on opposite strands in order to induce single strand nicking on each strand and result in full cleavage of the target DNA. The binding of a Cas9 nickase to a non-specific site will induce nicking but not a double strand break; (D) Deactivated Sp dCas9 (i.e., D10A and H840A mutant) fused with a FokI nuclease must bind to two closely spaced DNA sequences in order for dimerization of the FokI nuclease and cleavage to occur. Nonspecific binding at an off-target site will not induce cleavage.
Summary of studies detecting off-target mutagenesis using different cell lines.
| Cell Types | Target Gene | Programmable Nuclease Used | Detection Assay of off-Target Mutagenesis | off-Target Mutagenesis Detected | Ref. |
|---|---|---|---|---|---|
| U2OS, HEK293 and K562 cells | CRISPR | EGFP reporter, T7EI | Yes | [ | |
| 293T | TALENs, CRISPR | amplicon seq, Surveyor assay | Yes | [ | |
| 293T | CRISPR | T7EI, Sanger sequencing | Yes | [ | |
| 293T | CRISPR | T7EI, Sanger sequencing | Yes | [ | |
| 293 and U2OS cells | CRISPR | GUIDE-seq, amplicon seq (AMP-based seq) | Yes | [ | |
| HAP1 cells, K562 cells, | CRISPR | Digenome-seq | Yes | [ | |
| 293T cells, A549 cells | TALENs, CRISPR | HTGTS method | Yes | [ | |
| iPS cells |
| ZFNs | CGH, SNP array, exome seq | No | [ |
| myoblasts | TALENs | exome seq | No | [ | |
| iPS cells | TALENs, CRISPR | whole genome seq | No | [ | |
| ES cells, iPS cells | TALENs, CRISPR | whole genome seq | No | [ | |
| iPS cells | TALENs, HDAdV mediated HR | whole genome seq | No | [ | |
| 293FT cells, iPS cells | CRISPR | amplicon seq | No | [ | |
| iPS cells | TALENs and CRISPR | T7EI, amplicon seq, Karyotyping, CNV analysis, exome seq | No | [ | |
| iPS cells | TALEN | exome seq | No | [ | |
| iPS cells | CRISPR | amplicon seq | No | [ |