| Literature DB >> 29229813 |
Samuel Lessard1,2, Laurent Francioli3,4, Jessica Alfoldi3,4, Jean-Claude Tardif1,2, Patrick T Ellinor4,5, Daniel G MacArthur3,4, Guillaume Lettre1,2, Stuart H Orkin6,7,8,9,10, Matthew C Canver6,7,8,9.
Abstract
The CRISPR-Cas9 nuclease system holds enormous potential for therapeutic genome editing of a wide spectrum of diseases. Large efforts have been made to further understanding of on- and off-target activity to assist the design of CRISPR-based therapies with optimized efficacy and safety. However, current efforts have largely focused on the reference genome or the genome of cell lines to evaluate guide RNA (gRNA) efficiency, safety, and toxicity. Here, we examine the effect of human genetic variation on both on- and off-target specificity. Specifically, we utilize 7,444 whole-genome sequences to examine the effect of variants on the targeting specificity of ∼3,000 gRNAs across 30 therapeutically implicated loci. We demonstrate that human genetic variation can alter the off-target landscape genome-wide including creating and destroying protospacer adjacent motifs (PAMs). Furthermore, single-nucleotide polymorphisms (SNPs) and insertions/deletions (indels) can result in altered on-target sites and novel potent off-target sites, which can predispose patients to treatment failure and adverse effects, respectively; however, these events are rare. Taken together, these data highlight the importance of considering individual genomes for therapeutic genome-editing applications for the design and evaluation of CRISPR-based therapies to minimize risk of treatment failure and/or adverse outcomes.Entities:
Keywords: CRISPR-Cas9; human genetic variation; off-target specificity; on-target specificity; therapeutic genome editing
Mesh:
Substances:
Year: 2017 PMID: 29229813 PMCID: PMC5748207 DOI: 10.1073/pnas.1714640114
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Summary of therapeutically implicated loci
| Gene/virus | Target | Coordinates (hg19) | Disease | Repair | Refs. |
| Gene | |||||
| All exons | chr3:105085753–105295744 | HIV-1 infection | NHEJ | ||
| Enhancer | chr2:60722309–60722472 | β-Hemoglobinopathies | NHEJ | ||
| All exons | chr15:45003686–45010357 | Hypoimmunogenic cells for transplantation | NHEJ | ||
| Exon 1 | chr3:46414395–46415452 | HIV infection | NHEJ | ||
| Intron 26 | chr12:88494861–88495060 | Leber’s congenital amaurosis type 10 | NHEJ | ||
| Exon 2 | chr2:136872440–136873482 | HIV-1 infection | NHEJ | ||
| Exon 3 | chr6:29911046–29911320 | Hypoimmunogenic cells for transplantation | NHEJ | ||
| Exons 1–2 | chr1:55505512–55509707 | Cardiovascular disease | NHEJ | ||
| Exon 1 | chr2:242800916–242800990 | Tumor immunotherapy | NHEJ/HDR | ||
| Exons 2, 12, 14 | chr9:15468629–15510186 | HIV-1 infection | NHEJ | ||
| All exons | chr22:26921712–26992681 | HIV-1 infection | NHEJ | ||
| Exon 1 | chr14:23016448–23016719; chr7:142498738–142499111; chr7:142498726–142499111 | T cell immunotherapy | NHEJ | ||
| All exons | chr6:44221838–44225308 | HIV-1 infection | NHEJ | ||
| Intron 6/exon 7 | chr20:43251649–43251819 | Adenosine deaminase severe combined immunodeficiency (ADA-SCID) | HDR | ||
| Intron 1 | chr4:74270125–74270832 | Lysosomal storage disease, hemophilia A, B | HDR | ||
| Exon 10 | chr7:117199519–117199709 | Cystic fibrosis | HDR | ||
| Exons 2, 3, 14, 15, 54, 117 | chr3:48602217–48631981 | Epidermolysis bullosa | HDR | ||
| Exon 7 | chrX:37658207–37658337 | X-linked chronic granulomatous disease | HDR | ||
| Exons/intron 45–55 | chrX:31533884–32250573 | Duchenne’s muscular dystrophy | HDR | ||
| Intron 4 | chr9:97934216–97934415 | Fanconi anemia | HDR | ||
| Intron 1 | chrX:138613012–138619169 | Hemophilia B | HDR | ||
| Exon 8/intron 8 | chr15:80464492–80464690 | Hereditary tyrosinemia type I | HDR | ||
| Exon 1 | chr11:5248162–5248251 | Sickle cell disease | HDR | ||
| Exon 5 | chrX:70329079–70329240 | X-linked severe combined immunodeficiency (X-SCID) | HDR | ||
| Intron 4/exon 5 | chr14:94844848–94845047 | α-1-Antitrypsin deficiency | HDR | ||
| Virus | |||||
| Cytomegalovirus | Viral genome | — | Congenital defects, disease in immuno-compromised individuals | NHEJ | |
| Epstein bar virus | Viral genome | — | Infectious mononucleosis, malignancies | NHEJ | |
| Hepatitis B virus | Viral genome | — | Hepatitis B | NHEJ | |
| Herpes simplex virus type 1 | Viral genome | — | Cold sores, keratitis | NHEJ | |
| HIV-1 | Viral genome (LTR) | — | HIV-1 infection | NHEJ | |
| Human papilloma virus | E6–E7 oncogenes | — | Cervical carcinoma | NHEJ | |
| JC virus | T antigen | — | Progressive multifocal leukoencephalopathy | NHEJ |
Fig. 1.Off-target scores using the ambiguous genome approach. (A) Distribution of aggregate off-target scores in the reference and ambiguous genomes for human-genome–targeting, viral-genome–targeting, and nontargeting gRNAs. (B) Change in aggregate off-target score between ambiguous and reference genomes. (C) Distribution of off-target sites by number of mismatches. (D) Ratio of the number of off-target sites in ambiguous genomes compared with the reference genome stratified by the number of off-target sites in the reference genome. The y axis shows the ratios for each gRNA, whereas the x axis shows the number of off-target sites in the reference genome.
Fig. 2.Variants can reduce gRNA targeting efficiency. (A) Distribution of on-target scores for human-genome–targeting gRNAs for each possible target haplotype. (B) Distribution of samples/individuals carrying haplotypes predicted to be targeted with a local on-target score of <100%. (C) Distribution of local on-target scores for the gRNA TPST2_gRNA_2070. (D) Distribution of on-target CFDs for human-genome–targeting gRNAs for each possible target haplotype. (E) Distribution of samples/individuals carrying haplotypes predicted to be targeted with a CFD of <100%. (F) Distribution of CFDs for the gRNA TPST2_gRNA_2070. (G) Example of haplotypes at the HLA-A locus. Inset plots with a restricted y-axis range are shown for A, B, D, and E for easier visualization of data.
Fig. 3.Variants can increase the risk of off-target effects. (A) Distribution of aggregate off-target scores for each 1000G haplotype. (B) Distribution of aggregate off-target scores for each FC haplotype. (C) Difference in aggregate off-target scores for each 1000G haploid genome and the reference genome. The x axis corresponds to different gRNAs, and each dot represents the difference in score of each haploid genome in the 1000G dataset. The figure includes 758 gRNAs with at least one match with overlapping variants. (D) Difference in aggregate off-target score for each FC haploid genome and the reference genome. The x axis corresponds to different gRNAs and each dot represents the difference in score of each haploid genome in the FC dataset. The figure includes 710 gRNAs with at least one match with overlapping variants.
Representative example of off-target sites created by variants present in the 1000 Genomes database
Variants included in the chr11:100402414–100402433 (hg19) haplotype are rs566289682, rs555981507, rs181027193, and rs11560892. Variants included in the chr13:33591178–33591197 (hg19) haplotype are rs200611452 and rs116289670. Variants present in the chr14:52120745–52120765 (hg19) haplotype are rs532153306 and rs552139758. Sites displaying mismatches with the gRNA sequence are shown in red, whereas sites where variants rescue the gRNA sequence are highlighted in blue. CFD, cutting frequency determination; Freq., frequency; Hap., haplotype; PAM, protospacer adjacent motif; Seq. pos., sequence position.