| Literature DB >> 31877894 |
Taegeun Bae1, Hanseop Kim2, Jeong Hee Kim3, Yong Jun Kim3,4, Seung Hwan Lee2, Byung-Joo Ham5,6,7, Junho K Hur3,4.
Abstract
In CRISPR genome editing, CRISPR proteins form ribonucleoprotein complexes with guide RNAs to bind and cleave the target DNAs with complete sequence complementarity. CRISPR genome editing has a high potential for use in precision gene therapy for various diseases, including cancer and genetic disorders, which are caused by DNA mutations within the genome. However, several studies have shown that targeting the DNA via sequence complementarity is imperfect and subject to unintended genome editing of other genomic loci with similar sequences. These off-target problems pose critical safety issues in the therapeutic applications of CRISPR technology, with particular concerns in terms of the genome editing of pathogenic point mutations, where non-mutant alleles can become an off-target with only a one-base difference. In this study, we sought to assess a novel CRISPR genome editing technique that has been proposed to achieve a high specificity by positioning the mismatches within the protospacer adjacent motif (PAM) sequence. To this end, we compared the genome editing specificities of the PAM-based and conventional methods on an oncogenic single-base mutation in the endothelial growth factor receptor (EGFR). The results indicated that the PAM-based method provided a significantly increased genome editing specificity for pathogenic mutant alleles with single-base precision.Entities:
Keywords: CRISPR; PAM; off-target; single-base precision; specificity
Mesh:
Substances:
Year: 2019 PMID: 31877894 PMCID: PMC6982904 DOI: 10.3390/molecules25010052
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Schematic of the protospacer adjacent motif (PAM)-based guide RNA design for the targeting of the 2573T > G endothelial growth factor receptor (EGFR) point mutation. (A) Shown are the sequence of wild type EGFR (2573T), in grey, and the ribonucleoprotein complex comprised of CRISPR-Cas9 and guide RNA, in cyan. The PAM sequence is shown in red characters. (B) Schematic of the mutant sequence of EGFR (2573T > G), CRISPR-Cas9, and guide RNA. Lower-case letters indicate the 2573T > G mutation of EGFR. (C) Design of the four sgRNAs with single-base mismatches in PAM or within the guide RNA complementary sequences, indicated in lower-case letters.
Figure 2Schematic of the targeting of the RFP-GFP reporter system by CRISPR-Cas9. (A,B) Shown are the reporter construct with the wild type EGFR (2573T) and mutant EGFR (2573T > G) sequence flanked by red fluorescent protein (RFP) and green fluorescent protein (GFP). The PAM-based sgRNA sequence and its PAM are shown in blue and red, respectively.
Figure 3Comparison of the specificities of the PAM-based and sequence-complementarity methods for genome editing of targets with single-base differences. (A) Flow cytometry analyses of the genome editing specificities of the sgRNAs targeting the RFP-GFP dual reporters; (B) the graphs represent the ratio of the GFP population divided by whole population (RFP+GFP) in the WT and MT reporter systems, respectively. (C) The relative specificities are calculated by dividing the ratio of the green fluorescence protein (GFP) population in the MT reporter by that of the WT reporter. Lower-case letters indicate the mutated sequence (2573T > G). Red letters indicate the PAM sequences.
Figure 4Indel patterns and off-target analyses by T7E1 assay. (A) Shown are the most abundant 10 indel patterns of the on-target site by genome editing with sgRNA-1. The PAM sequence and insertions are indicated by red and blue characters, respectively. (B) Shown are the sequences of potential 15 off-target sites. Mismatches to the on-target sequence and PAM sequences are indicated by lower-case and red characters, respectively. (C) Shown are the indel rates of the on-target and the 15 candidate off-target sites that were quantified by targeted deep sequencing.
DNA Primer Sequence Information for deep sequencing assay of off-target sites. Mis-matched nucleotides are shown in lower-case letters.
| Target Gene | Primer Sequence (5′ to 3′) | Off-Target Sequence |
|---|---|---|
| EGFR_Off_1_F1 | TAAATTGAGTGCAGAGCCTTG | TCAAGATCACAcATaTTGGGAGG |
| EGFR_Off_1_R1 | GCAATGGTACCCTTTTCTTC | |
| EGFR_Off_1_F2 | AAACCCTGTGTTTGGAATTCA | |
| EGFR_Off_1_R2 | TTTGTTGTCCTTATATGTCATTGTG | |
| EGFR_Off_2_F1 | TCAGAGCCCAGTTTATCATAG | TCAAtATgACAtATTTTGGGGGG |
| EGFR_Off_2_R1 | GTAAATTAGCTGGTCTCAGG | |
| EGFR_Off_2_F2 | GGCATCAAGTAAATTACTCAAAACA | |
| EGFR_Off_2_R2 | CATGTAGCCAAAACAAAACAAACA | |
| EGFR_Off_3_F1 | ATTCAACAACGATCTGTTGTG | TgAAGgTCACAGAgTTTGGGAGG |
| EGFR_Off_3_R1 | AAAGCAAAACCCCAAAGCGA | |
| EGFR_Off_3_F2 | TCTAGTGGAGACAGATGTTAAAC | |
| EGFR_Off_3_R2 | GGGCTTTCCTCTCTTCTGT | |
| EGFR_Off_4_F1 | TGCATTTTGACTCTTCTCGA | TgAAGAcCACAGATTTaGGGAGG |
| EGFR_Off_4_R1 | AGAAGAAAGCTTCTGATGCC | |
| EGFR_Off_4_F2 | ATATCCAATAACAACAATACTAGC | |
| EGFR_Off_4_R2 | CCATCATTGAATTGAGGTCAC | |
| EGFR_Off_5_F1 | GGAAATCAAATTTGGGGTTG | TCAAGATgACAGATTTgGGtGGG |
| EGFR_Off_5_R1 | CTGCTGTTCCCCAACTTATT | |
| EGFR_Off_5_F2 | TCTGAGACTGGGTCATTCAT | |
| EGFR_Off_5_R2 | GCTTTTTACAACTATCTTACTAATAAC | |
| EGFR_Off_6_F1 | CCTTGTTGATTTACATTGATGTG | TCAAGATCACAaATaTTGGtAGG |
| EGFR_Off_6_R1 | CATTCCATAACATTCTCAGGT | |
| EGFR_Off_6_F2 | CCAATGTAAATCAACAAGGGT | |
| EGFR_Off_6_R2 | AATCATGAACATATTTATGCTTTCC | |
| EGFR_Off_7_F1 | CAAACTCACAATTGTGAGGG | TCAAcATaAtAGATTTTGGGAGG |
| EGFR_Off_7_R1 | AGCAGAGCATGGAGCTCATA | |
| EGFR_Off_7_F2 | ATGGACCGGGTGGCTTAAA | |
| EGFR_Off_7_R2 | AATGAAGCATGGTGACAGAC | |
| EGFR_Off_8_F1 | ACCAGGATGGTCTTGATCTC | aCAAGgTCAtAGATTTTGGGAGG |
| EGFR_Off_8_R1 | AAGTGTTTAAAACAGTGCCA | |
| EGFR_Off_8_F2 | CTGCCCAGGCTTCATCTTAA | |
| EGFR_Off_8_R2 | CCAGGCTGAAATGATCAAAG | |
| EGFR_Off_9_F1 | ATTTCAAAGGGTGGGGCTTT | aCAAGAaCACAGATTTTGGcAGG |
| EGFR_Off_9_R1 | GAAGTCTCAGATCAAGGTCC | |
| EGFR_Off_9_F2 | TTGGGACTTGTCATCCTTTT | |
| EGFR_Off_9_R2 | GGAGAAGAGCATGAGTGCTA | |
| EGFR_Off_10_F1 | TAGACTAGTCACCAGAATTCC | TCAAGATCAaAaAcTTTGGGAGG |
| EGFR_Off_10_R1 | TCACCATGCAGTTGTACATA | |
| EGFR_Off_10_F2 | CAAGGAAGACAAAAGAGACA | |
| EGFR_Off_10_R2 | TCCCCAGTCTGTTCTCTCTT | |
| EGFR_Off_11_F1 | GTTATGTGAGGTGTTTGTGT | TCAAGATCACAGATTTTtGGTGG |
| EGFR_Off_11_R1 | AGAACACACCATGTTAGAGG | |
| EGFR_Off_11_F2 | TGATGATAATTTCATGTGTGTTACC | |
| EGFR_Off_11_R2 | ACTCTCTTTGGTGAGAAGGA | |
| EGFR_Off_12_F1 | TACAGTTGTGTGGCTTTCGA | TaAAGtTCACAGgTTTTGGGGGG |
| EGFR_Off_12_R1 | GCCGTCTCAATACTTGTGAA | |
| EGFR_Off_12_F2 | AACAACTATGGTATGGGCCA | |
| EGFR_Off_12_R2 | GGTAAAACCCCATCTACTAAAAA | |
| EGFR_Off_13_F1 | GCCTCTTATCTGAACGAGAA | TCAAGgcCACAcATTTTGGGTGG |
| EGFR_Off_13_R1 | ATTAGTTGCAGTTCAAAGCC | |
| EGFR_Off_13_F2 | AAATCTACTTGGAGCAATGC | |
| EGFR_Off_13_R2 | TTTTTGGTAGGAGCCTGCAG | |
| EGFR_Off_14_F1 | AGTGTTTCGATAGATGGAGG | TgAAGAaCACAGtTTTTGGGTGG |
| EGFR_Off_14_R1 | CTACCCAAAACCTTTGTCCC | |
| EGFR_Off_14_F2 | AGAAGAGCACGAGTGGTAAA | |
| EGFR_Off_14_R2 | CACCCTTTTTCTTCCTCCAT | |
| EGFR_Off_15_F1 | GGTGGGGAAAAAAGTTTTTGG | TgAAGAaCACAGtTTTTGGGTGG |
| EGFR_Off_15_R1 | CTACACCTTCTTTTCCCGAC | |
| EGFR_Off_15_F2 | GAGTGACGAGGAGGAGGAAA | |
| EGFR_Off_15_R2 | TCAACACCCTTTTCCCCAT | |
| EGFR_on-target_F | TACTTGAAGCTGTCCTTCCC | TCAAGATCACAGATTTTGGG |
| EGFR_on-target_R | CCGTCGTCCTTGAAGAAGAT |