| Literature DB >> 30936113 |
Rohan Dandage1,2,3,4, Philippe C Després5,2,3, Nozomu Yachie6,7,8,9, Christian R Landry5,2,3,4.
Abstract
CRISPR-mediated base editors have opened unique avenues for scar-free genome-wide mutagenesis. Here, we describe a comprehensive computational workflow called beditor that can be broadly adapted for designing guide RNA libraries with a range of CRISPR-mediated base editors, Protospacer Adjacent Motif (PAM) recognition sequences, and genomes of many species. Additionally, to assist users in selecting the best sets of guide RNAs for their experiments, a priori estimates of editing efficiency, called beditor scores, are calculated. These beditor scores are intended to select guide RNAs that conform to requirements for optimal base editing: the editable base falls within maximum activity window of the CRISPR-mediated base editor and produces nonconfounding mutational effects with minimal predicted off-target effects. We demonstrate the utility of the software by designing guide RNAs for base editing to model or correct thousands of clinically important human disease mutations.Entities:
Keywords: CRISPR; base editing; gRNA design; gene editing; genome-wide mutagenesis
Mesh:
Substances:
Year: 2019 PMID: 30936113 PMCID: PMC6553823 DOI: 10.1534/genetics.119.302089
Source DB: PubMed Journal: Genetics ISSN: 0016-6731 Impact factor: 4.562
Figure 1The computational workflow of beditor allows for the flexible design of gRNA libraries to be used in CRISPR base editing and offers a priori evaluation of mutagenesis potential. (A) Information on the type and location of desired mutations is supplied to the beditor workflow as a tab separated file. gRNAs are designed according to the user-provided sets of BEs and PAM recognition sequences. Among many base editor and Pam sequence-specific requirements, nucleotide windows for maximum activity are considered while designing the gRNAs. Finally, potential off-target effects are estimated. (B) A scoring system specifically designed for a priori evaluation of mutagenesis potential of gRNAs. Penalties are assigned based on (1) the total number of off-target alignments of gRNAs to the reference genome, (2) positions of the mismatches in the off-target alignments relative to the PAM and (3) genomic locations of off-target alignments, and lastly, (4) whether the editable base lies inside the activity window of the BE. Using all the above penalties, a final score is calculated for each gRNA sequence – the beditor score.
Figure 2Demonstrative analysis of gRNAs designed with custom base editors and PAM recognition sequences. To demonstrate the utility of beditor in utilizing custom base editors and PAM recognition sequences, sets of 1000 randomly assigned mutations in S. cerevisiae (see Supplemental methods) were analyzed in two mutation formats (nucleotide or amino acid) and two modes of mutagenesis (“model” and “correct”) (each combination is shown in parts A–D). In each heatmap, number of gRNAs designed by each combination of a base editor (in columns) and PAM recognition sequence (in rows) is shown.
Figure 3Case study analysis of clinically relevant human SNPs. For the case study analysis, two base editors (Target-AID and ABE) and two PAM sequences (NGG and NG) were used. Number of gRNAs designed using each mutation format, i.e., nucleotide (A) and amino acid mutation (B) data are shown. (C) Representative summary visualization of gRNA libraries designed with the Target-AID base editor. Nucleotide composition of the gRNAs is shown along the length of the gRNAs. gRNAs are grouped by the position of the editable nucleotides within the activity window of a BE (shown in the rows).
Figure 4Percentage editability of gRNA libraries designed in case study analysis of clinically relevant human SNPs. Percentage of substitutions that can be edited by gRNA library (% editability) designed for case study analysis of clinically relevant human SNPs, in the format of nucleotide (A and B) and amino acid mutations (C and D) (see Supplemental methods). Also, the gRNA libraries were designed to remove mutations i.e., “correct” mode (A and C) and to introduce mutations i.e., “model” mode (B and D). Mapped on the heatmaps is a ratio between number of substitutions that can be edited with the designed gRNAs and the number of substitutions present in the input data (% editability). Left and right brackets indicate that the substitution is carried out by ABE and Target-AID respectively. +, −, and ± indicate substitutions for which gRNA is designed on +, −, and both strands, respectively. Shown in gray are substitutions that are absent in the input data. *Indicates nonsense mutation.
Figure 5Performance assessment of beditor scores from case study analysis of clinically relevant human SNPs. (A) Relationship between the number of genome-wide off-target alignments and beditor score per gRNA. The color of hexbins are scaled according to the number of gRNAs per bin. (B) Relationship between the distance of a mutation in off-target alignments and corresponding penalty assigned (P). The color of hexbins are scaled according to the number of off-target alignments per bin. (C) Relationship between the CFD score and beditor score for all the gRNAs carrying NGG PAM sequence. The color of hexbins are scaled according to the number of gRNAs per bin. ρ is Spearman’s correlation coefficient.