Literature DB >> 29955892

How to create state-of-the-art genetic model systems: strategies for optimal CRISPR-mediated genome editing.

Yannik Bollen^1,2,3, Jasmin Post^1,2, Bon-Kyoung Koo⁴, Hugo J G Snippert^1,2.

Abstract

Model systems with defined genetic modifications are powerful tools for basic research and translational disease modelling. Fortunately, generating state-of-the-art genetic model systems is becoming more accessible to non-geneticists due to advances in genome editing technologies. As a consequence, solely relying on (transient) overexpression of (mutant) effector proteins is no longer recommended since scientific standards increasingly demand genetic modification of endogenous loci. In this review, we provide up-to-date guidelines with respect to homology-directed repair (HDR)-mediated editing of mammalian model systems, aimed at assisting researchers in designing an efficient genome editing strategy.

Entities: Chemical Disease Gene Species

Mesh：

Substances：

Year: 2018 PMID： 29955892 PMCID： PMC6061873 DOI： 10.1093/nar/gky571

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Mammalian model systems with defined genetic modifications are powerful tools for basic research and disease modelling. Unfortunately, precise manipulation of the mammalian genome has remained resource extensive and laborious for years, forcing many researchers to prioritize user-friendly techniques such as transgenic overexpression. The recent development of a novel generation of designer nucleases, e.g. Cas9, in combination with a better understanding of DNA repair mechanisms, is greatly improving the generation of new model systems with defined genetic modifications. Indeed, these more accurate model systems will increasingly represent a new standard that researchers have to incorporate into their studies. Optimal design of precise genome editing strategies is subject to many considerations that depend to a large extent on the nature of the desired modification and the cellular context in which it is pursued. While double-strand breaks (DSBs) generated by designer nucleases are sufficient to introduce deletions and rearrangements at defined genomic loci (1–3), accurate replacement or insertion of genetic material generally requires the co-introduction of a donor template that carries the modification. Moreover, the composition of the donor template can be altered to favor a particular DSB repair pathway by which the modification will be introduced into the host genome. DSBs naturally occur during DNA replication or as a consequence of environmental factors. Fortunately, homology directed repair (HDR) pathways, e.g. homologous recombination, accurately repair DSBs by using homologous DNA as a template (4,5). Indeed, the requirement for a homologous template during HDR can be exploited to facilitate the replacement or insertion of genetic material. This mode of genome editing can be stimulated by a designer nuclease-generated DSB at the genomic locus of interest and the on-site presence of an artificial DNA template that contains (i) the new or modified genetic code and (ii) flanking regions that contain sufficient homology to the cleaved genomic strands (Figure 1). In a natural setting however, a homologous sister chromatid is only readily available to serve as a template during and shortly after DNA replication. Outside the late-S, G2 and M-phase of the cell cycle, most cells actively suppress HDR to favour non-homologous end joining (NHEJ)-mediated DSB repair (6–8). As a result, classical HDR-mediated genome editing is largely restricted to proliferative cells (9). In contrast to HDR, NHEJ directly ligates break ends without the need of homology and is active in both proliferating and post-mitotic cells (10). NHEJ-mediated DSB repair has recently been exploited to introduce exogenous DNA into genomes of post-mitotic cells such as mature neurons and cardiac muscle cells (11,12). Along similar lines, the microhomology-mediated end joining (MMEJ) pathway can also be stimulated to facilitate precise genome editing in both proliferative and post-mitotic cells (13–15).

Figure 1.

Exploiting HDR for insertion or replacement of DNA. Left panel: DNA Insertion strategy. A nuclease induced DSB near the stop codon triggers HDR mediated insertion of sequence in-between the homology regions of the donor template. Right panel: DNA replacement strategy. In this example, replacing an exon for a mutant variant will be accommodated via its excision by dual targeting of the nuclease to both ends of the exon (using two gRNA), followed by HDR between the DNA break extremities and the homology regions of the donor template. HDR via the 5’ and 3’ homology regions is indicated in blue and red respectively. Although NHEJ- and MMEJ-based genome editing protocols are important innovations that enable editing in post-mitotic cells, both strategies are subject to their own set of limitations. NHEJ in particular has a tendency to generate errors during DSB repair, which may lead to inaccurate junctions during integration of the donor template (12,15). More importantly, since NHEJ ligates the cleaved genomic strands, introducing exogenous DNA (e.g. cDNA encoding fluorescent proteins) is only possible at the exact genomic location where the DSB was generated. In similar fashion, the microhomology involved in MMEJ-mediated donor integration does not tolerate significant positional divergence between the nuclease cleavage site and the desired integration site. While mammalian genome editing via HDR remains the least error prone and most flexible strategy, traditional protocols often suffer from low editing efficiency. Fortunately, novel generations of designer nucleases and new insights into the molecular mechanisms of HDR have led to the development of more efficient HDR-mediated genome editing protocols. In this review we discuss the latest research and condense it into ‘best practise guidelines’ for researchers who would like to generate mammalian model systems with precisely defined genetic modifications.

DESIGNER NUCLEASES

Mouse genomes can be edited with bp-resolution using HR-mediated repair of naturally occurring DSBs in cultured ES cells (16). For decades, this procedure has been successfully used to generate new mouse models with specific integrations of exogenous DNA, so-called knock-in mouse models. Since the timing and location of DSBs could not be controlled, efficiency of HR-mediated knock-ins varied significantly between different integration-sites, e.g. genes of interest. In addition to the desired genomic modification, the donor template often included a positive selection cassette to allow selection for donor integration. Furthermore, the homology arms were extended up to 5 kb in length to increase the likelihood that the DNA template would span a naturally occurring DSB (17). The development of designer nucleases that can target virtually all DNA loci of interest significantly enhanced the efficiency of precise genome editing, since integration of the donor template is no longer dependent on the spontaneous occurrence of a DSB near the site of interest (18). The first use-cases of designer nucleases for precise genome editing in mammalian cells involve Zinc-finger nucleases (ZFN), which were succeeded by transcription-activator like effector nucleases (TALENs) (19,20). Specificity of both ZFN and TALENs depends on a sequence-specific DNA binding domain to guide a non-specific DNA cleavage module, frequently the FokI nuclease domain. Generation of a DSB requires dual targeting in a specific spatial bi-orientation since FokI requires dimerization for nuclease activity. While ZFN and TALENs have proven effective, their usability has been limited by the requirement to pre-engineer sequence-specific DNA binding domains for each genomic target site, followed by experimental testing of nuclease activity for each ZFN or TALEN pair. More recently, class II nucleases of the bacterial clustered regularly inter-spaced short palindromic repeats (CRISPR) adaptive immune system have been engineered to facilitate mammalian genome editing (21,22). Class II CRISPR nucleases consist of a single large monomeric nuclease with DNA target-site specificity mediated by an RNA molecule (guide RNA). Double-strand DNA cleavage occurs after sequence alignment (heteroduplex formation) between the variable region within the RNA (guide sequence) and the genomic target site. In contrast to engineering new proteins (e.g. ZFN and TALENs), these nucleases only require modification of the guide sequence to direct nuclease activity toward a specific genomic locus. In addition, the monomeric nature of class II CRISPR nucleases does not impose any orientational restraints on target sites. Potential target sites of class II CRISPR nucleases are only limited by the requirement of a protospacer adjacent motif (PAM), located either up- or downstream of the genomic target site on the strand that is not engaged in heteroduplex formation with the guide sequence (protospacer). However, most class II CRISPR nucleases have relatively permissive PAM requirements and many variants with alternate PAMs have since been validated in mammalian systems. Due to their superior properties, we will focus our discussion on two distinct types of class II CRISPR nucleases that have been adapted for mammalian genome editing to date. CRISPR associated nuclease 9 (Cas9) is a monomeric nuclease first derived from Streptococcus pyogenes (SpCas9) and human codon optimized (23). Cas9 is guided by a synthetic single-guide RNA (sgRNA) of approximately 100-nucleotides (nt) in length (24), containing a 17–20 nt guide sequence that recognizes the target locus. The RuvC-like and HNH nuclease domains independently initiate cleavage on both strands 3 bp upstream of the PAM to generate a blunt-ended DSB (Figure 2A). SpCas9, the most widely used class II CRISPR nuclease, primarily recognizes the relatively permissive NGG PAM with limited activity toward NAG PAMs. Orthologs and variants of SpCas9 with alternative PAM specificities have since been published and provide an opportunity to bypass restrictions imposed by the PAM preference of conventional SpCas9 (25–34).

Figure 2.

Key features of SpCas9 and LbCpf1. Schematic representation of SpCas9 (A) and LbCpf1 (B). Ribonucleoprotein heteroduplexed with target DNA. DNA is indicated with grey lines, unless specified otherwise (PAM and protospacer). Red lines are RNAs. Light grey shape at the back represents protein structure. DNA strand cleavage is indicated using red arrow heads. An interesting alternative to Cas9 is the more recently described CRISPR from Prevotella and Francisella 1 (Cpf1) (35). In addition to Cpf1 from F. Novicida (FnCpf1) (35,36), orthologs have been adapted from Acidaminococcus sp. (AsCpf1) and Lachnospiraceae bacterium (LbCpf1) (37–39). There are major differences between Cas9 and Cpf1 at the molecular level (Figure 2). Cpf1 is guided by a shorter CRISPR RNA (∼40 nt) and contains a guide sequence of up to 24 nt of which the 18 nt proximal to the PAM contribute most to binding and cleavage activity (40). In addition, the Cpf1 PAM is located immediately upstream of the protospacer and is T-rich. Although the exact nick positions have not been defined for all Cpf1 orthologues, DNA cleavage by Cpf1 results in a 5’ staggered cut that is located away from the PAM (Figure 2B). As a consequence, small insertions or deletions (indels) generated by NHEJ-mediated repair are more likely to maintain critical target site residues, in contrast to Cas9 where indels frequently prevent re-cleavage. The additional cleavage cycles of Cpf1 were speculated to increase the probability for HDR (35). However, a direct experimental comparison between LbCpf1 and SpCas9 in mice did not reveal a significant increase in HDR mediated donor integration when initiated by LbCpf1 (41). Whereas a more recent study in zebrafish attributed enhanced HDR by LbCpf1, among other things, to its PAM distal cleavage (42).

Specificity and cleavage efficiency of CRISPR derived nuclease variants

The specificity of designer nuclease-mediated cleavage is an important consideration since off-target cleavage can result in unintended disruption of genomic elements. Genomic cleavage by ZFN or TALEN pairs is inherently specific since it is exceedingly unlikely that two off-target sites are in the required proximity and orientation to support FokI dimerization. By contrast, the monomeric nature of type II CRISPR nucleases has raised concerns regarding off-target cleavage activity. Indeed, initial reports demonstrated substantial off-target indel generation by wild-type SpCas9 (43–46). Algorithms have since been developed that predict cleavage activity at off-target sites for type II CRISPR nucleases (43), which allows the researcher to select highly specific target sites. For particularly sensitive applications an in vitro analysis of off-target cleavage can be obtained via GUIDE-Seq (47). In addition, there have been efforts to improve the intrinsic specificity of wild-type Cas9 by directed engineering of SpCas9 (26,27,48). These engineered variants display single base sensitivity at many target sites, but often sacrifice on-target cleavage efficiency when compared to wild-type SpCas9 using standard expression protocols (49). The most recent engineered SpCas9 variant, xCas9, has the most permissive PAM to date and is reported to be superior to SpCas9 in terms of specificity and on-target cleavage activity (32). However, before this variant is set to replace conventional SpCas9 it needs broader characterization. Alternatively, the inherent specificity of FokI-based nucleases has been emulated by mutagenic inactivation of the RuvC like nuclease domain of SpCas9, thereby creating a nicking variant (25). While generating DNA nick's in close proximity on opposite genomic strands can initiate DSB repair machinery, the efficiency is generally lower compared to a DSB generated by a monomeric nuclease (21,25). Instead, Cas9 nickase variants are now increasingly used to stimulate donor integration using a single genomic DNA nick (50–53), which significantly reduces off and on-target indel generation since single DNA nicks are far less mutagenic compared to DSBs (50–52). In addition to engineering of class II CRISPR nucleases, significant improvements in both specificity and on-target cleavage activity can be achieved by modification of the guide RNA. A truncated sgRNA of 17nt significantly enhances cleavage specificity of Cas9 (43,46,54), often without reducing on-target activity (54,55). Chemical modifications or even DNA substitutions of select sgRNA residues enhances specificity (56–59), while terminal modifications that prevent RNA degradation lead to enhanced cleavage activity (60). A combination of extensive chemical modifications throughout the sgRNA sequence further enhances Cas9 cleavage dynamics (61). Similar modifications of the Cpf1 guide RNA have also been shown to be effective (62,63). Chemically modified guide RNAs are widely available from commercial suppliers and are especially effective in combination with mRNA or ribonucleoprotein delivery of Cas9 or Cpf1, which we will discuss in a later section. In general, most nuclease targeting approaches deal with a trade-off between on-target editing efficiency versus on-target specificity. In the interest of maximizing the efficiency of generating a new model system, we recommend a preference for established monomeric type II CRISPR nucleases (Table 1), which display the highest on-target cleavage activity. The increased off-target proclivity of monomeric nucleases is mainly a concern in the context of therapeutic in vivo gene correction. For research applications, careful selection of target sites will often provide sufficient specificity.

Table 1.

List of widely available monomeric type II CRISPR nucleases

Selecting a genomic target site

While HDR mediated donor integration is maximally stimulated by a DSB at the intended integration site, additional considerations should be taken into account when selecting a genomic target site. As for target site specificity, algorithms have been developed that predict site specific cleavage activity of SpCas9 based on the nucleotide composition of the protospacer and PAM (40,64,65). Many tools are now available that implement these algorithms to score potential nuclease target sites in a selected stretch of DNA. For conventional monomeric cleavage we recommend the free CRISPR design tool offered by the Benching platform. Among others, it predicts site specific activity of SpCas9 based on algorithms by Doench et al. (2016); allows selection of various genomes to predict specificity scores; supports a variety of PAMs including Cpf1 and allows the export of DNA oligo's suitable for ligation into commonly used expression plasmids. For a more complete overview of CRISPR design tools we suggest a review by Cui et al. (66). In practice, the researcher will want to use a CRISPR design tool to select a nuclease target site based on the following criteria; (1) cleavage proximity to the intended integration site; (ii) predicted on-target activity; (iii) absence of exonic off-targets with high cleavage probability and (iv) overlap with the intended integration site. The latter is preferable since a target site that overlaps with the intended integration site will be disrupted during template integration, which prevents re-cleavage without having to introduce additional point modifications in the template. In addition, although cleavage near the intended integration site is preferable, a distal target with a high on-target score may be preferred over a proximal target site with a poor on-target score. Furthermore, while a general off-target score is helpful as an overall indicator of target site specificity, predicted off-targets should be examined on an individual basis to identify off-targets that are particularly detrimental in the context of the research application. These critical sites should be screened in selected clones to confirm the absence of indels. In Figure 3, we summarize how the above criteria can be used to interpret the quality of nuclease target sites in proximity to the genomic site of integration.

Figure 3.

Nuclease target site choice. Schematic representation of a dsDNA with 5 candidate gRNA (coloured arrows). To ensure optimal HDR of a donor template at the hypothetical integration site, the possible gRNA are ranked with respect to their cleavage dynamics (on and off-target scores), as well as in relation to their location and orientation towards the intended integration site. In general, once CRISPR machinery becomes active within a cell, both alleles will be cleaved. Since NHEJ is dominant over HDR this often leads to the generation of indels within alleles that are not modified by HDR. Generated DSBs within an exonic region therefore often result in a heterozygous null allele in addition to the correctly modified allele. If a heterozygous null allele is detrimental to the application of the modified lineage, the modification can be introduced using a DNA nick instead of a DSB (50,52), which we will discuss in a later section. Alternatively, a DSB can be induced within the nearest intron or 3’ UTR, where indels are less likely to interfere with expression or protein function. However, positional divergence between the cleavage site and the intended integration site creates an internal region of homology between the genome and donor. This promotes undesired recombination outcomes and reduces the effective probability of generating a correctly modified allele (Figure 4A) (67). Internal homology also occurs when two modifications are simultaneously introduced that have intervening sequences that are unmodified, for instance when generating floxed alleles with two LoxP sites. The probability that the modification located distal to the DSB is not incorporated increases with the extent of internal homology (68). There are two ways in which internal homology can be prevented from participating in HDR. One strategy disrupts internal homology between the donor and genome by recoding in the corresponding region of the donor (Figure 4B) (69). Alternatively, the internal homology region can be excised from the genome by introducing two DSBs (Figure 4C) (70), which is a proven strategy when replacing a genomic sequence (71,72). However, this does increase the incidence of heterozygous null modifications by promoting the excision and inversion of alleles (67). In summary, we advocate a preference for recoding as a strategy to minimize small regions of internal homology, whereas excision might be best when dealing with extensive internal homology during genomic sequence replacement.

Figure 4.

Internal homology increases the risk of undesired recombination events. (A) Positional divergence between DNA cleavage (DSB, indicated with red arrow heads) and the intended integration site (green arrow heads) creates the possibility for an alternative mode of recombination wherein an internal homology region participates. In the green panel, the internal homology region is indicated (dashed grey lines), but HDR is mediated via the intended homology arms at the extremities of the donor template (indicated in blue and red). Alternatively, the internal homology region (red) participates with the upstream homology arm (blue) in HDR, thereby failing to integrate the insert from the donor template (red panel). Since the size of internal homology is proportional to the probability of undesired recombination events, there are two widely used preventive strategies that minimize the extent of internal homology. (B) Internal recombination can be prevented by introducing (silent) mutations (red stars) in the internal homology region of the donor template (recoding). (C) Alternatively, in case of extensive internal homology, the region can be excised from the genome using dual targeting of the nuclease to introduce two DSBs that flank the internal homology sequence. The positional divergence between the genomic integration site and the nuclease target site has to be compatible with the type of donor template that is used, since the donor needs to be capable of bridging the gap between those sites. In addition, short homology arms demand cleavage in close proximity to the intended integration site (53,73), while larger homology arms are more tolerant to distal cleavage (67). As a rule of thumb, we suggest to limit nuclease positional divergence to less than 10% of homology arm length before taking steps to counteract internal homology.

AT A GLANCE

Monomeric type II CRISPR nucleases display the highest on-target activity and provide sufficient specificity for research applications. Exploring target sites for multiple monomeric Cas9 as well as Cpf1 variants increases the probability of identifying a high-quality target site. Cleavage proximity to the integration site and predicted on-target activity are the main parameters that researchers should use when selecting a nuclease target site to initiate HDR. We recommend DNA cleavage to be initiated within a distance corresponding to 10% of homology arm length with respect to the integration site. Internal homology between the genome and donor should be minimized, and if extensive, excised from the genome.

SELECTION STRATEGIES

Due to the relatively low success rate of accurate HDR-mediated donor integration, the end result is often a mono-allelic modification, particularly when the locus is inactive and at closed chromatin conformation (74,75). Therefore, it is important to decide in advance whether a homozygous modification is necessary. Furthermore, if a heterozygous modification is sufficient, the researcher has to consider whether perturbation of the ‘secondary’ non-recombined allele is detrimental and thus if ‘functional loss-of-heterozygosity’ needs to be prevented at all costs. Ultimately, the selection method used to identify desired clones will largely dictate the genome editing strategy as a whole. In the following paragraphs, we will discuss a variety of selection approaches and the context in which they are useful.

Direct selection for precise genetic modifications

In some cases, the desired modification conveys a phenotype that can be directly selected for. A clear example is the genomic integration of a fluorescent protein sequence that allows enrichment of correctly targeted cells via fluorescence activated cell sorting (FACS) (76). Providing that the fluorescent protein, e.g. fused to a protein of interest, will be expressed at detectable levels in the cell type of choice and that these cells are compatible with FACS. Alternatively, modifications that allow immunogenic detection, such as small epitope tags, could offer similar opportunities for FACS-based enrichment. Another class of modifications that allow direct selection are those that convey a selective advantage over the parental lineage by means of modifying culture conditions. This strategy is frequently used in the generation of oncogenic model systems since many oncogenic driver mutations activate signalling pathways that promote growth factor independency. As a result, omitting growth factors from the culture conditions enables enrichment of correctly modified cells (22,77). However, this mode of phenotypic selection risks co-selection for orthogonal oncogenic mutations that arise instead or in addition to the desired modification. Although direct selection for the desired modification allows donor integration without additional modification of the genome, experimental settings are not always compatible with this mode of selection and may therefore require the co-integration of a genetic selection element.

Genetic selection elements

Genetic selection elements commonly drive the expression of a protein that conveys drug resistance or allows fluorescent detection to support subsequent enrichment strategies. The protein that is expressed by the selection element should be non-invasive and able to provide a selectable phenotype in the targeted cells. Expression of the selection element can either be controlled by its own independent transcriptional regulatory elements or alternatively by endogenous regulatory elements. Independent genetic selection elements are under transcriptional control of a dedicated promoter with ubiquitous activity so that it is expressed in virtually all cell types. In addition, the cDNA that encodes the selection marker is followed by a strong PolyA transcription terminator. As a result, the cassette will function as an independent transcriptional entity. In cases where the intended genomic modification is located within an exon, the selection cassette is frequently integrated within the nearest intron (78). However, in case the last exon is targeted, e.g. to generate C-terminal fusion proteins, the element is commonly integrated within the endogenous 3’UTR of the gene of interest (79). The independent selection cassette should be integrated in close proximity to the intended genetic modification in order to minimize internal homology between the genome and donor. However, caution should be taken with respect to disruption or relocation of regulatory elements. An RNA motif prediction tool such as RegRNA 2.0 (80) can be used to determine the least invasive integration site. In addition, DNA sequences of candidate integration sites can be compared between mammalian species in order to avoid conserved regions. The integration of an exogenous polyA transcriptional terminator upstream of the last coding exon can prematurely terminate transcription of the targeted allele (81). To prevent truncated transcription, the independent selection cassette can be integrated in the opposite transcriptional orientation with respect to the gene of interest, thereby escaping polyadenylation signal recognition by the polymerase that transcribes the targeted allele. However, integration of a strong exogenous promoter in the vicinity of endogenous regulatory elements may influence the expression level and pattern of the modified allele (16,82,83). In addition, some selection cassettes contain cryptic splice sites which may interfere with splicing of the targeted allele. (83,84). In order to minimize potential interference with endogenous expression it is good practice to remove the cassette once clones have been selected. Often this is achieved by flanking the selection cassette with either LoxP or FRT recombinase sites, allowing removal of the intervening sequence upon transient expression of the appropriate recombinase (85). This process leaves only a single recombinase site of about 30 bp in length behind, in contrast to the average size of selection cassettes of ∼2 kb. Unwanted DNA sequences that are practical leftovers from the editing procedure are often referred to as a scar sequence. However, although minimal in size and widely considered intrinsically inert, the localization of the scar sequence might still interfere with expression of the modified allele (86). Scarless removal of a selection cassette can be achieved via nuclease mediated excision followed by MMEJ-based repair of appropriately designed microhomology (87) or by using the piggyBac transposase (88). For genomic modification strategies that include a selection cassette, we recommend insertion of the cassette within a non-conserved region in-between the 5’ splice donor site and the downstream branch point (Figure 5). Insertion in-between the branch point and the 3’ splice acceptor site is not advisable since it will extend the branch sequence beyond the consensus length. When a selection cassette will be included within the endogenous 3’UTR of the locus of interest, a less invasive location can usually be found ∼50–100nt downstream of the endogenous stop codon.

Figure 5.

Intronic integration and removal of a selection cassette. Schematic representation of HDR via a large DNA donor template that includes an autonomous selection cassette (in the opposite transcriptional orientation) to provide neomycin resistance. The selection cassette is integrated in the intron downstream of the integration site. Ideally the positioning of the cassette is in close proximity to the intended modification (mutant exon) to minimize internal homology. However, it is essential that sequences and relative locations of important regulatory elements for correct splicing remain intact (such as the 5’ splice donor site and the 3’ branch point). After clonal selection, the cassette is ideally removed from the genome in order to exclude undesired influence of the selection cassette on the expression levels of the endogenous gene. Multiple strategies exist for removal, for instance via Cre/Flp mediated recombination of LoxP or Frt sites that flank the cassette (red triangles). The minimal left-over sequence, in this case a single recombination site of ∼30nt, is often referred to as a scar sequence. Scar-free removal strategies are available (see text). As an alternative to independent expression of a selection cassette, a genetic selection element may be positioned under the transcriptional control of the endogenous allele of interest. This type of selection element does not include a promoter and polyA transcription terminator, and instead expression reflects the level and pattern of the gene of interest. The selection element is either co-integrated with an internal ribosome entry site (IRES) into the endogenous 3’UTR to become expressed as a bicistronic message (79,89), or as a C-terminal fusion protein interspaced by a ribosome skipping 2A peptide (90). It is important to realize that the stringency for selection depends in both scenarios on the activity of the endogenous promoter. Both methodologies have been successfully used in multiple studies. However, insertion of an IRES sequence into the endogenous 3’UTR may influence expression levels (91), presumably by altering mRNA stability, while C-terminal 2A peptide fusion leaves the endogenous 3’UTR intact. On the other hand, the relative positioning of an IRES within the 3’UTR is flexible while C-terminal 2A peptide fusion requires integration immediately upstream of the endogenous stop codon. Furthermore, while the translational efficiency of an IRES might substantially deviate from the expression levels of the upstream gene, the newest generation of 2A fusion peptides generate near equimolar protein ratios. Therefore, 2A peptide reporters are more suitable as a readout of expression levels (90). On the other hand, a downside of the 2A peptides is the addition of a 19–22 amino acid peptide at the C-terminus of the upstream protein, which can potentially interfere with normal protein function.

Sampling-based selection

Direct selection for correct template integration is convenient via the simultaneous integration of a genomic selection element. However, this is not always necessary or even preferable. Foremost, using a genomic selection element requires construction of a larger donor even if the desired modification is only a single base substitution. Moreover, scarless removal of a selection cassette remains laborious to engineer, often requiring negative selection strategies. Sampling-based selection is an alternative strategy to identify desired clones. Although broadly applicable, this approach is especially helpful when generating delicate disease models that require solitary integration of their respective mutation. Sampling-based selection generally involves enrichment for cells that obtained transient expression of the designer nuclease. A common strategy relies on FACS-based enrichment of transfected cells via co-expression of 2A-GFP (92) or the use of fluorescently labelled tracrRNA (93). Alternatively, if the cell line is not compatible with FACS, transient puromycin selection can be used instead (94). When integrating epitope tags and other small modifications, the frequency of correctly modified clones can be estimated within the bulk population using TIDER, an adaptation of the popular TIDE algorithm which decomposes Sanger sequencing data generated from multiple alleles (95,96). This will provide the researcher with an impression of the number of clones that need to be screened in order to obtain a correctly modified clone. In the next section we will discuss the use of TIDER or PCR-based strategies to determine the zygosity state of the modified alleles within individual clones. We recommend sampling-based selection for modifications that are compatible with ssDNA donor design since these can often be integrated with relatively high efficiency. We advise against sampling-based selection when: (i) integrating large DNA donor templates that often suffer from low integration efficiencies; (ii) when working with model systems that are relatively laborious to maintain and scale and (iii) when the projected editing efficiency is low.

Screening clones using PCR-based strategies.

Once clones are obtained they have to be screened to confirm correct editing and determine the zygosity of the modified alleles. A straightforward strategy involves the design of primer pairs that are able to discriminate between non-integrated (wild-type) and correctly modified alleles. For insertion strategies, this requires a PCR that spans the genomic integration site with at least one primer annealing to endogenous DNA sequences outside the homology arm regions (Figure 6A). This approach prevents false positive amplicons in cases where the donor template randomly integrated into the genome. The generated PCR amplicon allows discrimination of wild-type and insertion alleles based on product size. If a clone generates amplicons corresponding to both wild-type and insertion alleles this indicates heterozygosity, whereas the absence of a wild-type amplicon indicates homozygosity. If the integrated sequence is so large that a corresponding amplicon would preclude efficient genomic PCR amplification, a secondary primer can be designed which anneals only to the integrated DNA sequence.

Figure 6.

PCR-based identification of correctly modified alleles. PCR-based assays can be employed to expedite screening of clones for correctly modified alleles (A). For insertion strategies, zygosity can be determined by discrimination between PCR amplicon sizes spanning the integration site. Subsequently, sequence integrity can be confirmed by sequencing of the amplicons. (B) Discrimination based on size is not always possible when replacing DNA sequences with modified versions. Sequence specific primers can be used to generate amplicons that are specific for WT or modified alleles. If a genomic sequence is to be replaced, a PCR spanning the corresponding region often allows discrimination between wild-type and replacement alleles based on product size. In cases where both amplicons are of similar size (wild-type and mutant), primer sets should be designed that amplify either wild-type or mutant specific amplicons (Figure 6B). While a single primer set is often able to identify zygosity, we recommend the design of redundant primer sets since genomic PCR’s can be challenging. In addition, PCR amplicons should span the entire modified sequence so that the integrity of correctly modified alleles can be confirmed via DNA sequencing of the amplicons. The absence of indels within the non-recombined wild-type allele can be analysed by decomposing the Sanger sequencing data using TIDE, or alternatively by sub-cloning the amplicon into cloning vectors prior to individual Sanger sequencing of multiple clones representing single alleles. Genomic modifications that do not allow direct selection may require co-introduction of a genetic selection element, expressed either as an independent transcriptional entity or under control of an endogenous promoter. Removal of a genetic selection element is crucial to minimize potential artefacts. Sampling-based selection is a broadly applicable alternative to genomic selection cassettes and is recommended for modifications that are compatible with ssDNA donor design. A single genomic PCR is often able to discriminate between wild-type and modified alleles based on amplicon size.

DONOR COMPOSITION

DNA donor templates can consist of synthesized single-stranded oligodeoxyribonucleotide (ssODN) donors, larger ssDNA fragments, plasmid- or viral-based donor vehicles, and PCR amplified double-stranded (ds)DNA donors. Each donor type offers distinct advantages and has a unique demand for the extent and distribution of homology. In general, the donor type that is best suited for the introduction of a particular modification depends on the size of the modification, but also on the selection strategy that can be used.

ssDNA donors

Efficient ssDNA donor integration via HDR pathways can be stimulated by a DSB or DNA nick(s) (53,71). The compact nature of ssDNA donors results in a relatively high concentration of donor molecules within each cell, which is thought to enhance the probability of alignment between the donor and target locus (52). Indeed, precise editing efficiency is generally higher when mediated by ssDNA donors as compared to plasmid-based donor vectors (21). Moreover, editing efficiency increases proportionally with donor concentration (97). Commercially synthesized ssODN donors allow efficient integration of modifications up to ∼60 nt during synthesis-dependent strand annealing (SDSA) mediated repair of Cas9 induced DSBs (53). The proposed mechanism of SDSA is relevant for this discussion. It involves: (i) 5’ end resection of the cleaved genomic strands and subsequent base pairing with the 3’ arm of the ssODN; (ii) extension of the genomic sequence using the ssODN as a template and (iii) capture of the opposite genomic strand by the newly synthesised homology (69). Many studies have contributed to optimal design parameters for ssODN donors in conjunction with blunt-end DNA cleavage, which we summarized in Figure 7A. Work by Richardson et al. initially suggested that ssODN polarity (i.e. ssODN in sense or antisense orientation) should be complementary to the strand that is not heteroduplexed with Cas9 (52). However, this finding has not been consistent with subsequent studies (42,98,99). Paix et al. propose an alternative model where ssODN polarity should be adjusted such that base pairing between the 3’ arm of the ssODN and the genomic strand is not interrupted by the intended modifications (69). As a consequence, optimal ssODN polarity depends on the relative position of the generated DSB in relation to the integration site. If the DSB is generated downstream of the intended integration site then a sense ssODN should be preferred, whereas cleavage upstream of the integration site favours an antisense ssODN. The polarity rule by Paix et al. is in agreement with the performance of ssODNs in the study by Richardson et al. and with other published work (52,98,99). In addition to polarity, the length and distribution of homology should be considered. Richardson et al. proposed that the 5’ arm has a greater demand for homology (52). This asymmetric distribution of homology has been confirmed in an independent study by Liang et al. (98). In addition, while in a study by Guo et al. an asymmetric ssODN underperformed, we note that the asymmetric ssODN had an unfavourable polarity according to Paix et al. while the symmetric ssODN had the correct polarity, which may explain the underperformance of the asymmetric ssODN (99). We therefore recommend ssODN polarity according to Paix et al. with an asymmetric distribution of homology of 30–36 nt at the 3’ and 67–91 nt at the 5’ of the ssODN according to Richardson et al. Finally, the stability of the ssODN, and thus the editing efficiency, is significantly enhanced by phosphorothioate (PS) modifications of the last two nucleotide bonds at both the 3’ and 5’ end (98), which can be included during commercial synthesis. Collectively, these design rules have been established using Cas9 mediated cleavage although they will likely translate to Cpf1 mediated cleavage as well.

Figure 7.

Parameters for donor template design. (A) Schematic representation of ssODN template integration stimulated by a DSB (red arrow heads). Due to 5’ DNA end resection by DNA repair pathway associated proteins, an ssODN can be designed to either hybridize in the sense or antisense orientation. Complete hybridization from the 3’ end of the ssODN towards the DNA strand extremity (DSB site) is advised. In this example, ssODN hybridization with the antisense genomic resected overhang (red) would lead to mismatches, whereas hybridization with the sense overhang (blue) does not. Furthermore, homology should be distributed in an asymmetric fashion in favour of the 5’ homology arm. Phosphorothioate bonds prevent ssODN degradation. (B) Schematic representation of HDR using long ssDNA templates. (C) Schematic representation of HDR using PCR amplified dsDNA donor. Homology arms up to 80nt can be added to conventional PCR primers as 3’ overhangs. (D) Schematic representation of HDR using a donor vector. Vectors can be assembled as complex donor templates from multiple sources. Flanking of the template with nuclease target sites identical to the genomic target site allows excision and linearization of the template concurrent with genomic cleavage. We recommend the use of ssODNs for the precise generation of modifications ranging from single nucleotide insertions or deletions to small epitope tags and multi-codon deletions. When a single ssODN is used to introduce two or more modifications that are spaced apart, the intervening region should be recoded using silent mutations (69). As a consequence the entire region between the proximal and distal modifications is likely treated as a single region of heterology (53). Although this might negatively impact overall integration efficiency, it will favour simultaneous incorporation of all modifications since internal homology is prevented (53,69). If a single point mutation is desired we strongly encourage the researcher to explore whether a base-editor is applicable. Base-editors trigger nucleotide-conversion after being targeted to specific genomic loci based on their fusion to catalytically-dead or nickase Cas9. As a result, base-editors can efficiently mediate any single nucleotide substitution without genomic cleavage (100–102). Since base-editors do not rely on cleavage, no indel mutations are introduced within the secondary allele, keeping its coding sequence intact and preventing ‘functional loss-of-heterozygosity’. As such, base-editors are optimal for introducing heterozygous point mutations, like oncogenic mutations. Similarly, two or more nucleotide substitutions can be integrated without compromising the secondary allele by stimulating ssODN mediated editing via a DNA nick (52,53). Although polarity of the ssODN with respect to the generated nick determines the preferred HDR pathway, it has little influence on editing efficiency. Moreover, while optimal ssODN composition for this mode of nick editing has not been determined, an ssODN with 77 nt homology arms was integrated at roughly half the efficiency when stimulated by a DNA nick as compared to a DSB (53). Commercial ssDNA is now available up to 2 kb, which allows highly efficient generation of modifications that previously required construction of a dedicated donor vector (72,103,104). Precise genome editing by long ssDNA donors is likely mediated by the single-strand annealing pathway instead of SDSA. As such, the optimal design parameters for long ssDNA donors are likely to be different from ssODN design and have yet to be fully determined. Nevertheless, efficient editing is achieved with homology arms of 50–300 nt with no clear preference for a particular polarity of the ssDNA (Figure 7B). We highly recommend long ssDNA donors for modifications that are too large for ssODN synthesis. The only exceptions are current size restrictions and sequence complexities that are rejected by commercial suppliers, such as high GC content.

PCR generated dsDNA donors

Commercial synthesis of long ssDNA can be expensive and when performed ‘in-house’ requires construction of a vector with an appropriately located T7 promoter (104). A PCR generated dsDNA donor is a cheap alternative which is especially useful for medium sized insertions that allow direct selection, e.g. fluorescent knock-in alleles. Homology arms up to 80 nt (or even longer via a nested PCR) can be appended to the desired insertion as overhangs in the PCR primers (Figure 7C) (105–107). The integration efficiency observed for PCR-generated donors is similar to that of traditional donor vectors (108).

Plasmid or viral donor vectors

Modifications that are too large for ssDNA synthesis require construction of a viral or plasmid-based donor vector using molecular cloning techniques. Since a substantial part of these vectors consists of backbone elements such as bacterial selection cassettes or viral packaging sequences, the effective donor template concentration per cell is disproportionally reduced compared to ssDNA donors. On the other hand, donor vectors have sufficient capacity to include a genomic selection cassette, making absolute editing efficiency less important. Traditionally, donor vectors required relatively large homology arms of up to 2 kb in order facilitate efficient HR mediated donor integration by a designer nuclease-generated DSB (67,108). Homology arm length of donor vectors can be reduced to 0.6–0.9 kb by flanking the donor with nuclease target sites that are identical to the genomic target site (Figure 7D). As a consequence, a linear donor supply is liberated concurrent with genomic cleavage (108,109). The linear nature and relatively long homology arms of the excised donor fragment is thought to stimulate a novel HDR pathway, themed homology-mediated end joining (109). Currently, no direct negative consequences of in-vivo donor excision have been reported, and since nuclease target sites are easily included as overhangs in the PCR amplification of the homology arms, the construction of this type of dual-cut donor vector does not require extra labor. Moreover, a donor vector that is flanked with nuclease target sites is compatible with a novel strategy called in trans paired nicking (50), where concurrent nicking of the genome and donor triggers efficient integration without genomic cleavage. However, it remains unclear whether this strategy can trigger replacement of genomic sequence. We expect donor vectors flanked with nuclease target sites to develop into the preferred strategy for the integration or substitution of large transgenic elements. Since the homology arms of these vectors are relatively short, vector construction and subsequent genotyping is far more convenient when compared to the use of traditional donor vectors with longer homology arms. Due to the absence of backbone sequences ssDNA donors result in a high effective donor concentration which improves editing efficiency. Modifications up to ∼60 nt can be integrated using ssODN donors. Optimized ssODNs are constructed with an asymmetric distribution of homology and a polarity determined by the position of the DSB relative to the intended integration site. Modifications up to 1.9 kb can be integrated using commercially available long ssDNA donors. PCR generated donors with short homology arms allow cost efficient generation of fluorescent knock-in alleles. Modifications too large for ssDNA donor templates require vector construction. By flanking the donor with nuclease target sites the homology demand can be reduced to 0.6–0.9 kb.

DELIVERY OF HDR COMPONENTS

Efficient delivery of genome editing components into target cells is a crucial step in the process towards generation of genetically modified model systems. The delivery method should be matched to the model system of choice and to the manner in which the designer nuclease and donor are presented to the cell.

In vitro delivery

To date, most publications involving precise CRISPR-mediated genome editing have used established transfection methods including liposomal transfection, electroporation and peptide-mediated cell penetration, to deliver nuclease expression constructs and donor templates into target cells (23,110–113). Although generally robust and broadly applicable, plasmid delivery has particular drawbacks that should be considered. Foremost, the introduction of high concentrations of foreign DNA into cells can trigger an immune response in certain cell types, which can eventually lead to programmed cell death (114). In addition, prolonged expression of nucleases from plasmid DNA increases the frequency of indel mutations at off-target sites (115,116) and along similar lines, prolonged stability of plasmid DNA increases the probability of random plasmid integration into the genome. To circumvent these issues, nucleases can be delivered as pre-assembled ribonucleoprotein complexes (RNPs) or as mRNA molecules along with in vitro transcribed or commercially synthesized gRNA (115,117,118). Messenger RNA is significantly less stable compared to plasmid DNA, resulting in a relatively short but robust spike of nuclease activity. Similarly, RNP delivery circumvents translation, resulting in an immediate spike of nuclease activity followed by a rapid decline due to protein degradation. In each case, nuclease activity is short lived, which reduces the probability of generating off-target indels. Furthermore, the immediate spike in nuclease activity ensures co-presence with initial high concentrations of DNA donor templates prior to their degradation. Indeed, RNP delivery in particular is associated with a significant increase in precise genome editing efficiency (98,119). In addition, since assembly of RNPs or delivery of mRNA is compatible with chemically modified gRNA, these protocols open up new avenues of maximizing editing efficiency. Some cell lineages, such as primary cells, remain difficult to transfect with classical transfection methods. Instead, these cells are often virally transduced (120). However, the cargo restrictions of many viral vectors often prohibit CRISPR mediated genome editing applications. BacMam technology, which employs baculoviral vectors that have the capacity to carry large DNA cargo up to 38 kb in length (121), has recently been used to address viral cargo issues. All the components required to support CRISPR-mediated precise integration of large DNA constructs, including the donor template itself, can be integrated within a single baculoviral genome (122,123). In addition, transduction in mammalian cells occurs in a transient manner by default, thereby minimizing the risk of viral integration into the host genome. However, since viral transduction is associated with general safety risks, alternative transfection methods are under continuous development (124,125). While common in vitro plasmid delivery methods have been widely applicable in mono-layer cell cultures, transfection of 3D organoid cultures is more challenging. Although both transfection and electroporation can successfully deliver plasmids in organoid structures, the resulting transfection efficiency is generally low (77,110,111). An optimized electroporation protocol has demonstrated significantly improved transfection efficiency in comparison to liposomal transfection (111). Nevertheless, cell viability post transfection generally remains low and consequently large quantities of organoid-derived cells are required in order to obtain successfully edited clones. Alternatively, human intestinal organoids have been virally transduced to deliver CRISPR machinery, however this method is not compatible with ssDNA donor delivery (123). To conclude, delivery of nuclease expression constructs following conventional protocols is convenient for less demanding applications such as heterozygous modifications that allow direct selection. RNP delivery, in particular when assembled with enhanced chemically modified gRNA, supports highly efficient precise editing in conjunction with ssDNA donors (71,98), and more recently in combination with AAV donor transduction (126). Therefore, we encourage the use of RNP delivery protocols whenever applicable, and especially when generating modifications that depend on a sampling-based selection approach. In this regard, enrichment of an RNP transfected population can be achieved by assembling RNPs with fluorescent tracrRNA (93). Finally, cell lines that are particularly difficult to transfect can usually be virally transduced. In this regard, viral-based delivery of genome editing components may be useful as a platform for the precise incorporation of a particular modification across many different cell lines of the same organism.

In vivo delivery

Although significant progress has been made in the development of in vitro delivery protocols that target a cellular population in its completeness, in vivo delivery is far more challenging. Initial publications demonstrated proof-of-principle in vivo HDR-mediated genome editing in adult mice using non-viral hydrodynamic tail vein injections, co-delivering Cas9-sgRNA expression constructs and ssODN donor template into the liver (127,128). However, among other things due to inferior delivery methods, HDR-mediated gene editing efficiency remained very low. The sporadic introduction of cancer mutations in vivo for the rapid development of human cancer models in mice has mainly been supported by locally injected lentiviral transduction (129,130). Further development of these protocols may benefit from the latest generation of high-capacity adenoviral vectors that are able to carry both the nuclease and gRNA scaffolds in one viral particle (131). The ultimate clinically-related goal of highly efficient genome editing is to correct disease mutations and phenotypes in living patients in terms of personalized medicine. In contrast to in vitro culture systems that allow clonal selection and outgrowth of successfully modified cells, most disease phenotypes for which in vivo genome editing is considered a potential clinical break-through require mutational correction in a large fraction of cells that manifest the diseased phenotype. Delivery methods to accommodate this level of precise nuclease mediated editing are currently out of reach. In addition, since many genetic conditions are caused by single point mutations, base editors are a far more likely candidate for clinical translation. Since the scope of our review is to facilitate guidelines for researchers that would like to genetically engineer their preferred model system, we refer to a number of excellent reviews with respect to in vivo genome editing for clinical applications (132–134). Plasmid based nuclease delivery is convenient for less demanding applications such as heterozygous modifications that allow direct selection. We encourage the use of RNP delivery protocols in conjunction with chemically modified gRNA, especially when depending on a sampling-based selection approach. BacMam technology is recommended for the generation of large genomic modifications in difficult to transfect cell lines, since it allows the delivery of all HDR components in a single construct. Clinical applications of CRISPR-mediated precise gene correction are currently out of reach.

COMPLEMENTARY STRATEGIES TO ENHANCE PRECISE EDITING EFFICIENCY

In addition to optimized donor template design, nuclease choice, genomic target site selection and delivery, there are additional complementary strategies that may further enhance CRISPR-mediated HDR efficiency. A major focus has been the development of tools to suppress the competing NHEJ repair pathway. Strategies include depletion or inhibition of the NHEJ pathway proteins KU70, KU80 and DNA ligase IV using either shRNAs (107,135), Adenovirus 4 (Ad4) proteins (135), or molecular inhibition of DNA ligase IV via small molecule inhibitors such as SCR7 (107,135–138). Similar to observations in DNA ligase IV-deficient flies, depletion or inhibition of DNA ligase IV reduced NHEJ activity, while increasing HDR in both mouse and mammalian cell lines (135,136,139). A similar effect was observed upon the depletion of the KU complex (137). NHEJ pathway suppression may be of particular interest when generating homozygous mutations, as the Ad4 protein-induced degradation of DNA ligase IV enhanced the net yield of homozygous clones when used in combination with selection markers (135). However, significant care should be taken when using the SCR7 compound, as it can enhance the number of off-target integrations and induces cell toxicity when used at high concentrations. Also, the sensitivity to NHEJ inhibition seems to be cell type specific, as improvements in HDR efficiency varied significantly between cell lines and often does not result in a notable beneficial effect (107,108,136). The search for compounds that enhance HDR continues. One study demonstrated resveratrol to be an even more potent enhancer of HDR-mediated genome editing efficiency when compared to SCR7 (138), albeit the molecular mechanism governing its therapeutic properties remain elusive (140). In addition, two new compounds that enhance Cpf1-mediated HDR have recently been identified (141). Another interesting approach is cell cycle synchronization in combination with timed Cas9 RNP delivery to focus nuclease activity to the G2/M-phase of the cell cycle when HDR is dominant. Indeed, cell cycle synchronization prior to Cas9 RNP delivery resulted in a significant increase in HDR efficiency in a variety of cell types (8,97,108). As expected, this also reduced the frequency of NHEJ events (108). In addition, the minimal concentration of Cas9 RNPs and donor DNA for sufficient HDR was substantially lower (8,97). However, cell cycle inhibitors by themselves may significantly affect cell viability, thereby decreasing the effective number of targetable cells in the population (8). In addition, several reports demonstrated that combining NHEJ inhibition and cell cycle synchronization did not further improve HDR efficiency, suggesting that HDR is already the predominant repair pathway during the G2/M-phase (8). Several reports have investigated the effect of temperature on nuclease mediated HDR. For instance, cold shock treatment at 32°C for 24–48 h post transfection was shown to enhance Cas9 mediated HDR in human induced pluripotent stem cells (99). However, a similar protocol turned out to be detrimental to Cas9 induced HDR in many other human cell types (142). A relative heat shock to 34°C in zebrafish significantly enhanced Cpf1 mediated HDR but had no effect on Cas9 mediated HDR (42). Collectively these results suggest that the effect of temperature on HDR rates requires further investigation before it should be generally applied. Although complementary strategies are useful in the context of maximizing precise editing efficiency we advise against using these strategies by default. Rather, they should be used in parallel or as a back-up plan when initial genome editing strategies yielded an insufficient number of clones. Inhibition of NHEJ, either via co-expression of Adenovirus 4 proteins or via small molecule inhibitors of DNA ligase IV, can enhance HDR-mediated genome editing. Cell cycle synchronization in the G2/M-phase combined with timed RNP delivery induces nuclease activity in the HDR dominant phase of the cell cycle. Complementary strategies should not be used by default but rather in parallel or as a backup strategy.

DISCUSSION

In the last couple of years, CRISPR-mediated genome editing has evolved at a very rapid pace. The expansion of the CRISPR-associated toolkit and our increased understanding of the molecular mechanisms that govern HDR have improved our ability to accurately edit mammalian genomes. Whereas many reviews have shed light on the historical and molecular background of CRISPR technology, up-to-date guidelines with respect to the design of HDR-mediated genome editing strategies were lacking. This review aims to function as a decision-making guide to assist researchers in using state-of-the-art genetics to generate mutant variants of their model system. It should be of special interest to classical cell biologists and biochemists without extensive genetic backgrounds. Especially in 2D cell cultures, introducing disease-related point mutations or protein fusions at endogenous loci is highly efficient. Indeed, solely relying on transient overexpression of (mutant) effector proteins is no longer recommended since scientific standards increasingly demand genetic modifications at endogenous loci. However, we stress the importance of a well thought out genome editing strategy in advance, since the entire process from design to a validated model system may still require a couple of months work. To summarize the current knowledge, opportunities and strategic options available to researchers, we will discuss three different design examples where many aspects discussed in this review will be placed into a real context. Example 1 (Figure 8A): Homozygous loss of phenylalanine at position 508 of the Cystic Fibrosis transmembrane conductance regulator (CFTR) is the most frequent genetic variant that causes Cystic Fibrosis (143). An accurate human model system will require homozygous deletion of ΔF508 without additional genetic scarring. The small size and homozygous nature of the deletion strongly favours an ssODN donor. A SpCas9 target site is available that cleaves just one nucleotide upstream of the deletion site, favouring an antisense ssODN designed with an asymmetric distribution of homology. Since deletion of ΔF508 will destroy the Cas9 target site, no additional silent mutations are required within the donor. Although the selected target site has a poor predicted on-target efficiency, better performing target sites cleave further away from the deletion site, likely leading to an overall decrease in performance when working with an ssODN. In addition, no well positioned Cpf1 target sites are available. RNP transfection will maximize editing efficiency, which enhances the isolation of homozygous clones. Since direct selection for ΔF508 CFTR is not available, RNPs will be assembled with chemically stabilized fluorescent tracrRNA in combination with commercially synthesized and chemically stabilized CRISPR-RNA containing the guide sequence. Fluorescent tracrRNA allows FACS-based single cell sorting of transfected cells. In addition, isolating a bulk population for TIDER analysis will give an estimate for the frequency of homozygous clones.

Figure 8.

Designing genome editing strategies: 3 real examples. (A) Schematic workflow of the practical steps and the sequence information for the process of generating ΔF508 CFTR mutant cell lines. Top: The dsDNA sequence of the CFTR gene is presented around the intended modification site. In between the corresponding amino acid sequence is depicted. As a donor template, an asymmetric antisense ssODN is advised (see text). (B) Schematic workflow of the practical steps and the sequence information for the process of generating an mNeongreen knock-in at the C-terminus of hACTB in cell lines. Top: The dsDNA sequence of the hACTB gene is presented around its endogenous stop codon. The corresponding amino acid sequence is depicted in between the dsDNA. An ssDNA donor template is depicted below the schematic representation of the hACBT locus. Rationale for strategy design is described in the text. (C) Schematic workflow of the practical steps and the sequence information for the process of generating a CreERT2 knock-in in the hKRT20 locus via a P2A fusion at its C-terminus in human colon organoids. Top: A stretch of dsDNA sequence of the 3’UTR of the hKRT20 gene is presented. Below the locus is a schematic representation of the donor plasmid. Rationale for strategy design is described in the text. Yellow arrow indicates gRNA. Cleavage sites (DSB) are indicated with red arrow heads. PAM sequences are underscored. Example 2 (Figure 8B): Overexpression of fluorescent fusion proteins illuminates cell biology at the costs of altering protein homeostasis and potentially protein function. In contrast, endogenous tagging of the respective genes is potentially less invasive and probably truer to nature. Fluorescent beta-actin (ACTB) fusions are popular in cell biology (144). In order to insert mNeongreen (a bright monomeric green fluorophore (145)) at its C-terminus in a human cell line, we selected a highly active SpCas9 target site that cleaves within the stop codon of ACTB. The mNeongreen coding sequence will be fused in frame to the C-terminus of hACTB via a 3xGSS flexible linker (144), thereby also disrupting the selected SpCas9 target site. Considering the size of the modification, integration efficiency is maximized by using an ssDNA donor. About 100 nt downstream of the stop codon, the 3’ UTR runs into a stretch of sequence complexity which commercial suppliers currently reject. We therefore limited the length of the 3’ homology arm of the sense ssDNA donor to 100 nt. By contrast, 5’ homology is more flexible which we extended to 200 nt, providing an extra buffer against 5’ degradation. The nature of the modifications allows direct selection by FACS and may even permit enrichment for homozygous clones if desired. Correct integration of the template, as well as its zygosity, can easily be determined using a single PCR that spans the integration site. In addition, an additional primer set can be designed with one primer placed on top of the nuclease cleavage site to facilitate identification of heterozygous clones where the secondary allele remained intact. Example 3 (Figure 8C): Patient-derived organoids recapitulate the stem cell driven differentiation hierarchy of self-renewing tissues, among others enabling the study of lineage differentiation (146). Unfortunately, genetic modification of organoids via sampling-based selection is often prohibited by poor transfection efficiency and laborious clonal expansion. In addition, when modifying a gene that is exclusively expressed in a terminally differentiated cell state, many direct selection strategies are problematic. A C-terminal P2A-CreERT2 fusion to Keratin 20 (KRT20) allows genetic lineage tracing within the enterocyte lineage of human colon organoids (147). In order to select for this C-terminal fusion, a selection cassette is integrated within the 3’ UTR at an unconserved position. The orientation of the selection cassette is not important in this instance since the cassette will be removed using Flp-mediated recombination in an undifferentiated population of cells. Due to the size of the modification (∼4 kb) and sequence complexity of the selection cassette, donor vector construction is required. During assembly of the donor vector, both homology arms should by flanked by a copy of the genomic target sequence. This approach will enable both in trans paired nicking as well as linear donor excision concurrent with genomic cleavage. While the relative position of the selection cassette with respect to CreERT2 creates a limited stretch of internal homology between the endogenous 3’UTR and the donor template, we recommend against introducing point mutations within the start of the 3’ UTR in order to avoid disruption of potential regulatory motifs. Instead, we minimize homology by placing the cassette in close proximity to the last exon. Innovative genome editing technologies will continue to be developed, thereby constantly modifying and changing existing editing procedures and methods. A recent innovation is RNP-donor conjugates that aim to deliver the DNA donor template directly at the site of nuclease activity (148,149). In addition, assembly of donor vectors by molecular cloning may soon be unnecessary as better ssDNA synthesis protocols are being developed (150). Ultimately, considering the progression toward nick-mediated editing and base-editors, we anticipate that the future of complex genome editing might not even involve nuclease activity at all.

150 in total

Review 1. Cre recombinase: the universal reagent for genome tailoring.

Authors: A Nagy
Journal: Genesis Date: 2000-02 Impact factor: 2.487

2. Chemically Modified Cpf1-CRISPR RNAs Mediate Efficient Genome Editing in Mammalian Cells.

Authors: Moira A McMahon; Thazha P Prakash; Don W Cleveland; C Frank Bennett; Meghdad Rahdar
Journal: Mol Ther Date: 2018-03-06 Impact factor: 11.454

Review 3. Intracellular DNA recognition.

Authors: Veit Hornung; Eicke Latz
Journal: Nat Rev Immunol Date: 2010-02 Impact factor: 53.106

4. Structure and Engineering of Francisella novicida Cas9.

Authors: Hisato Hirano; Jonathan S Gootenberg; Takuro Horii; Omar O Abudayyeh; Mika Kimura; Patrick D Hsu; Takanori Nakane; Ryuichiro Ishitani; Izuho Hatada; Feng Zhang; Hiroshi Nishimasu; Osamu Nureki
Journal: Cell Date: 2016-02-11 Impact factor: 41.582

5. Efficient intracellular delivery of native proteins.

Authors: Diego S D'Astolfo; Romina J Pagliero; Anita Pras; Wouter R Karthaus; Hans Clevers; Vikram Prasad; Robert Jan Lebbink; Holger Rehmann; Niels Geijsen
Journal: Cell Date: 2015-04-23 Impact factor: 41.582

6. Obligate ligation-gated recombination (ObLiGaRe): custom-designed nuclease-mediated targeted integration through nonhomologous end joining.

Authors: Marcello Maresca; Victor Guosheng Lin; Ning Guo; Yi Yang
Journal: Genome Res Date: 2012-11-14 Impact factor: 9.043

7. Microhomology-mediated end-joining-dependent integration of donor DNA in cells and animals using TALENs and CRISPR/Cas9.

Authors: Shota Nakade; Takuya Tsubota; Yuto Sakane; Satoshi Kume; Naoaki Sakamoto; Masanobu Obara; Takaaki Daimon; Hideki Sezutsu; Takashi Yamamoto; Tetsushi Sakuma; Ken-ichi T Suzuki
Journal: Nat Commun Date: 2014-11-20 Impact factor: 14.919

8. Efficient precise knockin with a double cut HDR donor after CRISPR/Cas9-mediated double-stranded DNA cleavage.

Authors: Jian-Ping Zhang; Xiao-Lan Li; Guo-Hua Li; Wanqiu Chen; Cameron Arakaki; Gary D Botimer; David Baylink; Lu Zhang; Wei Wen; Ya-Wen Fu; Jing Xu; Noah Chun; Weiping Yuan; Tao Cheng; Xiao-Bing Zhang
Journal: Genome Biol Date: 2017-02-20 Impact factor: 13.583

9. Scarless Genome Editing of Human Pluripotent Stem Cells via Transient Puromycin Selection.

Authors: Benjamin Steyer; Qian Bu; Evan Cory; Keer Jiang; Stella Duong; Divya Sinha; Stephanie Steltzer; David Gamm; Qiang Chang; Krishanu Saha
Journal: Stem Cell Reports Date: 2018-01-04 Impact factor: 7.765

10. Evolved Cas9 variants with broad PAM compatibility and high DNA specificity.

Authors: Johnny H Hu; Shannon M Miller; Maarten H Geurts; Weixin Tang; Liwei Chen; Ning Sun; Christina M Zeina; Xue Gao; Holly A Rees; Zhi Lin; David R Liu
Journal: Nature Date: 2018-02-28 Impact factor: 49.962

15 in total

Review 1. A Broad Application of CRISPR Cas9 in Infectious Diseases of Central Nervous System.

Authors: Anna Bellizzi; Nicholas Ahye; Gauthami Jalagadugula; Hassen S Wollebo
Journal: J Neuroimmune Pharmacol Date: 2019-09-11 Impact factor: 4.147

2. Modeling Splicing Variants Amenable to Antisense Therapy by Use of CRISPR-Cas9-Based Gene Editing in HepG2 Cells.

Authors: Arístides López-Márquez; Ainhoa Martínez-Pizarro; Belén Pérez; Eva Richard; Lourdes R Desviat
Journal: Methods Mol Biol Date: 2022

Review 3. Critical roles of macrophages in pressure overload-induced cardiac remodeling.

Authors: Dan Yang; Han-Qing Liu; Fang-Yuan Liu; Nan Tang; Zhen Guo; Shu-Qing Ma; Peng An; Ming-Yu Wang; Hai-Ming Wu; Zheng Yang; Di Fan; Qi-Zhu Tang
Journal: J Mol Med (Berl) Date: 2020-10-31 Impact factor: 4.599

4. An Optimized Preparation Method for Long ssDNA Donors to Facilitate Quick Knock-In Mouse Generation.

Authors: Yukiko U Inoue; Yuki Morimoto; Mayumi Yamada; Ryosuke Kaneko; Kazumi Shimaoka; Shinji Oki; Mayuko Hotta; Junko Asami; Eriko Koike; Kei Hori; Mikio Hoshino; Itaru Imayoshi; Takayoshi Inoue
Journal: Cells Date: 2021-04-30 Impact factor: 6.600

5. Gene-edited babies: What went wrong and what could go wrong.

Authors: Haoyi Wang; Hui Yang
Journal: PLoS Biol Date: 2019-04-30 Impact factor: 8.029

6. CRISPR-induced RASGAP deficiencies in colorectal cancer organoids reveal that only loss of NF1 promotes resistance to EGFR inhibition.

Authors: Jasmin B Post; Nizar Hami; Alexander E E Mertens; Suraya Elfrink; Johannes L Bos; Hugo J G Snippert
Journal: Oncotarget Date: 2019-02-15

7. Understanding off-target effects through hybridization kinetics and thermodynamics.

Authors: Nafisa N Nazipova; Svetlana A Shabalina
Journal: Cell Biol Toxicol Date: 2019-12-10 Impact factor: 6.691

8. Rapid Tagging of Human Proteins with Fluorescent Reporters by Genome Engineering using Double-Stranded DNA Donors.

Authors: Alexandre Paix; Dominique Rasoloson; Andrew Folkmann; Geraldine Seydoux
Journal: Curr Protoc Mol Biol Date: 2019-12

9. Precise Correction of Heterozygous SHOX2 Mutations in hiPSCs Derived from Patients with Atrial Fibrillation via Genome Editing and Sib Selection.

Authors: Simon Alexander Sumer; Sandra Hoffmann; Svenja Laue; Birgit Campbell; Kristin Raedecke; Viktoria Frajs; Sebastian Clauss; Stefan Kääb; Johannes W G Janssen; Anna Jauch; Karl-Ludwig Laugwitz; Tatjana Dorn; Alessandra Moretti; Gudrun A Rappold
Journal: Stem Cell Reports Date: 2020-09-24 Impact factor: 7.765

Review 10. CRISPR-Cas9 DNA Base-Editing and Prime-Editing.

Authors: Ariel Kantor; Michelle E McClements; Robert E MacLaren
Journal: Int J Mol Sci Date: 2020-08-28 Impact factor: 5.923