| Literature DB >> 28179977 |
Tuhin Kumar Guha1, Alvan Wai1, Georg Hausner1.
Abstract
Targeted genome editing has become a powerful genetic tool for studying gene function or for modifying genomes by correcting defective genes or introducing genes. A variety of reagents have been developed in recent years that can generate targeted double-stranded DNA cuts which can be repaired by the error-prone, non-homologous end joining repair system or via the homologous recombination-based double-strand break repair pathway provided a suitable template is available. These genome editing reagents require components for recognizing a specific DNA target site and for DNA-cleavage that generates the double-stranded break. In order to reduce potential toxic effects of genome editing reagents, it might be desirable to control the in vitro or in vivo activity of these reagents by incorporating regulatory switches that can reduce off-target activities and/or allow for these reagents to be turned on or off. This review will outline the various genome editing tools that are currently available and describe the strategies that have so far been employed for regulating these editing reagents. In addition, this review will examine potential regulatory switches/strategies that can be employed in the future in order to provide temporal control for these reagents.Entities:
Keywords: CRISPR/Cas9; Hammerhead ribozyme; Meganuclease; Regulatory switch; TALEN; Zinc finger nuclease
Year: 2017 PMID: 28179977 PMCID: PMC5279741 DOI: 10.1016/j.csbj.2016.12.006
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Examples of programmable genome editing tools. (a) Single-motif LAGLIDADG homing endonucleases, (b) double-motif LAGLIDADG homing endonucleases, (c) megaTAL, (d) MegaTev, (e) zinc-finger nucleases (ZFN), (f) transcription activator-like effector nucleases (TALENs), clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated proteins (Cas) systems using (g) Cas9 or (h) Cpf1, (i) targetrons, (j) triplex-forming oligonucleotide (TFO) nucleases, and (k) structure-guided nucleases (SGNs). EBS = exon-binding site; IEP = intron-encoded protein. The nuclease domain of FokI is used to engineer ZNFs, TALENs, and SGNs. Elements of this figure have been adapted from Hafez et al. [44] NRC Research Press License number: 3981970186164.
Generalized comparison of various genome engineering tools.
| Nuclease platform | MN | ZFN | TALEN | Targetron | CRISPR/Cas |
|---|---|---|---|---|---|
| Source | Organellar DNA, Bacteria, Phages | Bacteria, Eukaryotes | Bacteria ( | Organellar DNA, Bacteria, Phages | Bacteria ( |
| Number of component(s) | 1 | 2 | 2 | 2 | 1–2 (depends) |
| Availability of core components | Restricted | Available | Available | Restricted | Available |
| Type of recognition | Protein-DNA | Protein-DNA | Protein-DNA | RNA-DNA | RNA-DNA |
| Recognition site (bp) | 18–44 | 18–36 | 24–40 | 14–15 | 17–23 |
| Double strand break pattern | Staggered cut (4 nt, 3′ overhang) | Staggered cut (4–5 nt, 5′ overhang) | Staggered cut (Heterogeneous overhangs) | Staggered cut | SpCas9 creates blunt ends; Cpf1 creates staggered cut (5′ overhang) |
| Function | Nuclease, Nickase | Nuclease, Nickase | Nuclease, Nickase | Site-specific bacterial gene disruption | Nuclease, Nickase |
| Best suited for | Gene editing | Gene knockout, Transcriptional regulation | Gene knockout, Transcriptional regulation | Gene knockout | Gene knockout, Transcriptional regulation, Base editing |
| Ease of design | Difficult | Difficult; Design of new ZFNs is much easier than MNs | Moderate | Moderate | Easy |
| Dimerization required | No | Yes | Yes | No | No |
| Ease of generating large scale libraries | Laborious | Laborious | Moderately laborious | Unknown | Easy |
| Specificity | High | Low–Moderate | Moderate | Moderate | Low–Moderate |
| Multiplexing | Low | Low | Moderately high | Low | High |
| Gene drive | Possible | Unknown | Unknown | Unknown | Possible |
| Improved/other versions | MegaTEV, MegaTAL | AZP-SNase | Tev-mTALEN | Thermotargetron | Cpf1, eSpCas9 |
| Cost (USD) | 4000–5000 | 5–10,000 | < 1000 | 450–1500 | < 100 |
| Targeting constraints | Chromatin compaction | Non-guanosine rich sequence hard to target | 5′ targeted base must be thymine for each TALEN monomer | Entry of RNP complex in nucleus difficult | PAM sequence must follow target site |
| Efficiency/Inefficiency | Small size of MN allows use in a variety of viral vectors | Small size of ZFN expression cassettes allows use in a variety of viral vectors | Large size of each TALEN makes it difficult to pack in viral vectors | Large size of ribonucleoprotein complex makes it difficult for entry into nucleus | Commonly used Cas9 from |
| Methylation sensitive | Yes | Yes | Yes | Unknown | No |
| First use in human cells | 1994 | 2003 | 2011 | 2015 | 2013 |
| Immunogenicity | Unknown | Low | Unknown | Unknown | Unknown |
| Vector packaging | Multiple | Multiple | Few | Multiple | Multiple |
| Size of mRNA transcripts | Short | Short | Long | Short | Long |
| Mode of ex vivo delivery in animal cells | Electroporation, Viral transduction, Direct injection into zygotes | Electroporation, Lipofection, Viral transduction, Direct injection into zygotes | Electroporation, Lipofection, Viral transduction, Direct injection into zygotes | Electroporation, Lipofection | Electroporation, Lipofection, Viral transduction, Direct injection into zygotes |
Source [13], [21], [45], Number of component(s) [80], Availability of core components [80], Type of recognition [81], Recognition site (bp) [42], [49], [51], [55], [57], Double strand break pattern [42], [79], Function [45], [76], [77], [78], [79], [80], Best suited for [13], [45], [162], Ease of design [77], Dimerization required [76], Ease of generating large scale libraries [77], Specificity [86], Multiplexing [77], Gene drive [37], [38], [39], [40], [41], Improved/other versions [59], [103], [108], [117], [124], [155], [161], Cost (USD) [86], Targeting constraints [77], Efficiency/Inefficiency [77], Methylation sensitive [76], [101], First use in human cells [80], Immunogenicity [77], Vector packaging [86], Size of mRNA transcripts [80], Mode of ex vivo delivery in animal cells [77], [87].
Most widely used Cas9 is from Streptococcus pyogenes. However, Cas9 orthologs, such as the smaller Cas9 proteins from Streptococcus thermophilus CRISPR1 (ST1), N. meningitidis (NM) and the large Cas9 protein from Treponema denticola (TD), have shown promising results in genome editing [154].
1 (if using a complex guide RNA with Cas9 protein) or 2 (if guide RNA and Cas9 delivered separately) [77].
Availability of core components refers whether the building blocks are restricted to industry, available through and academic collaboration/purchase, or readily and freely available from not for profit agencies or commercial DNA synthesis [86].
The range used here, encompasses a number of different MNs and not only LAGs. The largest recognition site of a LAG is ~ 31 bp [94].
The 3′ hydroxyl group of the group II intron serves as a nucleophile and cleaves just one strand of the DNA homing site. The RNA lariat is reverse spliced into the target site and the endonuclease domain of the assisted protein partner cleaves the complementary DNA strand [57].
Although compromised activity is observed in eukaryotes and mammalian system due to the suboptimal codon usage, translational repression of the RT, nonsense-mediated decay (NMD) of group II intron-containing RNAs and suboptimal magnesium ion (Mg+ 2) concentrations, this RNA-guided endonuclease (RGEN) has shown potential for high site-specific retargeting in prokaryotes by reprogramming the intron EBS [194].
Recently improved specificity has been reported for eSpCas9 enzyme [161].
Approximate cost required to generate a single, gene specific candidate reagent [86].
Short oligonucleotides may be packaged along with guide RNAs into a single adeno-associated virus [154].
Vector packaging refers to the reagents ability to be packaged and delivered in multiple delivery vehicles. However, the size of TALENs makes them the most restrictive in this regard. To date, only one version (derived from S. aureus) of CRISPR/Cas9 can be packaged in an adeno-associated viral vector [86].
Fig. 2Strategies used to modulate Cas9 activity. (a) Group II intron (GII)-based switch, (b) separating Cas9 into two peptides, termed split-Cas9, (c) Tetracycline-inducible and reversible expression system, and (d) ligand-dependent dimerization of split-Cas9. Note: the strategy illustrated in (a) is based on the original study conducted by Guha and Hausner [185] on modulating expression of a meganuclease, not Cas9. A similar case is observed in (c), where Mandegar et al. [141] modulated the expression of dCas9, not Cas9. In both cases, a similar approach might also be possible with Cas9.
(e) Light-dependent dimerization of split-Cas9, termed photoactivatable Cas9 (paCas9), (f) intein-Cas9, which are activated by splicing of a ligand-dependent intein, (g) and unstable destabilizing domain-Cas9 (DD-Cas9) fusions, which are degraded unless provided with the ligand, Shield1.
Abbreviations: CAG = cytomegalovirus early enhancer/chicken β-actin promoter; Cas9 = clustered regularly interspaced short palindromic repeat (CRISPR)-associated protein 9; Cas9′ = partial Cas9; dCas9 = dead Cas9; FKBP = FK506 binding protein; FRB = FKBP-rapamycin binding; IPTG = isopropyl β-D-1-thiogalactopyranoside; KRAB = Krüppel-associated box; MN = meganuclease; mRNA = messenger RNA; rtTA = reverse tetracycline-controlled transcriptional activator; sgRNA = single-guide RNA; TRE = tetracycline response element; T7 = T7 RNA polymerase promoter; 4-HT = 4-hydroxytamoxifen; DD = destabilizing domain; nMag = negative Magnet; pMag = positive Magnet; sgRNA = single-guide ribonucleic acid. See text for more details.