Zexiang Chen1, Gitali Devi1, Amena Arif1, Phillip D Zamore1,2, Erik J Sontheimer1,3, Jonathan K Watts1,4. 1. RNA Therapeutics Institute, University of Massachusetts Chan Medical School, Worcester, Massachusetts 01605, United States. 2. Howard Hughes Medical Institute, University of Massachusetts Chan Medical School, Worcester, Massachusetts 01605, United States. 3. Program in Molecular Medicine, University of Massachusetts Chan Medical School, Worcester, Massachusetts 01605, United States. 4. Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, Massachusetts 01605, United States.
Abstract
CRISPR-Cas technology has revolutionized genome editing. Its broad and fast-growing application in biomedical research and therapeutics has led to increased demand for guide RNAs. The synthesis of chemically modified single-guide RNAs (sgRNAs) containing >100 nucleotides remains a bottleneck. Here we report the development of a tetrazine ligation method for the preparation of sgRNAs. A tetrazine moiety on the 3'-end of the crRNA and a norbornene moiety on the 5'-end of the tracrRNA enable successful ligation between crRNA and tracrRNA to form sgRNA under mild conditions. Tetrazine-ligated sgRNAs allow efficient genome editing of reporter and endogenous loci in human cells. High-efficiency editing requires structural optimization of the linker.
CRISPR-Cas technology has revolutionized genome editing. Its broad and fast-growing application in biomedical research and therapeutics has led to increased demand for guide RNAs. The synthesis of chemically modified single-guide RNAs (sgRNAs) containing >100 nucleotides remains a bottleneck. Here we report the development of a tetrazine ligation method for the preparation of sgRNAs. A tetrazine moiety on the 3'-end of the crRNA and a norbornene moiety on the 5'-end of the tracrRNA enable successful ligation between crRNA and tracrRNA to form sgRNA under mild conditions. Tetrazine-ligated sgRNAs allow efficient genome editing of reporter and endogenous loci in human cells. High-efficiency editing requires structural optimization of the linker.
CRISPR-Cas genome editing has
profoundly advanced biomedical research and holds promise as a therapeutic
modality.[1] CRISPR-Cas systems use programmable
guide RNAs that direct sequence-specific DNA cleavage by Cas nucleases.[2−5] CRISPR-mediated editing can be performed in cells or organisms by
DNA-, RNA-, or ribonucleoprotein (RNP)-based delivery of the effector
and guide RNA.[6] Direct chemical synthesis
can be used to generate chemically modified gRNAs with improved efficiency,
enhanced stability, reduced off-target editing, and improved delivery
and cellular uptake relative to unmodified guides.[7−16]Most CRISPR-Cas systems, including Streptococcus
pyogenes Cas9 (SpyCas9), use single-guide RNAs (sgRNAs)
containing >100 nucleotides.[2] Oligonucleotides
this long are expensive to synthesize, and yields tend to be low.
A dual-guide approach can be used, consisting of two short RNA pieces
(crRNA and tracrRNA) assembled by hybridization.[2,7] However,
sgRNAs are more effective than dual-guide RNAs for genome editing
in many cases. For example, after lipid nanoparticle delivery of Cas9
mRNA and guide RNA to the mouse liver, sgRNA provided substantially
higher in vivo editing than dual-guide RNA.[13] Therefore, efficient, scalable production of safe and effective
sgRNAs remains a key challenge.Ligation of short synthetic
RNAs offers an alternative to the synthesis
of a long RNA. However, enzymatic ligation is time-consuming and difficult
to scale.[17] Instead, chemical ligation
is relatively easy to implement and scale. Indeed, copper-catalyzed
azide–alkyne cycloaddition (CuAAC) chemistry has recently been
used to ligate two RNA components into a sgRNA.[18] However, the ligated sgRNAs were significantly less effective
than in vitro-transcribed sgRNAs, although their efficiency can be
aided by installing chemical modifications on the ligated sgRNAs.[18] Moreover, the copper catalyst is toxic, and
therefore, the products must be carefully purified before in vivo
use.[19−21] We have also observed that copper can have side reactions
with phosphorothioate (PS) linkages, which are desired for stability
and uptake of sgRNAs in vivo. Therefore, we sought to develop a metal-free
chemical ligation method to prepare sgRNAs.[22]The tetrazine-based inverse-electron-demand Diels–Alder
(IEDDA) reaction (Figure a) has emerged as a promising bioorthogonal ligation chemistry
with rapid kinetics that does not depend on a metal catalyst.[23−27] Despite many examples of nucleic acid applications,[27−36] the tetrazine-based IEDDA reaction has not been used to produce
long RNA molecules. Here we show that tetrazine-based ligation can
be used to form sgRNAs that support efficient genome editing.
Figure 1
(a) General
mechanism of tetrazine-based ligation and (b) proposed
tetrazine ligation strategy for sgRNA.
(a) General
mechanism of tetrazine-based ligation and (b) proposed
tetrazine ligation strategy for sgRNA.Standard sgRNA designs comprise a crRNA fused to a tracrRNA by
a four-nucleotide linker (GAAA).[2] This
linker forms a tetraloop that protrudes from the nuclease in CRISPR-Cas9
structures,[37−39] suggesting that SpyCas9 can accommodate structural
changes in the loop. We therefore chose the junction between the 3′-end
of the crRNA and the 5′-end of the tracrRNA as the ligation
point (Figure b).
We selected norbornene as the alkene in the tetrazine ligation because
it can be readily incorporated into solid-phase RNA synthesis.[28] We successfully synthesized norbornene phosphoramidites
(Figure S2) and installed the norbornene
moiety at the 5′-end of the tracrRNA during solid-phase synthesis.
To generate a tetrazine moiety on the crRNA, we synthesized the 3′-amino-modified
crRNA and then installed the tetrazine moiety via postsynthetic conjugation
with tetrazine N-hydroxysuccinimide (NHS) ester.
Because the structure of the linker may affect binding of the tetrazine-ligated
sgRNA to Cas9, we used two different tetrazine NHS esters to generate
sgRNAs with either a short linker 1 or a long linker 2 (Figure a, Figure S3). Linker 2 includes an extra octaethylene glycol (PEG8)
segment. To perform the tetrazine ligation, we combined the 3′-tetrazine-modified
crRNA and the 5′-norbornene-modified tracrRNA and incubated
the molecules under mild conditions (20 mM Tris-HCl, 200 mM NaCl,
pH 7.4) for ∼20 h at room temperature (Figure S4). We verified the success of the ligation reactions
by PAGE analysis (Figure S5) and confirmed
the identities of the purified ligation products by HPLC-MS (Table S1 and Figures S7 and S8).
Figure 2
Initial testing of tetrazine-ligated
sgRNAs with linkers 1 and
2. (a) Illustration of tetrazine-ligated sgRNAs with linkers 1 and
2. (b) Dose titration of sgRNAs with linkers 1 and 2 in the HEK-293T
TLR1 assay with the Cas9:sgRNA ratio fixed at 1:3. (c) Cas9:sgRNA
ratio titration of sgRNAs with linkers 1 and 2 in the HEK-293T TLR1
assay with the RNP dosage fixed at 2.5 pmol. Data are reported as
mean ± SD of three independent biological replicates.
Initial testing of tetrazine-ligated
sgRNAs with linkers 1 and
2. (a) Illustration of tetrazine-ligated sgRNAs with linkers 1 and
2. (b) Dose titration of sgRNAs with linkers 1 and 2 in the HEK-293T
TLR1 assay with the Cas9:sgRNA ratio fixed at 1:3. (c) Cas9:sgRNA
ratio titration of sgRNAs with linkers 1 and 2 in the HEK-293T TLR1
assay with the RNP dosage fixed at 2.5 pmol. Data are reported as
mean ± SD of three independent biological replicates.To facilitate testing in cells, we designed the ligated sgRNAs
to target a traffic light reporter construct, TLR1.[40] This reporter has a validated guide RNA sequence with good
editing efficiency and allows the use of flow cytometry for easy quantification.
Its nature as a gain-of-function assay facilitates low background
when editing is scored. In this assay, CRISPR-mediated cleavage of
TLR1 is repaired by mutagenic end-joining pathways, which shifts an
out-of-frame mCherry coding region into the correct reading frame
in a subset of edited cells, resulting in mCherry expression. The
percentage of mCherry-positive cells measured by flow cytometry therefore
provides a lower-limit measure of the editing efficiency.To
explore how the tetrazine-ligated sgRNAs affect the editing
efficiency, we assembled TLR1 sgRNA–SpyCas9 RNP complexes and
electroporated the complexes into HEK-293T TLR1 reporter cells. We
compared the functions of linker 1 and linker 2 sgRNAs at various
RNP dosages with a fixed Cas9:sgRNA ratio of 1:3 (Figure b). We also analyzed various
Cas9:sgRNA ratios with a fixed RNP dosage of 2.5 pmol (Figure c). The sgRNA with long linker
2 performed better than that with the short linker 1 under all of
the conditions tested, especially at low RNP dosages. At the higher
RNP dosages, the assay may be near saturation. These results indicate
that tetrazine-ligated sgRNAs support CRISPR-mediated editing and
confirm the importance of the linker structure in the sgRNA activity.We next applied the linker 2 design to sgRNAs that target endogenous
loci in human cells. To benchmark the activity of the tetrazine-ligated
sgRNAs, we compared them to standard synthetic sgRNAs with a GAAA
tetraloop linking the crRNA and tracrRNA. At each end of the sgRNAs
(standard and tetrazine-ligated), the last three nucleotides were
chemically modified with PS linkages and 2′-O-methyl (2′-OMe) groups to protect them from degradation.
Tetrazine-ligated sgRNAs targeting TLR1 and four endogenous loci (CCR5, HEK3, TRAC, and HPRT) were generated, and their identities were confirmed
by HPLC-MS (Table S1 and Figures S9–S13). The ability to generate these tetrazine-ligated sgRNAs indicates
that the tetrazine ligation chemistry is generally compatible with
chemical modifications (PS, 2′-OMe) that are important for
sgRNA stability and activity in cells.We first electroporated
TLR1 RNPs into HEK-293T TLR1 reporter cells
to compare tetrazine-ligated and standard sgRNAs at various RNP dosages
with a fixed Cas9:sgRNA ratio of 1:3 (Figure a) or with various Cas9:sgRNA ratios at a
fixed RNP dosage of 2.5 pmol (Figure b). The TLR1 tetrazine-ligated sgRNA exhibited activity
comparable to that of standard sgRNA at RNP dosages of 10 pmol (27%
vs 30%) and 15 pmol (28% vs 30%), but it was less active than standard
sgRNA at lower RNP dosages (e.g., 17% vs 30% editing at 2.5 pmol)
(Figure a). The reduced
editing activity of tetrazine-ligated TLR1 sgRNA was exacerbated at
lower sgRNA:Cas9 ratios (Figure b).
Figure 3
Comparison of the genome editing efficiencies of tetrazine-ligated
(linker 2) and standard sgRNAs. (a) Titration of tetrazine-ligated
and standard sgRNAs in the HEK-293T TLR1 assay with the Cas9:sgRNA
ratio fixed at 1:3. (b) Cas9:sgRNA ratio titration of tetrazine-ligated
and standard sgRNAs in the HEK-293T TLR1 assay with the RNP dosage
fixed at 2.5 pmol. (c) Dose titration of tetrazine-ligated and standard
sgRNAs for endogenous loci (CCR5, HEK3, TRAC, and HPRT) in HEK-293T cells
with the Cas9:sgRNA ratio fixed at 1:3. (d) Cas9:sgRNA ratio titration
of tetrazine-ligated and standard sgRNAs for endogenous loci (CCR5, HEK3, TRAC, and HPRT) in HEK-293T cells with the RNP dose fixed at 2.5 pmol.
Data are reported as mean ± SD of three independent biological
replicates.
Comparison of the genome editing efficiencies of tetrazine-ligated
(linker 2) and standard sgRNAs. (a) Titration of tetrazine-ligated
and standard sgRNAs in the HEK-293T TLR1 assay with the Cas9:sgRNA
ratio fixed at 1:3. (b) Cas9:sgRNA ratio titration of tetrazine-ligated
and standard sgRNAs in the HEK-293T TLR1 assay with the RNP dosage
fixed at 2.5 pmol. (c) Dose titration of tetrazine-ligated and standard
sgRNAs for endogenous loci (CCR5, HEK3, TRAC, and HPRT) in HEK-293T cells
with the Cas9:sgRNA ratio fixed at 1:3. (d) Cas9:sgRNA ratio titration
of tetrazine-ligated and standard sgRNAs for endogenous loci (CCR5, HEK3, TRAC, and HPRT) in HEK-293T cells with the RNP dose fixed at 2.5 pmol.
Data are reported as mean ± SD of three independent biological
replicates.To compare the editing activities
of tetrazine-ligated and standard
sgRNAs targeting endogenous loci, we sequenced the intended editing
regions and calculated the editing efficiencies using the ICE algorithm.[41] Consistent with the TLR assay, tetrazine-ligated
and standard sgRNAs targeting endogenous loci were similarly active
at high RNP dosages, but at low RNP dosages, the tetrazine-ligated
sgRNAs were less active than the standard sgRNAs (Figure c). The editing efficiency
of tetrazine-ligated sgRNA was further reduced when the Cas9:sgRNA
ratio decreased (Figure d). Overall, the tetrazine-ligated sgRNA with linker 2 consistently
performed well at high RNP dosages but weakly at low RNP dosages.The results of our experiments shown in Figure made it clear that linker 2 is not optimal.
We considered several hypotheses: (i) Linker 2 is still too short
or inflexible to allow optimal conformation of the RNP; (ii) the PEG8
segment in linker 2 may not be optimally configured; or (iii) the
pyridazine-based linkage in linker 2 may have certain unwanted effects
with Cas9 or the rest of the guide. To test and address these possibilities,
we designed several new linkers (Figure a). Linker 3 has the same length as linker
2, but instead of a single PEG8 segment, we incorporated a PEG4 segment
on each side of the loop. Linker 4 includes the PEG8 segment from
linker 2 and both PEG4 segments from linker 3, adding length and flexibility
relative to linker 2. Finally, linkers 5 and 6 are analogues of linkers
1 and 2 in which the stem formed by the crRNA and tracrRNA is extended
by three base pairs, on the basis of the hypothesis that the extended
(rigid) duplex structure might minimize any unwanted effects of the
pyridazine-based linkage with Cas9 that might occur.
Figure 4
Comparison of standard
and different linker-ligated sgRNAs. (a)
Structures of the standard and modified linkers. (b) Comparison of HPRT sgRNA genome editing efficiencies of standard and modified
linkers at RNP dosages of 2.5 pmol (top) and 5 pmol (bottom) in HEK-293T
cells. Data are reported as mean ± SD of three independent biological
replicates.
Comparison of standard
and different linker-ligated sgRNAs. (a)
Structures of the standard and modified linkers. (b) Comparison of HPRT sgRNA genome editing efficiencies of standard and modified
linkers at RNP dosages of 2.5 pmol (top) and 5 pmol (bottom) in HEK-293T
cells. Data are reported as mean ± SD of three independent biological
replicates.To test our new linkers (3–6)
under the most demanding conditions,
we applied them to sgRNAs targeting HPRT, for which
the editing efficiency difference between the tetrazine-ligated (linker
2) sgRNA and standard sgRNA was greatest (Figure c,d). The tetrazine-ligated sgRNAs with linkers
3–6 were generated and confirmed by HPLC-MS (Table S1 and Figures S14–S17) and assembled into RNPs.
We then compared the editing efficiencies of the tetrazine-ligated
(linkers 2–6) and standard GAAA-linked sgRNAs at RNP dosages
of 2.5 and 5 pmol, which showed the largest differences in activity
between the linker 2- and GAAA-linked sgRNA. As shown in Figure b, we observed that
sgRNAs with linkers 3–6 performed better at both RNP dosages
than did the sgRNA with linker 2. Linker 5 sgRNA was the most effective:
at the RNP dosage of 2.5 pmol, the linker 5 sgRNA (68%) was ∼4-fold
more effective than linker 2 sgRNA (17%) and was only 22% less active
than standard sgRNA (87%). Importantly, at the RNP dosage of 5 pmol,
the activity of linker 5 sgRNA (87%) was comparable to that of standard
sgRNA (90%). The linker 6 sgRNA was the next most effective tetrazine-ligated
sgRNA. The improved performance of linkers 5 and 6 suggests that extending
the crRNA:tracrRNA stem structure improves the activity of tetrazine-ligated
sgRNAs.We considered the possibility that the increment of
editing efficiency
from linker 2 to linker 5 might simply be caused by the extension
of the stem itself, unrelated to the linker structure. Previous literature
has shown that stem extension can improve the activity of RNP complexes,
possibly as a result of increases in gRNA stability and gRNA–Cas9
assembly.[2,3,42−45] To better characterize this, we expanded our experiments and analysis
with additional controls. Because linker 2 sgRNA has four more base
pairs than standard tetraloop-linked sgRNA in the upper stem region,
we included a GAAA-linked sgRNA control and a dual-guide (dgRNA) control
with the same four additional base pairs. Similarly, for linker 5
sgRNA, we added the GAAA-linked sgRNA and dgRNA controls with seven
more base pairs in the upper stem region. We compared these seven
guide designs across four loci: TLR1, HPRT, CCR5, and TRAC.Linker 5 sgRNAs provided
higher editing efficiencies than linker
2 sgRNAs at all four loci, though the magnitude of the improvement
varied (Figure ).
For instance, linker 5 sgRNAs were almost as good as GAAA-linked sgRNAs
at the TLR1, HPRT, and CCR5 loci.
Moreover, both linker 2 and linker 5 sgRNAs were more active than
their corresponding dgRNA controls at all four loci. This confirms
the importance of sgRNAs for efficient genome editing. Finally, the
observation that the editing efficiencies of GAAA sgRNA controls with
different stem lengths were similar suggests that the improvement
in going from linker 2 to linker 5 is not simply due to the longer
stem of linker 5.
Figure 5
Comparison of linker 2- and linker 5-ligated sgRNAs with
stem-length-matched
sgRNA and dgRNA controls at TLR1, HPRT, CCR5, and TRAC loci in HEK-293T cells. Data are reported
as mean ± SD of three biological replicates.
Comparison of linker 2- and linker 5-ligated sgRNAs with
stem-length-matched
sgRNA and dgRNA controls at TLR1, HPRT, CCR5, and TRAC loci in HEK-293T cells. Data are reported
as mean ± SD of three biological replicates.It is intriguing that the effect of the linker design on the sgRNA
activity depends on the context of the crRNA:tracrRNA stem length.
With a shorter stem, the long linker 2 sgRNA was more active than
the short linker 1 sgRNA (Figure b-c), but conversely, when the crRNA:tracrRNA stem
was extended by three base pairs, the short linker 5 sgRNA was more
active than the long linker 6 sgRNA (Figure ).One possible explanation for our
results is that pyridazine-based
linkages, particularly in the context of short stem lengths, may impair
optimal interactions with Cas9. We thus performed an electrophoretic
mobility shift assay (EMSA) for the seven TLR1 gRNAs used in Figure . The results (Figure S6) suggested that tetrazine-ligated sgRNAs
had somewhat lower affinities for Cas9 than the corresponding GAAA-linked
sgRNAs. Future studies should address the magnitude of this decrease
more quantitatively and develop guides and linkages that increase
rather than decrease Cas9 affinity.In summary, we have developed
tetrazine-linked sgRNAs that support
efficient editing activity in human cells, rivaling that of standard
sgRNA. Moreover, our tetrazine ligation strategy is easy to implement
and scale because the long, invariant tracrRNA can be made in bulk
quantity, ready to be ligated to the shorter, sequence-variable crRNA
according to the desired target. Therefore, tetrazine ligation provides
a promising route to CRISPR sgRNAs for efficient genome editing. We
expect that this approach will also be useful for the production of
guide RNAs for other Cas nucleases and other editing platforms that
require even longer guide RNAs,[46] thus
expanding the utility of CRISPR genome editing in biomedical research
and therapeutics development.
Authors: Martin Jinek; Krzysztof Chylinski; Ines Fonfara; Michael Hauer; Jennifer A Doudna; Emmanuelle Charpentier Journal: Science Date: 2012-06-28 Impact factor: 47.728
Authors: Hao Yin; Chun-Qing Song; Sneha Suresh; Suet-Yan Kwan; Qiongqiong Wu; Stephen Walsh; Junmei Ding; Roman L Bogorad; Lihua Julie Zhu; Scot A Wolfe; Victor Koteliansky; Wen Xue; Robert Langer; Daniel G Anderson Journal: Nat Chem Biol Date: 2018-01-29 Impact factor: 15.040
Authors: Martin Jinek; Fuguo Jiang; David W Taylor; Samuel H Sternberg; Emine Kaya; Enbo Ma; Carolin Anders; Michael Hauer; Kaihong Zhou; Steven Lin; Matias Kaplan; Anthony T Iavarone; Emmanuelle Charpentier; Eva Nogales; Jennifer A Doudna Journal: Science Date: 2014-02-06 Impact factor: 47.728
Authors: Daniel E Ryan; David Taussig; Israel Steinfeld; Smruti M Phadnis; Benjamin D Lunstad; Madhurima Singh; Xuan Vuong; Kenji D Okochi; Ryan McCaffrey; Magdalena Olesiak; Subhadeep Roy; Chong Wing Yung; Bo Curry; Jeffrey R Sampson; Laurakay Bruhn; Douglas J Dellinger Journal: Nucleic Acids Res Date: 2018-01-25 Impact factor: 16.971
Authors: Fernando Orden Rueda; Michal Bista; Matthew D Newton; Anne U Goeppert; M Emanuela Cuomo; Euan Gordon; Felix Kröner; Jon A Read; Jonathan D Wrigley; David Rueda; Benjamin J M Taylor Journal: Nat Commun Date: 2017-11-20 Impact factor: 14.919