Ho Joung Lee1, Hyun Ju Kim1, Young-Jun Park2, Sang Jun Lee1. 1. Department of Systems Biotechnology and Institute of Microbiomics, Chung-Ang University, Anseong 17546, Republic of Korea. 2. Environmental Disease Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon 34141, Republic of Korea.
Abstract
Mismatch tolerance, a cause of the off-target effect, impedes accurate genome editing with the CRISPR/Cas system. Herein, we observed that oligonucleotide-directed single-base substitutions could be rarely introduced in the microbial genome using CRISPR/Cpf1-mediated negative selection. Because crRNAs have the ability to recognize and discriminate among specific target DNA sequences, we systematically compared the effects of modified crRNAs with 3'-end nucleotide truncations and a single mismatch on the genomic cleavage activity of FnCpf1 inEscherichia coli. Five nucleotides could be maximally truncated at the crRNA 3'-end for the efficient cleavage of the DNA targets of galK and xylB in the cells. However, target cleavage in the genome was inefficient when a single mismatch was simultaneously introduced in the maximally 3'-end-truncated crRNA. Based on these results, we assumed that the maximally truncated crRNA-Cpf1 complex can distinguish between single-base-edited and unedited targets in vivo. Compared to other crRNAs with shorter truncations, maximally 3'-end-truncated crRNAs showed highly efficient single-base substitutions (>80%) in the DNA targets of galK and xylB. Furthermore, the editing efficiency for the 24 bases in both galK and xylB showed success rates of 79 and 50%, respectively. We successfully introduced single-nucleotide indels in galK and xylB with editing efficiencies of 79 and 62%, respectively. Collectively, the maximally truncated crRNA-Cpf1 complex could perform efficient base and nucleotide editing regardless of the target base location or mutation type; this system is a simple and efficient tool for microbial genome editing, including indel correction, at the single-nucleotide resolution.
Mismatch tolerance, a cause of the off-target effect, impedes accurate genome editing with the CRISPR/Cas system. Herein, we observed that oligonucleotide-directed single-base substitutions could be rarely introduced in the microbial genome using CRISPR/Cpf1-mediated negative selection. Because crRNAs have the ability to recognize and discriminate among specific target DNA sequences, we systematically compared the effects of modified crRNAs with 3'-end nucleotide truncations and a single mismatch on the genomic cleavage activity of FnCpf1 inEscherichia coli. Five nucleotides could be maximally truncated at the crRNA 3'-end for the efficient cleavage of the DNA targets of galK and xylB in the cells. However, target cleavage in the genome was inefficient when a single mismatch was simultaneously introduced in the maximally 3'-end-truncated crRNA. Based on these results, we assumed that the maximally truncated crRNA-Cpf1 complex can distinguish between single-base-edited and unedited targets in vivo. Compared to other crRNAs with shorter truncations, maximally 3'-end-truncated crRNAs showed highly efficient single-base substitutions (>80%) in the DNA targets of galK and xylB. Furthermore, the editing efficiency for the 24 bases in both galK and xylB showed success rates of 79 and 50%, respectively. We successfully introduced single-nucleotide indels in galK and xylB with editing efficiencies of 79 and 62%, respectively. Collectively, the maximally truncated crRNA-Cpf1 complex could perform efficient base and nucleotide editing regardless of the target base location or mutation type; this system is a simple and efficient tool for microbial genome editing, including indel correction, at the single-nucleotide resolution.
The
potential of the CRISPR/Cas system as a genome editing tool
was first reported a decade ago.[1] Various
types of Cas nucleases have been discovered and applied in genome
editing.[2] Among them, the CRISPR/Cpf1 system
is a genome editing system functional in bacteria,[3] plants,[4] and mammals,[5] including humans.[6] Because Cpf1 exhibits nonspecific ssDNA trans-cleavage
activity,[7] it is used in a wide range of
fields such as diagnostics.[8,9] For a genome with a
low average GC content, such as the human genome, Cpf1 with the 5′-TTN
protospacer adjacent motif (PAM) helps edit more diverse targets than
Cas9 with 5′-NGG PAM.[10] Cas9 has
been used to make point mutations in many bacterial cells,[11,12] but in some microorganisms, increasing the intracellular concentration
of Cas9 protein inhibits cell growth;[13,14] therefore,
the use of Cpf1 is absolutely necessary in certain cases of microbial
genome editing. However, because Cpf1 and Cas9 exhibit mismatch tolerance[6] and induce the off-target effect,[15] accurate genome editing using Cpf1 has certain
limitations.To avoid the double-stranded cleavage of nontarget
genomic regions,
a base editor that induces chemical modifications of the base without
introducing double-stranded cleavage has been developed.[16,17] With the development of the glycosylase base editor, the introduction
of transversion mutations has become feasible,[18] and the precise editing of only the target base using a
sequence-specific deaminase has also become viable.[19] In addition, all types of mutations can be introduced using
a prime editor that synthesizes a new DNA strand to a desired target
site using an engineered reverse transcriptase.[20] For highly efficient mutagenesis, including indel introduction,
the use of gene scissors such as Cas9 and Cpf1 that can cut double
strands is still needed.Several studies have been performed
to modify crRNA, a molecule
that determines the target specificity of the CRISPR/Cas system to
improve on-target specificity and editing efficiency. The gene knock-out
efficiency was increased by the conjugation of a tRNA-like structure
to the 5′-end of the crRNA.[21] 5′-end
extension in crRNA increased the editing efficiency, such as in gene
knock-out and homology-directed repair,[22] and 3′-end uridylylation increased the indel efficiency.[23] Chemical base modification in crRNA, such as
phosphorothioate backbone substitution in the target recognition sequence
or methylation at the 3′-end carbon 2, improved the gene disruption
activity.[24] When the 5′- and 3′-ends
of crRNA were substituted with DNA, the target cleavage efficiency
was improved and the off-target effect was reduced.[25] To overcome the mismatch tolerance of Cas9, we previously
designed a sgRNA with a mismatch with the target in advance, which
considerably enhanced the microbial single-base editing efficiency.[26] Target-mismatched crRNAs can also be used in
the CRISPR/Cpf1 system, which aids accurate bacterial genome editing.[3]Moreover, reportedly, the truncation of
guide RNA reduces the off-target
effect of Cas9.[27] It was recently reported
that 2-nucleotide (nt) truncations at the 5′-end of a single
molecular guide RNA (sgRNA) enhanced Cas9-mediated single-base substitution
in the microbial genome.[28] As in the case
of Cas9, the 3′-end-truncated crRNA-Cpf1 complex can cause
double-strand breaks in the target DNA even when 4–6 nt are
truncated at the 3′-end of the canonical 23-nt spacer of the
CRISPR/Cpf1 crRNA.[29,30] In this study, we systematically
investigated the relationship between single-base mismatch tolerance
and 3′-end nucleotide truncation in the CRISPR/Cpf1 crRNA.
We report a simple, accurate, and efficient method for single-nucleotide
editing, including indel, using maximally 3′-end-truncated
crRNAs in the CRISPR/FnCpf1 system, and also discuss the shorter target
recognition sequences of crRNAs for better target specificity of the
CRISPR/FnCpf1 system.
Results
Failure of Single-Base
Genomic Editing by crRNA-Cpf1 Negative
Selection
We electroporated mutagenic oligonucleotides and
crRNA plasmids into both λ Bet protein- and Cpf1-overexpressing Escherichia coli MG1655 cells. Intracellular λ
Bet proteins in pHK463 are necessary for the stabilization of mutagenic
oligonucleotides.[26,31] When the target sequence in galK is properly altered by mutagenic oligonucleotides (via
the introduction of stop codons), the crRNA-Cpf1 complex cannot digest
the target genomic DNA, and the surviving cells form white colonies
in D-galactose-containing MacConkey agar. When the target was unedited,
most cells could not survive owing to double-strand cleavages at the
genomic DNA targets, that is, negative selection (Figure A). We designed three different
nucleotides to introduce stop codons in the middle of the galK gene sequence. Edited targets that did not match the
crRNA sequence were designed not to be recognized and cleaved by the
crRNA-Cpf1 complex (Figure S1).
Figure 1
Comparison
of single-base and multi-base genome editing using the
CRISPR/Cpf1 system. (A) Negative selection of cells with genome editing
using the crRNA-Cpf1 complex. The crRNA-Cpf1 complex can cleave the
unedited DNA target, thereby leaving the edited DNA target uncleaved.
The generation of stop codons in the galK gene in
the cells with genome editing led to the formation of white colonies
on D-galactose-containing MacConkey agar because of the premature
termination of GalK protein translation. (B) Editing efficiencies
in CRISPR/Cpf1-mediated one to three base(s) substitutions in galK. The genome editing efficiency was assessed as the
ratio of the number of white colonies to the total number of colonies
on MacConkey D-galactose agar. The parentheses indicate the number
of colonies containing cells with correct editing among white colonies
from which the cells were subjected to Sanger sequencing. (C) The
target mismatch effect of CRISPR/Cpf1. crRNA plasmids carrying perfectly
matched crRNAs or crRNAs with one to four bases mismatched were transformed
into Cpf1-overexpressing cells. The number of transformants on LB
agar containing spectinomycin indicates the genome-target-cleaving
activity of the crRNA-Cpf1 complex.
Comparison
of single-base and multi-base genome editing using the
CRISPR/Cpf1 system. (A) Negative selection of cells with genome editing
using the crRNA-Cpf1 complex. The crRNA-Cpf1 complex can cleave the
unedited DNA target, thereby leaving the edited DNA target uncleaved.
The generation of stop codons in the galK gene in
the cells with genome editing led to the formation of white colonies
on D-galactose-containing MacConkey agar because of the premature
termination of GalK protein translation. (B) Editing efficiencies
in CRISPR/Cpf1-mediated one to three base(s) substitutions in galK. The genome editing efficiency was assessed as the
ratio of the number of white colonies to the total number of colonies
on MacConkey D-galactose agar. The parentheses indicate the number
of colonies containing cells with correct editing among white colonies
from which the cells were subjected to Sanger sequencing. (C) The
target mismatch effect of CRISPR/Cpf1. crRNA plasmids carrying perfectly
matched crRNAs or crRNAs with one to four bases mismatched were transformed
into Cpf1-overexpressing cells. The number of transformants on LB
agar containing spectinomycin indicates the genome-target-cleaving
activity of the crRNA-Cpf1 complex.In the absence of oligonucleotides or crRNA plasmids during electroporation,
the white colonies did not form on D-galactose-containing MacConkey
agar (Figure B). When
oligonucleotides were electroporated with a crRNA-deleted plasmid
(pHL259), white colonies (Gal–) could not be obtained
on the MacConkey agar containing D-galactose and spectinomycin, and
the number of surviving transformants was found to increase (≈107/μg of DNA) (Figure S2).
These results indicated that crRNA-Cpf1 or mutagenic oligonucleotides
cannot edit the galK target independently.When we electroporated both mutagenic oligonucleotides and crRNA
plasmids, white colonies with one or two bases edited were rarely
obtained (5–7%; Figure B). Conversely, when three bases were edited, the proportion
of white colonies among the total number of colonies increased to
up to 67%. Moreover, white colonies were randomly selected, and the
nucleotide sequences of the edited galK targets were
amplified and analyzed using Sanger sequencing (Figure S3). In case of single-base editing (504T → A), only one among 10 sequences was correctly altered.
In cases of double and triple base editing, cells from five among
five colonies showed correct sequence editing. These data indicate
that the actual accuracy of single-base editing was very low (1/10),
even among the phenotypically selected 5% cells (1/10 in 5% = 0.5%).We designed crRNAs carrying one to four base mismatch(es) against
the galK DNA target and transformed the crRNA plasmids
into MG1655 cells overexpressing Cpf1 nucleases (Figure C). The number of viable transformants
indicates how each modified crRNA-Cpf1 complex can efficiently recognize
and cleave the DNA target. When we used crRNAs with one or two bases
mismatched, the number of transformants obtained (less than 104) was approximately equal to the number of transformants obtained
with perfectly matched crRNAs. However, when we used crRNAs with three
or four bases mismatched, the number of surviving transformants increased
remarkably. These data indicate that one or two mismatch(es) tolerate
recognition and are cleaved as a target, whereas more than two mismatches
are not cleaved in the cells. Therefore, we demonstrated that the
editing of single or double bases was not successful owing to mismatch
tolerance.
Intolerance of Single-Base Mismatch between
the DNA Target and
Maximally 3′-End-Truncated crRNA
Reportedly, truncated
sgRNAs reduce the off-target effect of Cas9,[27] and the truncated crRNA-Cpf1 complex can cleave DNA targets.[29,30] Therefore, we investigated how accurately a truncated crRNA could
help Cpf1 recognize and cleave DNA targets in vivo. We constructed
various plasmids carrying crRNAs with 1 to 6 nt truncated that recognized
20 to 15 nt in the galK (497–517) gene as
a target. After the transformation of crRNA plasmids, the number of
viable transformants (less than 104) containing crRNAs
with 1 to 5 nt truncated was approximately equal to the number of
transformants containing perfectly matched crRNAs. This implies that
even crRNAs with 1 to 5 nt truncated helped retain the in vivo target
cleavage activity of Cpf1 nuclease. However, the genomic DNA cleavage
activity of the Cpf1 nuclease was not retained when crRNA with 6 nt
truncated was used. These data showed that the truncation of 5 nt
is the upper limit for FnCpf1 nuclease activity retention in the cells
(Figure A).
Figure 2
Effect of single-base
mismatch in 3′-serially truncated
crRNAs on Cpf1-mediated genomic DNA cleavage. (A) Genomic DNA cleavage
efficiency of Cpf1 with galK target-mismatched and
3′-truncated crRNAs. The number of surviving colonies indicates
the reversal of cleavage on the galK target by crRNA-Cpf1
complex. The large number (>106) of surviving cells
represents
the failure of target recognition and the cleavage of modified crRNAs
by Cpf1. Each bar represents the mean value obtained from three independent
experiments. (B) Genomic DNA cleavage efficiency of Cpf1 with xylB target-mismatched and 3′-truncated crRNAs. (C)
Single-base mismatch intolerance of Cpf1 with 3′-5-nt-truncated
crRNAs. Either a single-base mismatch or 5-nt truncation of crRNA
against the target DNA is allowed for cleavage by the crRNA-Cpf1 complex.
Effect of single-base
mismatch in 3′-serially truncated
crRNAs on Cpf1-mediated genomic DNA cleavage. (A) Genomic DNA cleavage
efficiency of Cpf1 with galK target-mismatched and
3′-truncated crRNAs. The number of surviving colonies indicates
the reversal of cleavage on the galK target by crRNA-Cpf1
complex. The large number (>106) of surviving cells
represents
the failure of target recognition and the cleavage of modified crRNAs
by Cpf1. Each bar represents the mean value obtained from three independent
experiments. (B) Genomic DNA cleavage efficiency of Cpf1 with xylB target-mismatched and 3′-truncated crRNAs. (C)
Single-base mismatch intolerance of Cpf1 with 3′-5-nt-truncated
crRNAs. Either a single-base mismatch or 5-nt truncation of crRNA
against the target DNA is allowed for cleavage by the crRNA-Cpf1 complex.Next, we attempted to determine the number of base
truncations
that led to the loss of single-base mismatch tolerance. 3′-end
truncations of up to 4 nt can ensure tolerance of single-base mismatch
in the genomic DNA target cleavage. However, when single-base mismatch
and 5-nt truncation occurred simultaneously in the crRNA, the number
of surviving transformants increased to up to 106/μg
of DNA. These data indicate that a single-base mismatch was not tolerated
in the recognition and cleavage of the galK target
DNA when 5-nt-truncated crRNA was combined with Cpf1 nuclease in the
cells.In case of the xylB target, as in the
case of galK, we observed the same limit of 5-nt
truncation of crRNA
for functional Cpf1 nuclease activity. However, serially 3′-end-truncated
crRNAs with single-base mismatches progressively increased the number
of viable transformed cells (Figure B). Of note, a single-base mismatch is almost never
allowed in 4- or 5-nt-truncated crRNAs. These data showed that 3′-end
truncation of the crRNA affected the in vivo cleavage activity of
Cpf1, and a single-base mismatch was incompatible with substantial
truncation at the 3′-end of the crRNA for the target cleavage
by Cpf1 (i.e., mismatch intolerance; Figure C).
Single-Base Genome Editing with Maximally
3′-End-Truncated
crRNAs
Based on the results of single-base mismatch intolerance
of 3′-5-nt-truncated crRNAs (Figure C), it was hypothesized that a single-base-edited
target could not be cleaved by 3′-5-nt-truncated crRNAs, whereas
an unedited target could be cleaved for efficient negative selection
(Figure A). We tested
various lengths of truncated crRNAs for single-base genome editing.
We electroporated single-mutagenic oligonucleotides (510C → A at galK) with crRNA plasmids into both
Cpf1- and Bet-overexpressing cells and spread the transformants on
MacConkey agar containing D-galactose and spectinomycin. The proportions
of white colonies obtained using 1- to 4-nt-truncated crRNAs (3–12%)
and using untruncated crRNAs (5%) were similar (Figure B). When 3′-5-nt-truncated crRNAs
were used, 88% of the transformants formed white colonies on the MacConkey
(D-galactose) agar. We randomly selected the white colonies and analyzed
the nucleotide sequences using Sanger sequencing (Figure S4). However, in the case of 3′-6-nt-truncated
crRNAs, the number of viable cells increased to up to 107/μg of DNA, and no white colonies were observed. This implies
that Cpf1 was not functional with 3′-6-nt-truncated crRNAs.
Figure 3
Efficiency
of Cpf1-mediated single-base substitutions along the
3′-nt truncation lengths of crRNA. (A) Proposed method for
the negative selection of the single-base-edited target facilitated
by the mismatch intolerance of the truncated crRNA-Cpf1 complex. The
unedited target could be cleaved by the 3′-5-nttruncated crRNA-Cpf1
complex (left). The single-base-edited target could not be recognized
or cleaved by the truncated crRNA-Cpf1 complex (right). (B) and (C)
Optimization of the 3′-end truncation length of crRNAs for
Cpf1-mediated single-base alteration in the galK and xylB gene sequences, respectively. We tested six different
3′-end-truncated crRNAs for negative selection of single-base
editing. Δ0 to Δ6 indicate the number of truncated nucleotides
at the 3′-end of the crRNAs. Each bar represents the mean from
three independent experiments.
Efficiency
of Cpf1-mediated single-base substitutions along the
3′-nt truncation lengths of crRNA. (A) Proposed method for
the negative selection of the single-base-edited target facilitated
by the mismatch intolerance of the truncated crRNA-Cpf1 complex. The
unedited target could be cleaved by the 3′-5-nttruncated crRNA-Cpf1
complex (left). The single-base-edited target could not be recognized
or cleaved by the truncated crRNA-Cpf1 complex (right). (B) and (C)
Optimization of the 3′-end truncation length of crRNAs for
Cpf1-mediated single-base alteration in the galK and xylB gene sequences, respectively. We tested six different
3′-end-truncated crRNAs for negative selection of single-base
editing. Δ0 to Δ6 indicate the number of truncated nucleotides
at the 3′-end of the crRNAs. Each bar represents the mean from
three independent experiments.Another single-base alteration at a different genomic target was
assessed using the same approach. We electroporated single-mutagenic
oligonucleotides (643G → T at xylB) with crRNA plasmids and spread the transformants on MacConkey agar
containing D-xylose and spectinomycin. The proportion of white colonies
gradually increased as the number of truncated bases increased (from
Δ0 to Δ5; Figure C). We obtained the highest proportion (87%) of white colonies
on the MacConkey (D-xylose) agar using 3′-5-nt-truncated crRNAs.
However, when we used 3′-6-nt-truncated crRNAs, the number
of viable cells increased to up to 107/μg of DNA,
and we did not observe white colonies. These data indicated that Cpf1-mediated
single-base genome editing could be performed efficiently using maximally
3′-end-truncated crRNAs.
Various Single-Base Editing
Events in the Target Range Using
Maximally 3′-End-Truncated crRNAs
We investigated
whether 3′-end-truncated crRNAs could be used for the single-base
editing of four types of nucleotides at various positions in the target
(Figure ). In the galK (497–512) target region, we selected two of
each of G, A, T, and C (eight targets in all) as targets. Different
oligonucleotides mutagenic at each of the eight positions were synthesized
and electroporated with the untruncated crRNA plasmid (pHK461) and
3′-5-nt crRNA plasmid (pHL190), respectively. The transformed
cells were spread on Luria–Bertani (LB) agar supplemented
with spectinomycin instead of MacConkey agar, and four colonies were
randomly selected in each case. Subsequently, the edited nucleotide
sequences were verified using Sanger sequencing (Figure S5). The success of base editing was indicated by at
least one correct base editing among cells from the four selected
colonies. Only one correct base editing was successful among 24 cases
when untruncated crRNA (from pHK461) was used (Figure A). However, base editing was successful
in 19 out of 24 cases (79.1%) when the 3′-5-nt crRNA (pHL190)
was used. At least one base editing was successful in all eight bases,
and editing of all three different bases was successful for five bases
(G1, A1, T1, A2, and T2).
Figure 4
Single-base editing at various positions in the target DNA sequence
achieved using the maximally 3′-end-truncated crRNA-Cpf1 complex.
(A) Comparison of the single-base editing efficiency of the galK target using untruncated (left) and 3′-5-nt-truncated
(right) crRNAs. The colored grids indicate successful single-base
editing in at least one of the four cells selected randomly from the
colonies on LB agar containing spectinomycin. The boxes with the X
symbol were not applicable for base editing because they represent
cases of base alterations to the same base. (B) Comparison of single-base
editing efficiency in the xylB target using untruncated
(left) and 3′-5-nt-truncated (right) crRNAs. (C) Comparison
of the efficiencies of successful single-base editing using untruncated
and 3′-5-nt-truncated crRNAs. Two types of base editing—transitions
(Ts) and transversions (Tv)—were separately analyzed in the
DNA target.
Single-base editing at various positions in the target DNA sequence
achieved using the maximally 3′-end-truncated crRNA-Cpf1 complex.
(A) Comparison of the single-base editing efficiency of the galK target using untruncated (left) and 3′-5-nt-truncated
(right) crRNAs. The colored grids indicate successful single-base
editing in at least one of the four cells selected randomly from the
colonies on LB agar containing spectinomycin. The boxes with the X
symbol were not applicable for base editing because they represent
cases of base alterations to the same base. (B) Comparison of single-base
editing efficiency in the xylB target using untruncated
(left) and 3′-5-nt-truncated (right) crRNAs. (C) Comparison
of the efficiencies of successful single-base editing using untruncated
and 3′-5-nt-truncated crRNAs. Two types of base editing—transitions
(Ts) and transversions (Tv)—were separately analyzed in the
DNA target.In the xylB (637–652)
target region, editing
was successful in 12 out of 24 cases (50%) using 3′-5-nt-truncated
crRNA (pHL219) (Figure B), the efficiency of which was slightly lower than that in the galK gene (79.1%). Based on the Sanger sequencing results
(Figure S6), we observed at least one successful
base editing at all sites, except at the A2 position of
the xylB gene. However, only one base editing out
of the 24 cases was successful when the untruncated crRNA (from pHL210)
was used. Therefore, for single-base editing, the use of maximally
3′-end-truncated crRNAs was more efficient than the use of
untruncated crRNAs for both galK and xylB genes.The success rate of base editing was analyzed for the
mutation
types (i.e., transition and transversion; Figure C). The success rate of transversion (pyrimidine
to purine or vice versa) was slightly higher than that of transition
(purine to purine or pyrimidine to pyrimidine). These data indicate
that the truncated crRNA-Cpf1 complex can be used for both transitions
and transversions in the DNA targets, regardless of the position of
the target nucleotides.
Insertion and Deletion of Single Nucleotides
by the 3′-End-Truncated
crRNAs
For further investigating the 3′-end-truncated
crRNA method, we evaluated how efficiently it can be applied to indels
of single nucleotides during genome editing. Mutagenic oligonucleotides
are designed to generate stop codons that disrupt the galK and xylB genes when the mutations are properly
introduced. We tested an intact crRNA (Δ0) and three different
truncated crRNAs (Δ4, Δ5, and Δ6) (Figure ). For the 4-nt-truncated crRNA
(Δ4), the proportion of white colonies formed upon the introduction
of single indels was slightly higher than that achieved using the
untruncated crRNA (Δ0). When the 5-nt-truncated crRNA (Δ5)
was used, the proportions of white colonies formed upon the insertion
(510G) and deletion (509ΔG) in the galK target
increased markedly to 79 and 76%, respectively (Figure A). Similarly, when the 5-nt-truncated crRNA
(Δ5) was used, the proportions of white colonies formed upon
the insertion (643A) and deletion (643ΔG) in the xylB target increased considerably to 62 and 58%, respectively (Figure B). The single-nucleotide
indel was verified using Sanger sequencing (Figure S7). When the 6-nt-truncated crRNA (Δ6) was used, we
did not observe white colonies, and the number of transformants increased
drastically (∼107/μg of DNA), as observed
in the abovementioned base substitution experiments.
Figure 5
Cpf1-mediated insertion
and deletion of nucleotides using 3′-end-truncated
crRNAs. The efficiencies of insertion and deletion of single nucleotides
in the galK (A) and xylB (B) genes
were tested using 3′-end-truncated crRNAs. The editing efficiency
was assessed by counting the white colonies on MacConkey agar containing
D-galactose or D-xylose. The number of surviving colonies per unit
crRNA plasmid added represents the genomic DNA breakage activity of
the crRNA-Cpf1 complex. Δ0, Δ4, Δ5, and Δ6
represent the number of 3′-end-truncated nucleotides in each
crRNA. Each bar represents the mean from three independent experiments.
Cpf1-mediated insertion
and deletion of nucleotides using 3′-end-truncated
crRNAs. The efficiencies of insertion and deletion of single nucleotides
in the galK (A) and xylB (B) genes
were tested using 3′-end-truncated crRNAs. The editing efficiency
was assessed by counting the white colonies on MacConkey agar containing
D-galactose or D-xylose. The number of surviving colonies per unit
crRNA plasmid added represents the genomic DNA breakage activity of
the crRNA-Cpf1 complex. Δ0, Δ4, Δ5, and Δ6
represent the number of 3′-end-truncated nucleotides in each
crRNA. Each bar represents the mean from three independent experiments.In addition, we also tested the insertion and deletion
of 2 nt
in galK and xylB using 5-nt-truncated
crRNAs. The efficiencies of the insertion (510AG) and deletion (509ΔGC)
of 2 nt in the galK gene were observed to be 54,
and 64%, respectively (Figure S8). In the
case of xylB, the efficiencies of the insertion (643AG)
and deletion (643ΔGA) of the 2 nt were observed to be 37 and
43%, respectively. These data indicate that the maximally 3′-end-truncated
crRNA method also yields efficient results for the indel of single
or double nucleotide(s) in the genomic DNA target.
Discussion
The off-target effect, in which a target similar to the target
of CRISPR/Cas is affected, is a major roadblock in accurate genome
editing and has been widely reported in higher organisms. Mismatch
tolerance is considered to be one of the causes of off-target effects.[32] As a defense system of the bacterial host, the
CRISPR/Cas system is thought to effectively cleave foreign genes even
when there is a minor change (∼1 or 2 nt) in the recognition
sequence caused by natural mutation. The mismatch tolerance of CRISPR/Cas
may not pose a problem in bacterial defense because the size of the
bacterial genome is considerably smaller than the number of possible
permutations, with repetitions of approximately 20 nt (∼420) in the target recognition sequence.[26] Mismatch tolerance of the CRISPR/Cpf1 system for one or
two base(s) was also observed in our study (Figure C).A Cpf1-mediated negative selection
system was constructed, in which
microbial cells with nonmutated targets were removed, and only cells
with mutated targets survived (Figure A). When one or two bases in the galK gene were edited, the formation of white colonies (the phenotype
of cells with successful editing) on MacConkey agar containing D-galactose,
a result of negative selection by Cpf1, was barely observed (Figure B). The results of
the nucleotide sequence analysis of the target of cells from the white
colonies was noteworthy. We observed various point mutations around
the single-base-edited targets from the white colonies. However, in
case of double-base editing, all white colony cells showed proper
editing (Figure S3). This presumably resulted
from the difference in the degree of mismatch tolerance for single-base-edited
targets rather than from the impurity of synthesized oligonucleotides.
It is speculated that additional mutations may be introduced into
the edited target owing to errors in DNA repair or religation during
repetitive double-strand break introduction in single-base-edited
targets by CRISPR/Cpf1, which allows mismatches.When only the
crRNA plasmid was introduced into cells without using
mutagenic oligonucleotides, it was anticipated that the cells transformed
by crRNA plasmids would be killed by double-strand breakage caused
by the crRNA-Cpf1 complex in the genome. However, the transformed
cells consistently formed 103–104 colonies/μg
of DNA on the agar (Figure B). When a crRNA-deleted plasmid was used for transformation,
the number of transformants was approximately 106–107/μg of DNA. Therefore, it appeared that negative selection
by CRISPR/Cpf1 was unsuccessful in 1 cell per 103–104 cells. This was probably owing to subpopulation heterogeneity,
in which case, Cpf1 would not work, or owing to occasional religation
following double-strand breakage.For assessing the effect of
crRNA truncation on-target recognition
and cleavage, we introduced single-base mismatches against the target
into 3′-end-truncated crRNAs and compared the number of surviving
transformants (Figure ). It was found that even if 5 nt were truncated at the 3′-end,
the DNA targets (galK and xylB)
could be cleaved efficiently in vivo. However, 6-nt-truncated crRNAs
(Δ6) could not recognize the targets (Figure A,B). Perhaps, because of the insufficient
number of DNA-crRNA base pairings, the active nuclease form of the
DNA-crRNA-Cpf1 ternary complex was not properly formed. In case of
maximally 5-nt-truncated functional crRNAs (Δ5), a single-base
mismatch could be clearly distinguished. As a crRNA with both single-base
mismatch and maximally tolerated truncation (Δ5) cannot recognize
the target, it can be considered that maximally 3′-end-truncated
crRNAs exhibit mismatch intolerance (Figure C).Therefore, it was expected that
the maximally 3′-end-truncated
crRNA could distinguish between the single-base-edited and unedited
targets in the genome (Figure A). Single-base editing was performed using various lengths
of 3′-end-truncated crRNAs. It was shown that the galK target with C510A single-base editing was successfully negatively
selected with a high efficiency when the 5-nt-truncated crRNA was
used (Figure B). In
case of xylB, as the number of truncated nucleotides
increased, the proportion of edited cells increased (Figure C). These results showed that
the in vivo DNA cleavage activity of Cpf1 is affected by the length
of the target recognition sequence of the crRNA, and maximally truncated
crRNAs facilitate highly efficient single-nucleotide editing.Single-base editing at various positions in the target DNA sequence
was attempted in galK and xylB genes
using the maximally 3′-end-truncated crRNA-Cpf1 complex (Figure A,B). The success
rates of 24 single-base editing events in the galK and xylB genes were 79.1 and 50%, respectively.
As observed in the case of using target-mismatched sgRNA in Cas9,[26] transition was slightly less efficient than
transversion in this study (Figure C). This was probably because the pairing of a pyrimidine
to a pyrimidine and a purine to a purine between the target DNA and
crRNA by transversion is thought to exert a more profound effect on-target
recognition, which makes it easier to distinguish between edited and
unedited sequences. Although the target-mismatched crRNA method requires
a new crRNA construct for the editing of each base by Cpf1,[3] the maximally 3′-end-truncated crRNA method
can be applied for editing at multiple positions of the target using
a single construct of maximally 3′-end-truncated crRNA.The efficiency of nucleotide indel mutation showed an overall trend
similar to that of base substitution (Figure ). When a 4-nt-truncated crRNA (Δ4)
was used, the white colony ratio increased considerably to 76% in
single-base substitution in the xylB643G target (Figure C), whereas the percentage of white colonies was as low as 17–22%
in a single-nucleotide indel in xylB643G (Figure B). In
addition, the efficiency of the double nucleotide indel was lower
than that of single-nucleotide indel. These results might be attributed
to the differences in the action of the DNA repair systems on the
mismatched bases and DNA bulge[33] during
base substitution and indel mutagenesis, respectively.In case
of xylB, as the length of the target recognition
sequence of 3′-end-truncated crRNAs reduced, the in vivo target
cleavage activity progressively decreased, but the target specificity
clearly increased (Figure B). The result was similar to that in vitro target cleavage
activity was decreased gradually by the increase of 3′-end
truncations.[34] It is assumed that the 3′-end
of untruncated crRNAs is necessary for mismatch tolerance to improve
the target recognition flexibility or Cpf1 activity for the cleavage
of DNA targets modified by natural mutation (containing one or two
base mutations, and 1 or 2 nt indels). The shortened target recognition
sequence of crRNAs increases the mathematical probability of encountering
the same sequence in the genome, which does not appear to be advantageous
for increasing target specificity in the editing of the genomes of
higher organisms. Because the number of cases with 16 nt target recognition
sequences in 5-nt-truncated crRNA is 416 (∼4 ×
109), the maximally 3′-end-truncated crRNA method
can be applied to genome editing in higher organisms with large genome
sizes as well as to microorganisms. As maximally 3′-end-truncated
crRNAs can strictly recognize the target and effectively discriminate
among single-base changes in the target, it can serve as a simple
and efficient CRISPR/Cas-based method for accurate genome editing
at a single-base/-nucleotide resolution, such as in indel correction.
Methods
Strains
and Culture Conditions
The E.
coli strains used in this study are listed in Table S1; we cultured the strains in LB broth
at 30 or 37 °C, depending on the ori sequences
in the plasmids. E. coli DH5α
and MG1655 were used as cloning hosts and for genomic integration
of the cpf1-KmR cassette, respectively. We cultured E. coli MG1655 cells in LB broth at 30 °C until
the optical density at 600 nm (OD600) approached 0.4. Subsequently,
electrocompetent cells were harvested, washed twice, resuspended in
10% glycerol solution, aliquoted, and stored at −80 °C.
The construction of an E. coli strain
carrying the cpf1 gene in its chromosome is described
below. As required, ampicillin, kanamycin, or spectinomycin at 50,
25, or 75 μg mL–1, respectively, was added
to the culture medium. To overexpress Cpf1 in the chromosome or λ
Bet in pHK463, L-arabinose (final concentration: 1 mM) was added to
the culture flasks. After the OD600 reached 0.4, the cells
were further cultured for 3 h.
Genomic Integration of
the cpf1 Gene
The primers used for the DNA
amplification of the strains are listed
in Table S2. The cpf1 gene
was PCR-amplified using the pJYS1Ptac plasmid (provided generously
by Sheng Yang; Addgene plasmid # 85545) as a template and was fused
with a kanamycin resistance marker using overlap PCR to generate a cpf1-KmR cassette. The cpf1-KmR cassette
was amplified using primer pairs harboring homologous DNA sequences
for recombineering in the arabinose operon. Subsequently, the purified cpf1-KmR PCR products were electroporated into E. coli MG1655 cells carrying the pKD46 plasmid after
λ recombinases were fully induced by treatment with L-arabinose.
After the pKD46 plasmid was cured at 42 °C, the strain was designated
as E. coli HK1061, in which the cpf1 gene was located downstream of the L-arabinose-inducible
P promoter in the chromosome.
Plasmid
Construction
The crRNA plasmids used are listed
in Table S1, and the primers used are listed
in Table S2. All crRNA plasmids were constructed
as follows: the galK (497–517) and xylB (637–657) sequences in the E.
coli genome were chosen as DNA target sequences recognized
by crRNAs for Cpf1-mediated genome editing. Two overlapping DNA fragments
containing the spectinomycin resistance gene and the crRNA gene were
amplified using pJYS2_crtYf (generously provided by Sheng Yang; Addgene
plasmid # 85544) as a template. Two fragments were assembled using
Gibson Assembly Master Mix (NEB, Ipswich, MA, USA) to generate pHK461.
All other crRNA plasmids and a crRNA-deleted plasmid were amplified
using pHK461 as the template and subjected to Gibson assembly.
In Vivo
Target Cleavage Assay Using Modified crRNAs
Various crRNA
plasmids that recognized the galK or xylB gene sequences as the DNA target were constructed.
Each crRNA was modified to introduce 1 to 4 bp mismatches and 1 to
6 nt truncations at the 3′-end, or 1-bp mismatch and 1 to 6
nt truncations at the 3′-end. The modified crRNA plasmids (200
ng) were electroporated into Cpf1-overexpressing HK1061 cells. Electroporation
was performed under the same conditions described above. The number
of surviving colonies was determined to evaluate target recognition
and cleavage by the modified crRNA-Cpf1 complex.
Oligonucleotide-Directed
Genome Editing
The mutagenic
oligonucleotides used for genome editing are listed in Table S3. Mutagenic oligonucleotides (100 pmol)
and crRNA plasmids (200 ng) were electroporated simultaneously into
the HK1061 cells carrying the pHK463 plasmid, in which both the λ
Bet protein and Cpf1 were overexpressed in response to the addition
of L-arabinose. Electroporation was performed at 25 μF, 200
Ω, and 1.8 kV, and a 0.1-cm electroporation cuvette was used.
Afterward, the cells were transferred to 950 mL of SOC immediately
and incubated for 1 h at 37 °C and 180 rpm for recovery. Thereafter,
the recovered cells were spread on MacConkey agar containing D-galactose
or D-xylose (0.5%) and spectinomycin (75 μg mL–1) and incubated for 16 h at 37 °C. The numbers of surviving
colonies and white colonies were counted for assessing the genome
editing efficiencies. The white colonies were randomly selected for
Sanger sequencing of edited genomic DNA targets.
Base Substitution
and Indel Editing
For single-base
substitution experiments in galK and xylB targets, mutagenic oligonucleotides generating stop codons and each
of the two types of crRNA plasmids (i.e., untruncated and 5-nt-truncated
crRNA plasmids) were electroporated into the HK1061 cells carrying
the pHK463 plasmid, as mentioned above. The recovered cells were spread
on LB agar plates containing spectinomycin (75 μg mL–1). After incubation for 16 h at 37 °C, four colonies (per electroporation
experiment) were randomly selected, and Sanger sequencing was carried
out to verify the desired single-base editing in the galK and xylB targets. The primers used for PCR amplification
and Sanger sequencing are listed in Table S2.The indel mutagenic oligonucleotides could also generate
stop codons that caused the premature termination of GalK or XylB
protein translation. Electroporation was performed under the same
conditions described above. The recovered cells were spread on MacConkey
agar containing D-galactose or D-xylose (0.5%) and spectinomycin (75
μg mL–1). The cells obtained from four white
colonies from the MacConkey agar plate were subjected to Sanger sequencing
for the confirmation of accurate single-base editing. The indel mutation
efficiencies were calculated based on the ratio of the number of white
colonies to the total number of colonies on the MacConkey agar plates.
Authors: Jonathan S Gootenberg; Omar O Abudayyeh; Max J Kellner; Julia Joung; James J Collins; Feng Zhang Journal: Science Date: 2018-02-15 Impact factor: 47.728
Authors: Andrew V Anzalone; Peyton B Randolph; Jessie R Davis; Alexander A Sousa; Luke W Koblan; Jonathan M Levy; Peter J Chen; Christopher Wilson; Gregory A Newby; Aditya Raguram; David R Liu Journal: Nature Date: 2019-10-21 Impact factor: 69.504