Vinod Ranganathan1, Karl Wahlin2, Julien Maruotti2, Donald J Zack3. 1. Department of Ophthalmology, Wilmer Eye Institute, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21287, USA. 2. 1] Department of Ophthalmology, Wilmer Eye Institute, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21287, USA [2]. 3. 1] Department of Ophthalmology, Wilmer Eye Institute, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21287, USA [2] Department of Molecular Biology and Genetics, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA [3] Solomon H. Snyder Department of Neuroscience, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA [4] Institute of Genetic Medicine, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA [5] Department of Genetics, Institut de la Vision, 17 rue Moreau, 75012 Paris, France.
Abstract
The repurposed CRISPR-Cas9 system has recently emerged as a revolutionary genome-editing tool. Here we report a modification in the expression of the guide RNA (gRNA) required for targeting that greatly expands the targetable genome. gRNA expression through the commonly used U6 promoter requires a guanosine nucleotide to initiate transcription, thus constraining genomic-targeting sites to GN19NGG. We demonstrate the ability to modify endogenous genes using H1 promoter-expressed gRNAs, which can be used to target both AN19NGG and GN19NGG genomic sites. AN19NGG sites occur ~15% more frequently than GN19NGG sites in the human genome and the increase in targeting space is also enriched at human genes and disease loci. Together, our results enhance the versatility of the CRISPR technology by more than doubling the number of targetable sites within the human genome and other eukaryotic species.
The repurposed CRISPR-Cas9 system has recently emerged as a revolutionary genome-editing tool. Here we report a modification in the expression of the guide RNA (gRNA) required for targeting that greatly expands the targetable genome. gRNA expression through the commonly used U6 promoter requires a guanosine nucleotide to initiate transcription, thus constraining genomic-targeting sites to GN19NGG. We demonstrate the ability to modify endogenous genes using H1 promoter-expressed gRNAs, which can be used to target both AN19NGG and GN19NGG genomic sites. AN19NGG sites occur ~15% more frequently than GN19NGG sites in the human genome and the increase in targeting space is also enriched at human genes and disease loci. Together, our results enhance the versatility of the CRISPR technology by more than doubling the number of targetable sites within the human genome and other eukaryotic species.
Genome-editing technologies such as zinc fingers nucleases (ZFN) [1-4]
and transcription activator–like effectors nucleases (TALEN) [4-10]
have empowered the ability to generate targeted genome modifications and offer the potential
to correct disease mutations with precision. While effective, these technologies are
encumbered by practical limitations as both ZFN and TALEN pairs require synthesizing large and
unique recognition proteins for a given DNA target site. Several groups have recently reported
high-efficiency genome editing through the use of an engineered type II CRISPR-Cas9 system
that circumvents these key limitations[11-15]. Unlike ZFNs and TALENs
which are relatively time consuming and arduous to make, the CRISPR constructs, which rely
upon the nuclease activity of the Cas9 protein coupled with a synthetic guide RNA (gRNA), are
simple and fast to synthesize and can be multiplexed. However, despite the relative ease of
their synthesis, CRISPRs have technological restrictions related to their access to targetable
genome space, which is a function of both the properties of Cas9 itself and the synthesis of
its gRNA.Cleavage by the CRISPR system requires complementary base pairing of the gRNA to a
20-nucleotide DNA sequence and the requisite protospacer-adjacent motif (PAM), a short
nucleotide motif found 3’ to the target site[16]. One can, theoretically, target any unique N20-PAM sequence in
the genome using CRISPR technology. The DNA binding specificity of the PAM sequence, which
varies depending upon the species of origin of the specific Cas9 employed, provides one
constraint. Currently, the least restrictive and most commonly used Cas9 protein is from
S. pyogenes, which recognizes the sequence NGG, and thus, any unique
21-nucleotide sequence in the genome followed by two guanosine nucleotides (N20NGG)
can be targeted. Consequently, expansion of the available targeting space imposed by the
protein component is limited to the discovery and use of novel Cas9 proteins with altered PAM
requirements[11,17] or pending the generation of novel Cas9 variants via mutagenesis or
directed evolution. The second technological constraint of the CRISPR system arises from gRNA
expression initiating at a 5’ guanosine nucleotide. Use of the type III class of RNA
polymerase III promoters have been particularly amenable for gRNA expression because these
short non-coding transcripts have well-defined ends, and all the necessary elements for
transcription, with the exclusion of the 1+ nucleotide, are contained in the upstream promoter
region. However, since the commonly used U6 promoter requires a guanosine nucleotide to
initiate transcription, use of the U6 promoter has further constrained genomic targeting sites
to GN19NGG[13,18]. Alternative approaches, such as in vitro
transcription by T7, T3, or SP6 promoters, would also require initiating guanosine
nucleotide(s)[19-21].In order to expand the current limitations of CRISPR-Cas9 targeting, we tested
whether, instead of U6, we could utilize H1 pol III as an alternative promoter[22]. Because H1 can express transcripts with either
purine (nucleotide R) located at the +1 position, we hypothesized that along with the
S. pyogenes Cas9, we could expand the CRISPR targeting space by allowing
for cleavage at both AN19NGG and GN19NGG sites (Fig. 1a). To demonstrate site-specific cleavage by H1 expressed gRNAs, we
developed a reporter assay to measure CRISPR-mediated cleavage of a GFP target gene integrated
at the AAVS-1 locus in the H7human embryonic stem cell line (hESC)[23] (Fig. 1b). We measured the
loss of GFP fluorescence, due to coding sequence disruption, as a proxy for error-prone
non-homologous end joining (NHEJ) frequency; notably, our assay would underestimate NHEJ, as
in-frame mutations or indels that do not disrupt GFP fluorescence would not be detected (Fig. 1b and c). H7 cells were electroporated with equimolar
ratios of Cas9 and gRNA expression plasmids and cells were visualized for GFP fluorescence
after colony formation. In contrast to the negative control electroporation, all gRNA
constructs from the U6 and H1 promoters we tested showed a mosaic loss of GFP signals in cells
undergoing targeted mutation (Fig. 1c and data not
shown). Quantitation of total cell number with a nuclear stain enabled cell-based analysis of
GFP fluorescence by flow cytometry. Although 100% of constructs resulted in NHEJ, as
demonstrated by loss of GFP fluorescence, the range of efficiencies varied for both U6 and H1
constructs (Fig. 1c, right and data not shown). By
expressing gRNAs from either the U6 or H1 promoters, this demonstrates that mutagenesis of the
GFP gene can occur at GN19NGG or AN19NGG sites, respectively.
Figure 1
Evaluating the ability to direct CRISPR targeting via gRNA synthesis from the H1
promoter. (a) Schematic illustration depicting the gRNA expression
constructs. Above, the U6 promoter only expresses gRNAs with a +1 guanosine nucleotide;
below, the H1 promoter can drive expression of gRNAs initiating at either purine
(adenosine or guanosine) nucleotide. On the right, a cartoon depiction of the Cas9 protein
with gRNA targeting genomic sequence AN19NGG. The location of the +1 A is
indicated. (b) Schematic overview of the eGFP targeted disruption assay. eGFP
fluorescence is disrupted by CRISPR targeting followed by error-prone NHEJ-mediated repair
resulting in frameshift mutations that disrupt the coding sequence, resulting in loss of
fluorescence. (c) Microscope images demonstrating successful CRISPR targeting
by U6 or H1 promoter expressed gRNAs. H7 ES cells were stained and colonies were
visualized to show nuclei (left, magenta), eGFP fluorescence (middle, green), and merged
images (right) indicating areas of GFP fluorescence mosaicism in the colony. To the right
is shown the quantification of eGFP fluorescence loss by flow cytometry for the respective
constructs. Below is a higher magnification of an H7 colony targeted by an H1 expressed
gRNA showing expression mosaicism. Scale bar, 50 µM. (d) Surveyor
assay-based quantitation of the frequency of NHEJ. Bioanalyzer gel image depicting control
(first lane), U6 expressed gRNA (second lane), H1 expressed gRNA (third lane), and marker
(fourth lane). The % indel (as calculated by the fraction of uncut (u) to cut (c)
bands) is indicated below.
To confirm and broaden these results with another cell line, we targeted a GFP
expressing HEK-293 cell line expressing GFP at the same locus with the same gRNA constructs as
above. By Surveyor analysis, we detected a range of efficiencies varying by promoter type and
targeting location (Fig. 1d, and Supplementary Fig. 1). By using
unmodified IMR90.4 induced pluripotent cells (hiPSC), we also confirmed the ability to modify
an endogenous gene by targeting the AAVS-1 locus within the intronic region of the PPP1R12C
gene. Targeted cleavage from H1 and U6 driven gRNAs were observed with comparable efficiencies
as measured by the Surveyor Assay (Supplementary Fig. 2).In order to determine the potential increase in targeting space, we performed
bioinformatic analysis to determine the available CRISPR sites in the human genome. While
AN19NGG sites might be predicted to occur roughly at the same frequently as
GN19NGG sites, we found that they are actually 15% more common (Fig. 2 and Supplementary Fig. 3); thus changing specificity from GN19NGG to
RN19NGG more than doubles the number of available sites. With a few exceptions,
(chr16, chr17, chr19, chr20, and chr22) AN19NGG sites are present at higher
frequencies than GN19NGG sites on each chromosome. To compare the average
genome-wide targeting densities we calculated the mean distances between adjacent CRISPR sites
in the genome for GN19NGG (59bp), AN19NGG (47bp), and RN19NGG
sites (26bp) (Fig. 2b). Additionally, AN19NGG
sites were even more enriched at relevant regions of targeting in the human genome. We found a
20% increase in AN19NGG sites in human genes, and a 21% increase at
disease loci obtained from the OMIM database (Fig. 2c).
We also examined 1165 miRNA genes from the human genome and found that 221 of these genes
could be targeted through one or more AN19NGG sites, but not through a
GN19NGG site (data not shown). Given that the efficiency of homologous
recombination negatively correlates with increasing distance from cut sites, the increase in
CRISPR targeting sites by use of the H1 promoter should facilitate more precise genomic
targeting and mutation correction[24].
Figure 2
Bioinformatics analysis of GN19NGG and AN19NGG sites in
the genome. (a) Circos plot depicting the frequency of CRISPR sites in the
human genome. The outside circle depicts the human chromosome ideograms. Moving inwards,
GN19NGG (orange), AN19NGG (blue), and RN19NGG (purple)
CRISPR sites frequency is indicated along the chromosomes. Plotted inside the circle is
the human exon density (black), and OMIM disease loci (blue). (b) Frequency
and distance between of CRISPR sites in the genome. Barplot of the frequency and distance
of adjacent GN19NGG (orange), AN19NGG (blue) sites in the genome.
The mean and median values are inset within the plot including RN19NGG sites.
(c) Barplot quantification of GN19NGG vs AN19NGG site
frequency at human genes (left) or OMIM disease loci (right). (d) Barplot
quantifying the GN19NGG vs. AN19NGG frequency in six genomes: human,
cow, mouse, rat, chicken, and zebrafish.
As CRISPR technology is increasingly utilized for genomic engineering across a wide
array of model organisms, we sought to determine the potential impact of the use of the H1
promoter in other genomes. We carried out this analysis on 5 other vertebrate genomes that had
high genomic conservation at the H1 promoter (Mouse; Rat; Chicken; Cow; and Zebrafish). In all
cases, we found a higher number of AN19NGG compared to GN19NGG sites:
+9% Cow; +14% Chicken; +19% Rat; + 21% Mouse; and +
32% Zebrafish (Fig. 2c). One explanation for this
prevalence could be due to the higher AT content (Supplementary Fig. 4). In the human genome, normalizing the
GN19NGG and AN19NGG site occurrences to AT content brings the
frequencies closer to parity, although this does not hold true for all genomes (Supplementary Fig. 4a and 4f).
Nevertheless, this demonstrates the utility of using the H1 promoter, which more than doubles
the currently available CRISPR targeting space in the human genome, and similarly in all other
genomes tested.We next sought to demonstrate the ability to target an AN19NGG site in an
endogenous gene with the H1 promoter construct. Using H7 cells, we targeted the second exon of
the MERTK locus, a gene involved with phagocytosis in the retinal pigment epithelium and
macrophages and that when mutated causes retinal degeneration[25] (Fig. 3a and 3b). To
estimate the overall targeting efficiency, we harvested genomic DNA from a population of cells
that were electroporated, and performed the Surveyor Assay. We amplified the region
surrounding the target sites with two independent PCR reactions and calculated a 9.5%
and 9.7% indel frequency (Fig. 3b). Next, 42
randomly chosen clones were isolated and tested for mutation by Surveyor analysis (data not
shown). Sequencing revealed that 7/42 (16.7%) harbored mutations clustering within 3-4
nucleotides upstream of the target PAM site. 6/7 clones had unique mutations (1 clone was
redundant) and 3 of these were bi-allelic frame-shift mutations resulting in a predicted null
MERTK allele that was confirmed by Western Blot analysis (Fig. 3c and 3d). Taken together, these results demonstrate
the ability to effectively target an AN19NGG site located at an endogenous
locus.
Figure 3
CRISPR targeting of AN19NGG at an endogenous gene (MERTK) in H7 ES
cells. (a) Schematic diagram of the MERTK locus and various protein domains.
Target site in exon 2 is shown below in larger scale, indicating the CRISPR
AN19NGG target site. (b) Quantification of CRISPR targeting at
exon2 by the Surveyor assay. The CRISPR site in exon 2 is depicted above, with the various
primers (arrows) used in the Surveyor assay; both F1:R1 and F2:R2 span the target site,
while the control PCR product, F3:R3, is just outside the target site. The gel from the
Surveyor assay is shown below with the three control products shown on the left, and
targeting is shown on the right. Below the % indel frequency is indicated.
(c) Sanger sequencing of mutant lines. Clonal lines were isolated and
sequenced indicating that CRISPR targeting at the AN19NGG sites resulted in
mutagenesis at this region. The aligned chromatograms show the 6 unique mutations that
were cloned. (d) Western Blot analysis for Mertk expression in H7-derrived
RPE cells. Lanes 1, 3, and 4 indicate knockout lines and lane 2 indicates expression from
heterozygous line.
In order to quantitatively determine the extent of off-targeting that occurred from
the GFP gRNA constructs, we used Surveyor Analysis to examine three genomic loci that were
bioinformatically predicted to be off-target sites (GFP_11-33, GFP_219-197, and GFP_315-293).
Two of these constructs (GFP_219-197, and GFP_315-293) were GN19NGG target sites,
allowing for expression with both promoters. One (GFP_11-33), an AN19NGG site, was
expressed from the U6 promoter by appending a 5’-G nucleotide. In all three off-target
loci we examined, we were unable to detect any off-target cleavage (data not shown). However,
the lack of detectable off-targets could result from our initial selection of the GFP gRNA
targets, in which sites were selected based upon low homology to other genomic loci. Thus, we
reasoned that a more stringent challenge would be to compare gRNA expression from H1 and U6
promoters at targeting sites specifically known to elicit high levels of off-target
hits[26-28]. Furthermore, the 5’ nucleotide flexibility of the H1 promoter
allowed for a direct comparison of identical gRNAs targeting GN19NGG sites between
U6 and H1 promoters, and we tested two sites previously reported from Fu et al. (2013): VEGFA
site 1 (T1) and VEGFA site 3 (T3) (Table 1 and Supplementary Fig. 5)[26,28]. An
additional benefit of the H1 promoter over the U6 promoter may be in increasing specificity by
reducing spurious cleavage. Because increased gRNA and Cas9 concentrations have been shown to
result in increased off-target hits[26,27,29], we
reasoned that the lower gRNA expression level from the H1 promoter[30-32] might also
reduce off-target effects. Using qRT-PCR, we tested the levels of the VEGFA T1 gRNA from
either the H1 and U6 promoter, confirming the reduced level of expression of the gRNA (Supplementary Fig. 5a). For the VEGFA T1
site, we tested the efficiency of cutting at the on-target loci, as well as four off-target
loci. In comparison with the U6 promoter, cutting at the on-target loci was comparable or
slightly reduced; however, the H1 promoter expressed gRNAs were notable more stringent at the
examined off-target loci indicating greater specificity (Off-target 1: 8% vs.
25%; Off-target 2: undetectable vs. 20%; and Off-target 4: 9% vs.
26%) (Table 1 and Supplementary Fig. 5). We detected equal
targeting between the two promoter constructs at the VEGFA T3 site (26%), but again,
lower levels of off-target cutting with the H1 promoter (Table
1 and Supplementary Fig. 5).
While further studies on H1 and U6 promoters expressed gRNAs needs to be performed, our data
suggests greater specificity from H1 expressed gRNAs.
Table 1
Frequency of indels induced at on-target and off-target sites by U6 or H1
expressed gRNAs.
Target
Promoter
Full-length Target
Indel mutation Frequency
VEGFA-T1
U6
GGGTGGGGGGAGTTTGCTCCtGG
24%
VEGFA-T1
H1
GGGTGGGGGGAGTTTGCTCCtGG
16%
OT1-3
U6
GGATGGAGGGAGTTTGCTCCtGG
25%
OT1-3
H1
GGATGGAGGGAGTTTGCTCCtGG
8%
OT1-4
U6
GGGAGGGTGGAGTTTGCTCCtGG
20%
OT1-4
H1
GGGAGGGTGGAGTTTGCTCCtGG
Not Detected
OT1-6
U6
CGGGGGAGGGAGTTTGCTCCtGG
Not Detected
OT1-6
H1
CGGGGGAGGGAGTTTGCTCCtGG
Not Detected
OT1-11
U6
GGGGAGGGGAAGTTTGCTCCtGG
26%
OT1-11
H1
GGGGAGGGGAAGTTTGCTCCtGG
9%
VEGFA-T3
U6
GGTGAGTGAGTGTGTGCGTGtGG
26%
VEGFA-T3
H1
GGTGAGTGAGTGTGTGCGTGtGG
26%
OT3-1
U6
GGTGAGTGAGTGTGTGTGTGaGG
20%
OT3-1
H1
GGTGAGTGAGTGTGTGTGTGaGG
13%
OT3-4
U6
GCTGAGTGAGTGTATGCGTGtGG
16%
OT3-4
H1
GCTGAGTGAGTGTATGCGTGtGG
11%
OT3-18
U6
TGTGGGTGAGTGTGTGCGTGaGG
Not Detected
OT3-18
H1
TGTGGGTGAGTGTGTGCGTGaGG
Not Detected
Accumulating evidence for S. pyogenes Cas9 targeting in
vitro and in vivo, indicates that the Cas9:gRNA recognition
extends throughout the entire 20 base pair targeting site. First, in testing
>1012 distinct variants for gRNA specificity in vitro,
one study found that the +1 nucleotide plays a role in target recognition. Furthermore,
positional specificity calculations from this data show that the 5’ nucleotide
contributes a greater role in target recognition than its 3’ neighbor, indicating that
the “seed” model for CRISPR specificity might overly simplify the contribution
of PAM-proximal nucleotides[27]. Secondly,
alternative uses such as CRISPR interference (CRISPRi), which repurposes the CRISPR system for
transcriptional repression, found that 5’ truncations in the gRNA severely compromised
repression, and 5’ extensions with mismatched nucleotides – such as mismatched
G bases for U6 expression – also reduce the repression efficiency, suggesting that
both length (20 nt) and 5’ nucleotide context are important for proper Cas9
targeting[24,33-36]. Finally, crystal
structure data further supports the experimental data and importance of the 5’
nucleotide in Cas9, as significant contacts are made with the 5’ nucleotide of the
gRNA and 3’ end of the target DNA[37,38].For increased targeting space, the use of alternate Cas9 proteins has been shown to
be effective, as in N. meningitides and S. thermophilus, yet
PAM restrictions from other type II systems reported, so far have more stringent requirements
and therefore reduce the sequence space available for targeting when used alone (data not
shown and [11,17]). In contrast, modified gRNA expression by use of the H1 promoter would
be expected to greatly expand the targeting repertoire with any Cas9 protein irrespective of
PAM differences. When we quantitated the respective gRNAs targets for orthologous Cas9
proteins (AN23NNNNGATT vs. GN23NNNNGATT for N.
meningitides and AN17NNAGAAW vs. GN17NNAGAAW for S. thermophilus),
we found a 64% and 69% increase in the gRNA sites with a 5’-A
nucleotide, indicating an even greater expansion of targeting space through use of the H1
promoter with alternate Cas9 proteins (Supplementary Table 1). As suggested in plants, use of different promoters can
expand the frequency of CRISPR sites. While the U6 promoter is restricted to a 5’
guanosine nucleotide, the U3 promoter from rice is constrained to a 5’ adenosine
nucleotide further highlighting the need for different promoters in different systems to
increase targeting space[36]. Conveniently,
sole use of the H1 promoter can be leveraged to target AN19NGG and GN19NGG sites (and possibly
CN19NGG or TN19NGG sites[39]) via a single
promoter system (Supplementary Fig.
6). This in turn can be employed to expand targeting space of both current and future
Cas9 variants with altered sites restrictions.Similarly with ZFN or TALEN technologies, one approach to mitigate potential
off-target effects might be to employ cooperative offset nicking with the Cas9 mutant
(D10A)[24,35]. This requires identification of two flanking CRISPR sites, oriented on
opposing strands, and within ~20bp of the cut site[24], and thus the additional targeting density provided by AN19NGG
sites would be expected to augment this approach. An added benefit over the U6 promoter may
also be to reduce spurious cleavage; as several groups have reported that increased gRNA and
Cas9 concentrations correlate with an increase in the propensity for off-target
mutations[26,27,29], the lower level of expression
provided by the H1 promoter may result in reduced off-target cutting.With enhanced CRISPR targeting through judicious site selection, improved Cas9
variants, optimized gRNA architecture, or additional cofactors, an increase in specificity
throughout the targeting sequence will likely result, placing greater importance on the
identity of the 5’ nucleotide. As a research tool, this will allow for greater
manipulation of the genome while minimizing confounding mutations, and for future clinical
applications, high targeting densities coupled with high-fidelity target recognition will be
paramount to delivering safe and effective therapeutics.
Methods
Plasmid construction
To generate the H1 gRNA-expressing construct, overlapping oligos were assembled
to create the H1 promoter fused to the 76bp gRNA scaffold and pol III termination signal.
In between the H1 promoter and the gRNA scaffold, a BamHI site was incorporated to allow
for the insertion of targeting sequence. The H1::gRNA scaffold::pol III terminator
sequence was then TOPO cloned into pCR4-Blunt (Invitrogen), and sequenced verified; the
resulting vector is in the reverse orientation (see below). To generate the various gRNAs
used in this study, overlapping oligos were annealed and amplified by PCR using two-step
amplification Phusion Flash DNA polymerase (Thermo Scientific), and subsequently purified
using Carboxylate-Modified Sera-Mag Magnetic Beads (Thermo Scientific) mixed with 2X
volume 25%PEG and 1.5M NaCl. The purified PCR products were then resuspended in
H2O and quantitated using a NanoDrop 1000. The gRNA-expressing constructs were generated
using the Gibson Assembly[40] (NEB) with
slight modifications for either the AflII digested plasmid (Addgene #41824) for U6
expression, or BamHI digestion of plasmid just described for H1 expression. The total
reaction volume was reduced from 20µl to 2µl.
Cell culture
The hESC line H7 and IMR-90 iPS cells (WiCell) were maintained by clonal
propagation on growth factor reduced Matrigel (BD Biosciences) in mTeSR1 medium (Stem Cell
Technologies), in a 10% CO2/5% O2 incubator
according to previously described protocols [41,42]. For passaging, hESC
colonies were first incubated with 5µM blebbistatin (Sigma) in mTesR1, and then
collected after 5–10 min treatment with Accutase (Sigma). Cell clumps were gently
dissociated into a single cell suspension and pelleted by centrifugation. Thereafter,
hPSCs were re-suspended in mTeSR1 with blebbistatin and plated at approximately
1,000–1,500 cells/cm[2]. Two days
after passage, medium was replaced with mTeSR1 (without blebbistatin) and changed
daily.Humanembryonic kidney (HEK) cell line 293T (Life Technologies) was maintained
at 37°C with 5% CO2 / 20% O2 in
Dulbecco’s modified Eagle’s Medium (DMEM) (Invitrogen) Supplemented with
10% fetal bovine serum (Gibco) and 2mM GlutaMAX (Invitrogen).
Gene targeting of H7 cells
hESC cells were cultured in 10µM Rho Kinase inhibitor (DDD00033325 EMD
Millipore) 24h prior to electroporation. Electroporation were performed using the Neon kit
(Invitrogen), according to the manufacturer instruction. Briefly, on the day of
electroporation, hESC were digested with Accutase (Sigma) for 1–2 minutes until
colonies lifted. Importantly, colonies were not dissociated into a single cell suspension.
After colonies were harvested, wet pellets were kept on ice for 15 min, and then
resuspended in electroporation buffer containing gene targeting plasmids. Electroporation
parameters were as following: voltage: 1400 ms; interval: 30 ms; 1 pulse. Following
electroporation, cell colonies were slowly transferred to mTeSR1 medium containing
10µM Rho Kinase inhibitor, and then kept at room temperature for 20 min before
plating on Matrigel-coated dishes and further cultured.For analysis of clonally derived colonies, electroporated hESC were grown to
sub-confluence, passaged as described in the previous paragraph and plated at a density of
500 cells per 35mm dish. Subsequently, single colonies were isolated by manual picking and
further cultured.For 293T cell transfection, ~100,000 cells/well were seeded in 24-well
plates (Falcon) 24 hours prior to transfection. Cells were transfected in quadruplicates
using Lipofectamine LTX Plus Reagent (Invitrogen) according to manufacturer’s
recommended protocol. For each well of a 24-well plate, 400ng of the Cas9 plasmid and
200ng of the gRNA plasmid were mixed with 0.5µl of Plus Reagent and 1.5µl
of Lipofectamine LTX reagent.
Generation of constitutively expressed GFP ESC lines
The H7human ESC line (WiCell) was maintained in mTeSR1 (Stem Cell Technologies)
media on Matrigel substrate. Prior to cell passaging, cells were subjected to a brief
pre-treatment with blebbistatin (>5 minutes) to increase cell viability, treated
with Accutase for 7 minutes, triturated to a single cell suspension, quenched with an
equal volume of mTesR, pelleted at 80xg for 5 minutes and resuspended in mTesR containing
blebbistatin. 1×106 cells were pelleted, media carefully removed and
cells placed on ice for 10–15 minutes. 10µg of AAV-CAGGS-EGFP donor vector
(Addgene; #22212) containing homology to the AAVS1 safe-harbor locus, plus
5µg each of hAAVS1 1R + L TALENs Addgene # 35431 and 35432 [23,43])
in R-buffer were electroporated with a 100µl tip-type using the Neon Transfection
System (Life Technologies, Grand Island, NY) with the following parameters: 1500V, 20ms
pulse and 1 pulse. Cells were then added gently to 1 ml of medium and incubated at room
temperature for 15 minutes and then plated onto Matrigel-coated 35mm dishes containing
mTeSR and 5µM blebbistatin. After 2 days cells were seeded at a density of
1×104 after which time stable clonal sublines were manually selected
with a fluorescence equipped Nikon TS100 epifluorescence microscope.
Surveyor assay and sequencing analysis for genome modification
For Surveyor analysis, genomic DNA was extracted by resuspending cells in
QuickExtract solution (Epicentre), incubating at 65°C for 15 minutes, and then at
98°C for 10 minutes. The extract solution was cleaned using DNA Clean and
Concentrator (Zymo Research) and quantitated by NanoDrop. The genomic region surrounding
the CRISPR target sites was amplified from 100ng of genomic DNA using Phusion DNA
polymerase (NEB). Multiple independent PCR reactions were pooled and purified using Qiagen
MinElute Spin Column following the manufacturer’s protocol. An 8µl volume
containing 400ng of the PCR product in 12.5mM Tris-HCl (pH 8.8), 62.5mM KCl and 1.875mM
MgCl2 was denatured and slowly re-annealed to allow for the formation of
heteroduplexes: 95°C for 10 minutes, 95°C to 85°C ramped at
−1.0°C/sec, 85°C for 1 sec, 85°C to 75°C ramped at
−1.0°C/sec, 75°C for 1 sec, 75°C to 65°C ramped at
−1.0°C/sec, 65°C for 1 sec, 65°C to 55°C ramped at
−1.0°C/sec, 55°C for 1 sec, 55°C to 45°C ramped at
−1.0°C/sec, 45°C for 1 sec, 45°C to 35°C ramped at
−1.0°C/sec, 35°C for 1 sec, 35°C to 25°C ramped at
−1.0°C/sec, and then held at 4°C. 1µl of Surveyor Enhancer
and 1µl of Surveyor Nuclease (Transgenomic) were added to each reaction, incubated
at 42°C for 60 min, after which, 1µl of the Stop Solution was added to the
reaction. 1µl of the reaction was quantitated on the 2100 Bioanalyzer using the
DNA 1000 chip (Agilent). For gel analysis, 2µl of 6X loading buffer (NEB) was
added to the remaining reaction and loaded onto a 3% agarose gel containing
ethidium bromide. Gels were visualized on a Gel Logic 200 Imaging System (Kodak), and
quantitated using ImageJ v. 1.46. NHEJ frequencies were calculated using the
binomial-derived equation: ; where the values of “a” and “b” are
equal to the integrated area of the cleaved fragments after background subtraction and
“c” is equal to the integrated area of the un-cleaved PCR product after
background subtraction [44].
Flow Cytometry
Following blebbistatin treatment, sub-confluent hESC colonies were harvested by
Accutase treatment, dissociated into a single cell suspension and pelleted. Cells were
then resuspended in Live Cell Solution (Invitrogen) containing Vybrant DyeCycle ruby stain
(Invitrogen) and analyzed on an Accuri C6 flow cytometer.
Quantitative real-time qPCR
293T cells were seeded at 250,000 cells/well in 12-well plates (Falcon) 24 hours
prior to transfection. Cells were transfected in triplicate using Lipofectamine LTX with
Plus Reagent (Invitrogen) according to manufacturer’s recommended protocol with a
6-dose titration of the gRNA plasmid: 0 ng, 31.25ng, 62.5ng, 125ng, 250ng, or 500ng in
each well. 48 hours post-transfection, total RNA was isolated using RNAzol RT (Molecular
Research Center), and purified using Direct-zol RNA MiniPrep (Zymo). 500ng of total RNA
was dsDNase (ArticZymes; Plymouth Meeting, PA USA) treated to remove residual genomic DNA
contamination and reverse transcribed in a 20 µl reaction using Superscript III
reverse transcriptase (Invitrogen) following the manufacturer’s recommendations.
For each reaction, 0.1µM of the following oligonucleotides were used to prime each
reaction; gRNA scaffold-CTTCGATGTCGACTCGAGTCAAAAAGCACCGACTCGGTGCCAC
, U6 snRNA-AAAATATGGAACGCTTCACGAATTTG. The underlined scaffold sequence denotes an anchor
sequence added for transcript stability. Each qPCR reaction was carried out in a Biorad
CFX 96 real-time PCR machine in a 10 µl volume using the SsoAdvanced™
Universal SYBR® Green Supermix (Biorad) containing 250nM of oligonucleotide
primers and 1 microliter of a 1:15 dilution of the RT reaction product from above.
Reactions were carried out for 40 cycles with 95°C denaturation, 54°C
annealing temperature and 60°C extension steps. The following primers were used
for detecting the guide RNA and reference gene respectively:
F1for-GTTTTAGAGCTAGAAATAGCAAGTTAA and guideRNAscaffrev-AAGCACCGACTCGGTGCCAC and
U6snRNAF-CTCGCTTCGGCAGCACATATACT and U6snRNARev-ACGCTTCACGAATTTGCGTGTC. Relative
normalized expression for each guide RNA sample and the s.e.m was calculated using the
Biorad’s integrated CFX manager software.
Bioinformatics
To determine all the potential CRISPR sites in the human genome, we used a
custom Perl script to search both strands and overlapping occurrences of the 23-mer CRISPR
sequence sites GN19NGG or AN19NGG. To calculate the mean and median
distance values, we first defined the predicted CRISPR cut site as occurring between the
third and fourth bases upstream of the PAM sequence. After sorting the sequences, we then
calculated the distances between all adjacent gRNAs in the genome. This data was imported
into R to calculate the mean and median statistical values, and to plot the data. To
calculate the mean density, the gRNA cut sites were binned across the genome and
calculated for the frequency of occurrences. This data was plotted in R using the ggplot2
package, or used Circos to generate a circular plot[45]. To calculate the occurrences in human genes or at disease loci, we
used BEDTools utility IntersectBED[46] to
find the occurrence of overlaps with either a RefSeq BED file retrieved from the UCSC
Genome Browser or a BED file from OMIM (Online Mendelian Inheritance in Man, OMIM.
McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University (Baltimore, MD),
2013. World Wide Web URL: http://omim.org/). The genomes used in this study were human (hg19), mouse
(mm10), rat (rn5), cow (bosTau7), chicken (galGal4), zebrafish (dr7), drosophila (dm3), C.
elegans (ce10), and S. cerevisiae (sacCer3).
Authors: Daniel Boden; Oliver Pusch; Fredrick Lee; Lynne Tucker; Peter R Shank; Bharat Ramratnam Journal: Nucleic Acids Res Date: 2003-09-01 Impact factor: 16.971
Authors: Petri I Mäkinen; Jonna K Koponen; Anna-Mari Kärkkäinen; Tarja M Malm; Kati H Pulkkinen; Jari Koistinaho; Mikko P Turunen; Seppo Ylä-Herttuala Journal: J Gene Med Date: 2006-04 Impact factor: 4.565
Authors: Dong Sung An; F Xiao-Feng Qin; Vincent C Auyeung; Si Hua Mao; Sam K P Kung; David Baltimore; Irvin S Y Chen Journal: Mol Ther Date: 2006-07-17 Impact factor: 11.454
Authors: P M D'Cruz; D Yasumura; J Weir; M T Matthes; H Abderrahim; M M LaVail; D Vollrath Journal: Hum Mol Genet Date: 2000-03-01 Impact factor: 6.150
Authors: C Ellis Wisely; Javed A Sayed; Heather Tamez; Chris Zelinka; Mohamed H Abdel-Rahman; Andy J Fischer; Colleen M Cebulla Journal: Prog Retin Eye Res Date: 2017-06-28 Impact factor: 21.198
Authors: Haley D Axelrod; Kenneth C Valkenburg; Sarah R Amend; Jessica L Hicks; Princy Parsana; Gonzalo Torga; Angelo M DeMarzo; Kenneth J Pienta Journal: Mol Cancer Res Date: 2018-10-05 Impact factor: 5.852