Ju-Chan Park1, Hyeon-Ki Jang2, Jumee Kim1, Jun Hee Han3, Youngri Jung3, Keuntae Kim1, Sangsu Bae2,3, Hyuk-Jin Cha1. 1. College of Pharmacy, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul 08826, Republic of Korea. 2. Institute for Convergence of Basic Sciences, Hanyang University, Seoul 04763, Republic of Korea. 3. Department of Chemistry, Hanyang University, 222, Wangsimni-ro, Seongdong-gu, Seoul 04763, Republic of Korea.
Abstract
Precise genome editing of human pluripotent stem cells (hPSCs) is crucial not only for basic science but also for biomedical applications such as ex vivo stem cell therapy and genetic disease modeling. However, hPSCs have unique cellular properties compared to somatic cells. For instance, hPSCs are extremely susceptible to DNA damage, and therefore Cas9-mediated DNA double-strand breaks (DSB) induce p53-dependent cell death, resulting in low Cas9 editing efficiency. Unlike Cas9 nucleases, base editors including cytosine base editor (CBE) and adenine base editor (ABE) can efficiently substitute single nucleotides without generating DSBs at target sites. Here, we found that the editing efficiency of CBE was significantly lower than that of ABE in human embryonic stem cells (hESCs), which are associated with high expression of DNA glycosylases, the key component of the base excision repair pathway. Sequential depletion of DNA glycosylases revealed that high expression of uracil DNA glycosylase (UNG) not only resulted in low editing efficiency but also affected CBE product purity (i.e., C to T) in hESCs. Therefore, additional suppression of UNG via transient knockdown would also improve C to T base substitutions in hESCs. These data suggest that the unique cellular characteristics of hPSCs could determine the efficiency of precise genome editing.
Precise genome editing of human pluripotent stem cells (hPSCs) is crucial not only for basic science but also for biomedical applications such as ex vivo stem cell therapy and genetic disease modeling. However, hPSCs have unique cellular properties compared to somatic cells. For instance, hPSCs are extremely susceptible to DNA damage, and therefore Cas9-mediated DNA double-strand breaks (DSB) induce p53-dependent cell death, resulting in low Cas9 editing efficiency. Unlike Cas9 nucleases, base editors including cytosine base editor (CBE) and adenine base editor (ABE) can efficiently substitute single nucleotides without generating DSBs at target sites. Here, we found that the editing efficiency of CBE was significantly lower than that of ABE in human embryonic stem cells (hESCs), which are associated with high expression of DNA glycosylases, the key component of the base excision repair pathway. Sequential depletion of DNA glycosylases revealed that high expression of uracil DNA glycosylase (UNG) not only resulted in low editing efficiency but also affected CBE product purity (i.e., C to T) in hESCs. Therefore, additional suppression of UNG via transient knockdown would also improve C to T base substitutions in hESCs. These data suggest that the unique cellular characteristics of hPSCs could determine the efficiency of precise genome editing.
With the development of genome editing technologies, human pluripotent stem cells (hPSCs) have become a critical cell resource not only for autologous stem cell therapy but also for disease modeling, thus providing the means to create pathogenic cell models harboring genetic mutations (i.e., an approach referred to as “disease-in-a-dish”).1, 2, 3 However, hPSCs are extremely susceptible to double-strand breaks (DSBs) and subsequent p53-dependent cell death through mitochondrial translocation of p53 (referred to as mitochondrial priming to apoptosis),, which serves as a safeguard to maintain the genome integrity of these cells, all of which affects the editing efficiency of Cas9 nucleases in hPSCs. Several strategies have been tested to enhance Cas9-mediated gene editing activity in hPSCs, including the introduction of survival genes, delivery with Cas9 ribonucleoproteins (RNP), and co-targeting with drug- and/or toxin-resistant genes., However, all of these approaches are fundamentally based on the induction of DNA DSBs. Alternatively, DNA base editors (BEs), including cytosine base editor (CBE) and adenine base editor (ABE), can convert single nucleotides without generating DSBs, thus decreasing DNA damage. Therefore, despite the limitations of BEs, such as incompetence of transversion, restricted target sequence accessibility due to the requirement of the protospacer adjacent motif sequence, and undesired mutations of bystander sequences within the target window, BEs have recently garnered much attention and have been applied in hPSCs to produce isogenic disease models with pathogenic mutations, or gene corrections.Although no DSB induction occurs by BEs, BEs can induce target DNA damage through base deamination and single-strand breaks, owing to the activities of a Cas9 nickase (nCas9) and conjugated cytidine deaminase or adenosine deaminase., Mismatch bases formed by deamination of target bases by BEs are efficiently repaired by the base excision repair (BER) and mismatch repair (MMR) pathways. BER is initiated by the recognition and excision of deaminated bases with base-specific DNA glycosylases (e.g., uracil DNA glycosylase [UNG], thymidine DNA glycosylase [TDG], methyl-CpG binding domain 4 DNA glycosylase [MBD4], and 3-methyladenine-DNA glycosylase [MPG])., MMR, an evolutionarily conserved DNA repair process, can be initiated by the recognition of small base mismatches by MutSα, a heterodimer of the MSH2 and MSH6 proteins. Additionally, the MutSα-DNA complex further recruits repair machineries such as endonucleases, DNA polymerase, ligase, and others.Due to the critical roles of embryonic stem cells (ESCs) in embryo development, a process from which all somatic cells in the body originate, ESCs have evolved molecular mechanisms to safeguard genome integrity and minimize spontaneous mutations., Thus, DNA repair systems such as BER, MMR, and others are highly active in hPSCs, which renders unique cellular characteristics to hPSCs but also affects the outcome of genome editing. For example, hPSCs are distinctively susceptible to p53-dependent cell death, which results in low Cas9-mediated gene editing efficacy in these cells. However, to the best of our knowledge, our study is the first to comprehensively analyze gene editing outcomes with CBE and ABE in hPSCs against somatic cell lines.Here, we conducted high-throughput sequencing analysis for tens of endogenous targets in hPSCs as well as in somatic cancer cell lines. Interestingly, the editing efficiency of CBE was lower than that of ABE only in hPSCs, whereas CBEs and ABEs exhibited similar levels of editing efficiencies in several somatic cancer cell lines. Further, high expression levels of DNA glycosylases including UNG, which are responsible for elevated BER activity, prevailed in undifferentiated hPSCs. The genetic perturbation of DNA glycosylases indicated that the relatively low editing activity of CBE compared to ABE in human ESCs (hESCs) was likely attributable to a high UNG expression. Therefore, simple transient depletion of UNG overall increased the editing activity and product purity of CBEs, thus providing a precise means to modulate C to T transition for future disease modeling and ex vivo gene correction in hPSCs.
Results
ABE-mediated editing efficiency is distinctively higher than that of CBE in hESCs
We first examined gene editing efficiencies mediated by Cas9 nucleases, CBEs, and ABEs in H9-hESCs, as well as in three different somatic cancer cell lines (HeLa, U2OS, and K562). To this end, we selected ten endogenous target sites for each Cas9 nuclease, CBE, and ABE, where gene editing efficiencies between ABE and CBE were comparable in somatic cell lines,,, and their perturbation gave minimal effect on pluripotency of hESCs. Accordingly, three different plasmids encoding Cas9 from Streptococcus pyogenes (SpCas9), as well as AncBE4max (with two UNG inhibitor [UGI] motifs, hereafter BE4) for CBE and ABEmax for ABE were prepared. Three days after the transfection of each genome editing tool, the genomic DNAs were subjected to high-throughput sequencing to evaluate the editing outcomes from bulk populations. For Cas9 nucleases, our results indicated that H9-hESCs exhibited relatively lower insertion and deletion (indel) rates (average 20%) than other cell lines (50% for HeLa, 50% for U2OS, and 50% for K562) (Figure 1A). Consistent with the previous study reporting that Cas9 showed low activity in hPSCs due to p53-dependent cell death during gene editing, Cas9 induced mRNA expression of PPM1D, encoding wild-type p53-induced phosphatase1 (WIP1), a well-characterized phosphatase for p53, as well as H2AX, that was markedly induced by Nutlin3 treatment, the MDM2 inhibitor to stabilize p53 (Figure S1A). Intriguingly, both PPM1D and MDM2, p53 downstream genes, were drastically induced by inhibition of apoptosis with pan-caspase inhibitor (z-VAD) (Figures S1A and S1B), implying that hESCs underwent Cas9-induced cell death. Considering the extremely high susceptibility of hESCs to p53-dependent cell death,, these results may account for the lower Cas9 efficiency of hESCs compared to the other somatic cancer cell lines, consistent with the previous report (Figure 1A).
Figure 1
ABE-mediated editing efficiency is distinctively higher than that of CBE in hESCs
(A) Comparison of indel frequencies in hESC, U2OS, K562, and HeLa cells after Cas9-encoding plasmid delivery at 10 genomic sites. Dots represent the mean value of independent biological replicates for each target. Bars represent mean value, and error bars represent the SEM of the dots. ∗p < 0.05, ∗∗p < 0.01, comparing with the H9-hESCs by paired t test. (B) Comparison of base editing efficiencies in hESC, U2OS, K562, and HeLa cells after CBE- and ABE-encoding plasmid delivery at 10 genomic sites. Dots represent the mean value of two independent biological replicates for each target. Bars represent mean value, and error bars represent the SEM of the dots. ∗p < 0.05, comparing efficiency CBE and ABE by paired t test.
ABE-mediated editing efficiency is distinctively higher than that of CBE in hESCs(A) Comparison of indel frequencies in hESC, U2OS, K562, and HeLa cells after Cas9-encoding plasmid delivery at 10 genomic sites. Dots represent the mean value of independent biological replicates for each target. Bars represent mean value, and error bars represent the SEM of the dots. ∗p < 0.05, ∗∗p < 0.01, comparing with the H9-hESCs by paired t test. (B) Comparison of base editing efficiencies in hESC, U2OS, K562, and HeLa cells after CBE- and ABE-encoding plasmid delivery at 10 genomic sites. Dots represent the mean value of two independent biological replicates for each target. Bars represent mean value, and error bars represent the SEM of the dots. ∗p < 0.05, comparing efficiency CBE and ABE by paired t test.Particularly, for CBEs and ABE, we found that the editing activity of ABE was substantially higher than that of CBEs in H9-hESCs (Figures 1B, S1C, and S1D) under a comparable mRNA level of ABE and CBE (Figure S1E), whereas ABEs and CBEs exhibited comparable editing efficiencies in all other cell types. Although the gene editing efficiencies of CBEs and ABEs varied depending on the target sequences, the average activities between CBEs and ABEs were similar in the somatic cancer cell lines. To confirm our findings (i.e., the skewed editing efficiency of CBE in hESCs), we further tested another hESC line, hCHA3, and observed similar distinctive characteristics in these cells (Figure S1F).Similar to that of Cas9, relevance of the skewed editing efficiencies of CBE compared with those of ABE in hESCs was next examined, in that p53-dependent cell death would be critical for determining editing efficiency in hPSCs. For establishment of TP53 knockout (TP53 KO) hESCs, sgRNA targeting exon 4 of TP53 (gTP53) was designed (Figure S2A). TP53KO hESCs were readily established by simply selecting a surviving colony after introduction of Cas9 and gTP53, followed by Nutlin3 treatment (Figures S2B and S2C), with approximately 100% indel (Figure S2D). As predicted, p53 response, determined by MDM2 expression, disappeared in TP53KO hESCs by either Nutlin3 or Cas9, unlike that of wild-type hESCs (Figure S2E). While cell death by introduction of Cas9, ABE, and CBE was evidently attenuated (Figure S2F), editing efficiencies of Cas9, ABE, and CBE were significantly improved in TP53 KO hESCs (Figure S2G). These results imply that p53 response would be less relevant to skewed editing efficiency of CBE in hESCs, as shown in Figure 1B.
Distinct expression pattern of DNA glycosylases in hPSCs
We next sought to determine why CBE exhibited lower editing efficiency than ABE in hESCs other than p53. Given the high activity of DNA repair mechanisms in hPSCs due to the high expression of repair genes in BER or MMR compared to differentiated cells, base mismatches formed by BEs would be readily repaired by either BER or MMR. However, MMR is initiated by the recognition of mismatch bases by the MSH protein complex (i.e., MutSα). Of note, it has been reported that base mutations in purine-pyrimidine and purine-purine are equivalently well-repaired by MMR, unlike those in pyrimidine-pyrimidine. As base mutations (e.g., I:T and G:U) that occurred by both ABE and CBE, respectively, belong to purine-pyrimidine, it is likely that MMR similarly affects both ABE and CBE (Figure 2A). Therefore, the MMR did not account for the skewed editing efficiency of CBE compared to ABE in hPSCs and was thus ruled out as the mechanism that mediated the aforementioned discrepancy. Instead, our downstream experiments focused on BER, which is initiated by base-specific DNA glycosylases (Figure 2A).
Figure 2
Distinct expression pattern of DNA glycosylases in hPSCs
(A) Brief scheme of mismatch repair and base excision repair pathway. Deaminated bases are colored in red, and newly synthesized DNA strand through DNA repair pathway is colored in green. (B) Gene set enrichment analysis for gene ontology of base excision repair pathway from indicated gene sets. (C) Expression of UNG, TDG, MBD4, and MPG in several hESCs cell lines from NextBio portal. Normal cells were analyzed as controls. Bars represent mean values, and error bars represent the SD. ∗∗∗∗p < 0.0001, comparing with the hESCs by unpaired t test. (D) mRNA expression of POU5F1, NANOG, UNG, TDG, MBD4, and MPG of hESC-derived MSCs (H9-MSCs), hESCs (H9-ESCs), human fibroblast (BJ fibroblast), BJ-fibroblast-induced iPSCs (BJ-iPSCs), and human dermal fibroblasts (hDF). Bars represent mean values, and error bars represent the SD (n = 2). (E) mRNA expression of POU5F1, UNG, TDG, MBD4, and MPG from H9-hESCs at indicative days after differentiation. Dots represent mean values, and error bars represent the SD (n = 2).
Distinct expression pattern of DNA glycosylases in hPSCs(A) Brief scheme of mismatch repair and base excision repair pathway. Deaminated bases are colored in red, and newly synthesized DNA strand through DNA repair pathway is colored in green. (B) Gene set enrichment analysis for gene ontology of base excision repair pathway from indicated gene sets. (C) Expression of UNG, TDG, MBD4, and MPG in several hESCs cell lines from NextBio portal. Normal cells were analyzed as controls. Bars represent mean values, and error bars represent the SD. ∗∗∗∗p < 0.0001, comparing with the hESCs by unpaired t test. (D) mRNA expression of POU5F1, NANOG, UNG, TDG, MBD4, and MPG of hESC-derived MSCs (H9-MSCs), hESCs (H9-ESCs), human fibroblast (BJ fibroblast), BJ-fibroblast-induced iPSCs (BJ-iPSCs), and human dermal fibroblasts (hDF). Bars represent mean values, and error bars represent the SD (n = 2). (E) mRNA expression of POU5F1, UNG, TDG, MBD4, and MPG from H9-hESCs at indicative days after differentiation. Dots represent mean values, and error bars represent the SD (n = 2).Consistent with previous reports,,, several gene ontology terms associated not only with DNA repair but also base excision repair were highly enriched in undifferentiated hPSCs (Figures 2B and S3A) based on multiple datasets (Figure S3B). The common differentially expressed genes from these datasets were then visualized in KEGG pathway maps (https://www.genome.jp/). This analysis indicated that a large number of BER-associated genes (Figure S3C) were highly expressed in undifferentiated hPSCs. Interestingly, the KEGG pathway analysis of this dataset indicated that the DNA glycosylases that are responsible for the removal of base lesions by CBE (e.g., UNG, TDG, and MBD4) were upregulated, whereas MPG, which is known to remove inosine (I) from DNA, was downregulated in hPSCs compared with their differentiated counterparts (Figure S3C). To further generalize the unique expressions of DNA glycosylases, we took advantage of a transcriptome database of cell lines (http://nextbio.com) and compared the expressions of these DNA glycosylases between 25 hESCs and 15 normal cell lines (Table S1) and identified distinct expression patterns in hESCs (Figure 2C). Similar to the expression profile from the dataset, DNA glycosylases such as UNG, TDG, and MBD4 were highly expressed along with POU5F1 in two independent hPSC models (H9-hESCs and BJ-iPSCs) compared with their differentiated counterparts (mesenchymal stem cells derived from H9-hESCs [hESC-MSCs], and BJ fibroblast, a parent fibroblast of BJ-iPSCs) (Figure 2D). In contrast, MPG expression was higher in the differentiated cells than in hPSCs (Figure 2D). As predicted, the expression of DNA glycosylases for uracil (e.g., UNG, TDG, and MBD4) was markedly reduced during spontaneous differentiation, whereas MPG exhibited a distinct expression pattern (Figure 2E). Consistent with these observations, the catalytic activity of UNG was significantly higher in hESCs compared to hESC-MSCs (Figure S3D).
Marginal effect of DNA glycosylases on ABE outcomes in hESCs
Due to the high expression of TDG and MBD4 instead of MPG in hPSCs, we first hypothesized that the I:T mismatch caused by ABE would be readily recognized by TDG and MBD4, which preferably recognize G:T mismatches (Figure 3A) due to the structural similarity between inosine and guanine (Figure 3A, inserted panel). However, contrary to our expectation that the prompt excision of T from I:T mismatches by MBD4 and/or TDG may cause higher A to G mutation rates in hESCs, perturbation of MBD4 and/or TDG expression failed to significantly affect ABE outcomes in multiple targets (Figures 3B and 3C). Next, we hypothesized that the relatively low expression of MPG (Figure 2C) might affect ABE efficiency. Unexpectedly, ectopic expression of MPG (Figure S4) had only a marginal effect on A to G editing efficiency in multiple targets (Figure 3D), and no significant differences were observed between conditions (Figure 3E). Therefore, ABE outcome was less affected by the expression level of DNA glycosylases in hESCs.
Figure 3
Marginal effect of DNA glycosylases on ABE outcomes in hESCs
(A) Scheme of ABE-mediated base editing process. (B) Base editing in H9-hESCs after ABE-encoding plasmid delivery with siRNA targeting MBD4 (siMBD4), TDG (siTDG) and both siRNA at the same time (siM + T) at four genomic sites. Bars represent mean values, and error bars represent the SD of two independent biological replicates. (C) Comparison of ABE efficiencies in H9-hESCs after ABE-encoding plasmid delivery with siRNA targeting DNA glycosylases at four genomic sites. The editing efficiency of ABE is normalized to non-targeting siRNA (siNC) delivered efficiency. Error bars represent the SEM of two independent biological replicates. (D) Base editing in H9-hESCs after ABE-encoding plasmid delivery with MPG expression (pMPG) or control plasmid (pCont) plasmid at seven genomic sites. Bars represent mean values, and error bars represent the SD of two independent biological replicates. (E) Comparison of ABE efficiencies in H9-hESCs after ABE-encoding plasmid delivery with MPG overexpression (pMPG) or control plasmid (pCont) at seven genomic sites. The editing efficiency was normalized to pCont. Bars represent mean values, and error bars represent the SD of three independent biological replicates.
Marginal effect of DNA glycosylases on ABE outcomes in hESCs(A) Scheme of ABE-mediated base editing process. (B) Base editing in H9-hESCs after ABE-encoding plasmid delivery with siRNA targeting MBD4 (siMBD4), TDG (siTDG) and both siRNA at the same time (siM + T) at four genomic sites. Bars represent mean values, and error bars represent the SD of two independent biological replicates. (C) Comparison of ABE efficiencies in H9-hESCs after ABE-encoding plasmid delivery with siRNA targeting DNA glycosylases at four genomic sites. The editing efficiency of ABE is normalized to non-targeting siRNA (siNC) delivered efficiency. Error bars represent the SEM of two independent biological replicates. (D) Base editing in H9-hESCs after ABE-encoding plasmid delivery with MPG expression (pMPG) or control plasmid (pCont) plasmid at seven genomic sites. Bars represent mean values, and error bars represent the SD of two independent biological replicates. (E) Comparison of ABE efficiencies in H9-hESCs after ABE-encoding plasmid delivery with MPG overexpression (pMPG) or control plasmid (pCont) at seven genomic sites. The editing efficiency was normalized to pCont. Bars represent mean values, and error bars represent the SD of three independent biological replicates.
UNG expression for efficiency and product purity of CBE in hESCs
Given that the expression level of DNA glycosylases was less associated with ABE (Figure 3), we next sought to determine whether this factor accounted for the skewed efficiency of CBE in hESCs. Given that uracil results from the deamination of cytosine by CBE, after which it is recognized and excised by UNG, TDG, or MBD4 (Figure 4A), depletion of one of these DNA glycosylases would alter the editing outcome of CBE. Unlike in ABE, knock down of UNG but not the other DNA glycosylases (Figure S5A) was likely to improve C to T editing in most of the tested targets in hESCs (Figure 4B). The overall improvement of C to T editing efficiency by depletion of UNG was statistically significant (Figure 4C). These results suggest that high UNG expression in hESCs likely explained the low CBE editing efficiency, resulting in skewed BE editing. Next, we examined the effect of transient depletion of DNA glycosylases in off-target editing of CBE by analyzing 12 off-target sites from CCR5-3, HEK2, and HEK3 (Table S2). Of note, no significant alterations of off-target editing by transient knockdown of DNA glycosylase were observed (Figures S5B–S5D). We also observed that depletion of UNG not only improved C to T efficiency but also product purity. The undesirable C to G and C to A conversion was reduced in multiple targets with different degrees by transient depletion of UNG in hESCs (Figures 4B and 4C). Therefore, the normalized product impurity (e.g., C to G [Figure 4D], C to A [Figure S5E], and indel [Figure S5F]) was significantly diminished by the simple knockdown of UNG but not any other DNA glycosylase. Furthermore, different expression level of UGI (by BE4 without UGI, BE4, and BE4 with additional ectopic expression of UGI) in CBE editing determined both editing efficiency and product purity (Figures S5G and S5H). We thus concluded that the distinct base editing pattern (e.g., lower CBE efficiency) in hESCs resulted from the high expression of UNG. Therefore, transient siRNA-mediated depletion of UNG would be beneficial for efficient C to T substitution of hESCs with CBE to further establish disease models or correct genetic mutations in hPSCs.
Figure 4
UNG expression for efficiency and product purity of CBE in hESCs
(A) Scheme of CBE-mediated base editing process. (B) Percentage of sequencing reads with specific edited nucleotides that have been converted from target Cs in hESCs after CBE-encoding plasmid delivery with siRNA targeting UNG (siUNG), TDG (siTDG), and MBD4 (siMBD4). Non-targeting siRNA (siNC) was used as control. Blue, C to T; red, C to G; yellow, C to A. Error bars represent the SEM of three independent biological replicates. (C and D) Comparison of C to T (C) and C to G (D) conversion efficiencies in hESC after CBE-encoding plasmid delivery with siRNA targeting DNA glycosylase at seven genomic sites. The conversion efficiency after CBE-encoding plasmid delivery with each DNA glycosylase-targeting siRNA delivery is normalized to the conversion efficiency after CBE-encoding plasmid delivery with siNC. (E) Table of targetability and main preference of UNG, TDG, and MBD4.
UNG expression for efficiency and product purity of CBE in hESCs(A) Scheme of CBE-mediated base editing process. (B) Percentage of sequencing reads with specific edited nucleotides that have been converted from target Cs in hESCs after CBE-encoding plasmid delivery with siRNA targeting UNG (siUNG), TDG (siTDG), and MBD4 (siMBD4). Non-targeting siRNA (siNC) was used as control. Blue, C to T; red, C to G; yellow, C to A. Error bars represent the SEM of three independent biological replicates. (C and D) Comparison of C to T (C) and C to G (D) conversion efficiencies in hESC after CBE-encoding plasmid delivery with siRNA targeting DNA glycosylase at seven genomic sites. The conversion efficiency after CBE-encoding plasmid delivery with each DNA glycosylase-targeting siRNA delivery is normalized to the conversion efficiency after CBE-encoding plasmid delivery with siNC. (E) Table of targetability and main preference of UNG, TDG, and MBD4.
Discussion
BER is an evolutionally conserved DNA damage repair mechanism that safeguards genome integrity even immediately after fertilization. BER is the primary repair system through which DNA damage is repaired in the developing zygote and is also highly activated in ESCs compared to differentiated cells, serving as an important genome safeguard.,Unlike in somatic cancer cell lines, we demonstrated that the editing efficiency of CBE was significantly lower than that of ABE in hESCs (Figure 1). Base-specific DNA glycosylases were distinctively expressed in hPSCs (Figure 2), which can account for the skewed efficiency of CBE compared to ABE in hPSCs. The temporary genetic perturbation of DNA glycosylases (UNG, TDG, MPG, and MBD4) indicated that the efficiency of CBE but not ABE (Figure 3) was determined by the high UNG expression in hPSCs (Figure 4). The fact that UNG depletion only affected CBE efficiency (Figure 4) suggests that the U:G mismatch produced by CBE was mostly repaired by UNG but not TDG and MBD4 in hESCs. UNG mostly recognizes uracil produced from the deamination of cytosine, whereas TDG preferentially recognizes 5-hydromethyl uracil (5hmU) or thymidine produced by 5hmC and 5-methyl cytosine (5mC) deamination (Figure 4E). Likewise, MBD4 preferentially recognizes T:G and U:G mismatches in CpG regions (Figure 4E). Given the preference of the rat APOBEC1 deaminase of CBE (BE4max) toward unmodified cytosine,, it is highly plausible that high UNG activity rather than TDG or MBD4 in hESCs competes with the activity of APOBEC1 toward unmodified cytosine despite the presence of UGI in BE4max.In contrast, the genetic perturbations in DNA glycosylase expression observed herein had a marginal effect on ABE editing efficiency (Figure 3). Despite the structural similarity of inosine with guanine, temporary ectopic expression of MPG, which is responsible for removing G from G:T mismatch, failed to enhance ABE editing efficiency in hPSCs (Figure 3D). These findings were consistent with previous results demonstrating that MPG knockout had a marginal effect on ABE editing efficiency and product purity. Although several cytosine deaminases act on DNA in mammalian cells, adenosine deaminases in mammalian cells are only involved in nucleotide metabolism and RNA. This is why adenosine deaminases in ABE are genetically engineered from the pre-existing adenosine deaminases acting from RNA, which might explain why editing efficiency is less affected by natural MPG than UNG. We also examined the depletion of TDG and/or MBD4 during ABE treatment under the assumption that TDG and MBD4 would similarly recognize I:T and G:T mismatches. However, transition depletion of TDG and/or MBD4 showed marginal effects on ABE efficiency (Figure 3B), suggesting that TDG and MBD4 can discriminate between I:T and G:T mismatches.The distinct DNA damage responses of hESCs may not only affect the editing outcome of CBE but also that of other types of DNA editing tools. For example, the recently developed glycosylase base editor (GBE) enables C to G transversion in eukaryotes by inducing apurinic (AP) sites via cytidine deaminase and UNG conjugated onto nCas9. We speculate that hPSCs would likely exhibit distinct GBE editing outcomes compared with other somatic cell lines given that the AP sites produced by GBE can be recognized by MBD4, which is highly expressed in hPSCs. Thus, for efficient genome editing in hPSCs either for disease modeling or ex vivo stem cell therapy, the unique characteristics of specific cell lines must be considered.
Materials and methods
Statistical analysis
The quantitative data are expressed as the mean values ± standard deviation (SD). Paired t tests and Student's unpaired t test were performed to analyze the statistical significance of gene editing efficiency and gene expression comparison, respectively, using the PRISM. For comparison of gene editing efficiency between cell lines or editing conditions (e.g., ABE versus CBE), paired t test was performed. For comparison of mRNA expression and live cell ratio, Student's unpaired t test was performed. Values less than 0.05 were considered statistically significant (∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001 and ∗∗∗∗p < 0.0001).
Plasmid construction
The plasmids in this study were provided from Addgene, including p3s-Cas9-HN (addgene no.104171), pCMV_ABEmax (addgene no.112095), pCMV-AncBE4max (addgene no. 112094), pUGI-NLS (addgene no.101091), and pRG2 (Addgene no. 104174). Sequences corresponding to sgRNAs were cloned into BsaI-digested pRG2 vector (Addgene no. 104174). For this step, oligos containing the spacer sequence were annealed to form double-stranded DNA fragments with compatible overhangs and ligated using T4 ligase (Enzynomics). To construct pCMV-AncBE4max without UGI, the c-terminal part of BE4max contacting 2× UGI was digested by Cas9 and AgeI endonuclease (NEB) and Gibson cloned using NEBuilder HiFi DNA Assembly master mix (NEB). All plasmids used for transfection experiments were prepared using a NucleoBond Xtra Midi Plus EF kit (MN).
Cell culture and transfection
HeLa (ATCC CCL-2) and U-2OS (ATCC HTB-96) cells were cultured in DMEM supplemented with 10% fetal bovine serum (FBS) and 1% penicillin-streptomycin (WELGENE). K562 cells (ATCC CRL-3343) were cultured in RPMI-1640 supplemented with 10% FBS and 1% penicillin-streptomycin (WELGENE). For Cas9-, ABE-, or CBE-mediated genome editing, Cas9-, ABE-, or CBE-encoding plasmids (0.5 μg) and sgRNA-encoding plasmids (0.17 μg) were mixed with cells (1.5 × 105) and electroporated via Neon Transfection System. H9 (WA09, WiCell Research Institute) and CHA3 hESCs were cultured on Matrigel (BD Biosciences) coated dishes fed with StemMACS media (Miltenyi-Biotec) added with 50 μg/mL Gentamicin (Gibco). For Matrigel coating, 200 μL of Matrigel was diluted in 16 mL of cold DMEM/F-12 media (Gibco). Diluted Matrigel was distributed to culture plate and incubated in a cell culture incubator for 1 h. For transfer, hESCs were rinsed with Dulbecco's phosphate buffered saline (DPBS) and detached with Accutase solution (561527, BD Biosciences). Detached cells were washed with DMEM/F-12 media for three times. Washed cells were resuspended with 1 mL of StemMACS media and plated on Matrigel-coated plate with StemMACS media added with 10 μM of Y27632 (Gibco). For transfection, hESCs were rinsed with DPBS and detached with Accutase solution (561527, BD Biosciences). Cells were washed with Opti-MEM (31985070, Gibco) three times and diluted to a concentration of 1 × 106 cells in 100 μL of Opti-MEM (31985070, Gibco). 2 μg of Cas9 or BE vectors (Cas9, BE4max, and ABEmax cloned in pCMV vector) and 2 μg of sgRNA vector were added to the cell mixture. For siRNA or overexpression vector, an additional 2 μg of siRNA or overexpression vector was additionally added to the cell mixture. Electroporation was performed by NEPA-21. Poring pulse was 175 V and transfer pulse was 2.5 mV.
Targeted deep sequencing
For analysis of editing efficiency, genomic DNA were extracted from Cas9-, ABE-, or CBE-transfected cells using a NucleoSpin Tissue kit (MN) at 3 days after transfection. Target sites were amplified using a KOD Multi & Epi PCR kit (TOYOBO) for sequencing library generation. These libraries were sequenced using MiniSeq with a TruSeq HT Dual Index system (Illumina) as previously described. Briefly, equal amounts of the PCR amplicons were subjected to paired-end read sequencing using an Illumina MiniSeq platform. After MiniSeq, paired-end reads were analyzed by comparing wild-type and mutant sequences using BE-analyzer. High-throughput sequencing data have been deposited in the NCBI Sequence Read Archive database (https://www.ncbi.nlm.nih.gov/sra) under accession number PRJNA787724.
Editing efficiency averaging and normalization
Average value of editing efficiency in each target was calculated via arithmetic mean of individual experiments. Average value of editing efficiency in each cell line was calculated via arithmetic mean of individual targets. Normalization of editing efficiency was conducted by dividing the average value of editing efficiency of each target from perturbation (e.g., siUNG or pMPG) by that of the control (e.g., siNC or pCont).
RT-qPCR analysis
For total RNA extraction, Easy-BLUETM RNA isolation kit (iNtRON Biotechnology) was used. RNA was extracted form cell pellets via Easy-BLUETM RNA isolation kit, following the supplier's instructions. cDNA was synthesized by PrimeScriptTM RT reagent kit (TaKaRa). 2 μL of PrimeScriptTM RT reagent kit was added to 500 μg of RNA samples in 8 μL of distilled water (DW) and reacted for 15 min at 37°C. Light Cycler-480®II (Loche) and SYBR® Green PCR reagents (Life Technologies) were used for quantitative real-time PCR analysis, following the supplier's instructions.
Generation of TP53 knockout hPSCs
Cells were transfected with Cas9 and gTP53 vector with the same conditions as in the "cell culture and transfection" section. To enrich TP53 KO cells, 10 μM of nutlin3 was treated for 48 h after 5 days from transfection.
Data availability
Source data are available from the corresponding authors upon request.
Authors: Luke W Koblan; Jordan L Doman; Christopher Wilson; Jonathan M Levy; Tristan Tay; Gregory A Newby; Juan Pablo Maianti; Aditya Raguram; David R Liu Journal: Nat Biotechnol Date: 2018-05-29 Impact factor: 54.908