Rasmus O Bak1, Daniel P Dever1, Andreas Reinisch2,3,4, David Cruz Hernandez2,3,4, Ravindra Majeti2,3,4, Matthew H Porteus1. 1. Department of Pediatrics, Stanford University, Stanford, United States. 2. Department of Medicine, Division of Hematology, Stanford University, Stanford, United States. 3. Department of Medicine, Institute for Stem Cell Biology and Regenerative Medicine, Stanford University, Stanford, United States. 4. Department of Medicine, Cancer Institute, Stanford University, Stanford, United States.
Abstract
Precise and efficient manipulation of genes is crucial for understanding the molecular mechanisms that govern human hematopoiesis and for developing novel therapies for diseases of the blood and immune system. Current methods do not enable precise engineering of complex genotypes that can be easily tracked in a mixed population of cells. We describe a method to multiplex homologous recombination (HR) in human hematopoietic stem and progenitor cells and primary human T cells by combining rAAV6 donor delivery and the CRISPR/Cas9 system delivered as ribonucleoproteins (RNPs). In addition, the use of reporter genes allows FACS-purification and tracking of cells that have had multiple alleles or loci modified by HR. We believe this method will enable broad applications not only to the study of human hematopoietic gene function and networks, but also to perform sophisticated synthetic biology to develop innovative engineered stem cell-based therapeutics.
Precise and efficient manipulation of genes is crucial for understanding the molecular mechanisms that govern humanhematopoiesis and for developing novel therapies for diseases of the blood and immune system. Current methods do not enable precise engineering of complex genotypes that can be easily tracked in a mixed population of cells. We describe a method to multiplex homologous recombination (HR) in human hematopoietic stem and progenitor cells and primary human T cells by combining rAAV6donor delivery and the CRISPR/Cas9 system delivered as ribonucleoproteins (RNPs). In addition, the use of reporter genes allows FACS-purification and tracking of cells that have had multiple alleles or loci modified by HR. We believe this method will enable broad applications not only to the study of human hematopoietic gene function and networks, but also to perform sophisticated synthetic biology to develop innovative engineered stem cell-based therapeutics.
The current gold standard method for studying human hematopoietic stem and progenitor cell (HSPC) gene function has been either overexpression or RNAi-mediated knockdown of genes using lentiviral vectors (Doulatov et al., 2012; Chan et al., 2015). While these methods have provided great insights into HSPC biology, they come with several confounders, such as random integration of the vector into the host genome, unregulated transgene expression, and incomplete gene knockdown (Woods et al., 2006; Naldini, 2015). More recently, programmable nucleases such as zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), and CRISPR/Cas9 have been utilized to disrupt genes by the introduction of site-specific DNA double strand breaks (DSBs) that are corrected through non-homologous end-joining (NHEJ) (Hendel et al., 2015; Holt et al., 2010; Saydaminova et al., 2015; Mandal et al., 2014; Schumann et al., 2015; Kim et al., 2014; Lin et al., 2014). This error-prone system creates a heterogeneous mixture of cells with various genotypes of SNPs and small insertions or deletions (INDELs); moreover, not all of the genetic changes from INDELs cause functional gene disruption as they may preserve the open reading frame and may not change amino acids essential for protein functions (Shi et al., 2015; Hultquist et al., 2016). In a prior study, defined gene deletions were created in HSPCs using a dual sgRNA approach, however, more than half of the alleles were not modified leading to residual gene expression (Mandal et al., 2014). Another limitation of this prior study is that successfully modified cells were not distinguishable from unmodified wild type (WT) cells, and therefore could not be tracked or isolated as an enriched population. Although the versatility of the CRISPR/Cas9 system allows for simultaneous manipulation at multiple genetic loci in a single cell, multiplexing of NHEJ-based gene editing has mainly been performed in immortalized humancancer cell lines and mouse cells (Hultquist et al., 2016; Cong et al., 2013; Heckl et al., 2014; Platt et al., 2014; Brown et al., 2016). Finally, these interesting multiplexed proof-of-concept studies, only used NHEJ-mediated editing and did not harness the power of homologous recombination (HR) to create more sophisticated alterations to the genome at multiple alleles and/or loci.Here, we report an HR-mediated genome engineering method in human HSPCs and T cells that overcomes these limitations and enables the generation and enrichment of HSPC or T cell populations with complete gene knockout or gene replacement at multiple genetic loci. This method has the power to reveal functional gene networks during hematopoiesis and immune system disease pathogenesis and could be combined with the concepts of synthetic biology to create novel stem cell based therapeutics.
Results
Enriching HSPCs with targeted integration
We and others have previously shown that HR in human HSPCs can be efficiently induced by site-specific nucleases in combination with homologous donor DNA delivered as single-stranded oligonucleotides (ssODNs), integration-defective lentiviral vectors (ÍDLVs), or by recombinant adeno-associated virus serotype 6 (rAAV6) vectors (Dever et al., 2016; DeWitt et al., 2016; De Ravin et al., 2017; Wang et al., 2015; Hoban et al., 2016). We previously showed targeted integration in the beta-globin gene (HBB) by combining delivery of Cas9 protein pre-complexed with chemically modified sgRNAs (RNP) and delivery of an AAV6 donor. After successful on-target integration of a reporter transgene, FACS-based sorting of transgene reporterhigh-expressing HSPCs was used to purify an HSPC population with >90% targeted integration that displayed long-term repopulation capacity in NSG mice (Dever et al., 2016). To extend this method beyond the HBB locus for therapeutic genome editing approaches of hemoglobinopathies, we tested six additional loci for their potential to be modified through HR by CRISPR/Cas9 in combination with AAV6-derived donor delivery. These genes are associated with hematopoiesis, hematopoietic malignancies, or safe harbor sites and include: interleukin-2 receptor gamma chain (IL2RG), chemokine (C-C motif) receptor 5 (CCR5), runt-related transcription factor one isoform c (RUNX1c), additional sex combs like 1 (ASXL1), stromal antigen 2 (STAG2), and adeno-associated virus integration site 1 (AAVS1) (Tebas et al., 2014; Genovese et al., 2014; Patel et al., 2012; Mazumdar et al., 2015; Kotin et al., 1992). Following electroporation with Cas9 RNP, containing a chemically-modified sgRNA targeting a single site in the selected locus, and transduction with an rAAV6donor vector carrying homology arms for the targeted site and an expression cassette encoding a fluorescent reporter gene (Figure 1—figure supplement 1a), we observed at early time points (day 4) a cell population with increased fluorescence intensity detectable by flow cytometry (reporterhigh cells) compared to cells receiving only the rAAV6donor without electroporation of Cas9 RNP (reporterlow) (Figure 1a and Supplementary file 1a). For cells targeted at either CCR5 or IL2RG, reporterhigh, reporterlow, and reporterneg populations were sorted at day four post-electroporation and cultured up to 22 days. Reporterhigh populations remained 99.2 ± 0.7% reporter positive (Figure 1b) while sorted reporterlow and reporterneg populations were 29.3 ± 5.4% and 0.6 ± 0.2% reporter positive, respectively. Dividing the reporterlow cells into three sub fractions based on fluorescence intensity revealed that GFP intensity at day four post-electroporation positively correlated with the propensity for maintaining GFP expression at day 20 (Figure 1—figure supplement 1b–c). In addition, single reporterhigh cells were plated in methylcellulose to assess integration events at the clonal level. Targeted HSPCs formed a mix of myeloid (CFU-M/GM) and erythroid colonies (BFU-E, CFU-E) indicating that they retained HSPC function. ‘In-Out PCR’ (one donor-specific primer and one locus-specific primer outside of the respective homology arms) on genomic DNA (gDNA) from single cell-derived methylcellulose colonies confirmed that 99%, 92%, and 100% of reporterhigh HSPCs targeted at CCR5 (338 clones analyzed), IL2RG (117 clones analyzed), and RUNX1 (36 clones analyzed), respectively, had at least a monoallelic targeted integration (Figure 1c and Figure 1—figure supplement 2). Analyses of clones with only mono-allelic integration showed gene-specific differences in the modification of the non-integrated alleles ranging from 38% INDELs for IL2RG to 89% INDELs for CCR5% and 88% INDELs for RUNX1, among which the majority was gene-disrupting (Figure 1—figure supplement 2 and Supplementary file 1b). Collectively, these data indicate that the observed log-fold transgene expression shift following rAAV6 and RNP delivery is due to HR at the intended locus and that reporter expression can be used to enrich gene-targeted HSPCs.
Figure 1—figure supplement 1.
Analysis of cell fractions with different fluorescence intensity.
(a) Schematic showing the general layout of the AAV6 donors employed. ITR: inverted terminal repeat; SFFV promoter: spleen focus forming virus promoter; GFP: green fluorescent protein; polyA: bovine growth hormone polyadenylation signal; RHA: right homology arm. Approximate sizes are shown below each component. (b) Cells were targeted at the HBB locus by electroporation of Cas9 RNP followed by transduction of a homologous rAAV6 donor carrying a GFP expression cassette. At 4 days post electroporation and transduction, cells with different GFP intensities (GFPhigh, GFPLowHigh, GFPLowMed, GFPLowLow) were FACS-sorted and cultured for an additional 16 days. At day 20 post targeting, cells were analyzed for GFP expression by flow cytometry and the red gates show the GFPhigh population at this time point. (c) The cells from b) were analyzed at different time points after sorting, and data points show the percentage of cells within the GFPhigh gate for the different populations as well as a population receiving only the rAAV6 donor and not Cas9 RNP.
Figure 1.
FACS-based identification and enrichment of monogenic genome-edited CD34+ human hematopoietic stem and progenitor cells (HSPCs).
(a) HSPCs were electroporated with CCR5-RNP and transduced with CCR5-tNGFR rAAV6 HR donor. Representative FACS plots from day four post-electroporation highlight the CCR5 tNGFRhigh population (red gate) generated by the addition of Cas9 RNP compared to cells with low reporter expression (green gate) and reporternegative cells (black gate). Numbers reflect percentage of cells within gates. (b) Day four post-electroporation, CCR5 (tNGFR or GFP) and IL2RG (GFP)-targeted HSPCs from reporterhigh (red), reporterlow (green), and reporterneg (blue) fractions were sorted and cultured for 20-22 days while monitoring the percentage of cells that remained GFP+. Error bars represent S.E.M. N = 6 for CCR5, N = 3 for IL2RG, all from different CD34+ donors. (c) HSPCs were targeted at CCR5 (with GFP or tNGFR donor) or at IL2RG (GFP donor; only female cells for IL2RG). At day four post-electroporation, reporterhigh cells were single-cell sorted into methylcellulose for colony formation. PCR was performed on colony-derived gDNA to detect targeted integrations. 338 CCR5 and 177 IL2RG myeloid and erythroid methylcellulose colonies were screened from at least two different CD34+ HSPC donors. (d) HSPCs were targeted at the STAG2 gene or the AAVS1 locus with a GFP reporter cassette. Cells that only received the STAG2-GFP AAV6 donor and not Cas9 RNP were included as an additional control. At day four post-electroporation and transduction, reporterhigh cells from the STAG2 and AAVS1 targeting experiments and bulk cells from the STAG2 AAV6 only population were plated in methylcellulose for colony formation. After 14 days, colonies were scored as either erythroid or myeloid based on morphology. Error bars represent S.E.M, N = 3, ***p<0.001, n.s. = p≥0.05, unpaired t-test.
(a) Schematic showing the general layout of the AAV6 donors employed. ITR: inverted terminal repeat; SFFV promoter: spleen focus forming virus promoter; GFP: green fluorescent protein; polyA: bovine growth hormone polyadenylation signal; RHA: right homology arm. Approximate sizes are shown below each component. (b) Cells were targeted at the HBB locus by electroporation of Cas9 RNP followed by transduction of a homologous rAAV6 donor carrying a GFP expression cassette. At 4 days post electroporation and transduction, cells with different GFP intensities (GFPhigh, GFPLowHigh, GFPLowMed, GFPLowLow) were FACS-sorted and cultured for an additional 16 days. At day 20 post targeting, cells were analyzed for GFP expression by flow cytometry and the red gates show the GFPhigh population at this time point. (c) The cells from b) were analyzed at different time points after sorting, and data points show the percentage of cells within the GFPhigh gate for the different populations as well as a population receiving only the rAAV6 donor and not Cas9 RNP.
(a) Left, schematic representation of the three-primer PCR used to genotype CCR5 alleles for integrated (green PCR product) and non-integrated (red PCR product) alleles. One forward primer is located in the left homology arm (LHA), one forward primer is located in the poly A, and a common reverse primer is located outside the region of the right homology arm (RHA). Right, gel image of representative genotyped clones from Figure 1c (CCR5) showing colonies with biallelic and monoallelic integrations. (b) A subset of the CCR5 and IL2RG clones (only female cells for IL2RG) from Figure 1c with monoallelic integration had the genotype on the non-integrated allele analyzed by Sanger sequencing of purified PCR products. Note that in-frame INDELs can be gene-disrupting depending on the location and size of the INDEL. (c) As in Figure 1c, HSPCs were targeted at RUNX1 and at day four post-electroporation, reporterhigh cells were single cell-sorted into methylcellulose-containing 96-well plates to establish colonies. After 14 days, PCR was performed on colony-derived gDNA to detect targeted integrations. A total of 36 myeloid and erythroid methylcellulose colonies were screened. (d) The monoallelically targeted clones from c) had the genotype assessed on the non-integrated allele by Sanger sequencing of purified PCR products. See Supplementary file 1b for complete list of genotypes.
Figure 1—figure supplement 2.
Genotypes of clones with mono-genic targeting.
(a) Left, schematic representation of the three-primer PCR used to genotype CCR5 alleles for integrated (green PCR product) and non-integrated (red PCR product) alleles. One forward primer is located in the left homology arm (LHA), one forward primer is located in the poly A, and a common reverse primer is located outside the region of the right homology arm (RHA). Right, gel image of representative genotyped clones from Figure 1c (CCR5) showing colonies with biallelic and monoallelic integrations. (b) A subset of the CCR5 and IL2RG clones (only female cells for IL2RG) from Figure 1c with monoallelic integration had the genotype on the non-integrated allele analyzed by Sanger sequencing of purified PCR products. Note that in-frame INDELs can be gene-disrupting depending on the location and size of the INDEL. (c) As in Figure 1c, HSPCs were targeted at RUNX1 and at day four post-electroporation, reporterhigh cells were single cell-sorted into methylcellulose-containing 96-well plates to establish colonies. After 14 days, PCR was performed on colony-derived gDNA to detect targeted integrations. A total of 36 myeloid and erythroid methylcellulose colonies were screened. (d) The monoallelically targeted clones from c) had the genotype assessed on the non-integrated allele by Sanger sequencing of purified PCR products. See Supplementary file 1b for complete list of genotypes.
FACS-based identification and enrichment of monogenic genome-edited CD34+ human hematopoietic stem and progenitor cells (HSPCs).
(a) HSPCs were electroporated with CCR5-RNP and transduced with CCR5-tNGFR rAAV6 HR donor. Representative FACS plots from day four post-electroporation highlight the CCR5 tNGFRhigh population (red gate) generated by the addition of Cas9 RNP compared to cells with low reporter expression (green gate) and reporternegative cells (black gate). Numbers reflect percentage of cells within gates. (b) Day four post-electroporation, CCR5 (tNGFR or GFP) and IL2RG (GFP)-targeted HSPCs from reporterhigh (red), reporterlow (green), and reporterneg (blue) fractions were sorted and cultured for 20-22 days while monitoring the percentage of cells that remained GFP+. Error bars represent S.E.M. N = 6 for CCR5, N = 3 for IL2RG, all from different CD34+ donors. (c) HSPCs were targeted at CCR5 (with GFP or tNGFR donor) or at IL2RG (GFP donor; only female cells for IL2RG). At day four post-electroporation, reporterhigh cells were single-cell sorted into methylcellulose for colony formation. PCR was performed on colony-derived gDNA to detect targeted integrations. 338 CCR5 and 177 IL2RG myeloid and erythroid methylcellulose colonies were screened from at least two different CD34+ HSPC donors. (d) HSPCs were targeted at the STAG2 gene or the AAVS1 locus with a GFP reporter cassette. Cells that only received the STAG2-GFP AAV6 donor and not Cas9 RNP were included as an additional control. At day four post-electroporation and transduction, reporterhigh cells from the STAG2 and AAVS1 targeting experiments and bulk cells from the STAG2 AAV6 only population were plated in methylcellulose for colony formation. After 14 days, colonies were scored as either erythroid or myeloid based on morphology. Error bars represent S.E.M, N = 3, ***p<0.001, n.s. = p≥0.05, unpaired t-test.
Analysis of cell fractions with different fluorescence intensity.
(a) Schematic showing the general layout of the AAV6 donors employed. ITR: inverted terminal repeat; SFFV promoter: spleen focus forming virus promoter; GFP: green fluorescent protein; polyA: bovine growth hormone polyadenylation signal; RHA: right homology arm. Approximate sizes are shown below each component. (b) Cells were targeted at the HBB locus by electroporation of Cas9 RNP followed by transduction of a homologous rAAV6donor carrying a GFP expression cassette. At 4 days post electroporation and transduction, cells with different GFP intensities (GFPhigh, GFPLowHigh, GFPLowMed, GFPLowLow) were FACS-sorted and cultured for an additional 16 days. At day 20 post targeting, cells were analyzed for GFP expression by flow cytometry and the red gates show the GFPhigh population at this time point. (c) The cells from b) were analyzed at different time points after sorting, and data points show the percentage of cells within the GFPhigh gate for the different populations as well as a population receiving only the rAAV6donor and not Cas9 RNP.
Genotypes of clones with mono-genic targeting.
(a) Left, schematic representation of the three-primer PCR used to genotype CCR5 alleles for integrated (green PCR product) and non-integrated (red PCR product) alleles. One forward primer is located in the left homology arm (LHA), one forward primer is located in the poly A, and a common reverse primer is located outside the region of the right homology arm (RHA). Right, gel image of representative genotyped clones from Figure 1c (CCR5) showing colonies with biallelic and monoallelic integrations. (b) A subset of the CCR5 and IL2RG clones (only female cells for IL2RG) from Figure 1c with monoallelic integration had the genotype on the non-integrated allele analyzed by Sanger sequencing of purified PCR products. Note that in-frame INDELs can be gene-disrupting depending on the location and size of the INDEL. (c) As in Figure 1c, HSPCs were targeted at RUNX1 and at day four post-electroporation, reporterhigh cells were single cell-sorted into methylcellulose-containing 96-well plates to establish colonies. After 14 days, PCR was performed on colony-derived gDNA to detect targeted integrations. A total of 36 myeloid and erythroid methylcellulose colonies were screened. (d) The monoallelically targeted clones from c) had the genotype assessed on the non-integrated allele by Sanger sequencing of purified PCR products. See Supplementary file 1b for complete list of genotypes.To evaluate the applicability of this technology in a biologically relevant setting we decided to modify the cohesin complex member, STAG2, in primary CD34+ HSPCs. The cohesin complex has previously been shown to play an essential part in maintaining normal erythroid differentiation potential of hematopoietic stem and progenitor cells (Mazumdar et al., 2015; Viny et al., 2015; Mullenders et al., 2015). Since the STAG2 gene is located on the human X chromosome, single-allele integration of a fluorescent reporter in male cells would be sufficient to fully knock out the gene. As expected, Cas9 RNP combined with rAAV6donor transduction resulted in the generation of a reporterhigh population that could be sorted for subsequent differentiation experiments. Single cell methylcellulose assays of reporterhigh cells revealed an almost complete loss in the capacity to form erythroid colonies compared to cells that had only been exposed to rAAV6 and not Cas9 RNP, and also compared to cells with targeted integration at the AAVS1 locus (Figure 1d). These proof-of-concept studies provide evidence that gene-specific enrichment of reporterhigh cells can be used to study HSPC gene function.
Biallelic targeted integration in HSPCs
To determine if this method could be used to enrich HSPCs with biallelic gene disruption, necessary for complete functional gene knockout, we targeted the ASXL1 gene and simultaneously provided GFP and BFP-encoding rAAV6 donors. Four days after electroporation and transduction, 10.4% of cells were double positive for GFPhigh and BFPhigh compared to 0.2% for the AAV only sample (Figure 2a). Similarly, double-positive populations were apparent when targeting three other genes (RUNX1, HBB, and CCR5) with two rAAV6 donors with various color combinations (Figure 2—figure supplement 1 and Supplementary file 1c). Double-positive cells sorted at day four after electroporation remained 94% double-positive for more than two weeks in culture (Figure 2b). ‘In-out PCR’ on gDNA from single cell-derived methylcellulose clones confirmed on-target integration of one transgene into one allele and the other transgene into the second allele (Figure 2c). We next tested if the biallelic targeting approach could be extended to another blood cell type and therefore targeted primary human T cells for biallelic HR at CCR5. After electroporation with CCR5-targeting Cas9 RNP followed by transduction with GFP and mCherry CCR5 rAAV6 donors, a GFPhigh/mCherryhigh double-positive population was observed, indicative of biallelic integration at the CCR5 gene (Figure 2d). No significant toxicity was associated with biallelic targeting in T cells (Figure 2—figure supplement 2). Overall, these results demonstrate the utility of using rAAV6, Cas9 RNP, and FACS to enrich for primary human HSPCs and T cells that have undergone biallelic homologous recombination, which may have applications for studying hematological and immunological diseases or generating HSPC or T cell therapeutics that require gene modifications or gene knockout at both alleles.
Figure 2.
Identification and enrichment of biallelic genome-edited CD34+ human hematopoietic stem and progenitor cells (HSPCs).
(a) Left, Schematic showing biallelic targeting strategy for ASXL1 using GFP and BFP-encoding rAAV6 donors for integration into each allele of ASXL1. The SFFV promoter drives reporter expression. Middle, FACS plot from an ‘AAV only’ sample day four post electroporation, showing low episomal reporter expression (BFP and GFP) in cells without the CRISPR system. Right, FACS plot of CD34+ HSPCs treated with both Cas9 RNP and the two rAAV6 donors highlighting the generation of BFPhigh/GFPhigh double positive cells that have undergone ASXL1 dual-allelic targeting. (b) HSPCs were targeted at both alleles of HBB (Cas9 RNP with GFP and tdTomato rAAV6 donors) and at day four post electroporation, dual positive cells were sorted and cultured for 16 days while analyzing reporter expression. Error bars representing S.E.M. are present, but too small to be visible (N = 3 different HSPC donors). (c) Gel images showing PCR genotyping of six methylcellulose-derived clones from (e) confirming integration into each of the HBB alleles. (d) Human primary T cells were CD3/CD28 stimulated for three days and then electroporated with CCR5-targeting Cas9 RNP and transduced with two CCR5-specific rAAV6 donors encoding GFP and mCherry, respectively. FACS plots show GFPhigh/mCherryhigh biallelic targeting frequencies at day four post-electroporation.
Top, Representative FACS plots from HSPCs transduced with two rAAV6 (two fluorescent reporters for each gene), that have homology for the genes listed on the bottom panel, show low episomal expression and very few dual reporterhigh-expressing HSPCs. Bottom, HSPCs were electroporated with gene-specific Cas9 RNPs and then transduced with rAAV6 targeting each allele of a gene with two different indicated fluorescent reporters. FACS plots at Day 4-post electroporation highlight the dual reporterhigh cells that have undergone HR at both alleles of the intended gene.
Human primary T cells were isolated from buffy coats and stimulated for three days using anti-CD3 and anti-CD28 antibodies. Cells were then electroporated with CCR5-targeting Cas9 RNP and transduced with two CCR5-specific rAAV6 donors encoding GFP and mCherry, respectively, either alone or in combination. Cell viabilities were measured at Day two post-electroporation by Trypan Blue exclusion assay (N = 2 different buffy coat-derived T cells).
Figure 2—figure supplement 1.
Cas9 and rAAV6-mediated biallelic homologous recombination (HR) in human CD34+HSPCs.
Top, Representative FACS plots from HSPCs transduced with two rAAV6 (two fluorescent reporters for each gene), that have homology for the genes listed on the bottom panel, show low episomal expression and very few dual reporterhigh-expressing HSPCs. Bottom, HSPCs were electroporated with gene-specific Cas9 RNPs and then transduced with rAAV6 targeting each allele of a gene with two different indicated fluorescent reporters. FACS plots at Day 4-post electroporation highlight the dual reporterhigh cells that have undergone HR at both alleles of the intended gene.
Figure 2—figure supplement 2.
Toxicity assessment of biallelic integration at the CCR5 locus in primary human T cells.
Human primary T cells were isolated from buffy coats and stimulated for three days using anti-CD3 and anti-CD28 antibodies. Cells were then electroporated with CCR5-targeting Cas9 RNP and transduced with two CCR5-specific rAAV6 donors encoding GFP and mCherry, respectively, either alone or in combination. Cell viabilities were measured at Day two post-electroporation by Trypan Blue exclusion assay (N = 2 different buffy coat-derived T cells).
Identification and enrichment of biallelic genome-edited CD34+ human hematopoietic stem and progenitor cells (HSPCs).
(a) Left, Schematic showing biallelic targeting strategy for ASXL1 using GFP and BFP-encoding rAAV6 donors for integration into each allele of ASXL1. The SFFV promoter drives reporter expression. Middle, FACS plot from an ‘AAV only’ sample day four post electroporation, showing low episomal reporter expression (BFP and GFP) in cells without the CRISPR system. Right, FACS plot of CD34+ HSPCs treated with both Cas9 RNP and the two rAAV6 donors highlighting the generation of BFPhigh/GFPhigh double positive cells that have undergone ASXL1 dual-allelic targeting. (b) HSPCs were targeted at both alleles of HBB (Cas9 RNP with GFP and tdTomato rAAV6 donors) and at day four post electroporation, dual positive cells were sorted and cultured for 16 days while analyzing reporter expression. Error bars representing S.E.M. are present, but too small to be visible (N = 3 different HSPC donors). (c) Gel images showing PCR genotyping of six methylcellulose-derived clones from (e) confirming integration into each of the HBB alleles. (d) Human primary T cells were CD3/CD28 stimulated for three days and then electroporated with CCR5-targeting Cas9 RNP and transduced with two CCR5-specific rAAV6 donors encoding GFP and mCherry, respectively. FACS plots show GFPhigh/mCherryhigh biallelic targeting frequencies at day four post-electroporation.
Cas9 and rAAV6-mediated biallelic homologous recombination (HR) in human CD34+HSPCs.
Top, Representative FACS plots from HSPCs transduced with two rAAV6 (two fluorescent reporters for each gene), that have homology for the genes listed on the bottom panel, show low episomal expression and very few dual reporterhigh-expressing HSPCs. Bottom, HSPCs were electroporated with gene-specific Cas9 RNPs and then transduced with rAAV6 targeting each allele of a gene with two different indicated fluorescent reporters. FACS plots at Day 4-post electroporation highlight the dual reporterhigh cells that have undergone HR at both alleles of the intended gene.
Toxicity assessment of biallelic integration at the CCR5 locus in primary human T cells.
Human primary T cells were isolated from buffy coats and stimulated for three days using anti-CD3 and anti-CD28 antibodies. Cells were then electroporated with CCR5-targeting Cas9 RNP and transduced with two CCR5-specific rAAV6 donors encoding GFP and mCherry, respectively, either alone or in combination. Cell viabilities were measured at Day two post-electroporation by Trypan Blue exclusion assay (N = 2 different buffy coat-derived T cells).
Simultaneous HR-mediated targeting of two genes (Di-Genic) in HSPCs
The vast majority of hematopoietic functions and immune diseases are governed by complex, polygenic networks (Seita and Weissman, 2010). To potentially study gene-gene interactions and/or generate cell therapeutics with HR modifications at two separate genes, we tested whether our methodology could facilitate simultaneous di-genic (two different genes) HR in HSPCs. We therefore co-delivered HBB-tdTomato and IL2RG-GFP rAAV6 donors with Cas9 RNP targeting both genes. This strategy produced 10.2% double positive GFPhigh/tdTomatohigh HSPCs compared to 0.1% for the AAV only control sample (Figure 3a). We also generated double reporterhigh positive populations when testing other combinations of di-genic HR (IL2RG/CCR5, RUNX1/ASXL1, and HBB/CCR5) (Figure 3—figure supplement 1 and Supplementary file 1c). Again, double reporterhigh positive cells sorted at day four post-electroporation remained 94% double positive for 15 days in culture (Figure 3b). ‘In-Out PCR’ on double positive methylcellulose myeloid and erythroid clones showed on-target integration at both loci in 88% of clones (57 clones analyzed) (Figure 3c and d).
Figure 3.
Identification, enrichment, and long-term engraftment in NSG mice of di-genic genome-edited CD34+ human hematopoietic stem and progenitor cells (HSPCs).
(a) Left, Schematic depicting HBB and IL2RG di-genic targeting. Middle, FACS plot of an ‘AAV only’ sample at day four post electroporation, showing low episomal reporter expression (HBB-tdTomato and IL2RG-GFP) in cells without the CRISPR system. Right, FACS plot at day four post-electroporation of HSPCs electroporated with Cas9 RNP targeting both HBB and IL2RG followed by transduction with HBB-tdTomato and IL2RG-GFP rAAV6 donors showing the generation of tdTomatohigh/GFPhigh cells with di-genic targeting at HBB and IL2RG. (b) Double-positive HSPCs targeted at HBB (GFP) and CCR5 (mCherry) were sorted at day four post-electroporation and cultured for 15 days while analyzing reporter expression. Error bars represent S.E.M. (N = 3 different HSPC donors). (c) Representative gel images showing PCR genotyping of six (out of 57 total) HBB-GFPhigh (gene reporter 1)/CCR5-mCherryhigh (gene reporter 2) methylcellulose-derived clones confirming integration at each locus (d) Representative fluorescence microscopy images of methylcellulose-derived clones with di-genic targeting at HBB and CCR5 show myeloid and erythroid progenitors with both GFP and mCherry expression. (e) HSPCs were targeted at the HBB and AAVS1 loci with a GFP and BFP expression cassette, respectively. Representative FACS plot (left panel) shows analysis seven days after targeting. All four gated populations were sorted and genomic DNA was subject to TIDE analysis for determining INDEL frequencies at the two loci (middle panel), and subject to ddPCR quantification of one of the two possible monocentric translocations between HBB and AAVS1 (right panel) (see also Figure 3—figure supplement 2). (f) Representative FACS plots from cells targeted at the HBB and AAVS1 loci with a GFP and BFP expression cassette, respectively. Representative FACS plot shows analysis four days after targeting at which point the four populations were sorted and transplanted intrafemorally into NSG mice that were irradiated 24 hr before transplantation. (g) Bone marrow from the injected femurs from the mice transplanted as described in (f) was analyzed 12 weeks after transplantation. Representative FACS plots are from a mouse from each of the four groups depicted in (f) as well as a mouse transplanted with mock-electroporated cells. The middle row depicts human engraftment gated as positive for the human leukocyte antigen complex (HLA-ABC). The upper and lower rows depict FACS plots gated from the human populations and show myeloid (CD33+) and lymphoid (CD19+) engraftment (upper row) as well as reporter gene expression (lower row) (see also Figure 3—figure supplement 3 for all transplantation data).
Top, Representative FACS plots of HSPCs transduced with two rAAV6 donors targeting two genes with two distinct fluorescent reporters (listed in FACS plots in lower panel) show low episomal expression and few dual reporterhigh-expressing HSPCs. Bottom, HSPCs were electroporated with two different gene-specific Cas9 RNPs and then transduced with homologous rAAV6 donors (each gene targeted with a different fluorescent reporter). Representative FACS plots from Day 4 post electroporation show the generation of dual reporterhigh positive HSPCs targeted at both genes.
(a) Schematic showing the HBB gene on chromosome 11 and the AAVS1 locus on chromosome 19. The Cas9 cut sites are shown in red. One of the two possible monocentric translocations is shown. (b) The reference sequence of the HBB-AAVS1 translocation is shown in the top. Below are representative translocation sequences from GFP-BFP- HSPCs sorted seven days after targeting (see Figure 3e, left panel). (c) Representative ddPCR analyses quantifying translocations in NTC (non-template control), mock-electroporated, and GFP-BFP- cells (see Figure 3e, right panel). The reference assay quantifies TERT gene copies used to normalize for DNA input. The translocation assay probe binds 50 bp away from the junction and none of the identified translocations would therefore exclude probe binding.
The table shows an overview of the 11 NSG mice that were transplanted intrafemorally with either mock-electroporated cells or sorted cells from the four populations displayed in Figure 3f. 12 weeks after transplant, the transplanted femurs were flushed and the cells analyzed for human engraftment based on HLA-ABC expression, B cell or myeloid phenotype (CD19 and CD33, respectively), and expression of the two reporter genes.
Figure 3—figure supplement 1.
Cas9 and rAAV6-mediated di-genic homologous recombination (HR) in human CD34+ HSPCs.
Top, Representative FACS plots of HSPCs transduced with two rAAV6 donors targeting two genes with two distinct fluorescent reporters (listed in FACS plots in lower panel) show low episomal expression and few dual reporterhigh-expressing HSPCs. Bottom, HSPCs were electroporated with two different gene-specific Cas9 RNPs and then transduced with homologous rAAV6 donors (each gene targeted with a different fluorescent reporter). Representative FACS plots from Day 4 post electroporation show the generation of dual reporterhigh positive HSPCs targeted at both genes.
Identification, enrichment, and long-term engraftment in NSG mice of di-genic genome-edited CD34+ human hematopoietic stem and progenitor cells (HSPCs).
(a) Left, Schematic depicting HBB and IL2RG di-genic targeting. Middle, FACS plot of an ‘AAV only’ sample at day four post electroporation, showing low episomal reporter expression (HBB-tdTomato and IL2RG-GFP) in cells without the CRISPR system. Right, FACS plot at day four post-electroporation of HSPCs electroporated with Cas9 RNP targeting both HBB and IL2RG followed by transduction with HBB-tdTomato and IL2RG-GFP rAAV6 donors showing the generation of tdTomatohigh/GFPhigh cells with di-genic targeting at HBB and IL2RG. (b) Double-positive HSPCs targeted at HBB (GFP) and CCR5 (mCherry) were sorted at day four post-electroporation and cultured for 15 days while analyzing reporter expression. Error bars represent S.E.M. (N = 3 different HSPC donors). (c) Representative gel images showing PCR genotyping of six (out of 57 total) HBB-GFPhigh (gene reporter 1)/CCR5-mCherryhigh (gene reporter 2) methylcellulose-derived clones confirming integration at each locus (d) Representative fluorescence microscopy images of methylcellulose-derived clones with di-genic targeting at HBB and CCR5 show myeloid and erythroid progenitors with both GFP and mCherry expression. (e) HSPCs were targeted at the HBB and AAVS1 loci with a GFP and BFP expression cassette, respectively. Representative FACS plot (left panel) shows analysis seven days after targeting. All four gated populations were sorted and genomic DNA was subject to TIDE analysis for determining INDEL frequencies at the two loci (middle panel), and subject to ddPCR quantification of one of the two possible monocentric translocations between HBB and AAVS1 (right panel) (see also Figure 3—figure supplement 2). (f) Representative FACS plots from cells targeted at the HBB and AAVS1 loci with a GFP and BFP expression cassette, respectively. Representative FACS plot shows analysis four days after targeting at which point the four populations were sorted and transplanted intrafemorally into NSG mice that were irradiated 24 hr before transplantation. (g) Bone marrow from the injected femurs from the mice transplanted as described in (f) was analyzed 12 weeks after transplantation. Representative FACS plots are from a mouse from each of the four groups depicted in (f) as well as a mouse transplanted with mock-electroporated cells. The middle row depicts human engraftment gated as positive for the human leukocyte antigen complex (HLA-ABC). The upper and lower rows depict FACS plots gated from the human populations and show myeloid (CD33+) and lymphoid (CD19+) engraftment (upper row) as well as reporter gene expression (lower row) (see also Figure 3—figure supplement 3 for all transplantation data).
Figure 3—figure supplement 2.
Measuring translocations after HBB and AAVS1 di-genic targeting.
(a) Schematic showing the HBB gene on chromosome 11 and the AAVS1 locus on chromosome 19. The Cas9 cut sites are shown in red. One of the two possible monocentric translocations is shown. (b) The reference sequence of the HBB-AAVS1 translocation is shown in the top. Below are representative translocation sequences from GFP-BFP- HSPCs sorted seven days after targeting (see Figure 3e, left panel). (c) Representative ddPCR analyses quantifying translocations in NTC (non-template control), mock-electroporated, and GFP-BFP- cells (see Figure 3e, right panel). The reference assay quantifies TERT gene copies used to normalize for DNA input. The translocation assay probe binds 50 bp away from the junction and none of the identified translocations would therefore exclude probe binding.
Figure 3—figure supplement 3.
Analysis of mice transplanted with different sorted populations of cells targeted at the HBB and AAVS1 locus.
The table shows an overview of the 11 NSG mice that were transplanted intrafemorally with either mock-electroporated cells or sorted cells from the four populations displayed in Figure 3f. 12 weeks after transplant, the transplanted femurs were flushed and the cells analyzed for human engraftment based on HLA-ABC expression, B cell or myeloid phenotype (CD19 and CD33, respectively), and expression of the two reporter genes.
Cas9 and rAAV6-mediated di-genic homologous recombination (HR) in human CD34+ HSPCs.
Top, Representative FACS plots of HSPCs transduced with two rAAV6 donors targeting two genes with two distinct fluorescent reporters (listed in FACS plots in lower panel) show low episomal expression and few dual reporterhigh-expressing HSPCs. Bottom, HSPCs were electroporated with two different gene-specific Cas9 RNPs and then transduced with homologous rAAV6 donors (each gene targeted with a different fluorescent reporter). Representative FACS plots from Day 4 post electroporation show the generation of dual reporterhigh positive HSPCs targeted at both genes.
Measuring translocations after HBB and AAVS1 di-genic targeting.
(a) Schematic showing the HBB gene on chromosome 11 and the AAVS1 locus on chromosome 19. The Cas9 cut sites are shown in red. One of the two possible monocentric translocations is shown. (b) The reference sequence of the HBB-AAVS1 translocation is shown in the top. Below are representative translocation sequences from GFP-BFP- HSPCs sorted seven days after targeting (see Figure 3e, left panel). (c) Representative ddPCR analyses quantifying translocations in NTC (non-template control), mock-electroporated, and GFP-BFP- cells (see Figure 3e, right panel). The reference assay quantifies TERT gene copies used to normalize for DNA input. The translocation assay probe binds 50 bp away from the junction and none of the identified translocations would therefore exclude probe binding.
Analysis of mice transplanted with different sorted populations of cells targeted at the HBB and AAVS1 locus.
The table shows an overview of the 11 NSG mice that were transplanted intrafemorally with either mock-electroporated cells or sorted cells from the four populations displayed in Figure 3f. 12 weeks after transplant, the transplanted femurs were flushed and the cells analyzed for human engraftment based on HLA-ABC expression, B cell or myeloid phenotype (CD19 and CD33, respectively), and expression of the two reporter genes.Since the combination of two sgRNAs has previously been used to create and study oncogenic translocations (Maddalo et al., 2014), and multiplexed TALEN-mediated gene editing in primary human T cells led to translocation frequencies between the two targeted genes of 0.01–1% with monocentric translocations occurring most frequently (Poirot et al., 2015), we assessed if our di-genic targeting scheme would enrich for translocations after purification of dual-reporter positive cells. Therefore, we analyzed one of the monocentric translocations between HBB and AAVS1 (Figure 3—figure supplement 2a). We targeted HBB and AAVS1 with a GFP and BFP reporter, respectively, and sorted the four different populations (double negative, single positives (each gene), and double positive) seven days after targeting (Figure 3e, left panel). INDEL rates at HBB and AAVS1 were comparable among all four sorted populations, with a small enrichment of INDELs in the three populations positive for the reporter (Figure 3e, middle panel). Droplet digital PCR (ddPCR) quantification of the translocation showed frequencies ranging from 0.14–0.28%, and importantly, no evidence of enrichment of the translocation was observed in the population sorted for di-genic targeting (Figure 3e, right panel and Figure 3—figure supplement 2c). Cloning and sequencing of PCR products spanning the translocation showed a wide variety of translocation junctions derived from different DNA end-processing products (Figure 3—figure supplement 2b).To confirm that HSPCs with long-term and multi-lineage engraftment potential were targeted, we again targeted HBB and AAVS1 with a GFP and BFP reporter, respectively, and transplanted the four different sorted populations into immune-compromised NSG mice (Figure 3f). 12 weeks after transplantation, human multi-lineage engraftment was evident in the bone marrow of the transplanted mice of all four groups (Figure 3g and Figure 3—figure supplement 3).Collectively, these data show that human HSPCs that have undergone di-genic HR are not enriched for translocations, and maintain their multi-lineage colony forming capacity and long-term engraftment potential.
Multiplexed homologous recombination in HSPCs
We next tested if we could combine the di-genic and biallelic targeting approach to simultaneously target both alleles of ASXL1 (GFP and mCherry) as well as both alleles of RUNX1c (BFP and E2-Crimson) (tetra-allelic) (for schematic see Figure 4—figure supplement 1a). Delivery of Cas9 RNPs targeting both genes followed by transduction of four rAAV6 donors gave rise to 1.1% GFPhigh/mCherryhigh/BFPhigh/E2Crimsonhigh quadruple-positive cells (Figure 4a and Figure 4—figure supplement 1b–c). A similar quadruple-positive population was evident when targeting all four combined alleles of HBB and RUNX1c (Figure 4—figure supplement 1e–h and Supplementary file 1e). Mixed, myeloid, and erythroid colonies were formed at frequency and ratio comparable to AAV only controls (Figure 4b). Genotyping of colonies revealed on-target integration at both alleles at both loci in 78% of clones (73 clones analyzed) (Figure 4c). Flow-cytometric analysis of individual colonies confirmed expression of all four reporters (BFP/GFP/mCherry/E2Crimson) at high levels (Figure 4—figure supplement 1d). The total number of genetic changes in this enriched population, which could be used for synthetic biology purposes is six: two endogenous genes inactivated (both alleles of each gene) plus the addition of four different transgenes (represented in our experiment by four genes encoding different fluorescent proteins). Thus, this methodology could be used for studying interaction of genes that need both copies disrupted to lose function, such as tumor suppressor genes.
Figure 4—figure supplement 1.
Targeting two genes for biallelic homologous recombination (HR) in primary CD34+ HSPCs.
(a) Schematic showing experimental strategy for Figure 4a for targeting both alleles of RUNX1 and ASXL1. (b) FACS plots, gating scheme, and frequencies of HR at each allele for the experiment shown in Figure 4a. (c) FACS plot showing very low frequency of tetra-reporterhigh cells without Cas9. (d) FACS plots of cells from single methylcellulose colonies derived from tetra-reporterhigh cells from Figure 4a. (e) Schematic showing targeting both alleles of RUNX1 and HBB for HR with four distinct reporters. (f) Top, FACS plots of HSPCs transduced with four rAAV6s (no Cas9 RNPs) showing the gating scheme and low episomal reporter expression without a nuclease. Bottom, HSPCs were electroporated with RNPs targeting HBB and RUNX1 and then transduced with four rAAV6s. FACS plots from day four post electroporation show MFI shift for each reporter alone. HBB-tNGFR rAAV6 has reproducibly shown lower episomal expression than all other rAAV6 we have used. (g) Images from fluorescence microscopy showing an mCherry/BFP/GFP positive CFU-GM clone that has undergone tetra-allelic HR. The colony was not stained for HBB-tNGFR. (h) Left, FACS plots show very low frequency of tetra-reporterhigh cells without Cas9. Right, Nuclease addition increases the frequency of bi and tetra-reporterhigh HSPCs.
Figure 4.
Multiplexing homologous recombination in CD34+ human hematopoietic stem and progenitor cells (HSPCs).
(a) HSPCs were electroporated with Cas9 RNP targeting ASXL1 and RUNX1 followed by rAAV6 transduction with two donors for ASXL1 (mCherry and GFP) and two donors for RUNX1 (E2Crimson and BFP). Tetra-allelically targeted HSPCs were identified as mCherryhigh/GFPhigh/BFPhigh/E2Crimsonhigh (N = 3 see Supplementary file 1e) (b) Cells modified at both alleles for RUNX1 and ASXL1 (as in (a)) were subjected to a methylcellulose assay (triplicates) and scored as BFU-E, CFU-M, CFU-GM or CFU-GEMM based on morphology 14 days after sorting. (c) PCR was performed on colony-derived gDNA to detect targeted integrations at both genes. 73 individual colonies were analyzed. Color coding for colonies with triple-allelic integration are as follows: grey: RUNX1 biallelic/ASXL monoallelic; white: RUNX1 monoallelic/ASXL1 biallelic. (d) For tri-genic targeting of HSPCs, cells were electroporated with Cas9 RNP targeting IL2RG, HBB, and CCR5 followed by transduction of three rAAV6 donors homologous to each of the three genes (IL2RG-GFP, HBB-tdTomato, and CCR5-tNGFR). Tri-genic-targeted cells were identified as reporterhigh for all three reporters (N = 5 see Supplementary file 1e). (e) Methylcellulose clones from the triple-positive cells in (d) were subjected to genotyping PCR and gel images show colonies with targeted integration at all three genes in 9/11 colonies (note that GFP shows a faint band in colony 6). (f) Left, Schematic showing strategy for targeting four different genes (HBB, RUNX1, ASXL1, and CCR5) simultaneously (tetra-genic). Four different genes are targeted by electroporation of four different Cas9 RNPs followed by transduction with four different rAAV6 donors that each targets a gene with a different reporter. Right, Tetra-genic targeting at the above-mentioned four genes was identified as reporterhigh for all four reporters (N = 3 see Supplementary file 1e).
(a) Schematic showing experimental strategy for Figure 4a for targeting both alleles of RUNX1 and ASXL1. (b) FACS plots, gating scheme, and frequencies of HR at each allele for the experiment shown in Figure 4a. (c) FACS plot showing very low frequency of tetra-reporterhigh cells without Cas9. (d) FACS plots of cells from single methylcellulose colonies derived from tetra-reporterhigh cells from Figure 4a. (e) Schematic showing targeting both alleles of RUNX1 and HBB for HR with four distinct reporters. (f) Top, FACS plots of HSPCs transduced with four rAAV6s (no Cas9 RNPs) showing the gating scheme and low episomal reporter expression without a nuclease. Bottom, HSPCs were electroporated with RNPs targeting HBB and RUNX1 and then transduced with four rAAV6s. FACS plots from day four post electroporation show MFI shift for each reporter alone. HBB-tNGFR rAAV6 has reproducibly shown lower episomal expression than all other rAAV6 we have used. (g) Images from fluorescence microscopy showing an mCherry/BFP/GFP positive CFU-GM clone that has undergone tetra-allelic HR. The colony was not stained for HBB-tNGFR. (h) Left, FACS plots show very low frequency of tetra-reporterhigh cells without Cas9. Right, Nuclease addition increases the frequency of bi and tetra-reporterhigh HSPCs.
(a) Schematic showing experimental strategy for Figure 4d targeting three genes, IL2RG, CCR5, and HBB. (b) FACS plots show gating scheme and HR frequencies at each locus for the experiment shown in Figure 4d. (c) Schematic outlining another tri-genic targeting experiment for RUNX1, ASXL1, and HBB. (d) Top, FACS plots of HSPCs transduced with three rAAV6 donors (no RNPs). Bottom, HSPCs were electroporated with gene-specific RNPs and then transduced with three rAAV6 donors. FACS plots at Day 4-post electroporation show MFI shift for each reporter alone. (e) FACS plots from same sample as in (d), but showing different combinations of di-genic reporterhigh populations that contain the same frequency of tri-genic reporterhigh cells.
CD34+ cells from mobilized peripheral blood were targeted at one, two, or three genes with Cas9 RNP and rAAV6 donors. Viabilities were measured by flow cytometry 72 hr post-electroporation using Live/Dead and Annexin V stains. Viable cells are defined as live, non-apoptotic (Annexin V−) and plotted as percentage of a single AAV6 donor alone. Error bars represent SD, ns = not statistically significant, Mann-Whitney test, N = 2 different HSPC donors.
Since capture of rAAV6 donors at the site of a DSB via NHEJ has been reported, we measured the false-positive rate of multiplexing HR via flow cytometry. (a) False-positive frequencies of di-genic targeting in HSPCs was determined by electroporating cells with an HBB-targeting Cas9 RNP followed by transduction with HBB-GFP (homologous) and CCR5-mCherry (non-homologous) rAAV6 donors. FACS plots show a false-positive rate of 0.24% dual reporterhigh cells. Note that 4% dual reporterhigh cells was reported in Figure 3—figure supplement 1 when performing di-genic targeting at CCR5 and HBB, giving a false positive rate of 6% of targeting. (b) Left, To determine false-positive frequencies of tri-genic targeting in HSPCs, we electroporated IL2RG-RNP and HBB-RNP into HSPCs followed by transduction with the rAAV6 donors IL2RG-GFP (homologous), HBB-tdTomato (homologous), and CCR5-tNGFR (non-homologous). FACS plots show a false-positive frequency of 0.47%. Note that Figure 4d shows a tri-genic targeting frequency of 4.1% (a false-positive rate of 11% of targeting). Right, We employed a similar strategy to determine false-positive frequencies of tri-genic targeting, but this time used different combinations of on-target nucleases. The false-positive rate detected here was 0.1% (2.4% of targeting). (c) To determine tetra-genic false-positive frequencies, we electroporated HSPCs with three on-target nucleases (IL2RG, HBB, and CCR5) and then transduced with three homologous rAAV6 donors (IL2RG, HBB, and CCR5) and one non-homologous donor (CXCL12). FACS plots show a frequency of 0.09% that are reporterhigh for all four reporters with Figure 4f showing a tetra-genic targeting frequency of 1.0% (a false-positive rate of 9% of targeting).
(a) A heterozygous knockout population can be generated with two HR donors. The first donor is designed to knock-in a wild-type (WT) cDNA cassette into the start codon (ATG) of the gene of interest followed by a cassette encoding a reporter gene (here GFP). WT cDNA is expressed from the endogenous promoter as reported by Voit et al. (2014), Hubbard et al. (2016), and Dever et al. (2016), which maintains endogenous regulatory control over gene expression. The other donor encodes another reporter (here BFP), which disrupts the targeted gene. Double positive cells (GFP+/BFP+) are heterozygous for the knockout allele. (b) A population heterozygous for a particular SNP can be generated using two donors that knock in cDNA expression cassettes followed by different reporter genes. One cDNA is WT while the other carries the SNP of interest. Double positive cells (GFP+/BFP+) are heterozygous for the SNP allele. Endogenous 3’ UTRs may be incorporated to preserve posttranscriptional regulation. Heterozygous SNP cDNA knock-in may be expanded to two or more genes, which may be of particular interest in studies of leukemia-mutated genes such as DNMT3A, IDH1/2, JAK2, and KRAS, which often occur in various combinations as heterozygous gain-of-function or dominant negative mutations. In addition, reporter knock-in combined with WT cDNA knock-in (as depicted in a) as well as SNP cDNA knock-in (SNP that disrupts gene or gene function) combined with WT cDNA knock-in (as depicted in b) could be used to study haploinsufficiencies. Though not depicted, all genes in the schematic are followed by polyadenylation signals.
Multiplexing homologous recombination in CD34+ human hematopoietic stem and progenitor cells (HSPCs).
(a) HSPCs were electroporated with Cas9 RNP targeting ASXL1 and RUNX1 followed by rAAV6 transduction with two donors for ASXL1 (mCherry and GFP) and two donors for RUNX1 (E2Crimson and BFP). Tetra-allelically targeted HSPCs were identified as mCherryhigh/GFPhigh/BFPhigh/E2Crimsonhigh (N = 3 see Supplementary file 1e) (b) Cells modified at both alleles for RUNX1 and ASXL1 (as in (a)) were subjected to a methylcellulose assay (triplicates) and scored as BFU-E, CFU-M, CFU-GM or CFU-GEMM based on morphology 14 days after sorting. (c) PCR was performed on colony-derived gDNA to detect targeted integrations at both genes. 73 individual colonies were analyzed. Color coding for colonies with triple-allelic integration are as follows: grey: RUNX1 biallelic/ASXL monoallelic; white: RUNX1 monoallelic/ASXL1 biallelic. (d) For tri-genic targeting of HSPCs, cells were electroporated with Cas9 RNP targeting IL2RG, HBB, and CCR5 followed by transduction of three rAAV6 donors homologous to each of the three genes (IL2RG-GFP, HBB-tdTomato, and CCR5-tNGFR). Tri-genic-targeted cells were identified as reporterhigh for all three reporters (N = 5 see Supplementary file 1e). (e) Methylcellulose clones from the triple-positive cells in (d) were subjected to genotyping PCR and gel images show colonies with targeted integration at all three genes in 9/11 colonies (note that GFP shows a faint band in colony 6). (f) Left, Schematic showing strategy for targeting four different genes (HBB, RUNX1, ASXL1, and CCR5) simultaneously (tetra-genic). Four different genes are targeted by electroporation of four different Cas9 RNPs followed by transduction with four different rAAV6 donors that each targets a gene with a different reporter. Right, Tetra-genic targeting at the above-mentioned four genes was identified as reporterhigh for all four reporters (N = 3 see Supplementary file 1e).
Targeting two genes for biallelic homologous recombination (HR) in primary CD34+ HSPCs.
(a) Schematic showing experimental strategy for Figure 4a for targeting both alleles of RUNX1 and ASXL1. (b) FACS plots, gating scheme, and frequencies of HR at each allele for the experiment shown in Figure 4a. (c) FACS plot showing very low frequency of tetra-reporterhigh cells without Cas9. (d) FACS plots of cells from single methylcellulose colonies derived from tetra-reporterhigh cells from Figure 4a. (e) Schematic showing targeting both alleles of RUNX1 and HBB for HR with four distinct reporters. (f) Top, FACS plots of HSPCs transduced with four rAAV6s (no Cas9 RNPs) showing the gating scheme and low episomal reporter expression without a nuclease. Bottom, HSPCs were electroporated with RNPs targeting HBB and RUNX1 and then transduced with four rAAV6s. FACS plots from day four post electroporation show MFI shift for each reporter alone. HBB-tNGFR rAAV6 has reproducibly shown lower episomal expression than all other rAAV6 we have used. (g) Images from fluorescence microscopy showing an mCherry/BFP/GFP positive CFU-GM clone that has undergone tetra-allelic HR. The colony was not stained for HBB-tNGFR. (h) Left, FACS plots show very low frequency of tetra-reporterhigh cells without Cas9. Right, Nuclease addition increases the frequency of bi and tetra-reporterhigh HSPCs.
Multiplexing homologous recombination at three genes simultaneously in HSPCs.
(a) Schematic showing experimental strategy for Figure 4d targeting three genes, IL2RG, CCR5, and HBB. (b) FACS plots show gating scheme and HR frequencies at each locus for the experiment shown in Figure 4d. (c) Schematic outlining another tri-genic targeting experiment for RUNX1, ASXL1, and HBB. (d) Top, FACS plots of HSPCs transduced with three rAAV6 donors (no RNPs). Bottom, HSPCs were electroporated with gene-specific RNPs and then transduced with three rAAV6 donors. FACS plots at Day 4-post electroporation show MFI shift for each reporter alone. (e) FACS plots from same sample as in (d), but showing different combinations of di-genic reporterhigh populations that contain the same frequency of tri-genic reporterhigh cells.
Toxicity assessment of multiplexed HR.
CD34+ cells from mobilized peripheral blood were targeted at one, two, or three genes with Cas9 RNP and rAAV6 donors. Viabilities were measured by flow cytometry 72 hr post-electroporation using Live/Dead and Annexin V stains. Viable cells are defined as live, non-apoptotic (Annexin V−) and plotted as percentage of a single AAV6 donor alone. Error bars represent SD, ns = not statistically significant, Mann-Whitney test, N = 2 different HSPC donors.
Assessment of false-positive frequencies of FACS-based identification of multiplexed HR in HSPCs.
Since capture of rAAV6 donors at the site of a DSB via NHEJ has been reported, we measured the false-positive rate of multiplexing HR via flow cytometry. (a) False-positive frequencies of di-genic targeting in HSPCs was determined by electroporating cells with an HBB-targeting Cas9 RNP followed by transduction with HBB-GFP (homologous) and CCR5-mCherry (non-homologous) rAAV6 donors. FACS plots show a false-positive rate of 0.24% dual reporterhigh cells. Note that 4% dual reporterhigh cells was reported in Figure 3—figure supplement 1 when performing di-genic targeting at CCR5 and HBB, giving a false positive rate of 6% of targeting. (b) Left, To determine false-positive frequencies of tri-genic targeting in HSPCs, we electroporated IL2RG-RNP and HBB-RNP into HSPCs followed by transduction with the rAAV6 donors IL2RG-GFP (homologous), HBB-tdTomato (homologous), and CCR5-tNGFR (non-homologous). FACS plots show a false-positive frequency of 0.47%. Note that Figure 4d shows a tri-genic targeting frequency of 4.1% (a false-positive rate of 11% of targeting). Right, We employed a similar strategy to determine false-positive frequencies of tri-genic targeting, but this time used different combinations of on-target nucleases. The false-positive rate detected here was 0.1% (2.4% of targeting). (c) To determine tetra-genic false-positive frequencies, we electroporated HSPCs with three on-target nucleases (IL2RG, HBB, and CCR5) and then transduced with three homologous rAAV6 donors (IL2RG, HBB, and CCR5) and one non-homologous donor (CXCL12). FACS plots show a frequency of 0.09% that are reporterhigh for all four reporters with Figure 4f showing a tetra-genic targeting frequency of 1.0% (a false-positive rate of 9% of targeting).
Controlling genotype with cDNA knock-in.
(a) A heterozygous knockout population can be generated with two HR donors. The first donor is designed to knock-in a wild-type (WT) cDNA cassette into the start codon (ATG) of the gene of interest followed by a cassette encoding a reporter gene (here GFP). WT cDNA is expressed from the endogenous promoter as reported by Voit et al. (2014), Hubbard et al. (2016), and Dever et al. (2016), which maintains endogenous regulatory control over gene expression. The other donor encodes another reporter (here BFP), which disrupts the targeted gene. Double positive cells (GFP+/BFP+) are heterozygous for the knockout allele. (b) A population heterozygous for a particular SNP can be generated using two donors that knock in cDNA expression cassettes followed by different reporter genes. One cDNA is WT while the other carries the SNP of interest. Double positive cells (GFP+/BFP+) are heterozygous for the SNP allele. Endogenous 3’ UTRs may be incorporated to preserve posttranscriptional regulation. Heterozygous SNP cDNA knock-in may be expanded to two or more genes, which may be of particular interest in studies of leukemia-mutated genes such as DNMT3A, IDH1/2, JAK2, and KRAS, which often occur in various combinations as heterozygous gain-of-function or dominant negative mutations. In addition, reporter knock-in combined with WT cDNA knock-in (as depicted in a) as well as SNP cDNA knock-in (SNP that disrupts gene or gene function) combined with WT cDNA knock-in (as depicted in b) could be used to study haploinsufficiencies. Though not depicted, all genes in the schematic are followed by polyadenylation signals.Multi-genic HR in HSPCs would allow for the characterization of functional gene networks during humanhematopoiesis (Bystrykh et al., 2005). To validate that our methodology could multiplex HR in HSPCs in more than two genes simultaneously, we electroporated HSPCs with RNPs targeting HBB, CCR5, and IL2RG, and then transduced them with gene-specific rAAV6 donors (HBB-tdTomato, CCR5-tNGFR, IL2RG-GFP) (for schematic see Figure 4—figure supplement 2a). At day four post-electroporation, 4.1% of HSPCs were triple-positive (Figure 4d and Figure 4—figure supplement 2b). ‘In-Out PCR’ on gDNA from myeloid and erythroid colonies derived from this population showed that 78% (27 clones analyzed) had an integration event at all 3 loci, indicating at least mono-allelic integrations at each targeted locus (Figure 4e). Further analyses showed that 85% of these clones with tri-genic integrations were modified on all alleles either by biallelic integration or INDELs on the non-integrated allele that were mostly disruptive (Supplementary file 1d). These data confirm that the methodology can efficiently enrich for HSPCs with multiplexed HR. Targeting at another combination of three genes (RUNX1/HBB/ASXL1) showed 2.9% triple-positive cells (Figure 4—figure supplement 2c–e), and collectively, tri-genic targeting experiments yielded an average of 4.5% triple-positive cells, with the highest frequency of 14% (N = 5) (Supplementary file 1e). To test if multiplexing HR caused cellular senescence or more cell death than mono or di-genic targeting in HSPCs, we evaluated cell death and apoptosis rates at day three post-targeting and proliferation for up to 10 days post-targeting (corresponding to 7 days post-sorting). We observed similar proliferation rates comparing modified and unmodified cells (data not shown) and only a minor, non-statistically significant decrease in cell viability (p=0.333) when targeting three genes compared to one (Figure 4—figure supplement 3). Finally, we targeted HSPCs for tetra-genic HR (HBB, CCR5, ASXL1, RUNX1) and found after four days in culture that 1% of cells were reporterhigh positive for all four reporters (Figure 4f). Targeting the same four genes with other combinations of reporter genes gave 0.41% and 0.78% tetra-genic targeting frequencies in the total cell population (Supplementary file 1e). Strikingly, 41–71% of HSPCs with tri-genic HR had undergone tetra-genic HR, suggesting that HR events at different genes may not be independent of each other, in contrast to recent findings for multiplexed NHEJ (Hultquist et al., 2016). Because rAAV vectors can be captured at DSBs via NHEJ (Miller et al., 2004), we performed experiments that aimed to detect the frequency of capture events by including a non-homologous rAAV donor in targeting experiments. We found that 89–98% of reporterhigh cells were derived from on-target homologous recombination, confirming a relatively low rate of AAV capture (Figure 4—figure supplement 4).
Figure 4—figure supplement 2.
Multiplexing homologous recombination at three genes simultaneously in HSPCs.
(a) Schematic showing experimental strategy for Figure 4d targeting three genes, IL2RG, CCR5, and HBB. (b) FACS plots show gating scheme and HR frequencies at each locus for the experiment shown in Figure 4d. (c) Schematic outlining another tri-genic targeting experiment for RUNX1, ASXL1, and HBB. (d) Top, FACS plots of HSPCs transduced with three rAAV6 donors (no RNPs). Bottom, HSPCs were electroporated with gene-specific RNPs and then transduced with three rAAV6 donors. FACS plots at Day 4-post electroporation show MFI shift for each reporter alone. (e) FACS plots from same sample as in (d), but showing different combinations of di-genic reporterhigh populations that contain the same frequency of tri-genic reporterhigh cells.
Figure 4—figure supplement 3.
Toxicity assessment of multiplexed HR.
CD34+ cells from mobilized peripheral blood were targeted at one, two, or three genes with Cas9 RNP and rAAV6 donors. Viabilities were measured by flow cytometry 72 hr post-electroporation using Live/Dead and Annexin V stains. Viable cells are defined as live, non-apoptotic (Annexin V−) and plotted as percentage of a single AAV6 donor alone. Error bars represent SD, ns = not statistically significant, Mann-Whitney test, N = 2 different HSPC donors.
Figure 4—figure supplement 4.
Assessment of false-positive frequencies of FACS-based identification of multiplexed HR in HSPCs.
Since capture of rAAV6 donors at the site of a DSB via NHEJ has been reported, we measured the false-positive rate of multiplexing HR via flow cytometry. (a) False-positive frequencies of di-genic targeting in HSPCs was determined by electroporating cells with an HBB-targeting Cas9 RNP followed by transduction with HBB-GFP (homologous) and CCR5-mCherry (non-homologous) rAAV6 donors. FACS plots show a false-positive rate of 0.24% dual reporterhigh cells. Note that 4% dual reporterhigh cells was reported in Figure 3—figure supplement 1 when performing di-genic targeting at CCR5 and HBB, giving a false positive rate of 6% of targeting. (b) Left, To determine false-positive frequencies of tri-genic targeting in HSPCs, we electroporated IL2RG-RNP and HBB-RNP into HSPCs followed by transduction with the rAAV6 donors IL2RG-GFP (homologous), HBB-tdTomato (homologous), and CCR5-tNGFR (non-homologous). FACS plots show a false-positive frequency of 0.47%. Note that Figure 4d shows a tri-genic targeting frequency of 4.1% (a false-positive rate of 11% of targeting). Right, We employed a similar strategy to determine false-positive frequencies of tri-genic targeting, but this time used different combinations of on-target nucleases. The false-positive rate detected here was 0.1% (2.4% of targeting). (c) To determine tetra-genic false-positive frequencies, we electroporated HSPCs with three on-target nucleases (IL2RG, HBB, and CCR5) and then transduced with three homologous rAAV6 donors (IL2RG, HBB, and CCR5) and one non-homologous donor (CXCL12). FACS plots show a frequency of 0.09% that are reporterhigh for all four reporters with Figure 4f showing a tetra-genic targeting frequency of 1.0% (a false-positive rate of 9% of targeting).
Discussion
Table 1 summarizes the HR multiplex experiments (seven total genes targeted) and shows that by using Cas9 RNP, rAAV6, and flow cytometry-based sorting, we can reproducibly generate HSPC populations that have undergone HR events at multiple loci. For synthetic biology purposes, the tetra-genic targeting method, for example, can generate an enriched population of cells with eight genetic modifications: the knockout of at least a single allele of four different genes while introducing four different transgenes (in this proof-of-concept we used three fluorescent protein reporter genes and one biologically inert cell surface marker (tNGFR) that has been previously used in human clinical trials to track genetically modified hematopoietic stem cells over the course of decades). Our approach to studying gene function in human HSPCs has several advantages over lentiviral-based approaches because it enables: (1) multigenic targeted integration (at least four genes), (2) enrichment of highly pure edited populations, (3) the ability to trace cells with a specific genotype, (4) enrichment of a population with biallelic targeting of at least two genes, and (5) fluorescent protein-based hematopoietic cell lineage tracing. Our methodology has the potential to advance the biological understanding of gene functions in canonical HSC processes, including self-renewal, differentiation, and engraftment, all of which are critical aspects of fundamental stem cell biology and may augment the efficacy of stem cell based therapeutics.
Table 1.
Overview of targeting experiments in hematopoietic stem and progenitor cells (HSPCs).
Overview of all HSPC targeting experiments performed in this study with the number of independent experiments (N) for each experiment type, and the mean targeting efficiency (±SD). See also Supplementary file 1a, c, and e.
Experiment
N
% efficiency ± SD
Monogenic
47
21.7 ± 13.4
Biallelic
16
5.5 ± 4.2
Di-genic
17
8.1 ± 8.1
Tetra-allelic
3
0.9 ± 0.3
Tri-genic
6
4.5 ± 4.8
Tetra-genic
3
0.7 ± 0.3
Overview of targeting experiments in hematopoietic stem and progenitor cells (HSPCs).
Overview of all HSPC targeting experiments performed in this study with the number of independent experiments (N) for each experiment type, and the mean targeting efficiency (±SD). See also Supplementary file 1a, c, and e.By knocking in four different transgenes into four different genes, the method generates four gene disruptions and four gene additions. However, the use of multiple sgRNAs also increases the chances for off-target effects and chromosomal translocations. By looking for monocentric translocations between two genes (HBB and AAVS1), we observed low levels of translocation events similar to previously published studies (Poirot et al., 2015). Such effects are likely sgRNA and target gene-specific and need to be assessed on a case-by-case basis. The observed tetra-genic targeting efficiencies at >0.5% are high enough to be experimentally useful, and though some applications may be restricted by HSPC source and starting cell numbers, our targeting methodology may be combined with recent advances in HSPC expansion protocols (Fares et al., 2014; Cutler et al., 2013; de Lima et al., 2012; Popat et al., 2015) or with transplantation into a humanized bone marrow ossicle xenotransplantation model, which supports higher engraftment levels compared to a standard NSG model (Reinisch et al., 2016). By using reporters as transgenes, one can both enrich and track the modified cells, and by using a transgene cassette in which a potentially biologically active transgene is linked through a 2A peptide or IRES to a reporter gene, one can enrich and track cells that could have up to four different new potentially bioactive genes expressed. Additionally, we and others have recently demonstrated the feasibility of knocking in a cDNA immediately after the start codon of the gene, thereby maintaining endogenous regulatory control over gene expression (Dever et al., 2016; Hubbard et al., 2016; Voit et al., 2014). This provides a genetic engineering toolbox where different types of alleles (WT, knockout, mutant cDNA forms) are fluorescently tagged and can be enriched or tracked in a population with mixed allele combinations. One potential caveat is the requirement for reporter gene expression and the fact that cells must be cultured for 2–3 days until reporter gene expression is detectable and cells can be sorted. Even though we have not detected any obvious negative impact in this or previous studies (Dever et al., 2016; Bak and Porteus, 2017), future studies may further investigate and optimize ex vivo culturing conditions, as well as promoter and reporter choice for minimal impact on biology and repopulation potential of edited HSPCs.Our methodology could be used for the characterization of gene interactions during blood and immune system disease pathogenesis. For example, functional knockouts can be created at one gene (e.g. reporter knock-in into tumor suppressor gene), while introducing disease-causing polymorphisms at another gene (cDNA expression cassette knock-in into proto-oncogene) (see Figure 4—figure supplement 5 for schematic). For example, Zhao et al., showed that the loss of p53 cooperates with the Kras mutation to promote acute myeloid leukemia (AML) in mouse HSPCs using a retroviral methodology (Zhao et al., 2010). Our system could be used to address whether these findings can be translated to human HSPCs by achieving site specific HR that would simultaneously knock out a tumor suppressor (e.g. TP53) and drive mutant KRAS under endogenous regulatory conditions, instead of using strong constitutive exogenous viral promoters with little control over proviral copy number and heterogeneity of transgene expression. However, in cDNA knock-in experiments, proper expression should always be validated since elements in the adjacent reporter expression cassette or the lack of UTRs and introns could influence cDNA expression (Sweeney et al., 2017). We also show biallelic integration in primary human T cells at CCR5, which could be therapeutically applicable for engineering HIV-resistance, where biallelic knockout of CCR5 could be combined with expression of different HIV restriction factors (Voit et al., 2013). Additionally, this approach could be useful to extend recently published studies showing high potency of chimeric antigen receptors (CARs) that were site-specifically integrated into the TRAC gene using CRISPR and AAV6 in primary human T cells (Eyquem et al., 2017). Multiplexed gene editing may be used to knock-in different CARs or co-stimulatory ligands into genes that are desirable to knock-out in CAR T cell therapy. We anticipate in the future that multiplexed HR mediated cell engineering will facilitate even more sophisticated uses of synthetic biology-based stem cell therapeutics than the examples we have given. Our methodology should also be widely applicable to other cell types of the hematopoietic system besides HSPCs and T cells, and even to cells of non-hematopoietic origin.
Figure 4—figure supplement 5.
Controlling genotype with cDNA knock-in.
(a) A heterozygous knockout population can be generated with two HR donors. The first donor is designed to knock-in a wild-type (WT) cDNA cassette into the start codon (ATG) of the gene of interest followed by a cassette encoding a reporter gene (here GFP). WT cDNA is expressed from the endogenous promoter as reported by Voit et al. (2014), Hubbard et al. (2016), and Dever et al. (2016), which maintains endogenous regulatory control over gene expression. The other donor encodes another reporter (here BFP), which disrupts the targeted gene. Double positive cells (GFP+/BFP+) are heterozygous for the knockout allele. (b) A population heterozygous for a particular SNP can be generated using two donors that knock in cDNA expression cassettes followed by different reporter genes. One cDNA is WT while the other carries the SNP of interest. Double positive cells (GFP+/BFP+) are heterozygous for the SNP allele. Endogenous 3’ UTRs may be incorporated to preserve posttranscriptional regulation. Heterozygous SNP cDNA knock-in may be expanded to two or more genes, which may be of particular interest in studies of leukemia-mutated genes such as DNMT3A, IDH1/2, JAK2, and KRAS, which often occur in various combinations as heterozygous gain-of-function or dominant negative mutations. In addition, reporter knock-in combined with WT cDNA knock-in (as depicted in a) as well as SNP cDNA knock-in (SNP that disrupts gene or gene function) combined with WT cDNA knock-in (as depicted in b) could be used to study haploinsufficiencies. Though not depicted, all genes in the schematic are followed by polyadenylation signals.
In conclusion, we anticipate that this method will be applicable to studying humanhematopoiesis and immune system disease pathogenesis through multiplexed, site-specific genome engineering by HR, which has the potential to lead to new discoveries in human hematopoietic stem cell biology.
Materials and methods
AAV vector production
AAV vector plasmids were cloned in the pAAV-MCS plasmid (Agilent Technologies, Santa Clara, CA) containing ITRs from AAV serotype 2 (AAV2). CCR5, IL2RG, HBB, RUNX1, ASXL1, and CXCL12 vectors contained an SFFV promoter, a reporter gene such as tNGFR, MaxGFP (or Citrine), BFP, mCherry, tdTomato or E2Crimson and BGH polyA. MaxGFP and Citrine are referred to as GFP throughout. For translocation and NSG transplantation experiments, a UbC promoter (approx. 1200 bp) was used in the HBBdonor instead of an SFFV promoter. For the T cell experiments, donors carried an EF1α promoter (approx. 1200 bp). The homology arms for IL2RG, ASXL1, and CCR5 were 800 bp, whereas left and right homology arms for HBB were 540 bp and 420 bp, respectively. The homology arms for RUNX1, STAG2, and AAVS1 were 400 bp. CCR5 donors used in T cell experiments expressed Citrine or mCherry from the PGK promoter and contained 400 bp homology arms. rAAV6 vectors were produced as described with a few modifications (Khan et al., 2011). Briefly, 293FT cells (Life Technologies, Carlsbad, CA, USA) were seeded at 13 × 106 cells per dish in ten 15 cm dishes one day before transfection. Each 15 cm dish was transfected using standard PEI transfection with 6 μg ITR-containing plasmid and 22 μg pDGM6 (gift from David Russell, University of Washington, Seattle, WA, USA), which contains the AAV6 cap genes, AAV2 rep genes, and adenovirus five helper genes. Cells were incubated for 72 hr until rAAV6 was harvested from cells by three freeze-thaw cycles followed by a 45 min incubation with TurboNuclease (Abnova, Heidelberg, Germany) or Benzonase (Thermo Fisher) at 250 U/mL. AAV vectors were purified on an iodixanol density gradient by ultracentrifugation at 48,000 rpm for 2.25 hr at 18°C. AAV vectors were extracted at the 58–40% iodixanol interface and dialyzed three times in PBS with 5% sorbitol in the last dialysis using a 10K MWCO Slide-A-Lyzer G2 Dialysis Cassette (Thermo Fisher Scientific, Santa Clara, CA, USA). Vectors were added pluronic acid to a final concentration of 0.001%, aliquoted, and then stored at −80°C until further use. rAAV6 vectors were titered using quantitative PCR to measure number of vector genomes as described before (Aurnhammer et al., 2012).
CD34+ hematopoietic stem and progenitor cells
Frozen CD34+ HSPCs derived from mobilized peripheral blood or cord blood were purchased from AllCells (Alameda, CA, USA) and thawed according to manufacturer’s instructions. Fresh CD34+ HSPCs from cord blood were acquired from donors under informed consent via the Binns Program for Cord Blood Research at Stanford University and used without freezing. Fresh CD34+ HSPCs from bone marrow were obtained from Stanford BMT Cell-Therapy Facility after informed consent. CD34+ cells were isolated using a humanCD34 MicroBead Kit (Miltenyi Biotec, San Diego, CA, USA). Generally, CB-derived HSPCs perform better in HR experiments. CD34+ HSPCs were cultured in stem cell retention media consisting of StemSpan SFEM II (Stemcell Technologies, Vancouver, Canada) supplemented with SCF (100 ng/ml), TPO (100 ng/ml), Flt3-Ligand (100 ng/ml), IL-6 (100 ng/ml), UM171 (Stemcell Technologies) (35 nM) and StemRegenin1 (0.75 mM). Mycoplasma contamination testing was not performed. Cells were cultured at 37°C, 5% CO2, and 5% O2.
T cell isolation and culturing
Primary human CD3+ T cells were isolated from buffy coats obtained from the Stanford School of Medicine Blood Center using a human T Cell Isolation Kit (Miltenyi) according to manufacturer’s instructions. Cells were cultured in X-VIVO 15 (Lonza, Walkersville, MD, USA) containing 5% human serum (Sigma-Aldrich, St. Louis, MO, USA), 100 IU/ml humanrIL-2 (Peprotech, Rocky Hill, NJ, USA) and 10 ng/ml humanrIL-7 (BD Biosciences, San Jose, CA, USA). T cells were activated directly after isolation with immobilized anti-CD3 antibody (clone: OKT3, Tonbo Biosciences, San Diego, CA, USA) and soluble anti-CD28 antibody (clone: CD28.2, Tonbo Biosciences) for 72 hr. Mycoplasma contamination testing was not performed. T cells were cultured at 37°C, 5% CO2, and ambient oxygen levels.
Electroporation and transduction of cells
All synthetic sgRNAs were purchased from TriLink BioTechnologies (San Diego, CA, USA). sgRNAs were chemically modified with three terminal nucleotides at both the 5′ and 3′ ends containing 2′ O-Methyl 3′ phosphorothioate and HPLC-purified. The genomic sgRNA target sequences with PAM in bold) were: HBB: 5’-CTTGCCCCACAGGGCAGTAACGG-3’, CCR5: 5’-GCAGCATAGTGAGCCCAGAAGGG-3’, IL2RG: 5’-TGGTAATGATGGCTTCAACATGG-3’, RUNX1c: 5’-TACCCACAGTGCTTCATGAGAGG-3’ ASXL1: 5’-ACAGATTCTGCAGGTCATAGAGG-3’, STAG2: 5’-AGTCCCACATGCTATCCACAAGG-3’, AAVS1: 5’-GGGGCCACTAGGGACAGGATTGG-3’. Cas9 protein was purchased from Life Technologies and Integrated DNA Technologies. Cas9 RNP was made by incubating protein with sgRNA at a molar ratio of 1:2.5 at 25°C for 10 min immediately prior to electroporation into CD34+ HSPCs or T cells. CD34+ HSPCs were electroporated 1–2 days after thawing or isolation. T cells were electroporated three days following activation. Both CD34+ HSPCs and T cells were electroporated using the Lonza Nucleofector 2b (program U-014) or 4D (program EO-100) (we have not detected any device-specific differences in electroporation efficiencies) and the Human T Cell Nucleofection Kit (VPA-1002, Lonza) with the following conditions: 5 × 106 cells/ml, 150–300 µg/ml Cas9 protein complexed with sgRNA at 1:2.5 molar ratio. Following electroporation, cells were incubated for 15 min at 37°C after which they were added rAAV6donor vectors (generally at an MOI (vector genomes/cell) of 50,000–100,000 for each gene). A mock-electroporated control was included in most experiments where cells were handled the same and was electroporated in the same electroporation buffer, but without Cas9 RNP. For experiments targeting multiple loci, electroporation volume and cell numbers were kept the same as stated above, and 150–300 µg/ml Cas9 RNP and MOIs of 50,000–100,000 were used for each targeted locus, but with no more than a total of 60 ug Cas9 per electroporation and 200,000 vector genomes/cell. All AAV vectors were added simultaneously and directly to the cell culture after which the cells were transferred to the incubator without further manipulation. AAV volume was kept less than 20% of the total culturing volume and medium was either supplemented or replaced with fresh medium after overnight culture.
Measuring multiplexed targeted integration of fluorescent and tNGFR donors
Reporterhigh expression was measured by flow cytometric analyses after 3–4 days post-electroporation and transduction using gates for multiplexed targeted integration set so that ‘AAV only’ samples (no nuclease) were less than 1% since previous data (not presented) have shown that after ~14 days in culture the frequency of reporter+ cells (from persistent episomal expression, random integration, and/or non-nuclease mediated HR) is generally less than 1%. The truncated NGFR receptor (tNGFR) where the cytoplasmic intracellular signaling domain is removed and is signaling incompetent, solely served the purpose of a reporter for targeted CD34+ HSPCs in indicated experiments (Bonini et al., 2003). Targeted integration of a tNGFR expression cassette was measured by flow cytometry of cells stained with APC-conjugated anti-humanCD271 (NGFR) antibody (clone: ME20.4, BioLegend, San Diego, CA). For enriching of reporterhigh populations, cells were sorted on a FACS Aria II SORP using DAPI, PI (both Thermo Fisher, 1 µg/ml) or LIVE/DEAD Fixable Cell Stain Kit (Life Technologies) to discriminate live and dead cells according to manufacturer's instructions.
Scoring, FACS-analysis, and genotyping of methylcellulose colonies
Single reporterhigh cells were either single-cell sorted into 96-well plates (Corning) pre-filled with 100 µl of methylcellulose and water in the outer wells or plated at 500 cells per 6 cm dish with methylcellulose (Methocult, StemCell Technologies). After 14 days, colonies were counted and scored as BFU-E, CFU-M, CFU-GM and CFU-GEMM according to the manual for ‘Human Colony-forming Unit (CFU) Assays Using MethoCult’ from StemCell Technologies and prior expertise (Majeti et al., 2007). For DNA extraction from 96-well plates, PBS was added to wells with colonies, and the contents were mixed and transferred to a U-bottomed 96-well plate. From 6 cm dishes, colonies were picked and transferred to PBS. Cells were pelleted by centrifugation at 300xg for 5 min followed by a wash with PBS. Finally, cells were resuspended in 25 µl QuickExtract DNA Extraction Solution (Epicentre, Madison, WI, USA) and transferred to PCR plates, which were incubated at 65°C for 10 min followed by 100°C for 2 min. For CCR5, a 3-primer PCR was set up with a forward primer binding in the left homology arm, a forward primer binding in the insert, and a reverse primer binding in CCR5 outside the right homology arm CCR5_inside_LHA: 5’-GCACAGGGTGGAACAAGATGG-3’, CCR5_insert: 5’-AAGGGGGAGGATTGGGAAGAC-3’, CCR5_outside_RHA: 5’-TCAAGAATCAGCAATTCTCTGAGGC-3’. For all other genes, gene-specific integration was detected by ‘In-Out’ PCR using a primer that binds outside the homology arm (HA) and a primer specific for the transgene cassette (insert). HBB_outside_LHA: GAAGATATGCTTAGAACCGAGG, HBB_insert: ACCGCAGATATCCTGTTTGG IL2RG_insert: 5’-GTACCAGCACGCCTTCAAGACC-3’, IL2RG_outside_RHA: 5’-CAGATATCCAGAGCCTAGCCTCATC-3’, RUNX1_outside_RHA: 5’- GAAGGGCATTGCTCAGAAAA-3’, RUNX1_insert: 5’- AAGGGGGAGGATTGGGAAGAC-3’, ASXL1_outside_RHA: 5’- AAGGGGGAGGATTGGGAAGAC-3’, ASXL1_insert: 5’- CCTCCCAAGCTGGAACTACA-3’. For detecting IL2RG non-integrated (non_int) alleles the following primers were used: IL2RG_non_int_fw: 5’-TCACACAGCACATATTTGCCACACCCTCTG-3′, IL2RG_non_int_rv: 5′-TGCCCACATGATTGTAATGGCCAGTGG-3’. For detecting dual integration of GFP and tdTomato into two HBB alleles, a primer in HBB outside the right homology arm was used together with either a GFP or tdTomato-specific primer: HBB_outside_RHA: 5’-GATCCTGAGACTTCCACACTGATGC-3’, GFP: 5’-GTACCAGCACGCCTTCAAGACC-3’, tdTomato: 5’-CGGCATGGACGAGCTGTACAAG-3’. Clones with di-genic GFP (HBB)/mCherry (CCR5) and tri-genic GFP (IL2RG)/tdTomato (HBB)/tNGFR (CCR5) integrations were screened for integrations using the same primers as above. All integrated PCR bands were subjected to Sanger sequencing to confirm perfect HR at the intended locus. For flow-cytometric analysis of colonies generated from cells with quadruple-allelic HR, individual colonies were picked and directly resuspended in FACS buffer containing LIVE/DEAD staining solution (LIVE/DEAD Fixable Near-IR Dead Cell Stain, Thermo). After 30 min incubation (4°C, dark) cells were washed in FACS buffer and subjected to analysis. Dead cells were excluded from analysis based on APC-Cy7 positivity.
Transplantation of CD34+ HSPCs into NSG mice
6 to 8 week-old NOD scid gamma (NSG) mice were used (Jackson laboratory, Bar Harbor, ME USA). The experimental protocol was approved by Stanford University’s Administrative Panel on Lab Animal Care (IACUC 25065). Four days after electroporation/transduction, different populations of live (DAPI-negative) targeted cells were sorted. Mock-treated cells were also sorted to control for the effect of the sorting procedure. Directly after sorting, cells were transplanted into one femur of sub-lethally irradiated mice (200 rad, 24 hr before transplant). Mice were randomly assigned to each experimental group and analyzed in a blinded fashion.
Assessment of human engraftment
12 weeks after transplantation, mice were sacrificed, mouse bone marrow (BM) was harvested from the transplanted femur by flushing. Non-specific antibody binding was blocked (10% vol/vol, TruStain FcX, BioLegend) and cells were stained (30 min, 4°C, dark) with monoclonal anti-human HLA-ABC APC-Cy7 (W6/32, BioLegend), anti-mouseCD45.1 PE-Cy7 (A20, eBioScience, San Diego, CA, USA), CD19APC (HIB19, BD511 Biosciences), CD33 PE (WM53, BD Biosciences), and anti-mouse mTer119 PE-Cy5 (TER-119, BD Biosciences) antibodies, and Propidium Iodide to detect dead cells. Human engraftment was defined as HLA-ABC+ cells.
Analysis of HBB-AAVS1 translocations
Genomic DNA was extracted from sorted populations using QuickExtract DNA Extraction Solution. For ddPCR quantification of translocations, ddPCR droplets were generated on a QX200 Droplet Generator (Bio-Rad) according to manufacturer’s protocol. Briefly, PCR reactions were set up in a 25 µL total volume per reaction with the ddPCR Supermix for Probes (No dUTP) (Bio-Rad). A HEX reference assay detecting copy number input of the TERT gene was used to normalize for genomic DNA input (Bio-Rad: saCP1000100). A custom assay designed to detect the translocations between HBB and AAVS1 consisted of: Forward primer: 5’-TCAGGGCAGAGCCATCTATTGC-3’, Reverse primer: 5’-CCAGATAAGGAATCTGCCTAACAGG-3', 5'−6FAM/ZEN/3'-IBFQ-labeled Probe (IDT): 5’-CTTCTGACACAACTGTGTTCACTAGCAACC-3’. The translocation assay was used at a final concentration of 900 nM for each of the primers and a final concentration of 250 nM for the probe. 20 µL of the PCR reaction was used for droplet generation, and 40 µL of the droplets was used in the following PCR conditions: 95° - 10 min, 50 cycles of 94° - 30 s, 57°C – 30 s, and 72° - 2 min, finalize with 98° - 10 min and 4°C until droplet analysis. Droplets were analyzed on a QX200 Droplet Reader (Bio-Rad) detecting FAM and HEX positive droplets. Control samples with non-template control (H2O) or genomic DNA from mock-electroporated samples were included in the entire process. Translocation frequencies were calculated as the translocation copy number per µL divided by the TERT copy number per µL. For sequencing of translocations, PCR products were generated using Phusion polymerase (Fisher Scientific) with the forward and reverse primers listed above for the translocation ddPCR assay. PCR amplicons were gel-purified and cloned into the pMiniT 2.0 plasmid using the NEB PCR Cloning Kit (NEB) according to manufacturer’s recommendations. Ligated plasmid reactions were transformed into XL-1 Blue competent cells, plated on ampicillin-containing agar plates, and single colonies were sequenced by MCLAB (South San Francisco, CA, USA) using rolling circle amplification followed by sequencing using the following primer: 5’-ACCTGCCAACCAAAGCGAGAAC-3’.
Analysis of cell viability and proliferation
Modified cells were FACS-sorted into individual wells of a 96-well U bottom plate and expanded in HSPC retention media (see above) at a density of <100,000 cells per mL. To check viability and proliferation after multiplexed HR, cells from a single well were recovered and a known number of absolute counting beads (CountBright beads, Invitrogen) was added. Cells were stained with Ghost Dye Red 780 (Tonbo Biosciences) for 30 min at 4°C in the dark and analyzed on a FACS-Aria II without further manipulation to reduce potential cells loss. Viable cells were determined as GhostDye Red 780 negative and exact cell counts were assessed through concomitant acquisition of 10,000 beads. Cell counts were calculated based on ratio of beads to cells within the suspension.In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.Thank you for submitting your article "Multiplexed Genetic Engineering of Hematopoietic Stem and Progenitor Cells using CRISPR and AAV6" for consideration by eLife. Your article has been favorably evaluated by Sean Morrison (Senior Editor) and three reviewers, one of whom, Ross L Levine (Reviewer #1), is a member of our Board of Reviewing Editors.The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.Although all three reviewers found merit in this work, some specific issues were raised which resonated with all of the reviewers and which need to be addressed. One is the relatively low level of engraftment, which is more striking than with the mock controls, raising concerns as to whether the multiplex gene editing is toxic to cells. This is an important question which needs to be thoughtfully addressed. The other major concern relates to potential off-target effects, and the need to see additional genomic data which addresses this question more comprehensively. Third, the authors should discuss how their method can be applied to biologic studies and what are the novel strengths and limitations to their approach.Reviewer #1:The authors describe a novel genome editing tool using HR mediated genome editing by combining delivery of Cas9/sgRNA complexes by electrocorporation and delivery of the homologous donor DNA and reporter genes by transduction of cells with a recombinant adeno associated vector.They use this technology to introduce different modifications such as monoallelic and biallelic modification of a particular gene, but also simultaneous targeting of 4 different genes in human HSPCs – which might have important applications for functional and therapeutic studies in hematological diseases.The paper does push the envelope further by showing additional technical improvements, which are substantive, above and beyond other reports. However, little biological insight is provided and it would be helpful to show novel biology to make this report of wide interest to the field. Nonetheless, as a Tools and Resources submission, this condition is relaxed a bit as long as the technique provides a rigorous proof of principle.1) Analysis of clones derived from single cell methylcellulose assays harboring monoallelic integration showed gene specific differences in the modification of non-integrated alleles. What do the authors think was the cause of these gene specific differences? And how could these be influenced?2) For the biallelic targeted integration in HPSCs, why was the rate of double positivity different in cells that have received RNP+AAV targeting e.g. Runx1 (14%) vs. CCR5 (1.2%)? Is this due to a lower transduction efficiency for the CCR5? Or other mechanisms interfering with HR? And how could this be optimized for CCR5? In Figure 1E, the authors should include the data for RUNX1, HBB and CCR5 as well.3) How did the authors design the sgRNA for introducing biallelic modifications? Were sgRNA on the both strands "reciprocal"? or were the sgRNA on both strands located at different nucleotide sequences?4) For the transplantation studies, 15 weeks after sorting double positive cells and transplanting them into sublethally irradiated mice, mice were sacrificed in engraftment (CDHLA-ABC+/CD45+) was assessed. After 15 weeks 1.5% of cells showed engraftment. Do the authors have sequential data, how this population changes over time – increases/ decreases?5) When multigenic HR was performed, was there evidence of NHEJ besides the HR directed repair? And what was its frequency? Was there evidence of gene fusions/ translocations?6) Cell viability 3 days post targeting 3 genes (RNP+AAV) was <70% compared to 100% in AAV only cells. Since the fraction of triple pos. cells is low (e.g. Runx1/HBB/ASXL1 2.9%), was flow analysis repeated at a later time point to ensure the triple-positive cells were still alive?7) The authors postulate that multiplex genome editing using their method will enable and facilitate functional studies of gene networks. Nevertheless, simultaneous gene editing of 3 genes showed an efficiency of 2.6% and of 4 genes and efficiency of 1% only. These fractions seem very small. Particularly in the setting of in vivo studies such as a BMT experiment: the authors were able to transplant ~30 000 double positive cells modifying 2 genes simultaneously with a double pos. rate of 10.2.% (di genic approach). If the triple or quadruple positive rate drops to 1%, this would mean to transplant 3 000 cells for a BMT, which seems very low. Please comment on this.Reviewer #2:This is a nicely designed and executed study using CRISPR tools to introduce reporter genes in humanCD34+ HSPCs allowing sorting and/or tracking of cells with inactivation of specific genes.The authors present an elegant approach that would be useful to model disease relevant inactivating mutations in human HSPCs, and potentially also point mutations (the authors discuss how this might be possible by modifying the CRISPR tools they present but do not actually show data on these strategies).The data are of high quality, well presented and clearly described. A minor criticism is that the results are presented in a repetitive and a bit monotonous style, but nevertheless clear and with enough detail for others to reproduce.1) Figure 2E shows low engraftment (in the 0.1-1% range) of the edited cells compared to mock, indicating a potential engraftment impairment (although the mouse numbers are low). Can the authors comment on this?2) Related to my comment above, the authors could discuss a bit more the potential limitations of their approach. For example, an important point to discuss is the time it would take from the beginning of culture to cell sorting and how this might impact HSPC function.3) Trypan blue and Annexin staining (Figure 1—figure supplement 3 and Figure 3—figure supplement 3) may not be adequate ways to assess toxicity, as dead cells may be continuously removed from the cultures and not detected with these methods. I think that cell counts depicting the expansion of the edited cells vs. mock or unmodified would be more convincing way to evaluate toxicity of the method.4) Figure 1—figure supplement 1B: the number of untargeted IL2RG alleles without indels is much higher than that of untargeted alleles of CCR5 or other genes. Is this due to lower gRNA activity (reflected by overall lower targeting efficiency in the IL2RG locus?) Can the authors comment on this?Reviewer #3:In this manuscript, Bak et al. describes a method to perform multiplex homology-directed genome editing of human HSPCs. Using Cas9-sgRNA ribonucleoprotein complex and rAAV as a donor for the homology-directed repair template, they remarkably achieve up to four (potentially more) HDR genome editing events, with little if any modification to a published protocol. Some weaknesses include lack of rigorous assessment of potential negative effects of multiplexed editing on HSPCs function and the genetic consequences. The work also lacks evidence that the functional consequences of multiplex genome editing can be read out even with the potential negative effects the editing protocol may have. Overall, this work could potentially be a highly valuable technique for the hematology field upon clarifying these issues.1) The chimerism achieved after transplantation of dual-edited cells are substantially lower than mock edited cells with no indication as to whether the cells had multi-lineage reconstitution, raising a concern that the multiplexed editing impaired the function of HSPCs. At least the contribution of dual-edited cells to each hematopoietic lineage should be shown. A concern is that this method uses rAAV6 at a high MOI (50,000-100,000 per gene. With tetra-editing, cells are exposed to 200,000 units of AAV). Is this high level of AAV exposure impairing the reconstitution potential of HSPCs (inflammation response or immune reaction?)? Do cells electroporated with Cas9-sgRNA RNP but not exposed to AAV reconstitute better?2) The authors should analyze the off-target mutagenesis events. There seems to be a substantial fraction of cells that undergo random integration of rAAV6 reporter, perhaps due to the high MOI of rAAV6 used. For example, Figure 1A shows that 7-8% of cells become reporterlow, some of which are maintained for long term (Figure 1B) indicating that the template DNA integrated into the HSPC genome. I agree that sorting reporterhigh cells will enrich for cells with HDR editing, but for a technology driven study it will be important to elucidate the nature of these reporterlow cells. This will be particularly important for some cell like T-cells (Figure 1G) that somehow have lower expression of the reporter genes. It also suggests that a fraction of reporterhigh cells also have substantial non-targeted integration, which may affect their function. Are these cells enriched for randomly integrated cells, or do they have specific integration into specific loci? Are these cells diluting the episome (so that the frequency of reporterlow cells decreases in Figure 1B) or these cells dying due to the mutagenic events of random integration?3) There is little effort to demonstrate the precision of the HDR events other than the "in-out PCR", which only tells that the reporter was integrated into the correct loci but does not tell whether the HDR was precise, or whether off-target cutting occurred, or whether translocation occurred at a level that impedes the usefulness of this protocol. The first point is important given that the authors envision their method could be used for targeted SNP knock-in. Off-targeting editing due to the use of multiple sgRNA is cautioned in the discussion but no data provided to assess whether this is a significant concern or not. The authors should perform sequencing analyses to provide quantitative assessment of precise vs. imprecise HDR editing at the targeted loci, and the extent to which off-target editing/translocation occurs.4) Although I understand that this is a method paper describing a toolbox to edit the genomes of HSPCs in a multiplexed manner, it is difficult to fully appreciate the potential of this method without a real example of how this could be used. In another word, there is no proof that it is possible to assess the functional consequences of multiplex editing other than analyzing the expression of reporter genes. Any of the concerns raised above (1-3) can impede the potential of multiplex HSPC editing. Can the authors provide evidence that multiplex editing can be used to examine the combinatorial effects of gene editing?Although all three reviewers found merit in this work, some specific issues were raised which resonated with all of the reviewers and which need to be addressed. One is the relatively low level of engraftment, which is more striking than with the mock controls, raising concerns as to whether the multiplex gene editing is toxic to cells. This is an important question which needs to be thoughtfully addressed. The other major concern relates to potential off-target effects, and the need to see additional genomic data which addresses this question more comprehensively. Third, the authors should discuss how their method can be applied to biologic studies and what are the novel strengths and limitations to their approach.Reviewer #1: […] The paper does push the envelope further by showing additional technical improvements, which are substantive, above and beyond other reports. However, little biological insight is provided and it would be helpful to show novel biology to make this report of wide interest to the field. Nonetheless, as a Tools and Resources submission, this condition is relaxed a bit as long as the technique provides a rigorous proof of principle.We thank the reviewer for the supportive words. In the revised manuscript, we now provide an example of how the enrichment technology may be used. We now show that knockout of the cohesin complex member STAG2 significantly decreases erythroid colony formation in humanCD34+ HSPCs (new Figure 1D). Since the sgRNA used to manipulate STAG2 has moderate efficiency (compared to other genes presented in our manuscript), the magnitude of the observed erythroid differentiation deficit in knockout cells could only be observed because our presented methodology allowed for the purification of modified cells. This further emphasizes the potential of our presented technology. Knockdown strategies using “INDEL-based” CRISPR/Cas9 approaches without the ability to purify modified cells would most likely miss such an important phenotype.1) Analysis of clones derived from single cell methylcellulose assays harboring monoallelic integration showed gene specific differences in the modification of non-integrated alleles. What do the authors think was the cause of these gene specific differences? And how could these be influenced?We believe that the differences observed in the modification of the non-integrated allele in clones with mono-allelic integration could be due to differences in the accessibility of the chromatin, which was recently examined in Chen et al., Nature Comm. 2017, Apr 7;8:14958. This could influence the efficiencies of the individual sgRNAs and dynamics of INDELs vs. HR.2) For the biallelic targeted integration in HPSCs, why was the rate of double positivity different in cells that have received RNP+AAV targeting e.g. Runx1 (14%) vs. CCR5 (1.2%)? Is this due to a lower transduction efficiency for the CCR5? Or other mechanisms interfering with HR? And how could this be optimized for CCR5? In Figure 1E, the authors should include the data for RUNX1, HBB and CCR5 as well.We have observed that CCR5 targeting rates, both mono and bi-allelic, are generally lower than at other loci and believe this may be a sgRNA-specific effect that changes the dynamics and spectrum of INDEL creation which may impact HR frequencies. The lack of CCR5 biallelic targeting is also corroborated in Figure 1C where we analyzed allelic frequencies in methylcellulose colonies. Increasing biallelic targeting at CCR5 may be achieved using a different sgRNA, increasing Cas9 concentrations and/or increasing the amount of AAV6 used. However, we believe that optimizing bi-allelic integration into the CCR5 gene, while interesting from the perspective of generating an HIV-resistant immune system, goes beyond the scope of the work here. As for the data for Figure 1E, we have only performed this experiment for HBB.3) How did the authors design the sgRNA for introducing biallelic modifications? Were sgRNA on the both strands "reciprocal"? or were the sgRNA on both strands located at different nucleotide sequences?For each targeted gene, only one sgRNA is used. This sgRNA targets the same target sites on the two alleles. We have made textual changes to clarify this. Thus, it is important to target sequences that are not polymorphic in the human population when using a single sgRNA and trying to create bi-allelic modifications.4) For the transplantation studies, 15 weeks after sorting double positive cells and transplanting them into sublethally irradiated mice, mice were sacrificed in engraftment (CDHLA-ABC+/CD45+) was assessed. After 15 weeks 1.5% of cells showed engraftment. Do the authors have sequential data, how this population changes over time – increases/ decreases?We have now extended the transplantation experiment and show new data in Figure 3f, Figure 3g, and Figure 3—figure supplement 3. These transplants were performed intra-femorally and at end point upon euthanasia, engraftment was assessed in the transplanted femur. To allow robust assessment of engraftment efficiencies, we did not perform serial aspirates from the transplanted femur. The suggestion to examine engraftment over time is an excellent point, and future studies using significantly larger numbers of mice focused on the engraftment kinetics of genome edited cells will be necessary to investigate this point.5) When multigenic HR was performed, was there evidence of NHEJ besides the HR directed repair? And what was its frequency? Was there evidence of gene fusions/ translocations?If the reviewer refers to NHEJ-mediated capture of the AAVdonor at on or off target sites, those are events that we have observed. This data is presented in Figure 4—figure supplement 4 where we mix-match nucleases and donors. The frequencies are below 10% of the total targeting frequencies (i.e. 1 out of every 10 targeted cells may have a capture). If the reviewer refers to NHEJ on non-targeted alleles, we have included that data for tri-genic targeting in Supplementary file 4. These data show that 85% of the clones with tri-genic integrations were modified on all alleles either by biallelic integration or INDELs on the non-integrated allele that were mostly disruptive. As suggested by the reviewer, we have now included data quantifying translocations when targeting two genes simultaneously. As expected, we do observe translocations, although at low frequencies (around 0.3%) for a monocentric translocation, which has been reported to be the most frequent translocation type (Poirot et al., Cancer Res. 2015). This new data is presented in Figure 3E and Figure 3—figure supplement 2.6) Cell viability 3 days post targeting 3 genes (RNP+AAV) was <70% compared to 100% in AAV only cells. Since the fraction of triple pos. cells is low (e.g. Runx1/HBB/ASXL1 2.9%), was flow analysis repeated at a later time point to ensure the triple-positive cells were still alive?We show survival as well as proliferation of cells targeted with multiplexed HR at time points beyond 4 days post electroporation in methylcellulose and liquid culture experiments. See for example Figure 1B, 1C, 2B, C, 3B, C, D, and 3E. Though this data is not comparative, the CFU assay in Figure 4B compares total colony formation of sorted reporter triple-positive cells to cells only transduced with AAV. Finally, long-term survival of modified cells is confirmed in the transplantation data displayed in Figure 3F and 3G.To further underscore this point, we have performed viability as well as a cell proliferation assay over a one-week culture period after sorting di-genically targeted cells (day of sort in this specific experiment is defined as Day 0 even though it is actually day 4 after editing), which confirms viability and proliferation capacity of targeted cells equivalent to non-targeted or mock-treated controls. We have added this data from a single cord blood donor to Author response image 1.
Author response image 1.
7) The authors postulate that multiplex genome editing using their method will enable and facilitate functional studies of gene networks. Nevertheless, simultaneous gene editing of 3 genes showed an efficiency of 2.6% and of 4 genes and efficiency of 1% only. These fractions seem very small. Particularly in the setting of in vivo studies such as a BMT experiment: the authors were able to transplant ~30 000 double positive cells modifying 2 genes simultaneously with a double pos. rate of 10.2.% (di genic approach). If the triple or quadruple positive rate drops to 1%, this would mean to transplant 3 000 cells for a BMT, which seems very low. Please comment on this.The reviewer brings up an excellent consideration. We have now included a revised transplantation experiment using higher numbers of cells with di-genic targeted integration (new Figure 3F and 3G). This data includes a mouse with human chimerism of 19.4%. However, we do agree with the reviewer on this important point and have made textual changes to mention ways to overcome this challenge, such as starting with higher cell doses, expansion of modified cells, or transplantation into human bone ossicles, which require very few transplanted cells (Reinisch et al., Nature Med. 2016 Jul;22(7):812-21.). While we believe that we have offered a multiplexing platform to build around, future studies will be needed to address these issues. In addition, while 1% can seem like a small percentage, humanleukemias develop from a single cell and yet end up taking over the hematopoietic system. In collaborative work published with the Cleary lab, for example, leukemias developed after transplantation into NSG mice with very low genome editing frequencies (perhaps as low as 0.1%) (Buechele et al., Blood 2015). In those studies, only a single editing event was created (but in a powerful oncogene (MLL)), and the current approach allows the generation of multiple editing events in genes that might have weaker transformative effects than the MLL-AF9 fusion.Reviewer #2:[…] The data are of high quality, well presented and clearly described. A minor criticism is that the results are presented in a repetitive and a bit monotonous style, but nevertheless clear and with enough detail for others to reproduce.We appreciate the reviewer’s high praise of our study, as we also believe that it will advance the field for modeling hematological malignancies. We are sorry that we have erred on being monotonous in our presentation, but we felt it was more appropriate to attempt to give a balanced view rather than to overstate our data. In fact, we are tremendously excited about the data and both of our labs have fully embraced the discoveries and approach for studies of leukemogenesis and for developing novel cell based therapeutics.1) Figure 2E shows low engraftment (in the 0.1-1% range) of the edited cells compared to mock, indicating a potential engraftment impairment (although the mouse numbers are low). Can the authors comment on this?We have previously described that HSCs are more refractory to HR compared to short-lived hematopoietic progenitors (Dever and Bak et al., Nature 2016). Therefore, higher numbers of modified cells must be transplanted to achieve engraftment frequencies similar to that of unmodified cells. Since we are targeting multiple genes rather than just one, it is not surprising that within a population of cells purified based on HR-mediated marker expression the number of true HSCs is relatively low and that long-term engraftment consequently will be low. To show that higher engraftment is possible by using more cells, we have now updated the transplantation results with data from a new experiment transplanting higher cell numbers (new Figure 3F-G and Figure 3—figure supplement 3). For one mouse transplanted with enriched di-genically targeted cells we observed 19.4% multilineage chimerism showing that the limiting factor is HSC cell number within the targeted population. While we believe that we have provided enough evidence that multiplexed targeting occurs in HSPCs with long-term and multilineage potential, further studies are necessary to address the caveats of achieving high targeting rates in the most immature HSC compartment with subsequent high engraftment of multiplexed-HSPCs. We have also made text revisions with suggestions on how to overcome low engraftment (higher cell dose, pre-transplant in vitro expansion, and the use of a human bone ossicle transplantation model).2) Related to my comment above, the authors could discuss a bit more the potential limitations of their approach. For example, an important point to discuss is the time it would take from the beginning of culture to cell sorting and how this might impact HSPC function.We completely agree with the reviewer and have added this relevant information to the second paragraph of the Discussion.3) Trypan blue and Annexin staining (Figure 1—figure supplement 3 and Figure 3—figure supplement 3) may not be adequate ways to assess toxicity, as dead cells may be continuously removed from the cultures and not detected with these methods. I think that cell counts depicting the expansion of the edited cells vs. mock or unmodified would be more convincing way to evaluate toxicity of the method.We agree with the reviewer. Expansion of edited cells is shown in the CFU assay in Figure 4B, which compares the total colony formation of sorted reporter triple-positive cells to cells only transduced with AAV.However, to further support this finding we have performed a cell proliferation assay as suggested by the reviewer using cell counts over a one-week culture period after sorting di-genically targeted cells. This assay confirms viability and proliferation capacity of targeted cells comparable to non-targeted or mock-treated controls. We have added this data from a single cord blood donor to Author response image 2 (day of sort is Day 0).
Author response image 2.
4) Figure 1—figure supplement 1B: the number of untargeted IL2RG alleles without indels is much higher than that of untargeted alleles of CCR5 or other genes. Is this due to lower gRNA activity (reflected by overall lower targeting efficiency in the IL2RG locus?) Can the authors comment on this?As the reviewer suggests, this difference in INDEL frequency is likely gRNA and locus specific, and generally must be determined empirically. The CCR5 guide used in this study creates a very high percentage of INDELs, specifically with a 1bp insertion (Bak and Porteus, Cell Reports, 2017), compared to the IL2RG sgRNA.Reviewer #3:In this manuscript, Bak et al. describes a method to perform multiplex homology-directed genome editing of human HSPCs. Using Cas9-sgRNA ribonucleoprotein complex and rAAV as a donor for the homology-directed repair template, they remarkably achieve up to four (potentially more) HDR genome editing events, with little if any modification to a published protocol. Some weaknesses include lack of rigorous assessment of potential negative effects of multiplexed editing on HSPCs function and the genetic consequences. The work also lacks evidence that the functional consequences of multiplex genome editing can be read out even with the potential negative effects the editing protocol may have. Overall, this work could potentially be a highly valuable technique for the hematology field upon clarifying these issues.We very much appreciate the reviewer acknowledging that it is remarkable to achieve HR at 4 genes simultaneously and that this work could be a highly valuable technique for the hematology field. We have addressed the reviewer’s concerns below.1) The chimerism achieved after transplantation of dual-edited cells are substantially lower than mock edited cells with no indication as to whether the cells had multi-lineage reconstitution, raising a concern that the multiplexed editing impaired the function of HSPCs. At least the contribution of dual-edited cells to each hematopoietic lineage should be shown. A concern is that this method uses rAAV6 at a high MOI (50,000-100,000 per gene. With tetra-editing, cells are exposed to 200,000 units of AAV). Is this high level of AAV exposure impairing the reconstitution potential of HSPCs (inflammation response or immune reaction?)? Do cells electroporated with Cas9-sgRNA RNP but not exposed to AAV reconstitute better?We appreciate the reviewer’s concerns and agree that maintaining HSPC function is of highest importance for this methodology to be useful. We note that in the revised manuscript we have now updated the engraftment data with a new experiment transplanting more cells (new Figure 3F-G and Figure 3—figure supplement 3). Notably, we can now report a mouse transplanted with di-genically targeted cells that displays 19.4% chimerism and multilineage reconstitution (B cells and myeloid cells). However, we agree that the engraftment capacity of HSPCs edited at multiple loci might be a concern. Lower engraftment of edited and enriched cells compared to unedited cells is something we have observed before (Dever & Bak et al., Nature 2016). This phenomenon is caused by lower targeting rates in HSCs with long-term repopulation capacity (LT-HSCs). Therefore, an edited population contains less LT-HSCs than a non-edited, and it is therefore expected to observe less engraftment with edited populations. With that said, we are currently investigating expansion protocols, injection of a higher numbers of cells, and transplantation into a human bone ossicle xenograft model, but believe those studies are beyond the scope of this manuscript. We do not believe high levels of AAV exposure is influencing HSPC function, as we see equivalent number of progenitor-derived colonies compared to controls (Figure 4B).2) The authors should analyze the off-target mutagenesis events. There seems to be a substantial fraction of cells that undergo random integration of rAAV6 reporter, perhaps due to the high MOI of rAAV6 used. For example, Figure 1A shows that 7-8% of cells become reporterlow, some of which are maintained for long term (Figure 1B) indicating that the template DNA integrated into the HSPC genome. I agree that sorting reporterhigh cells will enrich for cells with HDR editing, but for a technology driven study it will be important to elucidate the nature of these reporterlow cells. This will be particularly important for some cell like T-cells (Figure 1G) that somehow have lower expression of the reporter genes. It also suggests that a fraction of reporterhigh cells also have substantial non-targeted integration, which may affect their function. Are these cells enriched for randomly integrated cells, or do they have specific integration into specific loci? Are these cells diluting the episome (so that the frequency of reporterlow cells decreases in Figure 1B) or these cells dying due to the mutagenic events of random integration?The reviewer brings up an excellent point. The reporterlow cells contain mostly cells that are not targeted, but also some cells that are targeted (based on consistent reporter expression over time as shown in Figure 1B). We have now included data that presents a more in-depth analysis of the reporterlow fraction (new Figure 1—figure supplement 1). This data shows that most of the reporterlow cells are diluting the episome and becoming negative over time, but some remain positive and even become reporterhigh because they are in fact targeted. The intensity of the reporter strongly correlates with the propensity to shift to reporterhigh expression either due to delayed HR or delayed expression. The T cells shown in Figure 2D have low expression because in this particular experiment a weaker promoter (EF1α promoter which is weaker than SFFV) drives the genes encoding fluorescent proteins. We know from our previous studies (Dever & Bak et al., Nature 2016) that the reporterhigh monogenic targeted HSPCs can engraft long-term in secondary transplants, so it is unlikely that most of the reporterhigh cells have impaired function and more likely that they have specific integration into the on-target loci (as based on our genotyping data in colonies after 14 days in culture).While we believe that off-target effects should always be considered when evaluating the results of a genome editing experiment, since we studied 8 different genes in this work to establish a proof of concept, we do not believe that a comprehensive analysis of potential off-target INDELs is within the scope of this manuscript. We did, however, add data on translocation frequencies (Figure 3E and Figure 3—figure supplement 2) to give readers a sense of the relative frequency of such events (rare, accounting for <0.3%). Finally, we are not aware of any study using genome editing in which an off-target INDEL has confounded a biologic finding (in striking contrast to the RNAi methodology) and remain encouraged for the use of this approach. Nonetheless, we agree that unplanned genomic changes (from the Cas9/gRNA, from the AAV, and from simply culturing cells ex vivo) should always be carefully considered when interpreting the phenotypic effect of a genome editing experiment.3) There is little effort to demonstrate the precision of the HDR events other than the "in-out PCR", which only tells that the reporter was integrated into the correct loci but does not tell whether the HDR was precise, or whether off-target cutting occurred, or whether translocation occurred at a level that impedes the usefulness of this protocol. The first point is important given that the authors envision their method could be used for targeted SNP knock-in. Off-targeting editing due to the use of multiple sgRNA is cautioned in the discussion but no data provided to assess whether this is a significant concern or not. The authors should perform sequencing analyses to provide quantitative assessment of precise vs. imprecise HDR editing at the targeted loci, and the extent to which off-target editing/translocation occurs.We appreciate the reviewer’s concerns. We and others have previously confirmed that the majority of HR events are perfect/seamless (for example Sather et al., STM, 2015; Dever & Bak et al., Nature, 2016; Bak and Porteus, Cell Reports, 2017). We believe that a full off-target integration analysis is beyond the scope of this study, but as suggested by the reviewer, we have now included data quantifying translocations when targeting two genes simultaneously in HSPCs (new Figure 3E). As expected we do observe translocations, although at low frequencies (around 0.3%) for a monocentric translocation, which has been reported to be the most frequent translocation type.4) Although I understand that this is a method paper describing a toolbox to edit the genomes of HSPCs in a multiplexed manner, it is difficult to fully appreciate the potential of this method without a real example of how this could be used. In another word, there is no proof that it is possible to assess the functional consequences of multiplex editing other than analyzing the expression of reporter genes. Any of the concerns raised above (1-3) can impede the potential of multiplex HSPC editing. Can the authors provide evidence that multiplex editing can be used to examine the combinatorial effects of gene editing?We developed this method to be able to study genes involved in hematopoiesis that eliminates the need for random integration of lentiviral vectors to reduce gene expression in HSPCs. To exemplify the applicability of studying HSPC biology using targeted integration of a reporter gene combined with the enrichment methodology, we have now included new data (Figure 1D) showing that knockout of the cohesion member gene STAG2 leads to a drastic loss of erythroid colony formation. This highlights that sorting HR-targeted HSPCs can be used to study gene function.
Authors: Leonid Bystrykh; Ellen Weersing; Bert Dontje; Sue Sutton; Mathew T Pletcher; Tim Wiltshire; Andrew I Su; Edo Vellenga; Jintao Wang; Kenneth F Manly; Lu Lu; Elissa J Chesler; Rudi Alberts; Ritsert C Jansen; Robert W Williams; Michael P Cooke; Gerald de Haan Journal: Nat Genet Date: 2005-02-13 Impact factor: 38.330
Authors: Zhen Zhao; Johannes Zuber; Ernesto Diaz-Flores; Laura Lintault; Scott C Kogan; Kevin Shannon; Scott W Lowe Journal: Genes Dev Date: 2010-07-01 Impact factor: 11.361
Authors: Pablo Tebas; David Stein; Winson W Tang; Ian Frank; Shelley Q Wang; Gary Lee; S Kaye Spratt; Richard T Surosky; Martin A Giedlin; Geoff Nichol; Michael C Holmes; Philip D Gregory; Dale G Ando; Michael Kalos; Ronald G Collman; Gwendolyn Binder-Scholl; Gabriela Plesa; Wei-Ting Hwang; Bruce L Levine; Carl H June Journal: N Engl J Med Date: 2014-03-06 Impact factor: 91.245
Authors: Claire Mazumdar; Ying Shen; Seethu Xavy; Feifei Zhao; Andreas Reinisch; Rui Li; M Ryan Corces; Ryan A Flynn; Jason D Buenrostro; Steven M Chan; Daniel Thomas; Julie L Koenig; Wan-Jen Hong; Howard Y Chang; Ravindra Majeti Journal: Cell Stem Cell Date: 2015-10-22 Impact factor: 24.633
Authors: Corey Cutler; Pratik Multani; David Robbins; Haesook T Kim; Thuy Le; Jonathan Hoggatt; Louis M Pelus; Caroline Desponts; Yi-Bin Chen; Betsy Rezner; Philippe Armand; John Koreth; Brett Glotzbecker; Vincent T Ho; Edwin Alyea; Marlisa Isom; Grace Kao; Myriam Armant; Leslie Silberstein; Peirong Hu; Robert J Soiffer; David T Scadden; Jerome Ritz; Wolfram Goessling; Trista E North; John Mendlein; Karen Ballen; Leonard I Zon; Joseph H Antin; Daniel D Shoemaker Journal: Blood Date: 2013-08-30 Impact factor: 22.113
Authors: Dirk Heckl; Monika S Kowalczyk; David Yudovich; Roger Belizaire; Rishi V Puram; Marie E McConkey; Anne Thielke; Jon C Aster; Aviv Regev; Benjamin L Ebert Journal: Nat Biotechnol Date: 2014-06-22 Impact factor: 54.908
Authors: Jianbin Wang; Colin M Exline; Joshua J DeClercq; G Nicholas Llewellyn; Samuel B Hayward; Patrick Wai-Lun Li; David A Shivak; Richard T Surosky; Philip D Gregory; Michael C Holmes; Paula M Cannon Journal: Nat Biotechnol Date: 2015-11-09 Impact factor: 54.908
Authors: Volker Wiebking; James O Patterson; Renata Martin; Monica K Chanda; Ciaran M Lee; Waracharee Srifa; Gang Bao; Matthew H Porteus Journal: Nat Biotechnol Date: 2020-07-13 Impact factor: 54.908
Authors: Johan Jeong; Astraea Jager; Pablo Domizi; Mara Pavel-Dinu; Linda Gojenola; Masayuki Iwasaki; Michael C Wei; Feng Pan; James L Zehnder; Matthew H Porteus; Kara L Davis; Michael L Cleary Journal: Blood Adv Date: 2019-10-08
Authors: Annalisa Lattanzi; Joab Camarena; Premanjali Lahiri; Helen Segal; Waracharee Srifa; Christopher A Vakulskas; Richard L Frock; Josefin Kenrick; Ciaran Lee; Narae Talbott; Jason Skowronski; M Kyle Cromer; Carsten T Charlesworth; Rasmus O Bak; Sruthi Mantri; Gang Bao; David DiGiusto; John Tisdale; J Fraser Wright; Neehar Bhatia; Maria Grazia Roncarolo; Daniel P Dever; Matthew H Porteus Journal: Sci Transl Med Date: 2021-06-16 Impact factor: 17.956