| Literature DB >> 22467209 |
Fayza Daboussi1, Mikhail Zaslavskiy, Laurent Poirot, Mariana Loperfido, Agnès Gouble, Valerie Guyot, Sophie Leduc, Roman Galetto, Sylvestre Grizot, Danusia Oficjalska, Christophe Perez, Fabien Delacôte, Aurélie Dupuy, Isabelle Chion-Sotinel, Diane Le Clerre, Céline Lebuhotel, Olivier Danos, Frédéric Lemaire, Kahina Oussedik, Frédéric Cédrone, Jean-Charles Epinat, Julianne Smith, Rafael J Yáñez-Muñoz, George Dickson, Linda Popplewell, Taeyoung Koo, Thierry VandenDriessche, Marinee K Chuah, Aymeric Duclert, Philippe Duchateau, Frédéric Pâques.
Abstract
The ability to specifically engineer the genome of living cells at precise locations using rare-cutting designer endonucleases has broad implications for biotechnology and medicine, particularly for functional genomics, transgenics and gene therapy. However, the potential impact of chromosomal context and epigenetics on designer endonuclease-mediated genome editing is poorly understood. To address this question, we conducted a comprehensive analysis on the efficacy of 37 endonucleases derived from the quintessential I-CreI meganuclease that were specifically designed to cleave 39 different genomic targets. The analysis revealed that the efficiency of targeted mutagenesis at a given chromosomal locus is predictive of that of homologous gene targeting. Consequently, a strong genome-wide correlation was apparent between the efficiency of targeted mutagenesis (≤ 0.1% to ≈ 6%) with that of homologous gene targeting (≤ 0.1% to ≈ 15%). In contrast, the efficiency of targeted mutagenesis or homologous gene targeting at a given chromosomal locus does not correlate with the activity of individual endonucleases on transiently transfected substrates. Finally, we demonstrate that chromatin accessibility modulates the efficacy of rare-cutting endonucleases, accounting for strong position effects. Thus, chromosomal context and epigenetic mechanisms may play a major role in the efficiency rare-cutting endonuclease-induced genome engineering.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22467209 PMCID: PMC3401453 DOI: 10.1093/nar/gks268
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 2.Characterization of MNs properties using cell-based assays. (a) An extrachromosomal assay for the characterization of MN activity in mammalian cells. Briefly, MN expressing vectors and reporter vector are co-transfected into CHO-KI or 293-H cells. Upon cleavage of the target site, tandem repeat recombination by SSA between two truncated copies of the LacZ gene restores a functional β-galactosidase gene, which can be monitored by standard assays. For each MN, a dose response is performed and AUC is used as a quantitative measure of MN activity. (b) Example of read out of the extrachromosomal assay described in panel. The activity of the CLS4076m, I-SceIm and Rag1m MNs is featured as an example. AUC is used as a quantitative measure of MN activity in this type of extrachromosomal assay. For CLS4076, AUC is the area in grey. (c) Monitoring of MN expression by western blotting. 293-H cells and CHO-K1 were transfected with a plasmid-encoding MN. The MN expression level was monitored by western blot. (d) Quantification of MN expression level. The MN expression level from western blot (c) was quantified and normalized to β-tubulin signal. These ratios were plotted for both cell lines and statistical analysis was performed. r, Pearson coefficient (linear correlation coefficient); ρ, Spearman coefficient (non-linear correlation coefficient); N, sample size; P, probability of finding a given correlation when the underlying variables are not correlated. (e) Comparison between the extra-chromosomal SSA assays in CHO-KI and HEK293 cells. Sixteen MNs were tested in both assays (Table 1). Dotted grey curves represent 95% confidence interval for the regression line. r, Pearson coefficient (linear correlation coefficient); ρ, Spearman coefficient (non-linear correlation coefficient); N, sample size; P, probability of finding a given correlation when the underlying variables are not correlated.
Figure 1.Genome-wide distribution of the targets of the MNs used in this study. (a) Schematic representation of the distribution per chromosome. Thirty-seven MNs cleaving 39 targets were used in this study, including 36 MNs cleaving each single target (represented by filled-in triangles) and 1 MN (CLS3902m), cleaving 3 (represented by open triangles). Target sequences and MN properties are described in Table 1. (b) Distribution per type of genomic sequence, per GC content and per CpG number. Distributions for target locations (top row) are compared with the distribution found for the whole genome (bottom row, on assembly GRCh37/hg19). For the type of genomic sequences, two MN target sites are overlapping with an intron/exon boundary and a special class was made for them. For GC content and CpG number, windows of 1 kb are taken around the recognition site (top row) or successively on the whole genome (bottom row) and the percentage of GC or the number of CpG dinucleotides are computed. Percentages in each class are displayed. The estimation of the percentage of sequences in exon, intron and intergenic regions on the whole genome is based on available annotations. The human genome version we used was GRCh37/hg19.
Meganucleases and targets
| Target characterization | Extra- chromosomal test (AUC) | Tox test | Chromatin accessibility (AUC) | Targeted mutagenesis in 293 cells (% indels) | Homologous gene targeting in 293 cells (% KI) | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MN | Gene | Target sequence | Chromosome | Seq. type | CHO | 293 | CHO AUC | act. dose | Tox | 293 | CD34 | T cell | d7 w/ MN | d2 w/ MN | d7 w/o MN | w/ MN | w/o MN |
| CLS3906m | TTTTCCTGTCTTACCAGGTTTTGT | 1p34.3 | Intergenic | 6.63 | 5.04 | 6 | 70 | 0.10 | 0.04 | ||||||||
| CLS3903m | TTAAAACTTCTTAAAGGATCCCAG | 1q31.2 | Intergenic | 8.98 | 3.14 | 2.5 | 60 | 0.80 | 0.10 | 2.00 | 0.00 | ||||||
| CLS3898m | CTAGGCTGGATTACAGCGGCTTGA | 2q14.1 | Intergenic | 8.36 | 5.37 | 2.5 | 80 | 0.00 | 0.00 | ||||||||
| CLS4633m | CCR5 | TCGAAATGAGAAGAAGAGGCACAG | 3p21 | Exon | 3.70 | 1.78 | 1.2 | 40 | 0.04 | 0.00 | |||||||
| XPC4m | XPC | TCGAGATGTCACACAGAGGTACGA | 3p25 | Exon | 10.80 | 3.52 | 5.84 | 2.5 | 80 | 188 | 0.40 | 0.36 | 0.00 | 1.05 | 0.00 | ||
| CLS3719m | ATTTCATCACCTAACTAATCTCAC | 3p25.3 | Intergenic | 2.57 | 0.00 | 0.02 | |||||||||||
| CLS3191m | Rho | ACTTCCTCACGCTCTACGTCACCG | 3q21-q24 | Exon | 4.82 | 0.40 | 5.37 | 6 | 95 | 0.10 | 0.00 | ||||||
| CLS3894m | ACAATCTGAGGTAAGTAATACTGA | 5p14.3 | Intergenic | 8.09 | 1.45 | 3.87 | 6 | 45 | 0.50 | 0.02 | 0.60 | 0.00 | |||||
| CLS3902m | TCAAACCATTGTACTCCAGCCTGG | 14q24 | Intergenic | 9.99 | 3.03 | 5.36 | 2.5 | 90 | 247 | 0.19 | 0.70 | 0.02 | |||||
| TCAAACCATTGTACTCCAGCCTGG | 5q31 | Intergenic | 337 | 0.11 | 0.39 | 0.02 | |||||||||||
| HECW1 | TCAAACCATTGTACTCCAGCCTGG | 7p13 | Intron | 163 | 0.53 | 1.17 | 0.02 | ||||||||||
| CLS2697m | CCAATACAAGGTACAAAGTCCTGA | 6p25.1 | Intergenic | 9.09 | 4.79 | 2.5 | 80 | 0.70 | 0.00 | ||||||||
| CLS2705m | TTAAAACACTGTACACCATTTTGA | 7q31.2 | Intergenic | 6.41 | 4.49 | 2.5 | 75 | 0.20 | 0.02 | 0.90 | 0.00 | ||||||
| CLS2760m | SMC5L1 | CAACTCCGTCTTACAGAGGTAAAA | 9q21.11 | Intron/Exon | 8.09 | 4.96 | 6 | 60 | 0.10 | 0.00 | 0.00 | 0.00 | |||||
| Rag1m | RAG1 | TTGTTCTCAGGTACCTCAGCCAGC | 11p13 | Intron/Exon | 9.19 | 0.82 | 5.16 | 2.5 | 95 | 106 | 78 | 70 | 1.30 | 1.28 | 0.03 | 8.40 | 0.00 |
| CLS2607m | HBB | TCAGACTTCTCCTCAGGAGTCAGA | 11p15.5 | Exon | 2.04 | 5.80 | 6 | 95 | 0.01 | 0.04 | |||||||
| CLS3769m | HBB | AGGGGATAAGTAACAGGGTACAGT | 11p15.5 | Intron | 7.08 | 4.66 | 2.5 | 45 | 0.06 | 0.00 | |||||||
| CLS4422m | NANOG | ACTGAACGCTGTAAAATAGCTTAA | 12p13.31 | Intron | 11.10 | 5.04 | 2.5 | 50 | 0.10 | 0.03 | |||||||
| CLS3884m | NACA | GGGAACTGCTTCACAGGATCCAGA | 12q23-q24.1 | Exon | 5.88 | 2.5 | 0.02 | 0.06 | |||||||||
| CLS3896m | ATAAAACAAGTCACGTTATTTTGG | 13q34 | Intergenic | 4.24 | 5.76 | 6 | 85 | 0.10 | 0.05 | ||||||||
| CLS4076m | FUT8 | TTGTAATGTCTTACAAGGTTTTAA | 14q23.3 | Promoteur | 10.91 | 1.29 | 3.21 | 2.5 | 40 | 55 | 43 | 41 | 0.70 | 5.09 | 0.06 | 0.37 | 0.00 |
| ADCY9m | ADCY9 | CCCAGATGTCGTACAGCAGCTTGG | 16p13.3 | Exon | 8.24 | 2.87 | 5.97 | 2.5 | 95 | 410 | 201 | 179 | 0.20 | 0.17 | 0.06 | 0.70 | 0.00 |
| CLS2276m | p53.7 | TCTGGCCCCTCCTCAGCATCTTAA | 17p13.1 | Exon | 9.19 | 0.10 | 0.03 | 0.99 | 0.00 | ||||||||
| CAPNS1m | CAPNS1 | CAGGGCCGCGGTGCAGTGTCCGAC | 19q13.12 | 5′UTR | 14.08 | 3.49 | 5.15 | 24.23 | 21 | 52 | 2.40 | 6.10 | 0.01 | 12.00 | 0.26 | ||
| scCAPNS1m | 13.96 | 13.96 | 3.98 | 1.2 | 70 | ND | 6.20 | 0.01 | 15.00 | 0.26 | |||||||
| CLS2752m | CTSZ | ATTACCCCCTTCACAGGAGCCAGC | 20q13 | Intron | 9.99 | 3.89 | 2.5 | 40 | 0.30 | 0.20 | 0.00 | 0.00 | |||||
| CLS3895m | AGAAGCCCAGGTAAAACAGCCTGG | 21q21 | Intergenic | 3.74 | 4.07 | 6 | 50 | 0.10 | 0.10 | ||||||||
| CLS3690m | TTAATACCCCGTACCTAATATTGC | 21q21.1 | Intergenic | 5.71 | 2.21 | 4.94 | 2.5 | 80 | 0.50 | 0.06 | 1.50 | 0.00 | |||||
| CLS3899m | GCAAAACATTGTAAGACATGCCAA | 22q13.33 | Intergenic | 10.59 | 2.73 | 1.74 | 2.5 | 40 | 0.10 | 0.01 | |||||||
| CLS3702m | WAS | TCCAAACCTCTCACAAAAGTGTAT | Xp11.23- p11.22 | Intron | 7.70 | 2.40 | 2.5 | 50 | 0.10 | 0.04 | |||||||
| CLS3759m | WAS | CCGGGCCCTCGTGCAGGAGAAGAT | Xp11.23- p11.22 | Exon | 10.52 | 5.54 | 2.5 | 95 | 286 | 0.50 | 0.48 | 0.00 | 4.50 | 0.40 | |||
| DMD21m | DMD | GAAACCTCAAGTACCAAATGTAAA | Xp21.2 | Intron | 4.10 | 0.35 | 5.62 | 6 | 95 | 61 | 25 | 35 | 1.40 | 1.20 | 0.05 | 8.40 | 0.00 |
| CLS3326m | DMD | AAATCCTGCCTTAAAGTATCTCAT | Xp21.2 | Intron | 6.76 | 5.59 | 2.5 | 100 | 0.10 | 0.07 | 0.09 | 0.00 | |||||
| CLS3402m | DMD | TTTACCTATTTTAAGTCAGATACA | Xp21.2 | Intron | 2.95 | 0.36 | 3.75 | 2.5 | 65 | 0.00 | 0.03 | ||||||
| CLS3631m | DMD | AATGTCTGATGTTCAATGTGTTGA | Xp21.2 | Intron | 7.03 | 1.65 | 3.54 | 6 | 50 | 0.20 | 0.01 | 0.60 | 0.00 | ||||
| CLS4607m | DMD | GAATCCTGTTGTTCATCATCCTAG | Xp21.2 | Intron | 9.19 | 0.90 | 0.08 | 4.00 | 0.00 | ||||||||
| CLS4418m | CCAACATCCTGAACCTCAGCTAC | Xq11 | Pseudogene | 5.61 | 3.61 | 1.2 | 85 | 0.30 | ND | ||||||||
| CLS3368m | IL2RG | CCAGGCTTGAGTGCAGTGGGGCGA | Xq13.1 | Intron | 10.17 | 4.35 | 1.2 | 85 | 0.02 | 0.02 | |||||||
| CLS3640m | IL2RG | ATTGCACTTGGAACAGCAGCTCTG | Xq13.1 | Exon | 6.25 | 1.57 | 5.13 | 2,5 | 80 | 0.40 | 0.00 | ||||||
| CLS3676m | IL2RG | CCTAAACTTGGTGCCCCATCTCTC | Xq13.1 | Intron | 11.53 | 2.09 | 3.98 | 6 | 50 | 0.20 | 0.04 | 1.08 | 0.00 | ||||
| I-CreI | NA | TAGGGATAACAGGGTAAT | NA | NA | 12.39 | 2.20 | 1 | 95 | NA | NA | NA | NA | NA | NA | NA | NA | |
| I-SceI | NA | TCAAAACGTCGTGAGACAGTTTGG | NA | NA | 5.56 | 0.61 | 5.98 | 6 | 95 | NA | NA | NA | NA | NA | NA | NA | NA |
aFor statistics of Figures 3 and 4, results for CAPNS1 target are from CAPNS1m.
bWhen in a gene, or in the promoter.
cMeasured as shown in Figure 2A and B, in CHO-KI cells.
dMeasured as shown in Figure 2A and B, in 293-H cells.
eMeasured as shown in Figure 2G and 2H, in CHO-KI cells.
fMeasured in nanograms, amount of transfected DNA corresponding to MN maximal activity.
gMeasured in percentage of survived cells at the active dose.
hMeasured as shown in Figure 5, see also ‘Materials and Methods’ section.
iMeasured in percentage of chromatids, as shown in Figure 3A, 7 days after transfection with MN expressing vector in 293-H cells.
jMeasured in percentage of chromatids, as shown in Figure 3A, 2 days after transfection with MN expressing vector in 293-H cells.
kMeasured in percentage of chromatids, as shown in Figure 3A, 7 days after mock transfection in 293-H cells.
lMeasured in percentage of cells, as shown in Figure 3B, 21 days after transfection with MN expressing vector vector in 293-H cells. HGT frequencies were normalized to plating efficiencies, e.g. 0.33.
mMeasured in percentage of cells, as shown in Figure 3B, 21 days after mock transfection in 293-H cells HGT frequencies were normalized to plating efficiencies, e.g. 0.33.
Figure 3.Characterization of MNs efficacy on their chromosomal cognate target. (a) A chromosomal assay for the characterization of MN-induced TM. MN expressing vectors are transfected into 293-H cells; the targeted locus is amplified by PCR and 10 000 molecules (on average) are sequenced to detect indels. (b) A chromosomal test for the detection of MN-induced HGT. MN-expressing vectors are co-transfected into 293-H cells together with a repair matrix; cells are grown in 96-well plates, and targeted alleles are detected by PCR. (c) Comparison between the TM and HGT assays in 293-H cells. Eighteen MNs were characterized in both assays (Table 1). Dotted grey curves represent 95% confidence interval for the regression line. Linear regression fits the formula y = 5.46x − 0.66. r, Pearson coefficient (linear correlation coefficient); ρ, Spearman coefficient (non-linear correlation coefficient); N, sample size; P: probability of finding a given correlation when the underlying variables are not correlated. (d) Comparison between the TM and HGT frequencies measured simultaneously by deep sequencing. Sixteen independent experiments were conducted, using different MNs (Rag1m, ADCY9m, CLS3676m, CLS3759m, CLS3894m, CLS4076m and CLS4607m), promoters (SV40 or CMV) and vector concentrations, all described in Supplementary Table S2. Dotted grey curves represent 95% confidence interval for the regression line. Linear regression fits the formula y = 1.71x + 0.20. r, ρ, P and N have the same meaning as on the previous panel.
Figure 4.The efficacy of MNs on their chromosomal target is poorly correlated with the activity of the nucleases in extrachromosomal assays. Throughout the figure, r, Pearson coefficient (linear correlation coefficient); ρ, Spearman coefficient (non-linear correlation coefficient); N, sample size; P, probability of finding a given correlation when the underlying variables are not correlated. (a) Comparison between the extrachromosomal SSA assay in CHO-KI and the TM assay in HEK293 cells. Thirty-seven MNs cleaving 39 targets were tested in both assays (Table 1). Correlation was also made with a smaller sample of 18 MNs (black circles), which are the same as the 18 MNs characterized in the HGT assay (Figure 3b), and displayed on Figure 3c. Vertical grey lines represent the activity levels of I-SceIm (left line) and Rag1m (right line). (b) Correlation between the extrachromosomal SSA assays in 293-H cells and the TM assay in 293-H cells. (c) Correlation between the extrachromosomal SSA assays in CHO cells and the HGT assay in 293-H cells. (d) Correlation between the extrachromosomal SSA assays in 293-H cells and the HGT assay in 293-H cells.
Figure 5.Impact of position effect on MNs efficacy. (a) The Rag1m MNs as well as Iso1Rag1m and Iso2Rag1m, two isoschizomers were tested for activity in our extrachromosomal assay in CHO-KI cells, for toxicity, and for their ability to induce targeted insertion at the endogenous Rag1 locus. Activity, efficacy and toxicity assays were conducted as described in Figures 2 and 3. (b) Similar studies were conducted with the DMD21m MN and Iso1DMDm and Iso2DMDm cleaving the DMD21t target. (c) The CLS3902m MN cleaves a sequence found in four copies in the human. CLS3902m was tested for activity, toxicity and efficacy of targeted mutagenesis at three of its cognate targets, CLS3902t_5, CLS3902t_7 and CLS3902t_14. In A, B and C, each value is the average of two independent experiments but for Rag1m (n = 7) and DMD21 (n = 3). (d) CHART-PCR assay for micrococcal nuclease accessibility of the Rag1t, DMD21t, CLS3902t_5, CLS3902t_7, CLS3902t_14, CLS4076t, CAPNS1t, ADCY9t, CLS3759t and XPC4t loci. The GAPDH and PAX7 loci are used as controls for accessible and non accessible chromatin, respectively (37). Nuclei were isolated and treated with various amounts of micrococcal nuclease, and for each locus, the amount of non digested DNA was monitored by Q-PCR as described in materials and methods. Data are normalized to the value obtained in absence of nuclease, and each point is the average of four measurements from two independent experiments. (e) Comparison between the micrococcal nuclease accessibility assay and the TM assay at Day 2. The values of panel d were used to construct a curve, and a nuclease resistance index (AUC) was calculated as the AUC, based on the same principle as activity and cell survival index calculations (Figure 2b and h). Efficacy of TM at Day 2 was plotted against this resistance index, using a semi-logarithmic scale. Here, r represents the linear correlation coefficient between log(TM) and AUC. r, Pearson coefficient (linear correlation coefficient); ρ, Spearman coefficient (non-linear correlation coefficient); N, sample size; P, probability of finding a given correlation when the underlying variables are not correlated. (f) Comparison between the micrococcal nuclease accessibility assay and the TM assay at Day 7. Here, r represents the linear correlation coefficient between log(TM) and AUC.