| Literature DB >> 25004980 |
Stefano Stella1, Rafael Molina1, Blanca López-Méndez2, Alexandre Juillerat3, Claudia Bertonati3, Fayza Daboussi3, Ramon Campos-Olivas2, Phillippe Duchateau3, Guillermo Montoya1.
Abstract
DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein-DNA interactions in protein scaffolds is key to providing `toolkits' for precise genome modification or regulation of gene expression. In a search for putative DNA-binding domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix-loop-helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing.Entities:
Keywords: gene targeting; genetics; protein–DNA interaction
Mesh:
Substances:
Year: 2014 PMID: 25004980 PMCID: PMC4089491 DOI: 10.1107/S1399004714011183
Source DB: PubMed Journal: Acta Crystallogr D Biol Crystallogr ISSN: 0907-4449
Figure 1BurrH recognizes its target DNA with high affinity and specificity in an endothermic reaction. (a) Scheme of the BurrH domain structure. The central DNA-binding domain contains the BuD repeats with the residues involved in DNA recognition (BSRs; Supplementary Figs. S1 and S2). The sequence of the coding (the strand defined by the single amino acid-to-nucleotide correspondence) and noncoding (the complement of the coding strand) strands of the oligonucleotide used in the biophysical characterization and crystallization is depicted below. The BurrH target sequence is shown in bold. (b) ITC binding curves of BurrH. The protein specifically recognizes its double-strand (ds) DNA target. BurrH is not able to bind DNA duplexes with other sequences or single-strand (ss) DNAs containing its target sequence. (c) ITC binding curves of BurrH using DNA-RNA hybrids and RNA duplexes as targets. (d) ITC binding curves of BurrH-based variants display the same thermodynamic behaviour as the wild-type protein (see Supporting Information and Supplementary Fig. S4). (e) Table summarizing the K d values of the ITC analysis. The affinities of the redesigned variants are similar to the wild-type protein except for Var2. (f) SPR analysis of BurrH target binding compared with AvrBs3 TALE. The BuD array presents a fast association and low dissociation behaviour (see Supplementary Fig. S5). In both cases 12.5 nM protein was flowed over the chip for 95 s. Mono exponential fits are shown in black for the curves (Supplementary Fig. S5). (g) On–off rate map showing the values of the association and dissociation rate constants and the resulting affinity as obtained from SPR. Dashed diagonals represent different K d values (indicated on the upper and right axes). Positions along the same diagonal have the same K d values but different k on and k off values.
Data-collection, phasing and refinement statistics
Values in parentheses are for the highest resolution shell. One crystal was used to solve each structure.
| Apo BurrH | BurrHDNA | |
|---|---|---|
| Data collection | ||
| Space group |
|
|
| Unit-cell parameters (, ) |
|
|
| Wavelength () | 0.98 | 1.00 |
| Resolution () | 46.082.21 (2.332.21) | 47.922.65 (2.792.65) |
|
| 0.11 (0.42) | 0.07 (0.61) |
|
| 0.13 (0.49) | 0.08 (0.72) |
| No. of reflections | 79917 | 28134 |
| Mean | 7.0 (2.4) | 12.1 (1.7) |
| Completeness (%) | 98.7 (99.8) | 97.7 (99.9) |
| Multiplicity | 3.5 (3.4) | 3.4 (3.4) |
| SAD phasing | ||
| No. of Se sites found | 12/12 | |
| FOM | 0.45 | |
| Phasing power | 1.8 | |
| Refinement | ||
| Resolution () | 63.502.21 | 38.662.65 |
| No. of reflections | 79812 | 27719 |
|
| 0.18/0.23 | 0.20/0.27 |
| No. of molecules in asymmetric unit | 2 | 1 |
| No. of atoms | ||
| Protein | 10970 | 5491 |
| Ligand/ion | 0 | 936 |
| Water | 953 | 129 |
| R.m.s. deviations | ||
| Bond lengths () | 0.003 | 0.008 |
| Bond angles () | 0.712 | 1.366 |
| Average | 39.02 | 65.54 |
| Ramachandran plot | ||
| Favoured (%) | 99.73 | 93.48 |
| Allowed (%) | 0.27 | 6.13 |
| Outliers (%) | 0.00 | 0.40 |
R merge is defined according to Kabsch (2010 ▶).
Figure 2Crystal structures of BurrH and the BurrH–DNA complex. (a) Crystal structures of apo and DNA-bound BurrH (2.21 and 2.65 Å resolution, respectively). Cartoon representation of the crystal structures perpendicular to the longitudinal DNA axis (left panel) and along the DNA helix (right panel). The helical elements of BurrH are shown as cylinders and the duplex oligonucleotide is represented in stick mode. (b) Ribbon diagram of a BuD repeat. The side chains of the key residues (Supplementary Fig. S2) are shown in stick mode, including their positions in the repeat. Hydrophobic amino acids (Phe, Ile, Val and Leu) are coloured light blue, Gln magenta, Lys orange and the invariant Asn green.
Figure 3Detailed view of BurrH–DNA binding and the new BSR interactions. (a) Inter-repeat hydrophobic cluster built by four of the strictly conserved amino acids upon DNA binding. (b) General view of the protein–DNA association depicting the arrangement of the conserved polar stripes (composed of Lys/Arg and Gln at positions 8 and 17 of the BuD repeats, respectively) stabilizing the phosphate backbone of the noncoding and coding DNA strands. (c) Recognition of A+4 by Thr193 in the fourth BuD repeat. (d) Detailed view of the interaction of Arg490 with the duplex DNA establishing key interactions with both DNA stands. The electron-density map for all of the figures is a 2F o − F c σA-weighted map contoured at 1.2σ.
Figure 4Engineered BuDNs can target a DNA sequence in a cellular scenario. (a) Nuclease activity of BuDN towards its homodimeric target in yeast. Upon mating, the BuDNs generate a double-strand break at the site of interest, allowing the restoration of a functional lacZ gene by single-strand annealing (SSA), enabling the generation of a blue colour in the presence of X-Gal. The colour was quantified and scored as an Afilter value, a parameter correlated to the nuclease activity. (b) Sketch of the BuDN design (see Supporting Information). A BuD array (cyan) targeting the desired DNA sequence was fused to FokI similarly to an AvrBs3-based TALEN (purple). (c) A pair of BuDNs targeting the AvrBs3 sequence (Bs3) was built to compare its activity with AvrBs3-based TALEN. The different DNA targets used in the assay are shown. The Bs3 DNA contains two identical Bs3 binding sites in opposite orientations separated by a 15 bp DNA spacer. Bs3 A11′G C17′T and C15′A T18′C DNAs contain two base-pair substitutions each in only one of the Bs3 binding sites. (d) Nuclease activity of the BuDNs and TALEN towards the DNA targets. The grey dashed line indicates the experimental background level. (e) Comparison of the nuclease activity of both scaffolds towards the same target at different temperatures. BuDNs are sensitive to variations in the target sequence, while TALEN seem to ignore the mutations in the DNA. The background level has been subtracted from the histograms. The obtained values are an average of three independent experiments. See Supporting Information for a detailed description of the nucleases.
Figure 5BuDNs targeting the HBB gene are accurate and highly active. (a) Two BuD arrays targeting a DNA region within the HBB gene near a locus known to be responsible for sickle-cell anaemia were generated. The arrays were fused to a FokI domain and transfected into HEK293 human cells (see Methods ). (b) The efficiency of the double-strand breaks induced by the BuDNs was monitored using the T7 endonuclease assay. (c) The genomic DNA was also analyzed by deep sequencing. The most representative indels identified are reported in the table. Insertions are depicted in red and deletions by dashes. (d) Table quantifying the targeted mutagenesis events at the HBB locus by deep sequencing (see Supporting Information). (e) Targeted gene insertion (TGI) frequency determined at the HBB locus in the presence of the donor DNA transfected with or without BuDNs. See Supplementary Fig. S11 and Supporting Information for a detailed description.