| Literature DB >> 24452803 |
Shannon J Sirk1, Thomas Gaj, Andreas Jonsson, Andrew C Mercer, Carlos F Barbas.
Abstract
The serine recombinases are a diverse family of modular enzymes that promote high-fidelity DNA rearrangements between specific target sites. Replacement of their native DNA-binding domains with custom-designed Cys₂-His₂ zinc-finger proteins results in the creation of engineered zinc-finger recombinases (ZFRs) capable of achieving targeted genetic modifications. The flexibility afforded by zinc-finger domains enables the design of hybrid recombinases that recognize a wide variety of potential target sites; however, this technology remains constrained by the strict recognition specificities imposed by the ZFR catalytic domains. In particular, the ability to fully reprogram serine recombinase catalytic specificity has been impeded by conserved base requirements within each recombinase target site and an incomplete understanding of the factors governing DNA recognition. Here we describe an approach to complement the targeting capacity of ZFRs. Using directed evolution, we isolated mutants of the β and Sin recombinases that specifically recognize target sites previously outside the scope of ZFRs. Additionally, we developed a genetic screen to determine the specific base requirements for site-specific recombination and showed that specificity profiling enables the discovery of unique genomic ZFR substrates. Finally, we conducted an extensive and family-wide mutational analysis of the serine recombinase DNA-binding arm region and uncovered a diverse network of residues that confer target specificity. These results demonstrate that the ZFR repertoire is extensible and highlights the potential of ZFRs as a class of flexible tools for targeted genome engineering.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24452803 PMCID: PMC3985619 DOI: 10.1093/nar/gkt1389
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of the small serine recombinases. (A) (Top) Crystal structure of the γδ resolvase dimer bound to target DNA (PDB ID: 1GDT) (20). ‘Left’ and ‘right’ recombinase monomers are colored light and dark teal, respectively. DBD indicates native DNA-binding domain. Linker and arm region are labeled for the ‘right’ recombinase monomer only. (Bottom) Core sequence recognized by the γδ resolvase catalytic domain. Base positions are indicated. (B) Sequence alignment of six of the most comprehensively characterized serine recombinase catalytic domains. Conserved residues are highlighted light teal. The α-helical and β-sheet secondary structural elements are denoted above the alignment as cylinders and arrows, respectively.
The prototypical serine recombinases and their incorporation into ZFRs
| Recombinase | Organism | Native function | Target site | Core sequence | Activating mutation(s) | Used as ZFR |
|---|---|---|---|---|---|---|
| γδ | Resolvase | D102Y, E124Q | N/A | |||
| Tn3 | Resolvase | G70S, D102Y, E124Q | Ref. | |||
| Gin | Invertase | H106Y | Ref. | |||
| Hin | Phage Mu | Invertase | H107Y | Ref. | ||
| β | Resolvase/invertase | N95D | Present work | |||
| Sin | Resolvase | Q87R, Q115R | Present work |
Dinucleotide cores (e.g. crossover regions) are underlined. Core sequence half-site positions 10-7, 6-4, 3-2, and the dinucleotide core are separated by spaces.
Figure 2.Directed evolution of enhanced β and Sin catalytic domains. (A) Schematic representation illustrating the split gene reassembly selection strategy. ZFR variants are shown in various colors; β-lactamase gene is in orange and GFPuv gene is in white. (B) Selection of β and Sin variants that recombine minimal core sites from the six and resH recombination sites, respectively. (C, D) Frequency and position of the mutations that activate the (C) β and (D) Sin catalytic domains. Highly recurrent mutations are indicated. (E, F) Crystal structure of the activated Sin-Q115R tetramer; view of dimer interface from above the N-terminus of the E helix (PDB ID: 3PKZ) (51). Highly recurrent (E) β and (F) Sin mutations shown as sticks and mapped onto the rotated Sin dimer, residues labeled on upper monomer only. Sulfate ion shown as spheres. (G) Recombination activity of β-N95D and Sin-Q87R/Q115R on the 20B, 20S, 20G and 20T core sequences. Recombination was determined by split gene reassembly. Error bars indicate standard deviation (n = 3).
Recombination by selected β and Sin catalytic domains
| Recombinase | Mutations | Core sequence | |||
|---|---|---|---|---|---|
| 20B | 20S | 20G | 20T | ||
| β | None | −− | −− | −− | |
| M94V | +++ | −− | −− | ||
| N95D | ++++ | −− | −− | ||
| M94T, R104H | + | −− | −− | ||
| M94V, N107S | ++++ | −− | − | ||
| V58A, N95D | + | −− | −− | ||
| M94T, N95D | +++ | −− | −− | ||
| E71G, M94V, N95D | ++ | −− | −− | ||
| N68S, E71G, V88A, N95D | +++ | −− | −− | ||
| K33R, N49S, E71G, N95D | +++ | −− | −− | ||
| R18P, R41P, D55G, R67G, E71G, M94I, N107K, Y114N | +++ | −− | −− | ||
| Sin | None | −− | −− | −− | |
| I2V | ++ | −− | −− | ||
| Q115R | + | −− | −− | ||
| Q87R, Q115R | ++++ | −− | −− | ||
| T32A, N97D, Q115R | + | −− | − | ||
| I11V, D12N, V78A, Q115R | ++++ | −− | − | ||
| I11V, D12N, V78A, Q115R, L150P | + | −− | −− | ||
| T77I, D85G, K110R, I133V, V138I | + | −− | −− | ||
| I11T, V61A, D85G, K110R, I113T, Q115R | ++ | −− | −− | ||
| I64A, V78A, K84R, I90V, I113T, Q115R, Q137R | ++++ | −− | − | ||
| V53A, E76G, E83G, D85G, V99A, N102S, I113S, Q115R | ++++ | −− | − | ||
Symbols indicate recombination efficiency. ++++, >35% recombination; +++, 20–35%; ++, 6–19%; +, 1–5%; −, <0.1%; −−, <0.01%. The limit of detection of recombination by split gene reassembly is ∼10−5%. All Sin variants are derived from the I100T background strain.
Figure 3.Specificity of the β-N95D catalytic domain. (A) Schematic representation illustrating the genetic screen used to profile recombinase specificity. Recombinase substrate library shown in various colors; ZFR gene is in purple, β-lactamase gene is in orange and GFPuv gene is in white. (B) Randomization strategy used for specificity profiling. Randomized bases are boxed. Note that only ‘left’ half-site of the upstream ZFR target site contained base substitutions. (C and D) Recombination by (C) β-N95D and (D) Sin-Q87R/Q115R for each 20B and 20S core site library, respectively, at 6 and 16 h. (E) Number of selected base sequences (out of 30) at each position within the 20B half-site. Thirty clones were sequenced from each 6-h library output. Recombination was determined by split gene reassembly. Error bars indicate standard deviation (n = 3).
β-Mediated recombination of core sequences derived from the human genome
| Target site | Gene | Core sequence | Recombination (%) |
|---|---|---|---|
| β wild-type | 44 ± 9 | ||
| β consensus | ND | ||
| β degenerate | ND | ||
| β-AT 1 | CNGB3 | 92 ± 63 | |
| β-AT 2 | CNGB3 | 37 ± 34 | |
| β-AT 3 | Factor VIII | 36 ± 9 | |
| β-AT 4 | CNGB3 | 32 ± 10 | |
| β-AT 5 | Factor VIII | 27 ± 4 | |
| β-AT 6 | BCR | 3.2 ± 1 | |
| β-TT 1 | RPE | 0.07 ± 0.008 | |
| β-TT 2 | CNGA3 | 0.03 ± 0.02 | |
| β-TT 3 | CNGB3 | 0.03 ± 0.01 | |
| β-TT 4 | CNGB3 | 0.02 ± 0.01 | |
| β-TT 5 | CNGB3 | 0.01 ± 0.003 | |
| β-AA 1 | CNGB3 | 0.01 ± 0.004 | |
| β-AA 2 | Factor IX | <0.01 | |
| β-TA 1 | Factor VIII | <0.01 | |
| β-TA 2 | CNGB3 | <0.01 | |
| β-TA 3 | Factor VIII | <0.01 | |
| β-TA 4 | CNGB3 | <0.01 |
Dinucleotide core composition is denoted in target site (e.g. β-AT 1 contains an AT dinucleotide core). Positions 10-7, 6-4, 3-2, and the dinucleotide core are separated by spaces. Base mismatches between genomic and wild-type core sequences are underlined. Recombination was measured by split gene reassembly. ND indicates not determined. Error values indicate standard deviation (n = 3). Abbreviations for nucleotide substitutions are as follows: N = A, T, C, or G; V = A, C, or G; B = T, C, or G; R = A or G; Y = T or C; W = A or T.
Figure 4.Alanine-scanning mutagenesis of the serine recombinase arm region. (A–C) Recombination activity of mutant (A) Tn3, (B) Gin and (C) β catalytic domains on their native and minimal DNA targets. Asterisk indicates <0.0001% recombination. Dotted lines indicate threshold below which mutants were considered non-functional. (D) Crystal structure of the γδ resolvase arm region (sticks) in contact with substrate DNA (gray surface). Conserved and variable residues important for recombination are shown in red and purple, respectively. Inert residues are shown in yellow (PDB ID: 1GDT) (20). (E) Recombination by a Gin chimera substituted with residues predicted to impart specificity onto the 20T core site. Recombination was determined by split gene reassembly. Error bars indicate standard deviation (n = 3).