| Literature DB >> 16971456 |
Laura E Rosen1, Holly A Morrison, Selma Masri, Michael J Brown, Brendan Springstubb, Django Sussman, Barry L Stoddard, Lenny M Seligman.
Abstract
Homing endonucleases are highly specific enzymes, capable of recognizing and cleaving unique DNA sequences in complex genomes. Since such DNA cleavage events can result in targeted allele-inactivation and/or allele-replacement in vivo, the ability to engineer homing endonucleases matched to specific DNA sequences of interest would enable powerful and precise genome manipulations. We have taken a step-wise genetic approach in analyzing individual homing endonuclease I-CreI protein/DNA contacts, and describe here novel interactions at four distinct target site positions. Crystal structures of two mutant endonucleases reveal the molecular interactions responsible for their altered DNA target specificities. We also combine novel contacts to create an endonuclease with the predicted target specificity. These studies provide important insights into engineering homing endonucleases with novel target specificities, as well as into the evolution of DNA recognition by this fascinating family of proteins.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16971456 PMCID: PMC1635285 DOI: 10.1093/nar/gkl645
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1I-CreI/DNA contacts. Direct protein/DNA hydrogen bonds are red; water molecules and water-mediated hydrogen bonds are blue. Palindromic bases are green; non-palindromic are white. Scissile phosphate groups are indicated with black dots.
CreI mutants analyzed in vivo
| Amino acid substitution(s) | Sites assayed |
|---|---|
| Q44X | ±4 sites |
| R68X | ±5 sites |
| K28X | ±7 sites |
| Q26C/Y66R/T42 to R, S, Q, C, K, N, E, H, F; Q26C/T42E | G:C ±6 |
| T42E/Y66R | WT and C:G and T:A ±6 |
| Q26A/T42E/Y66R, Q26A/T42E | A:T ±6 |
| N30 to A, Q, C, G; Q38 to A, E, G, R; N30A/Q38 to A, C, I, K, L, M, N, P, R, S, T; N30R/Q38A, N30S/Q38A, N30G/Q38A, N30G/Q38Y, N30S/Q38R, N30T/Q38R, N30C/Q38R, N30L/Q38R, N30E/Q38K, N30S/Q38K, N30R/Q38T, N30R/S32G/Q38A, N30R/S32G/Q38Y, N30G/S32Q/Q38A, N30G/S32Q/Q38K, N30R/S32Q/Q38A, N30P/S32L/Q38A | ±9 sites |
aIndicates all 19 mutants with single amino acid substitutions at that I-CreI position.
bThe wild-type I-CreI site (WT), as well as the three symmetrically altered mutant target sites at that position.
cThe wild-type I-CreI site, as well as the four symmetrically altered mutant target sites at ±7.
Data processing and refinement statistics
| Protein | K28R | Q44V |
|---|---|---|
| Space group | P2(1) | P2(1) |
| Cell parameters (Å) | a = 43.1 | a = 43.1 |
| b = 68.2 | b = 68.0 | |
| c = 87.5 | c = 87.5 | |
| γ = 91.4 | γ = 91.5 | |
| Resolution | 2.3 Å | 2.3 Å |
| Redundancy | 3.3 | 2.2 |
| Completeness (%) | 96.0 (93.9) | 85.4 (80.5) |
| Average I/σ(I) | 32.7 (14.7) | 25.8 (10.3) |
| 3.9 (9.7) | 3.9 (9.1) | |
| 25.0 | 23.5 | |
| 30.2 | 27.9 | |
| Ramachandran plot (% of modeled residues) | ||
| Most favored | 87.4 | 87.4 |
| Additionally allowed | 12.6 | 12.6 |
| Disallowed | 0.0 | 0.0 |
| Average B(Å2) (protein, DNA) | 22.3 | 25.2 |
aOuter resolution bin 2.34–2.30.
Figure 2In vivo endonuclease assays. Colony phenotypes resulting from transforming indicated I-CreI derivatives into cells harboring the indicated target site mutants (13). Increased proportion of white cells within colonies indicates increased site affinity. The Q26C/Y66R mutant has been described previously (14).
Active endonuclease derivatives identified
| Target site | Active I-CreI derivatives |
|---|---|
| A:T ±4 | Q44A, Q44C, Q44T, Q44W, Q44V |
| A:T, C:G and T:A ±7 | K28R |
| C:G ±9 | Q38G, N30R/Q38A, N30G/Q38A, N30G/Q38Y, N30R/Q38T, N30R/S32G/Q38A, N30R/S32G/Q38Y |
| G:C ±9 | N30A/Q38R, N30S/Q38R, N30T/Q38R, N30E/Q38K, N30G/S32Q/Q38A, N30G/S32Q/Q38K |
| Wild-type | Q44A, Q44C, Q44I, Q44L, Q44M, Q44S, Q44T, Q44V, Q44W, R68K, K28R, N30G, N30A, Q38G, Q38A, N30G/Q38A, N30R/S32G/Q38A |
aAll novel interactions identified displayed some level of in vivo sectoring; examples are shown in Figure 2.
Cognate versus non-cognate cleavage efficiencies
| Target site | I-CreI mutant | Wild-type cognate: non | Mutant cognate: non | Specificity shift | Specificity broadening |
|---|---|---|---|---|---|
| A:T ±4 | Q44V | 3.4 | 2.7 | 9.4 | 0.78 |
| G:C ±6 | Q26C/T42E/Y66R | 8.0 | 2.9 | 23.5 | 0.37 |
| T:A ±7 | K28R | 5.3 | 0.8 | 4.2 | 0.15 |
| C:G ±9 | N30R/S32G/Q38Y | 2.6 | 3.2 | 8.3 | 1.26 |
| G:C ±9 | N30S/Q38R | 2.6 | 2.4 | 6.3 | 0.90 |
| G:C ±9 | N30G/S32Q/Q38K | 2.6 | 2.1 | 5.6 | 0.81 |
| G:C ±10 | Y33R | 4.0 | 1.5 | 6.1 | 0.38 |
| A:T ±4 G:C ±10 | Y33R/Q44V | >450 | 3.2 | >1440 | <0.007 |
aIndicates the ratio of enzyme concentrations yielding 50% cleavage of cognate to non-cognate target sites for the enzyme/site combination indicated. Standard deviations were all <20%, with the exception of the wild-type I-CreI versus A:T ±4 G:C ±10 assay which yielded values of 456, 1622 and 2488; the larger variation in this series is presumably due to the large enzyme dilutions required to achieve 50% cleavage of both wild-type and mutant sites in these assays.
bProduct of wild-type and mutant cognate to non-cognate cleavage efficiencies, as described previously (14).
cRatio of mutant cognate to non-cognate/wild-type cognate to non-cognate cleavage efficiencies, as described previously (14). Values greater than 1 indicate higher specificity, while values of less than 1 indicate broader specificity.
dFrom (14).
Figure 3Relative cleavage of cognate and non-cognate targets. Wild-type (left) and Q44V (right) endonucleases were exposed to linearized plasmids containing wild-type and A:T ±4 homing sites. The left lanes in each gel (upper) indicate the enzyme concentration required for complete digestion of each substrate. Serial 2-fold dilutions were performed, and the relative enzyme concentrations required for 50% cleavage of each substrate determined (lower).
Figure 4Structures and interactions of cognate pairs at residue 44 and base pair ±4 and residue 28 and base pair ±7. Protein/DNA contacts between wild-type I-CreI and its binding site are shown in (A and C). Contacts between Q44V and K28R and their cognate sites are shown in (B and D), respectively. The sequences of wild-type I-CreI target sequence and the alternate target sequences are shown below their corresponding structures. Dashed green lines represent hydrogen bonding and dashed magenta lines represent van der Waals interactions.