| Literature DB >> 21288879 |
Umut Y Ulge1, David A Baker, Raymond J Monnat.
Abstract
Homing endonucleases (HEs) cleave long (∼ 20 bp) DNA target sites with high site specificity to catalyze the lateral transfer of parasitic DNA elements. In order to determine whether comprehensive computational design could be used as a general strategy to engineer new HE target site specificities, we used RosettaDesign (RD) to generate 3200 different variants of the mCreI LAGLIDADG HE towards 16 different base pair positions in the 22 bp mCreI target site. Experimental verification of a range of these designs demonstrated that over 2/3 (24 of 35 designs, 69%) had the intended new site specificity, and that 14 of the 15 attempted specificity shifts (93%) were achieved. These results demonstrate the feasibility of using structure-based computational design to engineer HE variants with novel target site specificities to facilitate genome engineering.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21288879 PMCID: PMC3105429 DOI: 10.1093/nar/gkr022
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.mCreI and DNA target site structure. (A) mCreI is a monomerized version of the native I-CreI homodimeric LAGLIDADG homing endonuclease protein shown here (30). (B) Sequence of the native I-CreI/mCreI target site with positions of phosphodiester backbone cleavages in the top (downward filled triangle) and bottom (upward filled triangle) strands indicated on the sequence of the top strand of native target site DNA. Target site cleavage across the minor groove leaves 4-base, 3′-OH extended cohesive ends. Note the partially palindromic nature of the native target site sequence. (C) mCreI makes both direct and water-mediated contacts with target site DNA predominantly via amino acid residues located predominantly in β-sheets that lie in the major groove of target site DNA. The contact map is an updated and redrawn version of the original contact map shown in (34).
Figure 2.In vitro cleavage assay. (A) In vitro ‘bar code’ cleavage assay developed to determine simultaneously cleavage specificity and activity for all four base pair possibilities at single target site positions in a single tube/single gel lane assay. Four primer pairs were used to amplify cleavage substrates from target site plasmids with the target site at the center of the resulting PCR fragment (left panel). Fragment lengths specify the base pair at the target site query position (ranging from 2200 base pairs for ‘A’ sites to 1320 base pairs for ‘T’ sites; right panel). (B) Pools of four substrates were cleaved in a single tube digest prior to separating substrates and cleavage products in a single lane of an agarose gel. Cleavage of substrate molecules at the centrally located mCreI target site generates two equal length cleavage products and a ‘bar code’ linking substrate and cleavage band intensities that reports cleavage activity and specificity simultaneously for all four base pair possibilities at single target site base pair positions. Examples of base-specific (central panel) and non-specific or degenerate cleavage patterns (right panel) are depicted.
Figure 3.Graphical representation of mCreI computational design output. A library of 3200 mCreI designs was generated using RD against all 4 base pair possibilities at each target site position from ±3 to ±11 (see text). The RD-predicted specificities and energies of 117 designs are plotted that represent the most energetically stable or the most specific of the 50 designs generated for each design target. Only a single design is plotted for instances in which the most stable and most specific design were the same. Experimentally validated design specificities are represented by squares labeled with the design base pair and position. Useful designs for target site positions ±11 did not emerge and are not represented, nor is a design that cleaved −8C that was an unanticipated—albeit a sequence-specific—outcome of an attempt to design for −8G (Table 1, Design 12).
Summary of experimentally validated mCreI computational designs
Cleavage activity and specificity are shown as ‘−’ for no activity/specificity; ‘+’ for reduced activity/specificity; ‘++’ for levels comparable to native mCreI; and ‘+++’ for greater than native levels. Activity and specificity assessments were made at 20 nM protein concentrations with the exception of designs against native base pairs (e.g. Designs 1, 2, 20 and 21) where specificity and activity were assessed using data across all protein concentrations tested (20–66 nM). The most successful designs for each novel specificity are shown in bold, with Class designations given to the right (see text). Cleavage and specificity data used to prepare Table 1 are summarized in Supplementary Figure S1. The ‘Design specificity’ column lists the native base pair specificity of mCreI followed by the target base pair position number and the design specificity to the right. Color shading of design substitutions indicates residue substitutions previously identified by Seligman and colleagues (yellow-boxed substitutions) (7,12), or by Pâques and colleagues (magenta-boxed substitutions) (9,10,15). Both of these groups used structure-guided random mutagenesis at selected positions in the native I-CreI DNA interface. The blue-boxed substitutions of Gao and colleagues (17) were generated after visual inspection of the I-CreI structure. Of note, our Design 4 32R and 33H substitutions were not combined by Seligman and colleagues and 30D and 28D in Designs 10 and 15 both appeared as glutamates in previous work in contrast to our computationally predicted aspartates. Color shading of design substitutions indicates residue substitutions previously identified by Seligman and colleagues (yellow-boxed substitutions) (7,12), or by Pâques and colleagues (magenta-boxed substitutions) (9,10,15). Both of these groups used structure-guided random mutagenesis at selected positions in the native I-CreI DNA interface. The blue-boxed substitutions of Gao and colleagues (17) were generated after visual inspection of the I-CreI co-crystal structure.
aDesign specificity is indicated by the native base at the numbered design position followed by the design base at that position.
bCleavage activity of native mCreI is defined as ‘++’, whereas cleavage specificity equivalent to native mCreI at a given position is defined as ‘++’. Class designations encompass both cleavage activity and specificity, with Class I designs being more specific, Class II as specific albeit altered, and Class III designs as less specific/more relaxed than native mCreI.
cDesign 12 was originally directed at −8C but was found to have a different specificity, for −8G, when characterized.
Figure 7.Predicted and experimentally determined cleavage specificities of successful novel mCreI variants. Fourteen designs are shown color coded by Class of outcome and as a function of their RD-predicted and experimentally determined cleavage specificities (see text). Designs predicted to be more specific typically had higher observed specificities, although the relationship between predicted and observed specificities was modest (R2 value of ∼0.5). Observed specificities were calculated by quantifying the cleavage product intensity in gel images (Supplementary Figure S1), then dividing the intensity of the intended cleavage band by the sum of intensities of all cleavage products at the most specific enzyme concentration (usually 20 nM) (see Supplementary Table S1 for quantified cleavage band intensities). Again, as in Figure 3, a design with novel, but unintended, specificity for −8C has been excluded from the plot.
Figure 4.Designs with enhanced cleavage specificity at a degenerate target site position. (A) Native mCreI cleaves all four base pair possibilities at the −8 target site position. (B) This lack of specificity reflects the presence of a single water-mediated bond from 28K to −8A. (C) Design 11, in contrast, cleaves only −8G even at high enzyme concentrations. This and comparable designs with enhanced specificities are referred to as Class I designs (see text). (D) The enhanced specificity of Design 11 appears to reflect the ability of residue substitutions to specify a G:C base pair at this position: 40T→R donates two hydrogen bonds to guanine, and 28K→D accepts a hydrogen bond from the complementary −8C. Neither of these interactions was possible with the native target site A:T base pair. Native amino acid residues and the native target site base pair are shown in yellow, design residue substitutions and variant target site base pairs in green, and water molecules as blue spheres (not to scale). The structure of the native enzyme bound to native target site DNA is from the co-crystal structure of I-CreI determined by Chevalier and colleagues (PDB ID 1G9Y). The corresponding structures for designs were computationally generated molecular models.
Figure 5.Designs with altered cleavage specificity at a target site position. (A) Native mCreI preferentially cleaves target sites with −5G, and to a lesser extent −5A followed by −5C or T. (B) Recognition of the −5 position by native mCreI is mediated by 2 contacts made by residue 68R to −5G, and non-polar contact of 24I with the complementary C. (C) Design 28 cleaves −5C to near-completion even at 20 nM, with minor activity on −5T. (D) Recognition of −5C in Design 28 is mediated by 24I→K that contacts to the complementary G at −5, and by 68R→T that prevents potentially deleterious contacts with −5C. Designs with comparably altered specificities are referred to as Class II designs.
Figure 6.Designs that selectively broaden mCreI cleavage specificity. (A and D) Native mCreI cleaves −6C and to a lesser extent −6T at low protein concentrations, whereas Design 22 permits nearly equal cleavage of the −6G design target base pair as well as −6C or T. (B and C) The 26Q sidechain in native mCreI contacts the guanine complementary to −6C (B), but cannot make productive contacts with −6G either from 77I or 26Q (C). (E) In Design 22, 77R contacts both the −6G design target base as well as the adjacent −7A. (F) In the presence of the native −6C:G base pair, 77R pivots to make less favorable contacts to −7A and −8A that permit cleavage but with little base selectivity. Designs with comparably broadened specificities are referred to as Class III designs.