| Literature DB >> 24808958 |
Christian Bach1, William Sherman2, Jani Pallis3, Prabir Patra1, Hassan Bajwa4.
Abstract
Zinc finger nucleases (ZFNs) are associated with cell death and apoptosis by binding at countless undesired locations. This cytotoxicity is associated with the binding ability of engineered zinc finger domains to bind dissimilar DNA sequences with high affinity. In general, binding preferences of transcription factors are associated with significant degenerated diversity and complexity which convolutes the design and engineering of precise DNA binding domains. Evolutionary success of natural zinc finger proteins, however, evinces that nature created specific evolutionary traits and strategies, such as modularity and rank-specific recognition to cope with binding complexity that are critical for creating clinical viable tools to precisely modify the human genome. Our findings indicate preservation of general modularity and significant alteration of the rank-specific binding preferences of the three-finger binding domain of transcription factor SP1 when exchanging amino acids in the 2nd finger.Entities:
Year: 2014 PMID: 24808958 PMCID: PMC3997970 DOI: 10.1155/2014/970595
Source DB: PubMed Journal: Biotechnol Res Int ISSN: 2090-3146
Figure 1Normal HBB gene retrieved from NCBI website.
Figure 2Mutated HBB diseased gene. Normal HbA target sequence versus single point mutation of diseased HbS gene and target sequence of a three-finger binding domain.
The 64 triplets of the 2nd finger are divided into four 16-triplet series starting with A, C, G, and T.
|
| |
|
| |
|
| |
|
|
P32-labeled double-stranded oligonucleotide; 58-base pairs.
| 5′GTCGGATCCTGTCTGAGGTGAGTTGGG |
Figure 4(a) Band shifts (SP1 protein-DNA P32 oligonucleotide complex). (b) Results from Phosphor Imager Screening (Molecular Dynamics). Results of complete recognition code of the 64 binding sites of the 2nd finger of SP1 (nonstandardized P32 count).
Figure 3Amino acid sequence and structure of the SP1 binding domain.
List of exchanged amino acids in 2nd finger of SP1.
| 2nd finger | Amino acids in alpha helical region |
|---|---|
| SP1 (wild type) | R S D E L K R H K |
| Exchanged Amino Acids | |
| CB1 |
|
| MR14 | R S |
| MQ91 |
|
| MQ135 |
|
| MQ151 |
|
Listed rank-specific recognition codes for SP1, CB1, MR14, MQ91, MQ135, and MQ151.
| SP1 | CB1 | MR14 | MQ91 | MQ135 | MQ151 | |
|---|---|---|---|---|---|---|
| (1) | CGG | CCC | GAG | AGG | GGA | AGG |
| (2) | GCG | GAG | GGA | CGG | GAC | GGA |
| (3) | CGT | GGT | GTG | GGA | GAG | AAG |
| (4) | GGC | AGG | GAA | GAG | TAA | GAG |
| (5) | TTG | GGA | GGG | AAG | GGG | GTA |
| (6) | TGG | TAA | GGT | GGC | GGT | AGT |
| (7) | GAG | CAG | GTA | GTG | AGA | TGA |
| (8) | CCC | GTG | AGG | AGA | GAT | GTG |
| (9) | TAG | GGG | GGC | ATG | GTG | TTG |
| (10) | CCG | TGG | GTT | AAT | AGT | GCA |
| (11) | AGG | TGT | TGG | AAA | GGC | GGG |
| (12) | GGT | TAT | CCC | GGT | GTA | GGT |
| (13) | GAC | CAT | AGT | CAG | AGG | CGT |
| (14) | GTG | ATG | CGG | GGG | TGA | TTT |
| (15) | GGA | GTT | GAT | GAC | CGA | GCG |
| (16) | AGT | CGT | TGA | GAA | GCT | GCT |
| (17) | TAA | CGG | AAG | AGT | GTT | GAT |
| (18) | CAG | GTC | GAC | ACA | GCG | TGG |
| (19) | GTC | TAG | GCG | ATT | ATG | GTC |
| (20) | GCT | TCG | TGC | ACC | GCA | AAT |
| (21) | TAT | CAC | TTG | GCG | AGC | ATT |
| (22) | GTA | GAA | TAG | GCT | TGT | TCT |
| (23) | GAT | GAC | GCA | TAG | CGT | CGG |
| (24) | GCA | GTA | TTT | ATA | GAA | TGT |
| (25) | AAT | GCG | CGC | GTA | ATT | GGC |
| (26) | CGC | AGA | CGT | CAA | ACT | TAG |
| (27) | TCT | CGC | GTC | CGT | TCA | TAT |
| (28) | TCG | GGC | ATG | TAA | ACC | ACT |
| (29) | TAC | AAG | AGA | GTC | CGG | AGA |
| (30) | CCT | ACA | CAG | TGG | ACA | TTC |
| (31) | CAA | CCG | CCT | GCA | TGG | CCC |
| (32) | GTT | AAT | GCT | GCC | CAA | GTT |
| (33) | GGG | GAT | ATA | CCC | CCT | AGC |
| (34) | CAT | AGT | CAA | CGA | GCC | ATA |
| (35) | TGT | CAA | CTG | GAT | AAT | TGC |
| (36) | TCA | GCT | TGT | ACT | AAG | TTA |
| (37) | GCC | TGA | CGA | TGA | ATA | TCC |
| (38) | TGA | TCC | CAT | CAT | TTT | AAA |
| (39) | TTT | CCT | CTA | CCG | CGC | ACA |
| (40) | CGA | CGA | AAT | ACG | ACG | CGA |
| (41) | ATG | AAA | CCG | GTT | AAA | GAC |
| (42) | AAG | TTG | CTT | AGC | TTA | AAC |
| (43) | ACT | CCA | TAA | CCT | AAC | ATG |
| (44) | CTG | CTG | AGC | CAC | TAG | ACC |
| (45) | ATA | TCA | TAT | CGC | TTC | CGC |
| (46) | AGA | TAC | ATT | CTA | GTC | GCC |
| (47) | TTA | CTA | ACA | TAC | CAT | TCA |
| (48) | TCC | ATT | CAC | AAC | CCC | CTT |
| (49) | CCA | CTT | ACT | TCG | TAC | CTG |
| (50) | TGC | AGC | CTC | TAT | TGC | CCG |
| (51) | TTC | ACT | ACG | CCA | CAG | GAA |
| (52) | AGC | ATA | TTA | TGC | CAC | TAA |
| (53) | ACA | TTA | CCA | CTT | TTG | TAC |
| (54) | ACG | GCC | TCA | TCT | TAT | CAA |
| (55) | CAC | TTT | ACC | CTG | CTT | CAG |
| (56) | ATT | CTC | GCC | TCA | TCG | CTA |
| (57) | CTA | TCT | AAA | TCC | CCA | ACG |
| (58) | ATC | AAC | AAC | CTC | CCG | CAT |
| (59) | CTT | ACC | TAC | TGT | CTG | CTC |
| (60) | GAA | ACG | TCG | TTG | TCT | CCA |
| (61) | CTC | ATC | TCT | ATC | ATC | CCT |
| (62) | ACC | GCA | ATC | TTT | TCC | CAC |
| (63) | AAC | TGC | TCC | TTA | CTC | TCG |
| (64) | AAA | TTC | TTC | TTC | CTA | ATC |
Figure 5Band shifts of the SP1 protein-DNA P32 oligonucleotide complex.
Figure 6Results from Phosphor Imager Screening (Molecular Dynamics) of the complete binding spectrum of the 64 binding triplets of the 2nd finger of SP1 and number of exact GGGNNNGGG (Table 3) matches in the human genome.
Evolutionary success of C2H2 binding proteins. Relevant observations concerning evolutionary success of C2H2 binding proteins.
| Observations | References and comments |
|---|---|
| Degeneracy | (i) Engineered ZFAs typically yielded degenerate motifs, binding dozens to hundreds of related individual sequences [ |
| (ii) Observed clear secondary DNA binding preferences and the secondary motifs were bound nearly as well as the primary motifs [ | |
| (iii) The secondary motif can recruit genomic loci independently of the primary motif [ | |
| (iv) Beyond simply providing a DNA binding site motif, these data provide rank-ordered listing of the preference of a protein [ | |
| (v) Observed “secondary motif” phenomenon had not been described before, and it has important implications for understanding how proteins interact with their DNA binding sites [ | |
|
| |
| High failure rates | The modular assembly method of engineering zinc finger arrays has an unexpectedly higher failure rate [ |
|
| |
| Evolutionary plasticity | (i) The dramatic expansion of the number of C2H2-ZFs in mammals appears to be a recent evolutionary event [ |
| (ii) Evolutionary plasticity [ | |
| (iii) Conserved expression without conserved regulatory sequence: the more things change, the more they stay the same [ | |
|
| |
| Complexity | (i) Half of the proteins: each recognized multiple distinctly different sequence motifs [ |
| (ii) 10605 combinations for a 1000 bp long gene [ | |
| (iii) The dramatic expansion of the number of C2H2-ZFs in mammals appears to be a recent evolutionary event [ | |
|
| |
| Simplicity | (i) Origami structure: [ |
| (ii) Fractal organization: [ | |
|
| |
| Directional evolution | (i) Expression of ftz changed at least three times during arthropod evolution: [ |
| (ii) The complexity, robustness, and evolvability of regulatory systems [ | |
|
| |
| Evolutionary traits | (i) “The contribution of finger 1 to the DNA binding affinity of SP1 is smaller than that of fingers 2 and 3, but the presence of finger 1 is still essential for the high DNA binding affinity. These unique features have never been detected in other zinc fingers [ |
|
| |
| Cytotoxicity | Cell death and apoptosis associated with ZFN expression are most likely the result of excessive cleavage at off-target sites, which, in turn, suggests imperfect target-site recognition by the ZF DNA-binding domains. [ |
Evolutionary traits to be considered or reused in the design of zinc finger domains.
| Evolutionary trait | References and comments |
|---|---|
| Binding spectra | A single zinc finger binds many triplets[ |
| Condition dependency | Binding of dissimilar sequence in various conditions [ |
| Context dependency | The interdependence between all fingers in a domain through the overlap loci [ |
| Sequence dependency | Form of DNA determines the binding sequence a domain can recognize [ |
| Dynamic biological 3D malleability | Of the three-dimensional structure of single fingers and entire domains and the form and deformability of DNA structure [ |
| Reversibility | Formation and dissolution of DNA-protein complex is an indispensable property of a functional regulatory network system |
| Evolutionary dualisms | Duality of conserving and changing gene and protein sequences [ |
| 4th base overlap loci | Complex stabilization and dissolution loci to control the reversibility process [ |
| Binding initiation: 1st finger of SP1 | The 1st finger has unique evolutionary traits never detected in other fingers as well as relaxed binding specificity and affinity and therefore is likely to initiate complex formation [ |
Figure 7Evolutionary traits of SP1.
Sensitizing the SP1 framework. Potential design strategies for sensitizing versions of the SP1 framework.
| 1 × 1st finger | 321 | |
| 2 × 1st finger | 3211 | |
| 2 × 3 design | 321321 | |
| 3f + 4f design | 3213211 | |
| 2 × 4f design | 32113211 | (to lower affinity and higher specificity) |
Figure 8Sensitizing the SP1 framework.Potential design strategies for sensitizing versions of the SP1 framework for HBB gene target.
Box 1
Box 2(a)
| 9-mers | #P32 | #loc | 9-mers | #P32 | #loc | 9-mers | #P32 | #loc | 9-mers | #P32 | #loc | 9-mers | #P32 | #loc |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (1)CGG | 4570 | 16697 | (2) GCG | 3450 | 22741 | (4) GGC | 2850 | 16087 | (8) CCC | 1900 | 6153 | (19) GTC | 950 | 6246 |
| (3) CGT | 3125 | 6508 | (5) TTG | 2320 | 28372 | (9) TAG | 1850 | 14922 | (20) GCT | 800 | 42489 | |||
| (6) TGG | 2300 | 55636 | (10) CCG | 1850 | 8662 | (21) TAT | 660 | 7319 | ||||||
| (7) GAG | 2180 | 58721 | (11) AGG | 1750 | 41257 | (22) GTA | 630 | 15656 | ||||||
| (12) GGT | 1440 | 38721 | (23) GAT | 560 | 23908 | |||||||||
| (13) GAC | 1215 | 5109 | (24) GCA | 555 | 40076 | |||||||||
| (14) GTG | 1200 | 69427 | (25) AAT | 550 | 17749 | |||||||||
| (15) GGA | 1175 | 50364 | (26) CGC | 500 | 4098 | |||||||||
| (16) AGT | 1150 | 26799 | ||||||||||||
| (17) TAA | 1100 | 9037 | ||||||||||||
| (18) CAG | 1010 | 34437 |
(b)
| 9-mers | #P32 | #loc | 9-mers | #P32 | #loc | 9-mers | #P32 | #loc |
|---|---|---|---|---|---|---|---|---|
| (27) TCT | 475 | 16364 | (40) CGA | 225 | 5420 | (53) ACA | 63 | 26050 |
| (28) TCG | 450 | 7161 | (41) ATG | 188 | 28958 | (54) ACG | 62 | 6423 |
| (29) TAC | 380 | 1172 | (42) AAG | 188 | 36175 | (55) CAC | 50 | 3769 |
| (30) CCT | 300 | 27004 | (43) ACT | 187 | 16928 | (56) ATT | 50 | 11132 |
| (31) CAA | 300 | 15503 | (44) CTG | 150 | 38542 | (57) CTA | 40 | 9049 |
| (32) GTT | 280 | 24314 | (45) ATA | 124 | 7850 | (58) ATC | 30 | 1919 |
| (33) GGG | 260 | 18460 | (46) AGA | 123 | 38448 | (59) CTT | 20 | 15023 |
| (34) CAT | 250 | 14458 | (47) TTA | 110 | 10717 | (60) GAA | 10 | 36693 |
| (35) TGT | 245 | 20901 | (48) TCC | 110 | 2733 | (61) CTC | 10 | 4500 |
| (36) TCA | 238 | 23485 | (49) CCA | 80 | 22765 | (62) ACC | 0 | 3170 |
| (37) GCC | 235 | 12890 | (50) TGC | 75 | 4411 | (63) AAC | 0 | 2925 |
| (38) TGA | 230 | 24010 | (51) TTC | 73 | 1854 | (64) AAA | 0 | 29730 |
| (39) TTT | 225 | 16500 | (52) AGC | 65 | 6704 | Total: 1,261,301 | ||