| Literature DB >> 20047959 |
Alpay N Temiz1, Panayiotis V Benos, Carlos J Camacho.
Abstract
A major obstacle towards elucidating the molecular basis of transcriptional regulation is the lack of a detailed understanding of the interplay between non-specific and specific protein-DNA interactions. Based on molecular dynamics simulations of C(2)H(2) zinc fingers (ZFs) and engrailed homeodomain transcription factors (TFs), we show that each of the studied DNA-binding domains has a set of highly constrained side chains in preset configurations ready to form hydrogen bonds with the DNA backbone. Interestingly, those domains that bury their recognition helix into the major groove are found to have an electrostatic hot spot for Cl(-) ions located on the same binding cavity as the most buried DNA phosphate. The spot is characterized by three protein hydrogen bond donors, often including two basic side chains. If bound, Cl(-) ions, likely mimicking phosphates, steer side chains that end up forming specific contacts with bases into bound-like conformations. These findings are consistent with a multi-step DNA-binding mechanism in which a pre-organized set of TF side chains assist in the desolvation of phosphates into well defined sites, prompting the re-organization of specificity determining side chains into conformations suitable for the recognition of their cognate sequence.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20047959 PMCID: PMC2853105 DOI: 10.1093/nar/gkp1132
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.EGR–DNA complex and protein–DNA interactions. (A) Cartoon of EGR bound to DNA [PDB code: 1AAY (39)]. Fingers I, II and III are color red, green and blue, respectively. (B) Schematics diagram showing key side chain–base interactions between EGR and its target DNA site at positions −1, +3 and +6 of binding helix.
Analysis of buried surface area and bound-like behavior of specificity determinant side chains that contact DNA bases
Highlighted rows indicate residues that are mostly free in the unbound state, but are >80% buried in the complex, burying the largest amount of solvent area upon complexation. Also included in the Table are Positions +3 and +6 for EGR.
aChange in solvent accessible surface areas (SASA) upon complexation. SASA are calculated using the program NACCESS (50).
bFraction of buried area in the free protein with respect to the tri-peptide Ala-X-Ala.
cFraction of buried area in the protein–DNA complex with respect to the tri-peptide Ala-X-Ala
dFraction of bound-like conformations (RMSD <2 Å) for each side chain in the presence of counter ions at physiological concentrations (150–160 mM corresponding to 8–20 ions in a simulation box). See ‘Materials and Methods’ section.
eFraction of bound-like conformations for each side chain in the absence of counter ions (RMSD <2 Å). See ‘Materials and methods’ section.
fThe simulation results for the RMSD of R62 using the X-ray crystal is 12±5% with ions and 11±8% with no ions.
gR5 is not considered as specificity determinant and therefore not highlighted. See text for details.
Figure 2.Increasing bound-like behavior of EGR specificity determinant residue Arg+6 in FI as a function of ion concentration. Change in RMSD of Arg+6 with respect to its bound conformation as a function of time and increasing number of counter ions in the simulation box: (A) 20 ions; (B) 16 ions; (C) 12 ions; (D) 8 ions; (E) 4 ions; (F) No ions. The histograms show the distribution of the RMSDs in each simulation, reflecting the increasing amount of bound-like behavior of Arg+6 as a function of ion concentration.
Figure 3.Differential ion dependence of bound-like behavior of key side chains in EGR fingers I and III. The effective ionic strength in millimolar corresponds to the number of ions × 11.5. The conformations are considered bound-like if the RMSD from the crystal structure conformation is under 2 Å. (A) Fraction of bound-like conformations for buried Arg+6 in finger I. (B) Fraction of bound-like conformations for buried Arg−1 in finger III. (C) Fraction of bound-like conformations for exposed Arg−1 in finger I. (D) Fraction of bound-like conformations for exposed Arg+6 in finger III. Error bars are the direct standard deviation from three or more independent four nanosecond MD simulations. Note that in the co-crystal both Arg−1s and Arg+6s have identical structures. However, side chain dynamics is more bound-like if side chain is found buried in the complex relative to if it is found exposed to solvent.
SASA buried by phosphates and the corresponding residence time of Cl− ions on their binding site during MD
High residence times are observed on the loci of phosphates that bury the largest amount of SASA (highlighted rows).
aChange in SASA upon complexation. Surface areas of phosphate groups are calculated in the absence and presence of the protein.
bThe fraction of ion residence time corresponds to the fraction of the simulation time a counter ion is observed within a 3 Å radius sphere from the position of the phosphorus atom at physiological concentrations (150–160 mM corresponding to 8–20 ions in a simulation box).
cFII from the solution NMR complex structure (51).
Figure 4.Ions occupy the phosphate binding site in the protein–DNA complex. Distribution of counter ion positions in fingers I–III of EGR and in the engrailed homeodomain relative to the phosphorus atom position in their corresponding co-crystals. The cartoon representations of each domain are on the right: DNA phosphorus atom shown in orange spheres; Cys, His residues and binding motif residues are shown as sticks; CPK colored sticks show the crystal conformation; and, colored sticks show a snapshot of the position of key side chains. A blue sphere shows for each domain the position of the Cl− in the simulation that had the largest residence time using a 3 Å clustering radius. (A) Finger I (red), for comparison we also show the distribution with respect to a second phosphate binding site; (B) Finger II (green); (C) Finger III (blue); (D) Engrailed homeodomain, phosphorus atom is shown in yellow sphere. DNA is shown as yellow sticks. The two phosphate backbone contacting residues are Leu26, Arg31 and Arg53 are shown in light blue sticks. The contacts are depicted as pink dashed lines. For comparison we also show the distribution with respect to a second phosphate binding site. We find only one ion hot spot per domain, corresponding to the locus of the phosphate that buries the largest amount of SASA upon binding.
Residues contacting the phosphate backbone in EGR, TFIIIA, GLI and engrailed homeodomain
| DNA contact | Residue | ΔSASA | Buried free | Buried complex | Native-like(ions) | Native-like (no ion) | ||
|---|---|---|---|---|---|---|---|---|
| EGR | F1 | G8 | R103 | 40 | 36 | 56 | 52 ± 8 | 42 ± 8 |
| G7 | R114 | 43 | 35 | 57 | 58 ± 9 | 30 ± 9 | ||
| G7 | H125 (+7) | 19 | 84 | 97 | 100 | 100 | ||
| FII | T5 | R142 | 53 | 23 | 48 | 0 | 0 | |
| G4 | H153 (+7) | 24 | 77 | 93 | 100 | 100 | ||
| FIII | G2 | R170 | 53 | 32 | 59 | 96 ± 2 | 98 ± 0.5 | |
| – | H181 (+7) | 0 | 84 | 84 | 100 | 100 | ||
| TFIIIA | FI | G34 | R12 | 65 | 34 | 66 | 0 | 0 |
| G34 | Y13 | 44 | 48 | 73 | 100 | 100 | ||
| G26 | Y24 | 44 | 68 | 92 | 100 | 100 | ||
| A27 | K29 (+3) | 54 | 63 | 96 | 0 | 0 | ||
| FII | T23 | H63 (+7) | 14 | 23 | 95 | 100 | 100 | |
| A22 | T66 (+10) | 43 | 43 | 86 | 29 ± 7 | 27 ± 7 | ||
| FIII | A22 | T85 | 88 | 8 | 94 | 46 ± 19 | 52 ± 5 | |
| C40 | K87 (+1) | 36 | 28 | 50 | 0 | 0 | ||
| FV | T8 | K144 | 45 | 30 | 58 | 30 ± 6 | 0 | |
| T8 | H155 (+7) | 17 | 82 | 93 | 100 | 100 | ||
| C7 | V158 (+10) | 51 | 20 | 65 | 21 ± 4 | 0 | ||
| GLI | FII | G66 | R146 | 37 | 66 | 85 | 0 | 0 |
| G66 | K152 | 56 | 20 | 54 | 0 | 0 | ||
| A65 | H160 (+7) | 16 | 89 | 100 | 100 | 100 | ||
| FIII | C62 | Y181 | 57 | 64 | 96 | 100 | 100 | |
| A64 | R183 | 142 | 12 | 83 | 0 | 0 | ||
| T6 | K188 (+5) | 29 | 51 | 68 | 97 | 100 | ||
| FIV | C7 | Y200 | 78 | 39 | 83 | 77 ± 4 | 80 ± 8 | |
| T8 | R217 (+4) | 24 | 68 | 80 | 71 ± 4 | 88 ± 9 | ||
| C59 | H220 (+7) | 24 | 79 | 95 | 100 | 100 | ||
| C58 | T224 (+11) | 45 | 37 | 81 | 84 ± 0.2 | 85 ± 9 | ||
| FV | G56 | Y242 | 51 | 69 | 98 | 100 | 98 | |
| A57 | T243 | 108 | 0 | 96 | 94 ± 3 | 92 ± 9 | ||
| Homeodomain | A212 | W48 | 18 | 85 | 93 | 100 | 100 | |
| A213 | T6 | 5 | 47 | 42 | 45 | 30 | ||
| G326 | R31 | 27 | 70 | 84 | 33 | 52 | ||
| G327 | K46 | 31 | 49 | 68 | 59 | 72 | ||
| G327 | R53 | 42 | 71 | 91 | 82 | 10 | ||
| T328 | Y25 | 38 | 36 | 57 | 53 | 45 | ||
| T328 | R53 | 42 | 71 | 91 | 82 | 10 |
aChange in solvent accessible surface areas (SASA) upon complexation. SASA are calculated using the program NACCESS(50).
bFraction of buried area in the free protein with respect to the tri-peptide Ala-X-Ala.
cFraction of buried area in the protein–DNA complex with respect to the tri-peptide Ala-X-Ala.
dFraction of native-like conformations for each side chain in the presence of counter ions. See ‘Materials and methods’ section.
eFraction of native-like conformations for each side chain in the absence of counter ions. See ‘Materials and methods’ section.
fFII from the solution NMR complex structure (51).