| Literature DB >> 17214883 |
Francesca Spyrakis1, Pietro Cozzini, Chiara Bertoli, Anna Marabotti, Glen E Kellogg, Andrea Mozzarelli.
Abstract
BACKGROUND: To understand the energetics of the interaction between protein and DNA we analyzed 39 crystallographically characterized complexes with the HINT (Hydropathic INTeractions) computational model. HINT is an empirical free energy force field based on solvent partitioning of small molecules between water and 1-octanol. Our previous studies on protein-ligand complexes demonstrated that free energy predictions were significantly improved by taking into account the energetic contribution of water molecules that form at least one hydrogen bond with each interacting species.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17214883 PMCID: PMC1781455 DOI: 10.1186/1472-6807-7-4
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Figure 1Three-dimensional structure representation of Homing endonuclease I-crei complex (1g9y), using display features of the Lithium software package [75]. a) Overall view of the complex where the protein is displayed in ribbon/tube style and the DNA is represented in color-coded ribbons: red for adenine (A), blue for cytosine (C), green for guanine (G), and yellow for thymine (T). b) Water molecule hydrating a negatively-charged amino acid side-chain. c) Water molecule hydrating a DNA phosphate group. d) Water molecule screening the repulsive interaction between an Asp side-chain and a DNA phosphate. e) Water molecule located at the complex interface mediating specific amino acid-base interactions.
Classification data for the 39 protein-DNA complexes used in this study.
| 1a3q | NF kB p52 | Transcr. regulator | Rel homology region | other | Mixed b | [79] |
| 1aay | Zif268 zinc finger | Transcr. regulator | ββα-zinc finger | Zinc coordinating group | 3.30.160.60 | [80] |
| 1azp | Sac7D | DNA binding | Hyperthermophyle DNA-BP | β-sheet group | 2.40.50.40 | [81] |
| 1bl0 | MarA | Transcr. regulator | AraC transcriptional activator | Helix-turn-helix | 1.10.10.60 | [82] |
| 1by4 | Retinoic acid receptor rxr-alpha | Transcr. regulator | Nuclear receptor | Zinc coordinating group | 3.30.50.10 | [83] |
| 1c9b | TF IIB | Transcr. initiator | TF IIB | Helix-turn-helix | 1.10.472.10 | [84] |
| 1cez | T7 RNA polymerase | Enzyme | T7 RNA polymerase | DNA/RNA polymerases | Mixed b | [85] |
| 1cit | Orphan nuclear receptor NGFI-B | Transcr. regulator | Nuclear receptor | Zinc coordinating group | 3.30.50.10 | [86] |
| 1du0 | Engrailed homeodomain (q50a) | Transcr. regulator | Homeodomain | Helix-turn-helix | 1.10.10.60 | [87] |
| 1dux | Elk-1 | Transcr. regulator | Ets domain | Winged HTH | 1.10.10.10 | [88] |
| 1f4k | Replication terminator protein | DNA binding | Replication terminator protein | Winged HTH | 1.10.10.10 | [89] |
| 1g9y a | Homing endonuclease I-CreI | Enzyme | Homing endonuclease | Homing endonuclease-like | 3.10.28.10 | [90] |
| 1g9z a | Homing endonuclease I-CreI | Enzyme | Homing endonuclease | Homing endonuclease-like | 3.10.28.10 | [90] |
| 1h88 | Myb proto-oncogene protein | Transcr. regulator | Homeodomain | Helix-turn-helix | 1.10.10.60 | [91] |
| 1hcr | Hin-recombinase | Enzyme | Hin-recombinase | Helix-turn-helix | 1.10.10.60 | [92] |
| 1i3j | Homing endonuclease I-TevI | Enzyme | DNA-binding domain of intro endonuclease | DNA-binding domain of intro endonuclease | 3.30.60.40 | [93] |
| 1ig7 | Homeotic protein Msx-1 | Transcr. regulator | Homeodomain | Helix-turn-helix | 1.10.10.60 | [94] |
| 1jk1 | Zif268 d20a mutant | Transcr. factor | ββα-zinc finger | Zinc coordinating group | 3.30.160.60 | [95] |
| 1jk2 | Zif268 d20a mutant | Transcr. factor | ββα-zinc finger | Zinc coordinating group | 3.30.160.60 | [95] |
| 1jko | Hin recombinase | Enzyme | Homeodomain | Helix-turn-helix | 1.10.10.60 | [96] |
| 1jkr | Hin recombinase | Enzyme | Homeodomain | Helix-turn-helix | 1.10.10.60 | [96] |
| 1le8 | 3A Mating-type protein a-1 | Transcr. regulator | Homeodomain | Helix-turn-helix | 1.10.10.60 | [97] |
| 1lmb | Lambda repressor | Transcr. regulator | Repressor | Helix-turn-helix | 1.10.260.40 | [98] |
| 1mhd | Smad3 | Transcr. regulator | Smad Mh1 Domain | Smad Mh1 Domain | 3.90.520.10 | [99] |
| 1pnr | Purine repressor | Transcr. regulator | LacI repressor | Helix-turn-helix | 1.10.260.40 | [100] |
| 1pue | TF PU.1 | Transcr. factor | Ets domain | Winged HTH | 1.10.10.10 | [101] |
| 1qpz | Purine repressor | Transcr. regulator | LacI repressor | Helix-turn-helix | 1.10.260.40 | [102] |
| 1skn | Transcription factor skn-1 | Transcr. factor | Transcription factor skn-1 | Other α-helix group | 1.10.880.10 | [103] |
| 1t2t | Homing endonuclease I-TevI | Enzyme | DNA-binding domain of intro endonuclease | DNA-binding domain of intro endonuclease | 3.30.60.40 | [104] |
| 1t9i a | Homing endonuclease I-CreI (d20n) | Enzyme | DNA-binding domain of intro endonuclease | DNA-binding domain of intro endonuclease | 3.10.28.10 | [105] |
| 1t9j a | Homing endonuclease I-CreI (q47e) | Enzyme | DNA-binding domain of intro endonuclease | DNA-binding domain of intro endonuclease | 3.10.28.10 | [105] |
| 1tc3 | Transposase Tc3a1-65 | Enzyme | Transposase | Helix-turn-helix | 1.10.10.60 | [106] |
| 1tro | Tryptophane repressor | Transcr. regulator | Tryptophane repressor | Helix-turn-helix | 1.10.1270.10 | [107] |
| 1u0c a | Homing endonuclease I-CreI (y33c) | Enzyme | DNA-binding domain of intro endonuclease | DNA-binding domain of intro endonuclease | 3.10.28.10 | [108] |
| 1yrn | Mating-type protein a-1 | Transcr. regulator | Homeodomain | Helix-turn-helix | 1.10.10.60 | [109] |
| 1ytb | TATA box binding protein | Transcr. initiator | TATA box binding protein | β-sheet group | 3.30.310.10 | [110] |
| 2bop | Bovine papillomavirus-I E2 | Transcr. regulator | bovine papillomavirus-I E2 | Other α-helix group | 3.30.70.330 | [111] |
| 2hdd | Engrailed homeodomain (q50k) | Transcr. regulator | Homeodomain | Helix-turn-helix | 1.10.10.60 | [112] |
| 9ant | Antennapedia protein | Transcr. regulator | Homeodomain | Helix-turn-helix | 1.10.10.60 | [113] |
a Homing endonuclease I-CreI-DNA complexes (see text for more information).
b The domain arrangements of the protein portions interacting with the DNA are identified by multiple C.A.T.H. codes.
Structural, experimental dissociation and calculated HINT score data for the 39 protein-DNA complexes.
| 1a3q (2.10) | 785 | 10.82 | [79] | -14.71 | 11257 | 22843 (39) | 19954 (28) | 12461 (2) |
| 1aay (1.60) | 148 | 9.90 | [114] | -13.46 | 14166 | 36884 (47) | 36300 (43) | 20454 (11) |
| 1azp (1.60) | 132 | 6.81 | [115] | -9.26 | 4045 | 9182 (16) | 8400 (12) | 5225 (4) |
| 1bl0 (2.30) | 144 | 7.70 | [116] | -10.46 | 4742 | 6513 (15) | 6294 (12) | 5143 (4) |
| 1by4 (2.10) | 120 | 6.46 | [83] | -8.78 | 6328 | 17213 (24) | 14606 (19) | 10407 (7) |
| 1c9b (2.65) | 101 | 7.35 | [117] | -9.99 | 7122 | 14402 (20) | 13101 (14) | 7355 (1) |
| 1cez (2.40) | 471 | 6.33 | [118] | -8.60 | 2697 | 8969 (17) | 8125 (12) | 4807 (4) |
| 1cit (2.70) | 38 | 9.00 | [119] | -12.23 | 8239 | 12266 (9) | 12347 (8) | 9973 (4) |
| 1du0 (2.00) | 103 | 9.72 | [120] | -13.21 | 9793 | 15802 (37) | 16295 (24) | 12380 (5) |
| 1dux (2.10) | 129 | 10.07 | [121] | -13.69 | 8837 | 18746 (24) | 18768 (20) | 13227 (7) |
| 1f4k (2.50) | 96 | 10.70 | [89] | -14.54 | 10836 | 18458 (20) | 17928 (18) | 14753 (8) |
| 1g9y (2.05) | 435 | 10.00 | [105] | -13.59 | 2991 | 37385 (102) | 33546 (83) | 16470 (30) |
| 1g9z (1.80) | 857 | 8.70 | [122] | -11.83 | -747 | 32408 (117) | 31561 (93) | 10064 (25) |
| 1h88 (2.80) | 25 | 7.45 | [91] | -10.13 | 7335 | 8677 (3) | 8436 (2) | 7335 (0) |
| 1hcr (1.80) | 16 | 7.47 | [96] | -10.15 | 5553 | 6173 (4) | 6173 (4) | 5553 (0) |
| 1i3j (2.20) | 185 | 8.02 | [123] | -10.90 | 11217 | 21035 (37) | 21381 (30) | 13263 (6) |
| 1ig7 (2.20) | 153 | 8.70 | [124] | -11.83 | 7656 | 19284 (34) | 10839 (27) | 10839 (6) |
| 1jk1 (1.90) | 136 | 10.59 | [95] | -14.39 | 14098 | 38199 (53) | 35286 (40) | 18768 (7) |
| 1jk2 (1.65) | 145 | 10.37 | [95] | -14.09 | 14283 | 35196 (51) | 33716 (42) | 19239 (7) |
| 1jko (2.24) | 13 | 7.03 | [96] | -9.55 | 7571 | 7731 (1) | 7731 (1) | 7732 (1) |
| 1jkr (2.28) | 2 | 6.08 | [96] | -8.27 | 6787 | 6787 (0) | 6787 (0) | 6787 (0) |
| 1le8 (2.30) | 102 | 6.66 | [97] | -9.05 | 8571 | 12904 (26) | 12679 (18) | 8990 (2) |
| 1lmb (1.80) | 140 | 9.00 | [98] | -12.23 | 8191 | 19046 (37) | 16445 (31) | 10154 (6) |
| 1mhd (2.80) | 24 | 6.93 | [99] | -9.42 | 1607 | 1660 (2) | 1660 (2) | 1629 (1) |
| 1pnr (2.70) | 92 | 8.47 | [125] | -11.51 | 11760 | 14090 (6) | 14090 (6) | 11760 (0) |
| 1pue (2.10) | 61 | 6.85 | [126] | -9.30 | 5823 | 11403(18) | 10461 (14) | 6659 (1) |
| 1qpz (2.50) | 184 | 8.59 | [102] | -11.67 | 12015 | 16625 (12) | 16877 (12) | 13527 (4) |
| 1skn (2.50) | 28 | 9.00 | [103] | -12.23 | 9158 | 10976 (6) | 10976 (6) | 9618 (2) |
| 1t2t (2.50) | 72 | 8.28 | [104] | -11.25 | 12893 | 16500 (13) | 15693 (7) | 14360 (4) |
| 1t9i (1.60) | 656 | 8.74 | [105] | -11.89 | 6895 | 49015 (122) | 43140 (98) | 18371 (31) |
| 1t9j (2.00) | 189 | 9.22 | [105] | -12.54 | -449 | 27873 (81) | 24965 (70) | 8215 (22) |
| 1tc3 (2.45) | 49 | 7.10 | [127] | -9.65 | 10246 | 13475 (9) | 13390 (6) | 10720 (1) |
| 1tro (1.90) | 572 | 9.30 | [128] | -12.64 | 5739 | 22606 (60) | 22434 (46) | 10568 (12) |
| 1u0c (2.50) | 90 | 8.23 | [108] | -11.19 | -2591 | 12569 (37) | 7561 (30) | 3622 (14) |
| 1yrn (2.50) | 58 | 10.00 | [129] | -13.59 | 14429 | 22584 (25) | 22846 (24) | 17871 (9) |
| 1ytb (1.80) | 513 | 8.40 | [130] | -10.06 | 9035 | 16607 (32) | 17529 (23) | 9996 (1) |
| 2bop (1.70) | 121 | 9.40 | [131] | -12.77 | 13400 | 22452 (33) | 22254 (24) | 14692 (2) |
| 2hdd (1.90) | 183 | 11.06 | [120] | -15.03 | 14613 | 31840 (46) | 30951 (39) | 18688 (9) |
| 9ant (2.40) | 26 | 8.80 | [132] | -11.96 | 9192 | 12971 (9) | 13049 (8) | 11711 (5) |
HP-D-W = Hprotein-DNA + Hprotein-water + HDNA-water. Only the contributions of water molecules in a 4 Å range at the protein-DNA interface are considered.
Only waters with nonzero Ranks with respect to both protein and DNA are included in HP-D-W.
Only waters with nonzero Ranks with respect to both protein and DNA and total Rank ≥ 4 are included in HP-D-W.
The PDB files report the structures as two equivalent protein-DNA complexes. The second complexes were removed and only waters surrounding the first in a 8 Å sphere were considered.
Figure 2Correlation between the experimental binding free energies and the calculated HINT score values for the 39 analyzed protein-DNA complexes. Solid line is correlation for all 39 complexes; dashed line is correlation after five endonuclease I-CreI-DNA native and mutant complexes (1g9z, 1g9y, 1t9i, 1t9j, 1u0c; open symbols) are removed from dataset.
Number, mean HINT scores and mean Ranks of waters found at the protein-DNA interface.
| 1244 | -22 | 355 | 333 | 1.6 | 1.4 | 3.0 | |
| 996 | -10 | 388 | 378 | 1.8 | 1.6 | 3.4 | |
| 261 | 21 | 432 | 452 | 3.0 | 2.0 | 5.0 | |
Figure 3Correlation between the experimental binding free energies and the calculated HINT score values taking into account the score contributions of water molecules. a) Including all water molecules within in a 4 Å range of both atoms of the protein and atoms of the DNA. b) Including waters in a 4 Å range with total Rank ≥ 4, and non-zero partial Ranks with both protein and DNA.
Figure 4a) Contributions of various structural components of the protein-DNA association. Shown are protein-DNAphosphate, protein-DNAribose and protein-DNAbase-edge contributions, colored as indicated in the legend. b) HINT score contributions calculated for the interaction between water molecules (having total Rank ≥ 4 and non-zero Rank with both macromolecules) and the protein-DNA interface. Shown are the water-DNAphosphate, water-DNAribose, water-DNAbase and water-protein contributions, colored as indicated in the legend. Bars corresponding to 1h88, 1hcr, 1jkr and 1pnr complexes are missing because no significant crystallographic water molecules were found in the crystal structures.
Number, mean HINT scores and mean Ranks of waters initially classified by role.
| 535 | -36 | 426 | 390 | 1.9 | 1.3 | 3.2 | ||
| 461 | 32 | 324 | 356 | 1.7 | 1.6 | 3.3 | ||
| 212 | -17 | 281 | 264 | 1.6 | 1.2 | 2.8 | ||
| 249 | 71 | 361 | 432 | 1.9 | 1.8 | 3.7 | ||
| 218 | 94 | 360 | 453 | 1.9 | 1.8 | 3.7 | ||
| 20 | -120 | 400 | 280 | 1.6 | 1.9 | 3.5 | ||
| 11 | 36 | 310 | 346 | 2.8 | 1.7 | 4.6 | ||
Figure 5Contributions of the different interaction types participating in the direct protein-DNA association process, as estimated by the HINT force field. The bars are color-coded as indicated in the legend.