| Literature DB >> 19389725 |
Justin Ashworth1, David Baker.
Abstract
The biological functions of DNA-binding proteins often require that they interact with their targets with high affinity and/or high specificity. Here, we describe a computational method that estimates the extent of optimization for affinity and specificity of amino acids at a protein-DNA interface based on the crystal structure of the complex, by modeling the changes in binding-free energy associated with all individual amino acid and base substitutions at the interface. The extent to which residues are predicted to be optimal for specificity versus affinity varies within a given protein-DNA interface and between different complexes, and in many cases recapitulates previous experimental observations. The approach provides a complement to traditional methods of mutational analysis, and should be useful for rapidly formulating hypotheses about the roles of amino acid residues in protein-DNA interfaces.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19389725 PMCID: PMC2691843 DOI: 10.1093/nar/gkp242
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Optimality of affinity and specificity at individual positions in representative complexes. Protein identities and pdb codes are indicated at the top of each panel. In the left panels, the extent optimal for affinity [Equation (1)] is plotted against the extent optimal for specificity [Equation (3)] for each residue in a complex. The color of each data point is proportional to the change in the energy of binding calculated for mutation to glycine at that position, where bright cyan indicates the highest relative loss in binding energy, and black indicates the lowest loss in binding energy. All interface residues in each crystal structure are represented; large clusters near (0, 0) correspond to residues in the interface that are not predicted to be optimal for affinity or specificity. At right are representations of the crystal structures for each indicated interface. The amino acids are colored by a dual color gradient in which red indicates the optimality for affinity, blue indicates the optimality for specificity and pink/magenta indicates positions that are optimal for both affinity and specificity. A) DNAse I (pdb code: 2DNJ), B) C2H2 zinc finger Zif268 (1ZAA), C) [β]-Zip GCN4 (2DGC), D) EcoRV (1B94), E) I-MsoI (1M5X). In frame (D), symmetrically equivalent residues in the homodimer EcoRV are labeled on only one chain. Molecular images were rendered using PyMOL (11).
Figure 2.Distributions of optimality for affinity [Equation (1)] and specificity [Equation (3)] in four catagories of protein–DNA interfaces. Red: helical transcription factors; green: restriction endonucleases; blue: homing endonucleases; black: nonspecific enzymes. Histogram bin centers are indicated on the horizontal axes. Only positions at which mutation to glycine is predicted to result in the loss of >3 kcal/mol of binding energy were included.
Optimization of specificity and affinity in the C2H2 zinc finger family
| C2H2 zinc finger family | |||
|---|---|---|---|
| (pdb codes: 1zaa, 1aay, 1mey, 1ubd, 1g2f, 1a1f, 1a1h, 1a1j) | |||
| Position | Counts | 〈 | 〈 |
| −5 | 20 | 0.29 | 0.05 |
| −1 | 24 | 0.83 | 0.43 |
| 2 | 23 | 0.34 | 0.21 |
| 3 | 24 | 0.48 | 0.25 |
| 6 | 24 | 0.56 | 0.28 |
| All other | 97 | 0.07 | 0.02 |
| All int pos | 212 | 0.31 | 0.14 |
All quantities are averaged over the eight complexes noted above. 〈opt.ΔG〉, optimality for binding [Equation (1)]; 〈opt. Spec.〉, optimality for wild-type DNA specificity [Equation (3)]. ‘All other’ refers to all positions excluding −5, −1, 2, 3, and 6. ‘All int pos’ refers to all interface positions