| Literature DB >> 28375143 |
Heping Zheng1, Karol M Langner1, Gregory P Shields2, Jing Hou1, Marcin Kowiel1, Frank H Allen2, Garib Murshudov3, Wladek Minor1.
Abstract
The bond-valence model is a reliable way to validate assumed oxidation states based on structural data. It has successfully been employed for analyzing metal-binding sites in macromolecule structures. However, inconsistent results for heme-based structures suggest that some widely used bond-valence R0 parameters may need to be adjusted in certain cases. Given the large number of experimental crystal structures gathered since these initial parameters were determined and the similarity of binding sites in organic compounds and macromolecules, the Cambridge Structural Database (CSD) is a valuable resource for refining metal-organic bond-valence parameters. R0 bond-valence parameters for iron(II), iron(III) and other metals have been optimized based on an automated processing of all CSD crystal structures. Almost all R0 bond-valence parameters were reproduced, except for iron-nitrogen bonds, for which distinct R0 parameters were defined for two observed subpopulations, corresponding to low-spin and high-spin states, of iron in both oxidation states. The significance of this data-driven method for parameter discovery, and how the spin state affects the interpretation of heme-containing proteins and iron-binding sites in macromolecular structures, are discussed.Entities:
Keywords: Cambridge Structural Database; bond-valence model; metal–organics; nonlinear conjugate gradients; oxidation state
Mesh:
Substances:
Year: 2017 PMID: 28375143 PMCID: PMC5503122 DOI: 10.1107/S2059798317000584
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Figure 1(a) Bimodal BVS distribution for iron(II) sites in the CSD using literature R 0 values, with oxidation state assigned by the ligand-template method. (b) An example of a crystal structure with two iron(II) sites in different spin states (CSD Refcode ABEQUV). Although both Fe atoms were assigned an oxidation state of 2 and have a coordination number of 6, Fe1II—N distances span 1.911–2.067 Å, while Fe2II—N distances span 2.146–2.299 Å.
Optimized R 0 parameters (in Å) derived for iron(II) and iron(III), compared with reference literature values (Brese & O’Keeffe, 1991 ▸; Liu & Thorp, 1993 ▸; Kanowitz & Palenik, 1998 ▸)
For each metal–ligand pair, R 0 is the mean of parameters calculated separately from all validated homoleptic sites (with the standard deviation in parentheses) and R 0 (CSD) denotes the value obtained from our optimization procedure (with the average of |ΔR 0|2 as an upper bound on the uncertainty in parentheses) calculated over all validated homoleptic and heteroleptic sites. CN is the coordination number of the first coordination sphere.
| Ligands to iron | N | O | F | S (CN = 4) | Cl | Br |
|---|---|---|---|---|---|---|
| Iron(II) | ||||||
| No. of homoleptic sites | 497; 751 | 378 | 0 | 159 | 68 | 11 |
|
| 1.57 (2); 1.76 (2) | 1.71 (4) | — | 2.08 (9) | 2.04 (4) | 2.20 (1) |
| Total No. of sites | — | 1144 | 34 | — | 518 | 65 |
|
| — | 1.70 (4) | 1.67 (4) | — | 2.05 (3) | 2.21 (2) |
| Brese | 1.86 | 1.734 | 1.65 | 2.16 | 2.06 | 2.26 |
| Liu | 1.769 | 1.700 | — | 2.125 | — | — |
| Kanowiz | 1.713 | — | — | — | — | |
| Iron(III) | ||||||
| No. of homoleptic sites | 114; 62 | 665 | 14 | 68 | 280 | 59 |
|
| 1.70 (2); 1.83 (3) | 1.75 (4) | 1.68 (1) | 2.10 (11) | 2.08 (2) | 2.22 (1) |
| Total No. of sites | — | 2663 | 60 | — | 1034 | 104 |
|
| — | 1.76 (3) | 1.67 (4) | — | 2.09 (2) | 2.23 (4) |
| Brese | 1.86 | 1.759 | 1.67 | 2.16 | 2.09 | 2.26 |
| Liu | 1.815 | 1.765 | — | 2.134 | — | — |
| Kanowitz | — | 1.751 | — | — | — | — |
Optimized R 0 parameters (in Å) for bonds to Na, Mg, K, Ca and Zn using first coordination sphere ligand atoms
For each metal–ligand pair, three values are reported: R 0 is the mean of parameters calculated separately from all validated homoleptic sites (with the standard deviation in parentheses), R 0 (CSD) denotes the value obtained from our optimization procedure (with the average of |ΔR 0|2 as the upper bound on the uncertainty in parentheses) calculated over all validated homoleptic and heteroleptic sites, and R 0 (Brese) is the value previously reported in the literature (Brese & O’Keeffe, 1991 ▸).
| Ligands | N | O | F | S | Cl | Br |
|---|---|---|---|---|---|---|
| Na | ||||||
| No. of homoleptic sites | 60 | 1480 | 2 | 1 | 2 | 0 |
|
| 1.91 (9) | 1.77 (9) | 1.69 (2) | 2.27 | 2.16 (1) | — |
| Total No. of sites | 702 | 2404 | 75 | 126 | 87 | 15 |
|
| 1.88 (8) | 1.75 (9) | 1.67 (5) | 2.23 (8) | 2.16 (4) | 2.32 (6) |
|
| 1.93 | 1.80 | 1.677 | 2.28 | 2.15 | 2.33 |
| Mg | ||||||
| No. of homoleptic sites | 137 | 613 | 0 | 0 | 4 | 3 |
|
| 1.81 (6) | 1.67 (4) | — | — | 2.07 (1) | 2.23 (2) |
| Total No. of sites | 509 | 1075 | 9 | 28 | 90 | 66 |
|
| 1.78 (6) | 1.67 (4) | 1.64 (1) | 2.20 (4) | 2.11 (3) | 2.26 (2) |
|
| 1.85 | 1.69 | 1.58 | 2.18 | 2.08 | 2.28 |
| K | ||||||
| No. of homoleptic sites | 69 | 876 | 3 | 9 | 7 | 1 |
|
| 2.31 (15) | 2.10 (8) | 2.18 (6) | 2.74 (11) | 2.48 (6) | 2.69 |
| Total No. of sites | 950 | 2123 | 101 | 170 | 107 | 17 |
|
| 2.22 (8) | 2.07 (7) | 2.03 (5) | 2.62 (7) | 2.49 (4) | 2.63 (5) |
|
| 2.26 | 2.13 | 1.99 | 2.59 | 2.52 | 2.66 |
| Ca | ||||||
| No. of homoleptic sites | 53 | 475 | 0 | 0 | 0 | 0 |
|
| 2.08 (6) | 1.94 (5) | — | — | — | — |
| Total No. of sites | 316 | 801 | 2 | 15 | 31 | 14 |
|
| 2.07 (5) | 1.93 (4) | 1.89 (1) | 2.42 (4) | 2.33 (1) | 2.51 (2) |
|
| 2.14 | 1.96 | 1.84 | 2.45 | 2.37 | 2.49 |
| Zn | ||||||
| No. of homoleptic sites | 1393 | 2328 | 0 | 251 | 439 | 50 |
|
| 1.75 (3) | 1.69 (3) | — | 2.09 (2) | 2.01 (1) | 2.15 (2) |
| Total No. of sites | 9275 | 8811 | 18 | 1330 | 20549 | 403 |
|
| 1.75 (3) | 1.69 (2) | 1.64 (2) | 2.09 (2) | 2.01 (1) | 2.14 (1) |
|
| 1.77 | 1.74 | 1.62 | 2.09 | 2.01 | 2.15 |
Figure 2Distribution of distances from iron(II) to nitrogen in six-coordinated iron(II) sites with the iron reported to be either in a low-spin state (shown in blue) or in a high-spin state (shown in orange). For each iron(II) site identified by CSD Refcode and iron(II) atom label, the distances to all six nitrogen ligands are shown. Typical FeII—N distances range between 1.9 and 2.1 Å for sites with a reported low-spin iron, while typical FeII—N distances range between 2.1 and 2.3 Å for sites with a reported high-spin iron.
Figure 3Examples of Fe—N sites in the PDB, with Fe atoms shown in brown, O atoms shown in red and N atoms shown in blue. C atoms are shown in different colors for the different iron-binding sites. (a) Heme with high-spin iron(II), C atoms shown in green; (b) heme with low-spin iron(III), C atoms shown in cyan.
Typical first coordination sphere distances for Na, Mg, K, Ca and Zn derived from the converged R 0 values in Table 3 ▸
CN is the coordination number of the first coordination sphere. Octahedral geometry for Na, Mg, K and Ca and tetrahedral geometry for Zn are assumed, with equal bond-valence contributions from each atom. Distances are calculated using the R 0 values derived in this study (when sufficient validated data from the CSD were available).
| Distance (Å) | N | O | F | S | Cl | Br |
|---|---|---|---|---|---|---|
| Na (CN = 6) | 2.54 | 2.41 | 2.33 | 2.89 | 2.82 | 2.98 |
| Mg (CN = 6) | 2.19 | 2.08 | 2.05 | 2.61 | 2.52 | 2.67 |
| K (CN = 6) | 2.88 | 2.73 | 2.69 | 3.28 | 3.15 | 3.29 |
| Ca (CN = 6) | 2.48 | 2.34 | 2.30 | 2.83 | 2.74 | 2.92 |
| Zn (CN = 4) | 2.01 | 1.95 | 1.90 | 2.35 | 2.27 | 2.40 |
Typical first coordination sphere distances for Fe derived from the converged R 0 values in Table 1 ▸
CN is the coordination number of the first coordination sphere. CN = 4 is assumed for Fe—S sites and CN = 6 is assumed for iron sites interacting with non-sulfur ligands, with equal bond-valence contributions from each atom. Distances are calculated using the R 0 values derived in this study (when sufficient validated data from the CSD were available).
| Distance (Å) | N (CN = 6) | O (CN = 6) | F (CN = 6) | S (CN = 4) | Cl (CN = 6) | Br (CN = 6) |
|---|---|---|---|---|---|---|
| LS iron(II) | 1.98 | 2.11 | 2.08 | 2.33 | 2.46 | 2.62 |
| HS iron(II) | 2.17 | |||||
| LS iron(III) | 2.11 | 2.17 | 2.08 | 2.35 | 2.50 | 2.64 |
| HS iron(III) | 2.24 |
(a) PDB entry 2hhb (1.74 Å), deoxyhemoglobin, HS iron(II).
| Ligands | N1 | N2 | N3 | N4 | N5 | O6 | BVS |
|---|---|---|---|---|---|---|---|
| Distance (Å) | 2.03 | 2.06 | 2.15 | 2.15 | 2.16 | 3.39 | |
| v.u.( | 0.29 | 0.27 | 0.21 | 0.21 | 0.20 | 0.01 | 1.19 |
| v.u.( | 0.48 | 0.44 | 0.35 | 0.35 | 0.34 | 0.01 | 1.98 |
| v.u.( | 0.63 | 0.58 | 0.46 | 0.46 | 0.44 | 0.02 | 2.59 |
(b) PDB entry 1buw (1.90 Å), oxyhemoglobin, LS iron(III).
| Ligands | N1 | N2 | N3 | N4 | N5 | N6 | BVS |
|---|---|---|---|---|---|---|---|
| Distance (Å) | 1.75 | 1.99 | 2.00 | 2.01 | 2.01 | 2.26 | |
| v.u.( | 0.87 | 0.46 | 0.44 | 0.43 | 0.43 | 0.22 | 2.86 |
| v.u.( | 1.24 | 0.65 | 0.63 | 0.61 | 0.61 | 0.31 | 4.06 |
| v.u.( | 1.35 | 0.70 | 0.68 | 0.67 | 0.67 | 0.34 | 4.41 |
| v.u.( | 1.19 | 0.62 | 0.61 | 0.59 | 0.59 | 0.30 | 3.90 |