| Literature DB >> 28295691 |
Birger Dittrich1, Jens Lübben1, Stefan Mebs2, Armin Wagner3, Peter Luger2, Ralf Flaig3.
Abstract
Amino acid structures are an ideal test set for method-development studies in crystallography. High-resolution X-ray diffraction data for eight previously studied genetically encoding amino acids are provided, complemented by a non-standard amino acid. Structures were re-investigated to study a widely applicable treatment that permits accurate X-H bond lengths to hydrogen atoms to be obtained: this treatment combines refinement of positional hydrogen-atom parameters with aspherical scattering factors with constrained "TLS+INV" estimated hydrogen anisotropic displacement parameters (H-ADPs). Tabulated invariom scattering factors allow rapid modeling without further computations, and unconstrained Hirshfeld atom refinement provides a computationally demanding alternative when database entries are missing. Both should incorporate estimated H-ADPs, as free refinement frequently leads to over-parameterization and non-positive definite H-ADPs irrespective of the aspherical scattering model used. Using estimated H-ADPs, both methods yield accurate and precise X-H distances in best quantitative agreement with neutron diffraction data (available for five of the test-set molecules). This work thus solves the last remaining problem to obtain such results more frequently. Density functional theoretical QM/MM computations are able to play the role of an alternative benchmark to neutron diffraction.Entities:
Keywords: X-ray diffraction; amino acids; density functional theory; neutron diffraction; structure elucidation
Year: 2017 PMID: 28295691 PMCID: PMC5434951 DOI: 10.1002/chem.201604705
Source DB: PubMed Journal: Chemistry ISSN: 0947-6539 Impact factor: 5.236
Figure 1Static scattering for hydrogen (red) in comparison to oxygen (blue). The drop‐off with resolution is more pronounced for hydrogen as there are no core electrons. Thermal motion causes additional decay with resolution (light blue and orange lines); hydrogen scattering ceases to contribute to scattering at ca. 0.4 Å−1 with an isotropic temperature factor T=exp [−8π2⟨u 2⟩(sin2 θ/λ 2)] with u 2=0.025 Å2.
Figure 2The ONIOM high‐layer (usually the asymmetric unit content except for l‐phenylalanine on the top right) of the nine structures investigated. Depictions show experimental ortep plots after invariom refinement, including TLS+INV estimated H‐ADPs as well as refined hydrogen positions. Depictions were generated with the program molecoolQt.45 ortep plots from HAR refinements with TLS+INV estimated H‐ADPs are visually indistinguishable.
Selected crystallographic details of the nine structures studied.[a]
| Structure | Space group |
|
| Source type | Reference |
|---|---|---|---|---|---|
|
|
| 4,2 | 100 | synchrotron | [37] |
| RT | neutron | [38] | |||
|
|
| 8,2 | 100 | synchrotron | [37] |
|
|
| 4,1 | 100 | Mo Kα | [39] |
| RT | neutron | [40] | |||
|
|
| 2,3 | 25 | Mo Kα | [41] |
|
|
| 8,2 | 100 | synchrotron | [42] |
|
|
| 4,1 | 100 | synchrotron | [37] |
| RT | neutron | [12] | |||
|
|
| 8,1 | 19 | Ag Kα | [37] |
| RT | neutron | [43] | |||
|
|
| 2,1 | 100 | synchrotron | [44] |
|
|
| 4,2 | 100 | MoKα | [34] |
| 150 | neutron | [34] |
[a] The radiation used is also given. RT=room temperature. [b] The structure of l‐phenylalanine crystallizes as l‐phenylalanine‐l‐phenylalaninium formic acid.
Selected dataset characteristics for the nine structures studied.[a]
| Structure | Resolution (sin |
|
|
|---|---|---|---|
|
| 1.46 | 4.64 | 4.58 |
|
| 1.30 | 9.43 | 9.35 |
|
| 1.08 | 2.16 | 2.17 |
|
| 1.17 | 3.46 | – |
|
| 1.12 | 5.05 | 5.10 |
|
| 1.54/1.18 | 8.70/2.69 | –/3.23 |
|
| 1.35 | 4.29 | 4.29 |
|
| 1.54 | 6.25 | 6.01 |
|
| 1.13 | 6.16 | 5.82 |
[a] The weighting scheme was of 1/σ 2 in all cases. [b] INV=invariom. [c] HAR=Hirshfeld atom refinement with H‐ADPs estimated.
Statistical analysis of X−H bond lengths of the five structures where neutron data are available.[a]
| MD, | |||||
|---|---|---|---|---|---|
| Structure |
|
|
|
| No. of data |
| ASN | 0.0151, 0.0268 | – | −0.0070, 0.0103 | −0.0019, 0.0116 | 10 |
| GLN | 0.0157, 0.0136 | 0.0060, 0.0138 | 0.0060, 0.0106 | 0.0040, 0.0066 | 10 |
| SER | 0.0134, 0.0141 | 0.0260, 0.0240 | 0.0194, 0.0178 | 0.0054, 0.0074 | 7 |
| THR | 0.0426, 0.0376 | – | 0.0324, 0.0445 | 0.0052, 0.0116 | 9 |
| HYPRO | 0.0554, 0.0274 | 0.0465, 0.0255 | 0.0442, 0.0248 | 0.0006, 0.0133 | 13 |
[a] Analysis is through the mean difference MD ⟨ΔP⟩ and its population standard deviation σ pop. These values are reported for data pairs (N,X INV), (N,X HARfree), (N,X HAR_TLS+INV), and (N,O) when available. The number of data in each case is also given.
Statistical analysis of X−H bond lengths for the four remaining structures.[a]
| MD, | ||||
|---|---|---|---|---|
| Structure |
|
|
| No. of data |
| GLU | 0.0233, 0.0172 | 0.0083, 0.0138 | 0.0071, 0.0102 | 11 |
| PHE | 0.0275, 0.0345 | – | – | 11 |
| PRO | 0.0098, 0.0117 | – | 0.0088, 0.0286 | 11 |
| VAL | 0.0061, 0.0082 | −0.0034, 0.0119 | −0.0049, 0.0118 | 11 |
[a] Analysis is analogous to Table 3. Values are reported for data pairs (O,X INV), (O,X HARfree), and (O,X HAR_TLS+INV), if available; that is, ONIOM results (O) serve as reference. The number of data in each case is also given. HAR convergence failed for cases where values are not provided.
Values of the square root of the wRMSD, taking into account only X−H bonds (X=C,N,O).[a]
|
| ||||
|---|---|---|---|---|
| Structure |
|
|
|
|
| ASN | 2.21 | – | 1.62 | 5.45 |
| GLN | 2.50 | 3.48 | 2.73 | 1.69 |
| SER | 2.24 | 4.20 | 3.29 | 8.92 |
| THR | 3.99 | – | 5.92 | 1.66 |
| HYPRO | 6.13 | 5.67 | 6.23 | 2.22 |
| GLU | 3.21 | 1.88 | 1.65 | – |
| PHE | 2.42 | – | – | – |
| PRO | 2.85 | – | 5.16 | – |
| VAL | 1.30 | 2.27 | 2.29 | – |
[a] Values are again provided for data pairs (N,X INV), (N,X HARfree), (N,X HAR_TLS+INV), and (N,O), for the structures where neutron data (N) are available. For the remaining structures benchmark values (B) are taken from ONIOM, leading to (O,X INV), (O,X HARfree), and (O,X HAR_TLS+INV) comparisons. The number of data points is as in Tables 3 and 4.
X−H bond lengths of individual invarioms characterized with the mean difference MD ⟨ΔP⟩ to and its population standard deviation σ pop.[a]
| Invariom | MD, | ||
|---|---|---|---|
|
|
|
| |
| H1c[1o1c1c] | 0.0430 (0.0090) {2} | 0.0230 {1} | 0.0200 (0.0080) {2} |
| H1n[1.5c1h] | 0.0015 (0.0216) {4} | ‐0.0020 (0.0160) {2} | 0.0030 (0.0050) {4} |
| H1n[1c1h1h] | 0.0332 (0.0315) {12} | 0.0117 (0.0113) {6} | 0.0160 (0.0378) {12} |
| H1c[1c1h1h] | 0.0503 (0.0315) {6} | 0.0380 (0.0014) {3} | 0.0272 (0.0344) {6} |
| H1c[1c1c1h] | 0.0147 (0.0219) {12} | 0.0184 (0.0160) {10} | 0.0107 (0.0191) {12} |
| H1c[1c1c1c] | 0.0230 (0.0109) {5} | 0.0197 (0.0150) {3} | 0.0142 (0.0131) {5} |
| H1°[1c] | 0.0587 (0.0354) {4} | 0.0853 (0.0122) {3} | 0.0625 (0.0244) {4} |
| H1°[1 h] | 0.0417 (0.0356) {4} | 0.0725 (0.0045) {2} | 0.0337 (0.0371) {4} |
[a] Values are provided for data pairs N,X INV, N,X HARfree, and N,X HAR_TLS+INV in the top part of the table and O,X INV, O,X HARfree, and O,X HAR_TLS+INV in the bottom. The number of data contributing in each case is also given in square brackets. When only one value is available, σ pop is not given.