| Literature DB >> 26523828 |
Fang Li1, Alexander Grishaev2, Jinfa Ying1, Ad Bax1.
Abstract
Accurate quantitative measurement of structural dispersion in proteins remains a prime challenge to both X-ray crystallography and NMR spectroscopy. Here we use a model-free approach based on measurement of many residual dipolar couplings (RDCs) in differentially orienting aqueous liquid crystalline solutions to obtain the side chain χ1 distribution sampled by each residue in solution. Applied to the small well-ordered model protein GB3, our approach reveals that the RDC data are compatible with a single narrow distribution of side chain χ1 angles for only about 40% of the residues. For more than half of the residues, populations greater than 10% for a second rotamer are observed, and four residues require sampling of three rotameric states to fit the RDC data. In virtually all cases, sampled χ1 values are found to center closely around ideal g(-), g(+) and t rotameric angles, even though no rotamer restraint is used when deriving the sampled angles. The root-mean-square difference between experimental (3)JHαHβ couplings and those predicted by the Haasnoot-parametrized, motion-adjusted Karplus equation reduces from 2.05 to 0.75 Hz when using the new rotamer analysis instead of the 1.1-Å X-ray structure as input for the dihedral angles. A comparison between observed and predicted (3)JHαHβ values suggests that the root-mean-square amplitude of χ1 angle fluctuations within a given rotamer well is ca. 20°. The quantitatively defined side chain rotamer equilibria obtained from our study set new benchmarks for evaluating improved molecular dynamics force fields, and also will enable further development of quantitative relations between side chain chemical shift and structure.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26523828 PMCID: PMC4665082 DOI: 10.1021/jacs.5b10072
Source DB: PubMed Journal: J Am Chem Soc ISSN: 0002-7863 Impact factor: 15.419
Figure 1Enhanced accuracy of 1DCβHβ couplings through spectral editing of the 1H–13C CT-HSQC spectrum of randomly fractionally (75%) deuterated GB3. (A) Small region of the regular 1H–13C HSQC spectrum, 1H-coupled in the 13C dimension, showing the signals of both 13CH2 and 13CHD isotopomers, the latter upfield shifted due to the deuterium isotope effect. (B) Same spectral region, recorded with the DEPT-filtered 1H–13C CT-HSQC experiment (SI Figure S2), which selects 13CH signals and effectively suppresses 13CH2 isotopomers. (C) Comparison of the sum of 13Cβ–1Hβ2(Dβ3) and 13Cβ–Hβ3(Dβ2) couplings derived from the DEPT-filtered HSQC spectra of wild-type GB3 in Pf1 and isotropic solutions, with the corresponding summed couplings derived from CT-HN(COCA)CB spectra. Blue: K4A/K19E/V42E-CHis6 in Pf1; red: K19A/V42E/D47K in Pf1; green: K4A/K19E/V42E in Pf1; pink: wild-type GB3 in bicelle; gray: wild-type GB3 in PEG; black: wild-type GB3 in Pf1. Spectra were recorded at 900 MHz 1H frequency in 99.8% D2O (1H–13C HSQC) and at 600 MHz in 95% H2O for the CT-HN(COCA)CB experiment.
Figure 2Examples of spectral quality used for deriving 1DCβCγ couplings. (A) Small section of the methyl region of the 1H–13C HSQC spectrum of wild-type GB3 in Pf1 medium, recorded with regular (non-CT) 13C evolution (red), superimposed on the corresponding spectrum recorded under isotropic conditions (blue). (B) Small region of the 1Hβ–13Cβ region of the 1H–13C CT-HSQC spectrum, recorded with a REBURP 180° pulse covering only the 13C aliphatic region during the CT 13C evolution of the aliphatic region of wild-type GB3 in Pf1 medium (red) superimposed on the corresponding spectrum recorded under isotropic conditions (blue). The spectra were recorded with a double constant-time duration (56 ms) at 900 MHz. (C) Comparison of 1DCαCβ and the sum of the three other Cβ–related 1D couplings, which all have been scaled to 1DaCH (measurements for wild type GB3: pink, in bicelle; blue, in PEG; green, in Pf1).
Figure 3Structure of GB3, derived from NMR data. The backbone coordinates are presented as a ribbon diagram, with side chains depicted as sticks for the Cα–Cβ and Cβ–Cγ1 (Cβ–Oγ for Thr) bonds. For residues with the RDC-derived χ1 rotamer distributions, shown positions of the side chains (stick models) are obtained by 20 random samplings from the models reported in Table . Spread within each rotamer corresponds to the Monte Carlo-derived rms uncertainty of the best-fitted χ1 values as reported in Table .
Analysis of χ1 Rotamer Distributions in GB3
| residue | model | ρ(1) | σρ(1) | χ̅1(1) | σχ1(1) | ρ(2) | χ̅1(2) | σχ1(2) | ρ(3) | χ̅1(3) | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M1 | 11 | 2 | <10–4 | 0.53 | 0.04 | 306 | 12 | 0.47 | 203 | 15 | – | – |
| Q2 | 10 | 2 | <10–4 | 0.69 | 0.02 | 176 | 5 | 0.31 | 265 | 11 | – | – |
| Y3 | 15 | 1 | 0.59 | 1.0 | – | 298 | 1 | – | – | – | – | – |
| K4 | 8 | 2 | <10–4 | 0.67 | 0.02 | 183 | 3 | 0.33 | 293 | 5 | – | – |
| L5 | 7 | 1 | 0.98 | 1.0 | – | 173 | 1 | – | – | – | – | – |
| V6 | 11 | 1 | 0.89 | 1.0 | – | 178 | 1 | – | – | – | – | – |
| I7 | 10 | 1 | 0.39 | 1.0 | – | 299 | 1 | – | – | – | – | – |
| N8 | 12 | 2 | <10–4 | 0.58 | 0.01 | 190 | 2 | 0.42 | 296 | 3 | – | – |
| K10 | 5 | 3 | 0.58 | 0.04 | 293 | 3 | 0.23 | 187 | 3 | 0.19 | 57 | |
| T11 | 10 | 3 | 0.09 | 0.63 | 0.03 | 54 | 3 | 0.28 | 282 | 7 | 0.09 | 162 |
| K13 | 10 | 3 | 0.005 | 0.67 | 0.02 | 302 | 2 | 0.16 | 62 | 12 | 0.16 | 160 |
| E15 | 8 | 3 | 0.004 | 0.44 | 0.01 | 285 | 3 | 0.34 | 40 | 4 | 0.22 | 163 |
| T16 | 10 | 2 | <10–4 | 0.74 | 0.01 | 186 | 2 | 0.26 | 37 | 8 | – | – |
| T17 | 10 | 2 | <10–4 | 0.76 | 0.01 | 65 | 1 | 0.24 | 195 | 7 | – | – |
| T18 | 10 | 1 | 0.10 | 1.0 | – | 197 | 1 | – | – | – | – | – |
| V21 | 13 | 2 | 0.0002 | 0.60 | 0.05 | 302 | 3 | 0.40 | 68 | 4 | – | – |
| D22 | 15 | 1 | 1.0 | 1.0 | – | 61 | 1 | – | – | – | – | – |
| E24 | 10 | 2 | <10–4 | 0.53 | 0.02 | 284 | 6 | 0.47 | 182 | 5 | – | – |
| E27 | 10 | 2 | 0.07 | 0.93 | 0.03 | 185 | 3 | 0.07 | 286 | 34 | – | – |
| K28 | 5 | 1 | 1.0 | 1.0 | – | 305 | 3 | – | – | – | – | – |
| F30 | 15 | 1 | 0.35 | 1.0 | – | 289 | 1 | – | – | – | – | – |
| K31 | 10 | 2 | <10–4 | 0.84 | 0.01 | 285 | 2 | 0.16 | 181 | 9 | – | – |
| Q32 | 5 | 2 | 0.05 | 0.72 | 0.04 | 174 | 6 | 0.28 | 297 | 15 | – | – |
| Y33 | 14 | 1 | 0.009 | 1.0 | – | 179 | 1 | – | – | – | – | – |
| N35 | 15 | 2 | <10–4 | 0.52 | 0.01 | 184 | 2 | 0.48 | 288 | 3 | – | – |
| D36 | 15 | 2 | 0.04 | 0.95 | 0.11 | 290 | 4 | 0.05 | 188 | 40 | – | – |
| N37 | 15 | 1 | 1.0 | 1.0 | – | 285 | 1 | – | – | – | – | – |
| V39 | 15 | 2 | <10–4 | 0.83 | 0.02 | 178 | 2 | 0.17 | 324 | 9 | – | – |
| D40 | 15 | 2 | <10–4 | 0.69 | 0.13 | 179 | 11 | 0.31 | 237 | 15 | – | – |
| V42 | 9 | 2 | 0.0005 | 0.83 | 0.02 | 171 | 1 | 0.17 | 35 | 6 | – | – |
| W43 | 11 | 1 | 1.0 | 1.0 | – | 287 | 1 | – | – | – | – | – |
| T44 | 10 | 2 | 0.001 | 0.84 | 0.02 | 48 | 1 | 0.16 | 270 | 6 | – | – |
| Y45 | 15 | 1 | 1.0 | 1.0 | – | 178 | 2 | – | – | – | – | – |
| D46 | 15 | 2 | 0.0007 | 0.90 | 0.03 | 179 | 1 | 0.10 | 30 | 17 | – | – |
| D47 | 15 | 2 | <10–4 | 0.82 | 0.02 | 189 | 3 | 0.18 | 98 | 12 | – | – |
| T49 | 10 | 1 | 1.0 | 1.0 | – | 63 | 2 | – | – | – | – | – |
| K50 | 10 | 1 | 1.0 | 1.0 | – | 299 | 1 | – | – | – | – | – |
| T51 | 5 | 1 | 0.92 | 1.0 | – | 298 | 1 | – | – | – | – | – |
| F52 | 15 | 1 | 1.0 | 1.0 | – | 287 | 1 | – | – | – | – | – |
| T53 | 10 | 2 | 0.06 | 0.90 | 0.03 | 301 | 2 | 0.10 | 86 | 25 | – | – |
| V54 | 15 | 2 | 0.0003 | 0.81 | 0.04 | 57 | 2 | 0.19 | 205 | 7 | – | – |
| T55 | 6 | 1 | 0.06 | 1.0 | – | 303 | 1 | – | – | – | – | – |
| E56 | 9 | 2 | <10–4 | 0.56 | 0.02 | 286 | 7 | 0.44 | 184 | 8 | – | – |
NRDC is the total number of experimental RDCs available for side chain rotamer analysis, with a maximum of 5 per bond vector, as the 6th RDC is a linear combination of the other 5.
Model refers to the number of discrete χ1 rotamers required to fit the data.
P refers to the probability, extracted from F statistics. For cases where model 1 is selected, P refers to the probability that model 1 is more applicable than model 2; for cases where model 2 is selected, P refers to the probability that model 2 is not better than model 1; for cases where model 3 is selected, P refers to the probability that model 3 is not better than model 2.
Fractional population of rotamer (n).
Uncertainty in fractional population of rotamer 1.
χ1 angle of rotamer (n), degrees.
Uncertainty in χ1 angle of rotamer (n), degrees.
With Hβ and Hβ having the same chemical shifts, only 6 RDCs for the sum of the Cβ–Hβ couplings were available. Because these RDCs were incompatible with 1- or 2-rotamer distributions with energetically favorable χ1 angles, a 3-rotamer fit was carried out while restricting the χ1 angles to fall within 10° from 60, 180 and 300°.
Although F statistics suggests model 3, model 2 is in better agreement with 3JHαHβ than model 3.
Although F statistics suggests the presence of a 7% populated g+ rotamer, g+ would result in severe steric clashing and is therefore excluded.
Values obtained for this residue may be less accurate as they were derived under the assumption of a static Cα–Cβ bond orientation, whereas the backbone of this residue is known to undergo substantial dynamics.[24,53]
Figure 4Distributions of χ1 torsion angles for the three Asn residues in GB3, derived from RDC data, (A) Asn8, (B) Asn35, (C) Asn37 obtained with the program VW-Fit from an ensemble of 36 conformers, with identical backbone coordinates but the side chain χ1 angles ranging from 0° to 350° in 10° increments. Corresponding distributions for all residues are presented in SI Figure S6. The positive weight of the entropy term, θ, was adjusted such as to increase the total RDC component of the error function (eq ) by 2% over not using this term, resulting in a smoothing of the distribution profile (maximum entropy) and improved convergence of the simulated annealing protocol. The correlations between observed and predicted RDCs for these residues are shown in panel (D) Asn8, (E) Asn35, (F) Asn37.
Figure 5Best fits between measured and predicted RDCs values for K13 under six different protein orientations for (A) single-rotamer model, (B) 2-rotamer model, (C) 3-rotamer model. All RDCs are normalized relative to the 13C–1H dipolar couplings, i.e., 1DCβCγ values are scaled up 10-fold, and all alignment strengths are scaled to DaNH = 10 Hz for the backbone RDCs. Colors correspond to the differently aligned samples (red: K19A/V42E/D47K GB3 in Pf1; green: K4A/K19E/V42E GB3 in Pf1; pink: wild-type GB3 in bicelles; gray: wild-type GB3 in PEG; black: wild-type GB3 in Pf1). Circles correspond to 1DCβHβ couplings. (D–F) The best-fitted K13 χ1 rotamers for the 1-, 2-, and 3-rotamer models.
Figure 6Effect of uncertainty in the structure and side chain RDC data on the χ1 distributions, obtained by Monte Carlo rotamer analysis, of (A) K4, (B) K13 and (C) K50. The heavy line corresponds to the results of the fits to the experimental data, using the newly refined backbone structure and 13Cα–13Cβ orientations. The thin lines (obscured by the black line for K50) correspond to 10 separate fits obtained when perturbing the 13Cα–13Cβ orientations by 3.2° structural noise, and using the predicted RDCs of the initial fit to which Gaussian noise has been added that results in the same quality of fit as obtained for the experimental data (see text), presenting a visualization of the uncertainty in the derived rotamer populations. A full set is presented in SI Figure S9.
Figure 7Comparison of observed 3JHαHβ couplings with those predicted by the substituent-specific Karplus equations of Haasnoot et al.[67] The solid line corresponds to x = y; the dashed line corresponds to values predicted by these same equations after adding Gaussian fluctuations (cf. eq ) with a standard deviation of σ = 20° to the χ1 angles reported in Table . Note that to a very good approximation, the effect of σ = 20° simply scales the difference in the predicted 3JHαHβ relative to a value of ca. 6 Hz (Figure S11). The rmsd between observed and predicted 3JHαHβ values (using the σ = 20° motional model) is 0.75 Hz. 3JHαHβ values (Table S16) for residues with a single χ1 rotamer are in red, two χ1 rotamers in blue, and three rotamers in pink.