| Literature DB >> 36207470 |
Daisuke Fujinami1,2, Seiichiro Hayashi1, Daisuke Kohda3.
Abstract
Multiprobe measurements, such as NMR and hydrogen exchange studies, can provide the equilibrium constant, K, and rate constants for forward and backward processes, k and k', of the two-state structural changes of a polypeptide on a per-residue basis. We previously found a linear relationship between log K and log k and between log K and log k' for the topological exchange of a 27-residue bioactive peptide. To test the general applicability of the residue-based linear free energy relationship (rbLEFR), we performed a literature search to collect residue-specific K, k, and k' values in various exchange processes, including folding-unfolding equilibrium, coupled folding and binding of intrinsically disordered peptides, and structural fluctuations of folded proteins. The good linearity in a substantial number of the log-log plots proved that the rbLFER holds for the structural changes in a wide variety of protein-related phenomena. Among the successful cases, the hydrogen exchange study of apomyoglobin folding intermediates is particularly interesting. We found that the residues that deviated from the linear relationship corresponded to the α-helix, for which transient translocation had been identified by other experiments. Thus, the rbLFER is useful for studying the structures and energetics of the dynamic states of protein molecules.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36207470 PMCID: PMC9546931 DOI: 10.1038/s41598-022-21226-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Proteins used for the generation of REFER plots.
| Protein/domain | Conditions of measurement | Methoda | Assumption | Ref | Source |
|---|---|---|---|---|---|
| Nukacin ISK-1 | 27 res, 3 mono S bonds, 308 K, pH 3.5 | 15N-EXSY | [ | Figure 3b | |
| Drk SH3N | 59 res, 287 K, pH 6.0 | 15N-EXSY | [ | Figures 3 and 6 | |
| 293 K, pH 6.0 | [ | Table 2 | |||
| Myb32-KIX:MLL28b | 32 res Myb and 87 res KIX + 28 res MLL, 303 K, pH 7.0 | 15N-RD (CPMG) + titration | Two states | [ | Figure 5 |
| pKID-KIX | 34 res pKID and 87 res KIX, 303 K, pH 7.0 | 15N-RD (CPMG) + titration | Two statesc | [ | Table 1 |
| Fyn SH3 G48M | 59 res, 288 K, pH 7.0 | 15N-RD (CPMG) | Two statesd | [ | Figure 1b |
| Fyn SH3 G48V | 59 res, 283 K, pH 7.0 | 15N-RD (CPMG) | Two statesd | [ | Figure 1c |
| Fyn SH3 mutant | 59 res, A39V/N53P/V55L, 308 K, pH 7.0 | 15N-RD (CPMG) | Two statesd | [ | Figure 1b |
| Abl1 SH3 mutant | 59 res, 313 K, pH 7.0 E7L/V21K/N23G/G48V | 15N-RD (CPMG) | Two statesd | [ | Figure 1a |
| STARD6 | 220 res, apo form, 298 K, pH 7.4 | 15N-RD (CPMG) | [ | Figure 4 | |
| Tiam2 PDZ mutant | 90 res, 298 K, pH 6.8 M978L/E979K/F982L/V987L | 15N-RD (CPMG) | [ | Table 2 | |
| Fyn SH3 G48M | 59 res, 298 K, pH 7.0 | 15N-RD (R1ρ) | Two states | [ | Table 1 |
| 2P-ERK2 kinase | 356 res, dual-phosphorylated form, 298 K, pH 7.4 | Methyl 13C-RD (CPMG) | [ | Table 2 | |
| Ubiquitin | 76 res, 1.5 and 1.8 M guanidine deuterium chloride (GdnDCl), 288 K, pH 7–9 | HX/NMR | kop and kcl are independent of pH, EX1 mechanism | [ | Table 1 |
| CspA | 70 res, 311 K, pH 6–11 | HX/NMR + NMR (saturation transfer) | kop and kcl are independent of pH, EX1 + EX2 mechanism | [ | Table 2 |
| OMTKY3 | 56 res, 303 K, pH 6–10 | HX/NMR&ESI–MS | kop and kcl are independent of pH, EX2 mechanism | [ | Table 1 |
| OMTKY3 | 56 res, 303 K, pH 10–12 | Quenched-flow HX/NMR | kop and kcl are independent of pH, EX1 mechanism | [ | Table 1 |
| Apomyoglobin | 153 res, apo form, ambient temperature, pH 7–11 | Quenched-flow HX/NMR | kop and kcl are independent of pH and have identical values at the refolding times of 0.4 and 6 ms. EX1 + EX2 mechanism | [ | Table 1 |
aRelaxation dispersion (RD) experiments using either the CPMG (Carr–Purcell–Meiboom–Gill) pulse train or a constant radiofrequency spin lock field (R1ρ).
bThe minor binding site for the pKID peptide on the KIX domain was masked by the second IDP peptide derived from the MLL (mixed-lineage leukemia) protein.
cIn the original report, the authors concluded that the three-state model was appropriate for the coupled binding and folding of the pKID peptide, Free ⇄ Intermediate (encounter complex) ⇄ Bound. Here, the exchange between the free state and the encounter complex was analyzed.
dIn the original reports, the authors concluded that the three-state model was appropriate for the exchange processes. However, we used the kinetic constants calculated assuming a two-state model to generate the REFER plots.
Figure 1Residue-based REFER plots based on the literature data listed in Table 1. (a) Nukacin ISK-1. (b) N-terminal SH3 domain from Drosophila drk protein (drk SH3N). The drk SH3N is in an exchange between unstructured state U and native folded state N. There are different data sets from the two reports. (c) Myb32 and pKID are intrinsically disordered polypeptides that bind to the KIX domain. (d) Structural fluctuations of native states were investigated by the relaxation dispersion (RD) NMR method. The SH3 domains are derived from the Fyn and Abp1 proteins. The mutations in the SH3 domains markedly increased the folding rate despite their destabilization of the folding state, and have suitable properties for the RD studies[34]. The STARD (steroidogenic acute regulatory-related lipid transfer domain) from the STARD6 protein. The PDZ (PSD-95/Dlg/ZO-1) domain from the Tiam2 (T cell lymphoma invasion and metastasis 2) protein. A dual-phosphorylated (2P-ERK2) form of the ERK2 (extracellular signal-regulated kinase 2) protein. (e) Structural fluctuations of native states investigated by the HX method. There are two measurement conditions for ubiquitin and OMTKY3 (turkey ovomucoid third domain 3). In (a)–(e), the least-square lines and data points associated with the forward direction are colored blue, and those associated with the backward direction are orange. The least-square lines with interpretable slopes between 0 and 1 are depicted as solid lines, whereas those with uninterpretable slopes less than 0 or greater than 1 are depicted as dashed lines. No least-square lines are drawn if the correlations are considered insignificant. The concentric circle represents a basin-shaped energy landscape of the structural fluctuations around the native state, N, in equilibrium with the open state, Nop.
Figure 2Relationships between log k and log k′ as the basis of rbLFER. Several types are defined according to the data point distributions. Type N shows a negative correlation, and types V and H show flattened distributions of data points with zero correlations. The vertically flattened distribution provides two least-square lines with the slopes ρ of 0 (blue) and − 1 (orange), and the horizontally flattened distribution provides those with the slopes ρ of 0 (orange) and 1 (blue) in the REFER plots. Types P and P′ are positive correlations between log k and log k′. Type nr has no relation between log k and log k′, and the REFER plot may show a weak, artificial correlation trend.
Figure 3Datapoint distributions in the log k vs. log k′ plots. In each panel, the category type defined in Fig. 2, a 95% confidence ellipse, and the flatness of distribution are shown. The flatness of the distribution is defined by f = 1-b/a, where a is the long axis and b is the short axis of the confidence ellipse. The axis ranges are set equally for the proper interpretation of the flatness of the confidence ellipses.
Figure 4Risk evaluation of linearity overestimation in the REFER plots. (a) REFER plot and log k vs. log k′ plot of nukacin ISK-1[15] as a reference. The error bars represent one standard error. (b) REFER plot and log k vs. log k′ plot artificially generated using a synthetic dataset. Assuming a 15-residue protein, 15 k and k′ values are generated as uniformly distributed random numbers between 3 and 7 and between 8 and 12, respectively. The numerical ranges used are shown as the rectangular shape in the log k vs. log k′ plot. (c) The random number generation per residue was repeated 30 times (N = 30). The error bars represent one standard error of the mean. Note that one standard deviation is 5.5 (the square root of 30) times larger than one standard error of the mean.
Figure 5REFER plot and log k vs. log k′ plot of apomyoglobin folding intermediates. (a) REFER plot using all observed residues with estimated fitting uncertainties[16]. The data points and least square lines of the log kop vs. log K plot and the log kcl vs. log K plot are blue and orange, respectively. (b) Replot. Outlier residues (green and magenta) were removed to redraw the least-square lines (blue and orange). See Supplementary Fig. S4 for details. (c) Log k vs. log k′ plot. The yellow dots are the residues contributing to the least-square lines, and the purple dots are the outlier residues in (b). The yellow group belongs to type N. The yellow dots are enclosed by a 95% confidence ellipse.
Figure 6Mapping of the outlier residues in the REFER plot on the N-state structure of myoglobin. The native holo-structure (PDB ID 2JHO) is used as the best alternative to the intermediate structures. The residues on the least-square lines in Fig. 5b are colored cyan and the outlier residues are magenta. The other residues without information are colored white. These unprobed residues include parts of helices A, G, and H due to the full protection of the amide protons, and helix F and the loops due to the lack of local secondary structures in the intermediate states. The inset shows the helical topology of the kinetic intermediate Ib. The major portions of helices A, B, G, and H are formed. The half-translucent cylinders represent partially formed helices C, D, and E. The state of helix F is unclear in the folding intermediate states because helix F is not stably packed in the apo state of apomyoglobin. The white arrow indicates the translocation of helix H in the intermediate states, Ib.