Samuel Lenton1, Marco Grimaldo2, Felix Roosen-Runge3, Frank Schreiber4, Tommy Nylander5, Roger Clegg6, Carl Holt7, Michael Härtlein8, Victoria García Sakai9, Tilo Seydel8, Susana C Marujo Teixeira10. 1. Environment, Physical Sciences and Applied Mathematics (EPSAM), Keele University, Staffordshire, United Kingdom; Institut Laue-Langevin, Grenoble, France; Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom. 2. Institut Laue-Langevin, Grenoble, France; Institut für Angewandte Physik, University of Tübingen, Tübingen, Germany. 3. Institut Laue-Langevin, Grenoble, France; Department of Chemistry, Division of Physical Chemistry, Lund University, Lund, Sweden. 4. Institut für Angewandte Physik, University of Tübingen, Tübingen, Germany. 5. Department of Chemistry, Division of Physical Chemistry, Lund University, Lund, Sweden. 6. Hannah Research Institute, Ayr, Scotland, United Kingdom. 7. Institute of Molecular, Cell and Systems Biology, University of Glasgow, Glasgow, United Kingdom. 8. Institut Laue-Langevin, Grenoble, France. 9. ISIS Facility, Rutherford Appleton Laboratory, Chilton, Didcot, United Kingdom. 10. Department of Chemical and Biomolecular Engineering, University of Delaware, Newark, Delaware; The NIST Centre for Neutron Research, National Institute of Standards and Technology, Gaithersburg, Maryland. Electronic address: scm5@nist.gov.
Abstract
The last decade established that the dynamic properties of the phosphoproteome are central to function and its modulation. The temporal dimension of phosphorylation effects remains nonetheless poorly understood, particularly for intrinsically disordered proteins. Osteopontin, selected for this study due to its key role in biomineralization, is expressed in many species and tissues to play a range of distinct roles. A notable property of highly phosphorylated isoforms of osteopontin is their ability to sequester nanoclusters of calcium phosphate to form a core-shell structure, in a fluid that is supersaturated but stable. In Biology, this process enables soft and hard tissues to coexist in the same organism with relative ease. Here, we extend our understanding of the effect of phosphorylation on a disordered protein, the recombinant human-like osteopontin rOPN. The solution structures of the phosphorylated and unphosphorylated rOPN were investigated by small-angle x-ray scattering and no significant changes were detected on the radius of gyration or maximum interatomic distance. The picosecond-to-nanosecond dynamics of the hydrated powders of the two rOPN forms were further compared by elastic and quasi-elastic incoherent neutron scattering. Phosphorylation was found to block some nanosecond side-chain motions while increasing the flexibility of other side chains on the faster timescale. Phosphorylation can thus selectively change the dynamic behavior of even a highly disordered protein such as osteopontin. Through such an effect on rOPN, phosphorylation can direct allosteric mechanisms, interactions with substrates, cofactors and, in this case, amorphous or crystalline biominerals.
The last decade established that the dynamic properties of the phosphoproteome are central to function and its modulation. The temporal dimension of phosphorylation effects remains nonetheless poorly understood, particularly for intrinsically disordered proteins. Osteopontin, selected for this study due to its key role in biomineralization, is expressed in many species and tissues to play a range of distinct roles. A notable property of highly phosphorylated isoforms of osteopontin is their ability to sequester nanoclusters of calcium phosphate to form a core-shell structure, in a fluid that is supersaturated but stable. In Biology, this process enables soft and hard tissues to coexist in the same organism with relative ease. Here, we extend our understanding of the effect of phosphorylation on a disordered protein, the recombinant human-like osteopontinrOPN. The solution structures of the phosphorylated and unphosphorylated rOPN were investigated by small-angle x-ray scattering and no significant changes were detected on the radius of gyration or maximum interatomic distance. The picosecond-to-nanosecond dynamics of the hydrated powders of the two rOPN forms were further compared by elastic and quasi-elastic incoherent neutron scattering. Phosphorylation was found to block some nanosecond side-chain motions while increasing the flexibility of other side chains on the faster timescale. Phosphorylation can thus selectively change the dynamic behavior of even a highly disordered protein such as osteopontin. Through such an effect on rOPN, phosphorylation can direct allosteric mechanisms, interactions with substrates, cofactors and, in this case, amorphous or crystalline biominerals.
Osteopontin (OPN) is one of the main regulators of mineralization in hard and soft tissues, as well as biofluids (1, 2, 3, 4, 5). In bovine milk, the most abundant phosphoform is OPN1-149, formed by cleavage at or around residue 149 (6). In human urine, a much less phosphorylated peptide known as “uropontin” appears to be the most common form (7, 8). In mineralized tissues multiple short OPN fragments are found, as a result of the action of the protease PHEX (9). Phosphorylated OPN has been shown to mediate the intrafibrillar mineralization of collagen and activation of osteoclasts (10, 11), as well as nucleation, growth, and precipitation of calcium phosphate (12, 13, 14). The function of OPN is highly reliant on tissue-specific posttranslational processing (see for example (15, 16)), including phosphorylation by the Golgi FAM20C (17, 18) and other kinases, N- and O-glycosylation (4), and proteolysis (19). A variety of structural methods, including nuclear magnetic resonance (NMR) spectroscopy, synchrotron radiation circular dichroism, and small-angle x-ray scattering (SAXS), have been deployed to study various osteopontinpeptides (20, 21, 22, 23, 24).A critical point of regulation in physiological biomineralization is the initial formation of an amorphous hydrated phase of calcium phosphate, susceptible to control by interactions with phosphoproteins (25, 26, 27). OPN contains multiple sites of phosphorylation, which can cluster in the so-called phosphate centers (PCs): a sequence of 10 or fewer consecutive residues containing at least three sites of phosphorylation, no cysteines and fewer than three hydrophobic residues, as per the definition used in Holt et al. (22). Bovine OPN1-149 can sequester amorphous calcium phosphate through interactions involving its three PCs, forming nanostructures in which the core is surrounded by a shell of the phosphopeptides (28). Similar structures can be formed by casein phosphopeptides containing a single PC (see for example (29)).
Phosphorylation of intrinsically disordered proteins
The majority of mammalian proteins are phosphorylated (30) and phosphorylation sites are frequently found in disordered regions of protein structure (31). Phosphorylation serves dynamic cellular processes well (32, 33, 34), due to its reversibility and fast kinetics. Intrinsically disordered proteins (IDPs) can gain regular secondary structure or fold into a more compact state upon phosphorylation, with implications for their properties and function (35). Osteopontin generally shows random coil-like behavior, but quailOPN contains distinct local secondary structure elements with reduced conformational flexibility (36). Hyperphosphorylation of Tau, a known substrate of the protein kinase CK2 (37), has been associated with amyloid fibril formation in Alzheimer’s disease (38) and even subtle changes in conformational dynamics can dictate amyloidogenicity (39). In some Tau fragments, phosphorylation promotes the formation of, or stabilizes, poly-L-proline type II α-helical structures, while elsewhere in the sequence phosphorylation has no apparent structural effect (40). Furthermore, phosphorylation can also promote disorder (41): studies of kinase substrates (42) highlighted that phosphorylation by CK2 can stabilize helix unfolding.Phosphorylation and disordered conformations are common among proteins controlling biomineralization (22, 43, 44). The modulating role of phosphorylation on the biological functions of OPN and other proteins is widely recognized, allowing for OPN to interact with prenucleation clusters (45), step-specific interactions with crystal surfaces, and by altering mineral interfacial energies (46, 47) where the extent of phosphorylation has also been shown to have an impact (48). The mapping of specific effects of phosphorylation are nevertheless poorly understood, partly due to difficulties in determining phosphorylation stoichiometry (see, for example, (49)), but mostly due to the challenges of producing phosphoproteins in high yields with quantitative and complete phosphorylation at specific sites. With significant progress in both fields—for a discussion, see, for example, Oza et al. (50)—the time is ripe for studies of phosphorylation effects at different time and length scales (51, 52, 53). Here we compare the SAXS solution structures of phosphorylated and unphosphorylated rOPN. Information on the internal dynamics of rOPN hydrated powders was collected on two neutron spectrometers to cover a broad frequency range. In the hydrated powder state, the center-of-mass diffusion of rOPN is suppressed, thus providing unambiguous access to the internal molecular fluctuations probed by the hydrogen atoms, uniformly distributed throughout the rOPN sequence. Data were obtained from elastic (EINS) and quasi-elastic (QENS) incoherent neutron scattering (54). EINS provides mean square displacements, a geometrical signature of the dynamical confinement, while QENS allows vibrational and diffusive dynamics to be distinguished.
Materials and Methods
Protein expression, purification, and preliminary characterization
Phosphorylated and unphosphorylated rOPNpeptides were produced. A comparison to the bovine OPN1–149 peptide guided the design of a human-like rOPN (Fig. S1). Albeit with limited success in terms of controlling the phosphorylation sites, the rOPN sequence was chosen with the aim of decreasing the number of PCs, to produce a homogeneous functional sample. Disorder predictions and a charge-hydropathy analysis of rOPN was carried out using the PONDR VLX2 algorithm (PONDR.com), which allowed comparisons to be made with sets of folded and unfolded proteins (see Supporting Material for more details).Unphosphorylated rOPN was produced by overexpression of a construct in Escherichia coli as described in Clegg and Holt (55). Phosphorylated rOPN was coexpressed with a constitutive serine/threonine kinase—mostly serine (42)—with increased expression and activity in numerous cancers (56). A second plasmid contained an IPTG-inducible α-subunit of casein kinase on a pACYC Duet vector. The latter includes a chloramphenicol resistance gene, allowing for dual selection for OPN and kinase clones, in the presence of their respective antibiotics. For both rOPN forms, a final purification step was carried out by size exclusion chromatography, using a 24 mL column packed with Superdex S75 resin (GE Healthcare, Little Chalfont, UK), preequilibrated with two column-volumes of buffer (50 mM phosphate pH 7.5, 300 mM NaCl and 0.02% NaN3: where M denotes molar concentrations in mol/L). Electron spray ionization mass spectrometry measurements were carried out on the two rOPN forms, at the Grenoble platform service of the Partnership for Structural Biology (Grenoble, France).The ability of the phosphorylated rOPN peptide to stabilize calcium phosphate nanoclusters, like the native OPN, was tested using dynamic light scattering (DLS): samples of 10 mg/mL were prepared in phosphate buffer, as per the method of Holt et al. (22), and allowed to stabilize. DLS of the corresponding solution was measured on a Zetasizer unit (Malvern Instruments, Malvern, UK), with an incident wavelength of 633 nm. At least 10 consecutive measurements of the correlation time of the scattered light intensity were performed and averaged for each sample. The data were fitted using the Malvern analysis software package (Malvern Instruments).
Limited proteolysis by proteinase K
Limited proteolysis was carried out on the rOPNpeptides with proteinase K (Sigma-Aldrich, St. Louis, MO), according to the method of Denning et al. (57). Equine skeletal muscle myoglobin (Sigma-Aldrich) was also lysed for comparison with a globular protein of similar molecular weight (17 kDa) to the unphosphorylated rOPN. A 5 μg sample of each protein was incubated with 100 ng of proteinase K at 37°C. Aliquots were removed after 1, 2, 5, 10, 15, 30, and 60 min and the proteolysis was stopped by boiling the aliquots dispersed in sodium dodecylsulfate (SDS) loading buffer. They were then loaded onto an SDS-polyacrylamide gel electrophoresis (PAGE) (12% polyacrylamide) gel and stained with Coomassie Blue.
1D 1H HET-SOFAST NMR
NMR spectra of phosphorylated and unphosphorylated rOPN were measured on a Varian 600 MHz spectrometer (Agilent Technologies, Santa Clara, CA) at the Institut de Biologie Structurale (Grenoble, France). The 1D1HHET-SOFAST technique allows for a semiquantitative characterization of the polypeptide disorder in solution, by comparison to empirical thresholds defined as benchmarks for protein compactness (58) through a measurement of the 1H-1H spin diffusion or NOE effect (λNOE). All measurements were performed at room temperature, with 150 μM protein in a solution containing 200 mM NaCl and 50 mM phosphate buffer, pH 7.
SAXS
SAXS measurements were performed on the rOPNpeptides, using an online HPLC system (Viscotek GPCmax; Malvern Instruments) at the BM29 Bio-SAXS beamline of the European Synchrotron Radiation Facility (Grenoble, France). A 2.4 mL Superdex 200 Increase 5/150 GL column (GE Healthcare) was used and calibrated with protein standards (Bio-Rad, Hercules, CA). Before each run, the column was equilibrated with two column-volumes of buffer (50 mM phosphate pH 7.5, 300 mM NaCl, 0.02% NaN3). A sample of 100 μL was loaded onto the column and eluted at a flow rate of 0.1 mL/min. The elution was monitored by absorption at 280 nm.X-ray scattering data were collected on the eluate as a function of the scattered wave vector (q = 4πsinθ/λ), where 2θ is the scattering angle and the q range covered was 0.08–4 nm−1. Data were collected at a wavelength λ of 0.99 Å and at 1 frame every 2 s. Radially integrated, calibrated and normalized one-dimensional scattering profiles were obtained with the EDNA software pipeline (59). All repeat frames were compared to monitor radiation damage and exclude outliers from the averaging. Buffer frames were also averaged for background subtraction. Radii of gyration (R) were extracted from the x-ray data for the rOPNpeptides. The Guinier equation was used to fit the data close to the zero scattering angle:where I(0) is the forward scattering intensity. A Kratky plot of q2I(q)/I(0) as a function of q was also used as a qualitative estimation of protein compactness (60). The real-space pair distribution function P(r) was calculated by indirect Fourier transform using GNOM from the ATSAS package (61). The ensemble optimization method was used to produce an ensemble of structures, taking into account protein flexibility by allowing the coexistence of several different conformational isomers in the scattering solution (62). A pool of 10,000 random models was generated, based on the primary structure of the rOPNpeptides. A genetic algorithm was then employed to select the 50 models that best fit the scattering curve.
Incoherent neutron scattering
Neutron spectroscopy provides the possibility to explore the hierarchy of geometrically confined, superimposed motions as a function of (reciprocal) observation length and time, and a direct verification of molecular dynamics simulations that provide space-time trajectories. Data were collected on two spectrometers: IRIS (63), at the ISIS pulsed Neutron and Muon source (Chilton, UK) and IN16B (64), at the Institut Laue Langevin (ILL, Grenoble, France). IRIS and IN16B cover similar momentum transfer q ranges of 0.3–1.85 and 0.1–1.8 Å−1, respectively, but differ in energy resolution: 17 μeV (150 ps, IRIS) and 0.8 μeV (4 ns, IN16B). Monitor normalization was performed for all data, and the detectors were calibrated by using the sample signal from the lowest-temperature data.The rOPN samples were in the form of D2O-hydrated powders, obtained through three cycles of drying under vacuum and resuspension in D2O. Each sample contained 100 mg of protein powder, with 0.44 g of D2O per gram of protein (determined by interrupting the drying and weighting of the sealed sample to monitor when the desired weight was obtained). Fixed window elastic scans were measured while the temperature was continuously increased from 10 to 310 K at a rate of 1 K/min. The sample was sealed in a flat aluminum cell with an internal spacing of 0.3 mm, at an angle of 135° with respect to the incident beam. The so-called apparent mean square displacements (MSD; 〈u2〉) were obtained from analysis of the integrated intensity over the frequency window corresponding to the instrumental resolution around the elastic peak. The incoherent scattering from the rOPN nonlabile hydrogen atoms largely dominates the EINS signal: average MSD of the hydrogen atoms were extracted from the q-dependence of the scattered intensity (energy transfer ω), using a q4 model correction (65) to the Gaussian approximation:The distinctly nonvanishing a2
q4 term illustrates the deviation of the observed motion from the Gaussian approximation, significantly improving the fit to the data and allowing for a maximum of the measured q2 information to be used in the fits: 0.19–3.4 Å−2 for IRIS data, and 0.19–1.6 Å−2 for IN16B data.Quasi-elastic spectra at selected temperatures were also recorded at both the IRIS and IN16B spectrometers. The choppers and the Doppler drive were set to allow for an energy transfer range of ±30 μeV on IN16B, whereas the chopper settings on IRIS allowed for energy transfers of ±500 μeV. For IRIS data, the empty cell signal introduced errors (most likely due to nonmatching geometrical orientations) if subtracted. As the empty cell Debye-Waller factor of the aluminum in the q-dependence is a very small contribution, this data was not used. It is, however, represented by the elastic signal—a free parameter in the QENS fits. It may slightly affect the result for the fraction of immobile atoms (Fig. S11), but again its contribution can be assumed to be nearly constant in q.
Results and Discussion
Recombinant protein expression and levels of phosphorylation
Phosphorylated and unphosphorylated rOPN were purified to homogeneity, as determined by SDS-PAGE and Western blots. The deconvoluted electrospray ionization time-of-flight mass spectrum of the unphosphorylated rOPN produced one clear peak at 18.115 kDa, consistent with the theoretical molecular weight. This excludes the possibility of prokaryotic phosphorylation—for a review, see Macek et al. (66)—and validates the use of E. coli as a heterologous expression host for studies of CK2 phosphorylation of rOPN. The equivalent spectrum for the phosphorylated rOPN sample (Fig. S8) showed a distribution of peaks, differing by multiples of the mass of a phosphate group. With the information currently available on CK2 recognition frequencies or patterns, often biased by kinase priming of phosphorylation by another kinase, inferring a trend in the results would be speculative. The rOPN samples were designed assuming a distinct preference by a consensus sequence (see also the Supporting Material) that, by itself, is not consistent with the higher incidence of 4–9 phosphorylation sites. For the purposes of comparing phosphorylation effects on the structure and dynamics of the two rOPN isoforms, however, this is not a central issue.The size of calcium phosphate nanoclusters formed by phosphorylated peptides is not only dependent upon phosphorylation but also the sequence and length of the chain surrounding the phosphorylated region. We observed that phosphorylated rOPN forms calcium phosphate nanoclusters (CPN) in a solution that is supersaturated with respect to the bone and tooth mineral hydroxyapatite. Notwithstanding this, the solution is stable because it is undersaturated with respect to the obligate precursor amorphous phase (22, 26). The solutions of the nanocluster complexes remained clear after storage for prolonged periods and the absence of aggregation was confirmed by the DLS results (Fig. S9). The size of the nanoclusters agrees with the 22-nm radius of gyration of CPN previously determined for bovine OPN1-149 (22), consistent with the mode of calcium phosphate binding being similar for rOPN.
Flexibility of rOPN structure in solution
Phosphorylation of rOPN did not show a significant effect on the rate of protein degradation by proteinase K, consistent with no change in the extent of ordered structure. The susceptibility of rOPN to proteolysis is in clear contrast to that of the globular protein myoglobin (Fig. S6). Analysis of the 1H NMR chemical shift dispersion of both forms of rOPN further supports highly flexible unfolded protein structures (Fig. S7). These results are also reflected in the hydrodynamic behavior, computational predictions, and migration behavior on SDS-PAGE gels.SAXS data of the phosphorylated and unphosphorylated rOPN are shown in Fig. 1, A and B, respectively. In Fig. 1
C, the radii obtained from the Guinier plots are found consistent with those expected for IDPs, according to the Flory equation (67), which postulates a power-law relationship between the radius of gyration R and the number N of residues in a polymer:where R0 is a constant that varies with the persistence length of the peptide and ν is an exponential scaling factor constant. For chemically unfolded proteins (the R of bovine native OPN 1–149 was previously found (68) to match the predicted radius using these parameters), ν and R0 were previously determined (69) to be 0.6 ± 0.03 and 1.93 ± 0.28, respectively. Similarly, for globular proteins, accurate predictions were also made using values of ν and R0 equal to 0.39 and 0.35, respectively (used in the comparison shown in Fig. 1
C).
Figure 1
Guinier plots for phosphorylated (A) and unphosphorylated (B) rOPN. R and I0 were recovered from the fits of the straight lines where (q × R) ≤ 1.3. (C) Expected R values for chemically denatured proteins were determined using Eq. 3, where the region between dashed lines represents the confidence interval of the Flory equation. The diamonds show experimental R obtained for phosphorylated (darker) and unphosphorylated rOPN (lighter). The lower continuous gray line shows the expected R value for globular proteins. (D) P(r) distributions of phosphorylated (black) and unphosphorylated (gray) rOPN, from which the R and Dmax were obtained. (E) Kratky plots are shown (same color scheme as D).
The P(r) distributions gave broadly similar results for both forms of rOPN; the asymmetrical shape of the curves is characteristic of IDPs (see Fig. 1
D; Table 1). I0 was used to produce Kratky plots (Fig. 1
E), which display a plateau at high q from Gaussian chain behavior, characteristic of highly unfolded or flexible proteins (70). Phosphorylation appears to have minimal effect on the solution scattering of rOPN, reflecting an overall unchanged conformation at the SAXS resolution. Given the distribution of phosphorylation sites, a certain degree of averaging of the structural parameters is plausible. To fully assess this effect, further studies on phosphorylated peptides are required. The ensemble-optimization method data for phosphorylated and unphosphorylated rOPN both display highly extended and flexible structures. The corresponding R and maximum particle size, Dmax, for the two forms of rOPN, are shown in Table 1. Fig. 2 shows the respective ensemble fits.
Table 1
Parameters from Analysis of the SAXS Data on rOPN
Phosphorylated
Unphosphorylated
Guinier Rg (Å)
39.0 ± 0.6
38.6 ± 0.6
I0
13.4 ± 0.1
11.6 ± 0.1
Rg (Å) from P(r)
39.8 ± 0.2
39.4 ± 0.3
Dmax (Å) from P(r)
129 ± 12
131 ± 13
The confidence level for the uncertainties is 95%.
Figure 2
Ensemble optimization analysis of the HPLC-SAXS profile measured for rOPN peptides. (A) R and (B) Dmax distribution for the random ensemble (solid black line), phosphorylated rOPN (dashed), and unphosphorylated rOPN (dots) is shown. The fits to the corresponding scattering curves are shown in (C) for phosphorylated rOPN and (D) unphosphorylated rOPN.
Internal protein dynamics observed by neutron scattering show a change only above the nanosecond timescale upon phosphorylation
The apparent MSDs obtained for rOPN are shown in Fig. 3: the linear increase with temperature reflects the onset of motions faster than the resolution of the spectrometers used. A steeper increase at ∼200 K is typical of a dynamical transition associated with the onset of an anharmonic regime, often observed for hydrated protein powders. Within noise level, the MSDs are mostly similar for both forms of rOPN over nearly the entire temperature range measured. Above ∼280 K, a small difference in the apparent MSDs was detected for the IN16B data, and prompted the collection of full QENS spectra. These were recorded on IRIS at 300 K, and on IN16B at temperatures of 280 K and lower (Fig. 4). No differences can be observed between the two forms of rOPN at 200 K, within experimental error. Minor differences are only apparent at 280 K and for |ħω| < 10 μeV, at timescales of ∼300 ps (beyond what could be observable on the 150-ps scale of the IRIS data). At 300 K (Fig. S10), broader quasi-elastic wings are noticeable for phosphorylated rOPN. The following model was used to fit both IRIS and IN16B spectra:where R denotes the instrument energy resolution function, determined by fitting the spectra of a vanadium foil (IN16B), or the sample itself at the lowest temperature (10 K on IRIS), with a sum of Gaussian functions. L(Γ,ω) is a Lorentzian function with half-width at half-maximum Γ accounting for the rOPN internal molecular dynamics. Center-of-mass diffusion was ignored, given that the samples used are hydrated powders. A0(q)δ(ω) models the elastic contribution in the accessible time window, where A0(q) is the elastic incoherent structure factor (EISF) that yields information on the geometry of confinement of the atoms within the protein. The widths Γ are plotted as a function of q2 in Fig. 5. The fits to the IRIS spectra at 300 K show a trend of larger linewidths for the phosphorylated sample.
Figure 3
Apparent MSDs of the two forms of rOPN, obtained from polynomial non-Gaussian fits to elastic fixed window temperature scans performed: (A) on the IRIS and (B) on the IN16B spectrometers. (Both insets) Elastic scattering intensities as a function of q2, at 305 K, are given. The point symbols show data binned along the temperature-axis to 5 K-intervals. The error bars correspond to 1 SD of uncertainty. Note that on the (B) inset, the error bars are smaller than the point symbols.
Figure 4
Example QENS spectra measured on IN16B (q = 1.7 Å−1), and q-wise fit (Eq. 4) at two temperatures, for unphosphorylated (uP) and phosphorylated (P) rOPN, normalized to their maximum for a better comparison. For clarity, the data are rebinned for |ħω| > 1 μeV. The wings of the spectra—|ħω| > 15 μeV—are the same for both rOPN forms at both temperatures, within the error bars (1 SD of uncertainty).
Figure 5
Comparison of the widths for the two rOPN forms. (A) From fits to IN16B spectra: for 200 and 250 K data, the fits (Eq. 4) are straight lines. For the 280 K data, a jump diffusion model was used (Eq. 5). The points at the two highest q values are not shown (signal-to-noise ratio too high to be included in a reliable data analysis). (B) IRIS spectra and corresponding fits (Eq. 4) at 300 K are given. The lines are guides to the eye. The error bars correspond to 1 SD of uncertainty.
For the IRIS data at 300 K (Fig. S10), as well as for the IN16B data at 200 and 250 K (Fig. 5), no particular q dependence was observed on the range. A constant can be fitted to the QENS data with an average relaxation time τ. The observation of a q-independent width may indicate that side-chain motions are restricted to localized rotations at low temperature, at the accessible timescale. At 300 K, we obtain relaxation times of 17.1 ± 3.8 ps and 29.6 ± 1.7 ps for phosphorylated and unphosphorylated rOPN, respectively. This confirms that the phosphorylation state influences the internal dynamics of rOPN on the subnanosecond timescale (see also Fig. S11) and that, on average, a shorter time elapses between two side-chain jumps for phosphorylated rOPN, when compared to the unphosphorylated form.At 280 K, a more marked q dependence is observed, which can be fitted by a jump-diffusion model (71):where D is the jump-diffusion coefficient. Such a model was used in previous studies (72, 73) to describe the dynamics of protein side chains on the nanosecond and picosecond timescales. For 200 and 250 K data, the linewidth was fitted as a global parameter. After the observations on IN16B at 280 K, a fit was performed for all q simultaneously to keep the error on the parameters as low as possible, with the scattering function:where D and τ are global parameters for all scattered wave vectors. Jump-diffusion coefficients of (4.3 ± 2.9) × 10−6 and (2.3 ± 0.9) × 10−6 cm2/s were obtained from the fits for the unphosphorylated and phosphorylated rOPN, respectively. The confidence of the fit is insufficient for quantitative conclusions on a possible variation of D with phosphorylation. It is, however, noteworthy that both D coefficients do show observed internal dynamics approximately one order-of-magnitude faster than the center-of-mass motion of typical proteins (74), and one order-of-magnitude slower than the diffusion of water (75).The EISF from Eq. 5 is plotted in Fig. 6 for both rOPN forms. It describes an atom in a Gaussian radial energy landscape (76) with effective radius a:where p is the fraction of immobile atoms (immobile will be used, hereon, not in an absolute sense but in the context of the instrumental timescales used). We obtain an average value of 3.5 Å for the radius a (see Fig. S11), regardless of temperature and phosphorylation state. The fraction of immobile atoms can be easily deduced from the high-q value of A0, and is shown in Fig. 6. For both rOPN forms, p is closer to 1 at the lowest temperature and decreases toward higher temperatures at the nanosecond resolution. At 280 K, where the difference between EISFs of the two rOPN forms becomes clear, the phosphorylated protein has a larger number of immobile atoms on the resolution of IN16B. On the much shorter IRIS resolution timescale (at 300 K), upon phosphorylation, the number of immobile rOPN side chains decreases. The difference in absolute values between the apparent MSD data from IRIS and IN16B is then due to the different populations of motions sampled.
Figure 6
EISF, A0, for the two forms of rOPN from fits (lines) by Eq. 7. The fraction of immobile atoms, at the resolution times of each spectrometer, is shown for each plateau: (A) IN16B QENS spectra (∼4 ns); see also Fig. S11 for more information on the fit parameters. (B) IRIS spectra (∼15 ps). The effective radius a is 2.63 ± 0.86 Å and 2.71 ± 0.36 Å for unphosphorylated and phosphorylated rOPN, respectively. The data point error bars correspond to 1 SD of uncertainty.
Overall, for phosphorylated rOPN, before applying any model we observe a larger broadening of the QENS than for the unphosphorylated rOPN, recorded on both IRIS at 300 K and IN16B at 280 K (Figs. S4 and S10). This observation is consistent with the fits of the Lorentzian Γ in Eq. 4, which show a larger Γ for the phosphorylated samples at the aforementioned temperatures. The increased flexibility is further supported by the shorter residence times at 250 and 280 K for the phosphorylated rOPN (Fig. S12). The results obtained provide further insights into the motional landscape of rOPN upon phosphorylation:The geometrical confinement (MSDs and p values) and relaxation of dynamics are unchanged at lower temperatures upon phosphorylation. At 280 K, phosphorylation appears to induce slightly more extensive confinement of dynamics. This could be an effect of the larger immobile fraction (with effectively very small confinement).We see a dynamically heterogeneous sample, as reflected by the q4 term from EINS fits and the separation into immobile and mobile atoms at 75- and 4-ns timescales.Slower dynamics: for immobile atoms, a large part of the hydrogen atoms is not moving considerably even on nanosecond timescales (>67% at 280 K), and it increases by ∼7% (0.73–0.67) at 280 K upon phosphorylation. The difference likely comes from the serine side chains directly affected. For mobile atoms, phosphorylation speeds up the dynamics of mobile atoms, presumably to compensate for the entropic cost of immobilizing atoms. The q signature identifies these motions as jump-like below 280 K, and jump-diffusion-like for 280 K.Faster dynamics: at 300 K, phosphorylation unblocks some immobile atoms (11% = 0.87–0.76). Mobile atoms appear to have faster dynamics on the picosecond scale, and the q signature identifies the motions as mainly jump-like. Possible explanations for this are only speculative without further data, but at 300 K—closer to physiological conditions—the entropic cost of immobilizing residues upon phosphorylation may become too high for a range of rOPN conformations. Specific, preferred conformations—not sufficiently different from the ensemble for the change to be reflected on the R—may become more populated, where faster, albeit more confined movements, are allowed to further compensate for the rigidity of the PCs. The faster picosecond motions can plausibly enhance ability to interact with crystal surfaces and provide a potential advantage in interactions with prenucleation clusters—picosecond-lived species present in supersaturated solutions of calcium phosphates and other salts (45).
Conclusions
Phosphorylation effects at various length- and time-scales are paramount to IDPs, which often use allosteric mechanisms requiring somewhat long-range intramolecular communication (77). Proteins can use fast picosecond kinetics to drive allosteric changes by strategically increasing some and decreasing other side-chain mobilities (78, 79). The relationship between function and the hierarchy of timescales in protein dynamics is not well understood and there are still few studies in the literature investigating this in sufficient level of detail.In this study, no conformational changes were detected by small angle scattering on the solution structure of rOPN upon phosphorylation. This was previously observed for OPNpeptides (48), as well as other phosphorylated proteins (80). Our studies here show that, overall, phosphorylation makes side-chain motions faster, but blocks some of the slower moving residues and releases some of the faster moving ones. Similar phosphorylation effects were previously found by NMR relaxation studies (80), where dynamic changes at the active site were seemingly opposite to those observed elsewhere in the sequence.Phosphorylation of rOPN caused a rigidification of some residues on the accessible timescale. By selectively slowing nanosecond dynamics of PCs, phosphorylation affords sufficient free energy for sequestration of calcium phosphate and formation of CPN. Motions of side chains outside the PCs, on the other hand, can provide an entropic compensation for the arrest of the phosphorylated residues. The core-shell structure of CPN is highly dependent on the sequestering power of the phosphopeptide (caseins, for example, are known to form nanoclusters with a calcium phosphate core structure four times smaller than OPN).Our results are also consistent with higher (closer to physiological) temperatures being a trigger to an allosteric effect of phosphorylation, but further evidence is needed to unequivocally establish this. We have shown here that rOPN displays heterogeneous, complex internal molecular dynamics, and these are known to modulate tertiary and quaternary contacts. Effects from the extent and mapping of phosphorylation events, as well as the corresponding motional landscape, will require further studies.
Author Contributions
All authors contributed to the writing of the manuscript. S.L. prepared the recombinant peptides and characterized their structure, under the supervision of M.H., T.N., and S.C.M.T. Data were collected by S.L., T.N., and S.C.M.T., with T.S. and V.G.S. assisting. M.G., F.R.-R., T.S., V.G.S., and F.S. analyzed the incoherent neutron scattering data. R.C., C.H., T.N., M.H., and S.C.M.T. designed the samples. S.C.M.T. coordinated the research.
Authors: Jonathan E Kohn; Ian S Millett; Jaby Jacob; Bojan Zagrovic; Thomas M Dillon; Nikolina Cingel; Robin S Dothager; Soenke Seifert; P Thiyagarajan; Tobin R Sosnick; M Zahid Hasan; Vijay S Pande; Ingo Ruczinski; Sebastian Doniach; Kevin W Plaxco Journal: Proc Natl Acad Sci U S A Date: 2004-08-16 Impact factor: 11.205
Authors: Pau Bernadó; Efstratios Mylonas; Maxim V Petoukhov; Martin Blackledge; Dmitri I Svergun Journal: J Am Chem Soc Date: 2007-04-06 Impact factor: 15.419
Authors: Gerald Platzer; Andreas Schedlbauer; Angela Chemelli; Przemyslaw Ozdowy; Nicolas Coudevylle; Renate Auer; Georg Kontaxis; Markus Hartl; Andrew J Miles; B A Wallace; Otto Glatter; Klaus Bister; Robert Konrat Journal: Biochemistry Date: 2011-06-16 Impact factor: 3.162
Authors: Theodoros K Karamanos; Matthew P Jackson; Antonio N Calabrese; Sophia C Goodchild; Emma E Cawood; Gary S Thompson; Arnout P Kalverda; Eric W Hewitt; Sheena E Radford Journal: Elife Date: 2019-09-25 Impact factor: 8.140
Authors: Yafei Liu; Guanchao Ye; Bo Dong; Lan Huang; Chunyang Zhang; Yinliang Sheng; Bin Wu; Lu Han; Chunli Wu; Yu Qi Journal: Ann Transl Med Date: 2022-03
Authors: Tushar Raskar; Stephan Niebling; Juliette M Devos; Briony A Yorke; Michael Härtlein; Nils Huse; V Trevor Forsyth; Tilo Seydel; Arwen R Pearson Journal: Phys Chem Chem Phys Date: 2022-08-31 Impact factor: 3.945