Literature DB >> 25688181

Deciphering solution scattering data with experimentally guided molecular dynamics simulations.

Alexander Björling¹, Stephan Niebling, Moreno Marcellini, David van der Spoel, Sebastian Westenhoff.

Abstract

Time-resolved X-ray solution scattering is an increasingly popular method to measure conformational changes in proteins. Extracting structural information from the resulting difference X-ray scattering data is a daunting task. We present a method in which the limited but precious information encoded in such scattering curves is combined with the chemical knowledge of molecular force fields. The molecule of interest is then refined toward experimental data using molecular dynamics simulation. Therefore, the energy landscape is biased toward conformations that agree with experimental data. We describe and verify the method, and we provide an implementation in GROMACS.

Entities: Chemical

Mesh：

Substances：
Proteins
Solutions

Year: 2015 PMID： 25688181 PMCID： PMC4325560 DOI： 10.1021/ct5009735

Source DB: PubMed Journal: J Chem Theory Comput ISSN： 1549-9618 Impact factor: 6.006

Introduction

Solution X-ray scattering techniques are important for studying the conformations of molecules in solution.[1] Time-resolved X-ray scattering has been developed to study the structural dynamics of molecular reactions. In the experiment, a reaction is triggered and scattering is recorded as a function of reaction time. Time-resolved data are displayed and interpreted as difference scattering patterns, where the scattered intensity of a reference state is subtracted from that of a certain time point during the reaction. Initially, time-resolved scattering studies focused on photoreactions of small molecules.[2−5] A few years later, the technique was also applied to proteins. First, time-resolved scattering during carboxy-hemoglobin photolysis[6] and the photoreaction of bacteriorhodopsin and proteorhodopsin[7] were reported. Several other investigations were conducted on the structural dynamics of, for example, myoglobin,[8] photoactive yellow protein,[9] and a phytochrome.[10] These studies were carried out at synchrotron sources and covered time scales from 100 ps to seconds. The availability of ultrashort X-ray bursts at free electron lasers has enabled time-resolved scattering studies with femtosecond time resolution.[11] Today, time-resolved X-ray scattering experiments on proteins can be carried out reliably, but they are usually limited by data interpretation. Difference scattering reflects a difference in pair distribution functions and thus encodes the structural change of the molecules.[5] However, the information content in the one-dimensional spherically averaged (isotropic) scattering curves is inherently low. As in conventional small-angle X-ray scattering (SAXS), a curve typically contains only tens of independent data points.[12,13] Unlike SAXS, difference X-ray scattering has the advantage that the protein hydration shell typically only gives a minor contribution to the scattering signal.[14] A drawback is that a resting state structure, the starting point of structural change, is required for structural interpretation. Structural rearrangements can be uniquely identified from difference scattering data, when only a few atoms participate in the reaction.[2−5] For proteins with many hundreds, often thousands, of atoms, fitting atomistic models without additional structural information is impossible. Earlier attempts to surmount this challenge used rigid body refinement,[7,15] dynamic annealing of pseudo atoms,[9] and selection of suitable frames from molecular simulations.[10,11] In all cases, many candidate structures are generated, the scattering of each is computed, and the best fits to the experimental scattering data are identified. Thus, sampling and comparison with experimental data are done in sequence. Since the sampling in conformational space is usually limited, uniqueness of the solutions is not guaranteed. The ideal method to identify structural change from difference scattering data would be to harvest its limited information content while simultaneously taking chemical information, such as bonded and nonbonded interactions, into account. Biomolecular force fields encode such chemical information, and are used in molecular dynamics (MD) or Monte Carlo (MC) simulations that sample thermal ensembles of proteins in solution. In structural biology, many experimental techniques are increasingly aided by molecular simulation for data interpretation. Refinement by simulation of protein structure based on X-ray diffraction (XRD) data is now commonplace.[16−21] In NMR spectroscopy, the use of MD with experimental restraints has been even more instrumental,[18,20,22−24] since such experiments have historically not supplied a sufficient amount of data to determine a protein structure from scratch. The use of data from small-molecule crystal structures,[25] restraints found from sequence and chemical shift homology,[26] and molecular mechanics force fields[27] supplied the missing information. More recently, orientational restraints from residual dipolar couplings have been used to refine protein structures from NMR data[28,29] and to evaluate the performance of unrestrained simulations.[30] Conventional SAXS data are usually interpreted by ab initio reconstruction of low-resolution envelopes or by assembly of rigid high-resolution elements.[31,32] These strategies do not benefit from the sampling power of molecular simulation. They also do not use chemical knowledge beyond rudimentary considerations to identify likely conformations. A notable exception is the attempt by Grishaev et al. to improve NMR refinement by also taking SAXS intensities into account.[33] As with NMR data, absolute SAXS intensities have also been used for after-the-fact validation of MD trajectories.[34] To our knowledge, no systematic attempts to interpret difference X-ray scattering data with atomistic force fields have been published to date. Here, we describe and implement X-ray scattering-guided molecular dynamics simulations (XS-guided MD), which is a technique that refines structures against difference scattering data. The method can be used to guide proteins in MD simulations from one structure to another, and therefore is well-suited for elucidating the structural dynamics of biomolecules. In what follows, we shall only be concerned with difference X-ray scattering data, and as explained and justified below, neglect the scattering contribution from the protein’s solvation layer. The method uses an additional pseudo-energy term, based on the Debye scattering equation, which guides the simulation toward states that reproduce target data. The energy landscape of the simulation is biased toward the experimentally observed conformation and distinctions can be made between alternative conformations with similar free energies. As a result, conformers, which are not normally seen in simulations—either because they are transient species observed in dynamic experiments, because they are kinetically inaccessible or because they are incorrectly disfavored by the force field—can potentially be observed and characterized in MD simulations. The algorithm reduces the computational time to determine the structures that agree with experiment, and retains the chemical knowledge contained in MD force fields. The method is implemented in the popular and freely available GROMACS simulations package and is open to further development.

Theory and Methods

X-ray Scattering

Identical but randomly oriented collections of spherical scatterers give rise to a scattered intensity described by the Debye formula.[35]Here, f is the scattering factor for scatterer i,[36]q is the magnitude of the scattering vector (q = 4π sin θ/λ, with 2θ the scattering angle), and r = |r| = |r – r|, with r the coordinate vector for particle i. For molecules, the sums typically run over all atoms. For an actual protein solution, one typically measures the excess scattering of the solution compared to the solvent. In such cases, eq 1 must be corrected for the solvent displaced by the protein. This can be approximately done by using appropriately modified scattering factors f(q).[37] Equation 1 ignores the fact that the solvent and electrolyte densities around a macromolecule are generally different from their bulk values, which, in principle, leads to additional scattering terms. To our knowledge, taking this into account involves either (i) making extensive and explicit simulations for each scattering evaluation[34,38,39] or (ii) making ad hoc assumptions about excess surface densities.[40,41] We ignore this effect, which is an approximation that appears to be reasonable when the data are recorded as differences between two states, ΔS(q) = S(q) – Sref(q), as is the case with time-resolved data. Then, solvation contributions largely cancel.[14]

Force Calculation

For biasing MD simulations toward configurations that agree with experimental scattering curves, following Ahn et al.,[15] an additional term (UXS) is added to the usual MD potential energy (UMD):Here, kχ and σ are weighting factors for the scattering bias and for every q-point, respectively. Scalc(q) is the scattering computed from the current state of the simulation. For refinement against difference data, ΔSexp(q) is the experimental difference curve, Sref(q) is the scattering from the starting structure and α is the fraction of the observed sample that undergoes conformational change. For refinement against absolute data, ΔSexp(q) = 0 and α = 1 while Sref(q) is the experimental target scattering. The total force on a particle k isThe first term on the right is given by the force field, so the present problem reduces to finding ∇  χ2. The initial structure and experimental data are the same throughout a simulation, so Sref(q) and ΔSexp(q) are constant and ∇Sref(q) = ∇ΔSexp(q) = 0. Thus,The last step is to determine the vector ∇Scalc(q), which, by taking the appropriate derivatives of eq 1, comes out asFinally, the force contribution from the scattering energy term acting on particle k is obtained by combining eqs 5, 6, and 7:Calculating these forces amounts to evaluating two double sums over all scatterers, first for {Scalc(q) – Sref(q)}, as in eq 1, then for the forces, as in eq 8. For convenience, we definewhich gives

Scaling of Experimental Data

When evaluating the Debye equation, the resulting intensity carries units given by the atomic scattering factors. These are often given in “electron units” (e.u.), that is, as relative scattering amplitudes referenced to a classical unit point charge.[35] If concentrations, path lengths, incoming X-ray intensities, and detector efficiencies are precisely known, experimentally measured intensities can be converted to such units. However, this is often not the case in practice, and experimental data are instead scaled to predicted curves. For absolute data, this is straightforward. Either the experimental curve is scaled to each calculated profile, or it is rescaled such that the experimental extrapolation of S(q = 0) matches the calculated value. The latter is independent of conformation (see eq 1). For time-resolved difference measurements scaling is more problematic and involves a parameter α (eq 4). If the experimental system has pure initial and final states with I(q) = S and S, and is initially distributed so that the relative populations over these states are α0 and 1 – α0, then with a relative yield α,provided that the input ΔSexp(q) has meaningful units (this can be approximately achieved by scaling the detector reading to Scalc(q) and then scaling ΔSexp(q) by the same number), the parameter α in eq 4 can be identified with the relative yield of the difference experiment.

Implementation

XS-guided MD is implemented in GROMACS 5 and, therefore, profits from the efficient core of GROMACS. Moreover, it can, in principle, be used in combination with any other featured algorithm, for example, with the Simulated Annealing Method or Replica Exchange Method, to improve sampling. The current implementation supports parallelization using OpenMP. The Debye scattering terms are represented as bonded interactions of a new type (pairs type 3). Each such interaction is explicitly specified in the topology. Any atom or virtual interaction site can act as a scatterer, leaving it up to the user to design an appropriate simulation. The Debye summation is an N2 problem that becomes computationally demanding for large biomolecules. In some cases, coarse-graining the scattering calculation[14,42] or even the molecular representation[43] is appropriate. The code allows for combinations of these choices, and we provide a tool to expedite topology construction. To speed up simulations further, the slowly varying scattering forces can be calculated less often than the other forces. We have implemented a dual time-step algorithm, as described in detail in the GROMACS 5.0 manual. We provide implementations of atomic scattering factors with and without a displaced-solvent correction,[36,37] commonly used in SAXS intensity prediction tools.[40,41] For flexibility, we also implement the library-averaged and coarse-grained scattering factors calculated by Niebling et al.[14] for amino acid[42] and MARTINI-bead[43] scatterers. In addition to structural refinement, the user can calculate Debye sums from single frames or of entire trajectories, using a tool shipped with the code.

Computational Details

All simulations were performed using GROMACS 5.0, with the XS-guided MD implementation added. The Verlet cutoff scheme was used throughout. A constant temperature of 300 K was achieved in all simulations using a modified Berendsen thermostat[44] with τ = 0.5. All bonds were constrained using the LINCS algorithm.

Dibutyl Ether

Dibutyl ether was simulated in vacuum, using the OPLS/AA force field.[45] A periodic box was used, with electrostatic cutoffs and a time step of 2 fs. The 10 ns equilibrium simulation was run directly from the initial model with velocities drawn from a Maxwell distribution corresponding to 300 K. The first 500 ps were considered equilibration and removed from the analysis. XS-guided MD simulations were run with atomistic vacuum form factors[36] against scattering data predicted using the Debye equation (eq 1).

LAO Lysine Ligation

Simulations started from crystal structures of the apo and holo forms of LAO, with PDB codes 2LAO and 1LST,[46] respectively. Residues 1–238 were included. Missing atoms were filled in using the WHAT IF web server (http://swift.cmbi.ru.nl/whatif/). The CHARMM27 force field[47,48] was used for all LAO simulations. A periodic box was used, and electrostatics were treated with PME, using fourth-order interpolation and a grid spacing of 0.12 nm. The initial models were converted and protonated with the GROMACS tool pdb2gmx. Using cubic simulation boxes initially 2 nm larger than the protein in each direction, the starting structures were first energy-minimized to a maximum force of 2000 kJ/mol/nm. The systems were energy-minimized again after the addition of TIP3P water and ions corresponding to ∼50 mM NaCl (in addition to two or three cations to neutralize the protein). The simulation boxes were equilibrated under NVT conditions for 50 ps, then subsequently under NPT conditions for 500 ps using the Parrinello–Rahman barostat (τ = 2 ps, P = 1 bar). All non-hydrogen atoms were position-restrained to their initial positions during equilibration (force constants = 1000 kJ/mol/nm2). Equilibrium simulations covering 100 ns were carried out for the apo and holo forms, the latter both with and without bound lysine. The XS-guided MD simulations were carried out in a a larger cubic box, with dimensions of 13.5 nm, separately minimized and equilibrated as above. The starting structure was the same as for the lysine-free holo simulation. Amino acid–based scattering factors[14,42] were used, centered on virtual interaction sites representing the center of mass of each residue. The target scattering was calculated from the crystal structures using the Debye equation (eq 1).

Phytochrome Photoconversion

XS-guided MD simulations of phytochrome photoconversion started from the crystal structure with PDB code 4O01.[10] The CHARMM27 force field[47,48] was used, with the chromophore co-factor added by manually adapting the parameters published by Kaminski et al.[49] to the GROMACS format. A time step of 2 fs was used for these simulations. The initial structure was energy-minimized, solvated, and equilibrated as described for LAO above, in a 25 nm cubic box, with ions corresponding to 100 mM NaCl in addition to the 48 cations necessary to neutralize the protein. XS-guided MD simulations used amino-acid scattering factors centered on the C-beta atoms, and were run against previously published experimental difference scattering data.[10]

Results

The implementation of XS-guided MD described above was validated against three test systems. The first is an illustrative theoretical experiment which explores how the conformational distribution of a small organic molecule can be biased toward reproducing theoretical scattering curves. The second is more complex, as a real protein movement is reproduced by providing theoretical scattering data corresponding to two observed crystal structures. The third case involves actual difference scattering data, and shows that XS-guided MD reproduces earlier results that were based on a knowledge-based ad hoc approach. In the following, we review the results and discuss the limitations and possibilities of the method.

Dibutyl Ether

The linear dibutyl ether molecule was first simulated without experimental constraints under vacuum for 10 ns, and the degree of openness of the carbon–oxygen chain considered. Figure 1A shows examples of the conformations encountered, and the distribution of end-to-end distances (RC1–C8) is shown in Figures 1B and 1C (black curve).

Figure 1

Dibutyl ether guided toward an open or a closed conformation based on calculated X-ray scattering. (A) Example conformations from an unrestrained simulation, with various end-to-end distances in Å, chosen at random for each distance. (B, C) Distance distributions of an unrestrained simulation (black) and of XS-guided MD simulations aiming at an open conformation (panel (B), 9.8 Å) and a closed conformation (panel (C), 5.0 Å) with kχ values as indicated (blue). Insets show average calculated scattering profiles for each kχ, as a difference relative to the open conformation. There, red curves correspond to different kχ values, while target curves are shown in black. After calculating the theoretical scattering curves of an “open” dibutyl ether conformation (RC1–C8 = 9.8 Å) and of a “closed” (RC1–C8 = 5.0 Å) dibutyl ether conformation, these curves were used as input for XS-guided MD. Figures 1B and 1C show that the artificial scattering energy (UXS) has a dramatic effect on the end-to-end distance distribution. For both simulations, increasing the coupling coefficient kχ skews the distance distribution toward the target structure. At high coupling, all conformations that do not display the degree of openness of the target conformation are avoided. Meanwhile, the calculated scattering curves also approach their targets, as shown in the insets of Figures 1B and 1C. This establishes that the method and implementation work for this simple test system, where one degree of freedom essentially determines the scattering profile.

LAO Lysine Ligation

The lysine/arginine/ornithine-binding protein (LAO) from Salmonella typhimurium undergoes large-scale domain movements when binding ligands.[46,50] Specifically, lysine binding causes one of its domains to rotate 52° about an axis formed by two hinge points located on adjacent beta strand termini. The movement is shown in Figure 2D. Reproducing this structural change constitutes a more complex, albeit hypothetical, test case for the XS-guided MD method.

Figure 2

Method validation against the lysine/arginine/ornithine-binding protein (LAO). (A) Unrestrained simulations of the apo, holo, and lysine-free holo states. (B) XS-guided MD trajectories aiming at the apo state, starting from the lysine-free holo state, with various coupling strengths kχ. The plot shows RMSD:s compared to initial and target structures, as well as the evolution of the scattering energy. (C) 25 scattering curves, extracted from the second half of the 30 kJ/mol run, together with the target curve (thick line). (D) Graphical representation of the apo (red) and holo (orange) conformations. The right-hand model also shows a trajectory view over the second half of the 30 kJ/mol run. The hinge axis is indicated.

Unrestrained MD simulations starting from equilibrated crystal structures show that both the lysine-bound holo and the unbound apo forms are stable on the 100 ns time scale (Figure 2A). In contrast, with the lysine ligand removed, the holo form becomes ill-defined and deviates from its initial conformation without reaching the apo state (Figure 2A, bottom panel). We ran simulations guided by theoretical scattering profiles toward the apo state, starting from the lysine-free holo structure. The target scattering data was computed as the scattering of the apo minus the holo state. Figure 2B shows how the structural similarity to both states, as well as the energy term U, develop over time at several coupling strengths kχ. The figure shows that guiding the simulation toward the theoretical target scattering data causes the holo-to-apo transition to readily occur. As expected, increasing the coupling parameter kχ causes the transition to happen earlier in the simulation. The final root-mean-square deviation (RMSD), with respect to the apo state, is consistent with the equilibrium apo simulation (Figure 2A), at ∼2 Å. Figure 2C confirms that typical difference X-ray scattering patterns from the simulations agree well with the target data. Figure 2D gives a structural view of the XS-guided MD simulation and, again, confirms the holo-to-apo transition suggested by the RMSD traces. Method validation against the lysine/arginine/ornithine-binding protein (LAO). (A) Unrestrained simulations of the apo, holo, and lysine-free holo states. (B) XS-guided MD trajectories aiming at the apo state, starting from the lysine-free holo state, with various coupling strengths kχ. The plot shows RMSD:s compared to initial and target structures, as well as the evolution of the scattering energy. (C) 25 scattering curves, extracted from the second half of the 30 kJ/mol run, together with the target curve (thick line). (D) Graphical representation of the apo (red) and holo (orange) conformations. The right-hand model also shows a trajectory view over the second half of the 30 kJ/mol run. The hinge axis is indicated. Interestingly, if even stronger coupling to scattering data is imposed, the simulation fails to find the apo state (data not shown). This indicates that the scattering energy term contains barriers that become insurmountable on the time scale simulated here, if kχ is set too high. The ability of XS-guided MD to quickly reproduce the apo state of LAO, based solely on a difference scattering curve, shows that the curve contains enough information to bias the macromolecular force field toward the correct conformation. It also shows that XS-guided MD provides sufficient sampling to find the correct structure in a relatively short simulation, at least for this particular system.

Phytochrome Photoconversion

As a final test system, we applied the XS-guided MD algorithm to the bacterial phytochrome from Deinococcus radiodurans. The dimeric photosensory module of this light-sensing protein was recently shown to undergo dramatic structural change during photoconversion.[10] Upon illumination with red light, the dark-adapted form, labeled the Pr state, undergoes a series of Ångström-scale structural transformations that amplify and ultimately cause the homodimer to partially open in a nanometer-scale movement, forming the so-called Pfr state. As a result, the distance between the globular PHY (phytochrome-specific) domains on opposing monomers increases. In ref (10), we reported dark and illuminated crystal and solution structures, which showed that the opening motion is larger in solution than in the crystals.[10] Simulations guided by X-ray scattering were run starting from the illuminated crystal structure of the photosensory core from the D. radiodurans phytochrome.[10] The simulations were biased toward an experimental difference scattering curve corresponding to the Pr-to-Pfr transition, by comparing the calculated scattering profiles of the simulation to that of the proposed Pr solution structure. This data was previously used to propose the solution structure for the illuminated Pfr state.[10] Figure 3A shows that guiding the simulation toward experimental scattering data causes the distance between the phytochrome dimer’s opposing PHY domains to grow.[51] As the PHY–PHY distance approaches the value previously found,[10] the global conformation of the dimer reproduces the Pfr solution structure. Indeed, the structural overlap shown in Figure 3C illustrates that these structures are identical at the low-resolution level. Thus, XS-guided MD simulations are capable of refining solution structures based on an initial model and experimental X-ray difference scattering data.

Figure 3

Method validation against the D. radiodurans bacterial phytochrome. (A) The center-of-mass distances between the globular PHY domains (residues 330–445 and 480–503) on opposing monomers, as well as the evolution of the scattering energy term; the initial scattering energy was 300 kJ/mol. (B) Theoretical difference scattering during the second half of the 4 ns run. (C) Structural views of the Pr and Pfr solution structures,[10] and a trajectory view over the second half of the run, superimposed on the Pfr target structure. We note that, while the application to phytochrome photoconversion validates the XS-guided MD method as such, and it lends confidence to the nature of the previously proposed structural change,[10] the magnitude of the opening between the PHY domains cannot be rigorously defined at very large openings. This was already recognized in ref (10) and does not affect the conlusions drawn in that paper; however, it does illustrate a potential problem in refinement against difference X-ray solution scattering data. The parameter α, which describes the experimental conversion efficiency and is needed for the correct scaling of calculated to experimental data, was estimated based on the original analysis, where α was arbitrarily scaled for best fit to data.[10] In fact, the absolute size of the difference signal, and therefore the precise value of α, affects the degree to which the phytochrome dimer opens up in XS-guided MD refinement. This is illustrated in Figure 4, which shows predicted peak positions for difference scattering curves based on the previously published trajectories.[10] As the dimer opening increases, the peak first shifts along the q-axis, and then, after an initial opening to ∼5.5 nm, instead grows in magnitude. Thus, experimental determination of the yield parameter α is important for successful structural refinement.

Figure 4

Magnitude and position of the ΔS peak at q ≅ 1/nm for various PHY−PHY distances. The data are taken from the trajectories presented by Takala et al.[10]

Discussion

Using three test cases, ranging from small and theoretical to large and experimental, we have shown that XS-guided MD simulations may serve as a tool to structurally interpret difference solution X-ray scattering data. We have focused on the use of difference scattering data, as encountered in time-resolved X-ray scattering experiments of proteins in solution. However, the method could, in principle, be equally well-applied to absolute data, provided that attention is paid to the limitations of the Debye equation (eq 1) in predicting absolute X-ray scattering. In contrast to macromolecular crystallography and SAXS, which have seen the development of rigorous methodologies in the last decades,[31,52−55] there is no established way of structurally interpreting difference scattering data. The algorithm presented here may provide a starting point for such development. The applicability of the method relies on its ability (i) to calculate X-ray scattering with adequate accuracy, (ii) to distinguish between the true target conformation and other conformations using the combined knowledge of the chemical force field and the experimental data, and (iii) to sample enough conformations to find the target structure within a reasonable simulation time. We found that requirement (iii) is satisfied for all test systems tried here, usually after simulating ∼10 ns. Regarding requirement (i), it is noted that a shortcoming of the Debye equation (eq 1), where the sums run only over the protein atoms, is that it does not account for the solvation layer scattering. However, when computing difference X-ray scattering data, this effect can often be neglected.[14] This is the case for the phytochrome photoconversion investigated here. The second requirement, uniqueness of the target structure, is more critical. First, we note that a certain feature in a difference scattering pattern is more likely to reflect a unique structural rearrangement when it occurs at q values that describe the molecular envelope (for proteins, this region is typically described by q ≲ 2 nm–1). Difference scattering in this q-range reflects large-scale protein motions. All the test cases used here can be described by collective motions along a small number of degrees of freedom. For example, the phytochrome rearrangement can be thought of as an increase in distance between the PHY domains, while the structural change of LAO can be seen as rotation around an axis. These structural processes affect the overall shape of the protein and the difference scattering pattern of LAO is dominated by peaks at q ≤ 2 nm–1, while the phytochrome difference scattering has a distinct feature at q = 1 nm–1. Consequently, unique structural fits can be obtained. At higher q values, multiple candidate structures are more likely to produce overlapping features in difference scattering patterns and they must be discriminated using geometrical or chemical constraints. In the intermediate q range (2 nm–1 < q < 8 nm–1), typical structural dynamics would involve the rearrangement of secondary or tertiary structural elements. Geometric constraints, such as used in crystallography for dihedrals, angles, and bond lengths, are not likely to be effective in discriminating candidate structures. Instead, the full range of covalent, dispersion, and electrostatic interactions described by molecular force fields must be considered, and the resulting energy landscape sampled in the appropriate ensemble. In previous reports, these considerations were made ad hoc. For example, helix movements, hand-picked based on previous knowledge, were refined against difference scattering features at 2 nm–1 < q < 6 nm–1.[7] In the approach presented here, the required interactions and realistic sampling are inherent in the MD simulations. We consider this to be an important step toward unbiased structural interpretation of time-resolved X-ray scattering data. Finally, we note that some protein structural changes may be difficult to observe with X-ray scattering. For example, the conformational change upon photolysis of the hemoglobin–carbon monoxide bond corresponds to an RMSD value of 4 Å, but has a relatively small effect on the molecular envelope. Therefore, it affects the X-ray scattering signal at low q only weakly. We have tested a refinement of the hemoglobin structural change against the difference scattering patterns published in ref (6); however, even though the scattering energy decreased and stabilized rapidly, the RMSD observables did not reach the target values. This indicates that structures exist, which describe the difference scattering data of hemoglobin well, but which do not agree with structural changes derived from crystallographic structures. The situation arises because the experimental data contain too little information, and the demands on the force field and sampling become too high. In principle, molecular force fields should be able to distinguish between artificial and true target structures; however, in practice, this would require long and repeated simulations, beyond what is practically possible.

Conclusion

We expect that our implementation of XS-guided MD constitutes a founding step in the development of a systematic methodology for the structural interpretation of solution scattering data. The program is integrated in the GROMACS software package, which implies that users will have access to the features from the next release.

43 in total

1. Structural principles governing domain motions in proteins.

Authors: S Hayward
Journal: Proteins Date: 1999-09-01

2. The Xplor-NIH NMR molecular structure determination package.

Authors: Charles D Schwieters; John J Kuszewski; Nico Tjandra; G Marius Clore
Journal: J Magn Reson Date: 2003-01 Impact factor: 2.229

Review 3. Structural characterization of proteins and complexes using small-angle X-ray solution scattering.

Authors: Haydyn D T Mertens; Dmitri I Svergun
Journal: J Struct Biol Date: 2010-06-15 Impact factor: 2.867

4. Refinement of multidomain protein structures by combination of solution small-angle X-ray scattering and NMR data.

Authors: Alexander Grishaev; Justin Wu; Jill Trewhella; Ad Bax
Journal: J Am Chem Soc Date: 2005-11-30 Impact factor: 15.419

5. Protein structure determination from NMR chemical shifts.

Authors: Andrea Cavalli; Xavier Salvatella; Christopher M Dobson; Michele Vendruscolo
Journal: Proc Natl Acad Sci U S A Date: 2007-05-29 Impact factor: 11.205

6. Protein backbone angle restraints from searching a database for chemical shift and sequence homology.

Authors: G Cornilescu; F Delaglio; A Bax
Journal: J Biomol NMR Date: 1999-03 Impact factor: 2.835

7. Three-dimensional structures of the periplasmic lysine/arginine/ornithine-binding protein with and without a ligand.

Authors: B H Oh; J Pandit; C H Kang; K Nikaido; S Gokcen; G F Ames; S H Kim
Journal: J Biol Chem Date: 1993-05-25 Impact factor: 5.157

8. Torsion angle dynamics for NMR structure calculation with the new program DYANA.

Authors: P Güntert; C Mumenthaler; K Wüthrich
Journal: J Mol Biol Date: 1997-10-17 Impact factor: 5.469

9. Implementation of the CHARMM Force Field in GROMACS: Analysis of Protein Stability Effects from Correction Maps, Virtual Interaction Sites, and Water Models.

Authors: Pär Bjelkmar; Per Larsson; Michel A Cuendet; Berk Hess; Erik Lindahl
Journal: J Chem Theory Comput Date: 2010-01-25 Impact factor: 6.006

10. MARTINI bead form factors for the analysis of time-resolved X-ray scattering of proteins.

Authors: Stephan Niebling; Alexander Björling; Sebastian Westenhoff
Journal: J Appl Crystallogr Date: 2014-06-14 Impact factor: 3.304

17 in total

Review 1. The Structural and Functional Diversity of Intrinsically Disordered Regions in Transmembrane Proteins.

Authors: Rajeswari Appadurai; Vladimir N Uversky; Anand Srivastava
Journal: J Membr Biol Date: 2019-05-28 Impact factor: 1.843

2. SAXS-Oriented Ensemble Refinement of Flexible Biomolecules.

Authors: Peng Cheng; Junhui Peng; Zhiyong Zhang
Journal: Biophys J Date: 2017-04-11 Impact factor: 4.033

Review 3. The role of small-angle scattering in structure-based screening applications.

Authors: Po-Chia Chen; Janosch Hennig
Journal: Biophys Rev Date: 2018-10-10

4. Integrating solvation shell structure in experimentally driven molecular dynamics using x-ray solution scattering data.

Authors: Darren J Hsu; Denis Leshchev; Irina Kosheleva; Kevin L Kohlstedt; Lin X Chen
Journal: J Chem Phys Date: 2020-05-29 Impact factor: 3.488

5. Resolving Dynamics in the Ensemble: Finding Paths through Intermediate States and Disordered Protein Structures.

Authors: Adam K Nijhawan; Arnold M Chan; Darren J Hsu; Lin X Chen; Kevin L Kohlstedt
Journal: J Phys Chem B Date: 2021-11-08 Impact factor: 3.466

6. Deconvoluting Protein (Un)folding Structural Ensembles Using X-Ray Scattering, Nuclear Magnetic Resonance Spectroscopy and Molecular Dynamics Simulation.

Authors: Alexandr Nasedkin; Moreno Marcellini; Tomasz L Religa; Stefan M Freund; Andreas Menzel; Alan R Fersht; Per Jemth; David van der Spoel; Jan Davidsson
Journal: PLoS One Date: 2015-05-06 Impact factor: 3.240

7. Unfolding bovine α-lactalbumin with T-jump: Characterizing disordered intermediates via time-resolved x-ray solution scattering and molecular dynamics simulations.

Authors: Darren J Hsu; Denis Leshchev; Irina Kosheleva; Kevin L Kohlstedt; Lin X Chen
Journal: J Chem Phys Date: 2021-03-14 Impact factor: 3.488

8. Integrating an Enhanced Sampling Method and Small-Angle X-Ray Scattering to Study Intrinsically Disordered Proteins.

Authors: Chengtao Ding; Sheng Wang; Zhiyong Zhang
Journal: Front Mol Biosci Date: 2021-04-15

9. Sampling Enrichment toward Target Structures Using Hybrid Molecular Dynamics-Monte Carlo Simulations.

Authors: Kecheng Yang; Bartosz Różycki; Fengchao Cui; Ce Shi; Wenduo Chen; Yunqi Li
Journal: PLoS One Date: 2016-05-26 Impact factor: 3.240

10. Unraveling low-resolution structural data of large biomolecules by constructing atomic models with experiment-targeted parallel cascade selection simulations.

Authors: Junhui Peng; Zhiyong Zhang
Journal: Sci Rep Date: 2016-07-05 Impact factor: 4.379