Literature DB >> 25645890

Simulating the distance distribution between spin-labels attached to proteins.

Abstract

EPR/DEER spectroscopy is playing an increasingly important role in the characterization of the conformational states of proteins. In this study, force field parameters for the bifunctional spin-label (RX) used in EPR/DEER are parametrized and tested with molecular dynamics (MD) simulations. The dihedral angles connecting the Cα atom of the backbone to the nitroxide ring moiety of the RX spin-label attached to i and i + 4 positions in a polyalanine α-helix agree very well with those observed in the X-ray crystallography. Both RXi,i+4 and RXi,i+3 are more rigid than the monofunctional spin-label (R1) commonly used in EPR/DEER, while RXi,i+4 is more rigid and causes less distortion in a protein backbone than RXi,i+3. Simplified dummy spin-label models with a single effective particle representing the RXi,i+3 and RXi,i+4 are also developed and parametrized from the all-atom simulations. MD simulations with dummy spin-labels (MDDS) provide distance distributions that can be directly compared to distance distributions obtained from EPR/DEER to rapidly assess if a hypothetical three-dimensional (3D) structural model is consistent with experiment. The dummy spin-labels can also be used in the restrained-ensemble MD (re-MD) simulations to carry out structural refinement of 3D models. Applications of this methodology to T4 lysozyme, KCNE1, and LeuT are shown to provide important insights about their conformational dynamics.

Entities: Chemical Gene Species

Mesh：

Substances：

Year: 2015 PMID： 25645890 PMCID： PMC4509421 DOI： 10.1021/jp510745d

Source DB: PubMed Journal: J Phys Chem B ISSN： 1520-5207 Impact factor: 2.991

Introduction

Accurate structural information about the accessible conformations of proteins is key to understanding their function. This information is typically best obtained from high resolution X-ray crystallography or nuclear magnetic resonance (NMR) spectroscopy. X-ray crystallography relies on the successful crystallization of proteins, while NMR yields structural information for small proteins only. As an alternative, experimental biophysicists have been increasingly relying on electron paramagnetic resonance (EPR) spectroscopy, which provides structural information for large and complex protein systems in their native like environment. EPR spectroscopy requires the introduction of spectroscopic probes into the system via site-directed spin-labeling (SDSL)[1] techniques. The EPR technique along with spin-labeling probes has proved to be a very useful technique in characterizing the structure–function relationship of membrane proteins, such as ion channels and transporter proteins, as well as enzymes and receptors. Double electron–electron resonance (DEER) is a powerful-pulsed EPR technique that reports the distance distribution between a pair of spin-labels. DEER can detect spin-labels that are separated by distances as large as 80 Å. Since the spin-pair distance distribution may change with the change in conformational states, information about the three-dimensional (3D) structure and function of a protein in its native environment can be obtained from EPR/DEER spectroscopy. Although EPR/DEER is a very powerful technique, interpretation of the spin-label distributions is complicated by several factors. One of the main problems is the extremely flexible nature of the spin-label probes used in most EPR/DEER experiments. The commonly used nitroxide spin-label is MTSSL (1-oxyl-2,2,5,5-tetramethylpyrroline-3-methylmethanethiosulfonate) which is typically linked to a cysteine residue in the protein through a disulfide bond (R1 in Figure 1). The R1 spin-label possesses five dihedral angles denoted by χ1, χ2, χ3, χ4, and χ5 along the flexible bonds, Cα–Cβ–Sγ–Sδ−Cη–Cζ, and each of these dihedral angles has multiple rotameric states. The dihedral angles χ1 and χ2 can adopt 3-fold conformations, +60° (or gauche+), 180° (or trans), and −60° (or gauche), which are denoted by p, t, and m, respectively, and the dihedral angle χ3 can adopt two stable conformations, p (+90°) and m (−90°). Computational analysis and spectroscopic measurements indicate that χ4 and χ5 are very flexible, which is also consistent with the fact that no reliable X-ray crystallographic information is available for these dihedral angles. One advantage of the flexibility of R1, however, is that it can be introduced at any site within a protein including buried regions. Often the EPR/DEER histograms are broad and bimodal with multiple peaks, which also make it difficult to interpret the experimental histograms in terms of protein structure refinement. Hubble and co-workers recently introduced MTSSL linked through two disulfide bonds (RX in Figure 1).[2] The bifunctional RX spin-label can be introduced at pairs of cysteine residues at i, i + 3 and i, i + 4 positions in an α-helix and i, i + 1 and i, i + 2 positions in a β-strand. The RX spin-label side chain has a total of 10 dihedral angles with five dihedrals, denoted by χ1, χ2, χ3, χ4, and χ5, in one of the cysteine linkers and the remaining five dihedrals, denoted by χ1′, χ2′, χ3′, χ4′, and χ5′, belonging to the second cysteine linker. Only one crystal structure of the RX side chain attached to positions 115 and 119 in an α-helix of T4 lysozyme is available, which displayed all 10 dihedral angle values suggesting the RX is more rigid than the R1.[2] The EPR/DEER spin-pair distance distributions obtained from RX are also found to be narrower than R1, even in various reaction mediums such as the micelles, proteo-liposomes, and lipodisq.[3] Therefore, the analysis of the spin-pair distributions from the RX spin-label is expected to be easier than those obtained from the R1, suggesting the RX could be an important alternative to the R1 in the EPR/DEER spectroscopy. However, RX may introduce unwanted perturbations in the system due to the necessity to introduce two nearby cysteines. There remains not only a scarcity of reliable experimental data, but there also has been no computational study to understand the accessible rotameric states and the distance distributions of the RX spin-pairs in various sites in protein. Interpreting the EPR/DEER distance histogram data obtained from the RX inserted at various positions in proteins requires careful characterization of the dynamical properties of this spin-label. Computational methodologies can provide valuable insights about the dynamical properties of these spin-labels. In the case of R1, quantum mechanical ab initio methods offered very accurate energetics for various conformational states of R1.[4−6] However, these methods are generally computationally too demanding for large protein systems and they ignore thermal fluctuations. Molecular dynamics (MD) simulations based on classical force fields offered a realistic alternative strategy to understand the conformational dynamics of the R1 spin-labels.[7,8] The results obtained from the MD simulations were consistent with the available information from X-ray crystallography.[9] It is expected that MD simulations will also provide valuable information about the conformational dynamics of RX spin-labels inserted at various positions in a protein.

Figure 1

Spin-label side chains, R1 and RX, resulting from linking MTSSL to cysteine through a disulfide bond, and dummy spin-labels, OND and ONDX, which mimics the dynamics of the R1 and RX, respectively. In the case of ONDX, X = 3 or 4 depending on the position of the second cysteine residue to which it is attached. The dihedral angles connecting the Cα atom of the protein backbone to the nitroxide ring are shown for both R1 and RX. OND is parametrized by using the Cα–ON distance, Cβ–Cα–ON angle, and N–Cα–Cβ–ON dihedral angle, while one more variable, the ON–N distance from the second residue, is used to parametrize ONDX. Accurate structural information on a protein is obtained from its 3D structure, which is not possible to obtain directly from EPR/DEER observations. However, computational tools can provide a “virtual route” to link the atomic 3D structures of proteins to the experimental EPR observations. Recently, a novel computational method, the restrained-ensemble (re-MD) method,[9,10] was developed following a maximum entropy principle[11] to help refine the 3D structural model on the basis of DEER histrograms. The re-MD simulations were used to refine the outer vestibule of the KcsA ion channel protein[12] and the LeuT transporter protein.[13] The elastic network model in combination with MD simulation was also used to obtain structural models that satisfy EPR/DEER distance data.[14,15] There are also computational modeling methods, such as the multiscale modeling of macromolecular systems (MMM) software package of Yevhen Polyhach and Gunnar Jeschke[16,17] and the PRONOX algorithm of Hatmal et al.[18] The MtsslWizard computational program of Hagelueken et al.[19] provides interlabel distance distributions based on the analysis of spin-label rotamers inserted in a model protein structure. In the present study, MD simulations were performed to characterize the conformational dynamics of spin-labels RX and RX in a polyalanine α-helix. Force field parameters for the RX spin-labels have been developed, and the results from the simulations were compared to available X-ray crystallographic structures. Using the vast amount of information obtained from the simulations of RX and RX, simplified nitroxide dummy spin-labels, OND3 and OND4, respectively, are parametrized for the purpose of structural refinement. A simplified computational approach based on MD simulation of the dummy spin-labels (MDDS) has been presented which is demonstrated to provide better spin-pair distance distributions than the existing computational methods.[16−19] The MDDS simulations have been conducted on spin-labeled T4 lysozyme,[20] KCNE1,[21] and LeuT[22,23] protein systems. The results of the simulation suggest that MDDS simulations offer an effective strategy for obtaining important insights about the structure and function of various protein systems.

Methods

All molecular dynamic simulations of the spin-labeled polyalanine α-helix, T4 lysozyme, KCNE1, and LeuT were carried out with the CHARMM[24] and program package using the all-atom CHARMM36 protein force field[25] with the CMAP corrections and the force field parameters of R1 developed by Sezer et al.,[7] the OND developed by Islam et al.,[9] and the RX and ONDX developed in this study.

Molecular Dynamics Simulations of the Polyalanine α-Helix Labeled with RX

Two different systems of the polyalanine α-helix, which has a total of 18 alanine residues, is labeled at positions 8, 12 and 8, 11 with the RX spin-label, where 8, 12 and 8, 11 represent residues at positions i, i + 4 and i, i + 3, respectively. Both of the systems have a total of 232 atoms which are solvated by 7155 TIP3P water molecules within a 40 × 40 × 40 Å3 cubic box, and the salt concentration was maintained at 0.15 mM/mol by adding 7 potassium and 7 chloride ions. A weak positional harmonic restraint with a force constant of 0.5 (kcal/mol)/Å2 was used on residues 1–5 and 15–18 for the system labeled with 8-12RX and on residues 1–5 and 14–18 for the system labeled with 8-11RX to avoid any large displacement. A 1 ns equilibration simulation and a 10 ns production MD simulation were performed with an integration time step of 1 fs. Both equilibration and production simulations were performed under NPT conditions where the temperature was kept at 300 K and the pressure at 1 atm. The simulations were performed with a Langevin thermostat to control the temperature of the simulation box. A collision frequency, γ, of 5.0 ps–1 was used for the Langevin thermostat. A dielectric constant of 1.0 was used during the simulation. Bonds involving hydrogen atoms in water were constrained to their equilibrium values using the SHAKE algorithm. Periodic boundary conditions (PBCs) were imposed, and the nonbonded interactions were smoothly switched off from 10 to 12 Å using an atom-based cutoff. Long-range electrostatic behavior was controlled with the particle mesh Ewald (PME) method. A spherical harmonic restraint with a force constant of 1 kcal/mol was used to keep the center of mass of the backbone atoms of the systems near the origin of the box. From the two all-atom simulations, the force field parameters for the OND3 and OND4 are developed which are subsequently attached at positions 8, 12 and 8, 11 in the polyalanine α-helix. 500 ps equilibration and 5 ns MD production simulations were then carried out on these two systems using the same simulation parameters used in the all-atom simulations, and the statistics for various degrees of freedom involving the dummy spin-labels were compared with those obtained from the all-atom simulation.

Molecular Dynamics with Dummy Spin-Labels (MDDS)

To predict the spin-pair distance distributions of various labeled sites in T4 lysozyme, KCNE1, and LeuT, MD simulation with dummy spin-labels (MDDS) were carried out by attaching the mono- and bifunctional spin-labels, OND and OND4, respectively, in various sites in these proteins for which EPR/DEER data is available. Crystal structures of T4 lysozyme (2LZM),[20] KCNE1 (2K21),[21] and LeuT (2A65,[22]3TT1,[23] and 3TT3(23)) were used to construct the geometries of the T4 lysozyme, KCNE1, and LeuT systems for simulation. For all the simulations, the dummy spin-labels were attached directly with the Cα atom of the protein backbone and long side chain residues were truncated after the Cβ atom to avoid steric clashes with the ON labels. Truncating long side chains will not cause any drastic change in the distance distribution, since the dummy spin-label force field has a nonbond term that accounts for the influence of nearby side chains. First, two systems of T4 lysozyme are constructed each labeled with OND and OND4 at positions 109/131 and 109–113/127–131, respectively, and two systems of KCNE1 are constructed each labeled with OND and OND4 at positions 47/66 and 46–50/66–70, respectively. Since the dummy spin-labels do not interact with each other, multiple dummy atoms can be introduced to a single protein structure. Since 37 sites in total were experimentally labeled in T4 lysozyme, all of these sites were labeled with OND into a single T4 lysozyme. Similarly, the dummy OND spin-labels were linked directly to the Cα atoms of the three PDB structures of LeuT at all residues included in the EPR/DEER mutant data set. All of the simulations were performed in a vacuum under NVT at 300 K using the Langevin thermostat with a collision frequency of 10.0 ps–1. To begin, an adopted basis Newton–Raphson (ABNR) energy minimization (100 steps) and a short (10 ps) molecular dynamics simulation of the dummy ON spin-labels were performed with a time step of 0.5 fs by fixing the coordinates of all other atoms of the protein to its X-ray crystallographic structure. Finally, a 1 ns equilibration simulation and a 4 ns production MD simulation were performed by fixing the protein and using a time step of 1 fs from which the spin-pair distance distributions were calculated.

Restrained-Ensemble (re-MD) Simulation

The restrained-ensemble (re-MD) simulation[9,10] is used to match the distance distribution obtained from multiple copy spin-labels to match with those obtained from EPR/DEER distance distribution data. In this study, the dummy bifunctional spin-label OND4 was attached to positions 109–113 and 127–131 in T4 lysozyme. An ensemble of 25 replicas of OND4 was created for each of 109–113OND4 and 127–131OND4 (Figure S7, Supporting Information), which yield a total of 625 distances. An energy restraint was imposed via a large force constant, 10000 (kcal/mol)/Å2, at every step of the MD simulation so that the histogram obtained from the 625 spin–spin distances would match the experimental distance histogram obtained from the EPR/DEER. Large displacements of the protein backbone atoms were prevented by applying positional restraints with a harmonic force constant of 1 (kcal/mol)/Å2 relative to the X-ray structure. To reduce the size of the simulated system, only the water molecules within 35 Å from the center of mass of the T4L system were kept, for a total of 19410. A spherical half-harmonic containing restraint with a force constant of 0.5 (kcal/mol)/Å2 was used to keep the waters near the proteins. Both equilibration and production simulations were performed under NVT conditions where the temperatures for both replica OND4 atoms and normal atoms were kept at 300 K. The rest of the simulation parameters were kept the same as those used in the MDDS simulations. The dummy spin-labels were minimized using steepest decent and subsequently conjugate gradient algorithms. Finally, 1 ns equilibration and 5 ns re-MD simulation was performed.

Results and Discussion

Force Field Parameterization of the RX Spin-Label Side Chain

Sezer et al.[7] previously developed force field parameters for the nitroxide ring moiety and the spin-label linker of the R1 spin-label side chain (Figure 1, R1). Force field parameters for the RX spin-label side chain were developed as an extension of the previous work. RX is linked to the protein with an additional linker, which has the same atom types as the first linker. Therefore, modification and duplication of several atom types and a patch command, which links the second linker of RX with the protein, were only required to develop the force field parameters of the RX spin-label side chain. All the force field parameters for RX and the corresponding patch command are uploaded in CHARMM-GUI (http://www.charmm-gui.org),[26] which can readily build a system with the RX in i, i + 2, i, i + 3, and i, i + 4 or custom positions in a protein for a molecular dynamics simulation. The performance of the RX force field parameters was then evaluated by comparing the X-ray crystal structures of the RX with those obtained from a long MD simulation.

Comparison with X-ray Structure

The dihedral angles connecting the Cα atom of the backbone with the nitroxide ring of the RX and RX spin-labels were calculated from the MD simulation of the labeled polyalanine α-helix. The time dependence of all the dihedral angles is provided in Figures S1 and S2 (Supporting Information). The dihedral angles in RX obtained from the simulation are found to be very similar to those observed in the X-ray crystal structure of 115–119RX attached with T4 lysozyme (Table 1). The information reported in Table 1 for RX remains unchanged when the trajectory is extended to 100 ns (see Figure S9 in the Supporting Information).

Table 1

Comparison of the Dihedral Angles (in Degrees) Connecting the Cα Atom of the Backbone with the Nitroxide Spin of the RX and RX Obtained from the X-ray Crystallography and the MD Simulation

	χ₁	χ₂	χ₃	χ₄	χ₅	χ₁′	χ₂′	χ₃′	χ₄′	χ₅′
X-raya (RX_i,i+4)	–94	–61	–81	–160	105	–70	–56	106	122	–90
MD (RX_i,i+4)	–80	–65	–85	177	97	–60	–50	92	142	–97
MD (RX_i,i+3)	–60	–70	90	170	–110	–70, −170	144	80	165	72

Fleissner et al.[27]

Fleissner et al.[27] To be consistent with the convention used to represent rotamers in R1 (see R1 and RX in Figure 1), the conformations of the dihedral angles χ1, χ1′, χ2, χ2′, χ4, and χ4′ are denoted with p, t, and m to represent +60° (or gauche+), 180° (or trans), and −60° (or gauche–), respectively, and the conformations of dihedral angles χ3, χ3′, χ5, and χ5′ are denoted with p and m around +90 and −90° (=270°), respectively. Both X-ray[27] and MD simulations show that spin-label linker i has the mmmtp rotamer along the dihedral angles χ1, χ2, χ3, χ4, and χ5, respectively, while the dihedral angles χ1′, χ2′, χ3′, χ4′, and χ5′ accommodate the mmptm rotamer at linker i + 4 of RX. There is no crystal structure for RX. MD simulation of the RX reveals that the linker i has the mmptm rotamer along the χ1, χ2, χ3, χ4, and χ5, respectively, while the linker i + 3 has mtptm and ttptm rotamers along the χ1′, χ2′, χ3′, χ4′, and χ5′, respectively. χ1′ has two rotameric states, m and t, with percent rotamer populations of 74 and 26, respectively. Comparison of the rotameric states of RX and RX shows a difference in rotameric states at χ3, χ5, χ1′, and χ5′. The helical shape of the backbone to which the RX is attached shows very little change compared to the wild type; however, deviation from the helical geometry is observed for the RX near residue i. The greatest difference is found along the N–Cα–C–N dihedral angle (ψ), with the average value of ψ differing by about 37° from that of the wild type (see Figure S3, Supporting Information). These observations suggest that the structure of protein will be less disrupted by labeling with RX than RX and therefore RX is a better choice than RX. Polyalanine α-helix with (A) RX and (B) RX (left panel) and the corresponding dynamics of the nitroxide oxygen of the respective RX with respect to their Cα atoms obtained from MD simulation (right panel).

Comparison of the Dynamics of R1 and RX

In reality, the EPR/DEER signal originates from the unpaired electrons of the nitroxide atoms (ON) of the R1 and RX. Only a detailed understanding of the position and dynamics of the ON atom would be able to provide important structural insights and function of various conformational states of the protein. Due to the existence of five dihedral angles connecting the nitroxide ring moiety with the Cα atom of the protein backbone, the exact position of the ON with respect to the backbone of the protein is difficult to predict. Small changes in the rotamer population could also cause the position of the ON to change substantially. Long MD and restrained-ensemble (re-MD) simulations of R1 labeled at various sites in T4 lysozyme displayed that the ON atoms with respect to the N, Cα, and Cβ atoms of the labeled residues are distributed within a half-sphere around the Cα atom of the backbone (see Figure S4, Supporting Information).[9] The configurations of the ON atoms of the RX and RX obtained from the MD simulation of the polyalanine α-helix show that the distribution of ON atoms for both of the RX spin-labels is very constrained (Figure 2 A and B). Interestingly, the configuration of ON atoms of the RX is more distributed in the phase space than that of the RX. This again illustrates that the labeling with RX would be a better choice than RX to interpret the EPR/DEER experimental data.

Figure 2

Polyalanine α-helix with (A) RX and (B) RX (left panel) and the corresponding dynamics of the nitroxide oxygen of the respective RX with respect to their Cα atoms obtained from MD simulation (right panel).

Simplified Representation of the Spin-Labels for Structural Refinement

While the detailed atomic models of the spin-labels R1 and RX provide valuable information, their utilization in the context of structural refinement of proteins is somewhat difficult. Fairly long MD simulations may be required to allow for the conformational transitions to adequately sample all the accessible rotameric states of the spin-labels attached to a protein. This can become a cumbersome task that detracts from the central goal of computations carried out in the context where one tries to make the best use of EPR/DEER data. To circumvent this issue, a simplified spin-label side chain, OND (OND in Figure 1), was designed to reproduce the 3D spatial distribution of the nitroxide oxygen of R1 relative to the protein backbone in all-atom RE-MD simulations of T4 lysozyme based on experimental EPR/DEER data.[9] Optimal Lennard-Jones parameters of Rmin = 4 Å and Emin = −0.05 kcal/mol were determined for R1 on the basis of the observed radial distribution of the ON atom relative to any protein atom.[9] It should be emphasized that, although force field parameters are involved in defining the simplified dummy spin-labels, the latter are not true atomic models. Conceptually, the dummy particle attached to the backbone is only a statistical construct. Its purpose is to represent the average 3D spatial distribution of the nitroxide atom as it was extracted from the all-atom simulations restrained by 51 EPR/DEER distance histogram data from spin-labels inserted at 37 different positions in T4 lysozyme.[9] Thus, the resulting simplified model incorporates a great amount of detailed information about atomic spin-labels, though in a statistical average way. For example, the notion of rotameric states is no longer relevant, and specific interactions with nearby side chains are not taken into account directly (apart from the single Lennard-Jones center ascribed to the ON atom). While various subtle effects may have some importance for the configuration of the nitroxide spin-labels at specific sites in a given protein, comparison with EPR/DEER data demonstrates that the simplified spin-label models are generally able to capture the dominant features needed for the purpose of structural refinement based on EPR/DEER data. In the present study, force field parameters for simplified dummy spin-labels, OND3 and OND4 (ONDX in Figure 1), representing the all-atom RX attached to i, i + 3 and i, i + 4, respectively, were developed from five potential energy functionswhere V, V, Vθ, Vϕ, and Vnonbonded are the Cα–ON bond (r1), the ON–N bond (r2), the Cβ–Cα–ON angle (θ), the N–Cα–Cβ–ON dihedral angle (ϕ), and the Lennard-Jones 6-12 potential representing the interactions between the ON particle and the rest of the protein, respectively. The force constants for the r1, r2, θ, and ϕ are calculated from the all-atom simulations of RX and RX attached to the polyalanine α-helix. Comparison of distribution and the potential of mean force of the distances, angle, and dihedral angle obtained from MD simulations with RX and dummy nitroxide atoms at positions (A) i, i + 3 and (B) i, i + 4 of the polyalanine α-helix. The distributions over r1, r2, θ, and ϕ for both RX and RX extracted from the MD simulation are shown in Figure 3 (red line). For the RX and RX, the probability of finding the ON is maximum at a distance of about 8.5 and 8.8 Å, respectively, from the Cα and the widths for both of the distributions are about 1.5 Å. The probability of finding the ON is maximum at a distance of about 10.5 Å from N in RX, while it is about 7.9 Å from N in RX. The widths for both of the distributions are about 1 Å. The distributions for the θ for both the RX and RX range from 50 to 120°, but they peak at around 91 and 72°, respectively. There is one predominant rotameric state for the dihedral angle ϕ, positioned around 240–300 and 180–260° for both RX and RX, respectively, which peak around 267 and 222°, respectively.

Figure 3

Comparison of distribution and the potential of mean force of the distances, angle, and dihedral angle obtained from MD simulations with RX and dummy nitroxide atoms at positions (A) i, i + 3 and (B) i, i + 4 of the polyalanine α-helix.

The probability distribution functions are then modeled on the basis of four simple energy terms, V = k(r1 – r0)2, V = k(r2 – r0)2, Vθ = kθ(θ – θ0)2, and Vϕ = kϕ(1 + cos(nϕ – ϕ0)), where k, k, kθ, and kϕ are the force constants for the r1 distance, the r2 distance, the θ angle, and ϕ dihedral potentials and n represents the dihedral multiplicity. In addition, a Lennard-Jones 6-12 potential was used to account for the excluded-volume interactions between the ON particle and the rest of the protein. The nonbond parameters previously determined for R1 are used for the OND3 and OND4 particles in the simplified representation of the double-link RX spin-label.[9] As for R1, the models can only account for interactions with neighboring protein groups in a highly simplified manner. Nevertheless, this is not a concern here, since the double-linked spin-label is highly constrained by the two disulfide (S–S) bonds. Nonbonded interactions between the ON particles and the water molecules are switched off by using the keyword NBFIX in the parameter file, which can be read by both the NAMD[28] and CHARMM[24] program packages. The probability distributions obtained from the MD simulation of the dummy and all-atom spin-labels agree very well (Figure 3). The optimal force constants for k, k, kθ, and kϕ are 4.8 (kcal/mol)/Å2, 3.78 (kcal/mol)/Å2, 13.9 (kcal/mol)/rad2, and 29.6 (kcal/mol)/rad2, respectively, for the OND3 and 3.35 (kcal/mol)/Å2, 3.5 (kcal/mol)/Å2, 15.21 (kcal/mol)/rad2, and 21.0 (kcal/mol)/rad2, respectively, for the OND4. Both OND3 and OND4 force field parameters could be attached directly to the Cα atom of any wild-type residue except glycine, since glycine does not have the Cβ atom. To tackle this problem, the concept of the dummy Cβ atom (CBD) is used only in the case of glycine. The CBD atom only connects to the Cα atom and does not interact with any other atom of the protein. The CBD is defined with three variables—the Cα–CBD bond, the N–Cα–CBD angle, and the H–N–Cα–CBD dihedral angle (see Figure S5 in the Supporting Information). The parameters for these three variables were taken from the parameters that define the Cβ atom in the CHARMM36 protein force field.[25] All of the force field parameters for the OND3, OND4, and CBD are available in the CHARMM-GUI webpage. The simplified spin-labels can be inserted in all of the sites in a protein for the calculation of spin-pair distance distributions. The simplified representation of the all-atom spin-labels avoids the burdensome task of accounting for a large ensemble of Boltzmann-weighted spin-label rotamers of the all-atom spin-labels.

Molecular Dynamics of Dummy Spin-Labels (MDDS)

Having parametrized the OND and ONDX dummy spin-labels, it was necessary to check their performance in predicting the spin-pair distance distributions. EPR/DEER spin-pair distance distributions involving both the R1 and RX are available only for two protein systems, the T4 lysozyme[9,27] and the integral membrane protein KCNE1.[3] In the case of T4 lysozyme, spin-pair distance distributions are available between 109R1 and 131R1, represented here with 109/131R1, and between 109 and 113RX and 127–131RX, represented here with 109–113/127–131RX. T4 lysozyme has been labeled with OND at positions 109 and 131 and with OND4 at positions 109–113 and 127–131. MDDS simulations were carried out for 5 ns by keeping the protein fixed. The calculated spin-pair distance distributions obtained from the MDDS simulations are compared with those obtained from the EPR/DEER (Figure 4A). The 109/131OND distance distribution obtained from the MDDS is found to agree very well with those obtained from the EPR/DEER. This agreement suggests that the force field parameters for OND produce very reliable spin-pair distance distributions. The 109–113/127–131OND4 distance distribution obtained from the MDDS is found to be slightly narrower, with the width of the distribution ranging from 29 to 35 Å, than the EPR/DEER distance distribution, which ranges from 26 to 33 Å. However, there is a difference of only 1.5 Å between the maximum peaks of the MDDS and EPR/DEER distance distributions.

Figure 4

(A) Cartoon representation of T4 lysozyme with the OND at positions 109 and 131 and the OND4 at positions 109–113 and 127–131 and corresponding distance distributions obtained from MDDS and EPR/DEER. (B) Cartoon representation of KCNE1 with the OND at positions 47 and 66 and the OND4 at positions 46–50 and 66–70 and corresponding distance distributions obtained from MDDS and EPR/DEER. The structure of the integral membrane protein KCNE1 (PDB 2K21) was obtained from NMR.[21] Recently, the EPR/DEER distance distribution data has been reported for the 47/66R1 and 46–50/66–70RX spin-labels in this protein.[3] The 47/66OND and 46–50/66–70OND4 distance distributions obtained from the simulation agree very well with those obtained from the EPR/DEER (Figure 4B), suggesting the structure of KCNE1 obtained from NMR is quite accurate. The spin-pair distance distribution obtained from the OND4 is again found narrower than the EPR/DEER, but the maximum peaks of the two distributions are almost the same. The broader distance distributions in the EPR/DEER may be due to the dynamics of the protein and/or due to the influence from the membrane environment, since RX attached to membrane proteins could provide broader distance distributions, poorer signal-to-noise, and poor DEER modulation for longer distances as compared to water-soluble proteins.[3] Overall, OND and OND4 in conjunction with the MDDS simulations have been proved to provide very reliable distance distributions, which could be used to check the reliability or correctness of an already existing 3D structure of protein.

Application of MDDS to T4 Lysozyme

T4 lysozyme is the protein of known 3D structure for which the largest number of DEER histograms have been determined from EPR/DEER spectroscopy: 51 pairs from the insertion of the spin-label R1 at 37 positions.[9] Here, the simplified R1 spin-label (OND) is used to examine the performance of MDDS simulations. The OND is attached to 37 positions in T4 lysozyme (Figure 5A). Overall, the simulated distributions are found to agree very well with those obtained from the EPR/DEER, although the calculated distance distributions for some spin-pairs are found to be slightly broader (see Figure S6 in the Supporting Information). It is possible to quantify the similarity between the calculated and experimental distance distributions by finding the probability of their overlap, which is defined aswhere Pexp(n) and Pcalc(n) are the histograms obtained from experiment and calculation, respectively. The value of the overlap factor Q ranges from 0 to 1 by definition. A value of 1 implies a complete overlap between the two distributions. In the present case, Pexp(n) are the histograms obtained from the EPR/DEER and Pcalc(n) are the histograms obtained either from MDDS or MMM.[16] MMM is a very commonly used computational technique to predict spin-pair distance distributions. We find that the overlap factor Q is higher than 0.5 for all but 3 spin-label pairs using MDDS (60/94, 61/135, and 93/123). In contrast, Q is lower than 0.5 for 14 spin-label pairs with MMM (Figure 5B). This comparison suggests that the distance histograms from MDDS are in better agreement with the experimental distributions than those from MMM. The average distances extracted from MDDS are also in better agreement with the average distances extracted from the EPR/DEER distributions than those from MMM (Figure 5 C and D) (see Table S1 in the Supporting Information for the average distances). A linear regression analysis of the average spin-pair distances obtained from the EPR/DEER data and the simulations yields a correlation coefficient of about 0.91 for MDDS and a correlation coefficient of 0.85 for MMM. Using this system, it is also possible to assess the accuracy of a common and seductive simplification that uses Cα–Cα distances as surrogate spin-labels to interpret the structural changes reported by the EPR/DEER spin-pair distributions. The correlation coefficient for the average spin-pair distances obtained from EPR/DEER and the Cα–Cα distances is only 0.80. As shown in Figure 5C and E, MDDS is clearly a better representation than simply using Cα–Cα distances. This analysis shows that modeling the spin-label is important to accurately represent the distance distribution from EPR/DEER data and correctly interpret the structural changes in proteins.

Figure 5

Cartoon representation of (A) T4 lysozyme with 37 OND dummy spin-labels at positions 59, 60, 61, 62, 64, 65, 72, 75, 76, 79, 82, 83, 85, 86, 89, 90, 93, 94, 108, 109, 112, 115, 116, 119, 122, 123, 127, 128, 131, 132, 134, 135, 140, 151, 154, 155, and 159. OND dummy atoms and the Cα atoms are shown in red and cyan colors, respectively. (B) Comparison of the extent of spin-pair histogram overlap, Q, between the MDDS and MMM in T4 lysozyme. (C) 51 average interlabel experimental (RDEER) vs MDDS (RMDDS) distances in T4 lysozyme; the correlation coefficient is found to be 0.91. (D) 51 average interlabel experimental (RDEER) vs MMM (RMMM) distances in T4 lysozyme; the correlation coefficient is 0.85. (E) 51 average interlabel experimental (RDEER) vs inter-residue Cα–Cα distances in the crystallographic structure of T4 lysozyme; the correlation coefficient is 0.80.

Application of MDDS to Understand Conformational States of LeuT

The leucine transporter (LeuT) is a bacterial homologue of the mammalian neurotransmitter:sodium symporter (NSS). The NSS includes biogenic amine transporters that terminate synaptic signaling through selective reuptake of neurotransmitter molecules.[22,29] These NSSs are targets of widely prescribed therapeutic drugs (e.g., selective serotonin reuptake inhibitors (SSRIs) and tricyclic antidepressants (TCAs)) and drugs of abuse (e.g., cocaine, amphetamine).[30] LeuT has emerged as a model for NSS transporters due to its sequence, structural, and functional similarities. There have been intensive efforts to understand the dynamics of ion-coupled substrate translocation in LeuT by means of both computational and experimental techniques including the X-ray crystallography and EPR spectroscopy.[13,23,31−38] Three crystal structures of LeuT, PDB 2A65,[22] PDB 3TT1,[23] and PDB 3TT3,[23] have been classified as being in the outward-facing, inward-facing, and substrate-occluded states, respectively. The inward- and outward-facing states of LeuT were crystallized by mutation of some highly conserved residues and subsequent conformational selection with antibodies.[23] Recently, Karmier et al.[13] proposed an alternating access mechanism of LeuT through the use of EPR/DEER spin-pair distance distribution and subsequent structural refinement with the re-MD simulations. 3D structures obtained from the re-MD simulations were found very different in several sites of the inward-facing and substrate-occluded states when compared with the X-ray crystal structures, 3TT3 and 2A65, respectively. The study showed movements at the TMs 1b, 7b, 6a, and EL4 in the extracellular site and TMs 6b and 7a, N terminus in the intracellular ends during the alternating access of LeuT. Here, MDDS simulations are performed to cross-validate the three X-ray crystallographic structures in representing the outward-facing, inward-facing, and substrate-occluded states of LeuT. Three sets of EPR/DEER distance distribution data from 22 labeled sites are available that correspond to the three states of LeuT.[13] Overall, the agreement between the distance distributions obtained from the MDDS and EPR/DEER decreases with the use of 3TT1, 2A65, and 3TT3 in the MDDS simulation (see Figure S7 in the Supporting Information). Figure 6 represents the extent of MDDS and DEER histogram overlap (Q) vs the spin-pair numbers for all three states of LeuT. All the spin-pairs have been found to have high Q (>0.5) for the 3TT1, except for spin-pairs 12/371, 37/208, 71/425, 71/455, 193/277, and 208/306, suggesting the 3TT1 most likely corresponds to the outward-facing conformation of LeuT. Eleven spin-pairs have Q lower than 0.5 in the 2A65, suggesting the crystal structure initially classified as a substrate-occluded state does not correspond to the structure of the actual substrate-occluded state in solution. Spin-pairs 37/123 and 37/208 have spin-label 37 in TM1b, 123/240 and 208/240 have spin-label 240 in TM6a, 37/208, 208/240, and 208/306 have spin-label 208 in TM5, and 208/306 and 123/306 have spin-label 306 in TM7. All of these spin-pairs have very low Q (<0.4), suggesting the TMs 1b, 6a, 5, and 7 are different in the actual substrate-occluded state in LeuT. Fourteen out of 22 spin-pairs have Q values less than 0.5 in 3TT3, implying that the structure is very wrong and it does not represent the inward-facing conformation of LeuT. Spin-pairs 7/86, 12/86, 12/338, 12/371, and 12/377, which have spin-labels 7 and 12 in TM1a, have very low Q (<0.2), suggesting that the actual conformation of the TM1 in the inward-facing apo state is very different than that observed in the crystallographic structure 3TT3. 123/240 and 208/240 have spin-label 240 in TM6 and 79/277, 184/277, 193/277, 208/306, 277/455, and 309/480 have spin-labels 277 and 309 in TM7. All of these spin-pairs have very low Q (<0.4) in the apo state, suggesting the actual conformations of the TM6 and TM7 are different in the apo state than those in the 3TT3. These results are in agreement with the structures obtained from the re-MD simulations which show that the actual substrate-occluded state of LeuT is more inward-facing than that reported in the 2A65 and the actual inward-facing conformation is very different in the TMs 6b, 7a, N terminus in the intracellular ends of the LeuT.[13]

Figure 6

(A) Cartoon representation of the PDB structures of LeuT, 3TT1 (blue), 2A65 (green), and 3TT3 (red) which shows changes in some important TMs. (B) Comparison of the extent of overlap, Q, of the spin-pair distance distributions obtained from a MDDS simulation and EPR/DEER for three states of LeuT. Twenty-two distance distributions are available from labeled positions in 7, 12, 37, 71, 79, 86, 123, 184, 185, 193, 208, 240, 271, 277, 306, 309, 338, 371, 377, 425, 455, and 480 in LeuT. (C) The TMs and the corresponding residue numbers in LeuT are also shown for the sake of discussion.

Conclusion

In this study, force field parameters for atomic models of the bifunctional spin-labels (RX) have been developed and were tested in explicit-solvent MD simulations to understand their conformational dynamics. It was observed that RX induces less distortion in a protein backbone than RX, and is thus a better choice to study protein conformations. For the purpose of aiding structural refinement, simplified dummy spin-labels were also developed to match the 3D spatial distribution of the nitroxide oxygen atom relative to the protein backbone observed in all-atom simulations of the RX and RX spin-labels. The dummy spin-labels called OND3 and OND4 are meant to simulate the RX and RX, respectively. These models are designed to carry out MDDS and restrained-ensemble (re-MD) simulations. The results presented here highlight the usefulness of simplified spin-labels and MDDS simulations in predicting the EPR/DEER spin-pair distance distribution data. The distance distributions between spin-labels from MDDS simulations are in very good agreement with those obtained experimentally from EPR/DEER when the conformation of the protein is already known. The MDDS simulations perform better than the widely used MMM computational method. This suggests that MDDS simulation is a useful method to rapidly assess if a putative 3D model structure of the system is consistent with available EPR/DEER data. As the dummy spin-labels do not interact with one another, they can be introduced simultaneously to all the sites in a single MDDS simulation of the model structure. A system with dummy labels on all sites can be built by the web based CHARMM-GUI webpage. In the near future, one will be able to conduct the MDDS simulations using the spin-label descriptor module in charm-gui.org. A refinement of protein model structure is possible via the re-MD simulation by introducing noninteracting multiple copies of the dummy spin-labels inserted in various positions of the protein for which EPR/DEER data is available. The re-MD simulations of T4 lysozyme labeled with multiple copies (25) of 109–113OND4 and 127–131OND4 were able to match the calculated 109–113/127–131OND4 distance distribution with that obtained from the EPR/DEER (see Figure S8 in the Supporting Information). This result suggests that, if a sufficient number of EPR/DEER distance distributions are available, the re-MD simulation along with the simplified spin-labels can drive the conformation of a protein toward an accurate refined structure.

37 in total

1. State-dependent conformations of the translocation pathway in the tyrosine transporter Tyt1, a novel neurotransmitter:sodium symporter from Fusobacterium nucleatum.

Authors: Matthias Quick; Hideaki Yano; Naomi R Goldberg; Lihua Duan; Thijs Beuming; Lei Shi; Harel Weinstein; Jonathan A Javitch
Journal: J Biol Chem Date: 2006-06-23 Impact factor: 5.157

2. CHARMM-GUI: a web-based graphical user interface for CHARMM.

Authors: Sunhwan Jo; Taehoon Kim; Vidyashankara G Iyer; Wonpil Im
Journal: J Comput Chem Date: 2008-08 Impact factor: 3.376

Review 3. CHARMM: the biomolecular simulation program.

Authors: B R Brooks; C L Brooks; A D Mackerell; L Nilsson; R J Petrella; B Roux; Y Won; G Archontis; C Bartels; S Boresch; A Caflisch; L Caves; Q Cui; A R Dinner; M Feig; S Fischer; J Gao; M Hodoscek; W Im; K Kuczera; T Lazaridis; J Ma; V Ovchinnikov; E Paci; R W Pastor; C B Post; J Z Pu; M Schaefer; B Tidor; R M Venable; H L Woodcock; X Wu; W Yang; D M York; M Karplus
Journal: J Comput Chem Date: 2009-07-30 Impact factor: 3.376

4. X-ray structures of LeuT in substrate-free outward-open and apo inward-open states.

Authors: Harini Krishnamurthy; Eric Gouaux
Journal: Nature Date: 2012-01-09 Impact factor: 49.962

5. A competitive inhibitor traps LeuT in an open-to-out conformation.

Authors: Satinder K Singh; Chayne L Piscitelli; Atsuko Yamashita; Eric Gouaux
Journal: Science Date: 2008-12-12 Impact factor: 47.728

6. Using Markov models to simulate electron spin resonance spectra from molecular dynamics trajectories.

Authors: Deniz Sezer; Jack H Freed; Benoit Roux
Journal: J Phys Chem B Date: 2008-08-12 Impact factor: 2.991

7. Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ(1) and χ(2) dihedral angles.

Authors: Robert B Best; Xiao Zhu; Jihyun Shim; Pedro E M Lopes; Jeetain Mittal; Michael Feig; Alexander D Mackerell
Journal: J Chem Theory Comput Date: 2012-07-18 Impact factor: 6.006

Review 8. Unlocking the molecular secrets of sodium-coupled transporters.

Authors: Harini Krishnamurthy; Chayne L Piscitelli; Eric Gouaux
Journal: Nature Date: 2009-05-21 Impact factor: 49.962

9. Structure of KCNE1 and implications for how it modulates the KCNQ1 potassium channel.

Authors: Congbao Kang; Changlin Tian; Frank D Sönnichsen; Jarrod A Smith; Jens Meiler; Alfred L George; Carlos G Vanoye; Hak Jun Kim; Charles R Sanders
Journal: Biochemistry Date: 2008-07-09 Impact factor: 3.162

10. DEER EPR measurements for membrane protein structures via bifunctional spin labels and lipodisq nanoparticles.

Authors: Indra D Sahu; Robert M McCarrick; Kaylee R Troxel; Rongfu Zhang; Hubbell J Smith; Megan M Dunagan; Max S Swartz; Prashant V Rajan; Brett M Kroncke; Charles R Sanders; Gary A Lorigan
Journal: Biochemistry Date: 2013-09-09 Impact factor: 3.162

14 in total

1. Atomistic Models from Orientation and Distance Constraints Using EPR of a Bifunctional Spin Label.

Authors: Benjamin P Binder; Andrew R Thompson; David D Thomas
Journal: Biophys J Date: 2019-06-20 Impact factor: 4.033

2. MMM: A toolbox for integrative structure modeling.

Authors: Gunnar Jeschke
Journal: Protein Sci Date: 2017-09-04 Impact factor: 6.725

3. Refinement of Highly Flexible Protein Structures using Simulation-Guided Spectroscopy.

Authors: Jennifer M Hays; Marissa K Kieber; Jason Z Li; Ji In Han; Linda Columbus; Peter M Kasson
Journal: Angew Chem Int Ed Engl Date: 2018-11-27 Impact factor: 15.336

4. Sodium and proton coupling in the conformational cycle of a MATE antiporter from Vibrio cholerae.

Authors: Derek P Claxton; Kevin L Jagessar; P Ryan Steed; Richard A Stein; Hassane S Mchaourab
Journal: Proc Natl Acad Sci U S A Date: 2018-06-18 Impact factor: 11.205

5. Rotameric preferences of a protein spin label at edge-strand β-sheet sites.

Authors: Timothy F Cunningham; Soraya Pornsuwan; W Seth Horne; Sunil Saxena
Journal: Protein Sci Date: 2016-03-21 Impact factor: 6.725

Review 6. CHARMM-GUI 10 years for biomolecular modeling and simulation.

Authors: Sunhwan Jo; Xi Cheng; Jumin Lee; Seonghoon Kim; Sang-Jun Park; Dhilon S Patel; Andrew H Beaven; Kyu Il Lee; Huan Rui; Soohyung Park; Hui Sun Lee; Benoît Roux; Alexander D MacKerell; Jeffrey B Klauda; Yifei Qi; Wonpil Im
Journal: J Comput Chem Date: 2016-11-14 Impact factor: 3.376

7. Comparative evaluation of spin-label modeling methods for protein structural studies.

Authors: Maxx H Tessmer; Elizabeth R Canarie; Stefan Stoll
Journal: Biophys J Date: 2022-08-10 Impact factor: 3.699

8. Modeling of protein conformational changes with Rosetta guided by limited experimental data.

Authors: Davide Sala; Diego Del Alamo; Hassane S Mchaourab; Jens Meiler
Journal: Structure Date: 2022-05-20 Impact factor: 5.871

9. Structural Refinement of Proteins by Restrained Molecular Dynamics Simulations with Non-interacting Molecular Fragments.

Authors: Rong Shen; Wei Han; Giacomo Fiorin; Shahidul M Islam; Klaus Schulten; Benoît Roux
Journal: PLoS Comput Biol Date: 2015-10-27 Impact factor: 4.475

10. Structural basis for KCNE3 modulation of potassium recycling in epithelia.

Authors: Brett M Kroncke; Wade D Van Horn; Jarrod Smith; CongBao Kang; Richard C Welch; Yuanli Song; David P Nannemann; Keenan C Taylor; Nicholas J Sisco; Alfred L George; Jens Meiler; Carlos G Vanoye; Charles R Sanders
Journal: Sci Adv Date: 2016-09-09 Impact factor: 14.136