Literature DB >> 24874104

Balancing the interactions of ions, water, and DNA in the Drude polarizable force field.

Alexey Savelyev¹, Alexander D MacKerell.

Abstract

Recently we presented a first-generation all-atom Drude polarizable force field for DNA based on the classical Drude oscillator model, focusing on optimization of key dihedral angles followed by extensive validation of the force field parameters. Presently, we describe the procedure for balancing the electrostatic interactions between ions, water, and DNA as required for development of the Drude force field for DNA. The proper balance of these interactions is shown to impact DNA stability and subtler conformational properties, including the conformational equilibrium between the BI and BII states, and the A and B forms of DNA. The parametrization efforts were simultaneously guided by gas-phase quantum mechanics (QM) data on small model compounds and condensed-phase experimental data on the hydration and osmotic properties of biologically relevant ions and their solutions, as well as theoretical predictions for ionic distribution around DNA oligomer. In addition, fine-tuning of the internal base parameters was performed to obtain the final DNA model. Notably, the Drude model is shown to more accurately reproduce counterion condensation theory predictions of DNA charge neutralization by the condensed ions as compared to the CHARMM36 additive DNA force field, indicating an improved physical description of the forces dictating the ionic solvation of DNA due to the explicit treatment of electronic polarizability. In combination with the polarizable DNA force field, the availability of Drude polarizable parameters for proteins, lipids, and carbohydrates will allow for simulation studies of heterogeneous biological systems.

Entities: Chemical Disease Species

Mesh：

Substances：

Year: 2014 PMID： 24874104 PMCID： PMC4064693 DOI： 10.1021/jp503469s

Source DB: PubMed Journal: J Phys Chem B ISSN： 1520-5207 Impact factor: 2.991

Introduction

The conformational behavior of DNA in solution is driven by a subtle interplay among various intramolecular degrees of freedom and interactions with the environment.[1] For example, when in a B form, correlated motions of the backbone, sugar and glycosidic DNA torsions lead to the dynamical transitions between BI and BII states, as well as the extent of local sampling of geometries characteristic of the A form DNA as manifested in sugar puckering transitions between the south and north conformations.[2−6] On a global scale, the balance between bonded and nonbonded intramolecular parameters along with interactions with the environment influence the conformational equilibrium between the A, B, and Z forms of DNA. Moreover, interactions with mobile ions and hydration effects are known to contribute to the overall DNA stability and regulate a number of critical polymeric properties, such as persistence length and stiffness.[7] Over the past decade, computational modeling of DNA had some successes in predicting selected properties of DNA and helped to rationalize many DNA related experiments. In particular, modeling allowed for atomistic investigations of DNA energetics, mechanisms of structural transitions and the impact of environmental effects on DNA[8−12]—information that is not readily accessible to experimental techniques, such as X-ray crystallography and NMR.[13] These studies were based on additive force fields (FF) for DNA, including CHARMM,[11,14] AMBER,[15,16] Bristol-Myers Squibb,[2] and GROMOS.[17] While successful in many respects,[18−21] these models show a number of limitations, potentially associated with the lack of explicit electronic polarization in the model, including limited treatment in the equilibrium between the different conformations of DNA[22−24] as well as relative energies of different conformations.[25] While continued developments in the additive FFs have partially overcome these limitations,[11,26,27] it is anticipated that extending the form of the potential energy function to include the explicit treatment of electronic polarization may lead to improvements in DNA FFs, including yielding a more accurate description of the underlying physical forces dictating the structure and dynamics of DNA. Recently, we have presented the first-generation CHARMM polarizable FF for DNA based on the classical Drude oscillator formalism.[28] Briefly, electronic induction in the model is represented by introducing a massless charged particle (i.e., the Drude oscillator) attached to each polarizable atom by a harmonic spring.[29] In practice, charge redistribution in response to changes in the local electric field is approximated by self-consistently updating the positions of the auxiliary particles to their local energy minima for any given configuration of the atoms in the system.[30−33] This self-consistent field (SCF) approach takes into account the permanent electric field due to the fixed charges and the contribution of the induced dipoles to the electric field. In the context of molecular dynamics (MD) simulations, the SCF calculation is substituted with an Extended Lagrangian Integrator[34−36] that treats the Drude oscillators as dynamic variables, allowing for the computational efficiency required for long-time MD simulations. To date, the Drude model has been implemented in CHARMM,[37] NAMD[38] and ChemShell quantum mechanics/molecular mechanics (QM/MM).[39] As discussed elsewhere, due to the need of use of a 1 fs integration time step (versus the 2 fs time step customarily used in additive MD simulations), use of the Drude polarizable FF in simulations yields an approximately 4-fold overhead compared to additive simulations.[40,41] In addition, a module, the “Drude Prepper” has been added to the CHARMM-GUI,[42] facilitating the conversion of CHARMM PSF and coordinate files into the corresponding files required for Drude-based simulations including example inputs for CHARMM and NAMD. In our published study on development of the Drude force field for DNA we focused mainly on details of parametrization of the highly correlated backbone, sugar, and glycosidic torsion angles, and on extensive validation of the final parameters by comparing MD simulation results with a wide range of experimental data on DNA in solution.[28] A notable observation from those simulations was the significant enhancement and variation of base dipole moments as compared to the C36 additive FF,[11] indicating that the underlying physical forces dictating the properties of DNA are significantly different in the polarizable model. However, the discussion in that paper covered only a small part of the optimization process, not addressing such critical aspects of the modeling as fine-tuning of the interactions between DNA and mobile ions and the balance between DNA electrostatic and hydration effects, subtle corrections in the internal base parameters, and the impact of these on the overall DNA stability and subtler conformational phenomena, including BI/BII transitions and A-to-B equilibrium. Presently, we focus on the above aspects of the Drude FF development and additional testing of the optimized DNA parameters against a range of experimental data and theoretical estimations. Our optimization procedure involves approaches ranging from QM calculations on various model compounds and their complexes to investigating the hydration and osmotic properties of the chemically relevant ionic systems to studying the impact of the mobile ions on DNA structural stability. We note that the final results presented here are based on the identical set of parameters developed in our earlier study on the Drude force field for DNA. Some results are also presented for a “default” parameter set, which is that based on direct transfer of parameters optimized targeting small model compounds representative of DNA with only minimal adjustment of selected dihedral parameters.

Computational Methods

Parameter Optimization Strategy

In the present study we used a number of small model systems to probe how electrostatic, hydration and other effects in the DNA duplex impact overall DNA stability, as well as subtler aspects of DNA structural behavior in solution. Those systems are shown in Figure 1.

Figure 1

Model compounds used in the present study. Dihedral angles shown for 1 are defined as follows: α = O3′–P–O5′–C5′, ε = C4′–C3′–O3′–P, ζ = C3′–O3′–P–O5′ and ν4 = C3′–C4′–O4′–C1′. In 4 and 5, R = CH3 or H. Model compound 1, composed of the sugar and a 3′ methylated phosphate group (Figure 1) was previously used to study correlations between ε and ξ torsions in the context of BI/BII transitions in DNA.[11,28] In our recent parametrization study we discussed in detail the optimization of these and other dihedral angles.[28] In the present study, by analyzing individual energetic contributions of 1, we demonstrate that, in addition to the torsion parameters, adjustments of electrostatic parameters (partial charges) was necessary to achieve satisfactory agreement with both QM data on this model compound and experimental data on BI/BII transitions and A-to-B equilibrium in DNA duplexes in solution. QM data for 1, as previously published,[11,28] are represented by a 2D ε versus ξ potential energy surface based on structure optimizations at the MP2/6-31(+)G(d) level with ε and ξ being sampled in increments of 15°, while restraining the overall conformation to that of the canonical B DNA form. Specifically, one of the sugar moiety dihedral angles, ν4, and the backbone torsion, α, were restrained.[14] Analogous MM calculations were carried out using the CHARMM program[43] and the Drude polarizable force field.[28] The model was minimized first using the adopted-basis Newton–Raphson (ABNR) algorithm before performing the 2D dihedral potential energy scan. Dihedral harmonic restraints of 105 kcal/mol/rad2 were used with minimizations performed to a gradient of 10–6 kcal/mol/Å2. Alteration of partial charges in 1 prompted further investigation of hydration properties of a smaller model compound, 2, dimethylphosphate (DMP), which is representative of the phosphate group in the DNA backbone. The alteration of the electrostatic parameters to correct the behavior of 1 leads to changes in the hydration of 2. Therefore, modifications of LJ parameters for selected atoms were necessary. The optimization was performed by targeting, simultaneously, reproduction of the hydration free energy of DMP and the gas-phase heterodimeric interactions with water. Calculation of the hydration free energy is elaborated in the next section. QM calculations on DMP–water monohydrates, using Gaussian 09,[44] were performed with DMP fixed in the geometry obtained from gas phase optimization at the MP2/6-31+G* level of theory and the water molecule fixed in the geometry of the SWM4-NDP model.[45] Three minimum energy conformations of DMP were considered: gg, tt, and gt, where g and t stand for “gauche” and “trans”, respectively. The relative orientations of the DMP and water molecules were also fixed, as shown in Figure 2, with only the interaction distances optimized at the MP2/6-31+G* level of theory. Interaction energies were then determined via single point calculations at the RI-MP2/cc-pVQZ level of theory, performed on the MP2/6-31+G* optimized geometries. The single point calculations were performed using the program Q-Chem,[46,47] with counterpoise correction[48] included to account for basis set superposition error (BSSE).[49] MM polarizable calculations were performed with CHARMM[43] using the SWM4-NDP water model[45] and restraints and orientations identical to those used in the QM calculations.

Figure 2

Schematic representation of the interaction orientation between DMP anion and water molecules. While analysis was performed on the basis of water–DMP monohydrates, in this figure all water orientations are shown simultaneously. The numbering of the water orientations corresponds to that used in Table 2 and Tables S1 and S2 of the Supporting Information.

Table 2

QM and MM Interactions between the gg Conformation of DMP and a Single Water Molecule in Different Orientationsa

	distanceb			energy
dimer conformation	QM	MM	difference: MM–QM	QM	MM	difference: MM–QM
1 o11_lp2				–2.65	–2.20	0.44
2 o11_bis	1.87	1.61	–0.26	–9.35	–9.73	–0.38
3 o11_lp1	1.92	1.65	–0.27	–8.79	–8.43	0.35
4 o13_lp2	1.84	1.73	–0.11	–11.50	–11.37	0.12
5 o13_180	1.81	1.70	–0.11	–12.74	–13.14	–0.40
6 o13_lp1	1.86	1.70	–0.16	–13.63	–13.59	0.03
7 o13_lp_d270				–11.94	–11.74	0.19
8 o13_lp_d90				–13.23	–13.24	–0.01
9 p_hoh	3.34	3.20	–0.14	–14.53	–14.46	0.06
AVE			–0.17			0.04
RMSD			0.06			0.27

Orientations are as shown in Figure 2. All energies are in kcal/mol. Distances for selected conformations are in Å. In QM calculations, the structure of the DMP was first optimized at MP2/6-31+G* level of theory. Next, the distances between water molecule and DMP were optimized at MP2/6-31+G* level of theory with the rest of the dimer fixed. QM interaction energies were calculated at the RI-MP2/cc-pVQZ level of theory using counterpoise correction. See Methods section for details.

Distances for the QM optimized geometries 2–6 are between the water hydrogen and o11/o13 oxygen atom of DMP. For geometry 9, the distance is between water oxygen and the DMP phosphorus.

Another important aspect of DNA modeling is the ability of the FF to properly treat interactions between DNA and mobile ions. Those were tested based on two approaches. The first approach deals with the detailed analysis of the Na+ and Cl– distributions around a DNA duplex, ignoring sequence specific effects. The second approach deals with gas-phase calculations on model systems involving interactions of Na+ with individual nucleic acid bases, as well as with DMP. In particular, Na+ was positioned at distances ranging from 1 to10 Å away from the chemically relevant atoms at various orientations relative to the base. Specifically, nitrogen atoms participating in Watson–Crick base-pairing – N1 of adenine, N3 of cytosine, N1 of guanine and N3 of thymine–were chosen for probing interactions with Na+. In the case of guanine and thymine, whose N atoms are protonated, out-of-plane interactions with Na+ were studied. Additionally, calculations on the Ade-Ade/Na+ trimer model system were performed, with Na+ placed at distances [1-10] Å from the bases in the in-plane orientation and bases fixed in the relative orientation corresponding to B form DNA. Analogous calculations were carried out for DMP–/Na+ dimers, probing interactions among the phosphate group and Na+, with the latter positioned at various orientations and distances from the anionic O1P/O2P and ester O3/O4 oxygen atoms. Visual representations of the model systems are provided in the Results and Discussion section along with the corresponding QM and MM energy profiles. The distance increment in positioning Na+ from the atom of interest was 0.1 Å in the range of [1···4] Å, and 1 Å at larger separations. QM potential energy profiles for all complexes were based on gas phase structures optimized at the MP2/6-31++G(2df,2pd) level of theory using Gaussian 09[44] with counterpoise corrections[48] applied to account for BSSE.[49] Equivalent single point calculations were then performed using the Drude polarizable force field for nucleic acids. Additional analysis of the nucleic acid bases was performed on their geometries and vibrational spectra, motivated by preliminary simulations of the Drew Dickerson (DD or EcoRI) dodecamer,[50−52] indicating that the conformational behavior of the individual nucleic acid bases may contribute to DNA instability. This prompted a partial reoptimization of the internal parameters for the nucleic acid bases. As discussed below, this mainly involved manual adjustment of the base valence angles leading to a better representation of the vibrational frequencies and improvements in the geometries. Target data for bonds and angles involving hydrogen atoms were obtained from the previously published QM geometries optimized at the MP2/6-31+G* level of theory,[53] while all other bond length and angle parameters were optimized to reproduce target crystallographic geometries obtained from the Cambridge Structural Database survey.[54] CHARMM vibrational spectra were calculated using the MOLVIB utility[55] of the CHARMM program,[43] with the target infrared spectra calculated ab initio at the MP2/6-31+G* level of theory. Internal coordinate assignment was performed according to the method of Pulay et al.,[56] and a scale factor of 0.9434 was applied to QM calculated normal modes to account for limitations in the level of theory used.[57]

Calculation of Hydration Free Energies

Hydration free energies, Δhyd, were calculated via the free energy perturbation (FEP) method using the protocol of Deng and Roux.[58] In this approach, the LJ potential is separated into purely repulsive and attractive components using the Weeks–Chandler–Anderson (WCA) decomposition scheme,[59] as has been previously described.[60] Simulation systems involved a periodic box containing a single DMP molecule solvated by 250 SWM4-NDP[45] water molecules. Simulations were performed using CHARMM[43] and the velocity Verlet integrator,[61] which includes treatment of Drude particles via an extended Lagrangian formalism.[37] The SHAKE algorithm[62] was employed to constrain covalent bonds to hydrogen, and an integration time step of 1 fs was used. Simulations were performed in the NPT ensemble at 298.15 K and 1 atm, using a Nosé–Hoover thermostat[63,64] and a modified Andersen–Hoover barostat.[65] Electrostatic interactions were treated using particle mesh Ewald (PME) summation[66] with a coupling parameter of 0.34 and a sixth-order spline for mesh interpolation. The system was equilibrated for 100 ps at each value of the thermodynamic coupling parameter, λ, after which the system was simulated for 200 ps of production MD. For the calculation of dispersive and electrostatic contributions to Δhyd, λ was assigned values ranging from 0.0 to 1.0 in equally spaced increments of 0.1. For the repulsive term, extra windows were required in the “low λ” region, such that λ took additional values of 0.05, 0.15, 0.25, and 0.35. The dispersive and electrostatic contributions to Δhyd were calculated using thermodynamic integration (TI),[67] with the repulsive contribution calculated using a soft-core scheme,[58] and unbiased using the weighted histogram analysis method (WHAM).[68] Because these simulations were performed under periodic boundary conditions using the particle sum (P-sum) convention,[69] the interface potential between the bulk liquid and the external dielectric medium is zero. In reality, solvation of a charged species across a liquid/vacuum interface also depends on a nonzero and water-model-dependent interface potential (φl→v). Therefore, an additional contribution to the free energy, equal to qionφl→v, must be accounted for when performing calculations using the P-sum convention under periodic boundary conditions.[70] When considering a system consisting of a monovalent anion solvated with SWM4-NDP water, this contribution is 12.5 kcal/mol. The gas phase contribution to the hydration free energy was calculated using an identical procedure, but considered only an isolated DMP molecule in the gas phase, rather than a single molecule solvated in a water box.

Osmotic Pressure Calculations for the Bulk Electrolyte Solutions

System setup, MD simulation protocol, and the calculation of osmotic pressures for the bulk electrolyte solutions closely follow the method developed by Roux and co-workers.[71,72] Three aqueous electrolyte solutions of (1) Na+ and Cl– ions, (2) Na+ and DMP– ions, and (3) Na+ and Acetate– ions were simulated at two molal concentrations each, approximately corresponding to solutions having molarities of 1 M and 3M, respectively (see below). Each simulation system contains three compartments separated by two virtual semipermeable membranes aligned with the xy-plane. The electrolyte compartment centered on the absolute origin of coordinates is enveloped by compartments containing pure water. Water can freely pass through the membranes separating electrolyte and water solutions, while the ions are subject to the force of the half-harmonic planar potential on each side of the ionic compartment whose action mimics the ideal behavior of a semipermeable membrane inducing osmosis. A representative snapshot from MD simulation of the Na+/DMP– electrolyte solution at ∼3 M concentration is shown in Figure 3. The instantaneous osmotic pressure, π = Fmemb/A, equals the instantaneous total force on the membranes,divided by the total area of the membranes, A. In this expression, k is the force constant of the membrane potential, 10 kcal/(mol·Å), 48 Å is the linear size of the cubic ionic compartment, and zi is the z-coordinate of the i ion. Values of the instantaneous force were recorded every 0.5 ps, and the total MD simulation run was for 70 ns for each simulated electrolyte system. The convergence of MD simulations was verified by calculating osmotic pressures from the two halves of the MD trajectory. Standard deviations around the averaged values were obtained based on seven 10 ns blocks of the total MD trajectories.

Figure 3

Snapshot from an MD simulation of the ∼3 M aqueous solution of the DMP– and Na+ ions. Sodium ions are yellow, DMP ions are cyan, and water is red/white.

Snapshot from an MD simulation of the ∼3 M aqueous solution of the DMP– and Na+ ions. Sodium ions are yellow, DMP ions are cyan, and water is red/white. Given the values of experimentally measured osmotic pressure coefficient, φ=π/πid, the experimental value of the osmotic pressure at molal concentration, m, is determined as follows:where R is the gas constant, T is temperature in Kelvin, Vm is the molar volume of pure water at 1 bar and room temperature, and ν is the total number of cations and anions per formula unit (in our case of 1/1 electrolyte solutions ν = 2).[73] It is important to note that, following the convention used in experimental measurements, the osmotic pressure of an ideal solution in the above expression is proportional to the molal concentration of the electrolyte, m, which differs from the traditionally used molar concentration M. Therefore, to enable direct comparison with the experiment, the actual molal concentrations need to be computed from MD simulations based on the values of mean densities of the cations, anions, and pure water in the ionic compartment as follows:where overbars indicate ensemble averages, and Msalt is the molar concentration of the salt. The resulting molal concentration is then used to obtain the osmotic pressure of an ideal solution via eqs 2 and 3. An identical approach for computing molal concentrations and osmotic pressures is used in ref (74). MD simulation protocol for studying electrolyte solutions is analogous to that described below for the DNA duplexes. A few differences are represented by the use of ‘useFlexibleCel’ and ‘useConstantArea’ options of NAMD.[75] The first option allows for the three dimensions of the periodic cell to fluctuate independently, while the second one keeps the ratio of the unit cell constant in the xy-plane while allowing fluctuations along the z axis in the course of MD simulation. In addition, the “TclForce” script of NAMD was utilized to impose semiharmonic restraints on ionic species during MD simulations, as elaborated above. Simulations were carried out with the CHARMM Drude polarizable and the additive C36 force fields. Each simulated system included ∼45 000 particles, including atoms, lone pairs, and Drude particles, in the case of the polarizable model and ∼26 000 atoms in the additive model.

MD Simulation Protocol for DNA Duplexes in an Aqueous Salt Environment

Simulated systems included the EcoRI dodecamer[50−52] and Junfos[24,76] 16-mer whose structural behavior was extensively investigated in our original study.[28] Additionally, we studied a shorter 10-base-pair DNA oligomer (1DCV).[77] All DNA molecules were solvated with the polarizable SWM4-NDP[45] water model in cubic boxes with water molecules extending at least ∼15 Å from the DNA surface, with addition of neutralizing Na+ ions and an extra ∼120 mM of NaCl salt.[72,78] Initial configurations were generated by simulating the analogous additive CHARMM36 (C36) systems for several nanoseconds according to the protocol described elsewhere,[11] taking the last snapshot from these runs as inputs for subsequent Drude simulations. CHARMM was used for generating the polarizable Drude system, self-consistent relaxation of the Drude particle positions, and short equilibration of the whole system, as elaborated elsewhere.[28] For the production MD simulation NAMD[75] (version 2.9) was used whose self-consistent treatment of the Drude polarizable model is based on a dual thermostatting scheme and Langevin dynamics.[38] In particular, real atoms and polarizable degrees of freedom (Drude particles) were coupled to the thermostats at 300 and 1 K, respectively. Electrostatic interactions were treated using the PME summation[66] with a coupling parameter of 0.34 and a sixth-order spline for mesh interpolation. Nonbonded pair lists were maintained out to 16 Å, and a real space cutoff of 12 Å was used for the electrostatic and Lennard-Jones (LJ) terms. All covalent bonds involving hydrogen as well as the intramolecular geometries of water molecules were constrained using the SETTLE algorithm.[79] The “HARDWALL” feature enabled the use of a 1 fs time step in MD simulations.[20] As previously described, this feature is associated with a “hard wall” reflective term in the potential energy function that has been added to resolve the polarization catastrophe problem in Drude MD simulations.[40,41] This term was invoked only when Drude particles moved >0.2 Å away from their parent nuclei during MD simulations.

Studying the Ionic Atmosphere Around DNA Duplex Based on Ion/DNA Radial Distribution Functions

To analyze in detail the Na+ and Cl– distributions around a DNA duplex, we calculated their radial distribution functions (RDFs)[80] from MD simulations following the approach developed by Savelyev et al.[81] Each RDF was based on first defining the ion–DNA separation as the closest distance between the DNA molecule and the particular ion. These distances were used to construct DNA–ion distance histograms from each snapshot of the MD simulation. To obtain the RDFs, the histograms were normalized by the volume Jacobian, playing the role of the configurational entropy of ions structured around DNA. The number of neighbors within a distance r from a given object iswhere ρ is the bulk particle concentration, g(r) is the RDF (pair correlation function), and J(r) is the volume Jacobian. The latter is defined as the volume of a shell, equidistant from the DNA surface (see Figure 4a,b), which was numerically calculated as a function of a distance from the DNA surface (Figure 4b). To minimize end effects, only the inner region of the DNA duplex (eight internal base pairs in the case of the 12-bais-pair EcoRI dodecamer) was used for computing the RDFs. This was achieved by “clipping” the periodic boundary cell with two parallel planes that pass through the centers of the upper four and the lower four DNA base pairs and perpendicular to the line connecting these centers. The above procedure was repeated for each snapshot along the MD trajectory.

Figure 4

Side (A) and top (B) views of the DNA and equidistant shell (in red) from its surface. The shell’s thickness is dr, and its volume is J(r) dr. J(r) is the volume Jacobian whose numerical value as a function of the distance from DNA surface is shown in C. It is seen that the function increases monotonically only after ∼4 Å from the DNA, from which it can be approximated by the cylindrical Jacobian often used for calculation of the ionic distributions around DNA. To avoid end effects, ionic distributions were analyzed in the central region of the DNA, as shown in A. It is worth mentioning that the present method of computing J(r) and RDF differs from the standard procedures relying on either spherical or cylindrical symmetries.[82] For example, although DNA is on average cylindrically symmetric and the use of the cylindrical Jacobian is reasonable, such an approach leads to an overestimation of the counterion condensation at small distances from the DNA surface. This is important, in particular, when calculating the absolute number of ions contributing to the various RDF peaks and subsequently determining the fraction of the DNA charge neutralized by condensed counterions—a critical test for evaluating the quality of the force field used to model ion/DNA interactions (see below). The numerical volume Jacobian, taking into account the complex and irregular shape of the DNA segment, is characterized by an unusual nonmonotonic behavior in the vicinity of the DNA oligomer (Figure 4c). It is seen that only at distances more than ∼5 Å from the DNA surface can the monotonically increasing Jacobian be approximated by the cylindrical one. Ion–DNA distance histograms, computed over snapshots saved every 4 ps from the 100 ns simulations, were normalized by the corresponding numerical Jacobian. Three-dimensional grids with lattice spacings of 0.5 Å were used to calculate both the ion-DNA distance histograms and the volume Jacobian. The biochemical algorithm library (BALL)[83] was used to implement the computational analysis subroutines. Additionally, BALL was used for writing code for reimaging/recentering of MD trajectories in cases when DNA strands (treated as independent segments) appeared separated in the course of simulation because of the periodic boundary conditions. This postprocessing was necessary for subsequent calculations of the volume Jacobian and RDFs, and also other overall structural DNA properties, such as root-mean-square (RMS) deviations and Watson–Crick base-pairing.

Results and Discussion

Earlier stages of the Drude force field optimization for DNA were aimed at resolving overall duplex instability, while later published efforts[28] addressed finer details of the DNA structural behavior after the duplex stability was achieved. The resulting Drude DNA parameters were subjected to an extensive validation focusing on the details of the key backbone, sugar and glycosidic dihedral angles, whose correlated motions influence two critical aspects of the DNA structural behavior in solution–the proper treatment of the BI/BII transitions and the equilibrium between A and B forms DNA.[28] In the present work we present a description of earlier parametrization of selected bonded parameters in the bases and nonbonded parameters regulating electrostatic and van der Waals interactions between mobile ions, DNA atoms and water molecules. These interactions are strongly interdependent and must be properly balanced, with the balance extending to the rest of the FF. For example, mitigating electrostatic interactions between selected atoms along the DNA backbone by varying their default partial charges proved necessary for achieving experimentally and computationally predicted BI-to-BII transition frequencies[8,84] and the dynamical puckering transitions of the sugars,[85−87] as well as better consistency with gas-phase QM data.[11] Such alterations, however, disrupted the existing balance of various energetic contributions having an effect on, e.g., DNA hydration properties, requiring adjustments of LJ parameters that impact specific interactions with water molecules. Those adjustments, in turn, prompted calculations of the hydration free energies and osmotic pressure coefficients for a number of biologically relevant ions in aqueous solution and comparison of MD simulation results with available experimental data. Another complication that was encountered involved the interactions between the DNA and mobile ions. While the individual ions[72,78] and base nonbond parameters[53] were carefully optimized, that optimization did not involve direct interactions of the ions with the bases. In the FF, these “off-diagonal” interactions are treated with the electrostatic parameters for the individual moieties along with LJ parameters obtained from combining rules, the Lorentz–Berthelot combining rules[88] in the present case. With the additive force field, this approximation generally worked well, although limitations in the approach have been discussed.[51,60,89,90] However, with the Drude force field, the direct use of the electrostatic and combining rule LJ parameters has been shown to be more problematic, leading to optimization of both terms in specific cases via the use of through-space Thole interactions (NBTHOLE)[91,92] and pair-specific LJ parameters (NBFIX),[43] respectively. For example, NBTHOLE terms have been used to fine-tune the interactions of counterion pairs based on osmotic pressures[72] and the pair-specific LJ parameters have been used to improve free energies of hydration.[60] With DNA, an imbalance between the base and counterions leads to the interactions between them being significantly overestimated, leading to destabilization of the DNA. To correct this, gas phase QM calculations of interactions of the nucleic acid bases and Na+ ion were used to initially optimize the off-diagonal parameters, followed by MD simulations of the fully solvated DNA duplex in a salt buffer with analysis focused on reproduction of predictions from counterion condensation theory.[93] Following several iterations, pair-specific LJ parameters for the ions and nucleic acid base hydrogen bond acceptor atoms were obtained that achieve satisfactory agreement with both the QM and counterion condensation theory target data.

Tuning the Interactions Between Nucleic Acid Bases and Na+ Counterions

Initial nonbond parameters for the DNA FF involved the published base[53] and atomic ion[72] parameters; however, their application lead to instabilities in simulations of duplex DNA. As shown in Figure S1 of the Supporting Information, simulations of the EcoRI dodecamer resulted in RMS deviations from canonical B conformation which increased irreversibly after ∼10 ns and ∼15 ns of MD simulations, respectively, when only neutralizing Na+ ions and Na+ ions together with an extra ∼120 mM NaCl salt were included in the simulation system. The direct impact of the salt concentration on the longevity of the DNA’s stability domain suggested the potential cause for such behavior, which turned out to be associated with the anomalously favorable interactions among Na+ ions and the DNA oligomer. Identification of the problem proceeded as follows. We first computed the DNA/Na+ RDFs and compared the results from polarizable Drude and analogous additive C36 MD simulations (see Figure S2, Supporting Information). Such comparison revealed that Na+ condensed on DNA to a much greater extent in the polarizable model, with that increase largely involving direct contact of Na+ with DNA. Subsequent RDF analysis of specific sites on the DNA (Figure S3, Supporting Information) identified stronger binding of Na+ in the central AATT tract of the EcoRI oligomer. Analysis was then performed on the frequency of Na+ binding to these four internal residues (Figure S4) revealing that while, on average, Na+ ions interact more frequently with the phosphate groups in the additive model (Figure S4a), there is an enhanced binding of the sodium to the interior of the DNA in the Drude model, namely, to the N1/N6 atoms of the adenine and N3/O4 atoms of the thymine (Figure S4b, Supporting Information). Because these atoms participate in the Thy-Ade Watson–Crick base-pairing (N3–H···N1 and O4···H–N6 hydrogen bonds), it became clear that the cause for the DNA structural disruption was Na+ competing for those interactions. The above analysis prompted the calculation of QM and MM interactions between the individual nucleic acid bases and Na+. Additionally, as the AA stacked pair was indicated to be problematic (Figure S4c, Supporting Information), calculations on the Ade-Ade/Na+trimer complex were performed, with the adenine bases fixed in the B DNA conformation. The QM and MM potential energy scans are shown in Figure 5. As is evident, the default Drude parameters largely overestimate the attraction between Na+ and selected atoms of the bases at short distances. The MM and QM energies differ by ∼12–15 kcal/mol for Ade and Cyt bases (Figure 5a, d), and by ∼8 kcal/mol for Thy and Gua bases (Figure 5b, c). The discrepancy is even bigger in the case of the Ade–Ade/Na+ trimer for which MM and QM differ by nearly 40 kcal/mol (Figure 5e). Clearly, these results point to the default Drude parameters leading to overestimation of the base-Na+ interactions ultimately destabilizing duplex DNA.

Figure 5

QM and empirical Drude potential energy surfaces as functions of the distance between Na+ ion and selected atoms of the nucleic acid bases (A–D), or of A–A stacked bases (E).

QM and empirical Drude potential energy surfaces as functions of the distance between Na+ ion and selected atoms of the nucleic acid bases (A–D), or of A–A stacked bases (E). To overcome the overestimation of the base-Na+ interactions, we considered the use of both the NBTHOLE[74] and pair-specific LJ (NBFIX option in CHARMM)[43] approaches. While both approaches could alleviate the problem, as shown in Figure 5e, use of pair-specific LJ terms was considered to be more practical as a number of other pair-specific LJ terms are implemented in the Drude FF. The NBFIX approach is based on assigning small repulsive cores (i.e., LJ radii) to the particles of interest to mitigate Coulomb interaction among them. To resolve the problem, we modified only the values of Rmin and did not alter the LJ depth (ε) from that obtained from the combining rules. Such modifications have been made for LJ interactions between Na+ and the following atoms: (1) N1, N3, N6 of adenine; (2) N1, N2, N3 of guanine; (3) N1, N3, N6 of thymine; and (4) N4, N3 of cytosine. In practice, because of the repeating atom types in all four bases, only three unique pair-specific LJ terms (e.g., NBFIXes) were introduced. As seen from Figure 5, this improved the consistency with the QM data. As anticipated, these adjustments led to improved stability of duplex DNA in solution, as judged from RMS differences versus B-form DNA for the EcoRI dodecamer (see Figure S5).

Adjustment of the Internal Parameters for Nucleic Acid Bases

During preliminary duplex DNA MD simulations, significant out-of-plane distortions of individual bases lead to DNA instability on times exceeding ∼30 ns (Figure S5, Supporting Information). Visual examination of the base geometries during the course of the simulations revealed that excessive out-of-plane distortions were evident for non-hydrogen atoms as well as for the amino groups. The combined effect of these distortions was the formation of hydrogen bond-like interactions between the stacked bases in the duplex DNA, in particular, the amino groups with the carbonyl oxygen atoms. Interestingly, similar deformations were recently observed from gas-phase QM calculations on model systems composed of the two stacked bases, whose overall geometries were fixed in either B or A DNA conformations based on selected sets of internal coordinates, but allowed to relax otherwise.[94] While the effect of these out-of-plane distortions in duplex DNA in solution is not clear,[95] such base deformations clearly need to be limited to achieve a stable DNA structure. To understand the cause of the distortions, analysis of the amino groups in the adenine and cytosine bases demonstrated that the out-of-plane displacement of the NH2 group hydrogen atoms was associated with the sum of the valence angles around nitrogen atoms being smaller than 360° (Figure S6, Supporting Information). A similar analysis was also performed for the base’s 5- and 6-membered rings whose planarity is characterized by the sum of all inner valence angles being 540° and 720°, respectively (not shown). This indicated the valence angles required additional optimization. Adjustments of the internal base parameters were performed targeting crystal survey geometries[5] and QM geometries and vibrational spectra of individual bases with their geometries restrained to be planar. Based on the above observations, optimization focused on the equilibrium values for the valence angles, subjected to the following restraints: (1) the sum of the valence angles around the planar centers, such as amino groups, should be 360°; and (2) the sum of the inner valence angles in the 5- and 6-membered rings should be 540° and 720°, respectively. Although these were just a few restraints, performing such adjustments posed a challenge because of the repeating atom types in chemically different bases, making it difficult for these conditions to be satisfied for all in-plane centers and/or rings. However, the final parameters significantly improved both the agreement of the Drude model with both the experimental geometries and QM frequencies of the bases, as shown in Tables S1–S8 of the Supporting Information. Following these corrections, the DNA duplex was observed to be stable on time scales of 100 ns based on MD simulations of the EcoR1 dodecamer (Figure S5), as well as simulations of other duplexes presented in our previous study.[28]

Addressing Fine Details of the DNA Structural Behavior

Once extended simulations of DNA in solution were possible, more subtle conformational effects were addressed. These included transitions between the BI and BII substates of the canonical B form DNA and pucker transitions of the sugar groups from the dominant south conformation to the occasionally visited north conformation in B form simulations.[28] Details of the optimization process follow. Model compound 1 was designed to study energetics of the correlated motions among the ε and ζ backbone dihedral angles in the context of the BI/BII transitions in duplex DNA.[11,28] QM energetic data on 1 for 2D energy scans of ε versus ζ torsions along with experimentally determined populations of the BI and BII states were used as target data for optimization of the ε and ζ torsions in the context of the initial Drude parameters. As the optimization proceeded, it was necessary to significantly sacrifice the quality of agreement with the QM data to obtain satisfactory sampling of the BI and BII states. The resulting Drude 2D ε versus ζ energy surface showed the model to significantly overstabilize the BII state of 1 as compared to the QM surface, as shown in Figure 6a,b, respectively. In addition to the overall shapes of the QM and MM landscapes differing dramatically, the low energy pathways connecting BI and BII states were in poor agreement. Moreover, while BI-to-BII transitions were occurring, no experimentally observed transitions in sugar pucker from the south to north conformation,[85] an important prerequisite for stabilizing A-form DNA under appropriate conditions,[9,11,12,28,96] occurred. This is important as the A/B and BI/BII conformational equilibrium are strongly coupled, with the BII substate known to be strongly anticorrelated with sampling of the north sugar conformation characteristic of the A form DNA.[3,11,28]

Figure 6

QM and empirical Drude 2D potential energy surfaces of ε versus ζ for model compound 1 (T3PS). Also depicted are structures of the lowest (BII) and highest (240/240) energy conformations (C) of landscape A generated by nonoptimized Drude parameters. Some of the 1D representative ε and ζ energy profiles are provided in the Supporting Information, Figure S7. This involved energetic analysis of conformations of 1 having the smallest and the largest relative energies; those states are indicated by blue circles in Figure 6a with the approximate values of the ε/ζ angles being 240°/240° for the highest and 245°/175° (BII) for the lowest energetic states. Comparison of individual energetic contributions (bonded, LJ and electrostatic) in these conformations revealed that their electrostatic contributions differ by ∼8 kcal/mol in favor of the BII state, a contribution almost entirely responsible for the total energetic difference of ∼7 kcal/mol. Examination of the geometries of these conformations suggested this difference to be attributed to the attraction between the anionic O1P/O2P atoms of the phosphate and the positively charged C3′ carbon of the sugar ring. This distance is shorter by ∼1 Å in the BII conformation, as shown in Figure 6c. Due to this favorable interaction, the C3′–O3′–P valence angle is significantly deformed, being ∼109° instead of ∼120° as observed in a survey of DNA crystal structures.[5] Despite being a local effect, the polymeric nature of DNA would lead to this difference potentially causing an elongated DNA duplex. To overcome this issue, we altered the default electrostatic parameters by making the partial charges for the O1P/O2P atoms less negative by 0.1e, and the C3′ atom less positive by 0.2e. Following this change and additional optimization, it was observed that the agreement between the QM and MM data for 1 was significantly improved (Figure 6d) as was the agreement with experiment for the BI/BII equilibrium and puckering of the sugar moieties in duplex DNA. In addition, the MM model better reproduced experimental data for selected valence and dihedral angles in duplex DNA.[28] In the empirical ε versus ζ potential energy surface, the flattening of the landscape was achieved by mitigating the electrostatic interactions. To facilitate the comparison between QM and MM outcomes, we also included in the Supporting Information several representative 1D potential energy profiles as a function of ε or ζ obtained for model compound 1 being in either BI or BII conformation (Figure S7). Figure 7 shows time series of the ε and ζ torsions sampled in the EcoRI dodecamer using the final parameters, demonstrating frequent transitions in a bimodal fashion corresponding to BI/BII transitions. Additionally, time series for the sugar pseudorotation angles indicate pucker transitions from the north to south conformation occurring on the experimentally observed time scale of ∼200 ps.[85] Figure S8 of Supporting Information also demonstrates BI/BII transitions and sugar pucker transitions to occur on the level of a single nucleotide. Thus, adjustments of the partial atomic charges lead to significant improvements in the conformational properties in the context of the gas phase model compound QM data and the condensed phase duplex DNA data.

Figure 7

Time series and probability distribution functions from the MD simulation of the EcoRI dodecamer for ζ and ε backbone torsions, as well as for the sugar puckering obtained for the Drude parameters optimized based on model compound 2. Transitions between BI (ε ≈ 190°, ζ ≈ 270°) and BII (ε ≈ 270°, ζ ≈ 180°) states, and north (P ≈ 15°) and south (P ≈ 175°) sugar conformations are evident. Based on these time series, both BI/BII transitions and sugar pucker transitions occur on time scales of ∼200 ps, consistent with the QM data for model compound 2 and NMR experimental data, respectively.

Optimization of LJ Parameters for Interactions Between the DNA Phosphate Group and Water

Due to the changes in DNA backbone electrostatics it was necessary to reevaluate the hydration of the phosphodiester backbone. This was performed based on the ability of the force field to reproduce the hydration free energy as well as the QM minimum interactions energies with individual water molecules using DMP as the model compound (Figure 1). The experimental hydration free energy of DMP is not available in the literature, but it can be estimated using the thermodynamic cycle shown in Figure 8. Δ1 is determined to be = −325 ± 4 kcal/mol from gas phase acidity data,[97] and Δ3 = 1.75 kcal/mol is obtained from the pKa of protonated DMP (HDMP).[98] To estimate Δ2, the hydration free energy of HDMP, we start with the known hydration free energy of a closely related compound, trimethylphosphate (TMP), which has a hydration free energy of −8.7 kcal/mol.[99] We then use QM to compute the relative hydration free energy between HDMP and TMP: a QM AMSOL[100,101] calculation reveals it to be −3.4 kcal/mol, giving an estimate of −12.1 kcal/mol for ΔG2. Alternatively, the relative hydration free energy between HDMP and TMP may be determined from the free energies of hydration of acetic acid, −6.7 kcal/mol, and methyl acetate, −3.32 kcal/mol,[99] resulting in the same value for Δ2 = −12.1 kcal/mol. For the hydration free energy of the proton, Δ(H+), a value of −258.8 kcal/mol is used.[78] Putting all the contributions together gives an estimate for the experimental hydration free energy of DMP of −78.4 ± 5 kcal/mol.

Figure 8

Thermodynamic cycle used to determine the experimental hydration free energy of DMP.

Thermodynamic cycle used to determine the experimental hydration free energy of DMP. Table 1 shows the hydration free energy for DMP computed at different stages of the parameter optimization. It is seen that the above-described change in charges for the O1P/O2P oxygen and C3′ carbon atoms resulted in the hydration free energies being less favorable by ∼14 kcal/mol, significantly lower than the experimental estimate. To correct this, we systematically modified LJ parameters of selected atoms in DMP, varying both the well depth (ε) and Rmin terms. The final value of the hydration free energy obtained using the optimized Drude model parameters was −74 kcal/mol, in good agreement with the experimental estimate.

Table 1

Hydration Free Energies (kcal/mol) for DMP at Different Stages of the Optimization Procedure

experiment	default parameters	altered electrostatic parameters	altered electrostatic/LJ parameters
–78.4	–82.1	–64.3	–76.2

To further test the quality of the model, the gas phase QM and MM minimum interaction energies for DMP–water monohydrates in different orientation were determined. This analysis is important as it indicates whether the relative energies of hydrogen bonds with different atoms on DMP are correctly reproduced, information that the hydration free energy cannot provide. A total of nine different water positions were considered, as shown in Figure 2. The minimum interaction energies and distances with DMP in the gg conformation, corresponding to the geometry of the phosphate group encountered in DNA duplexes, are shown in Table 2. Minimum interaction energies for the other two minimum energy conformations of DMP, gt and tt, are provided in Tables S9 and S10, respectively, of the Supporting Information. It is clear from the data that the gas phase interactions with water are uniformly well reproduced. In general, the results indicate that an appropriate balance between the electrostatic and LJ parameters has been achieved. Orientations are as shown in Figure 2. All energies are in kcal/mol. Distances for selected conformations are in Å. In QM calculations, the structure of the DMP was first optimized at MP2/6-31+G* level of theory. Next, the distances between water molecule and DMP were optimized at MP2/6-31+G* level of theory with the rest of the dimer fixed. QM interaction energies were calculated at the RI-MP2/cc-pVQZ level of theory using counterpoise correction. See Methods section for details. Distances for the QM optimized geometries 2–6 are between the water hydrogen and o11/o13 oxygen atom of DMP. For geometry 9, the distance is between water oxygen and the DMP phosphorus.

Optimization of the Na+–Phosphate Interactions Targeting Osmotic Pressure Calculations

As evidenced above for the base-Na+ interactions, the proper balance of the nonbond portion of the force field is essential. To investigate this for the phosphodiester backbone and the NaCl salt, osmotic pressure coefficients were calculated for selected biologically relevant electrolyte solutions. Osmotic pressure coefficient is a measure of the solution “nonideality”, describing how much the behavior of the real electrolyte solution deviates from that of an ideal solution at a particular concentration. Thermodynamically, osmotic pressure is related to the second osmotic coefficient in the osmotic virial series for a nonideal solution, with that coefficient, in turn, related to the effective interaction potential (potential of mean force) between a pair of ions.[102] Thus, reproduction of osmotic pressures is an important requirement directly related to the FF’s ability to properly model interionic interactions and ensure those interactions are properly balanced with the ion–water interactions. Systems studied included aqueous solutions of Na+ and Cl– ions, Na+ and DMP– ions, and Na+ and Acetate– ions, each at low and high molal concentrations (∼1 M and 3M). Experimental values for the osmotic pressures of Na+/Cl– electrolytes are available for a wide range of concentrations, [0···6 M].[12,103] For Na+/DMP– solution, experimental osmotic pressures are known only at low salt concentrations (≤1 M).[104] Therefore, an additional electrolyte system composed of Na+ and Acetate–, due to the acetate oxygen atoms being chemically similar to the anionic oxygens of DMP–, was simulated at ∼1 M and 3M, concentrations for which experimental osmotic pressure coefficients are available. Based on the osmotic pressure calculations, optimization of selected pair-specific LJ parameters between Na+ and DMP was performed. Table 3 shows the osmotic pressures produced by the final Drude parameters along with experimental values and results from the additive C36 FF. Table 4 contains the actual molal concentrations in the ionic slab from the MD simulations, required for direct determination with the experimental data. Comparison of the results in Table 3 shows the level of agreement to be quite good. The Drude model is typically in improved agreement with experiment over C36, although the additive model is in better agreement for the Na+/Acetate– solution. Given the overall good agreement between the Drude and experimental data, the current parametrization provides a plausible description of osmotic properties of biologically relevant ionic solutions.

Table 3

Experimental and MD Simulation Results for the Osmotic Pressures (Bars) for Different Electrolyte Solutions at ∼1 M and ∼3 M Concentrations

	experiment		Drude FF		additive C36 FF
	∼1 M	∼3 M	∼1 M	∼3 M	∼1 M	∼3 M
Na–Cl	42.67	152.01	42.14 ± 3.52	153.36 ± 4.21	44.01 ± 4.24	149.58 ± 6.41
Na–DMP	47.41	226.8 [MD]a	48.66 ± 4.21	281.13 ± 21.66	45.39 ± 5.11	228.30 ± 18.61
Na–Acetate	49.70	195.20	47.42 ± 3.12	174.62 ± 8.15	44.23 ± 2.21	181.88 ± 9.10

This value is from the computational study of Aksimentiev et al. utilizing AMBER force field.[74]

Table 4

Actual Molal Concentrations for All Studied Electrolyte Solutions Computed from MD Simulations According to Eq 4

	Drude FF		additive FF
	∼1 M	∼3 M	∼1 M	∼3 M
Na–Cl	0.85	3.04	0.79	2.82
Na–DMP	0.89	3.37	0.91	3.55
Na–Acetate	0.99	3.24	0.91	3.00

This value is from the computational study of Aksimentiev et al. utilizing AMBER force field.[74] While experimental data for osmotic pressure of the Na+/DMP– system is not available at high concentrations, the value obtained from a MD simulation study of this system at ∼3 M is presented in Table 3. The value was determined in a study aimed at improved LJ parametrization of specific ion pairs in the AMBER force field.[74] As can be seen, the result from that study is in a good agreement with the additive C36 result, while the Drude simulation produced a noticeably higher value. Given the underestimation of the osmotic pressure for the Na+/Acetate– system, this result indicates that the experimental value may be significantly higher than that obtained from the additive FFs. This result serves as a prediction by the Drude FF.

Analysis of the Ionic Distribution Around DNA Duplex

Our final test of the Drude model is focused on details of the ionic structuring around the DNA duplex. Mobile ions impact DNA structure and dynamics through neutralization of its residual charge.[7] As demonstrated above, excessive condensation of counterions to the DNA may lead to disruption of the DNA structure. Numerous prior experimental[105−112] and theoretical[81,82,113−116] studies focused on sequence specific ionic binding, such as determining whether and how long counterions reside in DNA grooves, or competitive ionic binding to specific DNA sites. While these aspects will be addressed in future studies, for validation we will focus on the overall ionic condensation. Ionic distribution around DNA was characterized by computing RDFs, with averaging over the central region of the DNA duplexes.[81] Such an approach allows for a direct comparison of the MD simulation results with the counterion condensation (CC) theory.[93] In their seminal work, Onsager and Manning[117] demonstrated that CC must occur on linear rods that are uniformly charged above some critical charge density. The DNA molecule, being linear and carrying high charge per unit length, is well above this threshold.[118] One can estimate from these simple arguments that 76% of the total DNA charge is expected to be neutralized by condensed counterions within a ∼9 Å Manning radius.[93,119] RDFs for Na+ and Cl– ions shown in Figure 9 were computed from MD simulations of the EcoRI dodecamer in the ∼120 mM NaCl aqueous solution using both the polarizable Drude and additive C36 models. Analogous data for another two DNA systems, a longer Junfos (16-base-pair) and a shorter 1DCV (10-base-pair) duplex are provided in the Supporting Information (Figure S9). In all graphs, Na+ RDFs are characterized by three prominent peaks, indicating the formation of ionic “shells” around polyanionic DNA. The onset of such an ionic structuring when approaching DNA from the bulk is commonly associated with the Manning’s radius of CC theory. As seen from the plots, counterions are structured within ∼9 Å from DNA surface in both FFs, coinciding with the CC predictions. This result is consistent with the definition of Beveridge and co-workers[82] for the condensed counterions as lying within a 9 Å region from the DNA surface, where ions are structured with respect to DNA.

Figure 9

Na+/DNA and Cl–/DNA RDFs from the Drude (black) and additive (red) MD simulations of the EcoRI dodecamer computed based on the closest approach, as elaborated in the Methods section.

Na+/DNA and Cl–/DNA RDFs from the Drude (black) and additive (red) MD simulations of the EcoRI dodecamer computed based on the closest approach, as elaborated in the Methods section. The computed ionic RDFs were used to calculate the fraction of the DNA residual charge neutralized by the ions. To this end, RDFs for Na+ and Cl– were integrated from 0 to 9 Å, according to eq 5, and the resulting number of co-ions and counterions were summed and divided by DNA residual charge. Drude and additive result for DD, Junfos, and 1DCV DNA systems are shown in Table 5. The agreement with the CC estimates is clearly better for the Drude FF simulations, which predict ∼77%, ∼79%, and ∼75% of the DNA residual charge to be neutralized by ions within the Manning radius for EcoRI, 1DCV and Junfos, respectively. These results indicate that the Drude model faithfully captures basic features of the interplay between DNA duplexes and the surrounding ionic atmosphere, offering an improvement over the additive C36 FF, and further indicating the quality of the nonbonded parameters for DNA and mobile ions.

Table 5

Comparison between Additive and Drude MD Simulation Results and Counterion Condensation Theory Based on Analysis of Ion/DNA RDFs

	% DNA Charge Neutralized within 9 Å
	DD	1DCV	JunFos
Drude	77.0%	79.0%	75.4%
C36	69.0%	68.6%	67.2%
CC theory	76.0%	76.0%	76.0%

Conclusions

In the present study we addressed a number of critically important aspects of the development of the Drude polarizable force field for DNA. These issues included treatment of base-Na+ interactions, proper treatment of the base intramolecular parameters, balancing DNA electrostatic and hydration effects, and how they impact the overall DNA stability and regulate subtler aspects of DNA conformational behavior. For example, when Drude parameters developed for the nucleic acid bases were applied to a DNA duplex in solution, the DNA structure became unstable after ∼25 ns of MD simulation time because of significant out-of-plane deformations of individual bases. The problem was overcome by a partial reoptimization of base valence angle parameter based on reproduction of experimental geometries and QM geometries and vibrational spectra. Next, based on the QM model compound calculations, altering the default electrostatic charges of selected backbone DNA atoms was shown to be necessary to improve BI/BII conformational equilibrium and to better reproduce the dynamical sugar pucker transitions observed experimentally.[85] Such changes, however, required readjustment of selected LJ parameters dictating interactions between DNA and water, as well as DNA and mobile ions. These modifications were based on simultaneous targeting a range of experimental and QM data, such as the hydration free energy for DMP and the QM gas-phase interactions of DMP with water molecules, QM data on small model compounds representative of the bases and DMP- interacting with Na+ ion, and experimental data on osmotic pressure measurements. A critical test of the FFs ability to capture basic features of the distribution of mobile ions around DNA involved comparing predictions from counterion condensation theory[93] with those based on computing ion/DNA RDFs obtained from MD simulations. Our results indicate that Drude simulations generate a physically plausible picture of ionic distribution around DNA, reproducing the theoretical estimates of the fraction of neutralized residual DNA charge by the condensed ions. Notably, the Drude results show an improvement over the C36 FF, indicating the improved physical description of the forces dictating the ionic solvation of DNA due to the explicit treatment of electronic polarizability. In summary, we focused on balancing the electrostatics and hydration of the DNA backbone by optimizing selected Drude parameters for interactions between DNA, mobile ions and water molecules. Our initial efforts were aimed at stabilizing an overall DNA structure in solution, which included improvements in the base valence angle parameters, while later parameter modifications addressed finer details of the DNA conformational behavior, such as BI/BII transitions and the equilibrium between north and south states of the sugar moiety.[11,28,96] Force field validation presented in this study, together with the earlier demonstrated ability of the force field to adequately model all aspects of the B form DNA, as well as to stabilize the A form in low water activity environment,[28] indicate that the current Drude polarizable model for DNA is suitable for computational studies of various DNA molecules under varying conditions on the 100 ns time scale. In addition, the availability of Drude polarizable parameters for proteins,[40] lipids,[41] and carbohydrates[53,92,120−128] will allow for simulation studies of heterogeneous biological systems.

73 in total

1. Sequence-specific binding of counterions to B-DNA.

Authors: V P Denisov; B Halle
Journal: Proc Natl Acad Sci U S A Date: 2000-01-18 Impact factor: 11.205

2. Atomic Level Anisotropy in the Electrostatic Modeling of Lone Pairs for a Polarizable Force Field Based on the Classical Drude Oscillator.

Authors: Edward Harder; Victor M Anisimov; Igor V Vorobyov; Pedro E M Lopes; Sergei Y Noskov; Alexander D MacKerell; Benoît Roux
Journal: J Chem Theory Comput Date: 2006-11 Impact factor: 6.006

3. CHARMM-GUI: a web-based graphical user interface for CHARMM.

Authors: Sunhwan Jo; Taehoon Kim; Vidyashankara G Iyer; Wonpil Im
Journal: J Comput Chem Date: 2008-08 Impact factor: 3.376

4. The B-DNA dodecamer at high resolution reveals a spine of water on sodium.

Authors: X Shui; L McFail-Isom; G G Hu; L D Williams
Journal: Biochemistry Date: 1998-06-09 Impact factor: 3.162

5. The anatomy of A-, B-, and Z-DNA.

Authors: R E Dickerson; H R Drew; B N Conner; R M Wing; A V Fratini; M L Kopka
Journal: Science Date: 1982-04-30 Impact factor: 47.728

6. Additive and Classical Drude Polarizable Force Fields for Linear and Cyclic Ethers.

Authors: Igor Vorobyov; Victor M Anisimov; Shannon Greene; Richard M Venable; Adam Moser; Richard W Pastor; Alexander D MacKerell
Journal: J Chem Theory Comput Date: 2007-05 Impact factor: 6.006

7. Relative stability of different DNA guanine quadruplex stem topologies derived using large-scale quantum-chemical computations.

Authors: Jiří Šponer; Arnošt Mládek; Naďa Špačková; Xiaohui Cang; Thomas E Cheatham; Stefan Grimme
Journal: J Am Chem Soc Date: 2013-06-19 Impact factor: 15.419

8. Toward a full characterization of nucleic acid components in aqueous solution: simulations of nucleosides.

Authors: Nicolas Foloppe; Lennart Nilsson
Journal: J Phys Chem B Date: 2005-05-12 Impact factor: 2.991

9. Insight into the stabilization of A-DNA by specific ion association: spontaneous B-DNA to A-DNA transitions observed in molecular dynamics simulations of d[ACCCGCGGGT]2 in the presence of hexaamminecobalt(III).

Authors: T E Cheatham; P A Kollman
Journal: Structure Date: 1997-10-15 Impact factor: 5.006

10. Polarizable empirical force field for nitrogen-containing heteroaromatic compounds based on the classical Drude oscillator.

Authors: Pedro E M Lopes; Guillaume Lamoureux; Alexander D Mackerell
Journal: J Comput Chem Date: 2009-09 Impact factor: 3.376

34 in total

1. Polarizable force field for RNA based on the classical drude oscillator.

Authors: Justin A Lemkul; Alexander D MacKerell
Journal: J Comput Chem Date: 2018-12-15 Impact factor: 3.376

2. An Estimation of Hybrid Quantum Mechanical Molecular Mechanical Polarization Energies for Small Molecules Using Polarizable Force-Field Approaches.

Authors: Jing Huang; Ye Mei; Gerhard König; Andrew C Simmonett; Frank C Pickard; Qin Wu; Lee-Ping Wang; Alexander D MacKerell; Bernard R Brooks; Yihan Shao
Journal: J Chem Theory Comput Date: 2017-01-24 Impact factor: 6.006

3. Ion-Hydroxyl Interactions: From High-Level Quantum Benchmarks to Transferable Polarizable Force Fields.

Authors: Vered Wineman-Fisher; Yasmine Al-Hamdani; Iqbal Addou; Alexandre Tkatchenko; Sameer Varma
Journal: J Chem Theory Comput Date: 2019-03-13 Impact factor: 6.006

Review 4. Close encounters with DNA.

Authors: C Maffeo; J Yoo; J Comer; D B Wells; B Luan; A Aksimentiev
Journal: J Phys Condens Matter Date: 2014-09-19 Impact factor: 2.333

Review 5. CHARMM additive and polarizable force fields for biophysics and computer-aided drug design.

Authors: K Vanommeslaeghe; A D MacKerell
Journal: Biochim Biophys Acta Date: 2014-08-19

6. Extracting water and ion distributions from solution x-ray scattering experiments.

Authors: Hung T Nguyen; Suzette A Pabit; Lois Pollack; David A Case
Journal: J Chem Phys Date: 2016-06-07 Impact factor: 3.488

7. Competition among Li(+), Na(+), K(+), and Rb(+) monovalent ions for DNA in molecular dynamics simulations using the additive CHARMM36 and Drude polarizable force fields.

Authors: Alexey Savelyev; Alexander D MacKerell
Journal: J Phys Chem B Date: 2015-03-18 Impact factor: 2.991

8. Transferable interactions of Li⁺ and Mg²⁺ ions in polarizable models.

Authors: Vered Wineman-Fisher; Julián Meléndez Delgado; Péter R Nagy; Eric Jakobsson; Sagar A Pandit; Sameer Varma
Journal: J Chem Phys Date: 2020-09-14 Impact factor: 3.488

9. Balancing the Interactions of Mg²⁺ in Aqueous Solution and with Nucleic Acid Moieties For a Polarizable Force Field Based on the Classical Drude Oscillator Model.

Authors: Justin A Lemkul; Alexander D MacKerell
Journal: J Phys Chem B Date: 2016-10-27 Impact factor: 2.991

10. Differential Deformability of the DNA Minor Groove and Altered BI/BII Backbone Conformational Equilibrium by the Monovalent Ions Li(+), Na(+), K(+), and Rb(+) via Water-Mediated Hydrogen Bonding.

Authors: Alexey Savelyev; Alexander D MacKerell
Journal: J Chem Theory Comput Date: 2015-08-26 Impact factor: 6.006