Literature DB >> 34279923

A Quantum Chemical Topology Picture of Intermolecular Electrostatic Interactions and Charge Penetration Energy.

Fernando Jiménez-Grávalos¹, Dimas Suárez¹.

Abstract

Based on the Interacting Quantum Atoms approach, we present herein a conceptual and theoretical framework of short-range electrostatic interactions, whose accurate description is still a challenging problem in molecular modeling. For all the noncovalent complexes in the S66 database, the fragment-based and atomic decomposition of the electrostatic binding energies is performed using both the charge density of the dimers and the unrelaxed densities of the monomers. This energy decomposition together with dispersion corrections gives rise to a pairwise approximation to the total binding energy. It also provides energetic descriptors at varying distance that directly address the atomic and molecular electrostatic interactions as described by point-charge or multipole-based potentials. Additionally, we propose a consistent definition of the charge penetration energy within quantum chemical topology, which is mainly characterized in terms of the intramolecular electrostatic energy. Finally, we discuss some practical implications of our results for the design and validation of electrostatic potentials.

Entities: Chemical

Year: 2021 PMID： 34279923 PMCID： PMC8901103 DOI： 10.1021/acs.jctc.1c00263

Source DB: PubMed Journal: J Chem Theory Comput ISSN： 1549-9618 Impact factor: 6.006

Introduction

Electrostatic interactions are central to molecular modeling because of their slow decay and strength. Especially when polar atoms or charged species are involved, they largely determine the stability and activity of biomolecules such as proteins, nucleic acids, or lipids, among others.[1,2] As such, a reliable description in molecular mechanics (MM) potentials is essential both in the short- and in the long-range. Within the framework of MM methods, interactions comprising nonbonded atoms are usually represented by pairwise potentials such as the Lennard-Jones and the Coulomb ones. In the latter case, the use of point charges or higher order multipoles to avoid the integration of interacting charge densities has resulted in accurate electrostatics at long-range, with significant improvements to speed up and facilitate convergence such as the Ewald summation and its variants to perform, for example, molecular simulations in solution under periodic boundary conditions.[3−8] At short-range, however, the approximations taken for long distances become less accurate or invalid,[9] and a correct electrostatic description in this regime stills poses a challenge. Hence, there is a growing interest in improving short-range electrostatics (e.g., for troublesome hydrogen bonds), mainly focused on capturing the effects associated with the non-negligible interpenetration of densities, leading to the so-called charge penetration (CP) energy, which is typically defined as the difference between the electrostatic energy computed from continuous charge density distributions and that from multipolar approximations.[10] Thus, several investigations have been devoted in the last years to incorporate the charge penetration energy into the MM electrostatic potentials.[10−14] The separation of various energy terms as implemented in the MM potentials is somehow paralleled by the energy decomposition analysis (EDA) methods.[15] A major goal of any EDA approach is to ascertain the nature and type of the interactions among molecules as well as to rationalize their stabilizing or destabilizing roles, which may have implications for the design, parametrization, and validation of MM potentials such as the electrostatic ones. However, there is no unique recipe to decompose the energy, and thus many EDAs have been developed rooted in different approaches. Hence, symmetry-adapted perturbation theory (SAPT) makes use of a perturbative approach to differentiate the distinct nature of the intermolecular interactions,[16,17] while orbital-based EDAs exploit a stepped scheme to calculate the different energies according to some reference electronic states,[18−20] and the interacting quantum atoms (IQA) method relies on a real space partition of the quantum mechanical (QM) density matrices,[21,22] being thus classified as a quantum chemical topology (QCT) method. According to recent studies, in spite of their crude approximations, it may be feasible to improve the classical MM potentials by utilizing the information provided by EDAs.[23] More specifically, it has been shown that the SAPT energy components (electrostatics, induction, exchange-repulsion, and dispersion) can be modeled with relatively simple MM functions.[24,25] In particular, it has been demonstrated that the combination of empirical damping functions accounting for the CP energy with point multipoles results in electrostatic energies at short-range that are quite close to the SAPT ones. Actually, the SAPT electrostatic energy provides the required reference to parametrize and validate the CP-augmented potentials. However, different interpretations of short-range energetic effects involving the overlap of the electron densities of two or more fragments may be possible depending on the particular EDA of choice.[15] As such, other schemes such as the absolutely localized molecular orbital (ALMO) EDA, that relies on a different nonperturbative decomposition of energy terms, have also been proposed.[26] In this work, we reexamine the nature of electrostatic interactions under the prism of an orbital-invariant, reference-free technique. The IQA approach fulfills these requirements as it is a QCT, real-space energy decomposition resorting to the partition of the reduced density matrices (RDMs). IQA distinguishes not only between electrostatic or exchange-correlation components of the interaction energy but also between intra- or interatomic (or fragment) contributions. Moreover, since IQA splits the total energy of a system and not only the interaction between selected fragments, it is capable of reconstructing (or dissecting) the energy ascribed to both covalent and noncovalent binding, allowing thus covalent bond energies to be characterized[27] as well as the accuracy of the energy components handled by QM fragment methods to be investigated.[28] Herein, we study in detail the electrostatic interactions involved in noncovalent complexes with a twofold goal. On the one hand, we aim to compare in a consistent and systematic manner the atomic and fragment contributions to the electrostatic energy as evaluated throughout a hierarchy of QM and MM approximations and at varying intermolecular distances. In this way, we seek to identify the best correspondence between the IQA and the MM electrostatic terms. On the other hand, we critically examine the CP concept and propose a novel definition relying on a joint orbital and real-space decomposition scheme, which can give new insight into the CP energy. To help fulfill these goals, the rest of the manuscript is structured as follows. First, we present and describe the theoretical scaffold that holds our work, paying particular attention to the IQA—and its IQF variant—energy decomposition, followed by subsections concerning the zeroth-order approximation, the electrostatic MM potentials, and a final assessment of the CP energy and the alternative definition proposed in this work. Subsequently, we describe some computational settings and the results of our test calculations, which were carried out on the S66 and S66x8 data sets.[29,30] The various levels of description of the electrostatic interactions are then discussed based on their statistical correlation with benchmark data, their dependence with the intermolecular separation, etc. The QM and IQA calculations yield further information, not only about the magnitude of the CP energy, but more importantly, about its different role in the IQA descriptors. Finally, we conclude that the aim of improving the electrostatic description is essentially fulfilled at the expense of accounting for intramolecular effects.

Theory and Methods

IQA Decomposition of QM Energies

The interacting quantum atoms method is a robust and physically sound approach to decompose the total QM energy of a system into chemically meaningful components.[21,22] It is based on partitioning the first- and second-order RDMs, as can be done with the real space partition proposed by Bader and co-workers within their Quantum Theory of Atoms in Molecules (QTAIM).[31] Thus, the three-dimensional space is decomposed into atomic regions (Ω) as the attraction basins of the gradient field of the electron density. Given a global energy E of a system, IQA permits its decomposition into atomic components and pair interaction energies according towhere is called the net atomic energy and, under the Born–Oppenheimer approximation, represents the energy due to the kinetic energy of electrons plus all the interactions involved (i.e., electron–electron and electron–nucleus) inside the atomic basin of each atom I. Similarly, each term comprises the interaction energy between the electrons (e) and nucleus (n) located in atom I with those ascribed to other atoms J, which can be separated into n–e, e–e, and n–n contributions. In order to compute the potential energy, the pair density ρ2(r1, r2) is required. This object can be split according to ρ2(r1, r2) = ρ(r1)ρ(r2) + ρ(r1, r2) in two contributions. On the one hand, ρ(r1)ρ(r2) represents a noncorrelated product of densities, whereas electron correlation is accounted for by the exchange-correlation (xc) density ρ(r1, r2). Accordingly, the total interaction energy between two atoms can be decomposed into a Coulomb or electrostatic term and a quantum mechanical exchange-correlation one :the latter term comprising all the associated QM effects that other (e.g., perturbative) approaches identify separately as dispersion, charge-transfer, polarization, etc. However, such a decomposition of into two terms is particularly relevant when assessing the nature of a given bond or interaction, since the electrostatic term is associated with ionicity and the exchange-correlation contribution with covalency.[22] IQA admits the grouping of atomic terms into fragment contributions (e.g., functional groups, molecules). Thus, a fragment decomposition similar to eq of a molecular aggregate constituted by two moieties A and B involveswhere can be calculated analogously. For practical purposes, we use the IQA acronym to refer to the atomic analysis, whereas for its fragment version the term interacting quantum fragments (IQF) is preferred. In a previous work,[32] it was shown that IQF may be useful to dissect the formation energy of noncovalent complexes. Moreover, the IQA/IQF terms can be augmented with Grimme’s D3 dispersion correction[33] as combined with the Becke-Johnson damping function[34] to complement the DFT and HF descriptions. Using the IQF-D3 protocol, the formation energy of a two-fragment system AB given by the process A + B → A···B is split asThe deformation term () in the above equation corresponds to the net energy variation () of fragment A (B), whereas the interfragment interaction energy comprises the electrostatic (), exchange-correlation ), and dispersion () energies between the two fragments, the latter being thus separated from the whole exchange-correlation one. Overall, the contribution of electrostatics and exchange-correlation to ΔE is split between the intrafragment deformation and the interfragment interactions.

Electrostatic Energy from Continuous Charge Densities

The purely electrostatic energy for a given system with total charge density ρ(r) (ρ(r) ≡ ρ(r) = , including both the electron density ρ(r) and the nuclear charges Z at positions R) is readily computed using the Coulomb law,where, for the sake of simplicity, the electrostatic potential in this and the rest of the equations is expressed in atomic units. Interestingly, the QTAIM real space partition derived from ρ(r) allows us to decompose the electrostatic energy at the atomic level, Similarly, the fragment-based decomposition can be readily accomplished in an analogous way, allowing thus the specific assessment of the electrostatic component of the formation energy ΔE of a two-fragment system AB aswhere ΔE is expressed in terms of two contributions, namely, the intrafragment variations of electrostatic energy in the formation process, and , and the interfragment electrostatic interaction, . At this point, we note that although ΔE is commonly termed as a classical electrostatic interaction energy, we will refer to it as the electrostatic contribution to the formation energy of the A···B complex in order to help avoid confusions with the IQA/IQF interaction energy terms. When the charge density is constructed from the unrelaxed fragment densities as , the electrostatic contribution to the formation energy, which is named here as the zeroth-order energy , equals the Coulomb interaction between the unrelaxed densities:This energetic term corresponds to the so-called first-order polarization energy (or simply electrostatic energy) defined in SAPT,[16] which has been adopted as a benchmark electrostatic energy for the validation of recently developed short-range electrostatic potentials.

Electrostatic Potentials in Molecular Mechanics

To avoid the usage of continuous charge distributions, the MM methods typically invoke the multipolar expansion as detailed in the Supporting Information (SI), which approximates the zeroth-order energy defined in eq . Formally, the multipolar electrostatic energy is affected by two different error sources. On the one hand, the underlying expansion must be truncated at some order (l = 0, 1, 2, ...), resulting thus in a certain truncation error. On the other, when and overlap to a significant extent, the rigorous application of the multipole expansion is impeded so that its usage at short distances implies some charge penetration error, which is normally assumed to be dominant. Nevertheless, the multipole-based potentials are still largely useful in many cases, and they enhance convergence by distributing multipoles throughout the molecule at the atomic sites and/or bond centers.[9,35,36] The MM electrostatic potentials can be classified into two groups. On one side, MM methods such as AMBER,[37] CHARMM,[38] GROMOS,[39] and OPLS[40] adopt simple electrostatic formulas with point charges (i.e., monopoles, with l = 0) that are ultimately the result of a fitting procedure against the molecular electrostatic potential (ESP). On the other side, more sophisticated methods, such as NEMO,[41] AMOEBA,[42] or the QTAIM-based FFLUX,[43,44] include higher order multipoles (frequently up to the quadrupoles, l = 2) in order to capture the anisotropy of the distribution of electrons in space. These potentials are generally built from the QM density matrix of the molecule of interest by means of the distributed multipole analysis (DMA)[36] or similar procedures. In addition, some methods (e.g., AMOEBA or NEMO) also refine the DMA multipoles to better reproduce the ESP values. In this way, the resulting charges/multipoles may include in an effective way both high-order multipolar contributions and CP effects. Actually, the performance of the MM potentials is examined statistically as a whole (i.e., using the full MM potential including all bonded and nonbonded terms) by various energetic and structural validation tests. A quite different approach is followed by the FFLUX force field. It makes use of QTAIM multipoles in contrast to the more widespread DMA methodologies and estimates them by means of a machine learning technique depending on each atom’s environment. In comparison with the atomic/multipolar methods that are massively employed in current simulation packages, the electrostatic MM potentials that go beyond the multipolar approximation are much less consolidated. In this category, we find different methodologies such as SIBFA,[45] EFP,[46] and AMOEBA+[24] that complement the multipolar formulas with other (so-called damping) functions to capture very-short-range electrostatics and to remove the CP error. In this way, these potentials (whose general form is shown in the SI) seek to reproduce as evaluated by SAPT or similar methodologies.[24]

Charge Penetration Energy

The CP energy E between two molecules has been defined[47] as the difference between the exact zeroth-order electrostatic energy and its multipolar analogue , Conceptually, this straightforward definition of E is satisfactory. It also shows that E is not only an interfragment quantity but rather an energy that presents also intramolecular contributions according to the real space partitioning of the whole . In this respect, the energetic definition suggests that the CP energy is not limited to the change in the electrostatic interaction between two atoms due to their electron cloud overlap and the associated loss of nuclear screening.[48] The rigorous evaluation of E for different systems at varying intermolecular separations would allow a deeper analysis of electrostatics and, eventually, the development of more accurate potentials. However, as noticed by Bojarowski et al.,[47]different methods of obtaining multipole moments lead to different radii of (pseudo)convergence, different levels of multipole expansions at which (pseudo)convergence is achieved, and different values of penetration energy. Therefore, the value of the CP energy as evaluated with eq may depend on the particular method used to derive the multipoles. Moreover, the usage of truncated expansions introduces some additional truncation error so that both truncation and penetration effects become somewhat mixed in the resulting E values.[49] An alternative to evaluate E has been proposed by Kairys and Jensen.[50] Having noticed the relationship between the CP energy and the magnitude of the orbital overlap, they attempt to recover such an effect from scratch, with a derivation of E independently from the multipolar model used to estimate electrostatics at first stage. However, the authors find that the inherent dependence on the set of molecular orbitals used may lead to different CP values.

A Novel IQF Definition of the Charge Penetration Energy

By combining both the Bader partitioning scheme ) with a total zeroth-order density decomposition (), the following energy terms are obtained: the intramolecular interaction due to or inside a given molecular basin Ω or Ω, leading to , , , and . the intramolecular interaction between the two monomeric densities inside a given basin: and . the intermolecular electrostatic energy between the same density pieces: and . the intermolecular interaction between and in opposite molecular basins: and . Hence, the total electrostatic energy of a complex AB can be written asIn the notation used above the two interacting densities are encompassed by parentheses, while the basins they are integrated in are identified by the corresponding superscripts in the given order (only one if both are the same). Hence, for instance, the term stands for and corresponds to . When the above double decomposition is applied to the electrostatic energies of the separate fragments, such as A, in the final complex, the electrostatic energy of the original species becomes Note that this partitioning is derived from the AB zeroth-order (i.e., Hartree product) wave function and that geometry relaxation effects are not considered. By subtracting from eq the previous fragment energies, the corresponding electrostatic contribution to the formation energy of the complex is obtained, Among the surviving terms in eq , reveals itself as the ordinary interaction term between the two monomers A and B. It matches at long distances, while the other three terms would present a similar behavior of increasing in magnitude when shortening the intermolecular distances R and canceling out in the opposite situation. Thus, those three terms can be directly related with the interpenetration of molecular densities and grouped in the IQF-like electrostatic charge penetration energy This term fulfills (and so its three components), while . Figure represents the previous four terms between the partitioned and adding up to and compares them to the term between the total densities in each basin.

Figure 1

Graphical scheme of the four contributions giving rise to , where three of them (in dark blue) comprise the IQF electrostatic penetration energy and the remaining one (dark green) accounts for the interaction of and lying in the molecular basins Ω and Ω, respectively. The zeroth-order IQF pairwise term has been also included to highlight its difference with the previous , as it accounts for an interaction between total densities inside each basin (the original or and the tail from the other that has penetrated into another domain).

Computational Details

Molecular Geometries and Reference Interaction Energies

All the QM and classical electrostatic calculations were performed on the molecular geometries retrieved from the S66 database,[29] which contains a set of 66 complexes featuring the most common noncovalent interactions in biomolecules. These can be classified depending on the atoms involved into polar, nonpolar, and mixed. Analogously, the different complexes have been grouped into H-bond, dispersion, and mixed according to the main interactions they experience (see Table S1). For representing both the atomic interactions and the subsets of complexes, a color code has been utilized: magenta for H-bond/polar, yellow for mixed, and blue for dispersion/nonpolar. In addition to the S66 set, a selection of 12 representative complexes from the S66x8 database,[30] which is an extension of the former to eight different fractions of the equilibrium intermolecular distances, were also considered. The benchmark CCSD(T)/CBS interaction energies collected in S66 were employed as the reference values for comparative purposes.

HF-D3 Calculations

HF/cc-pVTZ calculations were carried out on the S66 and the S66x8 geometries using the GAMESS-US package.[51] Grimme’s D3 dispersion potential as implemented in the DFT-D3 code[52] was employed to incorporate the dispersion energy. Additionally, in order to correctly reproduce the asymptotic behavior of the dispersion energy at small distances, the Becke–Johnson damping function was chosen.[53] We selected HF because it lacks entirely dispersion energy and thereby yields a straight physical partitioning of energy in combination with the D3 potential. We also note in passing that HF-D3 has been shown to describe correctly and efficiently the structure and energetics of biomolecules[54] and that a variant of DFT-SAPT has been also developed in which the costly ab initio dispersion calculations are replaced by a reparametrized D3 potential.[55] In addition, the HF-D3/cc-pVTZ energies reproduce quite well the reference CCSD(T)/CBS energies of the S66 structures (see Figure S1).

IQA Energy Decomposition Analysis

The decomposition of the QM and the electrostatic energies derived from continuous charge densities were performed with the PROMOLDEN code.[56] The integration settings comprised β-spheres with radii of 60% of the distance between each nucleus and its closest critical point. Within them, Lebedev angular grids with 974 points were used, along with Euler–McLaurin radial quadratures with 382 radial points. A bipolar expansion of was selected with an l of 6. On the other hand, the outer part of the basins (i.e., outside the β-spheres) employed the same angular and radial quadratures, albeit increasing their respective points up to 5810 and 512, with a maximum radius of 15 au. In this case, was expanded by means of a Laplace expansion with l = 10.

Point-Charge and Multipolar Calculations

Atomic charges were computed for the separate monomers in the S66 structures by means of the restrained electrostatic potential (RESP) method following the General Amber Force Field (GAFF)[57] prescriptions with a HF/6-31G* level of theory. In the case of the atomic multipoles, two different sets were employed. On the one hand, AMOEBA multipoles were derived up to the quadrupoles (l = 2) following its corresponding parametrization protocol.[42,58] On the other, QTAIM multipoles were obtained by means of the PROMOLDEN program with an l = 2. Both the AMOEBA and the QTAIM multipolar energies were obtained with the MPOLINT code.[59] Additionally, a set of 12 S66x8 complexes was tested under the AMOEBA+ CP-corrected potentials.[24] For this, TINKER was used to calculate the respective CP energies as the difference between the CP-corrected multipoles and the multipolar energies previously derived. The parameters of the damping functions were directly taken from the literature.[24]

Graphs and Statistical Analyses

Octave[60] and GNUplot[61] were, in turn, used to perform the statistical analyses and the correlation plots, while Python’s Matplotlib[62] was chosen for the rest of the representations.

Results and Discussion

IQF-D3 Partitioning and Pairwise Approximation

The IQF-D3 decomposition of the HF/cc-pVTZ binding energies for the S66 complexes has been discussed at length in previous work.[32] Herein, we focus on the decomposition of the electrostatic descriptors into intra- and interfragment components. Interestingly, we found that the combination of the interfragment electrostatic interaction energy with the D3 dispersion potential yields pairwise energies that are quite well correlated with the S66 benchmark values, the coefficient of determination being R2 = 0.990 with RMS errors of 5.7 kcal mol–1 (see Figure and Table S2). Thus, the IQF descriptors in conjunction with the D3 potential capture the essential electrostatic and dispersion interactions that determine the relative stability of the noncovalent complexes. When addressing both terms independently (Figure S2), we find that the pure electrostatic term exhibits a satisfactory overall correlation (R2 = 0.943) due to the fundamental role of electrostatics in H-bond complexes. On the other hand, the D3 descriptor has a null global correlation with the S66 reference energies, although it is reasonable (R2 = 0.820) for the dispersion complexes as expected. However, the mixed complexes are not well-described by either the electrostatic or the dispersion energies separately, and their combination becomes critical.

Figure 2

Left: correlation between the dispersion-augmented IQF intermolecular electrostatic energy and the reference binding energies . Right: anticorrelation featured by the intrafragment electrostatic contribution to formation and the total kinetic plus exchange-correlation contributions ΔT + ΔE. The statistical analysis comprises the coefficient of determination R2, Spearman’s rank correlation coefficient ρ, and the root-mean-square error RMS. Data corresponding to the whole set of complexes is depicted in black and that ascribed to the H-bond group is in magenta, while mixed and dispersion complexes are in yellow and blue, respectively. All the energies are in kcal mol–1. In contrast to the ability of the descriptors to capture the main features of noncovalent binding, the combination of ΔE, which includes both the intra- and the intermolecular electrostatic effects, with the D3 potential deteriorates the global correlation (R2 = 0.888) and results in larger RMS errors (17.3 kcal mol–1). The full IQF decomposition (eq ) explains this unbalanced description because the intrafragment electrostatic energies, which contribute to the deformation energies, tend to cancel out with the QM energy terms (electronic kinetic energy and exchange-correlation) that are not required in the simple electrostatic + dispersion picture (see Figure right). Therefore, the pairwise terms arise as the most relevant IQF electrostatic descriptors of noncovalent binding.

Validating and Analyzing the Zeroth-Order Approximation

The electrostatic IQF terms can be readily evaluated under the zeroth-order approximation (i.e., ). Thus, it turns out that the interaction energies can be replaced effectively by their zeroth-order counterparts. Indeed, the pairwise energies have low RMS errors (3.1 kcal mol–1) and maintain a good correlation (R2 = 0.971) with respect to the benchmark data (Table S3). This behavior is also satisfactory within the S66 subsets: R2 = 0.989 and 0.988 for the polar H-bonded systems and the dispersion-dominated complexes, respectively, albeit the correlation is somewhat reduced in the case of the mixed complexes (R2 = 0.755). Further support for the use of the zeroth-order energies comes from the atomic level, where a high degree of coincidence between the diatomic zeroth-order and fully relaxed energies is also found at the equilibrium geometries (R2 = 0.995, see the SI). When addressing the distance dependence of the previous term (see Figure ), both and follow the same trends at varying intermolecular separations R (given as relative to the equilibrium distances R). As expected, they start diverging at short distances due to the strengthening of charge polarization, charge-penetration, and charge-transfer effects that attenuate the pairwise electrostatic forces. The magnitude of these effects is clearly system-dependent, as well as the shape and slope of the and curves, revealing thus further details about the role of electrostatics in these complexes. Thus, the electrostatic stabilization of the four H-bond complexes and others (e.g., the π-complex of the uracil dimer) is continuously reinforced upon shortening the monomer–monomer distance, reflecting the major electrostatic control of these systems. In contrast, the T-shaped benzene complexes with methanol or N-methylacetamide reach an electrostatic minimum at a distance longer than the equilibrium one while the small electrostatic energies of the dispersion dimers (i.e., +1, −1 kcal/mol) change very little along the curves (some small leaps are due to residual errors arising in the numerical integration over the atomic basins).

Figure 3

Intermolecular electrostatic interactions for a subset of the S66x8 complexes as provided by IQF (either exactly or under the zeroth-order approximation ), zeroth-order QTAIM multipoles , AMOEBA multipolar energies , and RESP atomic charges . Additionally, the zeroth-order electrostatic contribution to formation is also included. The complexes are colored and displayed in columns according to the group they belong to, namely, H-bond, mixed, or dispersion, respectively. The energies (Y-axis) are given in kcal mol–1, and the abscissas represent the intermolecular distances relative to the equilibrium ones (R/R). In Figure the deviation between the global energies and the interfragment anticipates the underlying CP effects associated with the density overlap. For the H-bond and some of the mixed complexes, the two curves decrease with lowering separation, but they split gradually for R/R < 1.6. The global stabilization nearly doubles at R, showing thus the large impact of intramolecular electrostatics as defined in the IQF framework. For the π-complexes (benzene–dimer, benzene–methanol, ...) or the weakly interacting neopentane dimer, the inter- vs intramolecular balance is differently modulated because the deviation between the global and the interfragment electrostatics becomes significant only at very close distances (e.g., R/R < 1.1), which are indicative of mutual overlap. In these systems, is thus reinforced by several kcal mol–1, which are ascribed to the intrafragment electrostatic stabilization achieved by the fragment-overlap (i.e., CP) effects. Such effects have a minor influence on the small energies (<1–2 kcal mol–1), which tend to remain nearly constant or become slightly attenuated. As shown below (Section ), the IQF analysis of the CP energy gives further insight about the behavior of and with R/R.

Comparison between and Pairwise MM Energies

The pairwise approximation that emerges from the IQF-D3 decomposition and the validity of the zeroth-order approximation for the electrostatic interactions provide an insightful theoretical support for the construction of noncovalent MM potentials. In this scenario, can be seen as the most suitable IQF descriptor to assess the approximate electrostatic potentials. Hence, we calculated the interfragment electrostatic energies using the RESP atomic charges and the AMOEBA multipoles, as well as the QTAIM multipoles up to the quadrupoles. According to the statistical data in Table , either the RESP atomic charges or the QTAIM/AMOEBA multipoles give interfragment electrostatic energies that correlate considerably well with (R2 > 0.9 and RMS errors ∼ 1 kcal mol–1) for the full S66 set and also for the H-bond/dispersion subsets. These point-charge/multipolar electrostatic energies are less satisfactory for the less abundant mixed complexes, although the multipolar potentials yield a more accurate description (R2 ≃ 0.6–0.8) than the RESP charges (R2 ≃ 0.5). In addition to and the fully relaxed and zeroth-order IQF pairwise terms, Figure also displays the distance dependence of the QTAIM/AMOEBA/RESP energies, that results quite close to that of the interfragment energies. Nevertheless, a closer inspection reveals that the QTAIM/AMOEBA/RESP energies tend to overestimate the stabilizing/destabilizing character of for the H-bond/dispersion dimers, respectively.

Table 1

Statistical Measurements Comprising the Coefficient of Determination R2, Spearman’s Rank Correlation Coefficient ρ, and the Root Mean Square Error RMS for the Correlation between and either the QTAIM or AMOEBA Multipoles (l = 2) or the RESP Point Charges (l = 0)

multipolar approximation	complex type	R²	ρ	RMS
QTAIM	global	0.970	0.958	1.0
	H-bond	0.956	0.904	1.4
	mixed	0.644	0.768	0.9
	dispersion	0.955	0.795	0.5

AOMEBA	global	0.953	0.972	1.3
	H-bond	0.904	0.841	2.0
	mixed	0.800	0.845	0.7
	dispersion	0.939	0.893	0.4

RESP	global	0.974	0.962	0.8
	H-bond	0.981	0.918	0.7
	mixed	0.456	0.687	1.1
	dispersion	0.948	0.831	0.3

The good agreement between the multipolar and the RESP energies in Table and in Figure suggests that the RESP fitting procedure may incorporate in an effective way higher order effects even at short distances. In addition, our results point out that the pure QTAIM multipoles can be employed in the construction of accurate electrostatic potentials, free from the inclusion of other effects that may be present when the DMA multipoles are fitted against the molecular ESP. In fact, the QTAIM multipoles, which are already considered in the FFLUX force field, readily reproduce the ESP without the need of any constraint.[63]

Comparing Diatomic Electrostatic Interactions

IQA permits an unambiguous decomposition of the continuous-density intermolecular interaction energy into a sum of atomic and diatomic terms that enables a thorough analysis of the global molecular properties based on their atomic origins, and a close comparison with the various MM descriptions at this atomic level. As expected, the IQA diatomic terms correlate almost perfectly with the QTAIM multipolar ones (see Figure ). On the contrary, the AMOEBA and RESP energies are significantly less correlated (R2 of 0.7 and 0.4, respectively) and have large RMS errors. For example, the largest discrepancies between and the QTAIM-multipolar in the acetic acid dimer (about 6 kcal mol–1) arise from the atoms involved in the OH·O H-bonds, the rest of pair interactions having much lower differences (<0.5 kcal mol–1; see Tables S7–S9). When comparing and (or ), the largest discrepancies amount to hundreds of kcal mol–1 and involve not only short polar contacts but methyl C atoms too (see Tables S10–S15).

Figure 4

Comparison of the and energies with the term (kcal mol–1). On the left are the correlation plots and, on the right, each difference as a function of the interatomic distance (Å).

Comparison of the and energies with the term (kcal mol–1). On the left are the correlation plots and, on the right, each difference as a function of the interatomic distance (Å). The dissimilarity between the energies and the / values was not entirely unexpected given that the RESP charges are derived from the molecular ESP and the AMOEBA multipoles are obtained by the DMA protocol. In fact, a difference of 1 order of magnitude between the atom–atom electrostatic interactions from IQA and MM potentials has also been noticed previously.[64] The present results show in further detail the actual discrepancies between the various atomic representations and suggest that, although the diverse atomic multipoles employed in classical potentials yield similar molecular electrostatic energies, the atomic decomposition is more questionable, which, in turn, can negatively affect the interpretation of localized electrostatic interactions and/or result in artifacts while dealing with QM and MM short-range electrostatics in hybrid QM/MM methodologies.

Charge Penetration under the QTAIM Scrutiny

Following the prescriptions introduced in Theory and Methods, the zeroth-order electrostatic formation energy of each S66 complex was decomposed by combining its real space partition into nonoverlapping atomic basins with the zeroth-order density approximation (). This strategy leads to the IQF-based charge penetration energies, , resulting from the sum of the intramolecular terms and , as those accounting for the interaction of both densities inside the same basin, and the intermolecular energy between the tails of each molecular density that penetrate into the opposite basin, as described in eq . This constitutes an effective penetration energy in the sense that the molecular identity between two overlapping fragments becomes necessarily blurred so that fragment properties are dependent upon the scheme followed to dissect the global charge density into its constituents. Nevertheless, the topological analysis of ρ0 yields a consistent identification of molecular fragments so that we believe that the associated charge-penetration analysis can give useful insight into the electrostatics of noncovalent complexes. The application of eq to results in the energy contributions shown in Figure . On the one hand, the interfragment energy is formally not affected by charge penetration and plays a stabilizing role in all the H-bond complexes (slightly repulsive in the dispersion complexes). On the other hand, the IQF penetration term turns out to be of equal importance in the H-bond complexes or even more relevant in the dispersion subset for which penetration energy describes the major part of .

Figure 5

Decomposition of into and the three IQF penetration terms , , and . Energies are given in kcal mol–1.

Decomposition of into and the three IQF penetration terms , , and . Energies are given in kcal mol–1. The decomposition of the penetration energy shows that it arises mainly from the stabilizing interactions between and inside the same basin. This is an intramolecular effect as reflected by the magnitude of the and energies. As shown by the integration of or in the corresponding basins, the mutual CP values range, for instance, from 0.035 e in the neopentane dimer to 0.099 e in the case of the acetic acid dimer. These fractional charges involve the e–e repulsion between the fragment electron densities occupying the same space, such as such that r ∈ Ω (or equivalently in region Ω), and the attraction experienced by the nuclei of one fragment (or ) and the fraction of electrons from the other that has penetrated into the former (or similarly ). In light of these results, e–n attraction greatly overcomes e–e repulsion between different zeroth-order densities inside the same basin and gives rise to the significant stabilizing energies observed. There is also a minor repulsive contribution owing to the purely electronic repulsion between the penetrating into Ω and the tail in Ω, which is measured by . Further insight can be gained by analyzing the distance dependence of the various energy terms as shown in Figure . The plots confirm that the three components of tend to zero when R/R > 1.5 and further highlight the role of the intrafragment terms. Interestingly, the energy, formally lacking penetration effects, is modulated by the degree of the interfragment overlap so that the decreasing trend in is damped out or inverted at the shortest distances. This is not entirely unexpected given that, as two initially separated atomic basins (e.g, Ω and Ω) approach one another, their volume, shape, and electron population evolve along the R/R curve in response to the density overlap. We note, however, that the deviation of with respect to the interfragment electrostatic energy may constitute a useful index about the specific impact of penetration effects on the pairwise electrostatics. At this point, an important caveat should be noted. Within the QTAIM framework, the energy includes a fraction of stabilizing penetration energy for R/R < 1.2 given that the loss of some electronic density from the basins of the monomer A is partially compensated by the penetration of into the same basin. The fixed multipoles/charges in the classical potentials somehow mimic this behavior so that they remain closer to the descriptors than to around the equilibrium distance.

Figure 6

Evolution of the energy terms from eq , along with the pair term as a function of the distance for the set of S66x8 systems chosen. The complexes are grouped in three columns as belonging to the H-bond, mixed, or dispersion subsets, respectively. Finally, Figure compares the IQF penetration term and other relevant energetic quantities with the analogue term derived from the AMOEBA+ model as a function of the intermolecular distance. Thus, the combination of the multipolar energies with the CP correction[24] results in the energies that approach to the reference , which is equivalent to the SAPT electrostatic energy. In effect, Figure shows that nearly matches . Concerning the CP energies, it is important to note again that the AMOEBA+ reference for measuring the CP energy is different from that provided by the IQF-QTAIM approach. Nevertheless, the two penetration energies exhibit a similar behavior with R, particularly for the more stable H-bond complexes, which resemble also the variations experienced by the intramolecular CP terms, and . Therefore, we conclude that the AMOEBA+ CP and similar corrections account mainly for intramolecular electrostatics.

Figure 7

Comparison between the AMOEBA+ model and the zeroth-order IQF energies for our model S66x8 complexes. The complexes have been displayed according to the group they belong to (either H-bond, mixed, or dispersion). Distances (X-axis) are relative to the equilibrium ones (R/R) and energies (Y-axis) are in kcal mol–1.

Concluding Remarks

In this work we have analyzed the short-range electrostatic interactions in the S66 and S66x8 data sets through a hierarchy of approximations at both the molecular and the atomic levels. We have shown first that the IQA/IQF decomposition augmented with the D3 dispersion terms gives support to the pairwise approach adopted by many MM potentials. In this respect, the interfragment energies derived from the IQF partitioning suffice to capture the essential electrostatic effects explaining the binding of the weakly interacting complexes. Moreover, the same role can be played by the equivalent values, which are obtained from the unrelaxed densities of the isolated monomers (i.e., the zeroth-order approximation). According to our results, the intermolecular energy turns out to be the most appropriate IQF descriptor to analyze and/or compare with electrostatic MM potentials. In particular, we have considered two widely used potentials relying on the RESP atomic charges or the AMOEBA distributed atomic multipoles, respectively, as well as the multipolar potential up to the quadrupoles derived directly from the QTAIM basins. The three MM pairwise approximations correlate satisfactorily with the zeroth-order IQF term at varying intermolecular distances and exhibit small RMS errors. However, when the values are further decomposed into diatomic contributions, large discrepancies between the RESP or the AMOEBA atom–atom interactions and their zeroth-order IQA counterparts are unveiled. Although this is understandable in terms of the specific details of the RESP/AMOEBA charge/multipole derivations, it contrasts sharply with the nearly perfect match between the QTAIM atomic multipolar energies and the IQA reference values. Hence, MM potentials based on the QTAIM multipoles—such as the QCT-based FFLUX—may provide a more consistent description of electrostatic interactions at both the molecular and the atomic levels. Besides forging links between the IQF/IQA quantities and the MM electrostatic potentials, we have studied the charge penetration effects that arise from the mutual interpenetration of the zeroth-order molecular densities in their opposite QTAIM basins as built from the final ρ0 of the complex. This QTAIM perspective allows us to dissect the CP energy into different contributions that emphasize its intramolecular character, which, in turn, is dominated by the attraction between the nuclei of fragment A (B) and the penetrating tail of density B (A). In this way we may clarify some practical issues related with the CP corrections for MM potentials. For example, adding CP corrections to MM potentials like RESP/AMOEBA, which target the zeroth-order interfragment electrostatic energy, results, necessarily, in an unbalanced description. This aspect, which has been overlooked in previous works,[10,48,65] implies also that the electrostatic energy employed in popular MM force fields (AMBER, CHARMM, ...) cannot be compared with the global energy derived from continuous charge distributions, but with its interfragment component. On the other hand, CP corrections have been derived to improve the description of the QM–MM electrostatic interactions in hybrid QM/MM methodologies.[12] In this case, such corrections should mitigate short-range electrostatic artifacts, particularly those associated with the QM–MM covalent linkages. However, considering the highly dissimilar interatomic electrostatic energies produced by the QM densities and the RESP/AMOEBA potentials, the usage of electrostatic parameters more akin to the QM densities at the atomic level may have a larger impact in improving the QM–MM electrostatics. Finally, concerning the novel MM potentials inspired by the QM SAPT methodology, it is clear that the multipolar electrostatics (interfragment) must be augmented by the CP potentials (intrafragment) if one seeks to reproduce the global electrostatics . Nevertheless, the IQF/IQA approach (and other EDAs) points out that the intramolecular electrostatic energy is closely related with other energy changes induced by fragment overlap (e.g., deformation and interfragment exchange-correlation energy), suggesting thus that the separate treatment of these effects by means of independent potential terms might be inefficient and hamper parameter development and transferability.

34 in total

1. Accurate Intermolecular Potentials Obtained from Molecular Wave Functions: Bridging the Gap between Quantum Chemistry and Molecular Simulations.

Authors: O Engkvist; P O Astrand; G Karlström
Journal: Chem Rev Date: 2000-11-08 Impact factor: 60.622

2. AMOEBA+ Classical Potential for Modeling Molecular Interactions.

Authors: Chengwen Liu; Jean-Philip Piquemal; Pengyu Ren
Journal: J Chem Theory Comput Date: 2019-06-11 Impact factor: 6.006

3. A density-functional model of the dispersion interaction.

Authors: Axel D Becke; Erin R Johnson
Journal: J Chem Phys Date: 2005-10-15 Impact factor: 3.488

4. Recommending Hartree-Fock theory with London-dispersion and basis-set-superposition corrections for the optimization or quantum refinement of protein structures.

Authors: Lars Goerigk; Charles A Collyer; Jeffrey R Reimers
Journal: J Phys Chem B Date: 2014-12-03 Impact factor: 2.991

5. Multipolar electrostatics for proteins: atom-atom electrostatic energies in crambin.

Authors: Yongna Yuan; Matthew J L Mills; Paul L A Popelier
Journal: J Comput Chem Date: 2013-10-29 Impact factor: 3.376

6. Interplay of point multipole moments and charge penetration for intermolecular electrostatic interaction energies from the University at Buffalo pseudoatom databank model of electron density.

Authors: Sławomir A Bojarowski; Prashant Kumar; Paulina M Dominiak
Journal: Acta Crystallogr B Struct Sci Cryst Eng Mater Date: 2017-07-28

7. Assessing Ion-Water Interactions in the AMOEBA Force Field Using Energy Decomposition Analysis of Electronic Structure Calculations.

Authors: Yuezhi Mao; Omar Demerdash; Martin Head-Gordon; Teresa Head-Gordon
Journal: J Chem Theory Comput Date: 2016-10-18 Impact factor: 6.006

8. Polarizable Atomic Multipole-based Molecular Mechanics for Organic Molecules.

Authors: Pengyu Ren; Chuanjie Wu; Jay W Ponder
Journal: J Chem Theory Comput Date: 2011-10-11 Impact factor: 6.006

9. S66: A Well-balanced Database of Benchmark Interaction Energies Relevant to Biomolecular Structures.

Authors: Jan Rezáč; Kevin E Riley; Pavel Hobza
Journal: J Chem Theory Comput Date: 2011-07-01 Impact factor: 6.006

10. A Comparative Study of Transferable Aspherical Pseudoatom Databank and Classical Force Fields for Predicting Electrostatic Interactions in Molecular Dimers.

Authors: Prashant Kumar; Sławomir A Bojarowski; Katarzyna N Jarzembska; Sławomir Domagała; Kenno Vanommeslaeghe; Alexander D Mackerell; Paulina M Dominiak
Journal: J Chem Theory Comput Date: 2014-02-21 Impact factor: 6.006

1 in total

1. QM/MM Energy Decomposition Using the Interacting Quantum Atoms Approach.

Authors: Roberto López; Natalia Díaz; Evelio Francisco; Angel Martín-Pendás; Dimas Suárez
Journal: J Chem Inf Model Date: 2022-02-25 Impact factor: 4.956

1 in total