Literature DB >> 27442443

Enthalpic and Entropic Contributions to Hydrophobicity.

Michael Schauperl¹, Maren Podewitz¹, Birgit J Waldner¹, Klaus R Liedl¹.

Abstract

Hydrophobic hydration plays a key role in a vast variety of biological processes, ranging from the formation of cells to protein folding and ligand binding. Hydrophobicity scales simplify the complex process of hydration by assigning a value describing the averaged hydrophobic character to each amino acid. Previously published scales were not able to calculate the enthalpic and entropic contributions to the hydrophobicity directly. We present a new method, based on Molecular Dynamics simulations and Grid Inhomogeneous Solvation Theory, that calculates hydrophobicity from enthalpic and entropic contributions. Instead of deriving these quantities from the temperature dependence of the free energy of hydration or as residual of the free energy and the enthalpy, we directly obtain these values from the phase space occupied by water molecules. Additionally, our method is able to identify regions with specific enthalpic and entropic properties, allowing to identify so-called "unhappy water" molecules, which are characterized by weak enthalpic interactions and unfavorable entropic constraints.

Entities: Chemical Disease Gene

Mesh：

Substances：
Amino Acids
Water

Year: 2016 PMID： 27442443 PMCID： PMC5024328 DOI： 10.1021/acs.jctc.6b00422

Source DB: PubMed Journal: J Chem Theory Comput ISSN： 1549-9618 Impact factor: 6.006

Introduction

Hydrophobic interactions are among the most important driving forces in nature, e.g. responsible for the separation of water and oil, the function of detergents, distribution of minerals in the earth’s crust, and many more.[1,2] In biology, they determine the structure of proteins and cells as well as the self-assembly of membranes.[3,4] Furthermore, the hydrophobic effect plays a key role in ligand binding processes and needs to be accounted for in drug design.[5,6] In all these systems, apolar groups tend to cluster together in a polar liquid, like water, to minimize the surface between groups of different polarity.[7] Investigation of the hydrophobic character of compounds is a complex research field because size, shape, and positions of chemical groups have to be taken into account for a detailed understanding.[8,9] A common way to simplify the solvation problem for biomolecules is the use of hydrophobicity scales. These scales assign a value to each amino acid, which describes the relative averaged hydrophobic character in comparison to the other amino acids. A vast variety of different scales are discussed in the literature, which are based on various theoretical and experimental methods.[10−13] Hydrophobicity scales have been widely used for the prediction of protein secondary structures, membrane regions, antigenic sites, and interior-exterior regions.[10,14] However, they have two major drawbacks: first, they describe the averaged hydrophobic character over the whole amino acid and a spatial resolution is not available. The hydrophobicity of a binding pocket may not be uniform, and, therefore, the identification of regions which are especially hydrophobic or hydrophilic is of interest. The displacement of water molecules from these areas into the bulk can lead to significant changes in the free energy of binding. Second, none of the reported methods can directly measure entropic contributions to the hydration process. Methods based on the distribution coefficient – or the free energy of transfer – between two different phases cannot evaluate the entropic contribution directly but only estimate it from the temperature dependence of the free energy or via the difference of the free energy and the enthalpy.[15,16] Thus, it is not possible to draw a conclusion about the water dynamics, and a detailed understanding of the entropic effects cannot be reached. This is one of the reasons why – compared to the structural and enthalpic effects of hydration – little is known about the water dynamics and entropic effects around a solute.[17] Although many different models are published,[18−23] trying to explain the entropic contributions to hydration, especially the extent of the deceleration and entropy gain in the proximity of a solute, is still discussed controversially. Dating back to 1945, Frank and Evans[18] were the first to address this issue when they proposed their visionary “Iceberg” model. They suggested that a water cage is formed around a solute in a similar way as it is found in ice or clathrates. This is achieved by orientating the hydrogen bonds tangentially to the solute surface, resulting in a slower reorientation and translation than in bulk.[19,24] These restrictions reduce the phase space, which water molecules can occupy in the hydration shells of the solute, resulting in a decrease of entropy.[9,17] Due to the strong enthalpic interactions between the solvent and the solute the movement of water molecules around hydrophilic groups is even more restricted. This effect is also known as entropy-enthalpy-compensation and has its origin in the strong electrostatic interactions of polar groups and a polar solvent.[25] The increased ordering around solvated molecules can be understood as an entropy penalty for the enthalpic interaction. Although the kinetics of water molecules are slowed down comparable to water in the ice phase, it still remains in a liquidlike structure.[26−28] Galamba proposed that these water molecules behave similarly to water molecules at lower temperatures.[29] Water dynamics in the solvation process were also analyzed by a variety of experimental techniques and simulations, leading to different interpretations of the results.[30−69] The entropic behavior of water molecules in response to different nonpolar, polar, and charged solutes is still elusive, making it one of the remarkable varieties of water properties, which are not completely understood. Therefore, computer simulations can help to obtain a detailed molecular description of solvation and explain the different thermodynamic contributions to this process. An atomistic understanding is crucial to comprehend the process of solvation in detail. Different theoretical strategies have been reported to describe the effect of hydrophobic interactions, especially in biomolecules. Most of the earlier studies are based on simplified solvent models, described by linear response theory, causing errors in calculations of dipolar solvents.[35] More recent reports investigated the size effect of a spherical symmetric solute, the influence of chain lengths on hydrophobicity, and the temperature behavior of hydration properties.[36,37] Huggins and Payne reported the hydration free energies for six small molecules calculated with inhomogeneous solvation theory (IST) and thermodynamic integration (TI). Their study revealed a good agreement between the two methods, but the entropy term (−TΔS) in IST calculations is generally overestimated.[38] Furthermore, the role of the hydrophobic effect for the behavior of proteins and amino acids has been explored by investigating thermodynamic properties around a set of noncharged amino acids side chain analogues using thermodynamic integration and hydrogen bond analysis.[39−41] However, further work showed that the backbone of the amino acid side chain has a significant influence on the thermodynamic properties of the side chain solvation. Therefore, hydrophobic properties of side chain analogues may differ from those of amino acids, including the backbone atoms.[15,42,43] In this study, the hydrophobicity of the 20 canonical amino acids is investigated. Amino acid models include the backbone, which has shown to be essential to obtain reliable results. Molecular Dynamics simulations and grid inhomogeneous solvation theory (GIST) are used to simulate and analyze the solvation of amino acids at an atomistic scale. Based on the interaction free energy of amino acids with water molecules we propose a method to calculate hydrophobicity, which allows a direct distinction between enthalpic and entropic contributions. Entropic contributions are obtained directly from the phase space, which water can occupy. We also investigate the spatial resolution of the solvation process, which our method can provide. Dissection of spatially resolved entropic and enthalpic contributions provides additional insight into the entropic behavior of water molecules around a solute.

Computational Methods

Grid Inhomogeneous Solvation Theory

In this section a very brief introduction to the concept of Grid Inhomogeneous Solvation Theory (GIST) developed by Gilson and co-workers is provided. For a more detailed description of GIST we refer to their papers.[44,45] Most general, the free energy of solvation for a molecule ΔGSolv can be written as It is defined as the integral over the free energy of the molecule ΔGSolv(q) with the solute being constrained to configuration q times the probability p(q) to find the solute in state q. For GIST calculations, the molecule is restricted to one single conformation. The impact of other conformations can be accounted for by using multiple GIST calculations on all relevant conformations. In the following, for the sake of simplicity, it is assumed that only one relevant conformation exists. The theory can simply be extended to multiple configurations using eq . The free energy of solvation ΔGSolv can be written as the difference between the solvation enthalpy ΔESolv and the solvation entropy ΔSSolv multiplied by the temperature T: The underlying idea behind inhomogeneous solvation theory is to transform integrals over molecular coordinates to integrals over distribution functions (one-body, two-body, three-body, correlation functions etc.). Based on these functions it is possible to approximate thermodynamic quantities. GIST uses a grid to discretize the analytical expressions usually used in inhomogeneous solvation theory. Thermodynamic quantities are calculated as discrete values from the stored trajectory frames at every grid point. The solvation enthalpy ΔESolv consists of two terms as shown in eq , the interaction between the solvent and the solute, from now on denoted as solute-water interaction ΔESW, and the changes in the enthalpic interaction between the solvent molecules due to the presence of the solute in contrast to the unperturbed water, summarized in ΔEWW. Both terms are calculated for each voxel of the grid. The solute-water interaction is the interaction of all water molecules in the voxel with the solute molecule, averaged over the complete simulation time. The energy calculation is based on the used simulation force field. In our case this is the AMBER force field[46] and an TIPXP solvent model (X = 3,4,5). Similarly, the water–water enthalpy captures the interaction of all water molecules in the voxel with all other water molecules in the simulated system, again averaged over the whole simulation time. The solvation entropy ΔSSolv is approximated by two computational affordable and intuitive terms: the translational ΔStrans and the orientational part ΔSorient of the entropy. The water–water correlations are neglected, hence, the integrals depend only on the coordinates of one water molecule. The contributions of higher order terms are considered to be small compared to the first order term and slow down the convergence so that longer simulation times would be required.[47] The distribution of the water molecules can be written as a function of the position of the oxygen atom of the water molecule in three-dimensional space r and of the orientation of the water molecule ω. Therefore, the water distribution gSW(r,ω) is dependent on two variables. This function can be rewritten as a function gSW(r) only depending on the position r of the water molecule and an orientational part depending solely on ω, with a fixed r value as condition gSW(ω|r), shown in eq . This spliced function can be used to split the approximated entropy into an orientational and a translational term. The translational part depends only on the position of the water molecule in space and can be calculated according to eq where ρ is the number density of the water model. A uniform distribution with the same density of water molecules in every voxel (gSW(r) = 1 for all voxels) is characteristic for a low ordered system with high entropy. If the system shows differences in the density the entropy gets lower (more ordered water structure, gSW(r) ≠ 1). Pure water without any solute has a uniform distribution and serves as reference state. For this reason, all results can be compared to this state. Presented thermodynamic values denote the difference between the observed state and the unperturbed water reference. The orientational part of the entropy is described by eqs and 8 and describes whether the water molecules in a voxel are found in a similar orientation ωwhere gSW(r) is the translational part of the water density, K is a normalization factor, and gSW(ω|r) is the part of the water density distribution gSW(r,ω) depending only on the orientation ω of the water molecule. It is assumed that the orientation of the water is independent of the water position in the voxel. Again, a uniform distribution accounts for low ordering and a high entropy value. In unperturbed/pure water, the individual molecules do not have a preferred orientation but show random orientation. To summarize, ΔGSolv gives a value for the change in the free energy when the molecule is transferred from the vacuum with no interaction partner to the liquid phase. The free energy consists of the solute-water enthalpy and the water–water enthalpy, as well as the translational and the orientational entropy of the water molecules. In contrast to previously published theoretical studies, the described method calculates the entropy directly from the phase space that water molecules occupy. Other methods like TI usually calculate the entropy as the difference between free energy and enthalpy or use multiple simulations at different temperatures to calculate the entropy due to the temperature dependence of the free energy.

Simulation Details

All 20 canonical amino acids were capped with an N-terminal acetyl and a C-terminal amide group (ACE-X-NME) and solvated in an octahedral box using the AmberTools package.[46] The minimum distance between the box edge and the protein was set to 12 Å, resulting in a system with approximately 2000 water molecules. As the water model is known to heavily influence the obtained thermodynamic properties, several water models were used in this study to investigate their impact on hydrophobicity.[48] We used the three most common water models in the AMBER package TIP3P,[49] TIP4P,[49] and TIP5P.[50] Arginine and lysine were simulated with a positive charge; glutamate and aspartate were assigned a negative charge. Histidine was simulated as positive and neutral species. For the neutral species we took the average of the two possible states. The backbone was constrained in the same conformation for all amino acids to eliminate the effects of backbone conformations. We choose the most abundant backbone conformation of an unrestrained alanine simulation as our reference geometry. The side chains were set up randomly and allowed to move during the equilibration phase and were restrained during the production run as it is required for the GIST analysis. Equilibration of the systems was performed according to the established protocol of Wallnöfer et al.[51] After equilibration every capped amino acid was simulated in a NPT ensemble for 200 ns. Langevin thermostat was used to keep the temperature at 300 K. The pressure of 1 bar was kept by using an isotropic implementation of the Berendsen barostat. The time step was set to 2 fs, and coordinates were saved every 10 ps. Simulation time and duration between two snapshots were chosen after a convergence analysis of the thermodynamic properties obtained with GIST. Even though for alanine a 50 ns simulation with 5000 snapshots is sufficient for the GIST analysis, to ensure the convergence for all amino acids the simulation time was extended to 200 ns with 10 000 simulation snapshots. These values are also far beyond the limits suggested in the Amber14 manual.[46] The grid spacing was set to 0.5 Å. All grid points within 4 Å of the solute were used to determine the thermodynamic properties of the hydrating water molecules (density-weighted integration). A higher cutoff increases the signal-to-noise ratio and is therefore not practicable. To increase the accuracy of the results and estimate the error correctly the calculations were repeated three times with different initial coordinates for the side chains. The backbone was always restrained to the same conformation. The standard deviation of three runs and the mean were calculated.

Results

Molecular Dynamics simulations of amino acids surrounded by water molecules were performed to calculate thermodynamic properties of the solvation process of these amino acids on a grid around the solute. The results for the free energy, the total enthalpy, and the overall entropy are reported in Table for the three different water models TIP3P, TIP4P, and TIP5P. In the Supporting Information, tables including further decomposition of the enthalpy in solute-water and water–water enthalpy are provided. Additionally, the values for the orientational and translational part of the entropy are summarized for all amino acids and water models.

Table 1

Thermodynamic Quantities: Free Energy, Enthalpy, and Entropy for All 20 Amino Acids Calculated with the Amber Force Field and Three Water Modelsa

	water model
	TIP3P			TIP4P			TIP5P
amino acid	ΔH	–TΔS	ΔG	ΔH	–TΔS	ΔG	ΔH	–TΔS	ΔG
ASP(−)	–80.5 ± 0.3	26.4 ± 0.0	–54.2 ± 0.3	–82.8 ± 4.2	29.2 ± 0.1	–53.5 ± 4.1	–67.0 ± 2.8	25.3 ± 0.1	–41.7 ± 2.7
GLU(−)	–78.3 ± 1.9	27.7 ± 0.2	–50.6 ± 1.7	–85.9 ± 1.3	30.6 ± 0.1	–55.2 ± 1.1	–68.6 ± 1.6	26.5 ± 0.1	–42.1 ± 1.6
LYS(+)	–66.1 ± 2.5	22.8 ± 0.7	–43.4 ± 1.9	–63.0 ± 0.9	23.5 ± 0.2	–39.5 ± 0.7	–69.0 ± 0.6	25.6 ± 0.4	–43.3 ± 0.3
ARG(+)	–64.7 ± 2.1	24.1 ± 0.5	–40.7 ± 1.6	–62.8 ± 0.5	24.5 ± 0.3	–38.3 ± 0.4	–72.1 ± 1.9	29.4 ± 0.6	–42.8 ± 1.5
HIS(+)	–57.7 ± 1.6	21.5 ± 0.3	–36.2 ± 1.3	–56.0 ± 0.7	22.5 ± 0.2	–33.5 ± 0.7	–60.2 ± 2.8	24.3 ± 0.5	–35.9 ± 2.3
GLN	–35.8 ± 1.1	20.6 ± 0.4	–15.2 ± 0.7	–34.7 ± 0.8	21.7 ± 0.1	–12.9 ± 0.8	–35.9 ± 0.5	23.2 ± 0.2	–12.7 ± 0.6
HIS	–36.0 ± 1.0	21.1 ± 0.2	–14.9 ± 0.8	–34.5 ± 1.2	22.2 ± 0.4	–12.3 ± 0.8	–32.3 ± 2.1	22.4 ± 0.8	–9.9 ± 1.3
ASN	–33.8 ± 4.0	19.1 ± 0.9	–14.7 ± 3.2	–33.5 ± 3.1	20.4 ± 0.7	–13.1 ± 2.5	–37.1 ± 0.9	22.6 ± 0.3	–14.5 ± 1.0
SER	–28.0 ± 1.2	15.6 ± 0.1	–12.4 ± 1.1	–29.3 ± 2.7	16.7 ± 0.8	–12.7 ± 2.0	–30.0 ± 2.8	17.1 ± 1.2	–12.9 ± 1.5
TYR	–32.4 ± 2.1	20.2 ± 0.3	–12.2 ± 1.9	–31.3 ± 0.8	21.9 ± 0.6	–9.4 ± 0.3	–32.7 ± 1.7	22.0 ± 0.5	–10.7 ± 1.1
TRP	–33.8 ± 0.5	22.3 ± 0.2	–11.5 ± 0.4	–32.3 ± 0.4	23.9 ± 0.2	–8.4 ± 0.3	–29.1 ± 2.5	23.0 ± 0.5	–6.0 ± 2.0
THR	–27.1 ± 1.1	16.7 ± 0.4	–10.3 ± 0.9	–27.1 ± 0.2	18.0 ± 0.2	–9.1 ± 0.2	–27.6 ± 0.4	18.3 ± 0.0	–9.4 ± 0.4
MET	–27.6 ± 0.6	18.1 ± 0.1	–9.5 ± 0.5	–25.7 ± 1.7	19.5 ± 0.3	–6.2 ± 1.4	–23.9 ± 0.7	18.9 ± 0.2	–5.0 ± 0.6
PHE	–29.1 ± 0.1	19.9 ± 0.2	–9.2 ± 0.1	–25.2 ± 2.2	20.9 ± 0.4	–4.3 ± 1.8	–26.6 ± 0.7	21.0 ± 0.5	–5.6 ± 0.2
CYS	–23.9 ± 2.0	15.3 ± 0.4	–8.5 ± 1.7	–22.3 ± 1.0	16.3 ± 0.3	–6.0 ± 0.8	–24.2 ± 1.9	16.1 ± 0.1	–8.1 ± 1.8
GLY	–22.4 ± 0.4	14.0 ± 0.1	–8.4 ± 0.4	–21.6 ± 0.2	14.8 ± 0.1	–6.8 ± 0.2	–20.5 ± 0.0	14.6 ± 0.1	–5.9 ± 0.0
ALA	–22.5 ± 0.8	14.8 ± 0.1	–7.7 ± 0.7	–22.0 ± 0.5	15.8 ± 0.2	–6.3 ± 0.4	–21.3 ± 0.7	15.8 ± 0.2	–5.5 ± 0.5
ILE	–25.4 ± 0.5	18.1 ± 0.1	–7.3 ± 0.6	–25.4 ± 0.2	19.7 ± 0.1	–5.7 ± 0.2	–24.4 ± 0.5	19.7 ± 0.4	–4.7 ± 0.4
VAL	–24.5 ± 0.8	17.2 ± 0.3	–7.3 ± 0.5	–24.2 ± 0.1	18.4 ± 0.1	–5.8 ± 0.1	–22.4 ± 0.3	17.9 ± 0.3	–3.0 ± 2.2
LEU	–24.9 ± 0.7	17.8 ± 0.3	–7.1 ± 0.5	–24.9 ± 0.1	19.4 ± 0.2	–5.5 ± 0.1	–22.9 ± 0.3	18.7 ± 0.2	–4.2 ± 0.1
PRO	–22.9 ± 0.8	16.0 ± 0.2	–6.9 ± 0.6	–21.6 ± 0.7	16.8 ± 0.2	–4.8 ± 0.5	–19.6 ± 0.3	16.1 ± 0.1	–3.5 ± 0.3

Values are given in kcal/mol.

Values are given in kcal/mol. The amino acids in Table are ordered according to the calculated value for the free energy using the TIP3P water model. Table indicates the gain in free energy when the molecule is transferred from the vacuum to the liquid phase. The free energy directly correlates with the hydrophilicity/hydrophobicity of the given molecule. Vacuum is from a theoretical point of view an ideal reference, because no interactions with other molecules have to be considered. Nevertheless, it should be mentioned that in practice other references, for example, apolar solvents, are much more relevant, and this will be discussed in more detail in the Discussion section. Table shows that the free energy and the enthalpy values are three to five times larger for charged residues than for noncharged residues. These high values can be explained as GIST is measuring the solvation free energy, enthalpy, and entropy of the compound with water. The high point charges of charged residues lead to high enthalpic interactions with the solvent, inducing also more negative (low) entropy due to entropy-enthalpy compensation. The reported values for charged residues cannot be directly compared to the values for other amino acids because these compounds are not charged in vacuum or an apolar solvent. To compare these values with experimental measured solvation energies, the formation of the ion pair, the gas phase basicity or acidity, and the solvation of the additional counterion have to be included.[52] The free energies for the formation of the ion pairs are high (>300 kcal/mol) in comparison to the values for the solvation.[53] As the consideration of these high values would also lead to high errors in the calculated values, this analysis is omitted. Therefore, it should be emphasized that the presented values state how large the influence/interaction of the molecules is on/with the surrounding water. This is also an interesting property for charged residues, although it is not a direct measurement of hydrophobicity for these amino acids. For uncharged residues, the calculated values can be compared to the energy gain obtained for transferring a molecule from the gas phase into water – assuming that the interaction energy in the gas phase is neglectable compared to the interaction energy in the liquid phase. For charged residues this assumption is not valid, due to the ion pair formation in the gas phase. Charged residues are therefore analyzed separately. For the positively charged residues arginine, histidine, and lysine all three water models (TIP3P, TIP4P, and TIP5P) yield very similar results. In contrast, substantial differences of around 10 kcal between the enthalpic interaction for the negative charged residues glutamate and aspartate are observed. For these residues a significantly higher enthalpy and free energy value is found for the TIP5P water model in comparison to TIP3P and TIP4P. To investigate the differences of these water models for the uncharged amino acids, the values for two different water models are plotted against each other. The free energy, the enthalpy, and the entropy of all noncharged residues (including histidine) are shown in Figure .

Figure 1

Correlation of the free energy (upper row), enthalpy (middle), and entropy (bottom) for the three water models (TIP3P, TIP4P, TIP5P) are depicted. Values plotted for uncharged amino acids only and labeled with one letter amino acid codes. For the noncharged side chains the free energy is lower (more negative) for the TIP3P water model compared to the TIP4P and TIP5P water models. The TIP4P and TIP5P models show very similar results, and no obvious trend can be obtained from the simulation data. From the splitting of the free energy in enthalpy and entropic contributions we see that the enthalpy is very similar for all three water models. Differences between TIP3P and TIP4P/TIP5P stem from the entropy: TIP3P shows smaller (less positive) values for the entropy term (−TΔS) of the surrounding water molecules. A lower value in the term −TΔS, which is shown in Figure , corresponds to a higher (less negative) entropy value ΔS. Thus, TIP3P shows a lower ordering compared to TIP4P and TIP5P, which are similar to each other. From our point of view it is quite surprising that the choice of the water model has only minor influence on the results for almost all amino acids.

Discussion

The results of Figure demonstrate that the solvation entropy for TIP3P is less negative than for the other investigated water models, and so water molecules show a lower ordering in comparison to TIP4P and TIP5P. A possible explanation for this behavior is that the electrostatics is modeled differently in the TIP3P water model.[49,50] The higher charges of TIP4P compared to TIP3P lead to a decrease in entropy (more negative values for ΔS, more positive values for the term −TΔS) and to an increase of order in the hydration layers, respectively. This is also evident for the entropy values obtained for the charged amino acids in Table . In TIP5P, the negative charge is distributed over two interaction sites, which mimic the lone pairs of the oxygen atom. Therefore, positive charges will always be found next to these lone pairs and will not be distributed around the negatively charged “hemisphere” of the water molecule, when the Coulomb and van der Waals interaction centers are identical. This directional interaction is responsible for the more negative entropy. As the reference state for all water models is the uniform distribution, more negative entropy values lead to a more pronounced change in the solvation entropy. Therefore, the solvation entropy is more negative, and we find higher ordering of the water molecules. Due to the high relevance for various research fields a broad spectra of hydrophobicity scales is known in the literature.[54−58] To evaluate the performance of our hydrophobicity scale, we compare it to a set of well-established hydrophobicity scales. One of the most prominent hydrophobicity scales is published by Kyte and Doolittle.[54] It is based on an amalgam of experimental measurements and is ideally suited to benchmark our method against experimental data (see Figure ). In Kyte and Doolittle’s scale the apolar reference phase is a hypothetical neutral, isotropic, noninteracting solvent. Therefore, they used information on buried side chains in proteins and water-vapor transfer free energies as their reference solvent does not exist.

Figure 2

Free energy of solvation obtained with GIST compared to the hydrophobicity values from Kyte and Doolittle. Equations for the linear regression are given in the top left corner.

Free energy of solvation obtained with GIST compared to the hydrophobicity values from Kyte and Doolittle. Equations for the linear regression are given in the top left corner. Figure shows the free energy calculated with the presented method (y-axis) plotted against the values reported by Kyte and Doolittle (x-axis). The correlation between the two different scales is indicated by linear regression (R2-values > 0.50). Absolute values cannot be compared as the hydration of the backbone and the capping groups are also included in our scale. As the backbone and capping groups are always constraint to the same conformation the contribution to the thermodynamic quantities is nearly constant for all amino acids. This constant contribution to the free energy of hydration is responsible for the relative high value of the intercept. The results for glycine (G) and proline (P) differ from the experimental values. Most likely this arises because the distinction between side chain and backbone cannot be made as easily as for all other amino acids. When we exclude these two amino acids from the linear regression, we obtain R2-values between 0.74 and 0.92. Again, our approach describes the transfer of a molecule from the vacuum to the aqueous phase. Although the distribution coefficients of molecules between the aqueous and the gas phase are an interesting molecular property, most of the frequently used scales describe the transfer of a molecule between an apolar environment, like octanol or a membrane, and a polar solvent like water. Despite the fact that our scale neglects the interactions in the apolar solvent, the results are in a very good agreement with those of Kyte and Doolittle for nearly all amino acids. This finding is little surprising when looking at Kyte and Doolittle’s reference phase in more detail: This is an apolar, neutral, noninteracting solvent and only show van der Waals interactions. When an apolar molecule is solvated by this ideal apolar solvent, the surface area between apolar solvent molecules is reduced by the same amount as the surface area between the solute and the solvent molecules grows. As a first approximation we can assume that van der Waals interactions are similar between solute and solvent and between solvent molecules in an ideal apolar solvent. Therefore, the free energy of solvation in an ideal apolar solvent is negligible. Corrections of the obtained results with a term proportional to the solvent accessible surface area to account for the dispersion energy were found to be very small (<0.2 kcal/mol). As they could not significantly improve the results, we conclude that our assumption is valid: the free energy in an ideal apolar solvent is small compared to the free energy of solvation in a polar solvent. From the free energy values it is not possible to identify which water model performs best. TIP3P has the highest correlation with experimental data, whereas the correct slope of nearly 1.0 is found for TIP5P. Besides the hydrophobicity scale of Kyte and Doolittle a number of other scales exist. These scales are based on different experimental and bioinformatic measurements. Despite the variety of hydrophobicity scales, our proposed scale is the only one that focuses on the interactions between amino acids and their surrounding water molecules. For the sake of comparison Table shows our hydrophobicity scale rank ordered for all 20 canonical amino acids and a selection of other published hydrophilicity scales.[54−59] As all water models yield very similar results, resulting in the same rank order only the TIP3P based scale is depicted here.

Table 2

Rank Ordered Hydrophobicity Scale Proposed in This Work (GIST-TIP3P) in Comparison to Previously Published Hydrophobicity Scales

The hydrophobicity scale of this work is in concordance with previously published scales. For all scales charged residues are found at the top end of the scale, as they are the most hydrophilic ones. The order of positive and negative charged residues are switched compared to all other scales. A possible reason is that the formation of the ion pair is neglected and solvation energies are reported. Also, the result for proline is different in our scale, but this can be explained with the unique property of proline: it has two chemical bonds connecting side chain and backbone. Therefore, it exerts a different influence of the backbone, which is most likely responsible for the deviation. As mentioned above, one of the key advantages of our method is that the entropy can be obtained directly from a single simulation. As the hydrophobicity values from the previous paragraph cannot be split into an entropic and enthalpic part, our entropy and enthalpy results are compared to the experimental values (free energy, enthalpy, and entropy of hydration for the amino acid side chains) published by Privalov et al.[60,61] (see Figure ). Their reported values are based on the measurement of distribution coefficients of molecules between the aqueous phase and the gas phase.

Figure 3

Free energy, enthalpy, and entropy values for all uncharged amino acids obtained with the GIST approach (y-axis) plotted against the experimentally obtained values (x-axis). The experimental values are free energies, enthalpy, and entropy of hydration reported by Privalov et al. Values are given in kcal/mol, and the equations for the linear regression are shown in the top left corner of every graph. For nearly all amino acids free energies from theory and experiment agree well with each other (Pearson correlation coefficient between 0.96 and 0.82). Only for tryptophan differences between experiment and theory are found. For the enthalpic values the agreement is even better (Pearson correlation coefficient >0.94). Thus, we conclude that the different water models treat enthalpic effects similarly and accurately. Comparing experimental and theoretical entropy values reveal interesting differences. Especially for the aromatic amino acids (tryptophan W, tyrosine Y, phenylalanine F, and histidine H) calculated entropies are up to 5 kcal/mol higher than experimental values. The employed force field seems to overestimate the entropic penalty for solvation of aromatic rings. As tryptophan is the largest aromatic amino acid this overestimation is most likely the reason for the deviation between experimental and calculated ΔG. The difference could probably also be an artifact arising from too high point charges in the water models. To compensate for polarization occurring in real water the charges are set to higher values than they would occur in the isolated molecules or when water is surrounded by an apolar environment. With higher charges the entropy change gets underestimated because high charges will order water molecules in a more defined way. The mismatch between experimental and theoretical values is similar for all three tested water models. Polarizable water models could potentially improve this estimation. The most common way to theoretically estimate thermodynamic properties of amino acid side chains is thermodynamic integration (TI). Similar to the described method, TI calculates the free energy gained by introducing/changing a group of the solute in solvent compared to the same transformation in the vacuum phase. From this point of view it is expected that TI results are very similar to the GIST-based results. Figure shows a comparison of our results to the results obtained by Hajari with TI.[15]

Figure 4

Free energy, enthalpy and entropy values for all uncharged amino acids obtained with the GIST approach (y-values) with respect to the values obtained by Hajari[15] (x-values) with thermodynamic integration. The values of the linear regression for every water model are given in the top left corner. The results from Hajari were obtained with the GROMOS 53A6 force field[62] and the SPC[63] water model. Despite the slightly different force field, the results are still in very good agreement with each other. The constant offset can be attributed to the contribution of the backbone atoms or differences due to conformational entropy of the solute. Especially for the TIP3P water model, results are nearly the same as with TI and the SPC water model, which is little surprising because the SPC and the TIP3P water models are very similar. Both water models are three-site water models with very similar properties. The differences in the results for the other two water models very likely arise from the different treatment of electrostatics. The good agreement suggests that both techniques TI and our GIST based hydrophobicity are capable of describing the change in free energy due to the solvation process and are equally correct (Similar results are also obtained for the enthalpy and entropy values.). We once more want to remind the reader that the presented scale includes an offset due to the inclusion of the backbone solvation. Therefore, only the relative values can be compared. Interestingly, we obtain slope values higher than 1.0 for the entropy. We would have expected values lower than 1.0 because GIST is neglecting higher order entropy terms. This may indicate that higher order entropy terms are small compared to the first order term and is in line with the results by Higgins et al., who stated that entropy is overestimated by IST calculations.[38] While both TI and the presented hydrophobicity scale agree well with each other, our approach can obtain the entropy directly from the phase space water can occupy. Therefore, only one simulation is necessary to get the solvation entropy (for a rigid solute). For TI on the other hand, entropic changes are typically derived indirectly from the temperature dependence of the free energy (using multiple simulations at different temperatures) as done by Hajari et al.[15] or via the difference of the free energy and the enthalpy. While Hajari et al. used 26 or more λ-values, 6 different temperatures, simulating every window for 5 ns, resulting in a total simulation time of ≥600 ns, we only needed 200 ns of simulation time (potentially even less without significant loss of accuracy). Being able to measure entropic and enthalpic contributions allows a closer look at frequently discussed entropy-enthalpy compensation, the negative correlation between entropy and enthalpy of hydration. This negative correlation is also found in our theoretical calculations, as one may see in Figure : more negative values for the enthalpy, i.e., favorable enthalpic interactions are accompanied by high values for −TΔS, and thus an entropic penalty.

Figure 5

Entropy-enthalpy compensation for uncharged amino acids obtained by the approach presented in this work: TIP3P (left), TIP4P (middle), TIP5P (right). Entropy is reported as −TΔS.

Entropy-enthalpy compensation for uncharged amino acids obtained by the approach presented in this work: TIP3P (left), TIP4P (middle), TIP5P (right). Entropy is reported as −TΔS. Although a negative correlation of the entropy and enthalpy is found overall, it is conspicuous that for all three water models polar amino acids tend to be found below the regression curve and apolar amino acids are found above. It may indicate that for polar amino acids the entropy-enthalpy compensation is rather small. The polar amino acids experience a smaller entropic penalty compared to apolar amino acids. Either this can be attributed to the simplistic treatment of electrostatics or entropy-enthalpy compensation is more pronounced at apolar surfaces. Alternatively, the shape of the apolar residues may lead to a different offset in the regression. The slope of the regressions for all water models ranges between 0.4 and 0.6 indicating that around half of the enthalpic interaction is compensated by entropy. It indicates that enthalpy and entropy are not perfectly correlated in simulations. Therefore, just looking at enthalpic values and neglecting the entropy can be misleading. Also, the entropy term (−TΔS) is in the same order of magnitude as the enthalpy, resulting in a significant contribution of the entropy to the free energy, which cannot be neglected. Our GIST based hydrophobicity scale provides not only entropy and enthalpy of solvation but also a spatial resolution of these quantities allowing for new insights into the hydrophobic character of amino acids side chains. It is possible to identify groups and/or functionalities that increase or decrease the hydration free energy. The implications of this are illustrated at the example of four amino acids depicted in Figure : it shows the spatial resolution of hydrophobicity obtained by analyses of the water environment around the amino acids.

Figure 6

Representative examples of the spatial analysis of hydrophobicity around the four amino acids, tyrosine, aspartate, leucine, and tryptophan, with the presented method. Red areas show enthalpically favored water regions; blue areas show regions of low entropy (unfavorable). Red spheres represent maximum likelihood water positions with a negative contribution to the free energy (happy water molecules), whereas blue spheres show water positions with a positive (unfavorable) contribution (unhappy water molecule). Black spheres represent water positions with very high (>4 kcal/mol) negative contribution to the free energy (very happy water). In Figure red regions symbolize enthalpically favored positions for water molecules, whereas blue regions show entropic unfavorable positions. Almost all entropic unfavorable regions have their origin in a high enthalpic binding, known as the previous discussed entropy-enthalpy compensation. For tyrosine one position on top of the phenyl ring can be identified with a positive contribution to the free energy of hydration. This position shows entropically unfavorable behavior which is not compensated by additional enthalpic interactions. For aspartate very strong enthalpic interactions are found due to the charged group. Leucine and tryptophan show almost only positive contributions to the solvation free energy (visualized with blue spheres), which is expected for apolar amino acids. The interaction of the backbone is similar for all amino acids and show mostly negative contributions to the free energy. The presented examples show that this method produces reasonable results for all amino acids and can therefore also be used to analyze polypeptides and proteins. This provides a tool to predict the positions of water molecules and allows us to estimate the binding energy of these water molecules at protein surfaces or in pockets. Replacing loosely bound water molecules in proteins by a ligand can result in a significantly gain of binding free energy. Therefore, our methodology can be used in drug design studies to analyze the hydrophobicity of the binding site and to predict the change in free energy when bound waters are released into bulk and/or replaced by an inhibitor.

Conclusion

Enthalpic and entropic contributions to hydrophobicity of amino acid side chains were calculated by inhomogeneous solvation theory using a grid based approach. A hydrophobicity scale is presented, allowing for the determination of the thermodynamic properties for all 20 canonical amino acids. The presented scale based on interaction free energies can be especially helpful in cases, where the interaction energy between solute and solvent and/or the mobility of the solvent molecules is the variable of interest, e.g. for ice nucleation.[64,65] For uncharged polar and apolar amino acids a good agreement with previously published experimental and theoretical hydrophobicity values was found. Therefore, the scale can also be used to determine the free energy for the transfer of a molecule from a polar to an apolar solvent. Our results also agree well with thermodynamic integration studies. However, the advantages of the GIST approach are (i) that it is possible to obtain with only one simulation the thermodynamic properties of interest resulting in shorter simulation time and (ii) that the entropy is calculated directly from the occupied phase space and not derived from multiple simulations. This allows us to analyze the spatial resolution of solvation thermodynamics as well as the individual contributions to it. Comparison with experimental values showed a weakness of state-of-the-art water models and nonpolarizable force field calculations. The entropic effects are not described equally well as the enthalpic contributions. Especially for very apolar environments like aromatic side chains the charges of the water model are too high,[66] and therefore the entropy penalty of solvation is overestimated. Nevertheless, the presented study showed that GIST is a valid approach to determine thermodynamic properties, especially relative differences in free energies and enthalpies, of amino acids. We conclude that our findings could be generalized, and our methodology may also be applied for the description of protein surfaces. An additional advantage of the described method is the possibility to locate the differences in the thermodynamic properties of water molecules around the solute. This is probably only of minor importance when free energies for the transition of amino acids from water to octanol or to the gas phase are investigated but could be of interest when the method is used to investigate biomolecules[70] as possible drug targets. It is very well-known that the hydration of binding sites plays a significant role in the binding processes of biomolecules.[67,68] This study validates the GIST approach to describe thermodynamic changes in the water due to the presence of amino acids. Therefore, it is possible to identify areas of unfavorable bound waters. This information can be used improve the ligand binding by replacing so-called unhappy waters.

55 in total

1. A biomolecular force field based on the free enthalpy of hydration and solvation: the GROMOS force-field parameter sets 53A5 and 53A6.

Authors: Chris Oostenbrink; Alessandra Villa; Alan E Mark; Wilfred F van Gunsteren
Journal: J Comput Chem Date: 2004-10 Impact factor: 3.376

2. Observation of immobilized water molecules around hydrophobic groups.

Authors: Y L A Rezus; H J Bakker
Journal: Phys Rev Lett Date: 2007-10-01 Impact factor: 9.161

3. Long-time correlations and hydrophobe-modified hydrogen-bonding dynamics in hydrophobic hydration.

Authors: John Tatini Titantah; Mikko Karttunen
Journal: J Am Chem Soc Date: 2012-05-29 Impact factor: 15.419

4. Release of high-energy water as an essential driving force for the high-affinity binding of cucurbit[n]urils.

Authors: Frank Biedermann; Vanya D Uzunova; Oren A Scherman; Werner M Nau; Alfonso De Simone
Journal: J Am Chem Soc Date: 2012-09-10 Impact factor: 15.419

5. Grid inhomogeneous solvation theory: hydration structure and thermodynamics of the miniature receptor cucurbit[7]uril.

Authors: Crystal N Nguyen; Tom Kurtzman Young; Michael K Gilson
Journal: J Chem Phys Date: 2012-07-28 Impact factor: 3.488

6. Stabilizing of a globular protein by a highly complex water network: a molecular dynamics simulation study on factor Xa.

Authors: Hannes G Wallnoefer; Sandra Handschuh; Klaus R Liedl; Thomas Fox
Journal: J Phys Chem B Date: 2010-06-03 Impact factor: 2.991

7. A simple method for displaying the hydropathic character of a protein.

Authors: J Kyte; R F Doolittle
Journal: J Mol Biol Date: 1982-05-05 Impact factor: 5.469

8. Hydration of tetraphenylphosphonium and tetraphenylborate ions by dielectric relaxation spectroscopy.

Authors: Wolfgang Wachter; Richard Buchner; Glenn Hefter
Journal: J Phys Chem B Date: 2006-03-16 Impact factor: 2.991

9. Water structural transformation at molecular hydrophobic interfaces.

Authors: Joel G Davis; Kamil P Gierszal; Ping Wang; Dor Ben-Amotz
Journal: Nature Date: 2012-11-22 Impact factor: 49.962

10. Spatial Decomposition of Translational Water-Water Correlation Entropy in Binding Pockets.

Authors: Crystal N Nguyen; Tom Kurtzman; Michael K Gilson
Journal: J Chem Theory Comput Date: 2015-12-04 Impact factor: 6.006

17 in total

1. Mixtures of two self- and mutually-associating liquids: Phase behavior, second virial coefficients, and entropy-enthalpy compensation in the free energy of mixing.

Authors: Jacek Dudowicz; Jack F Douglas; Karl F Freed
Journal: J Chem Phys Date: 2017-08-14 Impact factor: 3.488

2. Prediction of molecular interactions and physicochemical properties relevant for vasopressin V2 receptor antagonism.

Authors: Ania de la Nuez Veulens; Yoanna M Álvarez Ginarte; Rolando E Rodríguez Fernandez; Fabrice Leclerc; Luis A Montero Cabrera
Journal: J Mol Model Date: 2022-01-07 Impact factor: 1.810

3. Target SARS-CoV-2: theoretical exploration on clinical suitability of certain drugs.

Authors: Sk Md Nayeem; E Mohammed Sohail; N V Srihari; P Indira; M Srinivasa Reddy
Journal: J Biomol Struct Dyn Date: 2021-05-14

4. Binding Pose Flip Explained via Enthalpic and Entropic Contributions.

Authors: Michael Schauperl; Paul Czodrowski; Julian E Fuchs; Roland G Huber; Birgit J Waldner; Maren Podewitz; Christian Kramer; Klaus R Liedl
Journal: J Chem Inf Model Date: 2017-02-01 Impact factor: 4.956

5. Balance between hydration enthalpy and entropy is important for ice binding surfaces in Antifreeze Proteins.

Authors: Michael Schauperl; Maren Podewitz; Teresa S Ortner; Franz Waibl; Alexander Thoeny; Thomas Loerting; Klaus R Liedl
Journal: Sci Rep Date: 2017-09-19 Impact factor: 4.379

6. Structural Insights into σ₁ Receptor Interactions with Opioid Ligands by Molecular Dynamics Simulations.

Authors: Mateusz Kurciński; Małgorzata Jarończyk; Piotr F J Lipiński; Jan Cz Dobrowolski; Joanna Sadlej
Journal: Molecules Date: 2018-02-18 Impact factor: 4.411

7. Molecular Dynamics Gives New Insights into the Glucose Tolerance and Inhibition Mechanisms on β-Glucosidases.

Authors: Leon Sulfierry Corrêa Costa; Diego César Batista Mariano; Rafael Eduardo Oliveira Rocha; Johannes Kraml; Carlos Henrique da Silveira; Klaus Roman Liedl; Raquel Cardoso de Melo-Minardi; Leonardo Henrique Franca de Lima
Journal: Molecules Date: 2019-09-04 Impact factor: 4.411

8. Macrocycle Cell Permeability Measured by Solvation Free Energies in Polar and Apolar Environments.

Authors: Anna S Kamenik; Johannes Kraml; Florian Hofer; Franz Waibl; Patrick K Quoika; Ursula Kahler; Michael Schauperl; Klaus R Liedl
Journal: J Chem Inf Model Date: 2020-06-29 Impact factor: 6.162

9. Solvation Thermodynamics in Different Solvents: Water-Chloroform Partition Coefficients from Grid Inhomogeneous Solvation Theory.

Authors: Johannes Kraml; Florian Hofer; Anna S Kamenik; Franz Waibl; Ursula Kahler; Michael Schauperl; Klaus R Liedl
Journal: J Chem Inf Model Date: 2020-07-20 Impact factor: 6.162

10. Solvation Free Energy as a Measure of Hydrophobicity: Application to Serine Protease Binding Interfaces.

Authors: Johannes Kraml; Anna S Kamenik; Franz Waibl; Michael Schauperl; Klaus R Liedl
Journal: J Chem Theory Comput Date: 2019-10-24 Impact factor: 6.006