Literature DB >> 29719674

Cation-π interactions in protein-ligand binding: theory and data-mining reveal different roles for lysine and arginine.

Kiran Kumar1, Shin M Woo1, Thomas Siu1, Wilian A Cortopassi1, Fernanda Duarte2, Robert S Paton1.   

Abstract

We have studied the cation-π interactions of neutral aromatic ligands with the cationic amino acid residues arginine, histidine and lysine using ab initio calculations, symmetry adapted perturbation theory (SAPT), and a systematic meta-analysis of all available Protein Data Bank (PDB) X-ray structures. Quantum chemical potential energy surfaces (PES) for these interactions were obtained at the DLPNO-CCSD(T) level of theory and compared against the empirical distribution of 2012 unique protein-ligand cation-π interactions found in X-ray crystal structures. We created a workflow to extract these structures from the PDB, filtering by interaction type and residue pKa. The gas phase cation-π interaction of lysine is the strongest by more than 10 kcal mol-1, but the empirical distribution of 582 X-ray structures lies away from the minimum on the interaction PES. In contrast, 1381 structures involving arginine match the underlying calculated PES with good agreement. SAPT analysis revealed that underlying differences in the balance of electrostatic and dispersion contributions are responsible for this behavior in the context of the protein environment. The lysine-arene interaction, dominated by electrostatics, is greatly weakened by a surrounding dielectric medium and causes it to become essentially negligible in strength and without a well-defined equilibrium separation. The arginine-arene interaction involves a near equal mix of dispersion and electrostatic attraction, which is weakened to a much smaller degree by the surrounding medium. Our results account for the paucity of cation-π interactions involving lysine, even though this is a more common residue than arginine. Aromatic ligands are most likely to interact with cationic arginine residues as this interaction is stronger than for lysine in higher polarity surroundings.

Entities:  

Year:  2018        PMID: 29719674      PMCID: PMC5903419          DOI: 10.1039/c7sc04905f

Source DB:  PubMed          Journal:  Chem Sci        ISSN: 2041-6520            Impact factor:   9.825


Introduction

Cation–π interactions influence biological structures, molecular recognition and catalysis.1–5 They play an important role in determining protein structure and function. X-ray structural analyses and computational studies show that cation–π interactions are prevalent in protein–protein,6–9 protein–ligand,10–12 and protein–DNA complexes.13 It is estimated that a “typical” protein contains at least one cation–π interaction for every 77 amino acid residues.7 The existence of almost 100 000 X-ray crystal structures currently in the Protein Data Bank (PDB), with an average protein length of 802 residues, suggests more than 1 000 000 cation–π interactions may be present in those structures. Cation–π interactions are vital for several physiological processes. Notable examples include acetylcholine binding,14–17 the recognition of post-translationally modified histones by epigenetic proteins18–21 and ion selectivity in K+ channels.22 These are examples of positively charged ligands interacting favorably with the aromatic sidechains of histidine, phenylalanine, tryptophan and tyrosine residues. On the other hand, the interaction of aromatic ligands with positively charged residues – arginine, histidine, lysine – can also occur. Given the abundance of aromatic and heteroaromatic rings in the structures of drug-like molecules, this type of cation–π interaction is the focus of this paper. For example, sorafenib, approved for the treatment of liver cancer, forms a cation–π interaction with a positively charged lysine residue of the human p38 MAP kinase (Fig. 1A).23 Lapatinib, approved for breast cancer treatment, forms a cation–π interaction with a lysine residue of ErbB4 kinase (Fig. 1B).24 Dihydroquinoxalinone derivatives selectively inhibit the CBP/p300 bromodomain over the closely-related BRD4 receptor, attributed to the formation of a cation–π interaction with a positively charged arginine residue close to the active site (Fig. 1C).25–27 Predictable structure–activity relationships were established according to the electrostatic interaction strength.27
Fig. 1

Aromatic ligands forming cation–π interactions with the target protein: (A) sorafenib complexed with Human p38 MAP kinase, (B) lapatinib in complex with ErbB4 kinase, (C) Conway and co-workers' dihydroquinoxalinone inhibitor of the CREBBP bromodomain.

Cation–π interactions are dominated by the electrostatic attraction between an electron-rich arene and electron-deficient cation.28,29 Houk30 and Wheeler31,32 showed that ring substituents augment this interaction strength with additional electrostatic interactions between the substituents and cation. Differences in binding strength can be explained quantitatively in terms of these additive, through-space electrostatic interactions, more so than by considering any polarizing effect on the π system. Nevertheless, non-electrostatic effects such as induction feature in accurate physical descriptions of cation–π interaction energies.33,34 Analyses of the structures deposited in the PDB provide information about the occurrence and geometric characteristics of cation–π interactions in biological systems, including protein–DNA complexes,35 protein–protein interactions,8,9 metal cation–π interactions36 and cation–π–cation interactions.37 Quantifying the nature and magnitude of these interactions remains a challenge. Combined studies employing both small models and crystal structure analyses have proven useful in understanding these interactions in naturally occurring systems. The aromatic–aromatic interactions of nitroarenes with amino acid side chains of histidine, phenylalanine, tryptophan, and tyrosine have been studied in this way. Wheeler constructed CCSD(T)/aug-cc-pVTZ//B97-D/TZV(2d,2p) theoretical models and compared the computed energies and geometries against nitroarene binding sites from the PDB.38 Chipot studied model systems of intra-residue cation–π interactions between lysine/arginine and phenylalanine/tyrosine/histidine complexes at the MP2/6-311++G**//MP2/6-31G** level of theory and compared them to results obtained with 1718 cation–π protein–protein complexes found in the PDB.8 Over 40% of small molecule drugs launched in clinical trials or under development in 2009 contained a (hetero)-aromatic ring.39 Given the prevalence of aromatic rings in drug-like molecules, we reasoned there should be many crystallographically characterized cation–π interactions in which aromatic rings of ligands interact with positively-charged amino acids of protein active sites. A large collection of structures provides a statistically significant empirical dataset against which to test the relevance of theoretical model complexes. The foci of previous studies exploring the presence of cation–π interactions in biomolecular databases3 have centered on cationic ligands interacting with aromatic residues8 and intra-residue cation–π interactions inside proteins40 or at protein–protein interfaces.9 Comparative studies on aromatic ligands are restricted to nucleic acid bases.40 To the best of our knowledge, a meta-analysis of the interactions between aromatic ligands and cationic residues has not been described previously. Herein, we study the occurrence, geometrical features and magnitude of cation–π interactions between positively charged amino acid sidechains: arginine (Arg), lysine (Lys), and histidine (His), and (non-protein) ligands featuring an aromatic ring. The distance and angular dependence of these interactions has been characterized at the DLPNO-CCSD(T) level of theory and interpreted with symmetry adapted perturbation theory (SAPT). These results are compared to naturally occurring interactions between ligands and proteins found through systematic data-mining of non-redundant protein structures from the PDB.

Methods

Model cation–π complexes

Positively-charged sidechains of lysine, arginine and histidine residues were abbreviated in our quantum chemical studies to ammonium [NH4]+, guanidinium [Gdm]+ and imidazolium [Imi]+ cations. We used benzene as the archetype aromatic ligand. We explicitly considered two conformations of each complex (Fig. 2). For complex [C6H6][NH4]+, these differ by the orientation of the N–H bonds above ring atoms (A) or bonds (B); similarly for [C6H6][Gdm]+ the NH2 groups are either above ring atoms (C) or bonds (D). For the [C6H6][Imi]+ complex, we considered two rotamers with a parallel alignment of the two rings (E and F); and two distinct perpendicular (T-shape) complexes (G and H).
Fig. 2

Cation–π complexes analyzed in this work. Models of (A)/(B) lysine; (C)/(D), arginine; and (E)–(H), histidine sidechains interacting with benzene.

Based on gas phase ab initio calculations, a bidentate orientation of NH4+ above aromatic rings is favored over those with one or three N–H atoms facing the aromatic ring.41 This binding mode is most prevalent in protein inter-residue interactions.42 Guanidiniumbenzene complexes can adopt at least two conformations: perpendicular (T-shaped) or parallel. While T-shaped geometries are preferred in gas-phase, parallel-complexes are preferred in solution and have been observed more frequently in protein structures.7,43 We studied the parallel configuration, based on its biological relevance.7,44 For [C6H6][Imi]+ complexes, both perpendicular and parallel arrangements are found in protein interactions.45 We included both interaction types in our analysis. We studied the interaction energy (Eint) of each model cation–π complex as a function of intermolecular separation and of horizontal displacement parallel to the aromatic plane. Distances were calculated between the center of mass of both species in the complex. Cartesian (x,y) displacements are defined with respect to the benzene ring as shown (Fig. 3).
Fig. 3

(Left) Parameters describing the relative geometry for PEC calculations between the cation and benzene using the distance (R), vertical offset (R, along the normal) and horizontal offsets (R and R, parallel to the plane of benzene). (Right) The side and angle displacements of the cation relative to benzene corresponding to vectors pointing to a C–C/C–H bond by adjusting X and Y coordinates, used to describe the difference in geometry between pairs of complexes, e.g. (E) and (F).

Potential energy curves (PECs) were generated at the domain-based local pair-natural orbital coupled cluster with perturbative triple excitations, DLPNO-CCSD(T),46 level of theory. An augmented, correlation consistent basis set, aug-cc-pVTZ, was used. The convergence of valence DZ, TZ, and QZ quality basis sets was examined for the [benzene][Na]+ complex (Fig. S1†). The aug-cc-pVTZ equilibrium separation closely matches that obtained with a larger aug-cc-pVQZ basis, with an interaction energy within 0.5 kcal mol–1. DLPNO-CCSD(T) energies achieve an accuracy of 1 kcal mol–1 or better compared to CCSD(T),47 while CCSD(T)/CBS values are generally considered benchmark values for intermolecular interaction energies.48 DLPNO-CCSD(T) thermochemistry of small organic molecules is accurate to within 3 kJ mol–1 against experimental data.49 For complexes (A), (C), (E), and (G) CCSD(T) calculations were also carried out using a dielectric constant of 4.2 (diethyl ether) and 78.4 (water). In relation to the D6h symmetry of benzene, two vectors in the plane of the ring represent extreme scenarios of displacement – one towards a C–H bond (angle displacement) and the other towards a C–C bond (side displacement). These vectors are related by a rotation of μ = 30° about the C6 axis (Fig. 3). By plotting a potential energy curve (PEC) with vertical distance against displacement in each vector, behaviour for 12 planes (360°/30°) about the C6 axis of benzene can be achieved for a minimum of 12 geometries per complex. MP2/aug-cc-pVTZ optimized geometries of monomers were obtained imposing Td, D3h, C2v and D6h symmetry restraints for the NH4+, Gdm+, Imi+, and benzene monomers, respectively. E int was then calculated as a difference between the energy of the complex and the sum of the energies of the optimized monomers (eqn (1)): MP2 optimizations were performed with Gaussian50 and DLPNO-CCSD(T) energy calculations with ORCA.51 Computed structures and absolute energies are available as ESI (Tables S2–S9†). The DLPNO-CCSD(T) potential energy curves were interpolated (spline interpolation in 1-d with SciPy) to obtain minimum energy separations and corresponding geometries, on which symmetry adapted perturbation theory computations (SAPT)52 were performed with PSI4.53 The aug-cc-pVDZ-JKFIT basis set was used for these calculations. Briefly, SAPT provides a rigorous partitioning of the intermonomer interaction energy (Eint) into various physical contributions. These include short-range exchange–repulsion (Eee), electrostatics (Eele, e.g. charge–charge, charge–dipole, dipoledipole, etc.), induction polarization (Eind, e.g. dipole/induced-dipole) and London dispersion forces (Edisp, e.g. instantaneous dipole/induced dipole). This approach has been used to analyze non-covalent interactions including π–π and cation–π interactions.54 SAPT (at orders above SAPT2) gives interaction energies close to benchmark CCSD(T)/CBS values.55,56 Our results are consistent with this (Table S1†), with all SAPT2+3 interaction energies lying within ±0.6 kcal mol–1 of corresponding DLPNO-CCSD(T) values (see Fig. S5† for full comparison).

Meta-analysis of cation–π interactions involving aromatic ligands

An empirical distribution of protein–ligand cation–π interactions was sought from the PDB.57 The following workflow was implemented in Python to select a high-resolution, non-redundant dataset: (i) X-ray crystal structures of proteins with non-covalent bound ligands at a resolution of ≤2.0 Å were retained for further analysis. This returned 30 053 structures. (ii) The Protein–Ligand Interaction Profiler (PLIP)58 was used to identify cation–π interactions by imposing geometric criteria. OpenBabel was used to identify rings by Smallest Set of Smallest Rings (SSSR) perception and assign aromaticity.59 Based on previous works,7,8 two thresholds were applied: a 6.0 Å cut-off from the centroid of the aromatic ring and a 2.3 Å horizontal cut-off in the plane of the ring (Fig. S2†). (iii) PROPKA 3.1 (ref. 60) was used to determine the protonation state of all residues at a physiological pH of 7.4. This considers the local environmental perturbation of sidechain intrinsic pKa values. This is most pertinent for histidine residues, whose protonation state is influenced by nearby residues, although instances of neutral lysine and arginine residues were also predicted. We removed all interactions where these sidechains were predicted to be neutral. Equally, we found instances initially characterized as a π–π interaction that were reassigned as cation–π interactions once the sidechain was predicted to be protonated. (iv) Two sources of duplicate records were considered and removed; homologous protein chains within the same protein structure, and polycyclic aromatic ligands that were initially counted as two or more distinct interactions by satisfaction of the geometric cutoffs. To address this, only the first homologous chain containing the cation–π interaction was retained. In the case of polycyclic aromatic ligand interactions, these were further classified as either ‘fused’, ‘sandwich’, or ‘bridge’ complexes (Fig. S3†). For fused complexes the shortest distance was used and described as one cation–π complex, while sandwich and bridge complexes containing n aromatic rings were treated as n occurrences. Following this automated workflow, we obtained a total of 1827 unique protein structures, with 2012 cation–π interactions (Table 1).
Table 1

PDB protein–ligand search filtration results at each stage

StageFilterPDBsHisArgLysTotal
a2.0 ≥ Å resolution, containing ligand30 05330 053
bGeometric thresholds3248495931419309030
cResidue pKa >7.418276829548483870
dNon-redundant chain and polycyclic ligand18274913815822012

Results and discussion

We computed DLPNO-CCSD(T)/aug-cc-pVTZ interaction energies for all eight complexes along an intermolecular axis perpendicular to the aromatic plane, as shown in Fig. 4.
Fig. 4

Top: DLPNO-CCSD(T)/aug-cc-pVTZ interaction energies (kcal mol–1) as a function of intermolecular separation of cation–π complexes. Minimum energies (Emin) and equilibrium separations (R) shown. Bottom: NCI isosurfaces at the minimum energy separations.

The strongest interaction energies occur for complexes in which N–H bonds are oriented towards the aromatic ring: for model complexes (A)/(B) of lysine (–19.2 & –18.7 kcal mol–1) and the T-shaped histidine complex G (–14.0 kcal mol–1). In the other T-shaped complex a C–H bond is oriented towards the ring, giving a smaller interaction energy of –11.5 kcal mol–1. Interaction energies for the stacked complexes, arginine (C)/(D) (–7.5 & –7.4 kcal mol–1) and parallel histidine (E)/(F) (–7.7 & –7.7 kcal mol–1) are weaker still. The more strongly interacting complexes ((A), (B), (G) and (H)) have shorter equilibrium intermolecular separations around 2.9–3.0 Å. Correspondingly, the Non-Covalent Interaction (NCI) isosurfaces61 (Fig. 4) show focused, strongly attractive regions due to the polar X–H–π contacts and more diffuse, weakly attractive regions associated with the stacked systems. Molecular van der Waals surfaces using Bondi radii (Fig. S4†) show that X–H···π contacts occur at distances less than the sum of atomic van der Waals radii, whereas the stacked complexes are separated by interatomic distances slightly longer than the sum of these radii. These interactions depend negligibly on the orientation of each complex. Very similar interaction energy profiles (within 0.5 kcal mol–1) were obtained for the two rotamers of each of the lysine, arginine, and parallel histidine complexes. For the T-shaped [C6H6][Imi]+ complexes, there is a more pronounced difference, depending on whether an N–H or C–H bond is oriented towards the arene. An N–H···π interaction is 2.5 kcal mol–1 (complex (F)) is more stable than C–H···π interaction (complex (G)) and gives a shorter equilibrium separation by 0.25 Å. This is due to the increased polarity of the N–H bond.62 Comparison with the available gas-phase experimental thermochemistry is promising. Our DLPNO-CCSD(T) interaction energy for [C6H6][NH4]+ of –19.2 kcal mol–1 matches the experimental ΔH° (380 K) of –19.3 ± 1.0 kcal mol–1,41 as well as [C6H6][K]+, being –19 kcal mol–1.54 In order to further rationalize these quantitative results, SAPT2+3/aug-cc-pVDZ analysis was used to partition the interaction energy (Eint) into various physical contributions (exchange repulsion Eee, induction Eind, dispersion Edisp, and electrostatic Eele) (Fig. 5) at the equilibrium separation of each complex. In line with previous studies,56 the SAPT2+3 Eint methods gave values within ±0.6 kcal mol–1 of the DLPNO-CCSD(T) energies (Fig. S5†).
Fig. 5

SAPT2+3/aug-cc-pVDZ decomposition of the interaction energy (in kcal mol–1) into exchange repulsion (Eee, red), electrostatics (Eelec, blue), induction-polarization (Eind, green) and London dispersion (Edisp, yellow) at the equilibrium separation.

The SAPT2+3 decomposition shows that [C6H6][NH4]+ complexes (A)/(B) are very different from the others. The dominant favorable terms are electrostatic and induction (polarization). In the remaining complexes the terms are more evenly balanced: for T-shape [C6H6][Imi]+ complexes (G)/(H) the electrostatic term is the most favorable, whereas for [C6H6][Gdm]+ complexes (C)/(D) and parallel [C6H6][Imi]+ complexes (E)/(F) the dispersion term is most favorable. Hobza has proposed that noncovalent complexes can be classified based on the ratio of SAPT Edisp/Eelec terms.63 Empirical observation suggests that dispersion/electrostatics ratio less than 0.59 should be categorized as electrostatic; greater than 1.7 (1/0.59) as dispersion bound; between 0.59 and 1.7 as mixed. Adopting this convention, [C6H6][NH4]+ complexes (A) and (B) are electrostatics dominated (Edisp/Eelec = 0.49–0.50), whereas all others are mixed (Edisp/Eelec = 0.82–0.83 for complexes (G) and (H); Edisp/Eelec = 1.13–1.27 for complexes (C)–(F)). Finally, the large energy difference between (H) and (G), with the former being 2.5 kcal mol–1 more stable, arises from favorable Eee, Eind, which operate cooperatively. In this case, the N–H bond closest to the ring is more polar, and therefore induces both a stronger electrostatic attraction and a greater polarization response from the π system. This brings [Imi]+ closer to the ring of the benzene by 0.18 Å, which increases both Eee and, to a lesser extent, Edisp. The favorable interactions from this closer contact negate the increase in repulsive Eee. In addition to the magnitude of the interaction strengths, SAPT analysis illustrates a continuum of interaction type. The [C6H6][NH4]+ cation–π interactions are dominated by the favorable electrostatic attraction, whereas the other complexes feature a balanced mixture of electrostatic and dispersion interactions. Within this mixed category, the electrostatic term is the most favorable in the [C6H6][Gdm]+ T-shape complexed, and is 3–4 kcal mol–1 larger in magnitude than those experienced by [C6H6][Gdm]+ and [C6H6][Imi]+ stacked complexes. The stacked complexes have the smallest electrostatic terms and dispersion is the most favorable attractive term. The distinct nature of this interaction has been additionally classified as a π+–π interaction.64

Cation–π interaction in protein–ligand complexes

The prevalence of protein–ligand cation–π interactions was evaluated in a non-redundant set of PDB structures. An initial search returned 30 053 X-ray crystal structures with resolution ≤2 Å containing unique ligands (Table 1, stage A). Following an implementation of geometric rule-based criteria, 3248 structures were found to contain 9030 active sites with a Lys/Arg/His residue in proximity to an aromatic ligand (Table 1, stage B). Of the complexes identified, 4954 involved histidine, however, these results also contained π–π interactions involving a neutral residue. The protonation state of each of the residues at physiological pH was computed with PROPKA (Table 1, stage C), drastically reducing the number of interactions involving histidine acting as a cation. In contrast, Arg-ligand and Lys-ligand complexes retained 94% and 92% of their interactions from the previous stage. Finally, 40% of duplicate interactions were removed to yield a final data set of 2012 interactions dominated by Arg (69%), Lys (29%), and to a lesser extent His (2%) residues (Table 1, stage D).

Empirical vs. calculated cation–π potential energy surface

We compared the distribution of cation–π complexes extracted from crystal structures against the DLPNO-CCSD(T)/aug-cc-pVTZ PES. For each interaction the vertical displacement from the aromatic plane (R) is shown against horizontal displacement from the ring centroid (R) (Fig. 6). The underlying color reflects the calculated interaction energy of the model complex for arginine, histidine and lysine.
Fig. 6

Empirical distribution of complexes (points) superimposed on the computed DLPNO-CCSD(T)/aug-cc-pVTZ potential energy surface (in kcal mol–1); (left) Arg-aromatic primary (purple) and secondary bicyclic (yellow), (middle) His-aromatic, and (right) Lys-aromatic complexes.

For arginine, the empirical distribution from X-ray and computed PES agree well. The 1381 geometries are clustered empirically in the minimum energy well of the PES, with a vertical offset of 3.5 Å. Additionally, interplanar angles between the guanidium and aromatic group are predominantly clustered below 30° – i.e. closer to a parallel stacked geometry, as considered by our model calculations, than to a T-shaped conformation (Fig. S6†). The computed PES is relatively flat with respect to horizontal displacement and accordingly the interactions are evenly spread across this range. Additionally, most of the interactions with secondary rings of bicyclic aromatics are found within the potential well. Much fewer (59) X-ray structures were available for cationic His-aromatic complexes, making statistical inference more difficult. 70% of the complexes are found close to the minimum energy well centered at R = 1.25, R = 3.25. The computed well for lysine is deeper and narrower, however, the empirical distribution of these interactions is scattered widely (Fig. 6). Unlike arginine, the empirical distribution of lysine's cation–π interactions is more scattered than predicted by theory. A large cluster of interactions occurs at R = 2.0, R = 4.0, far from the PES minimum at (0.0, 3.0). Why do the X-ray structures of cationic argininearene interactions match theory well, whereas cationic lysinearene interactions do not? Recalling our SAPT results, dispersion interactions prevail for arginine while for lysine, electrostatics interactions dominate. This led us to consider how each interaction type is influenced by the surrounding medium. We compared the empirical distance dependence of each interaction type against the DLPNO-CCSD(T) energy profile calculated in the gas-phase, and with a surrounding conductor-like polarizable continuum model (CPCM)65 with dielectric constant of 4.2 and 78.4 (Fig. 7). These values reflect the average polarity of the relatively hydrophobic protein interior and that of bulk water.66 Solvation corrections were computed at the MP2/cc-pVTZ level of theory.
Fig. 7

Normalized distance dependence of empirical interactions (bars) compared against DLPNO-CCSD(T)/aug-cc-pVTZ computed potential energy curves (kcal mol–1) in the gas phase and with a dielectric constant of 4.2 (diethyl ether) and 78.4 (water). Solvation corrections were computed at the CPCM-MP2/cc-pVTZ level of theory.

Each cation–π interaction is weakened by the presence of a surrounding dielectric medium. For lysine, the interaction strength decreases to 19% of its gas-phase value (to 3.6 kcal mol–1) for ε = 4.2, and 7% (1.3 kcal mol–1) for ε = 78.4. For arginine, the decrease is less pronounced at these values of dielectric constant, to 47% (3.2 kcal mol–1) and 34% (2.3 kcal mol–1) – in water this interaction is stronger than lysine's even though it is less favorable by 11.5 kcal mol–1 in the gas-phase. With the SMD solvation model,66 arginine's cation–π interaction is also stronger than lysine's in water, although the interaction strengths were greater. A previous computational estimate of the lysinebenzene interaction strength in water is larger (5.5 kcal mol–1 with SM5.42R/HF/6-31+G*)7b compared to the values of 1.3/2.8 kcal mol–1 (CPCM/SMD) we have obtained with correlated wavefunction theory and a larger basis set. The scattered distribution of lysinearene interactions found empirically is consistent with a small interaction strength. This also explains the relative paucity of this interaction type, in relation to the abundance of lysine residues in proteins. In contrast, the argininearene interaction strength is less strongly influenced by the presence of a surrounding polar medium and a clear minimum remains on the PEC. The high frequency of this interaction type and the empirical distribution is consistent with the position of this minimum energy separation. The distinct behavior of lysine's and arginine's interactions results from the predominantly electrostatic character of the former interaction and mixed character of the latter, as shown by SAPT analysis. The cation–π interactions of T-shaped histidine are weakened to 21% of the gas-phase value (3.0 kcal mol–1) for ε = 4.2 and 9% (1.3 kcal mol–1) for ε = 78.4. For stacked histidine the reduction is smaller, to 44% (3.4 kcal mol–1) and 32% (2.5 kcal mol–1) at these values of dielectric constant. Again, the interaction with a greater electrostatic character (by SAPT) is diminished more severely by the surrounding dielectric medium. We considered as large a data set possible, but the interactions we found are still biased by the proteins and ligands studied experimentally and crystallized. Few lysinearene interactions were found empirically above the center of the aromatic ring, although the PES minimum is found at R = 0. Fig. 8 illustrates how several intermolecular interactions influence ligand position, such that the cation–π interaction itself between lysinearene ligands is not a determinant of this pose. For example, a high proportion (63%) of the lysinearene interactions found around R, R ≈ (2.0, 4.0) involve GTP/ATP/NAD/FAD related ligands. In such cases, the ligand's binding position is also influenced by a hydrogen bond interaction involving the ribose ester and the interactions between the phosphate group and the surroundings (Fig. 8A). It is interesting to notice that even when the systems involving these cofactors are removed from the analysis (Fig. S7†), the scattered nature of Lys-aromatic complexes is still evident. This is more likely due to additional interactions, such as salt bridges or hydrogen bonds, that Lysine can establish with nearby residues and solvent molecules. In the case of argininearene complexes, some long distance (6 Å) cation–π interactions are observed as a result of a π–π interaction with another aromatic residue (Fig. 8B). Cation–π interactions may also be shortened due to cooperative effects, such as those which occur when an N–H aromatic interaction between Arg101 and His98A inductively enhances the positive charge on arginine, thereby strengthening the electrostatic components of the cation–π interaction and salt bridge (Fig. 8C).
Fig. 8

(A) GDP/GTP Lys-aromatic binding site selected from ammonium-PES RR ≈ (2.0, 4.0) (ligand ID GDP). Examples of Arg-aromatic cation–π (B) long distance (ligand ID FX4) and (C) short distance bond (ligand ID HEM).

Conclusions

We have studied the cation–π interactions of neutral aromatic ligands with the cationic amino acid residues arginine, histidine and lysine. Quantum chemical potential energy surfaces for these interactions were obtained at the DLPNO-CCSD(T) level of theory and compared against the empirical distribution of 2012 unique protein-ligand cation–π interactions found in X-ray crystal structures. We created a workflow to extract these structures from the PDB, filtering by interaction type and residue pKa. The gas phase cation–π interaction of lysine is significantly stronger than arginine by more than 10 kcal mol–1. However, the distribution of 582 empirical structures is scattered away from the minimum on the interaction PES, with a much longer intermolecular separation. In contrast, the 1381 structures involving arginine (69% of the total found) reflect the underlying calculated PES well. This reflects the different electrostatic contributions to each of these residues' interactions with an aromatic ring, which was established by SAPT analysis. The electrostatics dominated lysinearene interaction is greatly diminished by a surrounding dielectric medium, such that it becomes essentially negligible in strength and without a well-defined equilibrium separation. The argininearene interaction involves a near equal mix of dispersion and electrostatic attraction, which is weakened to a much smaller degree by the surrounding medium. Our results account for the relative paucity of cation–π interactions involving lysine, despite its prevalence in protein structures. Particularly in protein active sites of medium to high polarity, such as those which are solvent exposed, the lysinearene interaction is predicted to be weakened to the point that it has a negligible effect on an aromatic ligand binding mode. In contrast, the cation–π interaction made by arginine residues with aromatic ligands is more robust to changes in the surrounding environment. This interaction is the most frequent found empirically and is also computed to be stronger than for lysine in higher polarity surroundings. There are relatively few cation–π interactions involving positively charged histidine residues, although the stacked π+–π interaction is predicted to be of similar magnitude to that of arginine. Systematic analysis of crystal structures showed that other factors, such as competitive hydrogen bonding interactions and solvent accessibility, unconnected to the cation–π interaction may also affect the cation–π interaction geometry. This investigation has characterized the intrinsic properties of biologically relevant cation–π interactions and the effect of the protein environment in an average way using a surrounding polarizable dielectric medium. Future work will consider the inhomogeneous nature of the local environment on these interactions.67

Conflicts of interest

There are no conflicts to declare. Click here for additional data file. Click here for additional data file.
  57 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  The Cationminus signpi Interaction.

Authors:  Jennifer C. Ma; Dennis A. Dougherty
Journal:  Chem Rev       Date:  1997-08-05       Impact factor: 60.622

3.  Development of a fluorescent-tagged kinase assay system for the detection and characterization of allosteric kinase inhibitors.

Authors:  Jeffrey R Simard; Matthäus Getlik; Christian Grütter; Vijaykumar Pawar; Sabine Wulfert; Matthias Rabiller; Daniel Rauh
Journal:  J Am Chem Soc       Date:  2009-09-23       Impact factor: 15.419

Review 4.  Cation-π interaction: its role and relevance in chemistry, biology, and material science.

Authors:  A Subha Mahadevi; G Narahari Sastry
Journal:  Chem Rev       Date:  2012-11-13       Impact factor: 60.622

5.  An efficient and near linear scaling pair natural orbital based local coupled cluster method.

Authors:  Christoph Riplinger; Frank Neese
Journal:  J Chem Phys       Date:  2013-01-21       Impact factor: 3.488

6.  Enzymatic catalysis of anti-Baldwin ring closure in polyether biosynthesis.

Authors:  Kinya Hotta; Xi Chen; Robert S Paton; Atsushi Minami; Hao Li; Kunchithapadam Swaminathan; Irimpan I Mathews; Kenji Watanabe; Hideaki Oikawa; Kendall N Houk; Chu-Young Kim
Journal:  Nature       Date:  2012-03-04       Impact factor: 49.962

7.  Revealing noncovalent interactions.

Authors:  Erin R Johnson; Shahar Keinan; Paula Mori-Sánchez; Julia Contreras-García; Aron J Cohen; Weitao Yang
Journal:  J Am Chem Soc       Date:  2010-05-12       Impact factor: 15.419

8.  Amino/aromatic interactions in proteins: is the evidence stacked against hydrogen bonding?

Authors:  J B Mitchell; C L Nandi; I K McDonald; J M Thornton; S L Price
Journal:  J Mol Biol       Date:  1994-06-03       Impact factor: 5.469

9.  Catalytic enantioselective synthesis of indanes by a cation-directed 5-endo-trig cyclization.

Authors:  Craig P Johnston; Abhishek Kothari; Tetiana Sergeieva; Sergiy I Okovytyy; Kelvin E Jackson; Robert S Paton; Martin D Smith
Journal:  Nat Chem       Date:  2015-01-12       Impact factor: 24.427

10.  Induction effects in metal cation-benzene complexes.

Authors:  Ignacio Soteras; Modesto Orozco; F Javier Luque
Journal:  Phys Chem Chem Phys       Date:  2008-04-07       Impact factor: 3.676

View more
  44 in total

1.  Discovery of Imidazoisoindole Derivatives as Highly Potent and Orally Active Indoleamine-2,3-dioxygenase Inhibitors.

Authors:  Wangyang Tu; Fanglong Yang; Guoji Xu; Jiangtao Chi; Zhiwei Liu; Wei Peng; Bing Hu; Lei Zhang; Hong Wan; Nan Yu; Fangfang Jin; Qiyue Hu; Lianshan Zhang; Feng He; Weikang Tao
Journal:  ACS Med Chem Lett       Date:  2019-06-03       Impact factor: 4.345

Review 2.  Physical Chemistry of Cellular Liquid-Phase Separation.

Authors:  Emily P Bentley; Benjamin B Frey; Ashok A Deniz
Journal:  Chemistry       Date:  2019-02-07       Impact factor: 5.236

3.  Comparative roles of charge, π, and hydrophobic interactions in sequence-dependent phase separation of intrinsically disordered proteins.

Authors:  Suman Das; Yi-Hsuan Lin; Robert M Vernon; Julie D Forman-Kay; Hue Sun Chan
Journal:  Proc Natl Acad Sci U S A       Date:  2020-11-02       Impact factor: 11.205

4.  Explicit Representation of Cation-π Interactions in Force Fields with 1/r4 Nonbonded Terms.

Authors:  Aysegul Turupcu; Julian Tirado-Rives; William L Jorgensen
Journal:  J Chem Theory Comput       Date:  2020-10-13       Impact factor: 6.006

5.  Novel Deazaflavin Analogues Potently Inhibited Tyrosyl DNA Phosphodiesterase 2 (TDP2) and Strongly Sensitized Cancer Cells toward Treatment with Topoisomerase II (TOP2) Poison Etoposide.

Authors:  Jayakanth Kankanala; Carlos J A Ribeiro; Evgeny Kiselev; Azhar Ravji; Jessica Williams; Jiashu Xie; Hideki Aihara; Yves Pommier; Zhengqiang Wang
Journal:  J Med Chem       Date:  2019-04-30       Impact factor: 7.446

6.  Tau repeat regions contain conserved histidine residues that modulate microtubule-binding in response to changes in pH.

Authors:  Rabab A Charafeddine; Wilian A Cortopassi; Parnian Lak; Ruensern Tan; Richard J McKenney; Matthew P Jacobson; Diane L Barber; Torsten Wittmann
Journal:  J Biol Chem       Date:  2019-04-16       Impact factor: 5.157

7.  Mining therapeutic targets from the antibiotic-resistant Campylobacter coli and virtual screening of natural product inhibitors against its riboflavin synthase.

Authors:  Khurshid Jalal; Kanwal Khan; Ajmal Hayat; Diyar Ahmad; Ghallab Alotaibi; Reaz Uddin; Mutaib M Mashraqi; Ahmad Alzamami; Muhammad Aurongzeb; Zarrin Basharat
Journal:  Mol Divers       Date:  2022-06-14       Impact factor: 2.943

8.  A Sequential Umpolung/Enzymatic Dynamic Kinetic Resolution Strategy for the Synthesis of γ-Lactones.

Authors:  Mark A Maskeri; Malte L Schrader; Karl A Scheidt
Journal:  Chemistry       Date:  2020-04-24       Impact factor: 5.236

9.  Cation-π Interactions in the Benzylic Arylation of Toluenes with Bimetallic Catalysts.

Authors:  Sheng-Chun Sha; Sergei Tcyrulnikov; Minyan Li; Bowen Hu; Yue Fu; Marisa C Kozlowski; Patrick J Walsh
Journal:  J Am Chem Soc       Date:  2018-09-19       Impact factor: 15.419

10.  Discovery of Hydroxyamidine Derivatives as Highly Potent, Selective Indoleamine-2,3-dioxygenase 1 Inhibitors.

Authors:  Fangfang Jin; Qiyue Hu; Hongbo Fei; Hejun Lv; Shenglan Wang; Bin Gui; Junzhen Zhang; Wangyang Tu; Yun Zhang; Lei Zhang; Hong Wan; Limin Zhang; Bin Hu; Fanglong Yang; Chang Bai; Feng He; Lianshan Zhang; Weikang Tao
Journal:  ACS Med Chem Lett       Date:  2021-01-20       Impact factor: 4.345

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.