Literature DB >> 34693623

Se⋅⋅⋅O/S and S⋅⋅⋅O Chalcogen Bonds in Small Molecules and Proteins: A Combined CSD and PDB Study.

Himansu S Biswal1,2, Akshay Kumar Sahu1,2, Bartomeu Galmés3, Antonio Frontera3, Deepak Chopra4.   

Abstract

The importance of selenium-centered noncovalent chalcogen bonds represented as Se⋅⋅⋅A (A=O/S) has been explored for short directional contacts in small molecules and proteins. In addition, S⋅⋅⋅O centered contacts have been analyzed. Computational analyses involving the quantitative assessment of the associated energetics, the molecular electrostatic potentials (MEP), and electron density derived topological parameters, namely, quantum theory of atom in molecules (QTAIM) analyses, and NBO (natural bond orbital) based calculations, have been performed to unequivocally establish the strength, stability, and attractive role of chalcogen bonds in the solid-state. This investigation has been performed in molecules from both the Cambridge Structural Database (CSD) and Protein Data Bank (PDB). Thus futuristic materials may be designed keeping in mind the significance of these interactions, including their relevance in biology.
© 2021 The Authors. ChemBioChem published by Wiley-VCH GmbH.

Entities:  

Keywords:  CSD analysis; PDB analysis; chalcogen bonding; density functional calculations; sigma-hole interactions

Mesh:

Substances:

Year:  2021        PMID: 34693623      PMCID: PMC9298333          DOI: 10.1002/cbic.202100498

Source DB:  PubMed          Journal:  Chembiochem        ISSN: 1439-4227            Impact factor:   3.461


Introduction

The relevance of σ‐hole centered noncovalent interactions is now well recognized in the literature. Recent investigations establish the importance of halogen,[ , ] chalcogen, and tetrel bonding in molecules. This also has implications in reactivity profiles, namely catalysis in these classes of compounds. Selenium is an important main group element and is present in molecules that find application in materials, in drugs, and in proteins (containing selenocysteine) as well. Over the last few decades, several investigations on chalcogen bonded systems have been performed. 17O and 77Se NMR spectroscopic measurements have been utilized to probe intramolecular Se⋅⋅⋅O contact in substituted benzeneselenyl derivatives. The concept of molecular balances has been utilized to experimentally evaluate the chalcogen bonding interactions, wherein the role of solvent effects are important. In situ cryo‐crystallization technique has been utilized to probe the role of different chalcogen‐centered interactions that are present in the crystal packing of Ph2Se and Ph2Te. The directional preferences of non‐covalent contacts with sulfur, which also suggested the non‐spherical nature of sulfur atom, were established by studies done via a statistical analysis of the molecules from the Cambridge Structural Database. Nonbonded sulfur/ selenium‐oxygen interactions in thiazole and selenazole nucleosides have been evaluated from computational studies. This was followed by the work of Iwaoka and co‐workers who have investigated S⋅⋅⋅O contacts, and suggested that directionality plays a key role and their relevance in protein structures was also suggested. Quantitative molecular orbital analysis in chalcogen bonded D2Ch⋅⋅⋅A− [Ch=chalcogen, D, A=F, Cl, Br] complexes, establish that there exists significant covalent character on account of the strong HOMO‐LUMO interactions along with structural distortions arising from charge transfer from A‐ to D2Ch. In‐depth characterization of chalcogen interactions, in terms of geometrical changes at donor and acceptor atoms, including detailed topological analysis have also been done as well. Keeping in mind the features mentioned above, we have undertaken to perform a quantum mechanical study of noncovalent interactions involving Ch⋅⋅⋅Ch contacts, namely, Se⋅⋅⋅S, Se⋅⋅⋅O and S⋅⋅⋅O in both small molecules (from CSD) and proteins (from PDB). The contacts considered in the study involve the contact distance as the sum of the van der Waals radii of participating atoms with the directionality being near linear (≥160°). DFT‐based calculations were performed, and the results establish the stabilizing nature of these interactions, with the strength being “weak” to “moderately strong.” Furthermore, quantum theory of “atoms in molecules” (QTAIM), noncovalent interaction (NCI) index, and natural bond orbital (NBO) analyses have been performed to characterize these noncovalent interactions′ bonding features and provide quantitative insights into the role of orbital contributions towards the overall stabilization of the formed complexes.

Results and Discussion

Investigation of intermolecular interactions from the crystal structures in CSD

A search of organic X‐ray structures from the CSD (updated May 2021) was initially performed, limiting the searches to those CIFs containing 3D coordinates and “no errors”. Moreover, only those X‐ray structures with Ch⋅⋅⋅Ch (Ch=Se, S, and O) distances ≤ΣRvdw+0.2 Å and Y−Ch⋅⋅⋅Ch (Y=any atom) angle ≥140° were selected and further inspected manually and statistically (see ESI for further details) and the statistical plots are represented in Figure 1. From the above analyses, we found 6992, 514, and 340 hits for the S⋅⋅⋅O, Se⋅⋅⋅O and Se⋅⋅⋅S interactions, respectively. A detailed geometrical distribution for all Ch⋅⋅⋅Ch interactions is represented in Figure 1. An approximate Gaussian like distribution was observed for S/Se⋅⋅⋅O, showing a maximum population near 3.3 Å (Figure 1(a), (d)). The distance and angle distribution were simultaneously plotted in a radial density plot (Figure 1(c), (f)) evidencing that the maximum density is found around 165° for both O⋅⋅⋅S and O⋅⋅⋅Se interactions [∠O⋅⋅⋅S−C (Figure 1(b)), and ∠O⋅⋅⋅Se−C (Figure 1(e))].
Figure 1

(a) Distribution of distance between S and O, (b) cone corrected O⋅⋅⋅S−C angle distribution where the frequency is generated by counts divided by respective sin(θ), (c) radial density distribution plot where the distance between O and S is normalized with respect to the sum of their van der Waals radii ( ), the density bar normalized to the maximum count, (d) Distribution of distances between Se and O, (e) cone corrected O⋅⋅⋅Se−C angle distribution where the frequency is generated by counts divided by respective sin(θ), (f) radial density distribution plot where the distance between O and Se is normalized with respect to their sum of the van der Waals radii ( ), the density bar normalized to the maximum count, (g) Distribution of distances between Se and S, (h) cone corrected S⋅⋅⋅Se−C angle distribution where the frequency is generated by counts divided by respective sin(θ), (i) radial density distribution plot where the distance between S and Se is normalized with respect to their sum of the van der Waals radii ( ), the density bar normalized to the maximum count.

(a) Distribution of distance between S and O, (b) cone corrected O⋅⋅⋅S−C angle distribution where the frequency is generated by counts divided by respective sin(θ), (c) radial density distribution plot where the distance between O and S is normalized with respect to the sum of their van der Waals radii ( ), the density bar normalized to the maximum count, (d) Distribution of distances between Se and O, (e) cone corrected O⋅⋅⋅Se−C angle distribution where the frequency is generated by counts divided by respective sin(θ), (f) radial density distribution plot where the distance between O and Se is normalized with respect to their sum of the van der Waals radii ( ), the density bar normalized to the maximum count, (g) Distribution of distances between Se and S, (h) cone corrected S⋅⋅⋅Se−C angle distribution where the frequency is generated by counts divided by respective sin(θ), (i) radial density distribution plot where the distance between S and Se is normalized with respect to their sum of the van der Waals radii ( ), the density bar normalized to the maximum count. For the Se⋅⋅⋅S interaction, the geometrical distribution is given in Figures 1(g)–(h). The radial distribution (Figure 1(i)) is slightly different, showing two different zones where the density of hits is important. One intense region is found at around 160° (∠S⋅⋅⋅Se−C) similar to S⋅⋅⋅O and Se⋅⋅⋅O interactions and another one (the maximum) at a lower angle (∼140°). This different behavior is likely due to a mixing of two possible situations, one where the Se is the σ‐hole donor and the S‐atom is the σ‐lump donor and another one that corresponds to the reverse combination, where the S is the σ‐hole donor and the Se‐atom is the σ‐lump donor. From the CSD searches, we have selected several hits with high directionality and short Ch⋅⋅⋅Ch distance to analyze them energetically using DFT calculations, MEP analysis, and a combination of QTAIM and NCIplot index computational tools (see Scheme 1).Tables S1–S3 (ESI) reflect the interaction energies of ChB dimers extracted from the X‐ray structures, the MEP values at the Se/S ChB donors and O/S ChB acceptors, and the magnitudes of the electron density at the bond CPs connecting both the Ch atoms. From the examination of the results, several issues can be extracted. First, the interaction energies in neutral systems range from −3.2 to −14.9 kcal/mol. However, most of the Ch⋅⋅⋅Ch dimers present additional contacts, mostly H‐bonds (see QTAIM analyses represented in Figures S1 to S3). Second, the MEP values at the selenium‘s σ‐holes are quite different depending on the structure, ranging from just 1.5 kcal/mol in DUNSAG to 115 kcal/mol in FIDZUG (due to its cationic nature). As expected, the MEP values at the Se σ‐holes are more positive than those at the S σ‐holes in most cases. Finally, the MEP values at the O‐atoms are more negative than those at the S‐atoms, thus anticipating stronger Se⋅⋅⋅O interactions.
Scheme 1

Chemical diagrams of the selected hits from the CSD searches with the indication of the CSD codes.

Chemical diagrams of the selected hits from the CSD searches with the indication of the CSD codes. We have selected some representative hits of each CSD search based on those structures where the dimers present a minimum number of ancillary interactions (Scheme 1). Consequently, in those hits the ChB is most likely the dominant interaction. Figure 2 shows the MEP surfaces of the three selected Se‐bond donor molecules that participate in Se⋅⋅⋅O ChBs (see ESI for computational details). Moreover, Figure 3 shows the QTAIM/NCIplot index analyses of the dimers extracted from their solid‐state structures. The CSD reference code LEDGAD presents a six‐membered diselenacycle in the structure, where each Se‐atom exhibits two symmetric σ‐holes (+33.8 kcal/mol, see Figure 2a). In the solid‐state, it forms dimers stabilized by a combination of Se⋅⋅⋅O and O(lp)⋅⋅⋅π interactions, both characterized by the corresponding bond CPs and green NCI plot isosurfaces (see Figure 3a). The dimerization energy is moderately strong (−6.5 kcal/mol), confirming the importance of the ChB contact in the crystal packing. The O(lp)⋅⋅⋅π interacts between the electrostatically more negative oxygen with the electrostatically positive four‐membered ring indicating the attractive nature of the interaction in the NCI plot.
Figure 2

MEP surfaces of refcodes LEDGAD (a), MUSCIM (b), and RAKMAR (c). The MEP values at the σ‐holes are indicated in kcal/mol. See Scheme 1 for the chemical diagrams of the compounds.

Figure 3

Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04<(signλ2) ρ<0.04 a.u. of refcodes LEDGAD (a), MUSCIM (b) and RAKMAR (c). Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for antibonding sigma orbital.

MEP surfaces of refcodes LEDGAD (a), MUSCIM (b), and RAKMAR (c). The MEP values at the σ‐holes are indicated in kcal/mol. See Scheme 1 for the chemical diagrams of the compounds. Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04<(signλ2) ρ<0.04 a.u. of refcodes LEDGAD (a), MUSCIM (b) and RAKMAR (c). Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for antibonding sigma orbital. In the case of bis(methylseleno)cyclopropenone (MUSCIM), the Se‐atom exhibits two different σ‐holes, the one opposite to the cyclopropenone ring being the most positive (+25.7 kcal/mol see Figure 2b). Interestingly, the Se⋅⋅⋅O interactions observed in the solid‐state of MUSCIM corresponds to the most positive σ‐hole that interacts with the O‐atom of the adjacent molecule (see Figure 2b). The dimerization energy of MUSCIM is −3.7 kcal/mol, which is smaller than that in LEDGAD and is likely due to the strongest contribution of the O⋅⋅⋅π interaction in the latter. Finally, the MEP surface of Se,Se′‐diphenyl carbonodiselenoate (RAKMAR, see Figure 2c) structure presents three σ‐holes (MESP in the range of 5.7–6.1 kcal/mol, two σ‐holes merged into one) that are similar in energy. All of them participate in the dimer formation, establishing the bifurcated nature of Se⋅⋅⋅O and two Se⋅⋅⋅π ChBs (see Figure 2c). These interactions are characterized by bond CPs (red spheres) and bond paths connecting the Se‐atoms to the O‐atom of the adjacent molecule and two C‐atoms of the phenyl rings. The green NCI isosurfaces establish the attractive nature of the Se⋅⋅⋅O and Se⋅⋅⋅π interaction. The interaction energy is large (−8.4 kcal/mol) due to the cooperative formation of these three concurrent ChB interactions. For the sake of comparison, we have computed at the same level of theory, two prototypical H‐bonded complexes, that are the water dimer and the formamide dimer. For the water dimer (only one H‐bond, see Figure S4a) the interaction energy is −5.7 kcal/mol and for the self‐assembled dimer of formamide (two H‐bonds, see Figure S4b), the interaction energy is −14.4 kcal/mol. Therefore, the ChBs energies reported for the Se⋅⋅⋅O contacts are more or less comparable to those of the H‐bonds. In order to further evidence the existence of chalcogen bonding interaction and the importance of orbital effects, we have carried out the natural bond orbital analysis of the homodimers represented in Figure 3. We have focused the study on the second‐order perturbation analysis since it is useful to account for orbital donor‐acceptor interactions. The results (E (2) energies) are indicated in Figure 3, showing in all three cases an electron donation from the lone pair (LP) at the O‐atom to the antibonding (Se−C) orbital with concomitant stabilization energies that range from 0.60 kcal/mol in RAKMAR to 1.72 kcal/mol in MUSCIM. The existence of such electron donation is typical in σ‐hole interactions. For comparison purposes, we have also computed the E (2) energies in the prototypical H‐bonded dimers commented above (see Figure S4), which are significantly larger than those obtained for the ChBs reported herein. For instance, for the formamide dimer, each N−H⋅⋅⋅O H‐bonds exhibits an electron donation from the lone pair (LP) at the O‐atom to the antibonding (N−H) orbital with a consequent stabilization of the system in E (2)=−12.3 kcal/mol. Figure 4 shows the MEP surfaces of the two Ch‐bond donors selected to highlight the Se⋅⋅⋅S interactions. For bis(methylmercapto)selenane (GAJLAF, Figure 4a), two symmetric σ‐holes are present in the Se atom.
Figure 4

MEP surfaces of refcodes GAJLAF (a) and XASBID (b). The MEP values at the σ‐holes are indicated in kcal/mol. See Scheme 1 for the chemical diagrams of the compounds

MEP surfaces of refcodes GAJLAF (a) and XASBID (b). The MEP values at the σ‐holes are indicated in kcal/mol. See Scheme 1 for the chemical diagrams of the compounds This molecule in the solid‐state forms self‐assembled dimers (see Figure 5a) stabilized by two Se⋅⋅⋅S contacts characterized by the corresponding bond CP and green NCI plot index isosurface. The dimerization energy is −3.9 kcal/mol, thus revealing that each ChB is modest (around −2 kcal/mol), in line with the moderate σ‐hole MEP value. The second selected example corresponds to(Z,Z)‐1,5‐diphenyl‐3‐thia‐1,5‐diselena‐2,4‐diaza‐2,3‐pentadiene (refcode XASBID)that exhibits one large and positive σ‐hole opposite to the Se−N bond (+15 kcal/mol, see Figure 4b). This compound forms infinite 1D supramolecular chains in the solid‐state. One dimer extracted from this chain is represented in Figure 5b. It forms two symmetrically equivalent ChBs, and the dimerization energy is −7.6 kcal/mol, which is stronger than GAJLAF. This is in accordance with the more intense σ‐hole observed in the Se‐atom of XASBID. The presence of green isosurfaces in both the molecules GAJLAF and XASBID establishes the attractive nature of Se⋅⋅⋅S contacts in the solid‐state. In contrast, the orbital contribution [LP(S)→σ*(Se−Y), Y=N,S] is larger in GAJLAF (3.40 kcal/mol) than in XASBID (0.64 kcal/mol), likely due to the better directionality of the interaction in GAJLAF, allowing a better orbital overlap. In case of GAJLAF dimer, the interaction energy using the highly correlated MP4(SDTQ) method has been also calculated in order to give reliability to the DFT method. It can be observed that the difference between both methods is very small (0.3 kcal/mol) thus supporting the PBE0 energies included in the Figures and Tables S1 to S4 (ESI).
Figure 5

Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04<(signλ2) ρ<0.04 a.u. of refcodes GAJLAF (a) and XASBID (b). Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for antibonding sigma orbital.

Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04<(signλ2) ρ<0.04 a.u. of refcodes GAJLAF (a) and XASBID (b). Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for antibonding sigma orbital. Figure 6 shows the MEP surfaces of the two Ch‐bond donors used to highlight the S⋅⋅⋅O interactions. For sulfathiazole (ADEDIX, Figure 6a). The S‐atom exhibits a σ‐hole that is significantly positive (+24.5 kcal/mol). This molecule in the solid‐state forms self‐assembled dimers (see Figure 7a) stabilized by two S⋅⋅⋅O interactions and one S⋅⋅⋅S contact that is expected to be very weak due to its poor directionality. Each interaction is characterized by the corresponding bond CP and green NCI plot index isosurface that establishes the attractive nature of the S⋅⋅⋅O interaction. The dimerization energy is large −12.3 kcal/mol, in accordance with the large MEP value at the σ‐hole. The second selected example corresponds to 4‐hydroxy‐1,2‐dithiolane 1,1‐dioxide (refcode ADOFEF), exhibiting two quite different σ‐holes (+21.9 and +10.6 kcal/mol, see Figure 6b), where the most positive one is the opposite to the ‐SO2 electron‐withdrawing group. This compound also forms self‐assembled dimers in the solid‐state (see Figure 7b) where two S⋅⋅⋅O and one O⋅⋅⋅O contacts are present. In this case, the dimerization energy is modest (−4.2 kcal/mol) because the relatively less positive σ‐hole at the S‐atom is involved in the interaction. However, the orbital contribution is similar in both compounds that exhibit electron donation from the LP orbital at the O‐atom to the antibonding σ*(S−C) orbital with concomitant stabilization energies of 2.10 and 2.25 kcal/mol for ADEDIX and ADOFEF, respectively.
Figure 6

MEP surfaces of refcodes ADEDIX (a) and ADOFEF (b). The MEP values at the σ‐holes are indicated in kcal/mol. See Scheme 1 for the chemical diagrams of the compounds.

Figure 7

Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04<(signλ2) ρ<0.04 a.u. of refcodes ADEDIX (a) and ADOFEF (b). Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained by using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for sigma antibonding orbital.

MEP surfaces of refcodes ADEDIX (a) and ADOFEF (b). The MEP values at the σ‐holes are indicated in kcal/mol. See Scheme 1 for the chemical diagrams of the compounds. Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04<(signλ2) ρ<0.04 a.u. of refcodes ADEDIX (a) and ADOFEF (b). Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained by using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for sigma antibonding orbital. In order to further assess the role of the ChBs in the crystal packing, we have fully optimized two structures, which are represented in Figure 8 along with the dimerization energies. The optimized dimer of refcode GAJLAF shows a very similar geometry compared to the one retrieved from the X‐ray structure (see Figure 5a). The Se⋅⋅⋅S distance is slightly longer in the optimized structure (3.69 Å) than the experimental one (3.50 Å) likely due to packing effects. The dimerization energy (−4.2 kcal/mol) is also very similar to the one obtained using the X‐ray coordinates (−3.9 kcal/mol). Similar results have been found for MUSCIM dimer. In this case the fully optimized geometry exhibits a shorter H⋅⋅⋅O distance and longer Se⋅⋅⋅O distance compared to the X‐ray dimer (values in red).
Figure 8

Fully optimized geometries of GAJLAF (a) and MUSCIM (b) dimers at the PBE0‐D3/def2‐TZVP level of theory. Distances in Å, in black the optimized ones and in red the experimental ones.

Fully optimized geometries of GAJLAF (a) and MUSCIM (b) dimers at the PBE0‐D3/def2‐TZVP level of theory. Distances in Å, in black the optimized ones and in red the experimental ones. The dimerization energy is identical to the one obtained for the X‐ray structure, thus suggesting a compensating effect between the ChB and HB in the theoretical geometry with respect to the experimental one. The fact that the assemblies and ChBs are similar in the fully optimized and X‐ray geometries supports the relevance and structure guiding role of the ChBs in the solid state.

Investigation of intermolecular interactions from the crystal structures in PDB

The home‐written python code was used to analyze the 13178 PDB files downloaded from the RCSB website (October 2020 release) [Resolution ≤1.5 Å]. The criteria used to look into the interactions in PDB files were as follows: (a) the distances between S and O: 2 Å The lower limit of the distance was kept at 2 Å for S⋅⋅⋅O, Se⋅⋅⋅O and 2.5 Å for Se⋅⋅⋅S, which is greater than their respective sum of their covalent radius. The covalent radius for O, S, and Se was considered to be 0.66 Å, 1.05 Å, and 1.20 Å, respectively. The upper limits of the distances for S⋅⋅⋅O and Se⋅⋅⋅O were less than their respective sum of the van der Waals radius, whereas the distance for Se⋅⋅⋅S is kept less than 4.07 Å, which is 10 % higher than their sum of the van der Waals radii. The secondary structure analysis of the interacting residues was performed using the software STRIDE. The PDB files with the angles ∠C−S⋅⋅⋅O, ∠S−Se⋅⋅⋅O/S ≥160° were chosen for model systems. We considered i−1, i, i+1 residues from the selected PDB files where i is the interacting residue. The Cα position of the C‐terminus (i−1) and N‐terminus (i+1) residue was replaced with a methyl group. A detailed geometrical distribution for S⋅⋅⋅O and Se⋅⋅⋅O interactions are represented in Figure 9. A Gaussian like distribution was observed for S⋅⋅⋅O, which shows that the maximum population is present near 3.3 Å (Figure 9a). The distance and angle distribution were simultaneously plotted in a radial density plot (Figure 9b). The presence of maximum density around 165° for ∠O⋅⋅⋅S−C confirms the directional nature of the interaction. From analysis of the secondary structures, it was observed that these interactions are present in all secondary structures. However, the O atoms preferred the turn (25.37 %) secondary structure followed by strand (24.32 %), coil (22.41 %), α‐helix (19.63 %), 310‐helix (6.54 %), and bridge (1.73 %) (Figure 9c) structures, respectively.
Figure 9

(a) Distribution of distance between S and O, (b) radial density distribution plot where the distance between O and S is normalized with respect to the sum of their van der Waals radii ( ), the density bar normalized to the maximum count, (c) the distribution of secondary structures of O atom in O⋅⋅⋅S interaction, (d) distribution of distances between Se and O, (e) radial density distribution plot where the distance between O and Se is normalized with respect to their sum of the van der Waals radii ( ), the density bar normalized to the maximum count (f) the distribution of secondary structures of O atom in Se⋅⋅⋅O interaction.

(a) Distribution of distance between S and O, (b) radial density distribution plot where the distance between O and S is normalized with respect to the sum of their van der Waals radii ( ), the density bar normalized to the maximum count, (c) the distribution of secondary structures of O atom in O⋅⋅⋅S interaction, (d) distribution of distances between Se and O, (e) radial density distribution plot where the distance between O and Se is normalized with respect to their sum of the van der Waals radii ( ), the density bar normalized to the maximum count (f) the distribution of secondary structures of O atom in Se⋅⋅⋅O interaction. From the above analysis, we found 1689, 50, and 10 hits for the S⋅⋅⋅O, Se⋅⋅⋅O and Se⋅⋅⋅S interactions, respectively. Similarly, the Se⋅⋅⋅O distance is maximum near 3.3 Å (Figure 8d). Nevertheless, the maximum density around 175° for ∠O⋅⋅⋅Se−C in radial distribution plot showed the highly directional nature of the interaction (Figure 9e). The O atom of O⋅⋅⋅Se interactions is present in all secondary structures as well but preferred the coil (30.6 %) secondary structure followed by strand (22.4 %), α‐helix (18.4 %), 310‐helix (14.3 %), turn (10.2 %) and bridge (4.1 %) (Figure 9f) structures respectively. Tables S4 and S5 depict the interaction energies of the chalcogen bonding adducts, the MEP values at the Se/S and O/S in the monomers, and the electron density values at the bond CPs of the examples extracted from the PDB searches. The inspection of the results included in Tables S4 and S5 reveals that (i) the interaction energies in neutral systems range from −0.8 to −7.8 kcal/mol, where most of the dimers present additional contacts (see Figures S5 and S6); (ii) the MEP values at the selenium‘s σ‐holes are modest ranging from 5.3 kcal/mol in 4IRX to 13.8 kcal/mol in 3CHV. This is reasonable taking into consideration that the Se is not bonded to electron‐withdrawing groups; (iii) the MEP values at the S‐atoms acting as electron acceptors (σ‐hole donors) in the S⋅⋅⋅O contacts are in most cases negative at the extension of the C−S bonds, indicating the absence of positive σ‐hole in most of the hits. Therefore, the favorable interaction energies are likely due to the additional contacts and are not due to the ChB itself, and (iv) the MEP values at the O‐atoms are more negative than those at the S‐atoms (acting as electron donors), apart from 3CHV, because the electron donor is cationic. We have also selected some representative hits of both PDB searches based on those structures where the model dimers presented the lowest ancillary interactions. Consequently, in those hits the ChB interaction is dominant. Figure 10 shows the MEP surfaces of two selected Se‐bond donor molecules that participate in Se⋅⋅⋅S ChBs, and Figure 10 shows the QTAIM/NCI plot index analyses of the dimers. The model of the PDB code 3S9X presents selenomethionine in the structure, where the Se‐atom exhibits a modest σ‐hole (+6.3 kcal/mol, see Figure 9a). In the X‐ray structure, this Se atom is in short contact with the S‐atom of a nearby cysteine amino acid, forming a ChB that is characterized by the corresponding bond CPs and green NCI plot isosurfaces (see Figure 11a). The dimerization energy is very small (−0.8 kcal/mol) and is likely due to the poor directionality of this interaction since the lone pair at the electron donor S‐atom does not point toward the selenium‘s σ‐hole. In the case of 3DSB, the σ‐hole that participates in the interaction is the opposite of the methyl group, which shows a slightly weaker σ‐hole (+4.7 kcal/mol, see Figure 10b). This amino acid also forms a ChB with a nearby cysteine and the computed interaction energy −3.8 kcal/mol. It is greater (in absolute value) than that of the model of 3S9X because the lone pair at the S‐atoms points to the σ‐hole with better directionality. This explanation agrees well with the NBO analysis results that show a larger orbital contribution for 3DSB (0.95 kcal/mol) compared to 3S9X (0.53 kcal/mol).
Figure 10

MEP surfaces of the models of the PDB codes 3S9X (a) and 3DSB (b). The MEP values at the σ‐holes are indicated in kcal/mol.

Figure 11

Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04 <(signλ2) ρ<0.04 a.u. of the models of the PDB codes 3S9X (a) and 3DSB (b). Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained by using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for sigma antibonding orbital.

MEP surfaces of the models of the PDB codes 3S9X (a) and 3DSB (b). The MEP values at the σ‐holes are indicated in kcal/mol. Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04 <(signλ2) ρ<0.04 a.u. of the models of the PDB codes 3S9X (a) and 3DSB (b). Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained by using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for sigma antibonding orbital. Figure 12a shows the MEP surface of the unique Ch‐bond donors that presents a positive MEP value at the S‐atom (see Table S5), which is a cysteine residue of PDB code 1GT9. The S‐atom exhibits a σ‐hole that is very small but positive (+1.7 kcal/mol) partially due to the influence of the adjacent S−H bond. This residue interacts with an O‐atom of the protein backbone in the X‐ray structure of 1GT9. The interaction is characterized by the corresponding bond CP and green NCI plot index isosurface connecting the S to the O‐atom (see Figure 12b). The interaction energy is small −3.2 kcal/mol, which is in accordance with the small MEP value at the σ‐hole. The NBO analysis reveals an orbital contribution from the LP at the O‐atom to the antibonding σ*(C−S) that is 1.01 kcal/mol, further confirming the existence of the ChB interaction in the model of 1GT9.
Figure 12

(a) MEP surfaces of the model of PDB code 1GT9. The MEP values at the σ‐hole are indicated in kcal/mol. (b) Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04<(signλ2) ρ<0.04 a.u. of the model of PDB code 1GT9. Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained by using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for sigma antibonding orbital.

(a) MEP surfaces of the model of PDB code 1GT9. The MEP values at the σ‐hole are indicated in kcal/mol. (b) Combined QTAIM (bond and red critical points represented as red and ring spheres, respectively) and NCIPlot (ρ cut‐off 0.04 a.u., |RGD|=0.5, color range −0.04<(signλ2) ρ<0.04 a.u. of the model of PDB code 1GT9. Only intermolecular interactions are represented for clarity. The orbital stabilization energies obtained by using the second‐order perturbation analysis are also indicated along with the orbital involved. LP stands for lone pair and σ* for sigma antibonding orbital.

Conclusions

The results obtained from a systematic analysis of the molecules in the CSD and PDB bring out the relevance of chalcogen‐centered Se⋅⋅⋅O, Se⋅⋅⋅S, and S⋅⋅⋅O interactions in the solid‐state. The stabilization energies for the dimers are in the range of −2.3 to −7.8 kcal/mol, −0.8 to −3.8 kcal/mol, and −0.7 to −14.1 kcal/mol for Se⋅⋅⋅O, Se⋅⋅⋅S and S⋅⋅⋅O chalcogen bonds in PDB. The corresponding magnitudes, i. e. the stabilization energies for molecules in CSD are −3.7 to −78.0 kcal/mol, −3.8 to −13.2 kcal/mol and −3.6 to −14.9 kcal/mol for the above‐mentioned interactions in CSD. The second‐order interaction energies are in the range of 0.6 to 3.4 kcal/mol for lone pair donation from O/S to σ* antibonding orbital of C−Se bond, and 2.10 to 2.24 kcal/mol for lone pair donation from O to the σ* antibonding orbital of the C−S bond in case of small molecules from CSD. The corresponding values are in the range of 0.53 to 0.95 kcal/mol for lone pair donation from S to σ* antibonding orbital of C−Se bond and 1.01 kcal/mol for lone pair donation from O to σ* antibonding orbital of the C−S bond in case of molecules in PDB. The MEP analysis clearly depicts the presence of the σ‐holes on the chalcogen atoms. The weak attractive nature of these chalcogen bonds was established from the NCI plot visualization index. Such studies establish the significance of these interactions unequivocally and are relevant in the design of materials related to applications in energy and medicine.

Experimental Section

Computational methods

The energies of all assemblies included in this study were computed at the PBE0‐D3/def2‐TZVP level of theory by using the program TURBOMOLE v7.0. For the calculations we have used the Weigend def2‐TZVP[ , ] basis set and the PBE0 DFT functional. The terms interaction energy and binding energy have been used indistinctly in this manuscript and calculated as the energy difference between the complex and the sum of the energies of the monomers. The molecular electrostatic potential surfaces have been plotted using the 0.001 a.u. isosurface using Gaussian‐16 software at the PBE1PBE‐D3/def2‐TZVP level of theory. The PBE1PBE is the notation used by Gaussian‐16 for the hybrid functional PBE0. The MP4SDQT/def2‐TZVP ab initio calculations were performed using Gaussian‐16. The QTAIM method has been used to obtain the critical points and bond paths via analysis of the topology of the electron density,[ , ] using the PBE1PBE‐D3/def2‐TZVP wavefuntion and the AIMALL program. The NCIPLOT is a computational index for visualization and identification of NCIs in real space. The noncovalent contacts are identified with the peaks that emerge in the reduced density gradient (RDG) at low densities. These are plotted by mapping an isosurface of s (s=|∇ρ|/ρ4/3) for a low value of RDG. When a supramolecular dimer is formed, the RDG changes at the CPs in between the monomers due to the annihilation of the density gradient at these CPs. Therefore, this index visualizes the extent to which NCIs stabilize a supramolecular assembly qualitatively and reveals which molecular regions interact. The color code is red‐yellow‐green‐blue where the red and yellow colors are used for strong and weak repulsive (ρcut +), respectively and the blue and green colors for strong and weak attractive (ρcut −) forces, respectively. The Natural Bond Orbital analysis of orbital donor‐acceptor interactions has been performed at the same level using the NBO 3.1 program.

Cambridge Structural Database (CSD) study (small molecules)

The Cambridge Structure Database (CSD) version 5.42 with updates until May 2021 was inspected using ConQuest version 2021.1.0 (build 319319). All queries were limited to crystal structures that contained 3D coordinates and had no errors. Only those .cif files with Ch⋅⋅⋅Ch (Ch=Se, S and O) distances≤than the sum of van der Waals radii +0.2 Å and C−Ch⋅⋅⋅Ch angle ≥140° were selected and further inspected manually and statistically. To avoid H‐bonded hits, no H‐atoms were allowed to be bonded to O and S atoms. Finally, “only organics” option was chosen to eliminate metal bridged hits and related structures.

Conflict of interest

The authors declare no conflict of interest. As a service to our authors and readers, this journal provides supporting information supplied by the authors. Such materials are peer reviewed and may be re‐organized for online delivery, but are not copy‐edited or typeset. Technical support issues arising from supporting information (other than missing files) should be addressed to the authors. Supporting Information Click here for additional data file.
  27 in total

1.  STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins.

Authors:  Matthias Heinig; Dmitrij Frishman
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  Accurate Coulomb-fitting basis sets for H to Rn.

Authors:  Florian Weigend
Journal:  Phys Chem Chem Phys       Date:  2006-01-03       Impact factor: 3.676

3.  Characterization of chalcogen bonding interactions via an in-depth conceptual quantum chemical analysis.

Authors:  Freija De Vleeschouwer; Mats Denayer; Balazs Pinter; Paul Geerlings; Frank De Proft
Journal:  J Comput Chem       Date:  2017-11-10       Impact factor: 3.376

4.  The Origin of Chalcogen-Bonding Interactions.

Authors:  Dominic J Pascoe; Kenneth B Ling; Scott L Cockroft
Journal:  J Am Chem Soc       Date:  2017-10-17       Impact factor: 15.419

Review 5.  Selenocysteine-containing proteins in mammals.

Authors:  V N Gladyshev; D L Hatfield
Journal:  J Biomed Sci       Date:  1999 May-Jun       Impact factor: 8.410

6.  The Chalcogen Bond in Crystalline Solids: A World Parallel to Halogen Bond.

Authors:  Patrick Scilabra; Giancarlo Terraneo; Giuseppe Resnati
Journal:  Acc Chem Res       Date:  2019-05-13       Impact factor: 22.384

7.  Selenium chalcogen bonds are involved in protein-carbohydrate recognition: a combined PDB and theoretical study.

Authors:  Jose A Fernández Riveras; Antonio Frontera; Antonio Bauzá
Journal:  Phys Chem Chem Phys       Date:  2021-08-10       Impact factor: 3.676

8.  Noncovalent Carbon-Bonding Interactions in Proteins.

Authors:  V Rao Mundlapati; Dipak Kumar Sahoo; Suman Bhaumik; Subhrakant Jena; Amol Chandrakar; Himansu S Biswal
Journal:  Angew Chem Int Ed Engl       Date:  2018-11-12       Impact factor: 15.336

9.  A Quantitative Molecular Orbital Perspective of the Chalcogen Bond.

Authors:  Lucas de Azevedo Santos; Stephanie C C van der Lubbe; Trevor A Hamlin; Teodorico C Ramalho; F Matthias Bickelhaupt
Journal:  ChemistryOpen       Date:  2021-02-17       Impact factor: 2.630

View more
  1 in total

1.  Se⋅⋅⋅O/S and S⋅⋅⋅O Chalcogen Bonds in Small Molecules and Proteins: A Combined CSD and PDB Study.

Authors:  Himansu S Biswal; Akshay Kumar Sahu; Bartomeu Galmés; Antonio Frontera; Deepak Chopra
Journal:  Chembiochem       Date:  2021-11-05       Impact factor: 3.461

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.