Literature DB >> 35571354

Packing Preferences of Chalcones: A Model Conjugated Pharmaceutical Scaffold.

Louise S Price1, Sarah L Price1.   

Abstract

We sought the crystal packing preferences of the chalcone scaffold by analyzing 232 single-component crystal structures of chalcones with a small (six or fewer non-hydrogen atoms) substituent on either or both rings, including the unsubstituted molecule. This covers 216 molecules, as some are polymorphic, and 277 independent molecular conformations, as 16% of the crystal structures have more than one symmetry independent molecule. Quantum mechanical conformational profiles of the unsubstituted molecule and the almost 5000 crystal structures within 20 kJ mol-1 of the global minimum generated in a crystal structure prediction (CSP) study have been used to complement this analysis. Although π conjugation would be expected to favor a planar molecule, there are a significant number of crystal structures containing nonplanar molecules with an approximately 50° angle between the aromatic rings. The relative orientations of the molecules in the inversion-related dimers and translation-related dimers in the experimental crystal structures show the same trends as in the CSP-generated structures for the unsubstituted molecule, allowing for the substituent making the side-to-side distances larger. There is no type of dimer geometry associated with particularly favorable lattice energies for the chalcone core. Less than a third of the experimental structures show a face-to-face contact associated with π···π stacking. Analysis of the experimental crystal structures with XPac and Mercury finds various pairs of isostructural crystals, but the largest isostructural set had only 15 structures, with all substituents (mainly halogens) in the para position. The most common one-dimensional motif, found in half of the experimental crystal structures, is a translation-related side-to-side packing, which can be adopted by all the observed conformations. This close-packed motif can be adopted by chalcones with a particularly wide variety of substituents as the substituents are at the periphery. Thus, although the crystal structures of the substituted chalcones show thermodynamically plausible packings of the chalcone scaffold, there is little evidence for any crystal engineering principle of preferred chalcone scaffold packing beyond close packing of the specific molecule.
© 2022 American Chemical Society.

Entities:  

Year:  2022        PMID: 35571354      PMCID: PMC9097456          DOI: 10.1021/acs.cgd.1c01381

Source DB:  PubMed          Journal:  Cryst Growth Des        ISSN: 1528-7483            Impact factor:   4.010


Introduction

The expanding field of crystal engineering aims to aid the design of new functional organic materials through understanding the preferred intermolecular contacts.[1−7] There are many well-known trends, such as the preference of carboxylic acids to form the R22(8) hydrogen-bonded dimer,[8] but this does not define how this dimer packs to form the extended crystal structure, and the preference can be weak, as tetrolic acid shows this motif in one polymorph and a hydrogen-bonded chain in another.[9] When there are multiple hydrogen-bonding donors and acceptors in a molecule, then Etter’s rule that the strongest donor pairs with the strongest acceptor[10] is not always reliable. It can be refined by taking into account the bonding environment of the donors and acceptors in calculating the hydrogen-bond propensity[11] from known crystal structures in the Cambridge Structural Database (CSD).[12] Exceptions may be understood by looking at whether steric factors prevent the formation of an anticipated hydrogen bond using Full Interaction Maps,[13,14] again based on CSD data. Hence, even the preferred hydrogen-bond pairing in the crystal structure is not always readily predicted.[11,15,16] Hydrogen bonding may be in competition with other strong interactions identified with only a few substituent atoms, such as halogen bonding. The competition between hydrogen and halogen bonds[17] is affected by substituent position[18] and better rationalized by the electrostatic potential around the molecules (i.e., taking into account the covalent bonding environment within each molecule).[19] Our knowledge of the competitive strength of different intermolecular interactions identified by small substituents (e.g., -NH2, OH, F, Cl, Br, I, C≡N) can be estimated from crystal structures in the CSD, but the data often need supplementing by systematic competitive studies.[20] The situation is even less clear for aromatic groups. Benzene rings prefer a T-shaped or offset parallel geometry because the electrostatic effects of the π electrons destabilize the simple stacked geometry favored by the dispersion interactions.[21] However, this preference is readily changed by substituents, e.g., C6H6/C6F6 forms columns,[22] or the introduction of heteroatoms into the π system that results in changes to the σ framework that interact favorably with the π system.[21] The π···π interactions between two aromatic species are broadly classified by geometry into three categories: edge-to-face T-shape, parallel displaced, and cofacial parallel stacked.[23] The small, unsubstituted aromatic compounds prefer edge-to-face T-shaped geometry, whereas substituted and large multiring aromatic compounds prefer parallel displaced geometry. Cofacial parallel stacked geometry is rather rarely observed.[23] However, as many optoelectronic functional materials are based on the stacking of the π systems,[24] there is huge interest in developing strategies for controlling π···π stacking and its influence on charge transport properties.[25] The terms π stacking and π···π interactions persist for their relationships to many important properties, despite the arguments that these terms should no longer be used.[26] Thus, the complex balance of the different types of intermolecular interaction (hydrogen bonding, halogen bonding, π···π, etc.) and the variety of molecular shapes that can be adopted by flexible molecules usually has to be evaluated by computer in a Crystal Structure Prediction (CSP) study.[27−29] The results are very specific to the molecule but can often show that there are a variety of different packings, often with different hydrogen-bonding motifs and molecular conformations, that are very similar in energy. These competitive structures are different compromises between the geometries of the named atom–atom intermolecular interactions[30] and the requirement for close packing to optimize the dispersion energy.[31,32] Indeed, this compromise can result in the first coordination shell including a destabilizing molecule–molecule interaction.[33] Instead of building up the crystal structure from the interactions between different atoms or small functional groups, it is desirable to consider the packing of a molecular core or scaffold, expecting that molecules that differ only in small substituents should have similarities in their crystal packing. Such a series of molecules may arise from a drug or functional materials discovery program, where the core has desired functionality that can be tuned by varying the substituents. Being able to find isostructural crystal structures of related molecules is often of interest as a route to tunable solid solutions[34] or templating the first nucleation of a desired polymorph.[35] There is an increasing number of analyses of large sets of crystal structures of molecules with a large common component, such as the analysis of mandelic acids[36] and 4,4′ benzenesulfonamidobenzenes.[37] Such analyses are limited to the members of the series of related molecules that crystallize readily and well enough for structure determination. (Unfortunately there are only few published systematic studies as to the factors which determine whether a series of molecules will crystallize readily, badly, or not at all, for example, a study of acylanilide structures.[38]) The experimental structures of a set of related molecules can be complemented by CSP-generated structures, as has been done to understand the conductivity of chiral [6]-helicene crystals[6] helicene molecule. Nanoscale. 2018 ">39] and the packing preferences of quinoxalines.[40] One family of molecules that require such analysis is the chalcones ((2E)-1,3-diphenylprop-2-en-1-ones, Figure ). This scaffold is pharmacologically significant[41] because of the biological activity of chalcones, with some being antioxidants,[42] cytotoxic, antimicrobial, anticancer,[43,44] anti-inflammatory,[45] and antibacterial.[46,47] Two clinically approved chalcones are Metochalcone, a choleretic drug, and Sofalcone which is both an antiulcer and mucoprotective drug.[48] Chalcones are used as agrochemicals for the prevention of fungal/insect infestation,[49] as well as for viral prevention.[50] It is thought that the main active functional group of these compounds is the α,β unsaturated ketone.[51] The overlap of the π molecular orbitals of some chalcones leads to a variety of optical properties. Indeed, the name is derived from the Greek word for copper, chalcos, with allusion to the red color of some chalcone derivatives. The bright colors of fruits and vegetables are due in part to the flavonoids, derived from chalcones. Some chalcone derivatives are polymorphic, with different colors exhibited by different polymorphs,[52] arising from the packing differences. Chalcones are also used as fluorescent probes in imaging, such as a library of dialkyaminochalcones,[53] many of which showed high fluorescence in DMSO, and a sharp structure–activity relationship in cellular cytotoxicity. A chalcone amide library,[54] with an amido and an amino on each side of the scaffold as electron donors, was also produced to provide a range of fluorescent probes. Thus, the chalcone chemical scaffold shown in Figure is present in a wide variety of crystal structures in the Cambridge Structural Database. A recent collaboration has generated a significant number of additional crystal structures with small substituents to aid this investigation (Supporting Information, Table S2). Nonetheless, difficulties have been reported in crystallizing chalcones with multiple methoxy substituents[55] and some chalcones that required laser-assisted crystallization.[56]
Figure 1

(a) The molecular diagram of the unsubstituted chalcone, with the atomic numbering used in this work. The main torsion angles are marked, with θ1 defined by C4–C1–C2=C3, θ2 defined by C5–C4–C1–C2, and θ3 defined by C2=C3–C10–C11. C5 and C11 are chosen so that θ2 and θ3 are closer to 180° than 0°. (b) The codes used in this work for the most common substituents. See Table S1 in the Supporting Information for the full list. The systematic naming of molecules studied in this work is XiYj, where X denotes the substituent on the 1-ring as a letter, Y denotes the substituent on the 3-ring as a number, and i,j denote whether the substituents on the respective rings are in the ortho, meta, or para positions.

(a) The molecular diagram of the unsubstituted chalcone, with the atomic numbering used in this work. The main torsion angles are marked, with θ1 defined by C4–C1–C2=C3, θ2 defined by C5–C4–C1–C2, and θ3 defined by C2=C3–C10–C11. C5 and C11 are chosen so that θ2 and θ3 are closer to 180° than 0°. (b) The codes used in this work for the most common substituents. See Table S1 in the Supporting Information for the full list. The systematic naming of molecules studied in this work is XiYj, where X denotes the substituent on the 1-ring as a letter, Y denotes the substituent on the 3-ring as a number, and i,j denote whether the substituents on the respective rings are in the ortho, meta, or para positions. The unsubstituted chalcone ((2E)-1,3-diphenylprop-2-en-1-one, Figure ) has only one heteroatom and so cannot form any traditional hydrogen bonds, and only a limited number of C–H···O hydrogen bonds, but the two phenyl rings will give rise to aromatic interactions. The molecule’s two aromatic rings are linked by the enone group and should display some degree of conjugation throughout the molecule, with the fully planar conformation showing the maximum conjugation. Thus, we might expect the crystal structures of chalcones to show π-stacks of planar molecules. However, such stacks are not seen in either of the polymorphs of the unsubstituted chalcone. In this paper, we have carried out a systematic review of the molecular conformations and crystal structures of a few hundred chalcones with one relatively small substituent (phenyl or smaller) on one or both rings. These analyses are contrasted with the structures for the unsubstituted chalcone generated by a CSP study, to see the extent to which the packing preferences of the core are reflected in the structures of the substituted molecules. The analysis compares first the conformations and then the coordination environments, looking both at pairs of molecules cut from the crystal structure (dimers) and the coordination using the Crystal Packing Similarity tool in Mercury[57] to establish how many molecules of the 20 molecule cluster can be overlaid and the optimum overlay (RMSD). Extended motifs are analyzed using XPac[58] pairwise comparisons for seeking 1- to 3-D similarity. Crystal Packing Similarity and XPac are used to establish the isostructural families of chalcone crystal structures. This novel combination of complementary analyses and use of CSP could be applied to other families of molecules.

Experimental Section

Naming Convention

The naming convention for molecules studied in this work is described in Figure .

Data Set of Chalcone Experimental Structures

Experimental structures from the 2019 Crystal Structure Database (CSD),[12,59] including Feb, May, and Aug 2019 updates, and recent structures determined by collaborators, were included in the conformational and structural analysis. A search of the CSD was carried out to find all structures containing the 18 atoms of Figure a (i.e., not including ring substituents), where 3-D coordinates were available, discounting any structures containing ions, organometallics, or elements heavier than iodine. This was manually refined to remove zwitterions, multicomponent systems, substituents with more than six non-hydrogen atoms, and molecules with more than one substituent on either ring. This data set was augmented by recently solved structures from our collaborators’ groups. The full criteria for inclusion are given in the Supporting Information, Section S2. A small number of structures were disordered or did not have hydrogen atoms located, and these were edited manually as necessary (see Supporting Information, Section S2), adding hydrogen atoms and removing minor disorder components, to allow automatic analysis of geometries.

CSP Methodology

Conformational analysis of the unsubstituted chalcone backbone was carried out in GAUSSIAN,[60] at PBE0, B3LYP, MP2, and PBE levels of theory with the 6-31G(d,p) basis set. CrystalPredictor_2.2[61] was used as the search algorithm. Two separate Z′ = 1 searches were carried out, first with a flexible molecule where θ1, θ2, and θ3 were allowed to vary, with θ1 around 180° (denoted Region A), and second with a rigid gas phase optimized molecule with θ1 close to −30° (denoted Region B). Intramolecular energy and point charges were evaluated at the PBE0/6-31G(d,p) level of theory, and the FIT empirical exp-6 repulsion-dispersion parameters were used.[62,63] CrystalOptimizer_2.4.7[64] was used to refine the structures, with DMACRYS_2.3.0.[65] The intramolecular energy at the PBE0/6-31G(d,p) level of theory was evaluated using GAUSSIAN,[60] and the distributed multipoles that represent this charge density were evaluated using GDMA_2.2.[66] The intermolecular lattice energy was evaluated from the distributed multipoles and an empirical repulsion-dispersion potential with the FIT parameters.[62,63] In addition to the three torsion angles shown in Figure , two further torsion angles and six bond angles around the center of the molecule were allowed to change in response to the packing forces as the crystal structure was optimized. Duplicate structures were removed according to similarities in the powder patterns and coordination environment overlay, using RMSD15 or RMSD30 for structures in the same or different space groups, respectively. Any structures that DMACRYS indicated were high-symmetry saddle points between two lower symmetry structures were reminimized with CrystalOptimizer in the lower symmetry subgroup. Full details of the generation of the CSP crystal structures are in the Supporting Information, Section S5.

Geometric Analyses

The crystal structures generated in the CSP and those in the Experimental Set were inspected using the CCDC Python API on a Linux cluster. This allowed measurement of geometric parameters within the molecules and between pairs of molecules in each crystal structure. The full definition of all parameters measured and the Python code are given in the Supporting Information, Section S7.

Molecular Parameters

The most significant conformational parameters (Figure ) are θ1 (C4–C1–C2=C3), the central torsion angle of the molecule, and the phenyl torsion angles, θ2 (C5–C4–C1–C2) and θ3 (C2=C3–C10–C11). To give an assessment of the overall planarity of the molecule, the dihedral angle C8–C5–C11–C14 was also measured and is termed planar1.

Intermolecular Parameters

The intermolecular contacts for all pairs of molecules in van der Waals contact were analyzed. The distance between pairs of molecules was measured as C2···C2′ (defined as D1), and we started the analysis with all pairs of molecules whose D1 separation was less than 30 Å for the experimental structures and less than 20 Å for CSP structures. Further geometric parameters were measured (see Supporting Information, Section S7), including an intermolecular torsion angle, C12–C2···C2′–C12′, which was used to define contacts as inversion or translation if this was exactly 180° or 0°, respectively, and these were analyzed in detail. For each inversion or translation contact, an average plane through each molecule was defined, and the separation of these planes measured as S1. As the C2–H1 and C2–C12 directions and the interplanar separation are approximately orthogonal (see Section S7.3.1 of the Supporting Information for testing of this assumption), the component of D1 parallel to C2–C12 was defined as S2, and the component of D1 parallel to C2–H1 was defined as S3 (Figure ).
Figure 2

Illustration of the three quantities S1, S2, and S3 for translation-related molecules. For each dimer pair, typical values for S1, S2, and S3 are given for closest approach of the colored molecules with respect to the black molecule.

Illustration of the three quantities S1, S2, and S3 for translation-related molecules. For each dimer pair, typical values for S1, S2, and S3 are given for closest approach of the colored molecules with respect to the black molecule. Given the anisotropy of the molecular shape, a “coordination sphere” does not encompass the molecules in van der Waals contact. A “coordination ellipsoid” was defined, and all contacts where were defined as the first coordination ellipsoid for the structures in the Experimental Set and for the CSP structures. The values for the denominators were determined by inspecting the full 3-D plots of S1, S2, and S3 and deciding from the absence of structures where the ellipsoids lay. The substituent volumes meant that the maximum values for S1 and S2 are larger for the Experimental Set. Analysis of the conformation-dependent box that contains the molecule[67] was used to confirm that the maximum values of S1, S2, and S3 chosen for the CSP Set were appropriate (see Supporting Information, Section S6).

Structural Correspondence

The geometries (as S1, S2, and S3) of all inversion and translation van der Waals contact dimers were plotted, to enable a comparison of the entire sets of crystal structures. Where there was a cluster of points from close contacts, the corresponding crystal structures were manually inspected for common dimer packing motifs.

XPac and Mercury Crystal Packing Similarity

Both XPac[58,68] and Mercury (Crystal Packing Similarity tool)[69] were used to carry out pairwise analysis of all structures in the Experimental Set. The program’s default settings were used for XPac, and a 20-molecule coordination sphere, 20% distance tolerance, and 20° angle tolerance was used for Mercury. The XPac matrix of pairwise similarities had the structures reordered to group together clusters of higher similarity and the same ordering used for the matrix of n values from the calculation of RMSDn in Mercury (Figure S22 and Excel spreadsheet in Supporting Information).

Results

Data Set of Chalcones Being Analyzed

The list of 232 experimental crystal structures analyzed in this work is given in Table S2 of the Supporting Information. This Experimental Set constitutes 216 different compounds, of which 12 compounds are dimorphic and two are trimorphic. In addition, 16% of the experimental structures have Z′ > 1 (33 structures are Z′ = 2, two are Z′ = 3, one structure is Z′ = 4 and another is Z′ = 5; see Table S3 in Supporting Information), and so some crystal structures contribute multiple conformations to the set of 277 molecular conformations. The most common substituents of the molecules in the Experimental Set are Br (67), Cl (61), NO2 (45), OH (41), OMe (39), and F (36). The full set of substituents is given in the Supporting Information, both within Table S2 (full list of structures) and in Figure S2 (breakdown of numbers of each substituent). Figure a shows that the distribution of space groups in chalcone crystal structures is very similar to that observed for small organic molecules overall,[70] with only 20% of the crystal structures having a unit cell containing more than four asymmetric units (equivalent to molecules for Z′ = 1). Figure b shows that para substituents dominate. This may reflect less interest in synthesizing molecules with ortho and meta substituents or greater difficulties in forming crystals suitable for structure determination.
Figure 3

(a) Pie chart showing the space groups (as given in the CSD) of the Experimental Set. For clarity, not all space groups are labeled on the pie chart. (b) Frequency of different combinations of substituent positions within the Experimental Set with o for ortho, m for meta, p for para, and (.) for no substituent. The first letter is the position of the substituent on the 1-ring, and the second letter is the position of the substituent on the 3-ring.

(a) Pie chart showing the space groups (as given in the CSD) of the Experimental Set. For clarity, not all space groups are labeled on the pie chart. (b) Frequency of different combinations of substituent positions within the Experimental Set with o for ortho, m for meta, p for para, and (.) for no substituent. The first letter is the position of the substituent on the 1-ring, and the second letter is the position of the substituent on the 3-ring.

Crystal Structure Prediction

The lattice energy landscape of the CSP search of the unsubstituted chalcone (Figure ) is a successful prediction of the crystal structures of the unsubstituted chalcone. Form II has a 88:12 disorder in the enone C=O and C–H bonds in the determination with the best R-factor (A-1–Pbcn[71]) and contains some contacts where aromatic rings have some π-overlap with the enone part of an adjacent molecule. The major component of form II is found as the global minimum in the CSP. Form I (A-1–Pbc21[72]), a structure without any face-to-face aromatic or T-shaped aromatic contacts, is the third lowest energy structure, only 2.2 kJ mol–1 higher in energy than form II. This positioning of the known structures of chalcone in the CSP (Figure ) shows that the model for the lattice energy is realistic. While there are only a few structures that are sufficiently competitive in energy to be possible polymorphs for the specific molecule, there are clearly a vast number of ways of packing the chalcone core that are quite favorable.
Figure 4

A summary of the CSP lattice energy landscape of the unsubstituted chalcone. Each point represents the lattice energy and packing coefficient of a crystal structure that is a minimum in the lattice energy, denoted by the space group. The lattice energies of the dominant component of form II and that of form I are also shown as open symbols. The minor (12%) component of the disordered form II had an energy of −91.7 kJ mol–1 and so is outside the energy range shown.

A summary of the CSP lattice energy landscape of the unsubstituted chalcone. Each point represents the lattice energy and packing coefficient of a crystal structure that is a minimum in the lattice energy, denoted by the space group. The lattice energies of the dominant component of form II and that of form I are also shown as open symbols. The minor (12%) component of the disordered form II had an energy of −91.7 kJ mol–1 and so is outside the energy range shown.

Conformation

The central angle of the molecule, θ1 (C4–C1–C2=C3), shows two minima in the torsion angle scan, at 180° and ±30° (Figure a), dividing the conformational space into Regions A (s-cis) and B (s-trans). The vast majority of conformations in the experimental crystal structures have the molecule in the Region A conformation. This would be expected as Region B is higher in energy, but the complexity of the balance of the conformation and packing is exemplified by the Z′ = 3 structure of Jm8p (1-(3-hydroxyphenyl)-3-(4-nitrophenyl)prop-2-en-1-one), which has two molecules in Region A and one in Region B.
Figure 5

Relaxed torsion angle scans about (a) θ1 (defining Region A and B) and (b) planar1 (starting from conformational minima in Region A and Region B) at the PBE0, B3LYP, MP2, and PBE levels of theory with the 6-31G(d,p) basis set. The conformational energy penalty is calculated relative to the global minima in the gas phase optimized geometry. Distributions of (c) the C4–C1–C2=C3 angle (θ1) in the Experimental Set (blue) and CSP set (orange) and (d) the C8–C5–C11–C14 angle (planar1) in the Experimental Set (blue) and CSP set (orange). 360° has been added to all angles below −90°, so all angles have a range of −90° to 270°. For the planar1 angle in parts (b) and (d), only one of the two symmetry-equivalent ranges of values was used, so the conformational regions could be visually separated.

Relaxed torsion angle scans about (a) θ1 (defining Region A and B) and (b) planar1 (starting from conformational minima in Region A and Region B) at the PBE0, B3LYP, MP2, and PBE levels of theory with the 6-31G(d,p) basis set. The conformational energy penalty is calculated relative to the global minima in the gas phase optimized geometry. Distributions of (c) the C4–C1–C2=C3 angle (θ1) in the Experimental Set (blue) and CSP set (orange) and (d) the C8–C5–C11–C14 angle (planar1) in the Experimental Set (blue) and CSP set (orange). 360° has been added to all angles below −90°, so all angles have a range of −90° to 270°. For the planar1 angle in parts (b) and (d), only one of the two symmetry-equivalent ranges of values was used, so the conformational regions could be visually separated. Both phenyl groups have a tendency to be coplanar with the central atoms of the molecule but can vary by ±30° with only a small energy penalty (see Figure S3 in the Supporting Information). Hence, the overall shape of the molecule, as measured by the torsion angle between atoms in the two phenyl rings, planar1, can vary significantly with only a small conformational energy penalty (Figure b), such that a wide range of conformations could plausibly be found in crystal structures. This is shown to be the case, as the distribution of the crystalline conformers (Figure c) is in good agreement, with the conformations within Region A showing a broad distribution of planar1, centered on a planar molecule. The Region B conformers cannot be planar as this produces an intramolecular steric clash between hydrogen atoms. It is notable that there is a significant proportion (24%) of the molecules in the Experimental Set with planar1 around ±50°, a significantly higher proportion than in the CSP set. This feature appears to reduce the qualitative agreement between the trends in the isolated molecule energies and the distributions seen in crystal structures, as is often expected,[73,74] but rarely demonstrated for such a large number of closely related molecules. There are an increasing number of examples[75−78] of a lack of agreement between conformational profiles and observed crystal structures, and the likely causes can be molecule-dependent.[74,79] In this case, we note that the variation in the calculated molecular conformational surfaces with method is significant, though not unprecedented.[80] The MP2 conformation scan has local minima such that molecules with a planar1 angle of ±50° are not only low in energy but also two crystals, one with planar1 around 0° and the other with planar1 around 50°, would be considered conformational polymorphs. The high frequency of experimental crystal structures with planar1 close to ±50° thus could reflect the low energy of that conformation or a packing effect. Figure shows the only significant correlation between the crystalline molecular conformation and the type and position of substituents. (Other plots are given in the Supporting Information, Figure S7.) If the ortho substituent on the 1-ring is amino or hydroxy, it can form an internal hydrogen bond to the carbonyl oxygen, resulting in an unusually narrow spread of θ2 values. Larger ortho substituents on the 1-ring, such as chloro or nitro, cause θ2 to deviate more, and nitro, bromo, or iodo substituents can cause the entire molecule to adopt the Region B conformation.
Figure 6

Conformations of the ortho-substituted molecules in the Experimental Set, plotted as the overall planarity of the molecule (planar1, C8–C5–C11–C14) versus (a) the main torsion angle in the molecule (θ1, C4–C1–C2=C3) and (b) the angle between the 1-ring and the backbone (θ2, C5–C4–C1–C2). Data points are marked by the substituent in the ortho position on the 1-ring. Comparable plots for substituents in other positions of the 1-ring and for substituents on the 3-ring are given in the Supporting Information, Section S7.3.2. 360° has been added to all angles below −90°, so all angles have a range of −90° to 270°.

Conformations of the ortho-substituted molecules in the Experimental Set, plotted as the overall planarity of the molecule (planar1, C8–C5–C11–C14) versus (a) the main torsion angle in the molecule (θ1, C4–C1–C2=C3) and (b) the angle between the 1-ring and the backbone (θ2, C5–C4–C1–C2). Data points are marked by the substituent in the ortho position on the 1-ring. Comparable plots for substituents in other positions of the 1-ring and for substituents on the 3-ring are given in the Supporting Information, Section S7.3.2. 360° has been added to all angles below −90°, so all angles have a range of −90° to 270°.

Dimer Motifs

All pairs of molecules where the two molecules were related by inversion (C12–C2···C2′–C12′ = 180°) or translation (C12–C2···C2′–C12′ = 0°) symmetry that lay within the first coordination ellipsoid (see section ) were plotted in Figure . Overall, there are 747 inversion and 408 translation contacts in the 232 structures of the Experimental Set and 8540 inversion and 6526 translation contacts in the 4985 structures of the CSP Set (further details of the statistics are in the Supporting Information, Section S7.3.6). The inversion motifs are more numerous than translation motifs because there is the possibility of different inversions on either side of the molecule, and only one of the two symmetry related translations on either side is counted. Structures in chiral space groups will not have any inversion dimers.
Figure 7

Inversion ((a) and (c)) and translation ((b) and (d)) contacts in the Experimental Set ((a) and (b)) and CSP Set ((c) and (d)) crystal structures, plotted as S2 vs S3, and colored by S1. All distances are in Å. The scale of color used for S1 is constant, to allow comparison of the plots, although no points with S1 > 4 are in the CSP Set. For the plots of translation, the plots are approximately rotationally symmetrical due to a translation relationship between molecules occurring on both sides at equal distances. Since only one of these dimer relationships is included (arbitrarily chosen by the CCDC Python API), the plots are not completely symmetrical, but any point at S2, S3 is equivalent to a point at -S2, -S3.

Inversion ((a) and (c)) and translation ((b) and (d)) contacts in the Experimental Set ((a) and (b)) and CSP Set ((c) and (d)) crystal structures, plotted as S2 vs S3, and colored by S1. All distances are in Å. The scale of color used for S1 is constant, to allow comparison of the plots, although no points with S1 > 4 are in the CSP Set. For the plots of translation, the plots are approximately rotationally symmetrical due to a translation relationship between molecules occurring on both sides at equal distances. Since only one of these dimer relationships is included (arbitrarily chosen by the CCDC Python API), the plots are not completely symmetrical, but any point at S2, S3 is equivalent to a point at -S2, -S3. There are far more CSP-generated structures than experimental structures, and the substituents of the molecules in the structures of the Experimental Set, being larger than hydrogen atoms to different degrees, increase the end-to-end S2 values by varying amounts. (The larger substituents are predominantly in the para position.) Nonetheless, it is apparent that the clustering of points observed for the CSP structures is reflected in that observed for the experimental structures, albeit with fewer points in the experimental clusters and a more diffuse shape because of the effect of the substituents on the molecular separations. For the face-to-face contacts, where S2 and S3 are approximately 0 Å, the S1 distances would be affected by substituents that are bigger than the aromatic carbons. The variation in S1 in the face-to-face translation contacts is not as great as in the inversion contacts, showing that for the chalcones, translation packing is more space-efficient than inversion, as it is for spoons in a dishwasher. Figure shows the same plots for the CSP Set as Figure c,d, with the points colored by the lattice energy of the crystal structure. There is very little correlation between the dimer contacts in the structures of the CSP set and the overall lattice energy, showing that the energy is not dominated by one dimer contact but has significant contributions from many molecule–molecule contacts within the crystal structures.
Figure 8

Scatterplots for S1, S2, and S3 dimer contacts in the CSP Set (cf., Figure c,d) colored by lattice energy.

Scatterplots for S1, S2, and S3 dimer contacts in the CSP Set (cf., Figure c,d) colored by lattice energy.

Common Dimer Contacts

The 3-D space defining the close contacts within the Experimental Set and CSP Set (Figure ) was divided into boxes, spaced at 0.25 Å intervals in S1 and 1 Å intervals in S2 and S3. All the close contacts within each volume were counted, and boxes containing 5+ interactions for the Experimental Set or 30+ for the CSP Set are defined as highly populated. Further details are given in Section S7.3.7 of the Supporting Information. The dimers that fall in the most populated areas of the plots in Figure are given in Table . The points nearer the centers of the plots in Figure (small S2, S3, large S1, named I1, I2, I3, and T3 in Table correspond to the motifs that will have a larger degree of π-electron overlap and are analyzed as π···π stacking motifs in section . The other common dimer geometries, seen at the top and bottom of Figure b and Figure d (small S2, large S3, variable S1) correspond to motifs named T1 and T2 in Table and are analyzed as side-to-side contacts in section .
Table 1

Definitions and Frequencies (in Experimental and CSP Sets) of the Most Populated Areas of the Plots in Figure a

Definitions in terms of S1, S2, and S3 are given for the Experimental Set, with the definitions in parentheses for the CSP Set where they differ. Diagrams showing these definitions on the 3-D plots of contacts are included at the top. The number of contacts of this type is given, with the percentage of structures containing that type of contact. Structures may contain the same contact type more than once.

Definitions in terms of S1, S2, and S3 are given for the Experimental Set, with the definitions in parentheses for the CSP Set where they differ. Diagrams showing these definitions on the 3-D plots of contacts are included at the top. The number of contacts of this type is given, with the percentage of structures containing that type of contact. Structures may contain the same contact type more than once.

π···π Stacking (Face-to-Face Contacts)

As seen in Table (based on Figures S15–S18 and Table S11 of the Supporting Information), there are three distinct inversion relationships that have extensive overlap of the conjugated systems. These do not form a continuum, and there are regions between the populated areas of the coordination ellipsoids that do not have many contacts on Figure . There are also translation contacts, named as T3 in Table , which cover a broader range of S2 and S3 values in the CSP structures. It is possible for a crystal structure to have more than one of the inversion contacts listed in Table , although having translation contact T3 precludes having any of the inversion contacts I1, I2, or I3. Within the Experimental Set, 68 structures contain at least one face-to-face inversion and 5 structures contain the T3 face-to-face translation. This totals 31% of all structures in the Experimental Set containing face-to-face contacts. Within the CSP Set, 297 structures contain at least one face-to-face inversion, and 935 structures contain the T3 face-to-face translation, totaling 25% of structures. Thus, the vast majority of structures do not contain substantial amounts of π overlap in the inversion or translation contacts.

Side-to-Side Contacts

Two common van der Waals contact dimer geometries are T1 and T2 (Table ). These are translation contacts and differ in the separation in S1. T1 contacts have the molecular planes almost coinciding (<1 Å separation), and almost all have planar1 ∼ ±50°, whereas T2 has the molecular planes separated by between 1.5 and 2.75 Å and planar1 ∼ 0°. In Figure b, this is seen at the bottom middle and top middle, where S3 distances are able to get smaller as S1 distances increase. These two types of contacts look like a continuum in the 2-D plot (Figure ), but Table S11 and the 3-D plots (Figure S16, Figure S18, Figure S19) show that the regions with S1 between 0.5 and 2.0 Å in the Experimental Set (and between 1.0 and 1.5 Å in the CSP Set) do not fulfill the criteria for being highly populated. Hence, there is a subdivision into T1 and T2 dimers, but this depends on the quantitative definition of “highly populated”. In the case of the translation dimers, equivalent contact on both sides of the molecule form 1-D motifs. Hence, we can define a 1-D translational motif with molecules side-to-side, TrSS, which covers both T1 and T2, above, and can be subdivided by molecular conformation. The TrSStwist subset of structures almost all contain the T1 motif defined in Table . As only translational dimers will necessarily form a 1-D motif, it is necessary to use XPac analysis to find other common extended motifs, such as chains, ribbons, sheets, or layers (section ).

Analysis of CSP Structures in Terms of Common Translational 1-D Motifs Seen in the Experimental Set

The distribution and nature of common inversion and translation dimer motifs in the CSP Set are the same as that seen in the Experimental Set (section ), indicating that the main driving force for the packing is the chalcone backbone. Any link between the dimer motifs and the energies of the crystals can only be assessed for the CSP structures. Figure shows how the TrSS motifs with the planar molecule (TrSSplan), twisted molecule (TrSStwist), and related motif with the Region B conformations (TrSSB) are distributed among the computational structures. There is a significantly higher proportion of structures with the TrSSplan motif than in the Experimental Set, which may be an artifact of the CSP method (see section ). The TrSSB motif is seen in a small number of structures, and these are high in energy. However, the lowest energy structure with the Region B conformational type has an energy of −105.7 kJ mol–1, and the lowest energy with the TrSSB motif has an energy of −103.1 kJ mol–1, so this motif is not particularly unfavorable for this conformation.
Figure 9

CSP landscape with points classified by whether the crystal structure includes the TrSS motif or not and the conformation within the motif.

CSP landscape with points classified by whether the crystal structure includes the TrSS motif or not and the conformation within the motif.

Pairwise Comparisons to Identify Extended Motifs, up to Isostructural Crystals

The analysis for the common one and higher dimensional similarities between pairs of crystal structures, carried out using XPac, is summarized by the heatmap in Figure . This is compared with the corresponding diagram for the number (n ≤ 20) of molecules which are overlaid in Mercury’s Crystal Packing Similarity RMSDn analysis in Section S9 of the Supporting Information. The key finding is that the large groups of structures with significant structural similarity on Figure contain the translation motif, TrSS. It is also notable that the XPac analysis of periodically repeating motifs gives a similar distribution of similar structures to the Mercury analysis in terms of the coordination sphere (Figure S22). Hence, this TrSS motif is the most common motif found in both the experimental and CSP crystal structures, seen in approximately 50% of crystal structures in the Experimental Set.
Figure 10

Heatmap of pairwise XPac analysis. The key is degree of similarity, from 1 = common dimer to 9 = 3-D similarity, with 10 for identical structures (i.e., the diagonal). Regions of high similarity where most of the structures contain a variant of the TrSS motif are labeled. The Set labels refer to the groups defined in the isostructurality analysis in Figure . (As the Set 4 structures are contained within both the T1 and the Other TrSS sets, they are also included in the analysis in Figure .) A large version of this figure with full definitions is in Section S9 of the Supporting Information, and an expandable version is provided in the Excel spreadsheet Supporting Information file.

Heatmap of pairwise XPac analysis. The key is degree of similarity, from 1 = common dimer to 9 = 3-D similarity, with 10 for identical structures (i.e., the diagonal). Regions of high similarity where most of the structures contain a variant of the TrSS motif are labeled. The Set labels refer to the groups defined in the isostructurality analysis in Figure . (As the Set 4 structures are contained within both the T1 and the Other TrSS sets, they are also included in the analysis in Figure .) A large version of this figure with full definitions is in Section S9 of the Supporting Information, and an expandable version is provided in the Excel spreadsheet Supporting Information file.
Figure 12

Structural similarities of the 68 crystal structures of the Experimental Set, which contain the T1 contact (and usually the twisted molecule with planar1 ∼ ±50°). Some of the common features are labeled as defined by the key.

Figure 13

Structural similarities of the 45 crystal structures of the Experimental Set, which contain the Other TrSS contact shown on Figure . Some of the common features are labeled as defined by the key.

Figure shows comparable views of crystal structures containing the TrSStwist and TrSSplan motifs. The spacefilling diagram shows that the molecules pack efficiently in this motif. The end-on capped-sticks view of each structure shows how the tilt of the planar molecules with respect to one another is similar to the angle of twist in the twisted molecules so that the relationship between one aromatic ring and the corresponding ring in the next molecule of the 1-D motif is similar, despite the different conformations. The balance of conformation and packing in these two structures gives virtually identical intermolecular interactions. These intermolecular interactions between the chalcone scaffold will not be disrupted by p-substituents, as shown by the bromine and fluorine atoms in Figure . Indeed, the molecules that have the TrSS motif contain a wide range of substituents, namely, F, Cl, Br, I, CH3, N(CH3)2, NO2, OCH3, OH, NH2, SMe, Ph, OEt, iPr, CF3, OCF3 CN, C≡CH, piperidine, OC(=O)C(=CH2)CH3, O(CH2)3CH3, and O(CH2)4Br. Almost all are seen in the para position, with a few in the meta position and a very few (namely, hydroxyl and fluorine) in the ortho position for the TrSSplan motif.
Figure 11

Various views of (top) A-4p(3-(4-bromophenyl)-1-phenyl-2-propen-1-one), which has the TrSStwist motif and (bottom) A-2p-I (3-(4-fluorophenyl)-1-phenyl-2-propen-1-one), which has the TrSSplan motif. Molecules are shown in gray where they are related by a translation to the molecule in atomic colors and in magenta for the glide related molecules. The final views on each line are a pair of molecules extracted from the third and first views respectively, with the 1-ring colored red and the 3-ring colored black to show that the corresponding rings in adjacent molecules of the 1-D packing motif have virtually equivalent relative orientations, regardless of the molecular conformation.

Various views of (top) A-4p(3-(4-bromophenyl)-1-phenyl-2-propen-1-one), which has the TrSStwist motif and (bottom) A-2p-I (3-(4-fluorophenyl)-1-phenyl-2-propen-1-one), which has the TrSSplan motif. Molecules are shown in gray where they are related by a translation to the molecule in atomic colors and in magenta for the glide related molecules. The final views on each line are a pair of molecules extracted from the third and first views respectively, with the 1-ring colored red and the 3-ring colored black to show that the corresponding rings in adjacent molecules of the 1-D packing motif have virtually equivalent relative orientations, regardless of the molecular conformation. In the dimer analysis, there is a particularly concentrated area (T1) corresponding to mainly twisted molecules, and a more diffuse set of translation dimers extending beyond the five structures of the T2 dimer motif that completes the TrSS set, the remainder of which mostly contain the planar molecule (see Supporting Information, Section S8). Hence, the crystal structures containing the T1 dimer were analyzed separately (Figure ) from the rest of the TrSS structures (Figure ) to see how these translational 1-D motifs build up to form more complex packings and detect isostructurality. Structural similarities of the 68 crystal structures of the Experimental Set, which contain the T1 contact (and usually the twisted molecule with planar1 ∼ ±50°). Some of the common features are labeled as defined by the key. Structural similarities of the 45 crystal structures of the Experimental Set, which contain the Other TrSS contact shown on Figure . Some of the common features are labeled as defined by the key. Eight structures contain the Region B conformation (a further structure contains both Region A and Region B conformations), and six of these contain the side-to-side translation interaction that is so similar to the TrSStwist and TrSSplan motifs that it is denoted TrSSB. (The different shape of the molecule means that this is not included within the TrSS definition in terms of S1, S2, and S3, Section S8 of the Supporting Information). Of these, four are isostructural, and a fifth contains a sheet in common with them. The other isostructural groups which do not contain the TrSS or related TrSSB motifs are (Cp3o, Ip1-, Ip2o, Ip2p, Ip3o), (Cp21p, Dm21p, Dp21p), (Jm2p, Jm3p, Jm6p), and (Fp2p, Fp9p, Up9p) and a few isostructural pairs that can be read from the spreadsheets in the Excel spreadsheet Supporting Information file.

Discussion

It is obvious from looking at only a small sample of chalcone structures that there is no simple strong packing preference, such as a favored π···π stacking of the chalcone core. This study has analyzed a large number of experimental crystal structures, most determined because of the pharmacological and optical applications of chalcones but supplemented with a significant number of new structures from our collaborators. The sample is limited to only one substituent per ring, with the substituents limited to being a phenyl ring or smaller, i.e., less than half the size of the core, so that we could study the preferences of the core in determining the packing. The sample was dominated by para substitution and halide substituents (Figure b, Figure S2); i.e., the position and nature of the majority of substituents would be expected to have a relatively minor effect on the conformation and relatively weak substituent-specific intermolecular interactions. However, this study faced the challenge of identifying common motifs in a large set of crystal structures, which are not formed from specific intermolecular synthons, such as hydrogen bonding, but are held together by more diffuse, dispersion-dominated interactions. The large number of chalcone crystal structures reveals that there is a wide variety in the packings. However, this variety is closely mirrored by the structures generated by the CSP study of the unsubstituted chalcone. This shows that the packing is dominated by the core, but since this has a wide variety of packings of similar stability, the packing changes necessitated by the substituents produce a wide range of observed structures. Since there are relatively few extensive sets of crystal structures of similar molecules, we cannot yet say whether this observation is general. However, we do note that the crystal structures of derivative families of compounds often show a surprising overall diversity of crystal structures[75,81,82] which would warrant comparing with a CSP of the core. The substituents affect the relative energies of the different molecular conformations. An ortho hydroxy or amino group on the 1-ring will form an intramolecular hydrogen bond strongly favoring a planar conformation; yet, a large ortho substituent will clearly lead to a steric clash if the molecule is planar. Larger groups in the ortho position of the 1-ring lead to a complete change in the shape of the molecule to the B conformation as seen for the molecules with iodo or nitro at that position (no groups larger than this were found at the ortho position in the data set examined). Four of the eight crystal structures that contain the Region B conformation were isostructural, with a further two containing a 1-D side-to-side translation contact, similar to TrSS. For cases where the conformation was not dictated by steric clashes or internal hydrogen bonding, there could be an electronic effect. However, there was no correlation of greater planarity with Hammett parameters as might be expected if the deviation from planarity was purely a molecular property (see Supporting Information, Section S7.3.3). The large number of twisted molecules (planar1 ∼ ±50° in Region A) in the experimental crystal structures is somewhat surprising, given the general tendency of crystal packing to favor extended conformations[79] as well as the expectation that the chalcone molecule would be planar because of its extended π system. The energy penalty for twisting the molecule is smaller than for adopting the Region B conformation, but both are dependent on the ab initio method (Figure ), with the MP2 calculation producing a local conformational energy minimum around the twisted conformation. Correctly estimating the intramolecular dispersion, which would favor the twisted conformation, is demanding of the ab initio method. The PBE0 functional may overstabilize the planar conformation through a delocalization error (although this error is usually less for PBE0 than PBE,[83−86] the difference appears to be small for the isolated chalcone molecule, Figure ). The MP2 calculations are likely to be affected by an intramolecular basis set superposition error[87] and are still far from the accuracy that can be needed for accurate CSP molecular calculations.[83] However, if we assume that the PBE0 functional calculations used in the CSP are overestimating the intramolecular energy penalty for twisting the molecule, this would at least partially account for why there is a smaller proportion of twisted molecules in the CSP than experimental structures (Figure ). The ability of the molecules that are twisted by 50° to close pack with translational symmetry for a variety of substituents in the TrSStwist motif (Figure ), may well be a more important factor in explaining the large number of twisted molecules in the Experimental Set. Whether or not there is another conformational minimum around 50° in the isolated molecule PES surface would affect the definition of conformational polymorphs[88] as polymorphs where the molecular conformation would optimize to different conformational minima. In this case, whether polymorphic pairs are conformational polymorphs depends on the ab initio method used (Figure , Table S4). So the pairs of polymorphs, A-6p-I and A-6p-IV, and Bp4p-P21/c and Bp4p-P2/n, are conformational polymorphs on the MP2/6-31G(d,p) but not on the PBE0/6-31G(d,p) conformational energy surface. There is the same difference in the chalcone backbone in the polymorphs of Cp9p-o and Cp9p-m, but since the orientation of the methoxy group also differs, these are unambiguously conformational polymorphs (Supporting Information, Section S3.3). The dependence of intramolecular energies on the computational method exacerbates the problem of accurately calculating polymorphic energy differences and obtaining an accurate ranking of structures in CSP.[83]

Crystal Packing

Discussion of Isostructurality

The unsubstituted molecule contains no hydrogen-bond donor atoms, and only one acceptor, so only the hydroxy and amino substituted molecules are capable of forming strong hydrogen bonds which are expected to determine their crystal packing. Hence, it is noteworthy that A-10m-II, which has the hydroxy group in the meta position of the 3-ring hydrogen bonding to the central carbonyl of a neighboring molecule, is isostructural (RMSD30 = 0.34 Å) with one of the polymorphs of the unsubstituted chalcone. Hence, two molecules that would be expected to have very different intermolecular interactions have isostructural polymorphs. This is just one of the many unusual isostructural pairs of molecules found among the chalcones, in addition to those with the more usual isostructural substitutions of Cl/CH3 and F/H. This is illustrated by the largest isostructural group of 15 crystals (Set 1 in Figure ) given in Table . All the substituents are in the para positions on the two rings, and all are halogens or similar-sized groups. There are many structures that might have been expected to be in this isostructural group, but are not, for example A-3p, A-6p (3 polymorphs), A-9p, Bp3p, Bp4p (2 polymorphs) Bp9p, Ip3p, Ip4p, and Ip9p. The conclusions that can be drawn from the isostructurality found in this survey will be limited by the probable lack of polymorph screening for most of the chalcones.[89] However, the isostructurality found in the detailed study of substituted quinoxalines[40] involved metastable polymorphs, which emphasizes that any change of substituent does involve a change in the relative thermodynamics and kinetic differences between the observed and any hypothetical crystal structures of similar energy.
Table 2

Molecules Which Crystallized in the Largest Isostructural Set and Their Crystal Structures

codeREFCODE1-ring3-ringcodeREFCODE1-ring3-ring
A-27pQEDHOYHCF3Dp3pLEPYIPBrCl
Bp6pMIYCAZFCH3Dp4pLEHROGBrBr
Cp3pGAVBELClClDp5pIWALAVBrI
Cp4pGEJJUBClBrDp6pIZEFOIBrCH3
Cp5pTADPEXClIDp27pPAQJOJBrCF3
Cp6pLOBVEEClCH3Ip27pARUGUROCH3CF3
Cp9p-mMEGYON02ClOCH3Sp3pCIQFEOSCH3Cl
C/Dp3/4paCl/BrCl/Br    

A structure that could not be refined to a publishable standard.

A structure that could not be refined to a publishable standard.

Dominant versus Expected Motifs

The dominant packing of the chalcone backbone, the side-to-side translation contacts of the molecules (TrSS, Figure ), is not seen in the observed crystal structures of the unsubstituted molecule, A-1-, though it is in putative polymorphs, including one that is more stable than form I (Figure ). This type of 1-D packing is observed with a wide variety of substituents, although not with any ortho substituents when the molecule is in the twisted conformation. Figure shows that these 1-D motifs can accommodate some meta and any size para substituents. The overlap region on Figure and the similarity in the phenyl–phenyl contacts emphasize that TrSS should be considered a single motif, which can accommodate both planar and twisted molecules. The expected π-overlap of the face-to-face interactions of adjacent molecules is present in many crystal structures and associated with the color,[52] but there is not a strongly preferred way of overlapping the molecules in this way. The face-to-face inversion-related contacts, I1 and I2 (Table ), are present in a significant number of structures. However, the maximum π···π overlap (T3 with no offset (S2, S3) displacement) is rarely observed.

Development of Crystal Structure Comparison Methods

This approach of using the CCDC Python API to systematically analyze a subset of the dimer contacts in the entire set of crystal structures, and then manually inspect this to find trends, is an efficient way of analyzing a large data set to find common structural features. This approach is useful for analyzing the large data sets of structures generated in a CSP study but does require some molecule-specific definitions. The chalcone core is very nonspherical, but its approximate planarity and symmetry allow the definition of a coordination ellipsoid with axes S1, S2, and S3 (the interplanar, end-to-end, and side-to-side molecular distances) to determine which molecules are in van der Waals contact. This first application to simultaneously compare a set of crystal structures of different molecules required the limits on S1, S2, and S3 to be adjusted for variations in substituent size. The coordination ellipsoid definition encompassed all inversion and translation pairs in van der Waals contact but did not include all molecule pairs with a different symmetry relationships (Supporting Information, Section S7.3.5). This analysis of the inversion and translation contacts in the crystal structures showed the structural diversity of the experimental structures and, by comparison with the much larger set of CSP-generated structures of the chalcone core, allowed the identification of some common dimer geometries (Table ). For many of the crystal structures, the inversion and translation dimers go a long way to defining the overall crystal packing as the other molecules have to complete the overall close packing of the coordination ellipsoid. Nonetheless, the focus on inversion and translation dimers could in principle miss a strongly interacting dimer related by another (or approximate) symmetry element. This was not the case for the chalcones as this would have been detected by the XPac analysis. The more established methods of comparing crystal structures, Mercury’s Crystal Packing Similarity tool and XPac, have the major disadvantage of being pairwise comparisons. These comparisons then need to be grouped, which is difficult to automate to find common motifs that are not 3-D (isostructurality). For example, where two structures, A and B, have a 2-D similarity, and B also has a 2-D similarity with C, it does not imply that A and C have a 2-D similarity. If A and C contain the same molecule only and are Z′ = 1, it can be assumed that they have at least a 1-D similarity (since the two planes in B must intersect at a line). Both XPac and Mercury’s Crystal Packing Similarity tool require parameters values defining cutoffs and hence at least manual inspection that the default values are appropriate for the specific group of molecules. For example, Mercury’s Crystal Packing Similarity tool, using the default 15 molecule coordination sphere, can appear to show isostructurality when it has matched a 15 molecule double layer of very anisotropic molecules, but the double layers can stack differently. This can be avoided by increasing the number of molecules to be matched to 20 or even 30 molecules, but this can exceed the capabilities of desktop PCs for larger molecules. XPac is a valuable complement to coordination environment comparisons as it looks for matching motifs of different dimensions. However, XPac can match alternate molecules and report a 3-D similarity, which is not isostructurality as the unmatched molecules will be different.

Systematics of the Chalcone Packings - Is This Unusual?

The investigation of the packing preference of the chalcone core compared the set of experimental substituted crystal structures with the lower energy structures generated by a CSP study of the unsubstituted chalcone. This showed sufficient similarity in the distribution of geometries of inversion and translation-related dimers to confirm that the substituted chalcones had packing preferences similar to the core. However, there were no dimer geometries associated with particularly favorable lattice energies (Figure ), implying that there is no strongly preferred dimer interaction. There is a reassuring consistency that the more common chalone dimer geometries mapped approximately onto the more common motifs identified in the isostructurality analysis using Mercury’s Crystal Packing Similarity tool and XPac. The definitions are different, and since there are some structures that are close to the limits, there can be small differences in the numbers of structures, for example, in T1 and TrSStwist. Face-to-face contacts between the chalcone core with some degree of π···π overlap are seen in less than a third of the crystal structures. However, the TrSS translational motif is significantly more common, appearing in approximately half the experimental structures. This TrSS motif is nicely close-packed in a way that is not affected by any para or smaller meta substituents (Figure ). These substituents will therefore determine how the motif is packed in 3-D. Hence, this translational motif is seen in the crystals of many molecules because it can accommodate a wide range of substituents and can pack in a variety of ways, not because the intermolecular interactions within the TrSS motif are particularly strong. The observation that the chalcones adopt a wide variety of crystal packings reflects the different spatial requirements of substituents requiring different structures to maintain close packing. Even substituents in the same position and of almost the same size will have different charge distributions affecting the local electrostatic and repulsive forces. When there are many packing possibilities, then very small changes will tip the balance of which structures are the most favorable. What is reassuring is that the packing preferences of the chalcone core are reflected in all the crystal structures (i.e., no structures have been seen with highly unfavorable conformations or poor packing of the core). This is possible because there are a large number of ways of packing the chalcone core, as shown by the CSP (Figure , Figure ). This may be a general finding. If the chalcone core had a very strong energetic preference to be planar and pack in a particular 1-D motif (e.g., π-stack), with a large energy penalty for a sideways shift in the stacking, then would so many substituted chalcones be able to crystallize? There would be relatively few substitution patterns that would enable this motif to crystallize with close packing of the atoms at the edges of the stack and favorable substituent intermolecular interactions. (Solvent molecules might be incorporated into the structure if it would otherwise have large voids around the substituents.) Substituents that were larger than the aromatic carbons would provide a force trying to increase the separation of the π-stacked molecules. Thus, attempts to generate a large library of crystal structures of related molecules may be hampered by many just not crystallizing[38] if the interactions within the core packing motif were so strong as to make this motif ubiquitous and virtually rigid. The CSD contains hundreds of crystal structures of simple chalcones because the interest in these molecules for their biological and optical properties has coincided with the ability to form at least one crystal suitable for structural determination.

Conclusion

We have carried out a range of complementary systematic analyses of the crystal structures of 232 chalcone molecules with up to one small substituent on each ring and made comparisons with the low-energy structures generated in a CSP study of the unsubstituted molecule. The similarities between the low-energy crystal structures of the core and the structures of the substituted chalcones confirm that the packing of the core plays a major role in determining the crystal structures. However, there are a wide range of crystal structures in both the Experimental and CSP sets of crystal structures. The molecular conformation can be twisted, with an angle of about 50° between the phenyl rings, as well as the planar conformation expected from the conjugation of the molecule. The diversity of experimental crystal structures is such that no isostructural group contains more than 15 structures, and most contain only a pair of crystal structures. Many of the two dozen crystal structures observed for more than one compound (i.e., isostructural) are unexpected from the similarity in substituent size and position. The most common 1-D motif, observed in half the experimental chalcone structures, is not a typical crystal engineering motif but a translation packing that can be adopted by different conformations of the molecule, with differently sized substituents, particularly in the para position. This study underlines the subtle balance of intermolecular interactions and conformational strain that makes the observed crystal structure quite molecule specific, illustrating why closely related molecules that have similar biological properties will often have very different crystallization behaviors.
  38 in total

1.  The lines-of-force landscape of interactions between molecules in crystals; cohesive versus tolerant and 'collateral damage' contact.

Authors:  Angelo Gavezzotti
Journal:  Acta Crystallogr B       Date:  2010-05-06

2.  Knowledge-based model of hydrogen-bonding propensity in organic crystals.

Authors:  Peter T A Galek; László Fábián; W D Samuel Motherwell; Frank H Allen; Neil Feeder
Journal:  Acta Crystallogr B       Date:  2007-09-14

3.  Conformational polymorphism.

Authors:  Aurora J Cruz-Cabeza; Joel Bernstein
Journal:  Chem Rev       Date:  2013-12-18       Impact factor: 60.622

4.  Why don't we find more polymorphs?

Authors:  Sarah L Price
Journal:  Acta Crystallogr B Struct Sci Cryst Eng Mater       Date:  2013-07-18

5.  Comparison of conformer distributions in the crystalline state with conformational energies calculated by ab initio techniques.

Authors:  F H Allen; S E Harris; R Taylor
Journal:  J Comput Aided Mol Des       Date:  1996-06       Impact factor: 3.686

Review 6.  Exploring pharmacological significance of chalcone scaffold: a review.

Authors:  N K Sahu; S S Balbhadra; J Choudhary; D V Kohli
Journal:  Curr Med Chem       Date:  2012       Impact factor: 4.530

7.  Inhibition of gastric H+,K(+)-ATPase by the anti-ulcer agent, sofalcone.

Authors:  S Murakami; M Muramatsu; H Aihara; S Otomo
Journal:  Biochem Pharmacol       Date:  1991-09-12       Impact factor: 5.858

8.  C-geranylated chalcones from the stems of Angelica keiskei with superoxide-scavenging activity.

Authors:  Nobuwa Aoki; Mai Muko; Emi Ohta; Shinji Ohta
Journal:  J Nat Prod       Date:  2008-06-18       Impact factor: 4.050

9.  Accurate and efficient representation of intramolecular energy in ab initio generation of crystal structures. I. Adaptive local approximate models.

Authors:  Isaac Sugden; Claire S Adjiman; Constantinos C Pantelides
Journal:  Acta Crystallogr B Struct Sci Cryst Eng Mater       Date:  2016-12-01

10.  A systematic structural study of halogen bonding versus hydrogen bonding within competitive supramolecular systems.

Authors:  Christer B Aakeröy; Christine L Spartz; Sean Dembowski; Savannah Dwyre; John Desper
Journal:  IUCrJ       Date:  2015-07-30       Impact factor: 4.769

View more
  1 in total

Review 1.  The Relevance of Crystal Forms in the Pharmaceutical Field: Sword of Damocles or Innovation Tools?

Authors:  Dario Braga; Lucia Casali; Fabrizia Grepioni
Journal:  Int J Mol Sci       Date:  2022-08-12       Impact factor: 6.208

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.