Literature DB >> 29445316

Evaluating the Energetic Driving Force for Cocrystal Formation.

Abstract

We present a periodic density functional theory study of the stability of 350 organic cocrystals relative to their pure single-component structures, the largest study of cocrystals yet performed with high-level computational methods. Our calculations demonstrate that cocrystals are on average 8 kJ mol-1 more stable than their constituent single-component structures and are very rarely (<5% of cases) less stable; cocrystallization is almost always a thermodynamically favorable process. We consider the variation in stability between different categories of systems-hydrogen-bonded, halogen-bonded, and weakly bound cocrystals-finding that, contrary to chemical intuition, the presence of hydrogen or halogen bond interactions is not necessarily a good predictor of stability. Finally, we investigate the correlation of the relative stability with simple chemical descriptors: changes in packing efficiency and hydrogen bond strength. We find some broad qualitative agreement with chemical intuition-more densely packed cocrystals with stronger hydrogen bonding tend to be more stable-but the relationship is weak, suggesting that such simple descriptors do not capture the complex balance of interactions driving cocrystallization. Our conclusions suggest that while cocrystallization is often a thermodynamically favorable process, it remains difficult to formulate general rules to guide synthesis, highlighting the continued importance of high-level computation in predicting and rationalizing such systems.

Entities: Chemical

Year: 2017 PMID： 29445316 PMCID： PMC5806084 DOI： 10.1021/acs.cgd.7b01375

Source DB: PubMed Journal: Cryst Growth Des ISSN： 1528-7483 Impact factor: 4.076

Introduction

Co-crystals, where two or more neutral molecules crystallize together within a single crystal lattice, continue to receive high levels of interest in the field of crystal engineering. A major driver for cocrystal studies is the increased opportunity for the modification of solid form properties that is introduced by the combination of two different chemical entities, rather than chemical modification of the original molecules. This has been exemplified by the modification of dissolution rate,[1] mechanical properties,[2] optical properties,[3] stability to hydration,[4] and melting point[5] through cocrystallization. Like polymorphism, cocrystallization is a phenomenon that is straightforward in principle, but where the definition of guiding rules and the development of predictive methods is challenging. In the area of polymorphism, the use of computational modeling has helped understand the typical energy difference between observed structures: most pairs of polymorphs are separated by lattice energy differences of less than 2 kJ mol–1, and 95% by less than 7.2 kJ mol–1,[6] and we are gaining a better understanding of the magnitude of entropic contributions and their impact on polymorph free energy differences.[6,7] We are not aware of a general guideline for the difference in energy between a typical cocrystal and the pure crystal structures of its individual components. This is despite the utility of such a characteristic value; most obviously, it would indicate whether cocrystallization in general is likely to be spontaneous (i.e., thermodynamically favorable with respect to formation of the separate, pure structures). A better quantitative understanding of the typical energetic driving force for cocrystallization also provides context for the introduction or modification of intermolecular interactions in attempts to induce the cocrystallization of two molecules. Computational chemists have developed a range of methods for predicting cocrystallization. These include the use of simplified models for the possible interactions between molecules,[8] specific criteria based on electrostatic potentials between donor and acceptor atoms on coformers,[9] models trained from crystallographic databases for predicting the propensity of competing hydrogen bond donor–acceptor combinations,[10] the adaptation of liquid-phase models to predict mixing enthalpies between components[11] and, recently, the use of machine learning based on molecular descriptors.[12] Furthermore, the methods developed for crystal structure prediction (CSP) have been applied to cocrystal prediction by exploring the complete space of crystal packing possibilities of the single components and their cocrystals and an evaluation of the possible energetic gain of cocrystallization.[13−15] The possibilities for different stoichiometric ratios of cocrystal components must be considered in predicting cocrystallization; in the case of CSP, this involves predicting the landscapes of possible structures at a range of stoichiometries.[13,16] From a computational CSP perspective, a quantification of the characteristic energy of observed cocrystals with respect to their separate single components would help define the relevant energy range within which a CSP study should explore possible cocrystal structures. Discussions of whether a cocrystal is thermodynamically more or less stable than the corresponding pure structures has largely been limited to studies of specific sets of cocrystal systems. Computationally, these typically involve a small closely related set of target molecules[17,18] rather than a broad, chemically diverse space, making the conclusions drawn from such studies useful for those specific systems but difficult to apply to cocrystallization generally. A recent effort to investigate cocrystallization as a general phenomenon by Gavezzotti et al.[19] considered a large set of around 1500 cocrystal structures, but only calculated energies relative to the single-component structures for a small fraction (97) of those, using the efficient but physically approximate PIXEL method.[20] That being said, both their work and the work of others on cocrystals of specific composition indicate that cocrystallization is likely a thermodynamically driven process; observed cocrystals are usually found to be more stable than the pure structures, but more general and quantitative conclusions regarding the magnitude of this stability cannot be reached in these studies. The aim of the present work is to accurately compute the thermodynamic stabilities of a large set of experimentally known cocrystals with respect to the pure structures of the molecules concerned. Ideally, in order to reach generally applicable conclusions, the set of structures must be both sufficiently large and chemically diverse to make generally representative conclusions. While the relative stabilities will likely show considerable variation depending on the physical nature of each system, it is our hypothesis that cocrystallization is in general thermodynamically driven to a degree that is resolvable with current computational methods. We also explore whether the calculated cocrystal stability could be linked with a straightforward structural descriptor. We consider several relatively simple descriptors of both the crystal structures and the molecules within them. In particular, we calculate and compare descriptors for two typical strong, directional intermolecular interactions—hydrogen bonding and halogen bonding—as well as the packing efficiency of the structures. We assess whether the change in these descriptors between the pure structures and the cocrystal can be linked to the observed stability of the cocrystal relative to its single component structures. Considering a hypothetical binary cocrystal with composition AB, we hereafter refer to the constituent molecular species A and B as the components of the cocrystal. Hence, we term crystal structures of purely constituent A or B as the corresponding single-component structures of the cocrystal. We define the relative thermodynamic stability of the cocrystal aswhere Etot is the total energy of a crystal structure per formula unit of constituent(s): Etot (AB) is the calculated energy of the cocrystal; Etot(A) and Etot(B) are energies of the single-component crystal structures of coformers A and B, respectively. Where polymorphism has been observed in the single-component structures, we present the cocrystal stability relative to the single-component structures with the lowest total calculated energy per formula unit. The cost-effective energy evaluation and structural optimization of molecular crystal structures is itself a difficult problem and an area of considerable research. Here, we apply periodic density functional theory (DFT) calculations, performed in a plane wave basis and supplemented by a dispersion correction (DFT+D). Thus, the energy differences calculated using eq account for both intermolecular energy differences and changes in intramolecular energy between the single component and cocrystal structures. The best DFT+D methods have been demonstrated to obtain values correct to within a few kJ mol–1 relative to experimentally derived lattice energies for a molecular crystal benchmark set of 23 organic crystal structures.[21] While such periodic DFT calculations are quite computationally demanding compared to force field based methods, they remain tractable for large sets of structures using large scale high-performance computing.

Method and Computational Details

General Cocrystal Structure Set

Our aim was to select a representative set of organic cocrystal structures, from which general observations on the energetics of cocrystallization can be made. The set of cocrystals and their corresponding single-component structures were obtained from the Cambridge Structural Database (CSD)[22] using the ConQuest software package. For the general cocrystal set, constituent elements were limited to C, H, N, O, F, S, and Cl. We define cocrystals for the purposes of this work as crystal structures: containing at least two chemically different polyatomic units, containing only overall neutral molecules (i.e., nonionic structures), and not containing any of a set (defined in the Supporting Information) of common solvents or small molecules known to be liquid/gaseous at room temperature, ruling out solvates and clathrates. To avoid duplicate and low-quality structures, all searches were initially restricted to the predefined “best hydrogens” subset[23] of the CSD. Matching of molecules between cocrystals and possible single component structures was handled using SMILES strings. Further details of our search procedure are provided in the Supporting Information. For brevity, we will refer to structures in the text by their CSD reference codes.

Halogen-Bonded Cocrystal Set

The general set of cocrystal structures was supplemented by a specific search performed to identify halogen-bonded cocrystals and their corresponding single-component structures. We identified all cocrystals containing at least one possible halogen bond, which we defined as a contact D···X—A, where D is one of N, O, S, or Cl; X is either Br or I; the D···X distance is less than the sum of the van der Waals radii of the two atoms plus a tolerance of 0.1 Å and the angle formed by D···X—A is greater than 160°. This process, combined with the additional restriction that crystal structures were also available for the individual components of each cocrystal, yielded 3303 unique cocrystal structures matching the general cocrystal criteria, and 34 unique cocrystal structures for the halogen-bonded set. Of the latter, only 28 successfully completed our optimization procedure—we then randomly selected cocrystals from the general set until 322 (and their corresponding single component structures) were successfully geometry optimized. This gave a total combined set of 350 cocrystals for our computational study. The assembled set of single component structures include all crystal polymorphs available within the CSD of each molecule found in the cocrystal set. While our set excludes any structures that are reported as salts in the CSD, such an unambiguous definition is not always possible due to dynamic proton transfer or disorder. Indeed, the experimental determination of cocrystal versus salt can be sensitive to conditions, as described for the cocrystal target of the most recent (sixth) Blind Test of CSP methods.[24] Since our optimization procedure allows all structural degrees of freedom to relax, it is feasible for proton transfer between species to occur if the cocrystal configuration is unstable at the level of theory used in our calculations, potentially changing structures defined in the CSD as being neutral cocrystals into ionic salts.

Periodic DFT Optimizations

All experimental cocrystal and single-component structures in our sets were optimized using plane-wave-based periodic DFT using the CASTEP[25] and VASP[26−29] packages. Structural optimization was performed in two steps to aid convergence—an initial optimization in which only atomic positions were optimized (i.e., the lattice parameters are fixed) followed by a second optimization in which both atomic positions and cell parameters are fully flexible. Such a procedure has been noted to improve convergence of DFT optimizations of experimentally obtained organic crystal structures.[30] Throughout our calculations, the PBE exchange-correlation functional[31] was employed, supplemented with the Becke–Johnson-damped Grimme dispersion corrections[32,33]—the D2 version in CASTEP and the more recent D3 version in VASP. Except where otherwise noted, we present only the final resulting structures and energies from our VASP calculations (PBE+D3) due to the more accurate dispersion correction. All calculations in VASP made use of the projector-augmented wave (PAW) method[34] and the standard supplied pseudopotentials.[35] The convergence criteria for all VASP calculations were as follows: a plane-wave energy cutoff of 500 eV, an energy tolerance in convergence of the electronic minimization of 1 × 10–7 eV per atom, and a force tolerance in the geometry optimization of 3 × 10–2 eV Å–1. These tolerances were found to yield sufficiently accurate energies (and converged minimizations) while moderating the considerable total cost of the many optimizations to be performed. The computational settings used are also in line with those used in other work employing DFT in VASP for accurate energy evaluations of organic crystal structures.[17,30,36,37] Further details regarding our optimization procedure are given in the Supporting Information.

Calculation of Descriptors

Hydrogen Bonding

Analysis of the hydrogen-bonding environments in crystal structures was carried out using in-house code in conjunction with the CSD Python API.[38] A molecular shell was constructed around each unique species in a crystal structure to count all hydrogen bonds in which the species participates (i.e., ignoring crystal symmetry). In cases of chemically equivalent but symmetrically unrelated moieties (e.g., a Z′ = 2 single-component crystal), hydrogen bond counts were averaged over the distinct moieties. Contacts were considered hydrogen bonds if they satisfied all of the following criteria: the H···A contact distance is less than the sum of the van der Waals’ radii of the two atoms, the D—H···A contact angle is greater than or equal to 130 degrees, the donor atom D is any of N, O, or S, and the acceptor atom A is any of N, O, or S. Additionally, we calculated a basic measure of the strength of a hydrogen bond based on the Coulombic attraction between the partial charges of the (positively charged) hydrogen and (negatively charged) acceptor atom involved (the H···A electrostatic interaction). We computed these charges (qH and qA respectively) based on fitting to the molecular electrostatic potential in vacuo of each unique moiety in the optimized crystal structure using the CHELPG method[39] in Gaussian09.[40] The strength of an individual hydrogen bond is then straightforwardly calculated aswhere rHA is the hydrogen-acceptor distance. A total hydrogen-bond attraction energy was then calculated for a given crystal structure C aswhere the sum runs over all hydrogen bonds present in the unit-cell. Finally, the change in this measure of hydrogen bond strength upon forming the cocrystal can be calculated in analogy with eq :

Packing Coefficient

To determine the efficiency of packing of a given crystal structure, we use the packing coefficient CK as defined by Kitaigorodsky.[41] Each molecule is assumed to exclude a volume Vmol based on the van der Waals’ radii of its constituent atoms. The packing coefficient for a crystal structure with unit cell volume Vcell is thereforeand this value was calculated for each of our optimized crystal structures using the PLATON package,[42] with a spherical probe of radius 1.2 Å, and an integration grid of 0.1 Å spacing. Analogously to eq , the change in packing coefficient upon forming the cocrystal can be defined as the difference between CK in the cocrystal and the stoichiometrically normalized sum of CK values of the single components:

Results and Discussion

Distribution of the Energetic Stability of Cocrystals

We first analyze the distribution of cocrystal stabilities in the overall cocrystal set, comprising 350 cocrystal structures, relative to the single-component structures of their constituent coformers (Figure ). The results are presented in two ways: as an energy gain per cocrystal formula unit (blue bars) and as an energy gain per molecule (orange bars, the energy gain per formula unit divided by the number of molecules in the formula unit).

Figure 1

Distributions of calculated (PBE+D3) relative stabilities (defined in eq ) for the full set of 350 cocrystals relative to their corresponding single-component structures. Blue bars are the energy per formula unit of the cocrystal (as defined within its CSD entry), while orange bars are the energy per molecule (the dark regions of bars indicate overlap). Values are expressed in kJ mol–1 and collected into bins of width 5 kJ mol–1. It is clear from Figure that the calculations generally and reliably predict observed cocrystals to be more stable than their corresponding single-component structures. The mean value of the data is −8.0 kJ per mole of molecules or −19.1 kJ per mole of cocrystal formula unit—the average gain in stability achieved by cocrystallization is greater than the typical energy difference between crystal polymorphs of organic molecules (around 2 kJ mol–1).[6] The median stability is −6.7 kJ mol–1 of molecules (−16.2 kJ mol–1 of formula unit), and the distribution is skewed toward increasing cocrystal stability. Note the magnitude of the energy per formula unit depends on the definition of the latter (obtained from the cocrystal’s CSD entry), hence the obvious outlier value at −129 kJ mol–1; this cocrystal, CSD refcode RAWFAW, has an unusually large formula unit containing seven molecules (tris(piperazine) tetrakis(1,1,1-tris(4-hydroxyphenyl)ethane)). Thus, the stability per molecule of RAWFAW relative to the single component structures is a much less extreme value: −18.4 kJ mol–1. Of the 350 cocrystals considered in Figure , only 18 (5.1%) are found to be less stable than the corresponding single-component structures—i.e., our level of theory indicates that 95% of experimentally observed cocrystals in the CSD are thermodynamically favorable. Of those that are found to be less stable, the instability never exceeds +10.2 kJ mol–1 of molecules (+20.5 kJ mol–1 of formula unit); the range of positive (in)stabilities is considerably smaller than the range of negative stabilities. The mean relative stability per formula unit of cocrystal in our data set, −19.1 kJ mol–1, is in reasonably good agreement with the work of Chan et al.,[17] who obtained a value of −11.5 kJ mol–1 also using periodic DFT calculations in VASP (with the PW91 functional and a custom dispersion correction). Chan et al. considered only cocrystals of a few select molecules, so the distribution of cocrystal stabilities that we compute is likely to be more generally representative of organic cocrystals. We also agree extremely well with Chan et al. concerning the proportion of cocrystal structures that are predicted to be more stable than the single-component structures, 94.9% in both cases (though their figure also includes several structures that were known a priori to be salts). Other efforts to compute relative stabilities of specific sets of cocrystals have used the force field and electrostatic models common in CSP, with varying success. Issa et al.[18] used a combination of electrostatic multipoles (derived from MP2 charge densities) and “exp-6” repulsion-dispersion potentials to compute relative stabilities in DMAREL[43] for a set of 26 cocrystals of three molecules of interest with assorted coformers. This approach yielded stabilities of similar magnitude to our work in cases where cocrystallization was predicted to be favorable; however, it did not consistently predict cocrystals to be more stable, and at times strongly disfavored the cocrystal. It is difficult to fully quantify the uncertainty in calculated relative stabilities. On the basis of a benchmark comparing to lattice energies derived from measured sublimation enthalpies, the PBE+D3 energy model achieves a mean absolute error of 4.5 kJ mol–1 with a random error (one standard deviation) of 5.6 kJ mol–1.[37] This is a relatively small error compared to magnitude of the cocrystal stability for the majority of systems; approximately one-third of the cocrystals studied have a stability relative to the single component structures within about 5 kJ mol–1 of zero—this is the proportion of cocrystal structures where the uncertainty in the energy calculation makes it unclear whether they are truly stable relative to their pure components. However, given that our relative stabilities are differences in energies between crystal structures, we expect that fortuitous cancellation of error might reduce the size of this uncertainty, in particular with regards to systematic errors in the PBE+D3 description of the intramolecular physics. It has also been shown[6] that within an empirical force-field and electrostatic multipole description of organic crystals, the contribution of vibrational energy terms to free energy differences between polymorphs rarely exceeds 2 kJ mol–1. Errors of this magnitude, due to us not including vibrational energies, are small relative to the static lattice energy differences. We therefore suggest that while we are unable to fully quantify the uncertainty in our calculated stabilities, we remain confident that the distribution of stabilities is broadly correct and certainly that the qualitative findings—cocrystallization is very often energetically favorable, and very rarely significantly unfavorable—are reliable.

Distribution Subsets: The Influence of Hydrogen and Halogen Bonding

Given our specific inclusion of the halogen-bonding cocrystal subset and the large number of more general cocrystals considered, we can categorize our systems into three different types of cocrystals: hydrogen-bonded, halogen-bonded, and those without one of these specific strong, directional interactions, which for convenience we will refer to as “weakly bound”.

Hydrogen-Bonded Cocrystals

The vast majority (82%) of cocrystals in our set exhibit hydrogen bonding, despite our making no particular efforts to favor such systems. This likely reflects the conventional synthetic routes to the formation of cocrystals being constructed around the complementary pairing of hydrogen bond donor and acceptor species. The distribution of hydrogen-bonded relative stabilities seen in Figure a is therefore unsurprisingly similar to our overall distribution in Figure . The range in Δcocryst in this set is from −24.6 kJ mol–1 (the most stable cocrystal, relative to the corresponding single-component structures) to +10.2 kJ mol–1 (the least stable).

Figure 2

PBE+D3 relative stabilities of our test set of 350 cocrystals, broken down into subsets of (a) hydrogen-bonded (red), (b) halogen-bonded (green), and (c) “weakly-bound” (gray) systems. Each histogram spans the same energy range on the horizontal axis, but note the different frequency ranges on the vertical axes, due to the different subset sizes. The most stable of our hydrogen-bonded cocrystals is RADHEL, a 1:2 cocrystal of 1,4-di[bis(4′-hydroxyphenyl)methyl]benzene (RADGUA) and 1,4-diazanaphthalene (HEYJOK01). Two additional hydrogen bonds are formed per molecule in the cocrystal, as the diazanaphthalene molecule cannot form hydrogen bonds on its own. The least stable hydrogen-bonded cocrystal, which is also the least-stable cocrystal overall, is CSD refcode AWAJOX02—a cocrystal of 2,2′-diethoxy-α-truxilic acid and trans-2-ethoxycinnamic acid. This particular case seems to be due to conformational strain in one of the component molecules, as discussed in a later section.

Halogen-Bonded Cocrystals

As there are fewer structures than in any of the other cocrystal subsets, the energies of the individual structures are also presented as a pair of bar charts in Figure .

Figure 3

PBE+D3 relative stabilities of the subset of halogen-bonded cocrystals. Orange bars indicate structures in which bromine forms the halogen bond, while purple bars indicate iodine. Each structure is labeled by the CSD refcode of the cocrystal. The most stable halogen-bonded cocrystal is also the most stable of all the cocrystals that we consider in this work: HUJNUX, a 1:1 cocrystal of tetraiodo-para-benzoquinone and tetrathiafulvalene (TTF), with a relative stability of −26.0 kJ mol–1 (Figure a). The cocrystal gains two halogen bonds per molecule, though this is also true of a number of other, less-stable cocrystals in our set. As well as the I···S halogen bonds that are formed in the cocrystal, the large stability of this cocrystal is most likely related to charge transfer between the stacked, planar donor and acceptor molecules.[44] The next most stable halogen bonded cocrystals (Figure b–d) have no other obvious strong intermolecular interactions, so most likely gain their stability through the halogen bonds themselves. These involve highly polarized halogen atoms, such as the iodine in 1,4-diiodo-2,3,5,6-tetrafluorobenzene (ISIHUN, DIVCUH) or the bromine in N-bromosuccinimide (IBIYUP).[45]

Figure 4

Four halogen bonded cocrystals found to be most stable with respect to their single component structures: (a) HUJNUX, tetraiodo-para-benzoquinone: tetrathiafulvalene (TTF), (b) ISIHUN, (c) DIVCUH, and (d) IBIYUP. Atoms are colored by element type: gray = carbon; white = hydrogen; blue = nitrogen; red = oxygen, light yellow = fluorine, yellow = sulfur; purple = iodine; brown = bromine. Dashed blue lines indicate halogen bonds. Cocrystal energies are given in kJ mol–1 per molecule with respect to the single-component crystals of their components. Meanwhile, the least stable halogen-bonded cocrystal, ZIJHEH, is a 1:1 system of 3,6-diiodotetrachlorobenzene (ZZZAVM02) and 4-methylbenzenesulfinamide (ZIJGOQ). This structure gains only one halogen bond per molecule, among the fewest in our set, and Δcocryst is only −0.1 kJ mol–1. It is not possible to state that either halogen leads to more stable cocrystals than the other—though the bromine-bonded sample is considerably smaller, both samples display similar distributions of Δcocryst. This is particularly noticeable when considering the ZOHVEZ/ZOHVID pair, 1:1 cocrystals of 3,4,5-trichlorophenol and para-haloaniline featuring halogen···Cl interactions, that are chemically identical apart from the halogen atom present (Br in ZOHVEZ and I in ZOHVID). Despite the change in halogen atom, the Δcocryst values obtained are −6.7 and −4.7 respectively; the difference between these is small and within the estimated uncertainty of the energy calculations. The two cocrystal structures are isostructural with very small differences in atomic coordinates, indicating that the considerable increase in size of the I atom relative to the Br atom does not affect the packing of the cocrystal. This is despite the fact that the single-component structures of the haloanilines pack differently. At least in this case, substituting Br with I does not substantially affect the relative stability of the resulting cocrystal.

Weakly Bound Cocrystals

Our “weakly bound” subset of cocrystals (Figure c) is perhaps the most visibly distinct from the overall distribution of relative stabilities (Figure ). A larger proportion of weakly bound structures have small or slightly positive relative stability values compared with the overall set or the other subsets. The least-stable weakly bound structure is IFIMAL, a 2:1 (defined in the CSD as being 1:1/2) cocrystal of endo4a,5,8,8a-tetrahydro-5,8-ethano-1,4-naphthoquinone (HETNQU) and para-benzoquinone (BNZQUI). However, there is also a larger proportion of very stable (25–30 kJ mol–1) cocrystals in the weakly bound set compared to the other subsets. This is perhaps counterintuitive, as one might expect the lack of strong, directional interactions to reduce the likelihood of the cocrystal being significantly more stable. As we found in the halogen bond set, the most stable cocrystal structure is likely a charge transfer complex. This most stable structure is PINHID, a 1:1 cocrystal of 6,12-dioxa-anthanthrene (or peri-xanthenoxanthene, QUPMUJ10) and tetracyanoquinodimethane (TCNQ, CSD refcode TCYQME), with Δcocryst = −23.6 kJ mol–1 (per molecule). In fact, many of the more-stable half of the weakly bound cocrystal subset are found to contain TCNQ, tetracyanoethylene (TCNE, CSD refcode TCYETY), or trinitrobenzene (TNBENZ) (Figure ), which all have electron-deficient π-systems and are typical acceptor species in charge-transfer systems, paired with planar aromatic donors. All of the low energy cocrystals (up to Δcocryst = −4.2 kJ mol–1) in this set contain aromatic donor and acceptor molecules, all with face-to-face packing in their cocrystal structures and the possibility for charge transfer between coformers. Therefore, it is likely that many of the unusually stable “weakly bound” systems are in fact charge-transfer cocrystals with strong interactions between donor and acceptor species.

Figure 5

Three examples of common molecules in our weakly bound cocrystal set: (a) tetracyanoquinodimethane (TCYQME), (b) tetracyanoethylene (TCYETY02), and (c) trinitrobenzene (TNBENZ13). All three species are electron-deficient and commonly used as acceptor species in charge-transfer complexes. The most stable cocrystal in this subset that lacks face-to-face packing of aromatic donor and acceptor molecules is the 1:1 cocrystal of 4,4′-bipyridine with 1,4-diethynylbenzene (RUXMAZ, Δcocryst = −4.2 kJ mol–1), which contains C–H···N interactions (C–H···A interactions were not classed as hydrogen bonds in our definition of cocrystal subsets). As we did not specifically target our search to include or exclude particular types of systems in this case (unlike for the halogen bonds), this indicates a considerable experimental bias in the CSD toward charge-transfer systems among cocrystals that lack hydrogen bonds or halogen bonds. It is possible this is precisely because the lack of other sources of strong, directional interactions in weakly bound cocrystals discourages attempts at their synthesis.

Comparison of Subsets

Table presents a summary of the calculated energies categorized by the type of interactions present. We preface our analysis by highlighting that the considerably smaller halogen-bonded and weakly bound sets may be less representative of their overall category than the hydrogen-bonded set.

Table 1

Minimum, Mean, and Maximum Relative Stabilities (kJ mol–1 per molecule) Calculated Using PBE+D3 for our Subsets of Experimental Cocrystal Structures, Categorized by the Presence or Absence of Strong Interactionsa

		ΔE_cocryst/kJ mol^–1
type	structures	min	mean	max	N(ΔE_cocryst > 0)
hydrogen-bonded	287	–24.6	–7.7	+10.3	15 (5.2%)
halogen-bonded	28	–26.0	–12.6	–0.1	0
weakly bound	35	–23.6	–6.5	+1.1	3 (8.6%)

Also shown is the proportion of structures with relative stabilities greater than zero (indicating instability compared to the single-component structures).

Also shown is the proportion of structures with relative stabilities greater than zero (indicating instability compared to the single-component structures). Regarding the energetics of the three different categories, we see that there is only a slight difference in the average relative stability between the hydrogen-bonded and weakly bound subsets, but the average relative stability of halogen-bonded systems appears to be markedly greater (by around 5 kJ mol–1). Given their relative scarcity in the CSD (we obtained approximately 1% as many halogen-bonded cocrystals compared to general cocrystals in our searches), they are overrepresented in our overall set of structures, and their greater average relative stability could therefore bias our overall relative stability values slightly. It is clear that, at least for the structures considered in our searches, halogen-bonding leads, on average, to considerably more stable cocrystals relative to the single component structures. The difference between the average values for the hydrogen-bonded and weakly bound sets, on the other hand, is considerably smaller, only 1.2 kJ mol–1 in favor of hydrogen-bonded cocrystals. The sign of this difference agrees with chemical intuition that cocrystal synthesis via formation of more complementary (or stronger) hydrogen bonding relative to the single component might lead to more stable cocrystals. In turn, the energetics of weakly bound systems might be expected to be less sensitive to their composition due to the nonspecificity of the interactions, leading to less strongly favored cocrystals on average. As discussed above, all of the most stable cocrystals in the “weakly bound” subset are composed of charge donor–acceptor molecules that probably lead to charge-transfer complexes. Apart from likely charge-transfer complexes, the Δcocryst values in the “weakly bound” subset are −4.2 kJ mol–1 or smaller. The minimum values of Δcocryst across the three sets show relatively little variation (no more than 1.5 kJ mol–1), suggesting that the greatest possible stability is also not necessarily intrinsic to a particular category, and therefore that the most stable of all cocrystals are not necessarily those with strongly directional, specific interactions. However, the maximum value of Δcocryst (i.e., the greatest instability) is considerably larger for the hydrogen-bonded set. The least stable cocrystal is AWAJOX02, with Δcocryst = 10.3 kJ mol–1; this structure, which is discussed further below, is a result of a photodimerization reaction of ortho-ethoxy-trans-cinnamic acid, leading to a 1:1 cocrystal of the photodimerization product with unreacted ortho-ethoxy-trans-cinnamic acid.[46] Given the unusual route to this cocrystal, it is unsurprising that the energy of the cocrystal structure is an outlier in our data set. Apart from AWAJOX02, the next least stable cocrystal is ABUCIP, a cocrystal of the agrochemical thiophanate-ethyl and 2,2′-bipyridine with Δcocryst = 5.8 kJ mol–1, which is still noticeably higher than any structures in the halogen bonding or “weakly bound” subsets. This higher maximum Δcocryst may simply be due to greater sampling of that population, or perhaps a greater systematic error in the description of hydrogen-bonding using the PBE+D3 level of theory (though the latter might be expected to largely cancel out when considering energy differences). Alternatively, it may be that hydrogen bonding does occasionally result in cocrystals that are thermodynamically unfavorable—perhaps due to the hydrogen bonds encouraging nucleation pathways that yield the cocrystal (i.e., perhaps kinetic effects driving structures to higher-energy local minima are more significant in hydrogen-bonded systems). On the other hand, when looking at the proportion of cocrystals that are unstable in a given subset, this quantity is greatest for weakly bound cocrystals, which suggests these combinations of systems are more likely to yield less-stable cocrystals, as might be expected from chemical intuition. However, the effect of small sample size is particularly noticeable here—if just one of these structures with Δcocryst > 0 were excluded, this proportion would move much closer to the hydrogen-bonded set. While our halogen-bonded subset is also quite small, it is a much larger proportion of all halogen-bonded cocrystal systems in the CSD than the other subsets are of their categories. Hence, we can more confidently state that the lack of any significantly unstable halogen-bonded systems indicates that these interactions are particularly useful for synthesis favoring cocrystals. To improve our understanding and rationalize the distributions we observe, we now consider further specific cases, as well as to attempt to correlate the distributions we obtain with chemical descriptors.

Individual Cocrystal Energetics of Note

Instability Caused by Molecular Strain

The most unstable cocrystal in our set is AWAJOX02, a hydrogen-bonded cocrystal of 2,2′-diethoxy-α-truxilic acid (XOSKEV) and ortho-ethoxy-trans-cinnamic acid (ZZZNQS05), which our procedure predicts to be 10.3 kJ mol–1 higher in energy than the single components. For both component molecules, the single-component structures display molecular geometries that resemble the gas-phase minima that chemical intuition would suggest. However, in the cocrystal, both molecules are noticeably distorted from those geometries—the 2,2′-diethoxy-α-truxilic acid geometry is compressed such that the overall shape is flattened (Figure ), while ortho-ethoxy-trans-cinnamic acid’s conjugated backbone is noticeably nonplanar (approximately 10° deviation, while the molecule in the single-component case is nearly perfectly planar).

Figure 6

Molecular conformations of 2,2′-diethoxy-α-truxilic acid in (a) its single-component crystal structure (XOSKEV) and (b) the cocrystal (AWAJOX02) with ortho-ethoxy-trans-cinnamic acid.

Molecular conformations of 2,2′-diethoxy-α-truxilic acid in (a) its single-component crystal structure (XOSKEV) and (b) the cocrystal (AWAJOX02) with ortho-ethoxy-trans-cinnamic acid. Molecular geometry optimizations at a comparable level of theory to our crystal calculations (PBE+D3/6-311G** in Gaussian09[40]) confirm this hypothesis; in the cocrystal, AWAJOX02, the strain energy (the energy required to adopt the in-crystal conformation relative to the nearest local gas-phase minimum) is 48 kJ mol–1 for the truxilic acid and 20 kJ mol–1 for the cinnamic acid. In comparison, the pure truxilic acid crystal XOSKEV exhibits a molecular strain energy of 37 kJ mol–1, and the pure cinnamic acid ZZZNQS05 a strain energy of 17 kJ mol–1—the single-component structures place the molecules (particularly the truxilic acid) under markedly less strain than the cocrystal. Note also that these strain energies are relative to the nearest gas-phase minimum to the observed in-crystal conformer—the XOSKEV gas-phase minimum is 6 kJ mol–1 higher in energy than that of the same molecule in AWAJOX02, meaning that the formation of the AWAJOX02 in-crystal conformation is further disfavored. Interestingly, but perhaps fortuitously, the sum of these strain energies relative to the lowest energy conformer and averaged over the two molecules in AWAJOX02 is about +10 kJ mol–1, i.e., very similar to the overall relative cocrystal stability we calculate. The unusual energetics of this cocrystal likely relate to its formation as the product of photodimerization of the α′ (Z′ = 3) polymorph of ortho-ethoxy-trans-cinnamic acid, in which two out of three independent molecules in the asymmetric unit combine to form the product, leaving one molecule unreacted in an ordered 1:1 cocrystal. Thus, the cocrystal exists as a result of this solid-state reaction, rather than the stabilization gained from combining the two molecules in a crystal. Our observations regarding intramolecular strain are in agreement with the observed buildup of molecular strain as the photodimerization reaction proceeds.[46] The molecules are strained because they are constrained by the packing of the reactant ortho-ethoxy-trans-cinnamic acid polymorph.

Indications of Entropic Contributions to Small Stabilities

Another interesting, specific case is the cocrystal VIGDAR01, of celecoxib (DIBBUL) and nicotinamide (NICOAM), which recent experimental work by Zhang et al.[47] has determined to be entropy-stabilized–the difference in enthalpy between the cocrystal and the single-component structures is positive. Being a cocrystal of nicotinamide, this is one of the systems previously investigated by Chan et al.,[17] who predicted it to be slightly less stable than the single component structures in advance of the experimental work. (However, Chan et al. used the only available structure in the CSD at the time, VIGDAR, a very similar structure but one which they proposed had assignment errors that they corrected.[17]) Our work predicts a slight energetic favorability of −1.9 kJ mol–1 for VIGDAR01, contrary to experiment. However, given that the uncertainty in our calculated energies is likely to be on the order of a few kilojoules per mole, we cannot definitively state that a value of this magnitude is indicative in either direction. Indeed, Chan et al. predict a similarly marginal value of +1.5 kJ mol–1 using PW91 with a dispersion correction (after a proposed structural reassignment), and it is known that periodic DFT can produce considerable variation in such small energy differences between crystal structures based on the functional employed (see, e.g., qualitative reordering of stability of polymorphs of organic semiconductors[48]). While cocrystals with such small magnitude (in)stabilities cannot be firmly classified as more or less stable, such small computed values likely indicate cases where the cocrystal is potentially experimentally accessible, due to (for example) entropic effects at finite temperature reversing the relative stability computed at zero temperature. In particular, a small calculated stability of a cocrystal which contains at least one molecule with marked conformational freedom within the crystal structure might indicate potentially entropically stabilized cases like VIGDAR01. In this way, computing the relative stability can still be a powerful tool for predicting and validating experimental cocrystallization outcomes, even in cases where the cocrystal is not definitively more stable.

Cocrystal Packing Efficiency

We first consider as a descriptor the change in the packing efficiency (defined in eq ) upon forming the cocrystal. Given that, according to our results in Figure and Table , cocrystals are usually thermodynamically more stable than their single-component counterparts, we consider the possibility that this is purely down to improved efficiency of packing in the cocrystal (and therefore improved nonspecific interactions in general). PBE+D3 relative stabilities of cocrystals plotted against the change in packing coefficient upon forming the cocrystal. A negative value of ΔCK indicates less efficient packing in the cocrystal. Red triangles are from our hydrogen-bonded set, green squares are halogen-bonded, and gray circles are weakly bound cocrystals. One significant finding is that the majority of cocrystals (245 cocrystals, 70% of the set) pack less efficiently than their single-component counterparts—evidently the energetic drivers of cocrystallization are more complex than simply improved packing in the cocrystal. One interpretation of this observation is that the directionality of the strong, specific interactions (such as hydrogen or halogen bonds) used to form cocrystals forces their structures to sacrifice close packing to improve the geometry of these interactions. Unfortunately, our calculations reveal no clear correlation between relative stability and the change in packing efficiency. The overall set displays virtually no relationship; examining the subsets, only the halogen-bonded subset displays even a qualitative relationship, which in this case is negative—less densely packed cocrystals tend to have greater relative stabilities. Even this, however, is subject to a number of exceptions. This is perhaps unsurprising, as it has been observed that the packing efficiency alone is a very poor predictor for many physical properties of crystals.[49] It can clearly be said that cocrystallization occurs despite generally resulting in less favorable packing, but no significant trend, either overall or within the subsets of interaction types, can be seen. Evidently, packing information alone is insufficient to rationalize the relative stabilities we obtain.

Hydrogen-Bonding Analysis

As discussed, chemical intuition often leads to synthetic routes to cocrystals that attempt to either create new hydrogen bonds that are not possible in the single-component structures, or to improve the quality of the hydrogen bonds that do form. However, our results (Figure and Table ) indicate that the presence alone of hydrogen bonding is not sufficient to yield more-stable cocrystals compared to the other sets of cocrystals. We now consider whether the change in hydrogen-bonding in the cocrystal can be correlated with its stability—in the first instance by simply considering the number of hydrogen bonds gained or lost, and in the second by quantifying the quality of the hydrogen bonds formed. The presence of hydrogen bonding is not confined to our “hydrogen-bonding” subset; however, 11 of the cocrystals in our halogen-bonded subset also display hydrogen bonding as well. Hence, we include these structures in the following analysis. It was found in the course of analyzing the hydrogen bonding in our structures that six of our cocrystals underwent proton transfer during the periodic DFT optimizations. All six structures feature O–H···N hydrogen bonds, along which coordinate the hydrogen atom transfers to protonate the N atom during the optimization. Evidently, at this level of theory, the experimental structures as presented in the CSD were unstable with respect to proton transfer between the species, resulting in these structures forming salts. It would, of course, be interesting to revisit these structures experimentally to verify the proton positions and monitor whether proton transfer occurs at low temperature (recalling that our calculations are temperature-free), as well as examining the sensitivity of proton transfer to the computational method (e.g., DFT functional). These structures are listed in the Supporting Information; their energies have been retained in the data in Figure and Table , but they are omitted from the hydrogen-bond analysis.

Simple Hydrogen-Bond Count

Presented in Figure is a plot of the computed relative stability of each cocrystal against the overall change in the average number of hydrogen bonds per molecule. For obvious reasons, here we only include structures within the hydrogen-bonded subset plus any from the halogen-bonded subset which also feature hydrogen bonding. Immediately apparent is the considerable majority (246, 84.3% of the set) of cocrystals which have exactly zero net change in the number of hydrogen bonds present compared to the single-component structures. Those cocrystals which do change the number of hydrogen bonds per molecule are nearly five times more likely to gain than lose them—13.0% of all structures considered compared to 2.7%, respectively—as might be expected if synthetic procedures are guided by complementary pairing of donor and acceptor molecules.

Figure 8

PBE+D3 relative stabilities of cocrystals plotted against the overall change in the number of hydrogen bonds per molecule upon forming the cocrystal. Positive values indicate more hydrogen bonds in the cocrystal. Only the 292 cocrystals from our set in which hydrogen bonding occurs are plotted here; 281 from the hydrogen-bonded subset (287-6 that underwent proton transfer) and 11 additional structures in the halogen-bonded subset. More surprisingly, Figure indicates that, even in the minority of cases in which the hydrogen bond count does change, there is no significant correlation with the computed relative stability. The overall picture is that an increased number of hydrogen bonds per molecule has no systematic effect on the relative stability of the cocrystal. Indeed, there is a slight trend for structures which do gain in hydrogen bonds to be less stable on average as the number of hydrogen bonds increases above 0.5 per molecule, though this is unlikely to be significant. Interesting, both the most extreme gain (+2) and the most extreme loss (−31/2) in hydrogen bonds occur in the halogen-bonded set. Examining the systems that behave contrary to the presumed relationship that more hydrogen bonds leads to greater stability does not reveal anything unusual. Consider VEKKEB, a 2:1 cocrystal of acetophenone (ACETPH) and trans-9,10-dihydroxy-9,10-diphenyl-9,10-dihydroanthracene (DIKCOP01), and one of the cocrystals with the largest gain in hydrogen bonds per molecule (11/3 on average). Yet the relative stability of VEKKEB is only −2.5 kJ mol–1, well below the average value. Its single-component structures exhibit no hydrogen bonding, and both molecules are reasonably rigid, so there are no obvious indicators that either of the single-component structures are unusually stable and would therefore lead to only a marginally stable cocrystal. On the other hand, CEKKOU, a 1:1 cocrystal of benzene-1,2,3-triol (PYRGAL02) and 1,10-phenanthroline (OPENAN), has a considerable and above-average relative stability of −15.9 kJ mol–1 despite losing an average of one hydrogen bond per molecule upon forming the cocrystal. While OPENAN cannot form hydrogen bonds in the pure crystal due to only possessing acceptor atoms, PYRGAL02 has three hydroxyl groups and forms six hydrogen bonds per molecule in the crystal, which one might assume would make it unusually stable and thus disfavor formation of the cocrystal CEKKOU, the opposite of what we calculate. Contrary to chemical intuition, it appears that the simple change in hydrogen bond count does not correlate with the stability of the resulting cocrystal. One weakness of this descriptor is that it considers all H-bonds equally—no chemical or geometric information is involved (apart from the criteria used to define what contacts are H-bonds in the first place). Hence, the H-bonds are binary—they either exist or do not—and there is no measure of their quality or strength. Clearly, a descriptor which very often shows a change of exactly zero between cocrystal and single components is of minimal benefit.

Weighting by Hydrogen-Bond Strength

A more physically meaningful descriptor instead considers the change in both the number and quality of hydrogen bonds, using the electrostatic H-bond attraction measure we define in eq . While this now requires specific geometric information from the crystal structures, it allows us to quantify the strength of a hydrogen bond and therefore numerically describe the change in strength of H-bonding between structures. Figure shows the cocrystal relative stability as a function of the total change in the hydrogen bond electrostatic attraction upon forming the cocrystal. Note that the magnitudes of the hydrogen bond energies are large because we have not included the repulsion energy that typically cancels much of the electrostatic stabilization of hydrogen bonds.

Figure 9

Relative stabilities of cocrystals as a function of the change in the H-bond strength measure defined in eq . Note that this measure considers only the Coulombic attraction between H and A, hence the magnitude of the values along the x-axis. Although the correlation between this descriptor and the relative stability is very weak (R2 = 0.05), there is a statistically significant proportional relationship (P-value = 4 × 10–4, relative to a random normal distribution of points). Broadly speaking, a gain in overall hydrogen bonding strength usually leads to a gain in cocrystal stability, though the magnitude of the latter is very weakly correlated with the gain in strength. As an attempt to incorporate greater physical accuracy into our hydrogen bond strength measure, we also recalculated the changes in hydrogen bond strength including the acceptor–donor Coulombic interaction as well. The strength measure therefore considers a system akin to a model dipole, with the acceptor Coulombically interacting with both the hydrogen atom and the donor atom. Unfortunately, this model not only failed to improve the relationship observed between the change in strength of the hydrogen bond and the cocrystal, it qualitatively altered the character of the hydrogen bond measure (around 20% of all our crystal structures had a net positive, i.e., repulsive hydrogen bond energy when the D···A interaction was included). We present this data in the Supporting Information, but here simply state that this attempt to improve our energetic description of the hydrogen bonding was unsuccessful.

Halogen-Bond Counts

None of the corresponding single-component structures of the halogen bond cocrystal subset exhibit halogen bonding, so when analyzing the effect of halogen bonding on the cocrystal stability, we may count those in the cocrystals alone. Figure presents the relative stabilities of the halogen-bonded cocrystals as a function of the number of halogen bonds gained per molecule in the cocrystal structure.

Figure 10

PBE+D3 relative stabilities of halogen-bonded cocrystals plotted against the gain in halogen bonds per molecule. Orange circles indicate structures in which bromine forms the halogen bond, while purple squares indicate iodine. The bromine-bonded systems exhibit no particular relationship between their relative stabilities and bromine bond counts; there are only six structures amd it is not possible to discern a meaningful trend. However, the iodine-bonded systems, while displaying considerable variation, do exhibit a noticeable trend toward increasing stability with increasing iodine bond count. The distributions for each value of the bond count (1, 11/3, and 2 per molecule) shift toward increasing stability as the bond count increases–the minimum, maximum, and average values all become more negative. (Note that the noninteger change in halogen bonds of 11/3 results from cases where a system of 2:1 stoichiometry, such as the cocrystal LADDEB, gains one halogen bond to the first molecule and two to the second, giving a total of four halogen bonds averaged over three molecules.) The reasons for this relationship applying only to the iodine-bonded cases are uncertain, though it is possible that iodine bonds are sufficiently strong compared to bromine bonds that they make up a larger proportion of the change in energy between the cocrystal and single-component structures. Regardless, the number of new iodine bonds created in the cocrystal appears to offer modest predictive power as to the degree to which it is more stable than the single-component structures. Perhaps the clearest outlier is the bromine-bonded cocrystal EXEJUN (relative to components FATLER01 and HXMTAM07), the fifth-most stable halogen-bonded cocrystal (Δcocryst = −23.8 kJ mol–1) despite only gaining one halogen bond per molecule. However, upon inspection, this cocrystal’s stability may be due to a considerable change in hydrogen bonding; EXEJUN contains two hydrogen bonds to each molecule in addition to the halogen bond, while no hydrogen bonding is present in either of the single component structures. (Note that this is already a more extreme change in hydrogen bonding per molecule than exhibited by any of the hydrogen-bonded cocrystal set.)

Combining Hydrogen Bonding and Packing Efficiency

The most informative description is obtained by simultaneously considering the change in packing coefficient in the cocrystal and the measure of the change in hydrogen bond strength, as is shown in Figure , where points are colored according to Δcocryst.

Figure 11

Relative stabilities of cocrystals (indicated by color) as a function of both the change in packing coefficient and the change in the H-bond strength measure. Note that a more negative value for ΔH-bond indicates a net gain in strength in the cocrystal. This plot includes all of our cocrystals except for the six structures found to undergo proton transfer in their structural optimizations. While the data presented still show considerable variation, there is a general agreement with chemical intuition when our descriptors are combined. More stable cocrystals tend to be those that both increase the strength of the H-bonds present and increase their efficiency of crystal packing (i.e., those points in the upper-left quadrant) relative to the single component structures. Conversely, the less-stable cocrystals tend to be those that show little gain or a net loss in H-bond strength, as well as having less efficient packing. As mentioned, a majority of the cocrystals considered pack less efficiently than their single-component structures. This is despite the overwhelming proportion of cocrystals that our PBE+D3 optimizations predict to be more stable. In some cases, this less-efficient packing is offset by a gain in H-bond strength. However, the spread of data in Figure indicate that a simple balance of these two factors is insufficient to explain the variation in the relative stabilities observed; several comparatively stable cocrystals demonstrate both a loss in packing efficiency and a considerable loss in H-bond strength. Moreover, two of our most stable cocrystals either suffer a large loss in hydrogen-bonding (and very little change in packing) or a very large loss in packing efficiency (and have no hydrogen bonding). Evidently, the complexities of intermolecular interactions driving cocrystallization cannot be completely described by such comparatively simple descriptors.

Limitations of Descriptors

A major drawback of both the packing efficiency and the hydrogen bond strength descriptor is that they are dependent on the crystal structure, not merely molecular information. (The straightforward count of hydrogen bonds, while also dependent the crystal structure, is at least constrained by the structure of the individual molecules and the stoichiometry in the cocrystal.) This reduces the utility of these descriptors for predicting the stability of cocrystals based solely on the molecules involved—a desirable goal for crystal engineering purposes. However, these descriptors still can be useful to provide a qualitative model for the relative stability of a hypothesized structure without requiring an expensive periodic DFT optimization, and the relative difference in descriptors may be more informative than the absolute values. If a hypothetical cocrystal structure demonstrates both reduced packing efficiency and weaker H-bonding compared to its single components, it is unlikely to be considerably more stable according to our calculations. Such a structure may therefore be less worthy of synthetic targeting than another cocrystal structure in which both descriptors improve. It is notable that these descriptors are not necessarily independent of each other–the H-bond strength measure (depending as it does on rHA) can in principle be improved through denser packing without any substantial rearrangement of molecules, e.g., in the trivial case of compression of a given crystal structure. However, there is no systematic physical link between the two descriptors, and the data in Figure show the values obtained tend to be largely independent of each other in practice. It should also be mentioned that our method for computing the partial charges for measuring the H-bond strength acts on the isolated gas-phase molecules in their crystal geometry—the only influence of the surrounding crystal environment on these charges is indirect, in enforcing the precise molecular conformation adopted. Our H-bond strength measure lacks any description of polarization due to neighboring molecules (even the other molecule involved in the H-bond) and is therefore only a basic description of this interaction. A stronger correlation between the descriptors and stability might be also seen in a set which does not focus solely on observed structures. Our use of structures from the CSD restricts our test set to systems which are already experimentally known, and there are physical factors that could promote the formation of a cocrystal beyond the packing and specific interactions—for example, kinetic or solvent effects during the crystallization process—such that it is observed regardless of the change in packing efficiency or the H-bond interaction network. However, given our focus on rationalizing cocrystal formation for experimentally observed cases, and the expense of considering a chemically as well as crystallographically diverse set of hypothetical computationally generated structures, we leave such exploration to future efforts.

Conclusions

We have demonstrated that periodic DFT using the PBE+D3 level of theory predicts the vast majority of organic cocrystals to be energetically more stable than their single-component counterparts. Cocrystallization seems to almost always be a thermodynamically favorable process, with 95% of cocrystals more stable than their single-component structures. The characteristic gain in stability upon cocrystallization averages 8 kJ mol–1 (per molecule), a value considerably greater than that associated with crystalline polymorphism but well within the region of relevance for computational CSP studies. Both the sign and magnitude of this stability are in very good agreement with previous, smaller studies of specific, chemically related sets of molecules. We find that considering our own specific set of halogen-bonded cocrystals reveals a slightly greater stability than the general lighter-element organic case, and that all of the halogen-bonded systems we consider are more stable as the cocrystal than as separate single-component crystals. We have also shown that the gain in stability upon cocrystallization is a complex phenomenon that is not necessarily well-described by changes in basic environmental or crystalline descriptors like molecular hydrogen bond count, strength of hydrogen bonding, or crystalline packing efficiency. Qualitative overall trends can be observed that match chemical intuition—more strongly hydrogen-bonded and efficiently packed cocrystals tend to be more stable. However, the considerable variance and significant number of exceptions in the general data set support the idea that cocrystallization is driven by a wide range and balance of physical factors that are not encapsulated by these descriptors. That said, when looking at extremes of stability or at groups of specific, more closely related systems such as halogen-bonded cocrystals, it appears that these basic descriptors can be of some utility in rationalizing the observed stability. Such descriptors may therefore be more appropriately used in narrowly targeted surveys of related systems. This is particularly true of, for example, the increase in number of halogen bonds, which tracks well with the typical relative stability in the specific case of iodine-bonded systems. The size of our test set and the expense of periodic DFT limit us to employing only one exchange-correlation functional and dispersion correction scheme, but we suggest that the wide variety of cocrystal systems considered and the sheer size of the set (at least three times larger, and much more chemically diverse, than any other DFT study yet undertaken) make our conclusions reasonably applicable to the general case. Even if the precise value of the stability may vary due to a systematic inaccuracy of the functional, we emphasize that the size and breadth of the test set make our study more illuminating of cocrystallization generally than conclusions obtained from a considerably smaller test set assessed with a more expensive method or a range of functionals. We have made our set of optimized structures available in the Supporting Information, along with the identification of the relevant single-component structures for each cocrystal. It is our hope that, beyond our evaluation of the characteristic relative stability, this data set enables further work exploring the phenomenon of cocrystallization in general. The availability of our benchmark value for the relative stability of organic cocrystal systems will also inform computational CSP studies, both for the prediction of specific cocrystal structures as well as for indicating when single-component structures may compete with cocrystals in the configuration space. Experimental work can also be assured that cocrystallization is predicted by theory to generally be a thermodynamic process, which may help to provide routes for the simplification or streamlining of cocrystal screening procedures. Along these lines, a particularly relevant avenue for further computational work is consideration of the effects of thermal motion on the relative cocrystal stability via the calculation of, e.g., lattice free energies as has been performed in our group for crystal polymorphs.[6] The range of calculated values of the stability highlights the importance of using the most physically accurate possible description of systems that is still computationally tractable, as attempting to explain or rationalize such complex and multifaceted interactions using simple descriptors or highly approximate interaction models can be unrepresentative of the true energetics of cocrystallization. The current work demonstrates the power of high-level quantum mechanical methods to provide insights that should be valuable in interpreting and guiding the experimental study of cocrystals, as well as computational efforts aimed at predicting cocrystallization.

24 in total

1. The Cambridge Structural Database: a quarter of a million crystal structures and rising.

Authors: Frank H Allen
Journal: Acta Crystallogr B Date: 2002-05-29

2. Ab initio molecular-dynamics simulation of the liquid-metal-amorphous-semiconductor transition in germanium.

Authors:
Journal: Phys Rev B Condens Matter Date: 1994-05-15

3. Ab initio molecular dynamics for liquid metals.

Authors:
Journal: Phys Rev B Condens Matter Date: 1993-01-01

4. Predicting stoichiometry and structure of solvates.

Authors: Aurora J Cruz-Cabeza; Shyam Karki; László Fábián; Tomislav Friscić; Graeme M Day; William Jones
Journal: Chem Commun (Camb) Date: 2010-02-26 Impact factor: 6.222

5. Searching the Cambridge Structural Database for the 'best' representative of each unique polymorph.

Authors: Jacco van de Streek
Journal: Acta Crystallogr B Date: 2006-07-12

6. Energy ranking of molecular crystals using density functional theory calculations and an empirical van der waals correction.

Authors: Marcus A Neumann; Marc-Antoine Perrin
Journal: J Phys Chem B Date: 2005-08-18 Impact factor: 2.991

7. A benchmark for non-covalent interactions in solids.

Authors: A Otero-de-la-Roza; Erin R Johnson
Journal: J Chem Phys Date: 2012-08-07 Impact factor: 3.488

8. Towards prediction of stoichiometry in crystalline multicomponent complexes.

Authors: Aurora J Cruz-Cabeza; Graeme M Day; William Jones
Journal: Chemistry Date: 2008 Impact factor: 5.020

9. Report on the sixth blind test of organic crystal structure prediction methods.

Authors: Anthony M Reilly; Richard I Cooper; Claire S Adjiman; Saswata Bhattacharya; A Daniel Boese; Jan Gerit Brandenburg; Peter J Bygrave; Rita Bylsma; Josh E Campbell; Roberto Car; David H Case; Renu Chadha; Jason C Cole; Katherine Cosburn; Herma M Cuppen; Farren Curtis; Graeme M Day; Robert A DiStasio; Alexander Dzyabchenko; Bouke P van Eijck; Dennis M Elking; Joost A van den Ende; Julio C Facelli; Marta B Ferraro; Laszlo Fusti-Molnar; Christina Anna Gatsiou; Thomas S Gee; René de Gelder; Luca M Ghiringhelli; Hitoshi Goto; Stefan Grimme; Rui Guo; Detlef W M Hofmann; Johannes Hoja; Rebecca K Hylton; Luca Iuzzolino; Wojciech Jankiewicz; Daniël T de Jong; John Kendrick; Niek J J de Klerk; Hsin Yu Ko; Liudmila N Kuleshova; Xiayue Li; Sanjaya Lohani; Frank J J Leusen; Albert M Lund; Jian Lv; Yanming Ma; Noa Marom; Artëm E Masunov; Patrick McCabe; David P McMahon; Hugo Meekes; Michael P Metz; Alston J Misquitta; Sharmarke Mohamed; Bartomeu Monserrat; Richard J Needs; Marcus A Neumann; Jonas Nyman; Shigeaki Obata; Harald Oberhofer; Artem R Oganov; Anita M Orendt; Gabriel I Pagola; Constantinos C Pantelides; Chris J Pickard; Rafal Podeszwa; Louise S Price; Sarah L Price; Angeles Pulido; Murray G Read; Karsten Reuter; Elia Schneider; Christoph Schober; Gregory P Shields; Pawanpreet Singh; Isaac J Sugden; Krzysztof Szalewicz; Christopher R Taylor; Alexandre Tkatchenko; Mark E Tuckerman; Francesca Vacarro; Manolis Vasileiadis; Alvaro Vazquez-Mayagoitia; Leslie Vogt; Yanchao Wang; Rona E Watson; Gilles A de Wijs; Jack Yang; Qiang Zhu; Colin R Groom
Journal: Acta Crystallogr B Struct Sci Cryst Eng Mater Date: 2016-08-01

10. Modelling temperature-dependent properties of polymorphic organic molecular crystals.

Authors: Jonas Nyman; Graeme M Day
Journal: Phys Chem Chem Phys Date: 2016-11-16 Impact factor: 3.676

8 in total

1. An experimental and theoretical charge density study of theophylline and malonic acid cocrystallization.

Authors: Bryson A Hawkins; Jonathan J Du; Felcia Lai; Stephen A Stanton; Peter A Williams; Paul W Groundwater; James A Platts; Jacob Overgaard; David E Hibbs
Journal: RSC Adv Date: 2022-05-23 Impact factor: 4.036

2. Comparative studies on structure, sensitivity and mechanical properties of CL-20/DNDAP cocrystal and composite by molecular dynamics simulation.

Authors: Binghui Duan; Yuanjie Shu; Ning Liu; Yingying Lu; Bozhou Wang; Xianming Lu; Jiaoqiang Zhang
Journal: RSC Adv Date: 2018-10-09 Impact factor: 4.036

3. Halogen-bonded cocrystallization with phosphorus, arsenic and antimony acceptors.

Authors: Katarina Lisac; Filip Topić; Mihails Arhangelskis; Sara Cepić; Patrick A Julien; Christopher W Nickels; Andrew J Morris; Tomislav Friščić; Dominik Cinčić
Journal: Nat Commun Date: 2019-01-04 Impact factor: 14.919

4. New insights and innovation from a million crystal structures in the Cambridge Structural Database.

Authors: Jason C Cole; Seth Wiggin; Francesca Stanzione
Journal: Struct Dyn Date: 2019-08-28 Impact factor: 2.920

5. Efficient Screening of Coformers for Active Pharmaceutical Ingredient Cocrystallization.

Authors: Isaac J Sugden; Doris E Braun; David H Bowskill; Claire S Adjiman; Constantinos C Pantelides
Journal: Cryst Growth Des Date: 2022-06-15 Impact factor: 4.010

6. One class classification as a practical approach for accelerating π-π co-crystal discovery.

Authors: Aikaterini Vriza; Angelos B Canaj; Rebecca Vismara; Laurence J Kershaw Cook; Troy D Manning; Michael W Gaultois; Peter A Wood; Vitaliy Kurlin; Neil Berry; Matthew S Dyer; Matthew J Rosseinsky
Journal: Chem Sci Date: 2020-12-08 Impact factor: 9.825

7. Challenging the Ostwald rule of stages in mechanochemical cocrystallisation.

Authors: Luzia S Germann; Mihails Arhangelskis; Martin Etter; Robert E Dinnebier; Tomislav Friščić
Journal: Chem Sci Date: 2020-08-12 Impact factor: 9.825

8. Cocrystals of Praziquantel: Discovery by Network-Based Link Prediction.

Authors: Jan-Joris Devogelaer; Maxime D Charpentier; Arnoud Tijink; Valérie Dupray; Gérard Coquerel; Karen Johnston; Hugo Meekes; Paul Tinnemans; Elias Vlieg; Joop H Ter Horst; René de Gelder
Journal: Cryst Growth Des Date: 2021-05-20 Impact factor: 4.076

8 in total