Literature DB >> 35423757

Two models to estimate the density of organic cocrystals.

Jun-Hong Zhou1, Li Zhao2, Liang-Wei Shi3.   

Abstract

Two models for predicting the density of organic cocrystals composed of energetic organic cocrystals and general organic cocrystals containing nitro groups were obtained. Sixty organic cocrystals in which the ratio of component molecules is 1 : 1 were studied as the dataset. Model-I was based on the artificial neural network (ANN) to predict the density of the cocrystals, which used (six) input parameters of the component molecules. The root mean square error (RMSE) of the ANN model was 0.033, the mean absolute error (MAE) was 0.023, and the coefficient of determination (R 2) was 0.920. Model-II used the surface electrostatic potential correction method to predict the cocrystal density. The corresponding RMSE, MAE, and R 2 were 0.055, 0.045, and 0.716, respectively. The performance of Model-I is better than that of Model-II. This journal is © The Royal Society of Chemistry.

Entities:  

Year:  2021        PMID: 35423757      PMCID: PMC8696992          DOI: 10.1039/d0ra10241e

Source DB:  PubMed          Journal:  RSC Adv        ISSN: 2046-2069            Impact factor:   3.361


Introduction

Nowadays, with the development of modern national defense and military industry, research on energetic materials (EMs) has attracted considerable attention. Pure crystals of EMs could not meet the needs of today's military development; therefore, numerous researchers have put their hearts into the study of energetic cocrystals (ECCs). ECCs are built by combining an energetic molecule with one or more molecules through non-covalent interactions in the same lattice. ECCs show great performance with high energy and low sensitivity compared with pure EMs. For example, Yang[1] has prepared a 1 : 1 cocrystal explosive by combining 2,4,6,8,10,12-hexanitrohexaazaisowurtzitane (CL-20) and benzotrifuroxan (BTF), and the cocrystal exhibits excellent performance compared with the pure components. Bolton[2]et al. have discovered and characterized an ECC, which is composed of CL-20 and 2,4,6-trinitrotoluene (TNT) at a molar ratio of 1 : 1. This cocrystal combines the economic and stability factors of TNT with the density and power of CL-20 into a homogenous energetic compound with high explosive power and excellent insensitivity. Xue[3]et al. have found that the cocrystal of CL-20/HMX can mediate the thermal stability of the pure crystal. Zhang and Guo[4] have discovered and characterized five novel 1 : 1 molar ratio cocrystals, which were composed of BTF and a variety of energetic materials. They found that not all cocrystals exhibited excellent performance in comparison with pure BTF. The solid-state density of the energetic material is the primary physical factor in detonation performance. The energy is usually characterized by detonation velocity and pressure, which are proportional to the density according to the Kamlet–Jacobs equations.[5] For ECC research, the high density cocrystals are our research goal. The loading density of a cocrystal explosive is determined by chemical composition, crystal packing, and intermolecular binding strength. However, the relationship between the cocrystal density and the pure component density is uncertain. Kira et al.[6] have researched 17 cocrystals of the benchmark energetic material, TNT, and many of these cocrystals have a density in between those of both components. However, this is not always the case, and some cocrystals have a density higher than those of two pure crystals. Therefore, it is important to choose the appropriate compound to get the cocrystal that has a high density. Nowadays, numerous researchers have obtained cocrystals with some excellent properties by numerous experimental attempts and it is time consuming and dangerous. So, it is urgent to find an accurate model to predict the cocrystal density before the experimental operation. Zhang[7]et al. have supplied a method (eqn (1)) for calculating the cocrystal density, and they supposed that the systems are composed of mixtures of pure components. mi is the mass of component i, and d298K,i is the density of component i. This equation only supplied a rough calculation of the cocrystal density. Strictly speaking, the density of the mixture of pure components is different from that of the cocrystal of pure components because the mixture density does not consider the intermolecular interactions between the pure components. Fathollahi et al.[8] have built models for predicting the densities of the energetic cocrystals using artificial neural network and multiple linear regression (MLR) based on three dragon descriptors (Ms, Elu, and RTm). In their study, while building the model, a cocrystal descriptor is denoted by CD (eqn (2)), and R1 and R2 are the mole fractions of the first and second components, respectively. D1 and D2 are the descriptors of the first and second components, respectively. The correlation coefficient (R2) of the ANN and MLR models (for the whole dataset) was 0.9716 and 0.9309, respectively. The average absolute relative deviation of the ANN model for the complete dataset was 2.48%. Zohari[9] has researched the relationship between the densities of energetic cocrystals through a quantitative structure–property relationship (QSPR) model (eqn (3)).where ρ is the density of the compound in g cm−3, sp is the sum of the atomic polarizabilities, OB is the value of the oxygen balance, DU is the degree of unsaturation of the compound, nAT is the number of atoms, and ρ+ is a correction factor. The research methodology provides a new model that can relate the density of an energetic co-crystal to several molecular structural descriptors, which are calculated by the Dragon[10] software. Dragon is a well-known software that can supply the calculation of more than 1600 molecular descriptors from several input formats (MDL, SYBYL, HyperChem, and Smiles). The determination coefficient (R2) of the derived correlation was 0.937. Krishna et al.[11] have developed a model for predicting the density of cocrystals using artificial neural network based on some descriptors, such as mass weight, binding energy, melting point, and pKa. In these existing models about the prediction of the density of the energetic cocrystals, most models did not consider the interactions between the two components of the cocrystals, so these models are not accurate enough to predict the density of the energetic cocrystals. In the present study, to predict the density of the energetic cocrystal, we chose the energetic organic cocrystals that have been synthesized as the main research objects. To increase the dataset and enhance the credibility of the model, some general organic cocrystals, which contain nitro groups and whose densities are higher than 1.4 g cm−3 and the ratio of whose components is 1 : 1, were also selected as the dataset. The two models to predict the densities of ECCs were built. For Model-I, we used the ANN model to predict the density of the organic cocrystal that uses three input parameters as the factors affecting the density of the organic cocrystal. For Model-II, we have used the Politzer[12,13] method, which was based on the molecular surface electrostatic potential (MESP) to predict the energetic cocrystal density. The method based on MESP has always been used to predict the density of the pure energetic compound. In the present study, we tried to predict the density of the cocrystal. In order to compare the prediction results with those of Model-I, the same dataset as Model-I was used as the research object.

Methods and calculations

In the present study, our major objective is to look for suitable expression parameters, so directly referring to the ref. 11, the ANN was selected as the machine learning algorithm. The ANN model was built as shown in Fig. 1, which includes an input layer, a hidden layer, weights, a sum function, an activation function, and an output layer. The input layer acts as the training sample, and the number of nodes in the input layer is the sample size.
Fig. 1

Architecture of the constructed ANN model consists of three main layers: input, hidden, and output layers.

The hidden layer is the operation black box used for connecting the input layer and output layer, and the number of nodes and number of layers can be customized. The output layer is the calculation result, which is mainly used for different calculations with the expected output. Through error feedback, the weights between the nodes of the hidden layer are adjusted, and then the new result is the output. The error feedback is repeated until the error is within the allowed range. Table 1 lists the main parameters used in the ANN model using the MATLAB toolbox. This includes network topology, training algorithm, and the number of data points of each dataset (training, test, and validation). In the present study, only one hidden layer network was chosen because the number of the samples is only 60, and it is too few.

The network parameters in the MATLAB toolbox

Topology6 inputs, 1 output, and 1 hidden layer with 3 neurons (6 × 3 × 1)
DataTraining set: 42 randomly selected cocrystals
Test set: 9 randomly selected cocrystals
Validation set: 9 randomly selected cocrystals
Beginning functionlog-sigmoid
Training algorithmLevenberg–Marquardt
Loss function conditionsMinimum MSE
Stopping conditionThe network stops in one of three ways: validation check > 10, minimum gradient < 10−7, momentum speed > 1010
The names of the components of the 60 organic cocrystals are listed in Table S1 (see ESI).† The three input parameters of the 60 organic cocrystals, that is, the densities of the two components that make up the cocrystals, ρ1 and ρ2, the strongest hydrogen bond interaction (Ehb), and the three dragon descriptors (Ms, RTm, and E1u)[8] are listed in Table S2 (see ESI).† The strongest hydrogen bond interaction (Ehb) can be calculated using the following formula.[14,15]where MEPmax and MEPmin are the maximum and minimum values on the map of electrostatic potential surface (MEPS) of the gaseous molecule. αmax and βmax are the parameters of the strongest hydrogen bond donor and acceptor, respectively. Suppose that the cocrystal AB is formed by the compounds A and B.andandwhere E(max,A) and E(max,B) denote the pairing energies of the strongest hydrogen bond between A–A and B–B in the pure crystal of compounds A and B. E(max,AB) denotes those in cocrystal AB. ΔEmax denotes the energy difference. The higher the −ΔEmax, the more probable is the formation of the cocrystal. −ΔEmax is taken as the criterion to indicate the possibility of cocrystal formation. The method based on the above description can be called the strongest intermolecular site pairing energy method (SISPE). The corresponding computations were implemented in multiwfn3.6.[16] The program multiwfn can realize the electronic wavefunction analysis. Normalization is to facilitate the rapid learning of neural networks and grasp the logical relationship between the data. Therefore, before performing the artificial neural network calculation, all inputs (descriptors values) were normalized between −1 and +1 using the following equation:where xi is the input or output of the model, Ai is the normalized value of xi, xmin and xmax are the minimum and maximum values of xi, respectively, and rmin and rmax describe the limits of the range where xi should be scaled. Model-II is based on the surface electrostatic potential correction method. The following eqn (16) reflects the features of the molecules' surface electrostatic potentials.where M is the molecular mass and Vm is the volume of the isolated gas phase molecule that is enclosed by the 0.001 au contour of its electronic density. The υσtot2 reflects the features of the molecules' surface electrostatic potentials. The two parameter values of Vm and υσtot2 can be computed using the Multiwfn software, and the value of M can be calculated according to the cocrystal molecular formula. The calculation values of M, Vm, and υσtot2 for the 60 cocrystals are listed in Table S3 (see ESI).† In order to assess the prediction results of the artificial neural network model and the surface electrostatic potential correction model, the relative percentage error (Re%) of the 60 cocrystal samples in the artificial neural network model and the surface electrostatic potential correction model were calculated, respectively. The RMSE, MAE, and R2 of the artificial neural network model and the surface electrostatic potential correction model were also calculated. The specific calculation of Re%, RMSE, MAE, and R2 are showed in the following formula.where the predicted value of the cocrystal density was abbreviated as ypre. The corresponding experimental value of the cocrystal density was abbreviated as yexp, the mean values of the experimental densities of all the cocrystals was abbreviated as ym, and N represents the total number of the cocrystals.

Results and discussion

The training set was 42 randomly selected cocrystals, the test set was 9 randomly selected cocrystals, and the validation set was 9 randomly selected cocrystals in the ANN model. The dataset whose serial numbers ranged from 1 to 42 was taken as the training set. The dataset whose serial numbers ranged from 43 to 51 was taken as the test set. The dataset whose serial numbers ranged from 52 to 60 was taken as the validation set. The descriptors (ρ1, ρ2, ΔEhb, Ms, RTm, and E1u) were taken as the input data and trained. ρ 1 and ρ2 are the experimental densities of the cocrystals from the Cambridge Structural Database. They are calculated according to the experimental crystal cell parameters or are directly determined by experimental measurements. ΔEhb is the energy difference of the strongest hydrogen bond interactions. Ms, RTm, and E1u are the dragon descriptors, and they have been indicated that they have a relation to the density in the ref. 8. For the choice of the descriptor, one method is that the important descriptors are decided by the relative analysis from thousands of descriptors. The other method is that the important descriptor with the physical meaning is directly selected by the expert's experiences. In the present study, six parameters were selected mainly according to the ref. 8, 11 and 15. The ANN model was taken as Model-I. Table 2 lists the predicted density using the artificial neural network model (ρANN), experimental density (ρexp), and relative percentage error (Re%) of the 60 organic cocrystals. From Table 2, it can be seen that the predicted densities agree well with the experimental densities for all the cocrystals in the research study. The maximum absolute value of Re% is 6.48%, and the smallest absolute value of Re% is 0%. 88.3% of the absolute value of Re% of all 60 cocrystals was less than 3%, and 8.3% was between 3 and 5, and 3.3% was more than 5. The RMSE, MAE, and R2 of 60 cocrystal densities predicted by the ANN model were 0.033, 0.023, and 0.920, respectively. 88.3% of the total results had an error of less than 0.05 g cm−3. The RMSE and MAE of 50 energetic cocrystals, according to the prediction results of the ref. 9 were 0.077 and 0.066. In order to compare, R2 of 50 energetic cocrystals from the ref. 9 was also calculated according to eqn (20), and the value was 0.825. 40% of the total results had an error of less than 0.05 g cm−3. The densities of 50 energetic cocrystals from the ref. 9 were predicted according to the method of the ref. 8, and the RMSE, MAE, and R2 were 0.490, 0.413, and 0.023, respectively. 40% of the total results had an error of less than 0.05 g cm−3. From the view of RMSE, MAE, R2, and the error range, the ANN model in the present work has a better prediction performance.

The prediction results of the 60 organic cocrystals using artificial neural network models (g cm−3 for the density units)

No.Co-formersRef. code ρ exp ρ ANN Re%
Training dataset
1CL-20:TNTIZUZUZ1.9111.9300.994
2CL-20:AZ2TETTAQ1.9391.938−0.052
3CL-20:NEX-1WEPGEG1.8821.874−0.425
4CL-2:TODAAZHIVGAW1.9711.958−0.660
5CL-20:BQNROSMOD1.7371.7450.461
6CL-20:DNBTIVJUF1.8801.8810.053
7CL-20:4,5-MDNINILCIX1.8821.877−0.266
8HMX:PNOWEPTAP1.7001.697−0.176
9HMX:FAZEZHET1.6871.6870
10BTF:TNAZEVNUL1.8111.8190.442
11BTF:MATNBGEXMON1.8041.8140.554
12BTF:TNAGEXMIH1.8841.867−0.902
13TNT:NNAPTOZMUS1.5391.5651.689
14TNT:1-BNURIJAH1.7371.698−2.245
15TNT:AntURIJEL1.5151.5321.122
16TNT:9-BNURIJIP1.6881.7151.600
17TNT:PerURIJUB1.5311.5360.327
18TNT:T2URIKEM1.6771.675−0.119
19TNT:DMBURILEN1.5011.5080.466
20ABA:TNTURILUD1.5941.589−0.314
21MACIC:TZMACERAD1.6051.6231.121
22MBD:MTNBDIFZOK1.5221.480−2.760
23PM:UREAEFOZAB031.6441.6480.243
24MC:PCFIXROV011.6061.6613.425
25NDT:THTZTFOYSUJ1.6641.657−0.421
26IDT:NTZFUFSOQ1.6441.6510.426
27DNBA:BAGAUTAM151.6971.655−2.475
28PZ:OAGUDSUV1.6091.6271.119
29TNP:MDNIHARJOB1.7691.768−0.057
30NF:CALEWTAK1.6271.6270
31NF:UREAORUXUV1.6611.652−0.542
32NPO:PAOWIYEZ1.6821.653−1.724
33UREA:CAPANVUV1.6721.654−1.077
34PZCX:DHXBEDPAQNOM1.6281.608−1.229
35DNPA:ODADAQARQUY1.7751.772−0.169
36TNP:TADQONYUP1.6851.655−1.780
37IZO:DLTARUWPEG1.6561.648−0.483
38IZO:LTAUHACIQ1.6311.6460.920
39IZO:LTAUHAFEP1.6071.6160.560
40DNBZA:TZUNAWUD1.6401.6550.915
41TZTM:HPYAFFUJ1.6361.635−0.061
42BM:TNPYUQHEY1.6161.6632.908
Test dataset
43CL-20:MTNPQAPNAZ1.9321.928−0.207
44CL-20:GTAXAQFUS1.6501.571−4.788
45CL-20:NFQNROSMIX1.7741.659−6.483
46DHDS:TZMACETEJ1.6251.6551.846
47AB:MTNBFONHOH1.4421.5134.924
48DNBA:TAIJAKAH1.6351.6551.223
49NMI:NMIITIXUE1.6601.657−0.181
50AN:HPJOZZED1.6141.6472.045
51Urea:OAUROXAM1.6791.605−4.407
Validation dataset
52CL-20:DNGJABYOD1.7501.7701.143
53HMX:PDCAZEZGOC1.6301.6581.718
54BTF:TNBGEXMED1.8061.8381.772
55TNT:DMDBTURIKUC1.4961.5231.805
56TNT:PDAURILAJ1.5781.561−1.077
57TNT:TNBNIBJUF1.6401.6530.793
58DNBZA:NAAWUDEB1.6071.6713.983
59PZCX:OAUZODUK1.6281.6511.413
60TZA:NDTZIVAZBIJ1.7901.698−5.140
In order to compare the prediction results of the two models, 42 cocrystals used in Model-I were also used as the training set in Model-II, and the rest were used for verification. The three parameters α, β, and γ in eqn (16) were obtained by the least-squares method. The specific formula is as follows: The predicted densities of the 60 cocrystals using the surface electrostatic potential correction model are presented in Table 3. Table 3 lists the predicted densities using the surface electrostatic potential correction model (ρP), experimental densities (ρexp), and relative percentage errors (Re%) of the 60 cocrystals. From Table 3, it can be seen that the predicted densities are also in good agreement with the experimental densities for all the cocrystals in the study. The maximum absolute value of Re% is 8.346%, and the smallest absolute value of Re% is 0.094%. In the 60 predicted results of the cocrystal densities, 60% of the absolute values of Re% was between 0 and 3, 28.3% was between 3 and 5, and 11.67% was greater than 5. The RMSE, MAE, and R2 of 60 cocrystal densities predicted by the surface electrostatic potential correction model are 0.055, 0.045, and 0.716, respectively. 65.0% of the total results had an error of less than 0.05 g cm−3. According to the ref. 17, for CHNO molecular crystals, the RMSE, MAE, and R2 of 36 molecular crystals are 0.045, 0.036, and 0.918, respectively. 77.8% of the total results had an error of less than 0.05 g cm−3. According to the ref. 18, an R2 value greater than 0.5 indicates the significant predictivity of the model. In the ref. 17, Politzer et al. categorized the quality of the density predictions according to the criteria provided by Kim et al.,[18] that is, (a) “excellent” (having an error less than 0.03 g cm−3), (b) “informative” (having an error between 0.03 and 0.05 g cm−3), (c) “barely usable” (an error between 0.05 and 0.10 g cm−3), and (d) “deceptive” (error greater than 0.10 g cm−3). Compared to the pure energetic crystal, the prediction based on MESP exhibits worse performance. However, according to the ref. 17, Model-II was also acceptable.

Prediction results of the 60 organic cocrystals using surface electrostatic potential correction models (g cm−3 for the density unit)

Co-formersRef. code ρ exp ρ P Re%
1CL-20:TNTIZUZUZ1.9111.853−3.023
2CL-20:DNGJABYOD1.7501.8113.464
3CL-20:MTNPQAPNAZ1.9321.882−2.574
4CL-20:AZ2TETTAQ1.9391.877−3.213
5CL-20:NEX-1WEPGEG1.8821.8980.837
6CL-20:GTAXAQFUS1.6501.7496.010
7CL-2:TODAAZHIVGAW1.9711.878−4.721
8CL-20:NFQNROSMIX1.7741.8122.155
9CL_20:BQNROSMOD1.7371.8395.864
10CL-20:DNBTIVJUF1.8801.860−1.070
11CL-20:4,5-MDNINILCIX1.8821.849−1.770
12HMX:PNOWEPTAP1.7001.698−0.094
13HMX:FAZEZHET1.6871.7413.205
14HMX:PDCAZEZGOC1.6301.6984.164
15BTF:TNAZEVNUL1.8111.8763.612
16BTF:TNBGEXMED1.8061.8230.940
17BTF:MATNBGEXMON1.8041.8070.178
18BTF:TNAGEXMIH1.8841.820−3.414
19TNT:NNAPTOZMUS1.5391.6275.740
20TNT:1-BNURIJAH1.7371.7400.151
21TNT:AntURIJEL1.5151.5653.305
22TNT:9-BNURIJIP1.6881.7121.404
23TNT:PerURIJUB1.5311.5400.616
24TNT:T2URIKEM1.6771.556−7.213
25TNT:DMDBTURIKUC1.4961.5443.187
26TNT:PDAURILAJ1.5781.6232.882
27TNT:DMBURILEN1.5011.5855.585
28TNT:TNBNIBJUF1.6401.7446.364
29ABA:TNTURILUD1.5941.6362.632
30MACIC:TZMACERAD1.6051.6211.027
31DHDS:TZMACETEJ1.6251.6672.555
32DNBZA:NAAWUDEB1.6071.6331.648
33MBD:MTNBDIFZOK1.5221.5894.434
34PM:UREAEFOZAB031.6441.574−4.243
35MC:PCFIXROV011.6061.6140.470
36AB:MTNBFONHOH1.4421.5094.618
37NDT:THTZTFOYSUJ1.6641.6851.237
38IDT:NTZFUFSOQ1.6441.6761.931
39DNBA:BAGAUTAM151.6971.555−8.347
40PZ:OAGUDSUV1.6091.577−1.977
41TNP:MDNIHARJOB1.7691.745−1.367
42DNBA:TAIJAKAH1.6351.6792.712
43NMI:NMIITIXUE1.6601.6680.504
44AN:HPJOZZED1.6141.579−2.151
45NF:CALEWTAK1.6271.6702.634
46NF:UREAORUXUV1.6611.6891.673
47NPO:PAOWIYEZ1.6821.7292.806
48UREA:CAPANVUV1.6721.6800.477
49PZCX:DHXBEDPAQNOM1.6281.6491.302
50DNPA:ODADAQARQUY1.7751.708−3.761
51TNP:TADQONYUP1.6851.652−1.930
52IZO:DLTARUWPEG1.6561.645−0.686
53IZO:LTAUHACIQ1.6311.617−0.873
54IZO:LTAUHAFEP1.6071.6301.420
55DNBZA:TZUNAWUD1.6401.6893.015
56Urea:OAUROXAM1.6791.6910.707
57PZCX:OAUZODUK1.6281.613−0.942
58TZA:NDTZIVAZBIJ1.7901.705−4.740
59TZTM:HPYAFFUJ1.6361.562−4.515
60BM:TNPYUQHEY1.6321.6491.028
While the Politzer model was built, the Politzer parameters were calculated based on the packing unit structure of the experimental crystal. However, in fact, while the model was used, the packing unit structure of the experimental crystal was not obtained, and it was only obtained by theoretical optimization. In order to compare the error caused by the packing unit structure, the densities of the six cocrystals were predicted based on the packing unit structures coming from the experimental cocrystals, which were theoretically optimized and unoptimized, respectively. The specific calculated values are shown in Table 4 and 5, respectively. By comparing the predicted densities of the cocrystals based on the optimized and unoptimized packing unit structures, it could be found that the Re% of the predicted density values was comparable with that of the predicted density values based on the unoptimized cocrystals. Therefore, the Politzer model built in the present study can be used to predict the densities of the cocrystals.

Parameters and the predicted density of the 6 optimized cocrystalsa

Co-formersRef. code M V m M/Vm υσ tot 2 ρ exp ρ pre Re%
CL-20:AZ2TETTAQ672.320498.7631.34842.2821.9391.9922.733
TNT:NNAPTOZMUS400.302361.8631.10635.1071.5391.6205.263
TNT:1-BNURIJAH434.201359.3161.20828.7431.7371.7772.303
TNT:DMDBTURIKUC431.363421.0631.04424.2161.4961.5241.872
TNT:PDAURILAJ341.275310.7561.07931.5151.5781.5780
TNT:DMBURILEN365.297344.7861.05924.1051.5011.5473.065

M are in g mol−1, Vm in Å3, the υσtot2 in (kcal mol)2 and all the density units are in g cm−3.

Parameters and the predicted density of the 6 unoptimized cocrystalsa

Co-formersRef. code M V m M/Vm υσ tot 2 ρ exp ρ pre Re%
CL-20:AZ2TETTAQ672.320492.0171.36645.3181.9391.929−0.516
TNT:NNAPTOZMUS400.302349.4621.14537.7731.5391.5742.274
TNT:1-BNURIJAH434.201347.5321.24931.6241.7371.7410.23
TNT:DMDBTURIKUC431.363401.1621.07526.4261.4961.461−2.34
TNT:PDAURILAJ341.275299.5561.13943.1381.5781.564−0.887
TNT:DMBURILEN365.297328.3831.11226.5511.5011.5211.332

M are in g mol−1, Vm in Å3, υσtot2 in (kcal mol)2, and all the density units are in g cm−3.

M are in g mol−1, Vm in Å3, the υσtot2 in (kcal mol)2 and all the density units are in g cm−3. M are in g mol−1, Vm in Å3, υσtot2 in (kcal mol)2, and all the density units are in g cm−3. In order to compare the relative accuracy of the artificial neural network model and the surface electrostatic potential correction model in predicting the cocrystal density, the differences between the absolute values of the Re% of the two models were calculated. Table 6 shows the values of RANN%, RP%, and the differences between the absolute values of RANN% and RP% (|RANN%| − |RP%|). The regression performance of the two models for predicting the densities of the cocrystals is shown in Fig. 2. When the value of |RANN%| − |RP%| is negative, it indicates that the artificial neural network model is more accurate in predicting the density value of the cocrystal.

Comparison of the prediction results of the two organic cocrystal density prediction models

No.Co-formersRef. code R ANN% R p%|RANN%| − |RP%|
1CL-20:TNTIZUZUZ0.994−3.023−2.029
2CL-20:DNGJABYOD−0.0523.464−3.412
3CL-20:MTNPQAPNAZ−0.425−2.574−2.149
4CL-20:AZ2TETTAQ−0.660−3.213−2.553
5CL-20:NEX-1WEPGEG0.4610.837−0.376
6CL-20:GTAXAQFUS0.0536.010−5.957
7CL-2:TODAAZHIVGAW−0.266−4.721−4.455
8CL-20:NFQNROSMIX−0.1762.155−1.979
9CL-20:BQNROSMOD05.864−5.864
10CL-20:DNBTIVJUF0.442−1.070−0.628
11CL-20:4,5-MDNINILCIX0.554−1.770−1.216
12HMX:PNOWEPTAP−0.902−0.0940.808
13HMX:FAZEZHET1.6893.205−1.516
14HMX:PDCAZEZGOC−2.2454.164−1.919
15BTF:TNAZEVNUL1.1223.612−2.49
16BTF:TNBGEXMED1.6000.9400.66
17BTF:MATNBGEXMON0.3270.1780.149
18BTF:TNAGEXMIH−0.119−3.414−3.295
19TNT:NNAPTOZMUS0.4665.740−5.274
20TNT:1-BNURIJAH−0.3140.1510.163
21TNT:AntURIJEL1.1213.305−2.184
22TNT:9-BNURIJIP−2.7601.4041.356
23TNT:PerURIJUB0.2430.616−0.373
24TNT:T2URIKEM3.425−7.213−3.788
25TNT:DMDBTURIKUC−0.4213.187−2.766
26TNT:PDAURILAJ0.4262.882−2.456
27TNT:DMBURILEN−2.4755.585−3.11
28TNT:TNBNIBJUF1.1196.364−5.245
29ABA:TNTURILUD−0.0572.632−2.575
30MACIC:TZMACERAD01.027−1.027
31DHDS:TZMACETEJ−0.5422.555−2.013
32DNBZA:NAAWUDEB−1.7241.6480.076
33MBD:MTNBDIFZOK−1.0774.434−3.357
34PM:UREAEFOZAB03−1.229−4.243−3.014
35MC:PCFIXROV01−0.1690.470−0.301
36AB:MTNBFONHOH−1.7804.618−2.838
37NDT:THTZTFOYSUJ−0.4831.237−0.754
38IDT:NTZFUFSOQ0.9201.931−1.011
39DNBA:BAGAUTAM150.560−8.347−7.787
40PZ:OAGUDSUV0.915−1.977−1.062
41TNP:MDNIHARJOB−0.061−1.367−1.306
42DNBA:TAIJAKAH2.9082.7120.196
43NMI:NMIITIXUE−0.2070.504−0.297
44AN:HPJOZZED−4.788−2.1512.637
45NF:CALEWTAK−6.4832.6343.849
46NF:UREAORUXUV1.8461.6730.173
47NPO:PAOWIYEZ4.9242.8062.118
48UREA:CAPANVUV1.2230.4770.746
49PZCX:DHXBEDPAQNOM−0.1811.302−1.121
50DNPA:ODADAQARQUY2.045−3.761−1.716
51TNP:TADQONYUP−4.407−1.9302.477
52IZO:DLTARUWPEG1.143−0.6860.457
53IZO:LTAUHACIQ1.718−0.8730.845
54IZO:LTAUHAFEP1.7721.4200.352
55DNBZA:TZUNAWUD1.8053.015−1.21
56Urea:OAUROXAM−1.0770.7070.37
57PZCX:OAUZODUK0.793−0.942−0.149
58TZA:NDTZIVAZBIJ3.983−4.740−0.757
59TZTM:HPYAFFUJ1.413−4.515−3.102
60BM:TNPYUQHEY−5.1401.0284.112
Fig. 2

Predicted densities of the cocrystals vs. experimental data for all the datasets ((a) for the ANN model, and (b) for the Politzer model).

However, the surface electrostatic potential correction model is more accurate in predicting the density value of the cocrystal. From Table 6, it can be seen that among the 60 cocrystals, 18 are positive values, accounting for 30%, and 42 are negative values, accounting for 70%. According to the RMSE and MAE values of the two models calculated above, it is also found that both values of the artificial neural network model are smaller than those obtained using the surface electrostatic potential correction model. From Fig. 2, it can also be seen that the performance of the artificial neural network model is better than that of the surface electrostatic potential correction model. Therefore, in these two models, the density value of the cocrystal predicted by the artificial neural network model is relatively accurate. However, the surface electrostatic potential correction model is more convenient to predict the cocrystal density because it provides a unique and specific formula for calculating the cocrystal density. It is also simple and time-saving to calculate the two parameters of the surface electrostatic potential correction model.

Conclusions

In this study, two types of prediction models for the organic cocrystal density were established. One is the artificial neural network model, and the other is the surface electrostatic potential correction model. For the artificial neural network model, the maximum absolute value of Re% is 6.483%, and the smallest absolute value of Re% is 0. 88.3% of 60 cocrystals for the absolute values of Re% were less than 3%. The RMSE and MAE of 60 organic cocrystal densities predicted by the artificial neural network model are 0.033 and 0.023, respectively. For the surface electrostatic potential correction model, maximum absolute value of Re% is 8.346%, and smallest absolute value of Re% is 0.094%. In the 60 predicted results of the cocrystal densities, 60% of the absolute values of Re% were between 0 and 3, 28.3% was between 3 and 5, and 11.67% was greater than 5. The RMSE and MAE of the 60 cocrystal densities predicted by the surface electrostatic potential correction model are 0.055 and 0.045, respectively. To compare the prediction accuracy of the two models, the values of |RANN%| − |RP%| were also calculated. By comparing the values of Re%, RMSE, MAE, and |RANN%| − |RP%|, it can be inferred that the artificial neural network model is more accurate than the surface electrostatic potential correction model. However, the surface electrostatic potential correction model is more convenient and practical than the artificial neural network model. Therefore, the two models could be selected according to the actual requirements.

Conflicts of interest

The authors declare there are no conflicts of interest regarding the publication of this paper.
  5 in total

1.  Multiwfn: a multifunctional wavefunction analyzer.

Authors:  Tian Lu; Feiwu Chen
Journal:  J Comput Chem       Date:  2011-12-08       Impact factor: 3.376

2.  Virtual computational chemistry laboratory--design and description.

Authors:  Igor V Tetko; Johann Gasteiger; Roberto Todeschini; Andrea Mauri; David Livingstone; Peter Ertl; Vladimir A Palyulin; Eugene V Radchenko; Nikolay S Zefirov; Alexander S Makarenko; Vsevolod Yu Tanchuk; Volodymyr V Prokopenko
Journal:  J Comput Aided Mol Des       Date:  2005-06       Impact factor: 3.686

3.  Improved stability and smart-material functionality realized in an energetic cocrystal.

Authors:  Onas Bolton; Adam J Matzger
Journal:  Angew Chem Int Ed Engl       Date:  2011-08-25       Impact factor: 15.336

4.  Assessment of density prediction methods based on molecular surface electrostatic potential.

Authors:  Ayushi Nirwan; Alka Devi; Vikas D Ghule
Journal:  J Mol Model       Date:  2018-06-19       Impact factor: 1.810

5.  Prediction of densities for solid energetic molecules with molecular surface electrostatic potentials.

Authors:  Chan Kyung Kim; Soo Gyeong Cho; Chang Kon Kim; Hyung-Yeon Park; Hui Zhang; Hai Whang Lee
Journal:  J Comput Chem       Date:  2008-08       Impact factor: 3.376

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.