| Literature DB >> 35423757 |
Jun-Hong Zhou1, Li Zhao2, Liang-Wei Shi3.
Abstract
Two models for predicting the density of organic cocrystals composed of energetic organic cocrystals and general organic cocrystals containing nitro groups were obtained. Sixty organic cocrystals in which the ratio of component molecules is 1 : 1 were studied as the dataset. Model-I was based on the artificial neural network (ANN) to predict the density of the cocrystals, which used (six) input parameters of the component molecules. The root mean square error (RMSE) of the ANN model was 0.033, the mean absolute error (MAE) was 0.023, and the coefficient of determination (R 2) was 0.920. Model-II used the surface electrostatic potential correction method to predict the cocrystal density. The corresponding RMSE, MAE, and R 2 were 0.055, 0.045, and 0.716, respectively. The performance of Model-I is better than that of Model-II. This journal is © The Royal Society of Chemistry.Entities:
Year: 2021 PMID: 35423757 PMCID: PMC8696992 DOI: 10.1039/d0ra10241e
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 3.361
Fig. 1Architecture of the constructed ANN model consists of three main layers: input, hidden, and output layers.
The network parameters in the MATLAB toolbox
| Topology | 6 inputs, 1 output, and 1 hidden layer with 3 neurons (6 × 3 × 1) |
|---|---|
| Data | Training set: 42 randomly selected cocrystals |
| Test set: 9 randomly selected cocrystals | |
| Validation set: 9 randomly selected cocrystals | |
| Beginning function | log-sigmoid |
| Training algorithm | Levenberg–Marquardt |
| Loss function conditions | Minimum MSE |
| Stopping condition | The network stops in one of three ways: validation check > 10, minimum gradient < 10−7, momentum speed > 1010 |
The prediction results of the 60 organic cocrystals using artificial neural network models (g cm−3 for the density units)
| No. | Co-formers | Ref. code |
|
| Re% |
|---|---|---|---|---|---|
|
| |||||
| 1 | CL-20:TNT | IZUZUZ | 1.911 | 1.930 | 0.994 |
| 2 | CL-20:AZ2 | TETTAQ | 1.939 | 1.938 | −0.052 |
| 3 | CL-20:NEX-1 | WEPGEG | 1.882 | 1.874 | −0.425 |
| 4 | CL-2:TODAAZ | HIVGAW | 1.971 | 1.958 | −0.660 |
| 5 | CL-20:BQN | ROSMOD | 1.737 | 1.745 | 0.461 |
| 6 | CL-20:DNB | TIVJUF | 1.880 | 1.881 | 0.053 |
| 7 | CL-20:4,5-MDNI | NILCIX | 1.882 | 1.877 | −0.266 |
| 8 | HMX:PNO | WEPTAP | 1.700 | 1.697 | −0.176 |
| 9 | HMX:FA | ZEZHET | 1.687 | 1.687 | 0 |
| 10 | BTF:TNA | ZEVNUL | 1.811 | 1.819 | 0.442 |
| 11 | BTF:MATNB | GEXMON | 1.804 | 1.814 | 0.554 |
| 12 | BTF:TNA | GEXMIH | 1.884 | 1.867 | −0.902 |
| 13 | TNT:NNAP | TOZMUS | 1.539 | 1.565 | 1.689 |
| 14 | TNT:1-BN | URIJAH | 1.737 | 1.698 | −2.245 |
| 15 | TNT:Ant | URIJEL | 1.515 | 1.532 | 1.122 |
| 16 | TNT:9-BN | URIJIP | 1.688 | 1.715 | 1.600 |
| 17 | TNT:Per | URIJUB | 1.531 | 1.536 | 0.327 |
| 18 | TNT:T2 | URIKEM | 1.677 | 1.675 | −0.119 |
| 19 | TNT:DMB | URILEN | 1.501 | 1.508 | 0.466 |
| 20 | ABA:TNT | URILUD | 1.594 | 1.589 | −0.314 |
| 21 | MACIC:TZM | ACERAD | 1.605 | 1.623 | 1.121 |
| 22 | MBD:MTNB | DIFZOK | 1.522 | 1.480 | −2.760 |
| 23 | PM:UREA | EFOZAB03 | 1.644 | 1.648 | 0.243 |
| 24 | MC:PC | FIXROV01 | 1.606 | 1.661 | 3.425 |
| 25 | NDT:THTZT | FOYSUJ | 1.664 | 1.657 | −0.421 |
| 26 | IDT:NTZ | FUFSOQ | 1.644 | 1.651 | 0.426 |
| 27 | DNBA:BA | GAUTAM15 | 1.697 | 1.655 | −2.475 |
| 28 | PZ:OA | GUDSUV | 1.609 | 1.627 | 1.119 |
| 29 | TNP:MDNI | HARJOB | 1.769 | 1.768 | −0.057 |
| 30 | NF:CA | LEWTAK | 1.627 | 1.627 | 0 |
| 31 | NF:UREA | ORUXUV | 1.661 | 1.652 | −0.542 |
| 32 | NPO:PA | OWIYEZ | 1.682 | 1.653 | −1.724 |
| 33 | UREA:CA | PANVUV | 1.672 | 1.654 | −1.077 |
| 34 | PZCX:DHXBED | PAQNOM | 1.628 | 1.608 | −1.229 |
| 35 | DNPA:ODADA | QARQUY | 1.775 | 1.772 | −0.169 |
| 36 | TNP:TAD | QONYUP | 1.685 | 1.655 | −1.780 |
| 37 | IZO:DLTA | RUWPEG | 1.656 | 1.648 | −0.483 |
| 38 | IZO:LTA | UHACIQ | 1.631 | 1.646 | 0.920 |
| 39 | IZO:LTA | UHAFEP | 1.607 | 1.616 | 0.560 |
| 40 | DNBZA:TZ | UNAWUD | 1.640 | 1.655 | 0.915 |
| 41 | TZTM:HP | YAFFUJ | 1.636 | 1.635 | −0.061 |
| 42 | BM:TNP | YUQHEY | 1.616 | 1.663 | 2.908 |
|
| |||||
| 43 | CL-20:MTNP | QAPNAZ | 1.932 | 1.928 | −0.207 |
| 44 | CL-20:GTA | XAQFUS | 1.650 | 1.571 | −4.788 |
| 45 | CL-20:NFQN | ROSMIX | 1.774 | 1.659 | −6.483 |
| 46 | DHDS:TZM | ACETEJ | 1.625 | 1.655 | 1.846 |
| 47 | AB:MTNB | FONHOH | 1.442 | 1.513 | 4.924 |
| 48 | DNBA:TA | IJAKAH | 1.635 | 1.655 | 1.223 |
| 49 | NMI:NMI | ITIXUE | 1.660 | 1.657 | −0.181 |
| 50 | AN:HP | JOZZED | 1.614 | 1.647 | 2.045 |
| 51 | Urea:OA | UROXAM | 1.679 | 1.605 | −4.407 |
|
| |||||
| 52 | CL-20:DNG | JABYOD | 1.750 | 1.770 | 1.143 |
| 53 | HMX:PDCA | ZEZGOC | 1.630 | 1.658 | 1.718 |
| 54 | BTF:TNB | GEXMED | 1.806 | 1.838 | 1.772 |
| 55 | TNT:DMDBT | URIKUC | 1.496 | 1.523 | 1.805 |
| 56 | TNT:PDA | URILAJ | 1.578 | 1.561 | −1.077 |
| 57 | TNT:TNB | NIBJUF | 1.640 | 1.653 | 0.793 |
| 58 | DNBZA:NA | AWUDEB | 1.607 | 1.671 | 3.983 |
| 59 | PZCX:OA | UZODUK | 1.628 | 1.651 | 1.413 |
| 60 | TZA:NDTZI | VAZBIJ | 1.790 | 1.698 | −5.140 |
Prediction results of the 60 organic cocrystals using surface electrostatic potential correction models (g cm−3 for the density unit)
| Co-formers | Ref. code |
|
| Re% | |
|---|---|---|---|---|---|
| 1 | CL-20:TNT | IZUZUZ | 1.911 | 1.853 | −3.023 |
| 2 | CL-20:DNG | JABYOD | 1.750 | 1.811 | 3.464 |
| 3 | CL-20:MTNP | QAPNAZ | 1.932 | 1.882 | −2.574 |
| 4 | CL-20:AZ2 | TETTAQ | 1.939 | 1.877 | −3.213 |
| 5 | CL-20:NEX-1 | WEPGEG | 1.882 | 1.898 | 0.837 |
| 6 | CL-20:GTA | XAQFUS | 1.650 | 1.749 | 6.010 |
| 7 | CL-2:TODAAZ | HIVGAW | 1.971 | 1.878 | −4.721 |
| 8 | CL-20:NFQN | ROSMIX | 1.774 | 1.812 | 2.155 |
| 9 | CL_20:BQN | ROSMOD | 1.737 | 1.839 | 5.864 |
| 10 | CL-20:DNB | TIVJUF | 1.880 | 1.860 | −1.070 |
| 11 | CL-20:4,5-MDNI | NILCIX | 1.882 | 1.849 | −1.770 |
| 12 | HMX:PNO | WEPTAP | 1.700 | 1.698 | −0.094 |
| 13 | HMX:FA | ZEZHET | 1.687 | 1.741 | 3.205 |
| 14 | HMX:PDCA | ZEZGOC | 1.630 | 1.698 | 4.164 |
| 15 | BTF:TNA | ZEVNUL | 1.811 | 1.876 | 3.612 |
| 16 | BTF:TNB | GEXMED | 1.806 | 1.823 | 0.940 |
| 17 | BTF:MATNB | GEXMON | 1.804 | 1.807 | 0.178 |
| 18 | BTF:TNA | GEXMIH | 1.884 | 1.820 | −3.414 |
| 19 | TNT:NNAP | TOZMUS | 1.539 | 1.627 | 5.740 |
| 20 | TNT:1-BN | URIJAH | 1.737 | 1.740 | 0.151 |
| 21 | TNT:Ant | URIJEL | 1.515 | 1.565 | 3.305 |
| 22 | TNT:9-BN | URIJIP | 1.688 | 1.712 | 1.404 |
| 23 | TNT:Per | URIJUB | 1.531 | 1.540 | 0.616 |
| 24 | TNT:T2 | URIKEM | 1.677 | 1.556 | −7.213 |
| 25 | TNT:DMDBT | URIKUC | 1.496 | 1.544 | 3.187 |
| 26 | TNT:PDA | URILAJ | 1.578 | 1.623 | 2.882 |
| 27 | TNT:DMB | URILEN | 1.501 | 1.585 | 5.585 |
| 28 | TNT:TNB | NIBJUF | 1.640 | 1.744 | 6.364 |
| 29 | ABA:TNT | URILUD | 1.594 | 1.636 | 2.632 |
| 30 | MACIC:TZM | ACERAD | 1.605 | 1.621 | 1.027 |
| 31 | DHDS:TZM | ACETEJ | 1.625 | 1.667 | 2.555 |
| 32 | DNBZA:NA | AWUDEB | 1.607 | 1.633 | 1.648 |
| 33 | MBD:MTNB | DIFZOK | 1.522 | 1.589 | 4.434 |
| 34 | PM:UREA | EFOZAB03 | 1.644 | 1.574 | −4.243 |
| 35 | MC:PC | FIXROV01 | 1.606 | 1.614 | 0.470 |
| 36 | AB:MTNB | FONHOH | 1.442 | 1.509 | 4.618 |
| 37 | NDT:THTZT | FOYSUJ | 1.664 | 1.685 | 1.237 |
| 38 | IDT:NTZ | FUFSOQ | 1.644 | 1.676 | 1.931 |
| 39 | DNBA:BA | GAUTAM15 | 1.697 | 1.555 | −8.347 |
| 40 | PZ:OA | GUDSUV | 1.609 | 1.577 | −1.977 |
| 41 | TNP:MDNI | HARJOB | 1.769 | 1.745 | −1.367 |
| 42 | DNBA:TA | IJAKAH | 1.635 | 1.679 | 2.712 |
| 43 | NMI:NMI | ITIXUE | 1.660 | 1.668 | 0.504 |
| 44 | AN:HP | JOZZED | 1.614 | 1.579 | −2.151 |
| 45 | NF:CA | LEWTAK | 1.627 | 1.670 | 2.634 |
| 46 | NF:UREA | ORUXUV | 1.661 | 1.689 | 1.673 |
| 47 | NPO:PA | OWIYEZ | 1.682 | 1.729 | 2.806 |
| 48 | UREA:CA | PANVUV | 1.672 | 1.680 | 0.477 |
| 49 | PZCX:DHXBED | PAQNOM | 1.628 | 1.649 | 1.302 |
| 50 | DNPA:ODADA | QARQUY | 1.775 | 1.708 | −3.761 |
| 51 | TNP:TAD | QONYUP | 1.685 | 1.652 | −1.930 |
| 52 | IZO:DLTA | RUWPEG | 1.656 | 1.645 | −0.686 |
| 53 | IZO:LTA | UHACIQ | 1.631 | 1.617 | −0.873 |
| 54 | IZO:LTA | UHAFEP | 1.607 | 1.630 | 1.420 |
| 55 | DNBZA:TZ | UNAWUD | 1.640 | 1.689 | 3.015 |
| 56 | Urea:OA | UROXAM | 1.679 | 1.691 | 0.707 |
| 57 | PZCX:OA | UZODUK | 1.628 | 1.613 | −0.942 |
| 58 | TZA:NDTZI | VAZBIJ | 1.790 | 1.705 | −4.740 |
| 59 | TZTM:HP | YAFFUJ | 1.636 | 1.562 | −4.515 |
| 60 | BM:TNP | YUQHEY | 1.632 | 1.649 | 1.028 |
Parameters and the predicted density of the 6 optimized cocrystalsa
| Co-formers | Ref. code |
|
|
|
|
|
| Re% |
|---|---|---|---|---|---|---|---|---|
| CL-20:AZ2 | TETTAQ | 672.320 | 498.763 | 1.348 | 42.282 | 1.939 | 1.992 | 2.733 |
| TNT:NNAP | TOZMUS | 400.302 | 361.863 | 1.106 | 35.107 | 1.539 | 1.620 | 5.263 |
| TNT:1-BN | URIJAH | 434.201 | 359.316 | 1.208 | 28.743 | 1.737 | 1.777 | 2.303 |
| TNT:DMDBT | URIKUC | 431.363 | 421.063 | 1.044 | 24.216 | 1.496 | 1.524 | 1.872 |
| TNT:PDA | URILAJ | 341.275 | 310.756 | 1.079 | 31.515 | 1.578 | 1.578 | 0 |
| TNT:DMB | URILEN | 365.297 | 344.786 | 1.059 | 24.105 | 1.501 | 1.547 | 3.065 |
M are in g mol−1, Vm in Å3, the υσtot2 in (kcal mol)2 and all the density units are in g cm−3.
Parameters and the predicted density of the 6 unoptimized cocrystalsa
| Co-formers | Ref. code |
|
|
|
|
|
| Re% |
|---|---|---|---|---|---|---|---|---|
| CL-20:AZ2 | TETTAQ | 672.320 | 492.017 | 1.366 | 45.318 | 1.939 | 1.929 | −0.516 |
| TNT:NNAP | TOZMUS | 400.302 | 349.462 | 1.145 | 37.773 | 1.539 | 1.574 | 2.274 |
| TNT:1-BN | URIJAH | 434.201 | 347.532 | 1.249 | 31.624 | 1.737 | 1.741 | 0.23 |
| TNT:DMDBT | URIKUC | 431.363 | 401.162 | 1.075 | 26.426 | 1.496 | 1.461 | −2.34 |
| TNT:PDA | URILAJ | 341.275 | 299.556 | 1.139 | 43.138 | 1.578 | 1.564 | −0.887 |
| TNT:DMB | URILEN | 365.297 | 328.383 | 1.112 | 26.551 | 1.501 | 1.521 | 1.332 |
M are in g mol−1, Vm in Å3, υσtot2 in (kcal mol)2, and all the density units are in g cm−3.
Comparison of the prediction results of the two organic cocrystal density prediction models
| No. | Co-formers | Ref. code |
|
| | |
|---|---|---|---|---|---|
| 1 | CL-20:TNT | IZUZUZ | 0.994 | −3.023 | −2.029 |
| 2 | CL-20:DNG | JABYOD | −0.052 | 3.464 | −3.412 |
| 3 | CL-20:MTNP | QAPNAZ | −0.425 | −2.574 | −2.149 |
| 4 | CL-20:AZ2 | TETTAQ | −0.660 | −3.213 | −2.553 |
| 5 | CL-20:NEX-1 | WEPGEG | 0.461 | 0.837 | −0.376 |
| 6 | CL-20:GTA | XAQFUS | 0.053 | 6.010 | −5.957 |
| 7 | CL-2:TODAAZ | HIVGAW | −0.266 | −4.721 | −4.455 |
| 8 | CL-20:NFQN | ROSMIX | −0.176 | 2.155 | −1.979 |
| 9 | CL-20:BQN | ROSMOD | 0 | 5.864 | −5.864 |
| 10 | CL-20:DNB | TIVJUF | 0.442 | −1.070 | −0.628 |
| 11 | CL-20:4,5-MDNI | NILCIX | 0.554 | −1.770 | −1.216 |
| 12 | HMX:PNO | WEPTAP | −0.902 | −0.094 | 0.808 |
| 13 | HMX:FA | ZEZHET | 1.689 | 3.205 | −1.516 |
| 14 | HMX:PDCA | ZEZGOC | −2.245 | 4.164 | −1.919 |
| 15 | BTF:TNA | ZEVNUL | 1.122 | 3.612 | −2.49 |
| 16 | BTF:TNB | GEXMED | 1.600 | 0.940 | 0.66 |
| 17 | BTF:MATNB | GEXMON | 0.327 | 0.178 | 0.149 |
| 18 | BTF:TNA | GEXMIH | −0.119 | −3.414 | −3.295 |
| 19 | TNT:NNAP | TOZMUS | 0.466 | 5.740 | −5.274 |
| 20 | TNT:1-BN | URIJAH | −0.314 | 0.151 | 0.163 |
| 21 | TNT:Ant | URIJEL | 1.121 | 3.305 | −2.184 |
| 22 | TNT:9-BN | URIJIP | −2.760 | 1.404 | 1.356 |
| 23 | TNT:Per | URIJUB | 0.243 | 0.616 | −0.373 |
| 24 | TNT:T2 | URIKEM | 3.425 | −7.213 | −3.788 |
| 25 | TNT:DMDBT | URIKUC | −0.421 | 3.187 | −2.766 |
| 26 | TNT:PDA | URILAJ | 0.426 | 2.882 | −2.456 |
| 27 | TNT:DMB | URILEN | −2.475 | 5.585 | −3.11 |
| 28 | TNT:TNB | NIBJUF | 1.119 | 6.364 | −5.245 |
| 29 | ABA:TNT | URILUD | −0.057 | 2.632 | −2.575 |
| 30 | MACIC:TZM | ACERAD | 0 | 1.027 | −1.027 |
| 31 | DHDS:TZM | ACETEJ | −0.542 | 2.555 | −2.013 |
| 32 | DNBZA:NA | AWUDEB | −1.724 | 1.648 | 0.076 |
| 33 | MBD:MTNB | DIFZOK | −1.077 | 4.434 | −3.357 |
| 34 | PM:UREA | EFOZAB03 | −1.229 | −4.243 | −3.014 |
| 35 | MC:PC | FIXROV01 | −0.169 | 0.470 | −0.301 |
| 36 | AB:MTNB | FONHOH | −1.780 | 4.618 | −2.838 |
| 37 | NDT:THTZT | FOYSUJ | −0.483 | 1.237 | −0.754 |
| 38 | IDT:NTZ | FUFSOQ | 0.920 | 1.931 | −1.011 |
| 39 | DNBA:BA | GAUTAM15 | 0.560 | −8.347 | −7.787 |
| 40 | PZ:OA | GUDSUV | 0.915 | −1.977 | −1.062 |
| 41 | TNP:MDNI | HARJOB | −0.061 | −1.367 | −1.306 |
| 42 | DNBA:TA | IJAKAH | 2.908 | 2.712 | 0.196 |
| 43 | NMI:NMI | ITIXUE | −0.207 | 0.504 | −0.297 |
| 44 | AN:HP | JOZZED | −4.788 | −2.151 | 2.637 |
| 45 | NF:CA | LEWTAK | −6.483 | 2.634 | 3.849 |
| 46 | NF:UREA | ORUXUV | 1.846 | 1.673 | 0.173 |
| 47 | NPO:PA | OWIYEZ | 4.924 | 2.806 | 2.118 |
| 48 | UREA:CA | PANVUV | 1.223 | 0.477 | 0.746 |
| 49 | PZCX:DHXBED | PAQNOM | −0.181 | 1.302 | −1.121 |
| 50 | DNPA:ODADA | QARQUY | 2.045 | −3.761 | −1.716 |
| 51 | TNP:TAD | QONYUP | −4.407 | −1.930 | 2.477 |
| 52 | IZO:DLTA | RUWPEG | 1.143 | −0.686 | 0.457 |
| 53 | IZO:LTA | UHACIQ | 1.718 | −0.873 | 0.845 |
| 54 | IZO:LTA | UHAFEP | 1.772 | 1.420 | 0.352 |
| 55 | DNBZA:TZ | UNAWUD | 1.805 | 3.015 | −1.21 |
| 56 | Urea:OA | UROXAM | −1.077 | 0.707 | 0.37 |
| 57 | PZCX:OA | UZODUK | 0.793 | −0.942 | −0.149 |
| 58 | TZA:NDTZI | VAZBIJ | 3.983 | −4.740 | −0.757 |
| 59 | TZTM:HP | YAFFUJ | 1.413 | −4.515 | −3.102 |
| 60 | BM:TNP | YUQHEY | −5.140 | 1.028 | 4.112 |
Fig. 2Predicted densities of the cocrystals vs. experimental data for all the datasets ((a) for the ANN model, and (b) for the Politzer model).