| Literature DB >> 29950681 |
Nastaran Meftahi1, Michael L Walker1, Marta Enciso1, Brian J Smith2.
Abstract
The enthalpy and Gibbs energy of sublimation are predicted using quantitative structure property relationship (QSPR) models. In this study, we compare several approaches previously reported in the literature for predicting the enthalpy of sublimation. These models, which were reproduced successfully, exhibit high correlation coefficients, in the range 0.82 to 0.97. There are significantly fewer examples of QSPR models currently described in the literature that predict the Gibbs energy of sublimation; here we describe several models that build upon the previous models for predicting the enthalpy of sublimation. The most robust and predictive model constructed using multiple linear regression, with the fewest number of descriptors for estimating this property, was obtained with an R2 of the training set of 0.71, an R2 of the test set of 0.62, and a standard deviation of 9.1 kJ mol-1. This model could be improved by training using a neural network, yielding an R2 of the training and test sets of 0.80 and 0.63, respectively, and a standard deviation of 8.9 kJ mol-1.Entities:
Year: 2018 PMID: 29950681 PMCID: PMC6021403 DOI: 10.1038/s41598-018-28105-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison of original models for estimating the enthalpy of sublimation and models re-derived in the current study.
| Politzer | Gharagheizi | Bagheri | Salahinejad | Mathieu | |
|---|---|---|---|---|---|
| Equation |
|
|
|
| |
| Number of descriptors | 2 | 5 | 3 | 4 | 35 |
|
| |||||
| Dataset sizes | |||||
| (train.) | 34 | 1079 | 1269 | 1042 | 814 |
| (test) | 5 | 269 | 317 | 260 | 486 |
| R2 (train.) | 0.95 | 0.97 | 0.93 | 0.95 | 0.99 |
| R2 (test) | NAc | 0.97 | 0.93 | 0.95 | 0.99 |
| Errord | 11.7 | 5.5 | 9.8 | 7.3 | 4.1 |
| Re-derived here:e | |||||
| R2 | 0.82 | 0.97 | 0.96 | 0.89 | 0.70 |
| Std. dev.f | 13.9 | 5.4 | 5.1 | 10.3 | 17.1 |
| Largest deviationg | |||||
| Positive | 127.7 | 15.8 | 14.5 | 34.7 | 132.1 |
| Negative | −163.9 | −31.5 | −20.8 | −90.4 | −92.9 |
aThis is a modified form of the original Salahinejad et al. model, with the W1 descriptor replaced by the Hy descriptor. bResults reported in the original analysis in the literature. cNot reported. dAverage error for the Politzer et al., root mean square error (RMSE) for Gharagheizi et al., Bagheri et al. and Mathieu, and standard error estimate (SEE) for Salahinejad et al. models in kJ mol−1. eResults for the training set of 260 compounds from Salahinejad et al. re-derived here. fStandard deviation in kJ mol−1. gDeviation from experiment in kJ mol−1.
Comparison of models for estimating the Gibbs energy of sublimation.
| Model | R2 | Std. dev.b |
|---|---|---|
| Politzer | 0.23 | 17.2 |
| Gharagheizi | 0.55 | 13.1 |
| Bagheri | 0.58 | 12.7 |
| Salahinejad | 0.59 | 12.5 |
| Mathieu | 0.25 | 17.0 |
aR2 for the training set of 278 compounds from Perlovich and Raevsky. bStandard deviation.
Figure 1Comparison of predicted values of Gibbs energy of sublimation versus experimental. Training set (blue) and test set (red) generated by the MLR-based model, equation 13. Energies are in units of kJ mol−1.
Comparison of MLR models for estimating the Gibbs energy of sublimation.
| Equation | R2 | Std. dev.a | Largest deviationa | R2 | Std. dev.a,b | |
|---|---|---|---|---|---|---|
| (train.) | Positive | Negative | (test) | |||
|
| 0.71 | 10.5 | 28.8 | −45.4 | 0.66 | 8.6 |
|
| 0.51 | 12.6 | 50.2 | −38.3 | 0.29 | 12.4 |
|
| 0.66 | 10.1 | 31.3 | −37.5 | 0.54 | 9.9 |
|
| 0.67 | 10.1 | 29.9 | −33.3 | 0.56 | 9.7 |
akJ mol−1. bStandard deviation for the test set.
Bounding box definitions of domain of applicability for models for estimating the Gibbs energy of sublimation.
| Descriptor | Minimum | Maximum |
|---|---|---|
|
| −6.5 × 10−4 | 2.1 |
|
| 49.5 | 460.7 |
|
| 0 | 270.1 |
|
| 0 | 9.9 × 10−2 |
|
| 0 | 9 |
|
| 0 | 1 |
|
| 29.0 | 505.7 |
| (νσ2tot)0.5 | 4.3 × 10−1 | 10.5 |
|
| 1.2 | 4.4 |
|
| 0 | 3 |
|
| 0 | 92.4 |
|
| 6 | 216 |
|
| 1.4 | 16.9 |
|
| 3.7 | 260.8 |
Comparison of ANN and SVR models for estimating the Gibbs energy of sublimation.
| Equation |
|
|
|
|
|---|---|---|---|---|
|
| ||||
| R2 (train.) | 0.80 | 0.59 | 0.71 | 0.73 |
| Std. dev.a | 8.9 | 16.6 | 9.4 | 9.6 |
| Largest deviationa | ||||
| Positive | 28.1 | 38.8 | 40.2 | 36.9 |
| Negative | −31.3 | −41.3 | −32.8 | −33.0 |
| R2 (test) | 0.63 | −0.32 | 0.59 | 0.57 |
| Std. dev.a,b | 8.7 | 11.5 | 9.3 | 9.5 |
|
| ||||
| R2 (train.) | 0.77 | 0.51 | 0.70 | 0.71 |
| Std. dev.a | 9.2 | 12.4 | 9.8 | 9.6 |
| Largest deviationa | ||||
| Positive | 38.5 | 38.0 | 37.3 | 39.5 |
| Negative | −36.5 | −53.3 | −33.0 | −34.8 |
| R2 (test) | 0.61 | 0.26 | 0.56 | 0.58 |
| Std. dev.a,b | 9.0 | 11.9 | 9.8 | 9.5 |
akJ mol−1. bStandard deviation for the test set.