| Literature DB >> 29880730 |
Mauro Banchero1, Luigi Manna2.
Abstract
Critical properties and acentric factor are widely used in phase equilibrium calculations but are difficult to evaluate with high accuracy for many organic compounds. Quantitative Structure-Property Relationship (QSPR) models are a powerful tool to establish accurate correlation between molecular properties and chemical structure. QSPR multi-linear (MLR) and radial basis-function-neural-network (RBFNN) models have been developed to predict the critical temperature, critical pressure and acentric factor of a database of 306 organic compounds. RBFNN models provided better data correlation and higher predictive capability (an AAD% of 0.92⁻2.0% for training and 1.7⁻4.8% for validation sets) than MLR models (an AAD% of 3.2⁻8.7% for training and 6.2⁻12.2% for validation sets). The RMSE of the RBFNN models was 20⁻30% of the MLR ones. The correlation and predictive performances of the models for critical temperature were higher than those for critical pressure and acentric factor, which was the most difficult property to predict. However, the RBFNN model for the acentric factor resulted in the lowest RMSE with respect to previous literature. The close relationship between the three properties resulted from the selected molecular descriptors, which are mostly related to molecular electronic charge distribution or polar interactions between molecules. QSPR correlations were compared with the most frequently used group-contribution methods over the same database of compounds: although the MLR models provided comparable results, the RBFNN ones resulted in significantly higher performance.Entities:
Keywords: QSPR models; acentric factor; critical properties; heuristic method; molecular descriptors; radial basis function neural networks
Mesh:
Substances:
Year: 2018 PMID: 29880730 PMCID: PMC6100065 DOI: 10.3390/molecules23061379
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Parity plot of the calculated versus the experimental values of critical temperature: (a) MLR model; (b) RBFNN model.
Figure 2Parity plot of the calculated versus the experimental values of critical pressure: (a) MLR model; (b) RBFNN model.
Figure 3Parity plot of the calculated versus the experimental values of acentric factor: (a) MLR model; (b) RBFNN model.
Comparison between the MLR and the RBFNN models for critical temperature, critical pressure and acentric factor.
| MLR Model | RBFNN Model | ||||
|---|---|---|---|---|---|
| Training Set | Validation Set | Training Set | Validation Set | ||
|
| total number of compounds | 215 | 91 | 215 | 91 |
| compounds with AD% > 10% | 8 | 9 | - | 1 | |
| compounds with AD% < 5% | 184 | 49 | 203 | 80 | |
| AAD% | 3.2% | 6.2% | 0.92% | 1.7% | |
| RMSE (K) | 22.0 | 37.4 | 7.2 | 11.9 | |
|
| total number of compounds | 215 | 91 | 215 | 91 |
| compounds with AD% > 10% | 40 | 25 | - | 3 | |
| compounds with AD% < 5% | 124 | 45 | 171 | 60 | |
| AAD% | 6.1% | 8.5% | 1.9% | 3.5% | |
| RMSE (MPa) | 0.40 | 0.47 | 0.11 | 0.18 | |
|
| total number of compounds | 215 | 91 | 215 | 91 |
| compounds with AD% > 10% | 65 | 45 | 1 | 7 | |
| compounds with AD% < 5% | 98 | 25 | 168 | 39 | |
| AAD% | 8.7% | 12.2% | 2.0% | 4.8% | |
| RMSE (−) | 0.040 | 0.066 | 0.0086 | 0.023 | |
Comparison between this work and literature QSPR methods for critical temperature, critical pressure and acentric factor.
|
|
|
| ||
|---|---|---|---|---|
| RMSE for MLR models | Egolf and coworkers [ | 12 K | - | - |
| Katritzky and coworkers [ | 15 K | - | - | |
| Turner and coworkers [ | 7.7 K | 0.16 MPa | - | |
| Sola and coworkers [ | 12 K | 0.25 MPa | - | |
| Sobati and Abooali [ | 16.3 K | 0.27 MPa | - | |
| this work (1) | 27.5 K | 0.42 MPa | 0.049 | |
| RMSE for ANN models | Espinosa and coworkers [ | 5.6 K | 0.08 MPa | - |
| Gharagheizi and Mehrpooya [ | 18 K | 0.17 MPa | 0.032 | |
| Yao and coworkers [ | 14 K | - | - | |
| Yao and coworkers [ | 0.15 MPa | - | ||
| this work (1) | 8.8 K | 0.13 MPa | 0.015 | |
(1) Calculated on the whole database (training + validation sets).
Comparison between MLR, RBFNN and Gani’s GC methods [4,5,6] for critical temperature, critical pressure and acentric factor.
|
|
|
| ||
|---|---|---|---|---|
| AAD% | MLR model (1) | 4.1% | 6.8% | 9.7% |
| RBFNN model (1) | 1.2% | 2.3% | 2.8% | |
| Gani’s GC method (2) | 2.7% | 8.5% | 14.1% | |
| RMSE | MLR model (1) | 27.5 K | 0.42 MPa | 0.049 |
| RBFNN model (1) | 8.8 K | 0.13 MPa | 0.015 | |
| Gani’s GC method (2) | 31.1 K | 0.48 MPa | 0.099 | |
(1) Calculated on the whole database (training + validation sets); (2) AAD% and RMSE for Gani’s GC methods were calculated on the same database of this work.
Selected molecular descriptors for the QSPR models of critical temperature, critical pressure and acentric factor.
| Descriptor | Group | |
|---|---|---|
|
| Relative number of F atoms | Constitutional descriptor |
| Number of aromatic bonds | Constitutional descriptor | |
| Relative number of rings | Constitutional descriptor | |
| Relative molecular weight | Constitutional descriptor | |
| Moment of inertia B | Geometrical descriptor | |
| HASA2/TMSA 1/2 | Electrostatic descriptor | |
| HDCA2/TMSA | Electrostatic descriptor | |
| Topographic electronic index (all pairs) | Electrostatic descriptor | |
| Randic index (order 1) | Topological descriptor | |
| Structural Information content (order 0) | Topological descriptor | |
|
| Number of Cl atoms | Constitutional descriptor |
| Relative number of rings | Constitutional descriptor | |
| Molecular volume | Geometrical descriptor | |
| Moment of inertia C | Geometrical descriptor | |
| HASA1 | Electrostatic descriptor | |
| HDSA1/TMSA | Electrostatic descriptor | |
| FPSA3 | Electrostatic descriptor | |
| Relative negative charged SA | Electrostatic descriptor | |
| Relative positive charged SA | Electrostatic descriptor | |
| count of H-donors sites | Electrostatic descriptor | |
|
| Relative number of double bonds | Constitutional descriptor |
| Molecular surface area | Geometrical descriptor | |
| Gravitation index (all bonds) | Geometrical descriptor | |
| HDCA2 | Electrostatic descriptor | |
| PNSA3 | Electrostatic descriptor | |
| Polarity parameter (Qmax − Qmin) | Electrostatic descriptor | |
| count of H-donors sites | Electrostatic descriptor | |
| Topographic electronic index (all bonds) | Electrostatic descriptor | |
| Structural Information content (order 0) | Topological descriptor | |
| Kier & Hall index (order 2) | Topological descriptor |