| Literature DB >> 27527144 |
José F Aranda1, Juan C Garro Martinez2, Eduardo A Castro3, Pablo R Duchowicz4.
Abstract
We predict the soil sorption coefficient for a heterogeneous set of 643 organic non-ionic compounds by means of Quantitative Structure-Property Relationships (QSPR). A conformation-independent representation of the chemical structure is established. The 17,538 molecular descriptors derived with PaDEL and EPI Suite softwares are simultaneously analyzed through linear regressions obtained with the Replacement Method variable subset selection technique. The best predictive three-descriptors QSPR is developed on a reduced training set of 93 chemicals, having an acceptable predictive capability on 550 test set compounds. We also establish a model with a single optimal descriptor derived from CORAL freeware. The present approach compares fairly well with a previously reported one that uses Dragon descriptors.Entities:
Keywords: Correlation and Logic software; Estimation Program Interface Suite software; Pharmaceutical Data Exploration Laboratory software; Quantitative Structure-Property Relationships; Replacement Method; soil sorption coefficient
Mesh:
Substances:
Year: 2016 PMID: 27527144 PMCID: PMC5000645 DOI: 10.3390/ijms17081247
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
The best linear QSPR models obtained from a pool of 3491 geometry independent descriptors obtained from PaDEL freeware; the selected model appears in bold.
| Descriptors | |||||
|---|---|---|---|---|---|
| 1 | 0.72 | 0.68 | 0.65 | 0.67 | |
| 2 | 0.80 | 0.76 | 0.55 | 0.59 | |
| 3 | 0.84 | 0.79 | 0.49 | 0.56 | |
| 5 | 0.87 | 0.81 | 0.44 | 0.52 | |
| 6 | 0.89 | 0.81 | 0.41 | 0.53 |
Figure 1Predicted and experimental values according to QSPR based on Equation (1).
The stepwise search for finding the best structural attributes contributing the optimal descriptor; the selected result appears in bold. , Nearest Neighboring Code; , Morgan Extended Connectivity of zero-th order; , the presence of Nitrogen, Oxygen, Sulfur or Phosphorus.
| Structural Attributes | |||||
|---|---|---|---|---|---|
| 0.84 | 0.73 | 0.49 | 0.64 | 50 | |
| 0.86 | 0.75 | 0.46 | 0.62 | 70 | |
An example of the calculation of the optimal descriptor for formaldehyde by summing CW values: .
| Structural Attribute | |
|---|---|
| EC0-O...1... | 0.12508 |
| EC0-C...3... | 1.00094 |
| EC0-H...1... | −0.18254 |
| EC0-H...1... | −0.18254 |
| NNC-O...101. | 0.24867 |
| NNC-C...303. | −0.75284 |
| NNC-H...101. | −0.07978 |
| NNC-H...101. | −0.07978 |
| NOSP01000000 | −0.74613 |
The best linear QSPR models obtained from a pool of 3493 geometry independent descriptors obtained from PaDEL and EPI Suite softwares; the selected model appears in bold.
| Descriptors | |||||
|---|---|---|---|---|---|
| 1 | 0.77 | 0.76 | 0.59 | 0.59 | |
| 2 | 0.86 | 0.83 | 0.46 | 0.50 | |
| 4 | 0.88 | 0.84 | 0.42 | 0.48 | |
| 5 | 0.90 | 0.84 | 0.40 | 0.49 | |
| 6 | 0.91 | 0.84 | 0.37 | 0.49 |
Figure 2Predicted and experimental values according to QSPR based on Equation (3).