| Literature DB >> 31497695 |
David van der Spoel1, Sergio Manzetti1,2, Haiyang Zhang3, Andreas Klamt4,5.
Abstract
The partitioning of compounds between aqueous and other phases is important for predicting toxicity. Although thousands of octanol-water partition coefficients have been measured, these represent only a small fraction of the anthropogenic compounds present in the environment. The octanol phase is often taken to be a mimic of the inner parts of phospholipid membranes. However, the core of such membranes is typically more hydrophobic than octanol, and other partition coefficients with other compounds may give complementary information. Although a number of (cheap) empirical methods exist to compute octanol-water (log k OW) and hexadecane-water (log k HW) partition coefficients, it would be interesting to know whether physics-based models can predict these crucial values more accurately. Here, we have computed log k OW and log k HW for 133 compounds from seven different pollutant categories as well as a control group using the solvation model based on electronic density (SMD) protocol based on Hartree-Fock (HF) or density functional theory (DFT) and the COSMO-RS method. For comparison, XlogP3 (log k OW) values were retrieved from the PubChem database, and KowWin log k OW values were determined as well. For 24 of these compounds, log k OW was computed using potential of mean force (PMF) calculations based on classical molecular dynamics simulations. A comparison of the accuracy of the methods shows that COSMO-RS, KowWin, and XlogP3 all have a root-mean-square deviation (rmsd) from the experimental data of ≈0.4 log units, whereas the SMD protocol has an rmsd of 1.0 log units using HF and 0.9 using DFT. PMF calculations yield the poorest accuracy (rmsd = 1.1 log units). Thirty-six out of 133 calculations are for compounds without known log k OW, and for these, we provide what we consider a robust prediction, in the sense that there are few outliers, by averaging over the methods. The results supplied may be instrumental when developing new methods in computational ecotoxicity. The log k HW values are found to be strongly correlated to log k OW for most compounds.Entities:
Year: 2019 PMID: 31497695 PMCID: PMC6713992 DOI: 10.1021/acsomega.9b01277
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Statistics for Prediction of log kOW per Method and Compound Class and Number of (Neutral) Compounds Included Is Determined by the Availability of Experimental Dataa
| HF | BP86 | COSMO-RS | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| class | rmsd | MSE | rmsd | MSE | rmsd | MSE | ||||||
| control | 12 | 0.98 | 0.57 | –0.33 | 12 | 0.99 | 0.43 | –0.08 | 12 | 0.99 | 0.22 | –0.04 |
| haloalkane | 8 | 0.98 | 0.42 | 0.37 | 8 | 0.97 | 0.46 | 0.41 | 8 | 0.98 | 0.18 | 0.01 |
| haloaromatic | 12 | 0.99 | 0.47 | –0.44 | 12 | 0.67 | 0.63 | –0.11 | 12 | 0.74 | 0.42 | –0.05 |
| PAH | 17 | 0.89 | 1.51 | –1.35 | 17 | 0.71 | 1.36 | –1.15 | 17 | 0.93 | 0.41 | –0.18 |
| PCB | 23 | 0.87 | 0.69 | –0.64 | 23 | 0.86 | 0.51 | –0.44 | 23 | 0.88 | 0.30 | –0.19 |
| PFC | 6 | 0.88 | 0.38 | –0.13 | 6 | 0.54 | 0.60 | –0.09 | 6 | 0.67 | 0.89 | 0.76 |
| PRB | 7 | 0.98 | 1.82 | –1.81 | 7 | 0.95 | 1.77 | –1.75 | 7 | 0.99 | 0.31 | –0.15 |
| PHT | 12 | 0.83 | 1.09 | –0.42 | 12 | 0.93 | 1.00 | –0.76 | 12 | 0.99 | 0.56 | 0.47 |
| all | 97 | 0.87 | 0.99 | –0.64 | 97 | 0.87 | 0.92 | –0.52 | 97 | 0.96 | 0.42 | 0.01 |
Number of compounds N, squared correlation coefficient r2, rmsd from experiment (rmsd) and MSE, both in log P units.
Predictions for log kOW from Multiple Calculations for Neutral Compounds Where No Experimental Data Are Availablea
| compound | category | HF | BP86 | COSMO-RS | XlogP3 | KowWin | average |
|---|---|---|---|---|---|---|---|
| (2 | haloalkane | 3.03 | 3.17 | 2.30 | 2.40 | 2.7 (0.4) | |
| 1,2,3,4,5-pentafluoro-6-(2,3,4,5,6-pentafluorophenyl)benzene | haloaromatic | 3.73 | 4.36 | 5.84 | 4.60 | 5.76 | 4.9 (0.9) |
| 5-methylchrysene | PAH | 4.40 | 4.71 | 5.76 | 6.00 | 6.07 | 5.4 (0.7) |
| 7 | PAH | 4.17 | 4.39 | 5.15 | 5.70 | 5.19 | 4.9 (0.6) |
| benzo[ | PAH | 4.39 | 4.69 | 5.83 | 6.40 | 6.11 | 5.5 (0.8) |
| cyclopenta[ | PAH | 4.03 | 4.31 | 5.12 | 5.50 | 5.70 | 4.9 (0.7) |
| indeno[1,2,3- | PAH | 4.72 | 5.12 | 6.22 | 7.00 | 6.70 | 6.0 (0.9) |
| 1,2,3,4,5-pentachloro-6-(2,3,4,6-tetrachlorophenyl)benzene | PCB | 7.45 | 7.76 | 7.72 | 8.20 | 9.56 | 8.1 (0.8) |
| 1,2,3,5-tetrachloro-4-(2,3,5,6-tetrachlorophenyl)benzene | PCB | 6.98 | 7.27 | 7.29 | 7.70 | 8.91 | 7.6 (0.7) |
| 2,2,3,3,4,4,4-heptafluorobutanoic acid | PFC | 1.18 | 0.98 | 2.88 | 2.20 | 2.14 | 1.9 (0.8) |
| (3 | PFC | 1.52 | 1.67 | 2.94 | 2.10 | 1.72 | 2.0 (0.6) |
| 2,3,4,5,6-pentafluorobenzoic acid | PFC | 1.99 | 1.42 | 2.82 | 2.00 | 1.78 | 2.0 (0.5) |
| (4 | PFC | 4.55 | 4.42 | 5.31 | 5.40 | 6.02 | 5.1 (0.6) |
| 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,12,12,12-tricosafluorododecanoic acid | PFC | 4.43 | 5.55 | 7.74 | 7.60 | 7.49 | 6.6 (1.4) |
| 2,2,3,3,4,4,5,5,6,6,7,7,7-tridecafluoroheptanoic acid | PFC | 2.54 | 2.78 | –1.82 | 4.30 | 4.15 | 2.4 (2.3) |
| 1,1,2,2,3,3,4,4,5,5,6,6,6-tridecafluorohexane-1-sulfonic acid | PFC | 0.79 | 1.21 | 5.12 | 3.70 | 3.16 | 2.8 (1.7) |
| 2,2,3,3,4,4,5,5,6,6,6-undecafluorohexanoic acid | PFC | 2.30 | 1.87 | 4.20 | 3.60 | 3.48 | 3.1 (0.9) |
| 1,1,1,2,2,3,3,4,5,5,5-undecafluoro-4-(trifluoromethyl)pentane | PFC | 4.09 | 4.26 | 4.89 | 5.10 | 5.02 | 4.7 (0.5) |
| 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,9-heptadecafluorononanoic acid | PFC | 3.09 | 4.64 | 6.16 | 5.60 | 5.48 | 5.0 (1.1) |
| 2,2,3,3,4,4,5,5,5-nonafluoropentanoic acid | PFC | 1.01 | 1.80 | 3.50 | 2.90 | 2.81 | 2.4 (0.9) |
| 1,1,2,2,3,3,4,4,5,5,5-undecafluoro- | PFC | 8.21 | 8.39 | 10.83 | 11.90 | 10.21 | 9.9 (1.5) |
| 2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,11-henicosafluoroundecanoic acid | PFC | 3.63 | 4.04 | 7.44 | 6.90 | 6.82 | 5.8 (1.6) |
| 2-methylpropyl 4-hydroxybenzoate | PRB | 1.74 | 2.05 | 3.17 | 3.40 | 3.40 | 2.8 (0.8) |
| propan-2-yl 4-hydroxybenzoate | PRB | 1.07 | 1.58 | 2.73 | 2.80 | 2.91 | 2.2 (0.8) |
| octyl 4-hydroxybenzoate | PRB | 4.05 | 3.70 | 5.55 | 5.40 | 5.43 | 4.8 (0.8) |
| pentyl 4-hydroxybenzoate | PRB | 1.91 | 2.55 | 3.88 | 3.80 | 3.96 | 3.2 (0.9) |
| 1- | PHT | 4.36 | 4.64 | 5.30 | 5.30 | 5.41 | 5.0 (0.5) |
| dicyclohexyl benzene-1,2-dicarboxylate | PHT | 4.17 | 5.00 | 5.66 | 5.20 | 6.20 | 5.2 (0.7) |
| bis(7-methyloctyl) benzene-1,2-dicarboxylate | PHT | 7.75 | 8.15 | 9.80 | 9.60 | 9.37 | 8.9 (0.9) |
| diundecyl benzene-1,2-dicarboxylate | PHT | 9.34 | 11.85 | 12.33 | 12.30 | 11.49 | 11.5 (1.2) |
| butyl decyl phthalate | PHT | 5.68 | 6.61 | 8.05 | 8.00 | 7.56 | 7.2 (1.0) |
| bis(5-methylhexyl) benzene-1,2-dicarboxylate | PHT | 5.58 | 6.56 | 7.69 | 7.40 | 7.41 | 6.9 (0.8) |
| bis(4-methylpentyl) benzene-1,2-dicarboxylate | PHT | 4.96 | 5.61 | 6.53 | 6.30 | 6.43 | 6.0 (0.6) |
| bis(11-methyldodecyl) benzene-1,2-dicarboxylate | PHT | 11.77 | 12.61 | 13.90 | 13.30 | 12.9 (0.8) | |
| bis(9-methyldecyl) benzene-1,2-dicarboxylate | PHT | 8.23 | 9.18 | 11.99 | 11.70 | 10.3 (1.7) | |
| bis(2-propylheptyl) benzene-1,2-dicarboxylate | PHT | 8.12 | 9.05 | 10.72 | 9.60 | 10.36 | 9.6 (1.0) |
XlogP3 values taken from PubChem.[99] KowWin values produced using EPI suite.[42] Standard deviations within brackets.
Figure 1Correlation between experimental and calculated log P (residual) for six methods separately for all the classes considered.
Figure 2Comparison between log kHW computed using the COSMO-RS (X-axis) and the quantum chemical method SMD (Y-axis) for the HF and BP86 methods.
Figure 3Comparison between log kOW and log kHW computed using the quantum chemical method SMD-HF and SMD-BP86 as well as COSMO-RS for all compounds. The green lines correspond to log kOW = log kHW, and it is plotted to guide the eye.