Literature DB >> 19325840

A new approach on estimation of solubility and n-octanol/water partition coefficient for organohalogen compounds.

Shuo Gao1, Chenzhong Cao2.   

Abstract

The aqueous solubility (logW) and n-octanol/water partition coefficient (logP(OW)) are important properties for pharmacology, toxicology and medicinal chemistry. Based on an understanding of the dissolution process, the frontier orbital interaction model was suggested in the present paper to describe the solvent-solute interactions of organohalogen compounds and a general three-parameter model was proposed to predict the aqueous solubility and n-octanol/water partition coefficient for the organohalogen compounds containing nonhydrogen-binding interactions. The model has satisfactory prediction accuracy. Furthermore, every item in the model has a very explicit meaning, which should be helpful to understand the structure-solubility relationship and may be provide a new view on estimation of solubility.

Entities:  

Keywords:  HOMO-LUMO interaction; aqueous solubility; n-Octanol/water partition coefficient; organohalogen compounds; quantitative structure-property relationship

Year:  2008        PMID: 19325840      PMCID: PMC2658778          DOI: 10.3390/ijms9060962

Source DB:  PubMed          Journal:  Int J Mol Sci        ISSN: 1422-0067            Impact factor:   6.208


1. Introduction

Aqueous solubility (logW) and n-octanol/water partition coefficient (logPOW) of compounds have long been recognized as the key molecular properties and are widely used in such diverse areas as pharmaceutics, biochemistry, environmental chemistry, toxicology, chemistry and chemical engineering. Drug delivery, transport, and distribution; prediction of environmental fate; and development of analytical methods depend on solubility and partition properties [1, 2]. As a consequence, it is of considerable value to have practical knowledge of the logW and logPOW values for molecules. The measurement of logW or logPOW through the synthesis of a compound and then its subsequent experimental determination is time-consuming and expensive. Hence, there is strong interest in the structure-based prediction of logW or logPOW for rational development of new drugs and for reasonable assessment of the environmental impact of chemicals before they were released into the environment. Not surprisingly, numerous methods for the prediction of aqueous solubility or partition coefficients have been suggested in the literature [3-19]. Fortunately, Jorgensen did a very good review on the prediction methods of logW for organic compounds [20]. Recently, Kühne has made a comparison among those widely used methods and pointed out that “every method has its method-specific application domains” [21]. Thus, new methods for supplementing existing approaches are required. It is well known that organohalogen compounds have been manufactured and used in the chemical industry as solvents, propellants, additives, cooling agents, and insecticides for many years [22]. In addition, these compounds can be formed during combustion processes in waste incineration. Generally, organohalogen compounds, such as polychlorinated biphenyls (PCBs), polybrominated biphenyls (PBBs), polychlorinated benzenes, polybrominated benzenes and polychlorinated naphthalenes (PCNs) and so on, have some extent negative impact on the environment and the ecology. Thus the assessment of the environmental risk of these compounds, which can be roughly done by studying their logW or logPOW, is very important. Recently, Padmanabhan [23], Lü [24], and Zou [25] proposed QSPR models to predict the logPOW of PCBs, and obtained good prediction accuracy. In the present paper, based on an understanding of the processes involved in dissolution, a new and very simple method was suggested to predict the logPOW and logW for some halogen containing organic compounds. The present method has a good prediction accuracy and every term in the presented equation has an explicit physical and chemical meaning.

2. Methodology

The dissolution of a solute in a solvent can conceptually take place in two stages: (i) a sizable hole or cavity has to be formed in the solvent phase to accommodate the solute molecule; (ii) the solute molecule is then inserted into the hole, and then interacts with the solvent molecules around it. After the above two steps, a stable solution is formed [26]. At the first stage of the dissolution, an input energy or enthalpy (Einput) is needed to separate the solvent molecules, i.e., to overcome the solvent-solvent cohesive interactions. This energy is proportional to the size or volume of the solute molecule. The second stage of the dissolution is an exothermic process. The output energy (Eoutput) in this stage for organic compounds having no (or very weak) hydrogen-binding interactions with solvent molecules, is, in our opinion, correlated with the interaction of the frontier orbitals of the solute molecules (FMOsolute) and solvent molecules (FMOsolvent). In other words, ignoring the interaction of the hydrogen-binding interactions resulting from solute and solvent molecules, Eoutput is mainly determined by the interactions between solvent’s HOMO (HOMOsolvent) and solute’s LUMO (LUMOsolute), and between solute’s HOMO (HOMOsolute) and solvent’s LUMO (LUMOsolvent), viz.: According to the above statements, the following equation was proposed to predicted the logW for the organohalogen compounds having no (or very weak) hydrogen-binding interactions with solvent, where, E, E, E, and E are the energies of the HOMOs and LUMOs of solute and solvent molecule, respectively. For a solvent of interest, e.g. water, its frontier orbitals energies are given. The last term in Eq. (2a) is an invariable, thus Eq. (2a) can be rewritten as: Here, a, b, c and d are the coefficients; V, E and E are the volume, the HOMO energy and the LUMO energy of the solute, respectively. The parameter V of a solute can be calculated by additive method, for the details one should consult Ref. [10]; The E and E were calculated by Gaussian 98 program (using Gaussian program packages in SYBYL 6.7 of Tripos, Inc.) at the HF/6-31G(d) level.

3. Regression Analysis

3.1. Aqueous Solubility of PCBs

Taking some experimental logW of PCBs [11] (listed in Table 1) as the training set, we employed Eq. (2b) to carry out a regression analysis and got the following equation: where, R is the correlation coefficient, S is the standard error between the experimental and estimated logW by Eq. (3), n is the number of the sample in the training set. Figure 1 is the plot of the experimental versus the calculated aqueous solubility of PCBs by Eq. (3). From the R and S value of Eq. (3) and Fig. 1, one can see that Eq. (3) has a good correlation.
Table 1.

The aqueous solubility (logW) and n-octanol/water partition coefficient (logPOW) of some organohalogen compounds.

Substitution patternsVEHOMO(a.u.)ELUMO(a.u.)-logWexp.alogPOW exp.b-logWcalc.clogPOW calc.d-ΔlogWeΔlogPOWe
Chlorobiphenyls
2-172.9−0.316840.117734.384.824.55−0.17
3-172.9−0.310440.104905.394.665.154.710.24−0.05
4-172.9−0.306100.105075.334.635.084.670.25−0.04
2,2’-185.8−0.333980.121775.724.725.314.890.41−0.17
2,3-185.8−0.325960.109915.354.995.585.02−0.23−0.03
2,3’-185.8−0.327600.108025.264.845.665.07−0.40−0.23
2,4-185.8−0.322480.105945.565.155.665.06−0.100.09
2,4’-185.8−0.321890.106975.465.095.625.04−0.160.05
2,5-185.8−0.324640.105525.005.705.09−0.09
2,6-185.8−0.333220.116424.935.474.97−0.04
3,3’-185.8−0.320760.094206.455.276.025.250.430.02
3,4-185.8−0.314290.095286.395.235.895.170.500.06
3,4’-185.8−0.316010.094196.405.155.955.210.45−0.06
3,5-185.8−0.320350.093645.406.035.250.15
4,4’-185.8−0.311900.094396.375.235.885.170.490.06
2,2’,3-198.7−0.338980.112096.105.126.075.360.03−0.24
2,2’,4-198.7−0.340120.110416.495.396.145.400.35−0.01
2,2’,5-198.7−0.336460.110286.175.336.095.370.08−0.04
2,2’,6-198.7−0.335580.114385.905.045.945.29−0.04−0.25
2,3,3’-198.7−0.337010.102005.606.375.520.08
2,3,4-198.7−0.329890.099886.185.686.335.49−0.150.19
2,3,4’-198.7−0.330200.100325.805.296.325.49−0.52−0.20
2,3,6-198.7−0.337950.104646.495.446.295.480.20−0.04
2,3’,4-198.7−0.332680.097306.115.546.465.56−0.35−0.02
2,3’,5-198.7−0.330370.098756.145.656.385.52−0.240.13
2,4,4’-198.7−0.336690.097296.225.716.525.59−0.300.12
2,4,5-198.7−0.336410.109535.746.115.380.36
2,4,6-198.7−0.327160.096355.506.415.53−0.03
2,4’,5-198.7−0.328660.095546.185.686.465.56−0.280.12
2,4’,6-198.7−0.339490.102936.165.246.375.52−0.21−0.28
2,3’,4’-198.7−0.329130.095916.215.716.455.55−0.240.16
2,3’,5’-198.7−0.335550.109336.305.716.115.380.190.33
3,3’,5-198.7−0.330390.083695.706.875.77−0.07
3,4,4’-198.7−0.319500.085335.226.665.65−0.43
2,2’,3,3’-211.6−0.344690.104426.835.676.775.800.06−0.13
2,2’,3,4-211.6−0.343470.101807.005.796.845.840.16−0.05
2,2’,3,4’-211.6−0.346020.102886.965.726.845.840.12−0.12
2,2’,3,5’-211.6−0.340730.102786.915.736.765.800.15−0.07
2,2’,3,6-211.6−0.339880.103176.304.846.745.78−0.44−0.94
2,2’,3,6’-211.6−0.341640.107916.304.846.615.72−0.31−0.88
2,2’,4,4’-211.6−0.347560.101357.235.946.915.880.320.06
2,2’,4,5-211.6−0.343280.100176.865.696.895.86−0.03−0.17
2,2’,4,5’-211.6−0.341810.101257.125.876.835.830.290.04
2,2’,4,6-211.6−0.341160.101546.945.756.815.820.13−0.07
2,2’,4,6’-211.6−0.343550.106506.655.516.685.76−0.03−0.25
2,2’,5,5’-211.6−0.341360.101187.005.796.825.830.18−0.04
2,2’,5,6’-211.6−0.337580.106496.655.556.605.710.05−0.16
2,2’,6,6’-211.6−0.340620.111596.205.246.475.65−0.27−0.41
2,3,3’,4-211.6−0.340430.092876.776.107.085.97−0.310.13
2,3,4,4’-211.6−0.333930.091276.866.247.045.94−0.180.30
2,3,4,5-211.6−0.335700.089626.057.125.980.07
2,3,4’,5-211.6−0.336940.089186.776.107.156.00−0.380.10
2,3,4’,6-211.6−0.339800.098217.025.766.905.870.12−0.11
2,3,5,6-211.6−0.342190.093747.255.967.085.970.17−0.01
2,3’,4,4’-211.6−0.335180.089056.635.987.135.99−0.50−0.01
2,3’,4,5-211.6−0.337900.087476.327.226.040.28
2,3’,4,6-211.6−0.341620.096777.266.036.975.910.290.12
2,3’,4’,5-211.6−0.336650.088476.696.227.176.01−0.480.21
2,3’,4’,6-211.6−0.337380.089407.025.767.156.00−0.13−0.24
2,4,4’,5-211.6−0.344860.102276.776.106.845.84−0.070.26
2,4,4’,6-211.6−0.332830.086767.266.037.176.010.090.02
2,3’,4’,5’-211.6−0.341180.096566.715.986.975.91−0.260.07
3,3’,4,4’-211.6−0.326960.077036.217.406.120.09
3,3’,5,5’-211.6−0.340460.074186.107.706.28−0.18
2,2’,3,3’,6-224.5−0.345380.097936.785.607.366.19−0.58−0.59
2,2’,3,4,4’-224.5−0.351090.094257.626.187.566.300.06−0.12
2,2’,3,4,5-224.5−0.347360.091477.876.387.606.310.270.07
2,2’,3,4,5’-224.5−0.344840.094197.666.237.476.250.19−0.02
2,2’,3,4,6-224.5−0.344240.092407.926.507.526.270.400.23
2,2’,3,4,6’-224.5−0.349230.099956.785.607.356.18−0.57−0.58
2,2’,3,4’,5-224.5−0.349060.092757.826.327.586.310.240.01
2,2’,3,4’,6-224.5−0.347800.094057.175.877.526.27−0.35−0.40
2,2’,3,5,5’-224.5−0.347260.096677.826.327.436.220.390.10
2,2’,3,5,6-224.5−0.346780.096347.406.067.436.22−0.03−0.16
2,2’,3,5’,6-224.5−0.343900.092557.195.927.516.27−0.32−0.35
2,2’,3,4’,5’-224.5−0.341570.096737.766.307.346.170.420.13
2,2’,3,4’,6’-224.5−0.342920.101127.406.047.226.110.18−0.07
2,2’,4,4’,5-224.5−0.348640.092677.956.417.586.300.370.11
2,2’,4,4’,6-224.5−0.349080.095107.666.237.506.270.16−0.04
2,2’,4,5,5’-224.5−0.345670.092646.657.546.280.37
2,2’,4,5’,6-224.5−0.342630.095167.476.117.416.210.06−0.10
2,3,3’,4,4’-224.5−0.342510.085617.526.797.726.37−0.200.42
2,3,3’,4,5-224.5−0.344960.082897.686.927.846.44−0.160.48
2,3,3’,4’,6-224.5−0.345920.084727.656.207.806.42−0.15−0.22
2,3,3’,5,6-224.5−0.351280.082377.956.417.956.500.00−0.09
2,3,3’,5’,6-224.5−0.344290.087947.766.457.676.350.090.10
2,3,4,4’,5-224.5−0.348710.092307.506.717.596.31−0.090.40
2,3,4,4’,6-224.5−0.339190.081727.966.447.806.410.160.03
2,3,4’,5,6-224.5−0.345180.084717.886.397.796.410.09−0.02
2,3’,4,4’,5-224.5−0.343600.087717.336.577.676.35−0.340.22
2,3’,4,4’,6-224.5−0.340270.080027.916.407.876.450.04−0.05
2,3’,4,5,5’-224.5−0.341740.080626.307.876.45−0.15
2,3’,4,5’,6-224.5−0.347150.091517.926.427.596.310.330.11
2,3’,4,4’,5’-224.5−0.349730.090687.426.647.666.35−0.240.29
2,2’,3,3’,4,4’-237.4−0.357170.088546.968.216.710.25
2,2’,3,3’,4,5-237.4−0.352840.086208.426.768.226.710.200.05
2,2’,3,3’,4,5’-237.4−0.351950.087197.308.176.690.61
2,2’,3,3’,4,6-237.4−0.349630.087718.486.788.126.660.360.12
2,2’,3,3’,4,6’-237.4−0.350750.092007.656.208.006.60−0.35−0.40
2,2’,3,3’,5,5’-237.4−0.352530.085906.728.226.720.00
2,2’,3,3’,5,6-237.4−0.349160.087887.656.208.116.65−0.46−0.45
2,2’,3,3’,5,6’-237.4−0.348560.090877.826.328.006.60−0.18−0.28
2,2’,3,3’,6,6’-237.4−0.346110.095686.967.816.500.46
2,2’,3,4,4’,5-237.4−0.353840.084948.526.828.276.740.250.08
2,2’,3,4,4’,5’-237.4−0.351730.087048.386.738.176.690.210.04
2,2’,3,4,4’,6’-237.4−0.355600.090508.246.588.126.660.12−0.08
2,2’,3,4,5,5’-237.4−0.348440.084948.426.758.206.700.220.05
2,2’,3,4,5,6’-237.4−0.350210.090528.136.568.046.620.09−0.06
2,2’,3,4,5’,6-237.4−0.345450.086518.016.458.106.65−0.09−0.20
2,2’,3,4’,5,5’-237.4−0.352300.085748.586.858.236.720.350.13
2,2’,3,4’,5’,6-237.4−0.348660.090707.946.418.016.60−0.07−0.19
2,2’,3,5,5’,6-237.4−0.345230.086577.936.428.096.64−0.16−0.22
2,2’,4,4’,5,5’-237.4−0.351720.085618.496.808.226.710.270.09
2,2’,4,4’,5,6’-237.4−0.349810.089248.126.658.076.640.050.01
2,2’,4,4’,6,6’-237.4−0.358850.092958.126.548.096.650.03−0.11
2,3,3’,4,4’,5-237.4−0.347080.076358.317.448.466.83−0.150.61
2,3,3’,4,4’,6-237.4−0.349780.083018.486.788.286.740.200.04
2,3,3’,4’,5,6-237.4−0.349450.083148.486.788.276.740.210.04
2,3,3’,4’,5’,6-237.4−0.354150.087658.276.638.196.700.08−0.07
2,3,3’,5,5’,6-237.4−0.352260.082437.008.336.770.23
2,3’,4,4’,5,5’-237.4−0.346700.072718.217.298.576.89−0.360.40
3,3’,4,4’,5,5’-237.4−0.340850.061198.857.558.867.04−0.010.51
2,2’,3,3’,4,4’,5-250.3−0.357070.080428.907.088.847.110.06−0.03
2,2’,3,3’,4,4’,6-250.3−0.358140.082857.118.777.080.03
2,2’,3,3’,4,5,5’-250.3−0.355370.079309.107.218.857.120.250.09
2,2’,3,3’,4,5,6’-250.3−0.353380.084778.596.858.647.01−0.05−0.16
2,2’,3,3’,4,5’,6-250.3−0.352380.081928.686.928.727.05−0.04−0.13
2,2’,3,3’,4,6,6’-250.3−0.354620.083008.156.558.727.05−0.57−0.50
2,2’,3,3’,4,5’,6’250.3−0.350790.085628.426.738.576.97−0.15−0.24
2,2’,3,3’,5,5’,6-250.3−0.352070.082118.596.858.717.04−0.12−0.19
2,2’,3,3’,5,6,6’-250.3−0.349860.085567.946.418.566.96−0.62−0.55
2,2’,3,4,4’,5,5’-250.3−0.355150.079119.107.218.857.120.250.09
2,2’,3,4,4’,5,6-250.3−0.354180.077948.977.138.887.130.090.00
2,2’,3,4,4’,5,6’-250.3−0.355070.083478.686.928.717.04−0.03−0.12
2,2’,3,4,4’,5’,6-250.3−0.352430.081708.857.048.737.050.12−0.01
2,2’,3,4,5,5’,6-250.3−0.347920.077778.756.998.797.08−0.04−0.09
2,2’,3,4’,5,6,6’-250.3−0.352250.084598.496.788.637.00−0.14−0.22
2,3,3’,4,4’,5,5’-250.3−0.353500.069688.727.729.147.27−0.420.45
2,3,3’,4,4’,5,6-250.3−0.351970.074728.917.088.957.17−0.04−0.09
2,3,3’,4,4’,5’,6-250.3−0.359300.078389.107.218.947.170.160.04
2,3,3’,4,5,5’,6-250.3−0.354850.074079.107.219.017.200.090.01
2,3,3’,4’,5,5’,6-250.3−0.358000.078539.107.218.917.150.190.06
2,2’,3,3’,4,4’,5,5’-263.2−0.360030.073899.707.629.467.510.240.11
2,2’,3,3’,4,4’,5,6-263.2−0.360140.074779.297.359.437.50−0.14−0.15
2,2’,3,3’,4,4’,5,6’-263.2−0.357600.077499.427.439.317.430.110.00
2,2’,3,3’,4,4’,6,6’-263.2−0.358280.081009.107.219.207.38−0.10−0.17
2,2’,3,3’,4,5,5’,6-263.2−0.354700.073759.427.439.397.470.03−0.04
2,2’,3,3’,4,5,5’,6’-263.2−0.356880.077659.107.219.297.42−0.19−0.21
2,2’,3,3’,4,5,6,6’-263.2−0.353090.076869.207.309.267.41−0.06−0.11
OctaCl-
2,2’,3,3’,4,5’,6,6’-263.2−0.354690.081209.297.359.147.350.150.00
2,2’,3,4,4’,5,5’,6-263.2−0.354690.073369.507.499.407.480.100.01
2,2’,3,4,4’,5,6,6’-263.2−0.358220.075899.487.489.377.470.110.01
2,3,3’,4,4’,5,5’,6-263.2−0.361430.075079.707.629.447.510.260.11
2,2’,3,3’,4,4’,5,5’,6-276.1−0.359750.0705910.187.949.937.830.250.11
2,2’,3,3’,4,4’,5,6,6’-276.1−0.360050.0727810.077.889.877.800.200.08
2,2’,3,3’,4,5,5’,6,6’-276.1−0.356840.072918.209.817.770.43
DecaCl-289.0−0.361820.070138.2010.358.120.08
Chloronaphthalenes
1-143.1−0.295160.089504.244.230.01
2-143.1−0.297950.090624.144.24−0.10
1,2-156.0−0.304230.079994.424.74−0.32
1,4-156.0−0.302580.076294.664.79−0.13
1,5-156.0−0.302860.076364.674.79−0.12
1,7-156.0−0.305070.077574.564.79−0.23
1,8-156.0−0.298120.077504.414.73−0.32
2,3-156.0−0.305530.079674.714.75−0.04
2,7-156.0−0.309120.078574.814.800.01
1,3,7-168.9−0.313400.066305.355.310.04
2,3,6-168.9−0.315120.068185.125.30−0.18
1,2,3,4-181.8−0.315690.059845.755.710.04
1,2,3,5-181.8−0.316960.057725.775.750.02
1,3,5,7-181.8−0.319190.054796.195.820.37
1,3,5,8-181.8−0.313550.054425.765.78−0.02
1,4,6,7-181.8−0.318130.055615.815.800.01
Chlorobenzenes
Mono-102.2−0.334660.131952.983.00−0.02
1,2-113.1−0.342310.117873.383.53−0.15
1,3-114.6−0.344360.115983.483.61−0.13
1,4-115.2−0.338300.115603.383.58−0.20
1,2,3-125.3−0.352270.104974.044.08−0.04
1,2,4-128.1−0.346490.102903.984.13−0.15
1,3,5-128.1−0.358430.101314.024.26−0.24
1,2,3,4-141.0−0.352440.092744.554.62−0.07
1,2,3,5-141.0−0.355580.091004.654.67−0.02
1,2,4,5-141.0−0.351470.091044.514.64−0.13
Penta-153.9−0.358090.081395.035.12−0.09
Hexa-166.8−0.364230.072415.475.59−0.12
3,4-Dimethyl-131.6−0.319010.132663.823.460.36
Chlorotoluenes
2-116.9−0.329020.132903.423.240.18
3-118.1−0.329810.131033.283.30−0.02
4-118.3−0.323960.130903.333.260.07
2,4-129.0−0.335160.117254.243.810.43
2,6-126.9−0.340550.118044.293.800.49
2,3-diCl-p-cymene190.0−0.329960.122065.504.940.56
2,5-diCl-p-cymene190.0−0.327140.119945.604.950.65
2,3,6-triCl-p-cymene202.9−0.336720.108716.205.490.71
TetraCl-p-cymene215.8−0.343330.096156.806.020.78
Bromobenzenes
Mono105.5−0.330820.129523.023.08−0.06
1,2-121.6−0.336840.114533.643.72−0.08
1,3-121.6−0.338980.112803.753.76−0.01
1,4-121.6−0.332320.111433.793.730.06
1,3,5-137.7−0.351590.098774.514.440.07
1,2,4,5-153.8−0.342760.086395.134.910.22
Hexa-186.0−0.351760.061845.736.06−0.33
Bromotoluenes
2-120.2−0.326320.130183.423.330.09
3-121.3−0.326840.129983.283.36−0.08
4-122.5−0.320960.129623.333.34−0.01
Bromochlorobenzenes
2-116.8−0.339030.115933.833.610.22
4-116.8−0.334940.113463.833.620.21

Taken from Ref. [11];

Taken from Ref. [10] and Ref. [23];

Calculated by Eq. (3);

Calculated by Eq. (8);

ΔlogW=logWexp. − logWcalc.; ΔlogPOW= logPOW exp. − logPOW calc.

Figure 1.

The plot of experimental aqueous solubility vs. the ones calculated by Eq. (3)

The characteristics and interrelations of descriptors in Eq. (3) are given in Tables 2 and 3, respectively, which suggested that the three descriptors (V, E, and E) are significant descriptors and not strongly correlated with each other. According to the t-test values (in Table 2), the more significant descriptor appearing in Eq. (3) is the descriptor V, which indicated that the volume of PCB molecules is the predominant factor determining the PCB’s aqueous solubility. The t-score value of parameter E implied that the interaction of LUMO of PCB molecule with the HOMO of water is also play a very importance role in the determination of the PCB’s aqueous solubility.
Table 2.

The characteristics of descriptors in Eq. (3)

DescriptorVEHOMOELUMO
S0.00333.23084.2833
t-score9.0395−3.4612−7.6414
Significance0.00000.00070.0000
Table 3.

Interrelations of descriptors in Eq. (3)

VEHOMOELUMO
V1
EHOMO−0.59811
ELUMO−0.57040.20041

3.2. n-Octanol/Water Partition Coefficient of PCBs

Hantsch et al. [27] have indicated that there exists a linear relationship between the aqueous solubility (logW) and the n-octanol/water partition coefficient (logPOW) of a solute. As Eq. (2b) can express well the relationship of the structure-aqueous solubility for PCB congeners, it was also expected to be able to predict the n-octanol/water partition coefficient. Thus, taking some experimental logPOW of PCBs [10, 23] (see Table 1) as the training set, we employed Eq. (2b) to carry out the regression analysis and got the following equation: Equation (4) has a high correlation coefficient R and small standard deviation S for predicting the n-octanol/water partition coefficient of PCB congeners.

4. Discussion

A closer analysis of the coefficients in front of the parameters in Eq. (3) can provide physical insights to understand structure-solubility relationship. The negative coefficient of the parameter V implied that the PCB molecule with a larger volume has a smaller logW value than that of the smaller PCB. That is to say, the larger PCB has lower solubility in water than that of the smaller PCB, because a larger hole has to be carved in the water layer for accepting the larger PCB molecule, which needs a larger energy input. The positive coefficient of E means that the higher the HOMO of the PCB molecule, the larger the logW value of the PCB is. In our opinion, the higher energy of HOMO of the solute can interact with the LUMO of water more effectively, which is more energetically favorable for the formation of the solution. Thus the higher the HOMO of the PCB molecule, the more soluble it is. The higher the energy of LUMO of the solute molecule, the more effectively the LUMO of the solute molecule interact with the HOMO of water. Thus, the solubility of PCBs increases with the increase of the E values. In order to test the robustness and prediction ability of Eq. (3), a cross-validation analysis was performed. In the cross-validation analysis, a model is calculated with groups of objects (i.e., PCB congeners) omitted subsequently, followed by the prediction of the logW for the omitted objects. In the present study, Leave-One-Out (LOO) cross-validation method is employed. The internal predicted ability and the robustness of the models are characterized in terms of the corresponding leave-one-out cross-validation correlation coefficient (RCV) and the cross-validation predicted standard error (SCV), which are defined as: where y and ŷ are the experimental and predicted value, respectively. ȳ is the mean value of y. where N is the number of samples used for model building, M is the number of descriptors. The RCV and SCV of Eq. (3) showed that Eq. (3) is robust with only 0.27 log unit for the prediction error of PCBs’ logW. The obtained parameters RCV and SCV also show that Eq. (4) is robust. Recently, Padmanabhan et al. [23], and Lü et al. [24] developed the QSPR models for estimating logPOW of 133 PCB congeners with prediction errors of 0.225 and 0.205 log units, respectively. In order to compare with Padmanabhan’s and Lü’s work, we employed the same data set used in their work and employed Eq.(2b) to perform the correlation analysis, namely: The results of Eq. (4) and Eq. (7) showed that the predicted accuracy of the present model is better than that of Padmanabhan’s QSPR model and is comparable to that of Lü’s model. Examination of Eq. (4) or Eq. (7) may lead to the following significant interpretations: the value of logPOW increases with the increase of V, which means that increase in solute size, V, favors wet octanol phase. The reason is that water molecule is more polar than the n-octanol molecule, so the cohesive energy is larger between water molecules than that between the n-octanol ones. Thus, more energy input is needed to create a similarly-sized hole in the more polar solvent (i.e., water phase) than that in the less polar solvent (i.e., n-octanol phase). Consequently, the PCB molecule tends to enter into the n-ocatanol phase, which is energetically more favorable. Increase the E or E solute favors the aqueous layer. It is relatively easy to build a QSPR model for the congeners, while it is somewhat difficult to correlate a data set of heterogeneous compounds. Besides having a high correlation and low deviation, a valuable QSPR model should also have a large application range. Thus, in order to verify the application of Eq. (2) in more complex data sets, we combined the logPOW of 157 PCBs and some other halogen substituted aromatic compounds [10, 11] (including PBBs, PCNs, and HBs, listed in Table 1) as a data set, and used Eq. (2b) to perform a correlation analysis, the following correlation equation was obtained: Figure 2 is the plot of experimental logPOW versus the calculated ones by Eq. (8), which shows that Eq. (8) has a good correlation and prediction ability. The standard deviation of the correlation equation is only 0.247 log units, which is within the experimental uncertainties. The result of Eq. (8) showed that the model (i.e., Eq. (2b)) can be employed to predict logW or logPOW for a wide range compounds with various structures.
Figure 2.

The plot of experimental n-octanol/water partition coefficient vs. the ones calculated by Eq. (8).

It should be noted that some excellent software (such as ACD/LogP, CLogP [28], and so on) have been developed to compute logPOW. In order to compare the presented results with the data calculated by these softwares, some leading compounds were selected (see Table 4) and their logPOW were computed by Eq. (8) and by CLogP software (using CLogP packages in SYBYL 6.7 of Tripos, Inc.), respectively. For these compounds in Table 4, the average absolute deviation is 0.45 log units between the experimental logPOW exp. and the logPOW CLogP calculated by CLogP software. While, for the same compounds in Table 4, the average absolute deviation is only 0.14 log units between the logPOW exp. and the logPOW calc. predicted by Eq. (8). Seen from the average absolute deviation, the precision of present method is a little better than that of CLogP software.
Table 4.

The results of logPOW calculation by the presented method and the ClogP software for a few leading organohalogen compounds.

Substitution patternslogPOW exp.alogPOW calc.blogPOW CLogPcΔlogPOWdΔlogPOWe
Chlorobiphenyls
2-4.384.554.49−0.17−0.11
4-4.634.674.74−0.04−0.11
2,2’-4.724.894.96−0.17−0.24
2,3-4.995.025.09−0.03−0.10
2,6-4.934.974.96−0.04−0.03
4,4’-5.235.175.460.06−0.23
2,2’,4-5.395.405.67−0.01−0.28
2,2’,5-5.335.375.67−0.04−0.34
2,3,4-5.685.495.680.190.00
2,3,4’-5.295.495.80−0.20−0.51
2,4,6-5.505.535.67−0.03−0.17
2,2’,3,3’-5.675.806.14−0.13−0.47
2,2’,3,5’-5.735.806.26−0.07−0.53
2,2’,4,4’-5.945.886.380.06−0.44
2,3,4,5-6.055.986.390.07−0.34
2,3,5,6-5.965.976.26−0.01−0.30
3,3’,4,4’-6.216.126.640.09−0.43
2,2’,3,3’,6-5.606.196.60−0.59−1.00
2,2’,3,4,4’-6.186.306.85−0.12−0.67
2,2’,3,5,5’-6.326.226.970.10−0.65
2,3,4,4’,5-6.716.317.100.40−0.39
2,3,4,4’,6-6.446.416.970.03−0.53
2,2’,3,4,4’,5-6.826.747.570.08−0.75
2,3,3’,4,4’,5-7.446.837.700.61−0.26
2,3,3’,4,4’,6-6.786.747.570.04−0.79
2,2’,3,3’,4,5,5’-7.217.128.160.09−0.95
2,2’,3,4,4’,5,6-7.137.138.150.00−1.02
2,2’,3,4,5,5’,6-6.997.088.15−0.09−1.16
2,3,3’,4,4’,5,5’-7.727.278.290.45−0.57
2,2’,3,3’,4,4’,5,5’-7.627.518.750.11−1.13
2,2’,3,3’,4,4’,5,6-7.357.508.62−0.15−1.27
2,2’,3,3’,4,5’,6,6’-7.357.358.490.00−1.14
2,2’,3,3’,4,4’,5,5’,6-7.947.839.340.11−1.40
2,2’,3,3’,4,4’,5,6,6’-7.887.809.210.08−1.33
Deca-8.208.129.920.08−1.72
Chloronapthalenes
2-4.144.244.03−0.100.11
1,4-4.664.794.74−0.13−0.08
1,7-4.564.794.74−0.23−0.18
2,3-4.714.754.62−0.040.09
2,3,6-5.125.305.34−0.18−0.22
1,2,3,5-5.775.755.930.02−0.16
1,4,6,7-5.815.806.050.01−0.24
Chlorobenzenes
1,2-3.383.533.45−0.15−0.07
1,2,3-4.044.084.04−0.040.00
1,3,5-4.024.264.28−0.24−0.26
1,2,3,4-4.554.624.63−0.07−0.08
Penta-5.035.125.35−0.09−0.32
Hexa-5.475.596.06−0.12−0.59
3,4-Dimethyl-3.823.463.800.360.02
Chlorotoluenes
2-3.423.243.350.180.07
2,6-4.293.804.070.490.22
2,5-diCl-p-cymene5.604.955.490.650.11
Bromobenzenes
Mono-3.023.083.01−0.060.01
1,3-3.753.763.87−0.01−0.12
Hexa-5.736.066.72−0.33−0.99
4-Cl-3.833.623.720.210.11
Bromotoluenes
4-3.333.343.50−0.01−0.17

Taken from Ref. [10] and Ref. [23];

Calculated by Eq. (8);

Calculated by CLogP software;

logW=logWexp. − logWcalc.;

ΔlogPOW=logPOW exp. − logPOW CLogP

5. Conclusions

Based on the comprehension of the dissolution process, a very simple three-parameter model was proposed to predict the aqueous solubility and n-octanol/water partition coefficients for organohalogen compounds containing nonhydrogen-binding interactions. The model has satisfactory prediction accuracy. Furthermore, every item in the model has a very explicit meaning, which would be helpful to understand the structure-solubility relationships.
  10 in total

1.  UPPER III: unified physical property estimation relationships. Application to non-hydrogen bonding aromatic compounds.

Authors:  N Jain; S H Yalkowsky
Journal:  J Pharm Sci       Date:  1999-09       Impact factor: 3.534

2.  The n-octanol and n-hexane/water partition coefficient of environmentally relevant chemicals predicted from the mobile order and disorder (MOD) thermodynamics.

Authors:  P Ruelle
Journal:  Chemosphere       Date:  2000-03       Impact factor: 7.086

Review 3.  Prediction of drug solubility from structure.

Authors:  William L Jorgensen; Erin M Duffy
Journal:  Adv Drug Deliv Rev       Date:  2002-03-31       Impact factor: 15.470

4.  A new method for the estimation of the aqueous solubility of organic compounds.

Authors:  N Bodor; M J Huang
Journal:  J Pharm Sci       Date:  1992-09       Impact factor: 3.534

5.  Model selection based on structural similarity-method description and application to water solubility prediction.

Authors:  Ralph Kühne; Ralf-Uwe Ebert; Gerrit Schüürmann
Journal:  J Chem Inf Model       Date:  2006 Mar-Apr       Impact factor: 4.956

6.  Prediction of partition coefficient based on atom-type electrotopological state indices.

Authors:  J J Huuskonen; A E Villa; I V Tetko
Journal:  J Pharm Sci       Date:  1999-02       Impact factor: 3.534

7.  QSPR models for polychlorinated biphenyls: n-Octanol/water partition coefficient.

Authors:  J Padmanabhan; R Parthasarathi; V Subramanian; P K Chattaraj
Journal:  Bioorg Med Chem       Date:  2005-10-07       Impact factor: 3.641

8.  Molecular hologram derived quantitative structure-property relationships to predict physico-chemical properties of polychlorinated biphenyls.

Authors:  Xiaodong Wang; Songlin Tang; Shushen Liu; Shihai Cui; Liansheng Wang
Journal:  Chemosphere       Date:  2003-05       Impact factor: 7.086

9.  Application of TLSER method in predicting the aqueous solubility and n-octanol/water partition coefficient of PCBs, PCDDs and PCDFs.

Authors:  Jun Huang; Gang Yu; Zu-lin Zhang; Yi-lei Wang; Wei-hua Zhu; Guo-shi Wu
Journal:  J Environ Sci (China)       Date:  2004       Impact factor: 5.565

10.  QSPR prediction of n-octanol/water partition coefficient for polychlorinated biphenyls.

Authors:  Wenjuan Lü; Yonglei Chen; Mancang Liu; Xingguo Chen; Zhide Hu
Journal:  Chemosphere       Date:  2007-06-12       Impact factor: 7.086

  10 in total
  1 in total

1.  Biological Evaluation of 3-Benzylidenechromanones and Their Spiropyrazolines-Based Analogues.

Authors:  Angelika A Adamus-Grabicka; Magdalena Markowicz-Piasecka; Marcin Cieślak; Karolina Królewska-Golińska; Paweł Hikisz; Joachim Kusz; Magdalena Małecka; Elzbieta Budzisz
Journal:  Molecules       Date:  2020-04-01       Impact factor: 4.411

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.