| Literature DB >> 20706622 |
Omar Deeb1, Padmakar V Khadikar, Mohammad Goodarzi.
Abstract
The terms bioaccumulation and bioconcentration refer to the uptake and build-up of chemicals that can occur in living organisms. Experimental measurement of bioconcentration is time-consuming and expensive, and is not feasible for a large number of chemicals of potential regulatory concern. A highly effective tool depending on a quantitative structure-property relationship (QSPR) can be utilized to describe the tendency of chemical concentration organisms represented by, the important ecotoxicological parameter, the logarithm of Bio Concentration Factor (log BCF) with molecular descriptors for a large set of non-ionic organic compounds. QSPR models were developed using multiple linear regression, partial least squares and neural networks analyses. Linear and non-linear QSPR models to predict log BCF of the compounds developed for the relevant descriptors. The results obtained offer good regression models having good prediction ability. The descriptors used in these models depend on the volume, connectivity, molar refractivity, surface tension and the presence of atoms accepting H-bonds.Entities:
Keywords: BCF; non-ionic organic compounds; partial least square (PLS); principal components artificial neural networks (PC-ANN); structure property relationships (QSPR)
Year: 2010 PMID: 20706622 PMCID: PMC2918358 DOI: 10.4137/ehi.s5168
Source DB: PubMed Journal: Environ Health Insights ISSN: 1178-6302
Non-ionic organic compounds used in this study and their experimental log BCF values.
| 1 | 1,2-Dichloroethane | 0.30 |
| 2 | Trichloromethane | 0.78 |
| 3 | 1,1,2,2-Tetrachloroethane | 0.90 |
| 4 | Trichloroetheylene | 1.59 |
| 5 | 1,1,1-Trichloroethane | 0.95 |
| 6 | Tetrachloroetyelene | 1.74 |
| 7 | Tetrachloromethane | 1.48 |
| 8 | Pentachloroethane | 1.83 |
| 9 | Hexachloroethane | 2.92 |
| 10 | 1,1,2,3,4,4-Hexachloro-1, 3-Butadiene | 3.83 |
| 11 | Benzene | 0.64 |
| 12 | Toulene | 1.12 |
| 13 | Styrene | 1.13 |
| 14 | Ethylebenzene | 1.19 |
| 15 | o-Xylene | 1.24 |
| 16 | m-Xylene | 1.27 |
| 17 | p-Xylene | 1.27 |
| 18 | p-Methyle styrene | 1.50 |
| 19 | m-Methyle styrene | 1.55 |
| 20 | Isopropylebenzene | 1.55 |
| 21 | 2-Phenyledodecane | 2.65 |
| 22 | Octachlorostyrene | 4.52 |
| 23 | Napthalene | 1.64 |
| 24 | Acenaphtylene | 2.58 |
| 25 | Acenaphtalene | 2.59 |
| 26 | Biphenyle | 2.64 |
| 27 | Anthracene | 2.83 |
| 28 | 2-Methylenapthalene | 3.20 |
| 29 | Fluorene | 3.23 |
| 30 | Phenanthrene | 3.42 |
| 31 | Benzo[a]pyrene | 3.42 |
| 32 | Pyrene | 3.43 |
| 33 | 2-Methylephenanthrene | 3.48 |
| 34 | 2-Chlorophenanthrene | 3.63 |
| 35 | 9-Methylanthracene | 3.66 |
| 36 | Benzo[a]anthracene | 4.00 |
| 37 | Chlorobenzene | 1.85 |
| 38 | 1,2-Dichlorobenzene | 2.48 |
| 39 | 1,4-Dichlorobenzene | 2.52 |
| 40 | 1,3-Dichlorobenzene | 2.65 |
| 41 | 1,2,3-Trichlorobenzene | 3.11 |
| 42 | 1,2,4-Trichlorobenzene | 3.26 |
| 43 | 1,2,3,5-Tetrachlorobenzene | 3.36 |
| 44 | 1,3,5-Trichlorobenzene | 3.38 |
| 45 | 1,2,4,5-Tetrachlorobenzene | 3.76 |
| 46 | 1,2,3,4-Tetrachlorobenzene | 3.77 |
| 47 | Pentachlorobenzene | 3.86 |
| 48 | 2,4,5-Trichlorotoulene | 3.87 |
| 49 | Hexachlorobenzene | 4.26 |
| 50 | Bromobenzene | 1.70 |
| 51 | 1,3-Dibromobenzene | 2.80 |
| 52 | 1,2,3-Tribromobenzene | 2.83 |
| 53 | Hexabromobenzene | 3.04 |
| 54 | 1,2-Dibromobenzene | 3.10 |
| 55 | 1,2,4-Dibromobenzene | 3.66 |
| 56 | 1,2,4,5-Tetrabromobenzene | 3.79 |
| 57 | 1,3,5-Tribromobenzene | 3.85 |
| 58 | Octachloronapthalene | 3.44 |
| 59 | 1,4-Dichloronapthalene | 3.56 |
| 60 | 2-Monochloronapthalene | 3.63 |
| 61 | 1,8-Dichloronapthalene | 3.79 |
| 62 | 2,3-Dichloronapthalene | 4.04 |
| 63 | 2,7-Dichloronapthalene | 4.04 |
| 64 | 1,2,3,4-Tetrachloronapthalene | 4.10 |
| 65 | 1,3,5,8-Tetrachloronapthalene | 4.40 |
| 66 | 1,3,7-Trichloronapthalene | 4.43 |
| 67 | 1,3,5,7-Tetrachloronapthalene | 4.53 |
| 68 | 4-Chlorobipheyl | 2.69 |
| 69 | 2,2′-Dichlorobiphenyl | 3.26 |
| 70 | 4,4′-Dichlorobiphenyl | 3.28 |
| 71 | 2,2′,4,4′,6-Pentachlorobiphenyl | 3.37 |
| 72 | 2,4′-Dichlorobiphenyl | 3.55 |
| 73 | 2,4′,5-Trichlorobiphenyl | 3.75 |
| 74 | 3,5-Dichlorobiphenyl | 3.78 |
| 75 | 2,2′,6,6′-Tetrachlorobiphenyl | 3.85 |
| 76 | 3,3′,4,4′-Tetrachlorobiphenyl | 3.90 |
| 77 | 2,2′,4,4′-Tetrachlorobiphenyl | 4.02 |
| 78 | 2,4,5-Trichlorobiphenyl | 4.02 |
| 79 | 2,2′,3,3′,4,4′,5,5′,6,6′-Decachlorobiphenyl | 4.02 |
| 80 | 2,5-Dichlorobiphenyl | 4.20 |
| 81 | 2,2′,3,3′-Tetrachlorobiphenyl | 4.23 |
| 82 | 2,3-Dichlorobiphenyl | 4.25 |
| 83 | 2,2′,5-Trichlorobiphenyl | 4.27 |
| 84 | 2,4,4′-Trichlorobiphenyl | 4.63 |
| 85 | 2,3′,4′,5-Tetrachlorobiphenyl | 4.77 |
| 86 | 2,2′,4,4′,5,5′-Hexachlorobiphenyl | 4.83 |
| 87 | 2,2′,3,5′-Tetrachlorobiphenyl | 4.84 |
| 88 | 2,2′,4,5′-Tetrachlorobiphenyl | 4.84 |
| 89 | 2,2′,5,5′-Tetrachlorobiphenyl | 4.87 |
| 90 | 2,2′,4,4′,6,6′-Hexachlorobiphenyl | 4.93 |
| 91 | 2,2′,4,5-Tetrachlorobiphenyl | 5.00 |
| 92 | 2,2′,3,3′,4,4′,5,5′-Octachlorobiphenyl | 5.08 |
| 93 | 2,2′,3,4,5′-Pentachlorobiphenyl | 5.38 |
| 94 | 2,2′,4,5,5′-Pentachlorobiphenyl | 5.40 |
| 95 | 2,2′,3′,4,5-Pentaclorobiphenyl | 5.43 |
| 96 | 2,2′,3,3′,6,6′-Hexachlorobiphenyl | 5.43 |
| 97 | 2,2′,3′,5,5′,6′-Hexachlorobiphenyl | 5.54 |
| 98 | 2,2′,3,3′,4,4′,5,5′,6,-Nonachlorobiphenyl | 5.71 |
| 99 | 2,2′,3,3′,4,4′-Hexachlorobiphenyl | 5.77 |
| 100 | 3,3′,4,4′,5′-Pentachlorobiphenyl | 5.81 |
| 101 | 2,2′,3,4,5,5′-Hexachlorobiphenyl | 5.81 |
| 102 | 2,2′,3,3′,5,5,6,6′-Octachlorobiphenyl | 5.82 |
| 103 | 2,2′,3,4,4′,5,6′-Heptachlorobiphenyl | 5.84 |
| 104 | 2,2′,3,4,4′,5′-Hexachlorobiphenyl | 5.88 |
| 105 | 2,2′,3,3′,4′,5,5′,6-Octachlorobiphenyl | 5.88 |
| 106 | 2,2′,3,3′,4,4′,5,6-Octachlorobiphenyl | 5.92 |
| 107 | 2,2′,3,4,5,5′,6′-Heptachlorobiphenyl | 5.93 |
| 108 | 3,3′,4,4′,5,5′-Hexachlorobiphenyl | 5.97 |
| 109 | 2,4,6-Tribromobiphenyl | 3.93 |
| 110 | 2,2′,4,4′,6,6′-Hexabromobiphenyl | 3.96 |
| 111 | 4,4′-Dibromobiphenyl | 4.19 |
| 112 | 2,2′,5,5′-Tetrabromobiphenyl | 4.80 |
| 113 | 2,7-Dichlorodobenzo-p-dioxin | 2.13 |
| 114 | 1,2,4-Trichlorodibenzo-p-dioxine | 2.36 |
| 115 | 1,2,3,4-Tetrachlorodibenzo-p-dioxine | 2.55 |
| 116 | Octachlorodibenzo-p-dioxine | 2.76 |
| 117 | 2,8-Dichlorodibenzo-p-dioxine | 2.82 |
| 118 | 1,2,3,4,6,7,8-Heptachlorodibenzo-p-dioxine | 3.16 |
| 119 | 1,2,3,4,7-Pentachlorodibenzo-p-dioxine | 3.21 |
| 120 | 1,2,3,7-Tetrachlorodibenzo-p-dioxine | 3.24 |
| 121 | 1,3,6,8-Tetrachlorodibenzo-p-dioxine | 3.36 |
| 122 | 1,2,3,4,7,8-Hexachlorodibenzo-p-dioxine | 3.54 |
| 123 | Dibenzo(1,4)dioxine | 3.85 |
| 124 | 2,3,7,8-Tetrachlorodibenzo-p-dioxine | 4.06 |
| 125 | 1,2,3,7,8-Pentachlorodibenzo-p-dioxine | 4.50 |
| 126 | Benzo[b]furan | 2.56 |
| 127 | Octachlorodibenzofuran | 2.94 |
| 128 | Dibenzofuran | 3.34 |
| 129 | 2,3,7,8-Tetrachlorodibenzofuran | 3.53 |
| 130 | 1,2,3,4,6,7,8-Heptachlorodibenzofuran | 3.62 |
| 131 | 2,3,4,7,8-Pentachlorodibenzofuran | 4.03 |
| 132 | 4-Cyanophenol | 0.91 |
| 133 | 2-Methylphenol | 1.03 |
| 134 | Phenol | 1.24 |
| 135 | 3-Chlorophenol | 1.25 |
| 136 | 2,4-Dichlorophenol | 1.50 |
| 137 | 4-Bromophenol | 1.56 |
| 138 | p-sec-Butylphenol | 1.57 |
| 139 | Hydroquinone | 1.60 |
| 140 | 2,6-Dibromo-4-cyanophenol | 1.67 |
| 141 | 4,6-Dichloroguaiacol | 1.74 |
| 142 | 4-t-Butyylphenol | 1.86 |
| 143 | 4,5,6-Trichloroguaiacol | 1.97 |
| 144 | 4,5-Dichloroguaiacol | 2.03 |
| 145 | 2,3,5,6-Tetrachlorophenol | 2.15 |
| 146 | 2,4-Dimethylphenol | 2.18 |
| 147 | 2-Chlorophenol | 2.33 |
| 148 | 3,4,5-Trichloroguaiacol | 2.41 |
| 149 | 2,4,6-Trichlorophenol | 2.43 |
| 150 | p-Nonyl phenol | 2.45 |
| 151 | Tetrachloroguaiacol | 2.71 |
| 152 | 2,4,6-Tribromophenol | 2.71 |
| 153 | Pentachlorophenol | 2.74 |
| 154 | p-Dodecyl phenol | 3.78 |
| 155 | 4-Chloroaniline | 0.23 |
| 156 | 3-Chloroaniline | 0.34 |
| 157 | Aniline | 0.41 |
| 158 | 2-Chloroaniline | 0.57 |
| 159 | Diphenylamine | 1.48 |
| 160 | 3,4-Dichloroaniline | 1.48 |
| 161 | 2,4-Dichloroaniline | 1.98 |
| 162 | N-Phenyl-2-napthylamine | 2.17 |
| 163 | 2,3,4-Trichloroaniline | 2.31 |
| 164 | 2,4,5-Trichloroanaline | 2.61 |
| 165 | 2,3,4,5-Tetrachloroaniline | 2.69 |
| 166 | 3,4,5-Trichloroaniline | 2.70 |
| 167 | 2,4,6-Trichloroaniline | 2.73 |
| 168 | 3,3′-Dichlorobenzidine | 2.79 |
| 169 | 2,3,5,6-Tetrachloroaniline | 3.03 |
| 170 | Pentachloroaniline | 3.17 |
| 171 | Ethyl acetate | 1.48 |
| 172 | Dimethyl phthalate | 1.76 |
| 173 | Diethyl phthalate | 2.07 |
| 174 | Bis(2-ethylehexyl)phalate | 2.34 |
| 175 | Deltamethrin | 2.66 |
| 176 | Fenalerate | 2.79 |
| 177 | Benzyl butyl phatalate | 2.89 |
| 178 | Cypermethrine | 2.91 |
| 179 | 2-t-Butoxy ethanol | −0.22 |
| 180 | t-Butyl methyl ether | 0.18 |
| 181 | t-Butyl isopropylether | 0.76 |
| 182 | Bis(2-chloroethyl)ether | 1.04 |
| 183 | 2,4,6-Trichloroanisole | 2.94 |
| 184 | 2,4,6-Tribromoanisole | 2.94 |
| 185 | Methoxychlore | 3.10 |
| 186 | 2,4,5-Trichlorodiphenyle ether | 4.18 |
| 187 | 3,3′,4,4′-Tetrachlorodiphenyl ether | 4.51 |
| 188 | 2-Methyl-4,6-dinitrophenol | 0.16 |
| 189 | 4-Nitroaniline | 0.64 |
| 190 | 2-Nitroaniline | 0.91 |
| 191 | 3-Nitroaniline | 0.92 |
| 192 | 3-Nitrophenol | 1.40 |
| 193 | 2-Nitrophenol | 1.60 |
| 194 | 2,4,5-Trichloronitrobenzene | 1.84 |
| 195 | 3-Chloronitrobenzene | 1.89 |
| 196 | 2,3,4,5-Tetrachloronitrobenzene | 1.89 |
| 197 | 4-Chloronitrobenzene | 2.00 |
| 198 | 2,5-Dichloronitrobenzene | 2.05 |
| 199 | 2,4-Dichloronitrobenzen | 2.07 |
| 200 | 3,4-Dichloronitrobenzene | 2.07 |
| 201 | 2-Chloronitrobenzene | 2.10 |
| 202 | 2,3-Dichloronitrobenzene | 2.16 |
| 203 | 2,3,4-Trichloronitrobenzene | 2.20 |
| 204 | 3,5-Dichloronitrobenzene | 2.23 |
| 205 | Pentachloronitrobenzene | 2.40 |
| 206 | 2,4,6-Trichloronitrobenzene | 2.88 |
| 207 | Chloronitrofen | 3.04 |
| 208 | 2,3,5,6-Tetrachloronitrobenzene | 3.20 |
| 209 | Phenthoate | 1.56 |
| 210 | Fenthion | 2.68 |
| 211 | EPN | 3.05 |
| 212 | Leptophos | 3.78 |
| 213 | Carbaryl | 1.22 |
| 214 | Molinate | 1.41 |
| 215 | BPMC | 1.41 |
| 216 | Acrylonitrile | 1.68 |
| 217 | Thiobencarb | 2.03 |
| 218 | Acridine | 2.61 |
| 219 | Lindane | 2.84 |
| 220 | B-HCH | 2.86 |
| 221 | α-HCH | 2.95 |
| 222 | Hexachlorocyclopentadiene | 3.09 |
| 223 | Xanthene | 3.62 |
| 224 | Dieldrine | 3.71 |
| 225 | Heptachlore | 4.14 |
| 226 | o-p′-DDT | 4.57 |
| 227 | Chlordane | 4.58 |
| 228 | p,p-DDE | 4.71 |
| 229 | p,p′-DDT | 4.84 |
Notes:
Compounds were considered as outliers;
Compound classified in the test set.
Correlation coefficient for MLR, PLS and ANN models 3–15 and cross validation parameters obtained from PLS and ANN analysis.
| 3 | V1MD,deg, nHAcc, MATS2m | 0.895 | 0.619 | 3 | 0.906 | 0.820 | 0.876 | 0.851 | 3 | 0.899 | 0.764 | 0.878 | 0.192 |
| 4 | V1MD,deg, nHAcc, MATS2m, GATS2e | 0.906 | 0.589 | 4 | 0.915 | 0.837 | 0.877 | 0.847 | 4 | 0.913 | 0.800 | 0.899 | 0.174 |
| 5 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv | 0.910 | 0.5800 | 4 | 0.891 | 0.793 | 0.818 | 1.026 | 4 | 0.914 | 0.804 | 0.887 | 0.184 |
| 6 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv, ST | 0.912 | 0.574 | 5 | 0.898 | 0.806 | 0.822 | 1.023 | 5 | 0.917 | 0.810 | 0.907 | 0.168 |
| 7 | V1MD,deg, nHAcc, MATS2m, GATS2e,2Xv, ST, JhetZ | 0.914 | 0.568 | 5 | 0.902 | 0.813 | 0.810 | 1.052 | 4 | 0.918 | 0.814 | 0.892 | 0.180 |
| 8 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv, ST, JhetZ, Jhetv | 0.916 | 0.565 | 5 | 0.902 | 0.813 | 0.808 | 1.060 | 6 | 0.919 | 0.815 | 0.895 | 0.177 |
| 9 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv, ST, JhetZ, Jhetv, MR | 0.917 | 0.561 | 5 | 0.900 | 0.810 | 0.822 | 1.015 | 5 | 0.921 | 0.821 | 0.912 | 0.163 |
| 10 | V1MD,deg, nHAcc, MATS2m, GATS2e, X2v, ST, JhetZ, Jhetv, MR, pl | 0.918 | 0.559 | 4 | 0.886 | 0.784 | 0.804 | 1.052 | 5 | 0.921 | 0.822 | 0.900 | 0.174 |
| 11 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv, ST, JhetZ, Jhetv, MR, pl, 0Xv | 0.919 | 0.557 | 4 | 0.887 | 0.786 | 0.810 | 1.031 | 4 | 0.921 | 0.822 | 0.892 | 0.180 |
| 12 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv, ST, JhetZ, Jhetv, MR, pl, 0Xv, H6p | 0.920 | 0.556 | 5 | 0.896 | 0.802 | 0.813 | 1.028 | 7 | 0.922 | 0.824 | 0.902 | 0.172 |
| 13 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv, ST, JhetZ, Jhetv, MR, pl, 0Xv, H6p, BAC | 0.921 | 0.555 | 5 | 0.896 | 0.802 | 0.815 | 1.032 | 8 | 0.923 | 0.827 | 0.888 | 0.184 |
| 14 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv, ST, JhetZ, Jhetv, MR, pl, 0Xv, H6p, BAC, J | 0.922 | 0.552 | 5 | 0.903 | 0.814 | 0.848 | 0.931 | 6 | 0.924 | 0.829 | 0.884 | 0.187 |
| 15 | V1MD,deg, nHAcc, MATS2m, GATS2e, 2Xv, ST, JhetZ, Jhetv, MR, pl, 0Xv, H6p, BAC, J, PC | 0.923 | 0.552 | 4 | 0.893 | 0.796 | 0.839 | 0.954 | 5 | 0.924 | 0.830 | 0.879 | 0.192 |
M# refers to model number;
refers for calibration set;
refers for prediction set.
Figure 1.Correlation of R2CV with MLR model number.
Figure 2.Correlation of 1st principal component with 2nd principal component for the factor spaces of the descriptors and their BCF.
Figure 3.A) correlation of PRESS with ANN models (3–15). B) correlation of PRESS with PLS models (3–15). C) correlation of PRESS with different numbers of hidden nodes for ANN model 4. D) correlation of PRESS with different numbers of hidden nodes for ANN model 14.
Note: Blue and pink columns indicate PRESS values for the training and test sets, respectively.
Figure 4.Correlation of the predicted log BCF against observed one as well as their residues for A) training set. B) validation set. C) external test set of model 4 obtained by PC-ANN analysis using 10 hidden nodes.
Figure 5.Correlation of the predicted log BCF against observed one as well as their residues for A) training set. B) external test set of model 9 obtained by PLS analysis.
Comparison between the different QSAR models.a
| Zhao et al | (HM & GA) -RBFNN | 473 | 378 | 95 | 0.83 | 0.79 | NR | NR |
| Gramatica and Papa (2006) | GA-VSS + OLS | 84 | 53 | 31 | 0.77 | 0.73 | 0.74 | NR |
| Gramatica and Papa (2003) | GA-VSS + OLS | 238 | 179 | 59 | 0.8 | 0.78 | 0.88 | NR |
| Fatemi et al | GA-ANN | 53 | 44 | 9 | 0.88 | 0.89–92 | NR | NR |
| Wang et al | PLS | 238 | 202 | 36 | 0.84 | 0.83 | 0.8 | NR |
| Chen et al | PLS | 192 | 122 | 70 | 0.87 | 0.86 | 0.76 | NR |
| The current model | PC-ANN | 227 | 135 | 46 | 0.84 | 0.84 | NR | R |
| The current model | PLS | 227 | 181 | 46 | 0.85 | 0.82 | NR | NR |
Notes:
nA, number of all compounds; nT, number of compounds in the training set; nP, number of compounds in the test set;
See Ref.40 for R2CV, ex or (Q2ex) definition;
Val. Set, validation set.
Abbreviations: HM, Heuristic; GA, genetic algorithms; RBFNN, radial biased function neural network; VSS, variable subset selection; OLS, ordinary least square regression; R, reported; NR, not reported.