| Literature DB >> 21954348 |
Mihai V Putz1, Cosmin Ionaşcu, Ana-Maria Putz, Vasile Ostafe.
Abstract
Given the modeling and predictive abilities of quantitative structure activity relationships (QSARs) for genotoxic carcinogens or mutagens that directly affect DNA, the present research investigates structural alert (SA) intermediate-predicted correlations A(SA) of electrophilic molecular structures with observed carcinogenic potencies in rats (observed activity, A = Log[1/TD(50)], i.e., [Formula: see text]). The present method includes calculation of the recently developed residual correlation of the structural alert models, i.e., [Formula: see text]. We propose a specific electrophilic ligand-receptor mechanism that combines electronegativity with chemical hardness-associated frontier principles, equality of ligand-reagent electronegativities and ligand maximum chemical hardness for highly diverse toxic molecules against specific receptors in rats. The observed carcinogenic activity is influenced by the induced SA-mutagenic intermediate effect, alongside Hansch indices such as hydrophobicity (LogP), polarizability (POL) and total energy (Etot), which account for molecular membrane diffusion, ionic deformation, and stericity, respectively. A possible QSAR mechanistic interpretation of mutagenicity as the first step in genotoxic carcinogenesis development is discussed using the structural alert chemoinformation and in full accordance with the Organization for Economic Co-operation and Development QSAR guidance principles.Entities:
Keywords: OECD principles; electronegativity and chemical hardness reactivity principles; genotoxic carcinogenesis; residual-QSAR; structural alerts
Mesh:
Substances:
Year: 2011 PMID: 21954348 PMCID: PMC3179155 DOI: 10.3390/ijms12085098
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1.The alert-QSAR method uses structural alerts to assemble a molecular fragment QSAR model that has predictive power similar to that of full molecular modeling.
Molecules from the Gaussian training set (Figure 2) and corresponding rat TD50 toxicities (in mg/kg body wt/day) [16] and activities A = Log(1/TD50) using semi-empirical PM3-computed (Hyperchem [30]) structural parameters: hydrophobicity (LogP), polarizability (POL) [Ǻ3], total optimized energy (Etot) [kcal/mol], electronegativity (χ = −0.5(ɛLUMO + ɛHOMO)) [eV], and chemical hardness (η = 0.5(ɛLUMO – ɛHOMO)) [eV].
| 1. | 3,3′-Dimethoxy-4,4′-biphenylene diisocyanate | 1630 | 2.07 | 30.03 | −82478.58594 | 4.74077805 | 3.85074395 | ||
| 2. | Chrysazin (Danthron) | 245 | 1.87 | 24.44 | −68162.28125 | 5.4079765 | 3.9369375 | ||
| 3. | Acetaldehyde | 153 | −0.58 | 4.53 | −13662.00781 | 4.94880425 | 5.75505575 | ||
| 4. | Allyl isothiocyanate | 96 | 1.17 | 11.74 | −20700.27344 | 4.9388987 | 4.2117593 | ||
| 5. | Isobutyl nitrite | 54.1 | 1.63 | 9.96 | −31363 | 5.294418075 | 5.263031925 | ||
| 6. | Urethane | 41.3 | −0.06 | 8.35 | −27989.58203 | 4.573154 | 5.741656 | ||
| 7. | Ethylene oxide | 21.3 | −0.16 | 4.31 | −13626.54297 | 4.4747555 | 6.8617045 | ||
| 8. | Hexa(hydroxymethyl)mela mine | 10.2 | 1.96 | 27.19 | −108827.09 | 4.05956015 | 4.50969485 | ||
| 9. | 1,2-Dichloroethane | 8.04 | 1.59 | 8.3 | −21506.41406 | 5.0714835 | 5.6050665 | ||
| 10. | Tris(2,3-dibromopropyl) phosphate | 3.83 | 5.37 | 35.91 | −108827.09 | 5.6512295 | 4.5231705 | ||
| 11. | Beta-Propiolactone | 1.46 | −0.25 | 6.23 | −23148.73047 | 5.2018966 | 6.0842834 | ||
| 12. | Chlorambucil | 0.896 | 4.14 | 31.04 | −76933.42969 | 4.350258535 | 4.405313465 | ||
| 13. | Azaserine | 0.793 | −1.03 | 14.25 | −54439.625 | 5.2656847 | 4.7215543 | ||
| 14. | Dacarbazine | 0.71 | −0.92 | 17.95 | −49126.58594 | 4.9880568 | 4.1820822 | ||
| 15. | Thiotepa (Tris(aziridinyl)-phosphine sulfide) | 0.164 | 0.54 | 17.63 | −38905.46484 | 5.2831755 | 3.8071835 | ||
| 16. | Aflatoxin-B1 | 0.0032 | 0.99 | 29.86 | −91307.82331 | 5.3273625 | 3.9567405 | ||
| 17. | 2,3,7,8-Tetrachlorodibenzo-p-dioxin | 0.0000457 | 4.93 | 28.31 | −76933.75 | 4.7914412 | 4.0075488 | ||
| 18. | Aflatoxicol | 0.00247 | 0.46 | 30.41 | −91979.58594 | 5.140259 | 3.945276 | ||
| 19. | 1-(2-Hydroxyethyl)-1-nitrosourea | 0.244 | −0.95 | 10.92 | −42184.19141 | 5.42904375 | 5.08170625 | ||
| 20. | N'-Nitrosonornicotine-1-N-oxide | 0.876 | 0.25 | 19.48 | −53174.95313 | 5.04527 | 4.273811 | ||
| 21. | Benzo(a)pyrene | 0.956 | 5.37 | 36.04 | −58881.02734 | 4.631374 | 3.410258 | ||
| 22. | 2-Acetylaminofluorene | 1.22 | 2.61 | 26.26 | −56110.60547 | 4.38615285 | 4.02819215 | ||
| 23. | 1,2-Dibromoethane | 1.52 | 1.71 | 9.7 | −28203.0625 | 6.1527065 | 5.0695035 | ||
| 24. | Michler's ketone | 5.64 | 3.8 | 19.85 | −67801.28125 | 4.3453716 | 4.1924714 | ||
| 25. | Ethylene thiourea (ETU) | 8.13 | 0.33 | 11.45 | −22095.42578 | 4.40057075 | 4.20081425 | ||
| 26. | Thioacetamide | 11.5 | −0.21 | 9.04 | −15263.96289 | 4.72959049 | 3.99513951 | ||
| 27. | o-Nitroanisole | 15.6 | −0.18 | 14.75 | −45613.03906 | 5.5631575 | 4.3657605 | ||
| 28. | 2-Aminodipyrido[1,2-a:3`,2`-d]imidazole (Glu-P-2) | 42.3 | 2.35 | 20.73 | −45103.06641 | 4.5267029 | 3.7506371 | ||
| 29. | Dichlorodiphenyltrichloroet hane (DDT) | 84.7 | 6.39 | 33.4 | −77956.60156 | 4.95182645 | 4.50488155 | ||
| 30. | p-Cresidine | 98 | 1.48 | 16.09 | −36280.75391 | 3.9300665 | 4.3473585 | ||
| 31. | Ethyl 2-(4-chlorophenoxy)-2-methylpropionate (Clofibrate) | 169 | 2.97 | 24.73 | −65740.6875 | 4.49111609 | 4.53578491 | ||
| 32. | Vinyl acetate | 341 | −0.01 | 8.65 | −26598.12305 | 4.6849081 | 5.2657279 | ||
| 33. | Salicylazosulfapyridine | 1590 | 4.54 | 36.79 | −107222.1719 | 5.209331 | 3.898064 |
Molecules from the quasi-Gaussian test set (Figure 2), withthe activities and structural parameters as in Table 1.
| 34. | Phenacetin | 1250 | 0.99 | 19.85 | −49230.08203 | 4.063315 | 4.307675 | ||
| 35. | Dimethylvinyl chloride (DMVC) | 31.8 | 1.51 | 9.85 | −20725.60325 | 4.32596855 | 4.98083445 | ||
| 36. | Sulfallate | 26.1 | 2.73 | 24.79 | −46435.69922 | 4.8447835 | 3.8753115 | ||
| 37. | beta-Butyrolactone | 13.8 | 0.17 | 8.06 | −26599.55273 | 5.1344426 | 6.0826774 | ||
| 38. | Vinyl Chloride | 6.11 | 1.01 | 6.18 | −13820.70898 | 4.56666095 | 5.27117005 | ||
| 39. | Acrylamide | 3.75 | −0.28 | 7.52 | −20478.92578 | 4.77457395 | 4.91861805 | ||
| 40. | Mirex | 1.77 | 6.41 | 38.39 | −114919.4688 | 5.27780275 | 5.22349725 | ||
| 41. | Dimethylnitramine | 0.547 | 0.97 | 7.64 | −28551.91406 | 5.288693895 | 5.374516105 | ||
| 42. | N-Nitrosodimethylami ne | 0.0959 | 0.01 | 7.01 | −21802.08203 | 4.6046239 | 5.1639551 | ||
| 43. | N-Methyl-N`-nitro-N-nitrosoguanidine (1-Methyl-3-nitro-1-nitroso-guanidine) | 0.803 | 1.5 | 11.13 | −46112.81641 | 5.475207 | 4.654173 | ||
| 44. | 1-Phenyl-3,3-dimethyltriazene | 2.31 | 2.53 | 17.51 | −36944.65625 | 4.65555575 | 4.28693125 | ||
| 45. | Hydrazobenzene | 5.59 | 3.4 | 22.8 | −44481.07422 | 3.65518885 | 3.99645815 | ||
| 46. | 1'-Acetoxysafrole | 25 | −0.11 | 22.47 | −64108.48047 | 4.516422835 | 4.517086165 | ||
| 47. | o-Nitrosotoluene | 50.7 | 2.29 | 13.48 | −32074.53516 | 5.20234765 | 4.40152935 | ||
| 48. | p-Nitrosodiphenyl amine | 201 | 3.07 | 22.66 | −50526.36328 | 4.57337225 | 3.74357475 | ||
| 49. | 1,4-Dichlorobenzene (p-dichlorobenzene) | 644 | 3.08 | 14.29 | −32415.54297 | 4.73892295 | 4.49613405 |
Figure 2.Graphical representation of the working activities for molecules in Tables 1 and 2, classified under the “Gaussian” and “quasi-Gaussian” series for the training and testing QSARs, respectively.
QSAR models for the activities of the trial molecules with the physicochemical parameters of the SAs in Table 1.
| 5.62 − 0.07 | 0.026 | ||
| 6.016 − 0.149 | 0.086 | ||
| 4.677 + 0.05 | 0.27 | ||
| 5.066 + 0.159 | 0.16 | ||
| 4.402 − 0.00031
| 0.35 | ||
| 6.51 − 0.09 | 0.093 | ||
| 5.1 − 0.08 | 0.27 | ||
| 5.54 − 0.1 | 0.16 | ||
| 5.58 − 0.255 | 0.371 | ||
| 2.43 + 0.38 | 0.316 | ||
| 4.75 + 0.05 | 0.167 | ||
| 2.6 + 0.314 | 0.38 | ||
| 4.36 + 0.12 | 0.337 | ||
| 4.38 − 0.0625 | 0.382 | ||
| 4.09 − 0.412 | 0.4305 | ||
| 2.62 − 0.035 | 0.31 | ||
| 5.25 − 0.095 | 0.17 | ||
| 3.78 − 0.254 | 0.399 | ||
| 4.52 − 0.0325 | 0.337 | ||
| 6.46 − 0.45 | 0.4141 | ||
| 5.75 − 0.36 | 0.451 | ||
| 2.05 + 0.39 | 0.373 | ||
| 3.09 + 0.22 | 0.392 | ||
| 3.23 + 0.156 | 0.436 | ||
| 4.07 + 0.015 | 0.4312 | ||
Multi-linear QSAR models for the trial molecular activities with the full molecular (M) physicochemical parameters of Table 1 and the corresponding activities of the structural alerts (ASA or ASA) from Table 3.
| 2.18 + 0.63 | 0.187 | ||
| − 65.66 + 12.1
| 0.295 | ||
| 7.13 − 0.405 | 0.192 | ||
| 11.38 − 0731
| 0.198 | ||
| 4.71 + 0.029 | 0.18 | ||
| − 0.438 + 1.1
| 0.275 | ||
| 5.29 − 0.0007 | 0.009 | ||
| − 13.64 + 3.71
| 0.325 | ||
| 4.67 − 0.00001
| 0.203 | ||
| − 0.05 + 1.01
| 0.358 | ||
| 3.92 + 0.68 | 0.278 | ||
| 0.015 + 0.73 | 0.28 | ||
| 1.36 + 0.67 | 0.27 | ||
| − 6.009 + 0.86 | 0.373 | ||
| 2.12 + 0.639 | 0.188 | ||
| − 19.46 + 0.81 | 0.409 | ||
| 1.64 + 0.62 | 0.27 | ||
| − 4.81 + 0.83 | 0.443 | ||
| 6.18 − 0.264 | 0.204 | ||
| 1.47 − 0.64 | 0.378 | ||
| 7.66 − 0.49 | 0.213 | ||
| −12.17 − 0.52 | 0.403 | ||
| 5.993 − 0.24 | 0.226 | ||
| 1.585 − 0.442 | 0.426 | ||
| 4.339 + 0.72 | 0.29 | ||
| − 0.81 + 0.04 | 0.422 | ||
| 4.67 − 0.00097 | 0.203 | ||
| − 0.299 − 0.0304 | 0.3907 | ||
| 4.578 − 0.162 | 0.26 | ||
| − 0.603 − 0.2337 | 0.488 | ||
| 2.89 + 0.686 | 0.288 | ||
| − 3.19 + 0.86 | 0.452 | ||
| 4.458 + 0.65 | 0.287 | ||
| − 16.899 + 0.794 | 0.462 | ||
| 3.016 + 0.65 | 0.298 | ||
| − 2.689 + 0.841 | 0.497 | ||
| 1.481 + 0.58 | 0.338 | ||
| − 5.593 + 0.81 | 0.483 | ||
| 1.503 + 0.64 | 0.276 | ||
| − 4.009 + 0.725 | 0.469 | ||
| 1.961 + 0.538 | 0.304 | ||
| − 4.251 + 0.68 | 0.552 | ||
| 4.87 − 0.093 | 0.294 | ||
| − 0.253 − 0.153 | 0.466 | ||
| 6.84 − 0.39 | 0.239 | ||
| 3.415 − 1.08 | 0.507 | ||
| 6.176 − 0.303 | 0.287 | ||
| 1.433 − 0.449 | 0.525 | ||
| 4.341 + 0.06 | 0.298 | ||
| − 0.598 + 0.031 | 0.495 | ||
Residual-QSARs for the structural alert models of Table 3.
| 5.2856 +
| 0.999 | ||
| 5.2856 +
| 0.996 | ||
| 5.2856 +
| 0.961 | ||
| 5.2856 +
| 0.986 | ||
| 5.2856 +
| 0.933 | ||
| 5.2856 + 0.886
| 0.999 | ||
| 5.2856 + 0.987
| 0.999 | ||
| 5.2856 + 0.963
| 0.999 | ||
| 5.2856 + 0.98
| 0.999 | ||
| 5.2856 + 1.194
| 0.997 | ||
| 5.2856 + 1.101
| 0.996 | ||
| 5.2856 + 1.105
| 0.996 | ||
| 5.2856 − 0.709
| 0.991 | ||
| 5.2856 + 1.642
| 0.966 | ||
| 5.2856 + 1.395
| 0.991 | ||
| 5.2856 + 0.852
| 0.999 | ||
| 5.2856 + 0.884
| 0.999 | ||
| 5.2856 + 0.892
| 0.999 | ||
| 5.2856 + 0.941
| 0.999 | ||
| 5.2856 + 1.005
| 0.999 | ||
| 5.2856 + 0.983
| 0.999 | ||
| 5.2856 + 0.951
| 0.997 | ||
| 5.2856 + 1.212
| 0.997 | ||
| 5.2856 + 0.955
| 0.997 | ||
| 5.2856 − 0.423
| 0.991 | ||
| 5.2856 + 0.815
| 0.999 | ||
| 5.2856 + 0.966
| 0.999 | ||
| 5.2856 + 0.966
| 0.999 | ||
| 5.2856 + 0.902
| 0.999 | ||
| 5.2856 + 0.858
| 0.999 | ||
| 5.2856 + 0.830
| 0.999 | ||
| 5.2856 +
| 0.99 | ||
| 5.2856 +
| 0.96 | ||
| 5.2856 +
| 0.985 | ||
| 5.2856 +
| 0.928 | ||
| 5.2856 +
| 0.985 | ||
| 5.2856 +
| 0.985 | ||
| 5.2856 +
| 0.92 | ||
| 5.2856 +
| 0.93 | ||
| 5.2856 +
| 0.923 | ||
| 5.2856 +
| 0.902 | ||
| 5.2856 + 1.200
| 0.996 | ||
| 5.2856 + 1.0003
| 0.995 | ||
| 5.2856 + 0.9991
| 0.995 | ||
| 5.2856 − 0.490
| 0.988 | ||
| 5.2856 + 1.033
| 0.96 | ||
| 5.2856 + 0.945
| 0.961 | ||
| 5.2856 + 1.319
| 0.987 | ||
| 5.2856 + 1.148
| 0.986 | ||
| 5.2856 + 1.086
| 0.985 | ||
| 5.2856 − 0.084
| 0.948 | ||
| 5.2856 − 0.388
| 0.99 | ||
| 5.2856 + 0.308
| 0.944 | ||
| 5.2856 + 0.870
| 0.949 | ||
| 5.2856 + 1.155
| 0.987 | ||
| 5.2856 + 0.342
| 0.947 | ||
| 5.2856 +
| 0.948 | ||
| 5.2856 +
| 0.985 | ||
| 5.2856 +
| 0.916 | ||
| 5.2856 +
| 0.941 | ||
| 5.2856 +
| 0.91 | ||
| 5.2856 +
| 0.89 | ||
| 5.2856 +
| 0.927 | ||
| 5.2856 +
| 0.919 | ||
| 5.2856 +
| 0.899 | ||
| 5.2856 +
| 0.902 | ||
Residual-alert QSARs for the models of Table 5 that fulfill Equation (10) with highest trial correlation factors. These are compared with the respective direct structural alert models of Table 3 using their correlation performances for the trial and test molecules in Tables 1 and 2, respectively.
| 0.368 | 0.168 | |||
| 0.371 | 0.127 | |||
| 0.078 | 0.505 | |||
| 0.316 | 0.043 | |||
| 0.063 | 0.725 | |||
| 0.167 | 0.052 | |||
| 0.384 | 0.087 | |||
| 0.38 | 0.131 | |||
| 0.040 | 0.222 | |||
| 0.337 | 0.357 | |||
| 0.369 | 0.015 | |||
| 0.382 | 0.132 | |||
| 0.386 | 0.016 | |||
| 0.430 | 0.007 | |||
| 0.373 | 0.056 | |||
| 0.414 | 0.018 | |||
| 0.371 | 0.149 | |||
| 0.451 | 0.136 | |||
| 0.018 | 0.592 | |||
| 0.31 | 0.027 | |||
| 0.122 | 0.286 | |||
| 0.373 | 0.112 | |||
| 0.304 | 0.178 | |||
| 0.392 | 0.152 | |||
| 0.382 | 0.039 | |||
| 0.436 | 0.012 | |||
| 0.019 | 0.399 | |||
| 0.337 | 0.370 | |||
| 0.394 | 0.038 | |||
| 0.431 | 0.034 | |||
| 0.289 | 0.277 | |||
| 0.373 | 0.007 | |||
| 0.372 | 0.296 | |||
| 0.452 | 0.106 | |||
| 0.21 | 0.184 | |||
| 0.441 | 0.033 | |||
| 0.348 | 0.300 | |||
| 0.455 | 0.066 | |||
| 0.274 | 0.225 | |||
| 0.416 | 0.021 | |||
Trial-test averages of the correlations’ connected paths between the endpoint models of Table 6, computed using the Euler Equation (11).
| Δ | Δ | |||||
|---|---|---|---|---|---|---|
| 0.005099 | 0.264847 | 0.134973 | 0.057384 | 0.140089 | 0.098736 | |
| 0.003162 | 0.148222 | 0.080006 | 0.031320 | |||
| 0.023194 | 0.152190 | 0.087692 | 0.080099 | 0.070576 | 0.075337 | |
| 0.098386 | 0.241588 | 0.169987 | 0.088729 | 0.104890 | 0.09681 | |
| 0.244769 | 0.327055 | 0.285912 | 0.090426 | 0.161375 | 0.125901 | |
| 0.105948 | 0.450693 | 0.278321 | 0.216933 | 0.099201 | 0.158067 | |
| 0.177115 | 0.439092 | 0.308104 | 0.206000 | 0.120933 | 0.163467 | |
| 0.362415 | 0.701156 | 0.531786 | 0.269046 | 0.045177 | 0.157112 | |
| 0.320806 | 0.733973 | 0.52739 | 0.269670 | 0.067201 | 0.168436 | |
| 0.123434 | 0.091197 | 0.107316 | 0.050447 | 0.120838 | 0.085643 | |
| 0.085440 | 0.102420 | 0.026832 | 0.132672 | |||
| 0.172011 | 0.152738 | 0.162375 | 0.056222 | 0.120838 | 0.08853 | |
| 0.034058 | 0.265377 | 0.149718 | 0.059135 | 0.130678 | 0.094907 | |
| 0.120282 | 0.120415 | 0.077369 | 0.257421 | 0.167395 | ||
| 0.431226 | 0.187882 | 0.309554 | 0.094530 | 0.323154 | 0.208842 | |
| 0.114284 | 0.163110 | 0.138697 | 0.050009 | 0.120668 | 0.085339 | |
| 0.185690 | 0.147800 | 0.166745 | 0.050009 | 0.098005 | 0.074007 | |
| 0.027459 | 0.201221 | 0.11434 | 0.021377 | 0.146768 | 0.084073 | |
| 0.172046 | 0.146812 | 0.159429 | 0.007810 | 0.021587 | ||
| 0.034234 | 0.262011 | 0.148123 | 0.019924 | 0.054230 | 0.037077 | |
| 0.184173 | 0.147648 | 0.165911 | 0.010049 | 0.027018 | 0.018534 | |
Figure 3.The electrophilic docking structure-reactivity algorithm correlating electronegativity and chemical hardness with chemical carcinogenesis. The algorithm starts with electronegativity docking (equalization) between the ligand and the receptor (the middle dashed line). Next, intra-molecular (in connection with specific structural alerts) maximization of the HOMO-LUMO gap (i.e., of chemical hardness) is accomplished by exo-electrophilic transfer of an electron from ligand to receptor.
Figure 4.Illustration on a ligand-receptor cyclic interaction coordinate of the molecular mechanism of genotoxic carcinogenesis as given by the residual-alert-QSAR correlation-path hierarchy of Equations (13a)–(13c) then summarized in Equation (17). The mechanism is superimposed over an immunohistochemical analysis of paraffin-embedded sections of rat intestinal cancer using the Caspase-2 antibody [51]. In these evolving molecular graphs (the SA region is circumvented), steric movement is represented by mirroring, electronegativity docking by changing SA colors, diffusion by translation arrows; polarizability by vibration arrows, and electrophilic docking (the final stage including the maximum hardness principle) by positive charging.