| Literature DB >> 21668999 |
Abstract
INTRODUCTION: Both main types of carcinogenesis, genotoxic and epigenetic, were examined in the context of non-congenericity and similarity, respectively, for the structure of ligand molecules, emphasizing the role of quantitative structure-activity relationship ((Q)SAR) studies in accordance with OECD (Organization for Economic and Cooperation Development) regulations. The main purpose of this report involves electrophilic theory and the need for meaningful physicochemical parameters to describe genotoxicity by a general mechanism. RESIDUAL-QSAREntities:
Year: 2011 PMID: 21668999 PMCID: PMC3141620 DOI: 10.1186/1752-153X-5-29
Source DB: PubMed Journal: Chem Cent J ISSN: 1752-153X Impact factor: 4.215
Figure 1Representation of the residual-QSAR algorithm from a given computed activity (.
The molecules listed with their effect on rat TD50 activity [28] and the semi-empirical PM3 (Hyperchem [29]) computed structural parameters of hydrophobicity (LogP), polarizability (POL, in Å3) and total optimized energy (Etot, in kcal/mol) belonging to the Gaussian training set illustrated in Figure 2.
| Chemical Compound | Formula | CASRN | TD50_Rat(a) | A(b) | logP | POL | Etot | |
|---|---|---|---|---|---|---|---|---|
| 3,3'-Dimethoxy-4,4'-biphenylene diisocyanate | C16H12N2O4 | 91-93-0 | 1630 | 2.79 | 2.07 | 30.03 | -82478.58594 | |
| Chrysazin (Danthron) | C14H8O4 | 117-10-2 | 245 | 3.61 | 1.87 | 24.44 | -68162.28125 | |
| Acetaldehyde | C2H4O | 75-07-0 | 153 | 3.82 | -0.58 | 4.53 | -13662.00781 | |
| Allyl isothiocyanate | C4H5NS | 57-06-7 | 96 | 4.02 | 1.17 | 11.74 | -20700.27344 | |
| Isobutyl nitrite | C4H9NO2 | 542-56-3 | 54.1 | 4.27 | 1.63 | 9.96 | -31363 | |
| Urethane | C3H7NO2 | 51-79-6 | 41.3 | 4.38 | -0.06 | 8.35 | -27989.58203 | |
| Ethylene oxide | C2H4O | 75-21-8 | 21.3 | 4.67 | -0.16 | 4.31 | -13626.54297 | |
| Hexa(hydroxymethyl)melamine | C9H18N6O6 | 531-18-0 | 10.2 | 4.99 | 1.96 | 27.19 | -108827.0859 | |
| 1,2-Dichloroethane | C2H4Cl2 | 107-06-2 | 8.04 | 5.09 | 1.59 | 8.3 | -21506.41406 | |
| Tris(2,3-dibromopropyl) phosphate | C9H15Br6O4P | 126-72-7 | 3.83 | 5.42 | 5.37 | 35.91 | -108827.0859 | |
| Beta-Propiolactone | C3H4O2 | 57-57-8 | 1.46 | 5.84 | -0.25 | 6.23 | -23148.73047 | |
| Chlorambucil | C14H19Cl2NO2 | 305-03-3 | 0.896 | 6.048 | 4.14 | 31.04 | -76933.42969 | |
| Azaserine | C5H7N3O4 | 115-02-6 | 0.793 | 6.10 | -1.03 | 14.25 | -54439.625 | |
| Dacarbazine | C6H10N6O | 4342-03-4 | 0.71 | 6.15 | -0.92 | 17.95 | -49126.58594 | |
| Thiotepa (Tris(aziridinyl)-phosphine sulfide) | C6H12N3PS | 52-24-4 | 0.164 | 6.789 | 0.54 | 17.63 | -38905.46484 | |
| Aflatoxin-B1 | C17H12O6 | 1162-65-8 | 0.0032 | 8.49 | 0.99 | 29.86 | -91307.82331 | |
| 2,3,7,8-Tetrachlorodibenzo-p-dioxin | C12H4 Cl4 O2 | 1746-01-6 | 0.0000457 | 10.34 | 4.93 | 28.31 | -76933.75 | |
| Aflatoxicol | C17H14O6 | 29611-03-8 | 0.00247 | 8.61 | 0.46 | 30.41 | -91979.58594 | |
| 1-(2-Hydroxyethyl)-1-nitrosourea | C3H7N3O3 | 13743-07-2 | 0.244 | 6.61 | -0.95 | 10.92 | -42184.19141 | |
| N'-Nitrosonornicotine-1-N-oxide | C9H11N3O2 | 78246-24-9 | 0.876 | 6.06 | 0.25 | 19.48 | -53174.95313 | |
| Benzo(a)pyrene | C20H12 | 50-32-8 | 0.956 | 6.02 | 5.37 | 36.04 | -58881.02734 | |
| 2-Acetylaminofluorene | C15H13NO | 53-96-3 | 1.22 | 5.91 | 2.61 | 26.26 | -56110.60547 | |
| 1,2-Dibromoethane | C2H4Br2 | 106-93-4 | 1.52 | 5.82 | 1.71 | 9.7 | -28203.0625 | |
| Hydrazobenzene | C12H12N2 | 122-66-7 | 5.59 | 5.25 | 3.8 | 19.85 | -67801.28125 | |
| Ethylene thiourea (ETU) | C3H6N2S | 96-45-7 | 8.13 | 5.09 | 0.33 | 11.45 | -22095.42578 | |
| Thioacetamide | C2H5NS | 62-55-5 | 11.5 | 4.94 | -0.21 | 9.04 | -15263.96289 | |
| o-Nitroanisole | C7H7NO3 | 91-23-6 | 15.6 | 4.81 | -0.18 | 14.75 | -45613.03906 | |
| 2-Aminodipyrido[1,2-a:3',2'-d]imidazole | C10H8N4 | 67730-10-3 | 42.3 | 4.37 | 2.35 | 20.73 | -45103.06641 | |
| Dichlorodiphenyltrichloroethane (DDT) | C14H9Cl5 | 50-29-3 | 84.7 | 4.07 | 6.39 | 33.4 | -77956.60156 | |
| p-Cresidine | C8H11NO | 120-71-8 | 98 | 4.01 | 1.48 | 16.09 | -36280.75391 | |
| Ethyl 2-(4-chlorophenoxy)-2-methylpropionate | C12H15ClO3 | 637-07-0 | 169 | 3.77 | 2.97 | 24.73 | -65740.6875 | |
| Vinyl acetate | C4H6O2 | 108-05-4 | 341 | 3.47 | -0.01 | 8.65 | -26598.12305 | |
| Salicylazosulfapyridine | C18H14N4O5S | 599-79-1 | 1590 | 2.799 | 4.54 | 36.79 | -107222.1719 |
(a) in [mg/kg body wt/day]; (b) computed as Log[1/TD
The molecules belonging to the quasi-Gaussian test set, as illustrated in Figure 2, with the same type of activity and structural parameters as those reported in Table 1.
| Chemical Compound | Formula | CASRN | TD50_Rat(a) | A(b) | logP | POL | Etot | |
|---|---|---|---|---|---|---|---|---|
| 34 | Phenacetin | C10H13NO2 | 62-44-2 | 1250 | 2.90 | 0.99 | 19.85 | -49230.08203 |
| 35 | Dimethylvinyl chloride (DMVC) | C4H7Cl | 513-37-1 | 31.8 | 4.498 | 1.51 | 9.85 | -20725.60325 |
| 36 | Sulfallate | C8H14ClNS2 | 95-06-7 | 26.1 | 4.58 | 2.73 | 24.79 | -46435.69922 |
| 37 | beta-Butyrolactone | C4H6O2 | 3068-88-0 | 13.8 | 4.86 | 0.17 | 8.06 | -26599.55273 |
| 38 | Vinyl Chloride | C2H3Cl | 75-01-4 | 6.11 | 5.21 | 1.01 | 6.18 | -13820.70898 |
| 39 | Acrylamide | C3H5NO | 79-06-1 | 3.75 | 5.43 | -0.28 | 7.52 | -20478.92578 |
| 40 | Mirex | C10Cl12 | 2385-85-5 | 1.77 | 5.75 | 6.41 | 38.39 | -114919.4688 |
| 41 | Dimethylnitramine | C2H6N2O2 | 4164-28-7 | 0.547 | 6.26 | 0.97 | 7.64 | -28551.91406 |
| 42 | N-Nitrosodimethylamine | C2H6N2O | 62-75-9 | 0.0959 | 7.02 | 0.01 | 7.01 | -21802.08203 |
| 43 | N-Methyl-N'-nitro-N-nitrosoguanidine | C2H5N5O3 | 70-25-7 | 0.803 | 6.1 | 1.5 | 11.13 | -46112.81641 |
| 44 | 1-Phenyl-3,3-dimethyltriazene | C8H11N3 | 7227-91-0 | 2.31 | 5.64 | 2.53 | 17.51 | -36944.65625 |
| 45 | Michler's ketone | C17H20N2O | 90-94-8 | 5.64 | 5.25 | 3.4 | 22.8 | -44481.07422 |
| 46 | 1'-Acetoxysafrole | C12H12 O4 | 34627-78-6 | 25 | 4.6 | -0.11 | 22.47 | -64108.48047 |
| 47 | o-Nitrosotoluene | C7H7NO | 611-23-4 | 50.7 | 4.29 | 2.29 | 13.48 | -32074.53516 |
| 48 | p-Nitrosodiphenylamine | C12H10 N2O | 156-10-5 | 201 | 3.7 | 3.07 | 22.66 | -50526.36328 |
| 49 | 1,4-Dichlorobenzene (p-dichlorobenzene) | C6H4Cl2 | 106-46-7 | 644 | 3.19 | 3.08 | 14.29 | -32415.54297 |
(a) in [mg/kg body wt/day]; (b) computed as Log[1/TD
Figure 2Graphical representation of the working activities for the molecules in Tables 1 and 2, classified to build up the "Gaussian" and "quasi-Gaussian" series that are specific to the training and testing QSAR purposes, respectively. The interpolating function, A = f(N), to be used in Equation (10) is also shown as the contour of the Gaussian set of trial molecules.
The parameters and statistical correlation coefficients for the residual-QSAR algorithm of Equations (1) and (2), as applied to the molecules of Table 1 in all possible combinations of variables.
| LogP | 5.297587 | -0.007280 | 0.0091 | 5.285636 | 1 | 0.9999 | ||
| POL | 4.712835 | 0.029613 | 0.1832 | 5.285636 | 1 | 0.9831 | ||
| Etot | 4.676954 | -0.000011 | 0.2033 | 5.285636 | 1 | 0.9791 | ||
| LogP, POL | 4.339331 | -0.279746 | 0.072662 | 0.2925 | 5.285636 | 1 | 0.9563 | |
| LogP, Etot | 4.578059 | -0.162902 | -0.000018 | 0.2608 | 5.285636 | 1 | 0.9654 | |
| POL, Etot | 4.679442 | -0.000978 | -0.000012 | 0.2033 | 5.285636 | 1 | 0.9791 | |
| LogP, POL, Etot | 4.341697 | -0.273668 | 0.06646 | -0.000002 | 0.2929 | 5.285636 | 1 | 0.9562 |
Residual-QSAR self-consistent (SC), factor (F1), averaged (AV, with ) models of Equations (3), (7), and (8) for the Hansch parameters of Table 3, with the modeling and predictive powers for the "Gaussian" and "Quasi-Gaussian" molecules of Tables 1 and 2 represented by their associated correlation factors, respectively.
| Structural | Activity Model | |||
|---|---|---|---|---|
| 0.99996 | 0.99994 | |||
| -119.51 + 72.8[ | 0.0091 | 0.1240 | ||
| 0.0091 | 0.1240 | |||
| 0.98307 | 0.97713 | |||
| 33.8936-1.75225[ | 0.1832 | 0.23179 | ||
| 0.1832 | 0.23179 | |||
| 0.98362 | 0.97238 | |||
| 29.1235 + 5.26316 × 10-4[ | 0.2033 | 0.04250 | ||
| 0.2033 | 0.04250 | |||
| 0.95626 | 0.94916 | |||
| 21.6546 + 6.40151[ | 0.2925 | 0.21906 | ||
| 0.2925 | 0.21906 | |||
| 0.96686 | 0.96164 | |||
| 20.4502 + 4.70815[ | 0.2608 | 0.0524 | ||
| 0.2608 | 0.0524 | |||
| 0.97838 | 0.97017 | |||
| 29.0045 + 0.046793[ | 0.2033 | 0.03654 | ||
| 0.2033 | 0.03654 | |||
| 0.95628 | 0.94927 | |||
| 21.5511 + 6.24813[ | 0.2929 | 0.19871 | ||
| 0.2929 | 0.19871 | |||
Synopsis of the statistical paths connecting the correlation factors for the models of Table 4.
| Statistical Path | Self-Consistent res-QSARs | Factor and Averaged res-QSARs | ||
|---|---|---|---|---|
| 0.11541 | ||||
| 0.04368 | 0.05067 | 0.2838 | 0.21791 | |
| 0.04368 | 0.05067 | 0.2838 | ||
| 0.02683 | 0.02808 | 0.1097 | ||
| 0.02679 | 0.3257 | |||
| 0.02786 | 0.1097 | 0.35742 | ||
| 0.02738 | 0.02333 | 0.0896 | 0.19691 | |
| 0.02311 | 0.0896 | |||
| 0.02734 | 0.16813 | |||
Figure 3Illustration of the molecular mechanism for genotoxic carcinogenesis according to the present residual-QSAR correlation-path hierarchy superimposed over an immunohistochemcial analysis of paraffin-embedded sections of rat intestinal cancer using the Caspase-2 antibody [42].