| Literature DB >> 18414635 |
Hao Zhu1, Ivan Rusyn, Ann Richard, Alexander Tropsha.
Abstract
BACKGROUND: To develop efficient approaches for rapid evaluation of chemical toxicity and human health risk of environmental compounds, the National Toxicology Program (NTP) in collaboration with the National Center for Chemical Genomics has initiated a project on high-throughput screening (HTS) of environmental chemicals. The first HTS results for a set of 1,408 compounds tested for their effects on cell viability in six different cell lines have recently become available via PubChem.Entities:
Keywords: QSAR; carcinogenesis; computational toxicology; high-throughput screening
Mesh:
Substances:
Year: 2008 PMID: 18414635 PMCID: PMC2291015 DOI: 10.1289/ehp.10573
Source DB: PubMed Journal: Environ Health Perspect ISSN: 0091-6765 Impact factor: 9.031
Summary of the biological activity of chemical agents screened in NTP–HTS assays.
| Classification | BJ | HEK293 | HepG2 | Jurkat | MRC-5 | SK-N-SH | All tests |
|---|---|---|---|---|---|---|---|
| Actives | 42 | 63 | 41 | 121 | 37 | 74 | 140 |
| Inconclusives | 44 | 79 | 47 | 89 | 44 | 54 | 90 |
| Inactives | 1,203 | 1,147 | 1,201 | 1,079 | 1,208 | 1,161 | 1,059 |
Rodent carcinogenicity classification (CPDB database) for 314 NTP–HTS compounds.
| Rats
| Mice
| |||
|---|---|---|---|---|
| Classification | Male | Female | Male | Female |
| Active | 121 | 111 | 123 | 134 |
| Inactive | 150 | 154 | 153 | 140 |
| Total | 271 | 265 | 276 | 274 |
Statistical information of the 15 most statistically significant kNN QSAR models based on the 275-compound modeling set.
| Model ID | N-training | Pred.-training | N-test | Pred.-test | NNN |
|---|---|---|---|---|---|
| 1 | 141 | 0.90 | 119 | 0.73 | 1 |
| 2 | 140 | 0.91 | 121 | 0.71 | 1 |
| 3 | 141 | 0.92 | 120 | 0.69 | 1 |
| 4 | 140 | 0.88 | 123 | 0.73 | 1 |
| 5 | 141 | 0.88 | 120 | 0.73 | 1 |
| 6 | 190 | 0.90 | 85 | 0.71 | 1 |
| 7 | 228 | 0.86 | 47 | 0.74 | 1 |
| 8 | 140 | 0.92 | 121 | 0.67 | 1 |
| 9 | 140 | 0.89 | 116 | 0.70 | 4 |
| 10 | 149 | 0.88 | 122 | 0.70 | 1 |
| 11 | 140 | 0.87 | 124 | 0.72 | 1 |
| 12 | 190 | 0.85 | 85 | 0.73 | 1 |
| 13 | 140 | 0.88 | 125 | 0.70 | 1 |
| 14 | 149 | 0.87 | 118 | 0.71 | 1 |
| 15 | 141 | 0.92 | 123 | 0.66 | 1 |
| Average | 154 | 0.89 | 111 | 0.71 | 1 |
Abbreviations: N-training, number of compounds in the training set; Pred.-training, the overall predictivity of the training set; N-test, number of compounds in the test set; Pred.-test, the overall predictivity of the test set; NNN, number of the nearest neighbors used for prediction.
Consensus prediction for 109 compounds in the external validation set.
| Consensus prediction
| After applicability domain applied
| |||
|---|---|---|---|---|
| Model characteristics | Exp. actives | Exp. inactives | Exp. actives | Exp. inactives |
| Pred. actives ( | 21 | 7 | 17 | 5 |
| Pred. inactives ( | 16 | 65 | 9 | 65 |
| Sensitivity (%) | 56.8 | 65.4 | ||
| Specificity (%) | 90.2 | 92.9 | ||
| Overall predictive power (%) | 73.5 | 79.2 | ||
Abbreviations: Exp., experimental; Pred., predicted.
The overall predictive power is the average value of sensitivity (predictive rate of actives) and specificity (predictive rate of inactives).
The relationship between HTS activity and rodent carcinogenicity of 314 compounds.
| Content of CPDB | HTS actives | HTS inconclusives | HTS inactives |
|---|---|---|---|
| CPDB actives ( | 30 | 12 | 136 |
| CPDB inactives ( | 9 | 13 | 114 |
| Correlation (%) | 77 | — | 46 |
Figure 1Seven HTS descriptors with their frequency of use in the 198 kNN QSAR model.
Consensus prediction of 50 compounds in the external validation set using the kNN QSAR models based on two different descriptor sets.
| CAS no. | Name | CPDB actives | MZ | MZHTS |
|---|---|---|---|---|
| 79005 | 1,1,2-Trichloroethane | + | + | + |
| 106934 | 1,2-Dibromoethane | + | + | + |
| 90120 | 1-Methylnaphthalene | – | – | – |
| 86577 | 1-Nitronaphthalene | – | + | + |
| 634935 | 2,4,6-Trichloroaniline | + | + | + |
| 120832 | 2,4-Dichlorophenol | – | + | – |
| 99558 | 5-Nitro- | + | + | + |
| 67630 | Isopropanol | – | + | – |
| 96695 | 4,4-Thiobis(6- | – | + | + |
| 619170 | 4-Nitroanthranilic acid | – | – | – |
| 298817 | 8-Methoxypsoralen | + | + | + |
| 75058 | Acetonitrile | – | + | + |
| 50782 | Acetylsalicylic acid | – | – | – |
| 50760 | Actinomycin D | + | I | + |
| 86500 | Azinphosmethyl | – | – | – |
| 92875 | Benzidine | + | + | + |
| 57578 | Propiolactone | + | + | + |
| 80057 | Bisphenol A | – | + | – |
| 75274 | Bromodichloromethane | + | + | + |
| 115286 | Chlorendic acid | + | I | I |
| 91645 | Coumarin | + | + | + |
| 4342034 | Dacarbazine | + | – | – |
| 103231 | Di(2-ethylhexyl)adipate | + | + | + |
| 333415 | Diazinon | – | – | – |
| 62737 | Dichlorvos | + | + | + |
| 828002 | Dimethoxane | + | – | – |
| 98011 | Furfural | + | + | + |
| 87683 | Hexachloro-1,3-butadiene | + | – | + |
| 67721 | Hexachloroethane | + | + | + |
| 122667 | Hydrazobenzene | + | – | – |
| 58935 | Hydrochlorothiazide | – | I | I |
| 121755 | Malathion | – | – | – |
| 298000 | Methyl parathion | – | – | – |
| 150685 | Monuron | + | – | – |
| 1212299 | – | I | I | |
| 759739 | + | + | + | |
| 98953 | Nitrobenzene | + | + | + |
| 67209 | Nitrofurantoin | + | I | + |
| 59870 | Nitrofurazone | + | I | I |
| 55185 | + | + | + | |
| 636215 | + | – | – | |
| 106478 | – | – | – | |
| 122601 | Phenyl glycidyl ether | + | + | + |
| 103855 | Phenylthiourea | – | – | + |
| 1918021 | Picloram | – | – | – |
| 57681 | Sulfamethazine | + | – | – |
| 79196 | Thiosemicarbazide | – | + | + |
| 108054 | Vinyl acetate | + | + | + |
| 1330207 | Xylenes (mixed) | – | + | + |
| 17924924 | Zearalenone | + | – | + |
Abbreviations: +, carcinogenic; –, noncarcinogenic; I, inconclusive because out of the applicability domain; MZ, models based on MolConnZ descriptors only; MZHTS, models based on the combination of MolConnZ and HTS descriptors.
Summary of the statistical parameters of the prediction results of 50 external compounds.
| Chemical descriptors only
| Combined descriptors
| |||
|---|---|---|---|---|
| Model characteristics | Exp. actives | Exp. inactives | Exp. actives | Exp. inactives |
| Pred. actives | 18 | 8 | 22 | 6 |
| Pred. inactives | 8 | 10 | 6 | 12 |
| Sensitivity (%) | 69.2 | 78.6 | ||
| Specificity (%) | 55.5 | 66.7 | ||
| Overall predictivity (%) | 62.3 | 72.7 | ||
| Coverage (%) | 88 | 92 | ||
Abbreviations: Exp., experimental; Pred., predicted.
Summary of the top 10 atom and bond type MozConnZ chemical descriptors used in successful kNN QSAR models before and after using HTS descriptors.
| No. | Descr_Name | Illustration | Freq_MZ | Ratio_MZ | Freq_MZHTS | Ratio_MZHTS |
|---|---|---|---|---|---|---|
| 1 | Snitroso | Sum of E-states of nitroso group
| 38 | 36.9% | 73 | 36.9% |
| 2 | nnitroso | Number of nitroso group
| 34 | 33.0% | 69 | 34.8% |
| 3 | nHBint3 | Number of hydrogen bond acceptor and donor pairs separated by 3 skeletal bonds
| 27 | 26.2% | 31 | 15.7% |
| 4 | naasN | Number of aromatic nitrogen with substitute
| 25 | 24.3% | 42 | 21.2% |
| 5 | SHBint3 | Sum of E-state of strength for potential hydrogen bonds if separated by 3 skeletal bonds
| 24 | 23.3% | 41 | 20.7% |
| 6 | nHssNH | Number of amine groups
| 24 | 23.3% | 23 | 11.6% |
| 7 | SdsN | Sum of E-states for nitrogens with one single bond and one double bond
| 24 | 23.3% | 48 | 24.2% |
| 8 | SdsssP | Sum of E-states for phosphors with three single bonds and one double bond
| 19 | 18.4% | 21 | 10.6% |
| 9 | SsBr | Sum of E-states for bromines
| 19 | 18.4% | 45 | 22.7% |
| 10 | SHssNH | Sum of H E-states for hydrogens in amine groups.
| 18 | 17.5% | 25 | 12.6% |
Abbreviations: Descr_Name, name of descriptor; Freq_MZ, frequency of occurrence in successful kNN models only using only MolConnZ descriptors; Ratio_MZ, ratio of occurrence in successful QSAR models using only MolConnZ descriptors; Freq_MZHTS, frequency of occurrence in successful kNN models using MolConnZ and HTS descriptors; Ratio_MZHTS, ratio of occurrence in successful QSAR models using MolConnZ and HTS descriptors.