| Literature DB >> 29847084 |
Jessica A Wignall1, Eugene Muratov2, Alexander Sedykh2, Kathryn Z Guyton3, Alexander Tropsha2, Ivan Rusyn4, Weihsueh A Chiu4.
Abstract
BACKGROUND: Human health assessments synthesize human, animal, and mechanistic data to produce toxicity values that are key inputs to risk-based decision making. Traditional assessments are data-, time-, and resource-intensive, and they cannot be developed for most environmental chemicals owing to a lack of appropriate data.Entities:
Mesh:
Year: 2018 PMID: 29847084 PMCID: PMC6071978 DOI: 10.1289/EHP2998
Source DB: PubMed Journal: Environ Health Perspect ISSN: 0091-6765 Impact factor: 9.031
List of the established toxicity values and chemical descriptors that were used for modeling in this study.
| Toxicity value name (abbreviation) [units] | Mean (90% CI) [min–max] | Number of compounds | Global domain of applicability coverage |
|---|---|---|---|
| Oral exposure noncancer | |||
| Reference dose (RfD) [ | 7.3 (5.4, 9.6) [3.6–14.5] | 671 | 86.1% |
| RfD No observed adverse effect level (RfD NOAEL) [ | 4.8 (2.8, 6.9) [1.8–8.4] | 487 | 90.4% |
| RfD Benchmark dose (RfD BMD) [ | 4.0 (1.8, 6.1) [1.1–6.8] | 137 | 81.8% |
| RfD Benchmark dose lower limit (RfD BMDL) [ | 4.4 (2.1, 6.5) [1.3–7.9] | 137 | 81.8% |
| Oral exposure cancer | |||
| Oral slope factor (OSF) [ | 5.0 (2.8, 7.5) [1.5–10.7] | 302 | 86.7% |
| Cancer potency value (CPV) [ | 5.2 (3.0, 7.6) [2.5–10.7] | 225 | 81.6% |
| Inhalation exposure (noncancer and cancer) | |||
| Reference concentration (RfC) [ | 6.8 (3.9, 9.4) [3.1–12.9] | 152 | 61.7% |
| Inhalation unit risk (IUR) [ | 4.5 (2.2, 6.9) [ | 150 | 71.8% |
Note: CI, confidence interval.
Depending on the toxicity value, the number of compounds with an established number varied and was determined as described in “Methods.” See Table S1 for the chemical names, toxicity values, and their source (i.e., the federal or state agency that derived each value).
Percent of compounds from the Collaborative Estrogen Receptor Activity Prediction Project (CERAPP) chemical database of 32,464 compounds (Mansouri et al. 2016) within the global domain of applicability for the compounds with the corresponding toxicity values, based on Chemistry Development Kit (CDK) molecular descriptors and a Z-score cutoff . For a more restrictive Z-score cutoff , the percentages are 70%, 78%, 65%, 65%, 69%, 67%, 43%, and 54%.
Figure 1.Chemical space coverage of 886 Conditional toxicity value (CTV) chemicals compared to 32,464 structures in the Collaborative Estrogen Receptor Activity Prediction Project (CERAPP) “prediction” data set, based on chemical descriptors generated from the Chemistry Development Kit (CDK). (A) Three-dimensional (3-D) plot of coverage comparison based on first three principal components (PC); (B–E) Coverage comparisons based on easily interpretable descriptors octanol:water partition coefficient (ALogP), molecular weight (MW), and topological polar surface area (TopoPSA). Panel (B) is a 3-D plot, and panels (C–E) are histograms comparing the distribution of compounds for each descriptor. The 3-D plots were created in R (version 3.3.2; R Core Team).
Summary of the modeling outcomes.
| Toxicity value | Consensus model | Model prediction absolute error | Mean value prediction absolute error | |
|---|---|---|---|---|
| RfD | 0.41 | 0.77 (0.06, 1.80) | 0.96 (0.08, 2.53) | |
| RfD NOAEL | 0.45 | 0.70 (0.06, 1.82) | 0.93 (0.05, 2.37) | |
| RfD BMD | 0.31 | 0.88 (0.13, 2.08) | 1.08 (0.09, 2.34) | 0.0098 |
| RfD BMDL | 0.28 | 0.93 (0.07, 2.19) | 1.13 (0.13, 2.44) | 0.0098 |
| OSF | 0.33 | 0.92 (0.07, 2.45) | 1.19 (0.12, 2.60) | |
| CPV | 0.25 | 0.97 (0.07, 2.53) | 1.19 (0.15, 2.65) | 0.0008 |
| RfC | 0.42 | 1.11 (0.12, 2.71) | 1.49 (0.20, 3.54) | 0.0015 |
| IUR | 0.42 | 0.93 (0.07, 2.69) | 1.29 (0.06, 2.85) |
Note: BMD, benchmark dose; BMDL, benchmark dose lower confidence limit; CPV, cancer potency value; IUR, inhalation unit risk; NOAEL, no observed adverse effect level; OSF, oral slope factor; QSAR, quantitative structure activity relationship; RfC = reference concentration; RfD = reference dose.
is the fraction of the variance explained by each model, estimated by 5-fold cross-validation. In all cases, the accuracy of the model built with original data was significantly higher than that of models built using y-randomized data sets ().
The mean and confidence interval (90%) of absolute error for the external prediction of each compound’s toxicity value against the QSAR model prediction under 5-fold cross-validation.
The mean and confidence interval (90%) of absolute error for the external prediction of each compound’s toxicity value against the mean value of the compounds with that particular toxicity value.
Kolmogorov-Smirnov statistics for the difference between model and “mean value” prediction absolute errors.
Figure 5.Example of mechanistic interpretation of QSAR model for the RfD. (A) Ranks of the molecular descriptors by their frequency of use in the model. The top twenty are denoted by dashed lines. (B) The top twenty descriptors shown separately, with the descriptor names. (C) comparison of the descriptor values for the top two descriptors between the highest and lowest potency RfDs. Note: QSAR, quantitative structure activity relationship; RfD, reference dose. Definitions of each molecular descriptor can be found online: https://cdk.github.io/cdk/1.5/docs/api/org/openscience/cdk/qsar/descriptors/molecular/package-summary.html.
Figure 6.Screen shots illustrating use of the Conditional Toxicity Value (CTV) Predictor web portal (http://toxvalue.org). Steps include searching for a chemical, verifying its identity, selecting toxicity values of interest, running quantitative structure–activity relationship (QSAR) models, and exporting predictions to a comma-separated-values file. The entire process can be completed in approximately 60 s. CTV is a public web portal maintained by two of the authors (IR and WAC).
Summary of conditional toxicity value versus high-throughput screening toxicity value comparisons.
| “Gold standard” toxicity value | Linear model slope (log10-transformed) ( | Median absolute deviation | Adjusted | |
|---|---|---|---|---|
| NOAEL ( | CTV NOAEL | 1.25 ( | 0.66 | 0.47 |
| 0.52 (0.02) | 1.11 | 0.12 | ||
| BMDL ( | CTV BMDL | 1.17 ( | 0.58 | 0.59 |
| 0.35 (0.20) | 1.06 | 0.06 | ||
| RfD ( | CTV RfD | 0.99 ( | 0.72 | 0.36 |
| 0.32 (0.02) | 2.68 | 0.09 |
Note: BMDL, benchmark dose lower confidence limit; CTV, conditional toxicity value; , high-throughput screening–based oral equivalent dose lower 5% confidence limit; NOAEL, no observed adverse effect level; RfD, reference dose.
The median of absolute residuals between predicted and “gold standard” toxicity values.
Figure 7.Comparison of conditional toxicity value (CTV)-based (A, C, E) or high-throughput screening (HTS) assay-based (B, D, F) toxicity value predictions with “gold standard” regulatory toxicity values. In each panel, the x-axis is the toxicity value predicted from CTV (left panels) or based on HTS assays (right panels), which is compared with regulatory toxicity values on the y-axis (all values are in units of ). Comparisons are made for regulatory NOAELs (panels A and B), BMDLs (panels C and D), or RfDs (panels E and F). In all cases, the predictions from CTV are based on cross-validation (panels A, C, and E). Panels A–E also include lines indicating equality and a factor of 10 greater or less than equality (solid and dotted lines); for panel F, the line is offset by a factor of to account for the fact that the HTS-based oral equivalent dose lower 5% confidence limit () is a point of departure and is not meant to be equivalent to an RfD. The offset is approximately equivalent to treating as a surrogate for the RfD. The value of this offset was determined by the intercept of the linear regression. Each panel also includes linear regression lines (dashed lines), along with the number of compounds (n) and the adjusted . Note: BMDL, benchmark dose lower confidence limit; HTS OED05, high-throughput screening–based oral equivalent dose lower 5% confidence limit; NOAEL, no observed adverse effect level; RfD, reference dose.