| Literature DB >> 26863198 |
Thomas Luechtefeld1, Alexandra Maertens1, Daniel P Russo2, Costanza Rovida3, Hao Zhu2,4, Thomas Hartung1,3.
Abstract
The European Chemicals Agency, ECHA, made available a total of 13,832 oral toxicity studies for 8,568 substances up to December 2014. 75% of studies were from the retired OECD Test Guideline 401 (11% TG 420, 11% TG 423 and 1.5% TG 425). Concordance across guidelines, evaluated by comparing LD50 values ≥ 2000 or < 2000 mg/kg body weight from chemicals tested multiple times between different guidelines, was at least 75% and for their own repetition more than 90%. In 2009, Bulgheroni et al. created a simple model for predicting acute oral toxicity using no observed adverse effect levels (NOAEL) from 28-day repeated dose toxicity studies in rats. This was reproduced here for 1,625 substances. In 2014, Taylor et al. suggested no added value of the 90-day repeated dose oral toxicity test given the availability of a low 28-day study with some constraints. We confirm that the 28-day NOAEL is predictive (albeit imperfectly) of 90-day NOAELs, however, the suggested constraints did not affect predictivity. 1,059 substances with acute oral toxicity data (268 positives, 791 negatives, all Klimisch score 1) were used for modeling: The Chemical Development Kit was used to generate 27 molecular descriptors and a similarity-informed multilayer perceptron showing 71% sensitivity and 72% specificity. Additionally, the k-nearest neighbors (KNN) algorithm indicated that similarity-based approaches alone may be poor predictors of acute oral toxicity, but can be used to inform the multilayer perceptron model, where this was the feature with highest information value.Entities:
Keywords: LD50; acute toxicity; chemical safety; computational toxicology; regulatory toxicology
Mesh:
Substances:
Year: 2016 PMID: 26863198 PMCID: PMC5461469 DOI: 10.14573/altex.1510054
Source DB: PubMed Journal: ALTEX ISSN: 1868-596X Impact factor: 6.043
Fig. 1Number of substances with studies for each of the acute oral toxicity OECD guidelines
The Limit Test (OECD TG 401, now deleted) was performed on 8,482 substances, the Fixed Dose Procedure (OECD TG 420) on 1,140 substances, the Chemical Classification Test (OECD TG 423) on 1,081 substances, and Up and Down Dosing (OECD TG 425) on 147 substances.
Fig. 2Histograms for use of OECD TG 401, 420, 423, and 425 in registrations of acute oral toxicant OECD TG 401, 423, 420, and 425
Y-axes are not equivalent. The x-axis represents LD50 for each OECD guideline. Density plot with overlapping densities between 0 and 5,000 mg/kg dosage. Notice the LD50 clustering around 2,000 and 5,000 mg/kg dosage; this is due to dosing schemes.
Five oral toxicity hazards extracted for 6,027 substances from ECHA dossiers “identification and labelling” information
H300 has 36 failed extractions.
| Description | Hazard | Positive | Negative | Data lacking | Inconclusive |
|---|---|---|---|---|---|
| Fatal if swallowed | H300 | 33 | 5,709 | 237 | 12 |
| Toxic if swallowed | H301 | 225 | 5,518 | 272 | 12 |
| Harmful if swallowed | H302 | 1,072 | 4 677 | 266 | 12 |
| May be harmful if swallowed | H303 | 23 | 5,720 | 272 | 12 |
| May be fatal if swallowed and enters airways | H304 | 453 | 2,913 | 2,626 | 35 |
| May be harmful if swallowed and enters airways | H305 | 3 | 3,316 | 2,628 | 35 |
Negative refers to the ECHA “conclusive but not sufficient for classification” category.
Percent agreement of OECD test guidelines for acute oral toxicity tests averaged over substances
Number of substances with both tests in parentheses. Calculated by finding substances having studies with both guidelines where guidelines agree (defined as both having toxicity ≥ 2,000 or < 2,000 mg/kg b.w.) and dividing by total number of substances tested with both guidelines.
| OECD 401 | OECD 420 | OECD 423 | OECD 425 | |
|---|---|---|---|---|
| 93% (8,541) | 90% (1,966) | 74% (1,303) | 83% (127) | |
| 92% (1,521) | 81% (400) | 92% (25) | ||
| 90% (656) | 84% (44) | |||
| 94% (76) |
Evaluation of Bulgheroni et al. testing strategy on extracted ECHA data
White cells only count experimental key in vivo oral toxicity studies. Grey cells count all oral toxicity studies (including read-across, and non-key studies).
| NOAEL (mg/kg bw) from 28d study | LD50 | (mg/kg bw) | Total |
|---|---|---|---|
| < 2,000 | ≥ 2,000 | ||
| ≤ 200 | 237 | 411 | 648 |
| > 200 | 49 | 928 | 977 |
| Total | 286 | 1,339 | 1,625 |
| ≤ 200 | 660 | 1,301 | 1,961 |
| > 200 | 183 | 3,126 | 3,309 |
| Total | 843 | 4,427 | 5,270 |
Fig 3Prevalence frequency histogram of NOAELs for 28 and 90 day subchronic oral toxicity tests
This figure was made by aggregating results of 2,400 90-day tests and 1,933 28-day tests.
Fig. 428- and 90-day acute oral toxicity matched NOAELs
Circles represent the averaged 28 day (x-axis) and 90 day (y-axis) NOAEL for a given chemical taken from ECHA 28-day and 90-day oral toxicity studies. Red circles represent 200 substances matching the constraints given by Taylor et al. (2014).
Cross-validated feature importance via information gain, ranker evaluation and wrapper evaluation
Summary descriptions of molecular descriptors used in perceptron training (as implemented by CDK).
| Name | Description | Ranker Evaluation | Ranker stDev | Wrapper Evaluation |
|---|---|---|---|---|
| KNN (K=5,T=0.7) | K nearest neighbors with k=5 and neighbor threshold = 0.7 | 0.07 | 0.004 | 100 |
| TPSA | Calculation of topological polar surface area based on fragment contributions | 0.023 | 0.002 | 84 |
| AcidicGroupCount | Returns the number of acidic groups | 0 | 0 | 19 |
| Apol | Sum of the atomic polarizabilities (including implicit hydrogens). Polarizabilities are taken from | 0.05 | 0.003 | 15 |
| HBondAcceptorCount | This descriptor calculates the number of hydrogen bond acceptors using a slightly simplified version of the PHACIR atom types | 0.042 | 0.003 | 15 |
| RuleOfFive | The number failures of Lipinski’s Rule of 5. | 0.042 | 0.003 | 6 |
| EccentricConnectivity | A topological descriptor combining distance and adjacency information | 0.044 | 0.003 | 5 |
| MannholdLogP | Prediction of logP based on the number of carbon and hetero atoms | 0.041 | 0.003 | 4 |
| AromaticAtomsCount | Number of aromatic atoms | 0.008 | 0.005 | 4 |
| Bpol | Sum of the absolute value of the difference between atomic polarizabilities of all bonded atoms in the molecule (including implicit hydrogens) with polarizabilities taken from | 0.067 | 0.008 | 3 |
| ZagrebIndex | The sum of the squares of atom degree over all heavy atoms i | 0.044 | 0.003 | 3 |
| FractionalPSA | Polar surface area expressed as a ratio to molecular size | 0.004 | 0.009 | 3 |
| LargestPiSystem | Number of atoms in the largest pi system | 0.052 | 0.003 | 2 |
| XLogP | Prediction of logP based on the atom-type method called XLogP | 0.045 | 0.004 | 2 |
| LargestChain | The number of atoms in the largest chain | 0.047 | 0.006 | 1 |
| HybridizationRatio | Reports the fraction of sp3 carbons to sp2 carbons | 0.041 | 0.004 | 1 |
| AromaticBondsCount | Number of aromatic atoms | 0.067 | 0.005 | 0 |
| RotatableBondsCount | The number of rotatable bonds is given by the SMARTS specified by Daylight on SMARTS tutorial | 0.061 | 0.007 | 0 |
| AtomCount | Number of atoms | 0.054 | 0.007 | 0 |
| FragmentComplexity | C=abs(B^2-A^2+A)+H/100 where C=complexity; A=number of non-hydrogen atoms; B=number of bonds and H=number of heteroatoms | 0.054 | 0.004 | 0 |
| Weight | Molecular weight | 0.052 | 0.003 | 0 |
| BondCount | Number of bonds of a given bond order (single, double, triple) | 0.046 | 0.003 | 0 |
| VadjMaDescriptor | Vertex adjacency information (magnitude): 1 + log2 m where m is the number of heavy-heavy bonds. If m is zero, then zero is returned (Definition from MOE tutorial on-line) | 0.046 | 0.003 | 0 |
| LongestAliphaticChain | Number of atoms in the longest aliphatic chain | 0.045 | 0.008 | 0 |
| PetitjeanNumber | Molecular graph descriptor measuring graph eccentricity | 0.039 | 0.003 | 0 |
| FMF | Ratio of heavy atoms in the Murcko framework to the total number of heavy atoms in the molecule | 0.025 | 0.002 | 0 |
| HBondDonorCount | Number of hydrogen bond donors using a slightly simplified version of the PHACIR atom types | 0.02 | 0.004 | 0 |
| BasicGroupCount | Returns the number of basic groups | 0 | 0 | 0 |
Sensitivity, specificity and balanced accuracy (BAC) of classifiers trained and tested on all substances existing in a module (see Fig. 5)
Stratified 10-fold cross-validation was performed on the multilayer perceptron and indicates some overfitting of the decision tree.
| Classifier | Train/Test | Sensitivity | Specificity | BAC |
|---|---|---|---|---|
| KNN (K=5,T=0.7) | 1,059 substances, 268 positive 791 negative | 53.7% | 89.1% | 71.4% |
| Decision tree | 1,059 substances, 268 positive 791 negative | 68.7% | 71.0% | 69.9% |
| Multilayer perceptron | 1,059 substances, 268 positive 791 negative | 93.7% | 99.6% | 96.7% |
| Multilayer perceptron | stratified 10-fold cross validation | 70.7% | 71.6% | 71.2% |
indicates that the classifier balanced positive and negative substances prior to training by weighting positive substances 2.95x heavier than negative examples, thus creating a balanced weight between negative and positive examples.
Fig. 5Chemical similarity maps for substances with acute oral toxicity LD50 data that could be mapped from REACH to PubChem
The map contains 613 substances and is built from 3,122 substances mapped from REACH to PubChem and for which similarity and structure data could be determined from the chemistry development kit (Bolton et al., 2008; Steinbeck et al., 2003). Edges are shown between substances with similarity ≥ 0.7 as determined by their Tanimoto distance (Lourenço et al., 2004). The force layout algorithm is used to distribute substances (Fruchterman and Reingold, 1991).
A. Similarity map modules: Nine modules are created by maximization of the Q-metric, a measure of module coherence (Blondel et al., 2008). Chemical nodes are colored by their module identification.
B. Chemical similarity map colored by experimental LD: Dark pink = low LD50, white = 1,000 mg/kg b.w./day, dark green = 2,000 mg/kg b.w./day. Results based on average LD50 values. Clusters of low LD50 values can be seen in module 5 and 0 with some otherwise sporadic distribution.
C. Chemical similarity map colored by oral toxicant status: Pink substances denote LD50 < 2,000 mg/kg b.w. per day. Green substances denote LD50 ≥ 2,000 mg/kg b.w. per day (not classified).
D. KNN classifications for LD: Pink = predicted LD50 < 2,000 mg/kg b.w. per day. Green = predicted LD50 ≥ 2,000 mg/kg b.w. per day.
Substances are predicted as toxicant if the majority of the closest 5 neighbors are toxicants. A chemical is considered a neighbor if it has Tanimoto similarity > 70% (Lourenço et al., 2004).
E. Multilayer perceptron classifications for LD: Pink = predicted LD50 < 2,000 mg/kg b.w. per day. Green = predicted LD50 ≥ 2,000 mg/kg b.w. per day. Classifier built on 1,059 substances referenced by at least one acute oral study with Klimisch score = 1. Classifications made by the multilayer perceptron appear to be well clustered; this indicates that chemical descriptors are influenced by substructure presence.
Modular sensitivity and specificity of KNN
These measurements give an idea of the domains of applicability for KNN and can be compared to the same domains in Table 8. KNN was not trained or tested in cross-validation but instead using the entire dataset for training and testing. Module 3 has no positive substances out of its 45 members making sensitivity not a number (NAN). BAC = balanced accuracy. FN = false negatives, TP = true positives, TN = true negatives, FP = false positives.
| Module | Sensitivity | Specificity | BAC | FN | TP | TN | FP | Total |
|---|---|---|---|---|---|---|---|---|
| 0 | 38.24% | 94.83% | 66.53% | 21 | 13 | 110 | 6 | 150 |
| 1 | 50.00% | 89.66% | 69.83% | 5 | 5 | 26 | 3 | 39 |
| 2 | 0.00% | 85.71% | 42.86% | 2 | 0 | 6 | 1 | 9 |
| 3 | NAN | 100.00% | 100.00% | 0 | 0 | 45 | 0 | 45 |
| 4 | 0.00% | 97.50% | 48.75% | 8 | 0 | 39 | 1 | 48 |
| 5 | 72.41% | 55.17% | 63.79% | 8 | 21 | 16 | 13 | 58 |
| 6 | 20.00% | 94.50% | 57.25% | 8 | 2 | 103 | 6 | 119 |
| 7 | 23.08% | 93.75% | 58.41% | 10 | 3 | 45 | 3 | 61 |
| 8 | 0.00% | 100.00% | 50.00% | 6 | 0 | 80 | 0 | 86 |
Modules sensitivity and specificity of multilayer perceptron trained in leave-one-out cross-validation
Regions where molecular descriptors fail to predict oral toxicity accurately may be a consequence of the arbitrary threshold picked, or may indicate more subtle chemical effects not described by descriptors. FN = false negatives, TP = true positives, TN = true negatives, FP = false positives.
| Module | Sensitivity | Specificity | BAC | FN | TP | TN | FP | Total |
|---|---|---|---|---|---|---|---|---|
| 0 | 70.59% | 69.83% | 70.21% | 10 | 24 | 81 | 35 | 150 |
| 1 | 60.00% | 82.76% | 71.38% | 4 | 6 | 24 | 5 | 39 |
| 2 | 50.00% | 85.71% | 67.86% | 1 | 1 | 6 | 1 | 9 |
| 3 | NAN | 100.00% | 100.00% | 0 | 0 | 45 | 0 | 45 |
| 4 | 37.50% | 67.50% | 52.50% | 5 | 3 | 27 | 13 | 48 |
| 5 | 79.31% | 41.38% | 60.34% | 6 | 23 | 12 | 17 | 58 |
| 6 | 20.00% | 96.33% | 58.17% | 8 | 2 | 105 | 4 | 119 |
| 7 | 61.54% | 45.83% | 53.69% | 5 | 8 | 22 | 26 | 61 |
| 8 | 0.00% | 93.75% | 46.88% | 6 | 0 | 75 | 5 | 86 |