| Literature DB >> 26863516 |
Hao Zhu1, Mounir Bouhifd2, Elizabeth Donley3, Laura Egnash3, Nicole Kleinstreuer4, E Dinant Kroese5, Zhichao Liu6, Thomas Luechtefeld2, Jessica Palmer3, David Pamies2, Jie Shen7, Volker Strauss8, Shengde Wu9, Thomas Hartung2,10.
Abstract
Read-across, i.e. filling toxicological data gaps by relating to similar chemicals, for which test data are available, is usually done based on chemical similarity. Besides structure and physico-chemical properties, however, biological similarity based on biological data adds extra strength to this process. In the context of developing Good Read-Across Practice guidance, a number of case studies were evaluated to demonstrate the use of biological data to enrich read-across. In the simplest case, chemically similar substances also show similar test results in relevant in vitro assays. This is a well-established method for the read-across of e.g. genotoxicity assays. Larger datasets of biological and toxicological properties of hundreds and thousands of substances become increasingly available enabling big data approaches in read-across studies. Several case studies using various big data sources are described in this paper. An example is given for the US EPA's ToxCast dataset allowing read-across for high quality uterotrophic assays for estrogenic endocrine disruption. Similarly, an example for REACH registration data enhancing read-across for acute toxicity studies is given. A different approach is taken using omics data to establish biological similarity: Examples are given for stem cell models in vitro and short-term repeated dose studies in rats in vivo to support read-across and category formation. These preliminary biological data-driven read-across studies highlight the road to the new generation of read-across approaches that can be applied in chemical safety assessment.Entities:
Keywords: big data; biological similarity; read-across; safety assessment
Mesh:
Substances:
Year: 2016 PMID: 26863516 PMCID: PMC4834201 DOI: 10.14573/altex.1601252
Source DB: PubMed Journal: ALTEX ISSN: 1868-596X Impact factor: 6.043
General information and properties of the analogs
| Property | Flusilazole | Hexaconazole | Propiconazole | Triadimefon | Myclobutanil |
|---|---|---|---|---|---|
| Fungicide/antibacterial drug | Fungicide | Fungicide | Fungicide | Fungicide | |
|
|
|
|
|
| |
| 85509-19-9 | 79983-71-4 | 60207-90-1 | 43121-43-3 | 88671-89-0 | |
| 315.3927 | 313.0749 | 341.0698 | 293.0931 | 288.1142 | |
| Solid | Solid | Liquid | Solid | Solid | |
| 43 | 1.29 | 100 | 71.5 | 142 | |
| 4.68 | 3.7 | 3.7 | 2.8 | 2.9 |
Summary of test article ADME and toxicity data
| Flusilazole [1] | Hexaconazole [2] | Propiconazole [3] | Triadimefon [4] | Myclobutanil [5] | |
|---|---|---|---|---|---|
| NA | NA | 24–31 hr | ~4 hr | Biphasic Rapid Phase: 5.25 | |
| Rapid & Extensive (up to 80%) | NA | >80% in 48h | 28% in females, 67% in males as urinary excretion | Rapidly absorbed (> 89%) | |
| Widely | Widely distributed; highest concentrations in liver, intestinal tract and adrenal cortex | Widely distributed; highest concentrations in the liver and kidney | Widely distributed in kidneys and liver | Widely distributed | |
| Extensive | Extensive | Extensive | Rapid & Extensive | Rapid & Extensive | |
| NA | NA | NA | CYP2C and CYP3A | CYP2C and CYP3A | |
| 96 hr | 72 hr | 24 hr | 96 hr | 96 hrs | |
| urine | 43% urine/53% feces (m) | Even distribution in urine & feces | Feces (m) | Even distribution in urine & feces | |
| 674 | 2189 | 1517 | 363–1855 | 1600 | |
| Negative | Negative | Negative | Negative | Negative | |
| Liver and Urinary bladder | Liver | Body weight, liver, erythrocytes | Liver | Liver | |
| 9 | 2.5 | 76 | 150 | 51.5 | |
| Skeletal anomalies, malformations at higher doses | Fetal Toxicity, skeletal variations | Skeletal variations | Skeletal variations | Fetal toxicity/increased number of early resorptions and lower fetal weights | |
| 2 | 2.5 | 30 | 30 | 93.8 | |
| 10 | 25 | 90 | 10 | 93.8 | |
Note: NA: Data not available. In vivo data summarized from rat studies.
Excretion is greater than or equal to 90% of radiolabel.
Developmental toxicity includes embryo/fetal toxicity and teratogenicity.
[1] Adcock and Tasheva, 2009
[2] Tonkelaar and van Koten-Vermeulen, 1991
[3] Dewhurst and Dellarco, 2006
[4] Zarn et al., 2006
[5] Yoshida and McGregor, 2015
Public databases of toxicity data
| Name | General Information | Data description |
|---|---|---|
| PubChem | Over 50 million compounds, over 700,000 bioassays, over 13 billion data points | Toxicity, genomics and literature data |
| ChEMBL | Over 600,000 compounds, 3.3 million bioassay readout data | Literature toxicity data |
| ACToR | The toxicity results from 100 various data resources | Both in vitro and in vivo toxicity data |
| ToxNET | Over 50,000 environmental compounds from 16 different resources | Both in vitro and in vivo toxicity data |
| SEURAT | Over 5,500 cosmetic-type compounds in the current COSMOS database web portal | Animal toxicity data |
| REACH | 816,048 studies for 9,800 substances and 3,600 study types | Data submitted in EU chemical legislation, made machine-readable by |
| CTD | Over 13,000 compounds, over 32,000 genes, over 6000 diseases | Compound, gene and disease relationships |
| CEBS | About 10,000 toxicity bioassays from various sources | Gene expression data |
| DrugMatrix | About 600 drug molecules and 10,000 genes | Gene expression data |
| Cmap | About 1,300 compounds and 7,000 genes | Gene expression data |
Fig. 1The response space of 962 ToxCast compounds represented by the data obtained from 193 PubChem bioassays
The red dots represent active responses; the blue dots represent inactive responses, and the yellow dots represent no available testing data or inconclusive results.
Three REACH compounds (the first compound) with their chemical nearest neighbor (the second compound) and biological nearest neighbor (the third compound)
| Compounds | LD50 (mg/kg) | Bioprofiles | |
|---|---|---|---|
| 1 |
| 181 |
|
|
| 730 | N/A | |
|
| 320 |
| |
| 2 |
| 949 |
|
|
| 2,100 | N/A | |
|
| 520 |
| |
| 3 |
| 206 |
|
|
| 6,490 |
| |
|
| 1,041 |
|
The bioprofile consists of 18 PubChem assays (PubChem assay AIDs 427, 542, 544, 545, 546, 921, 963, 964, 966, 968, 973, 974, 993, 504832, 651802, 686979, 743041, 743086) which were selected for calculation since they contain the largest number of active responses per assay in REACH compounds. The red color indicates active response, blue color indicates inactive response and white color indicates no data available.
N/A indicates there is no data available for this compound within these assays.
Fig. 2Random Forest Proximity Matrix for 1,056 ToxCast Phase I/II chemicals
Chemicals are clustered based on their similarity across all 800 ToxCast in vitro assay targets. Chemical ordering is the same on each axis, and the unity correlation is shown along the diagonal. Darker red coloring indicates a higher degree of similarity.