| Literature DB >> 25195622 |
Hao Zhu1, Jun Zhang, Marlene T Kim, Abena Boison, Alexander Sedykh, Kimberlee Moran.
Abstract
High-throughput screening (HTS) assays that measure the in vitro toxicity of environmental compounds have been widely applied as an alternative to in vivo animal tests of chemical toxicity. Current HTS studies provide the community with rich toxicology information that has the potential to be integrated into toxicity research. The available in vitro toxicity data is updated daily in structured formats (e.g., deposited into PubChem and other data-sharing web portals) or in an unstructured way (papers, laboratory reports, toxicity Web site updates, etc.). The information derived from the current toxicity data is so large and complex that it becomes difficult to process using available database management tools or traditional data processing applications. For this reason, it is necessary to develop a big data approach when conducting modern chemical toxicity research. In vitro data for a compound, obtained from meaningful bioassays, can be viewed as a response profile that gives detailed information about the compound's ability to affect relevant biological proteins/receptors. This information is critical for the evaluation of complex bioactivities (e.g., animal toxicities) and grows rapidly as big data in toxicology communities. This review focuses mainly on the existing structured in vitro data (e.g., PubChem data sets) as response profiles for compounds of environmental interest (e.g., potential human/animal toxicants). Potential modeling and mining tools to use the current big data pool in chemical toxicity research are also described.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25195622 PMCID: PMC4203392 DOI: 10.1021/tx500145h
Source DB: PubMed Journal: Chem Res Toxicol ISSN: 0893-228X Impact factor: 3.739
Figure 1Construction of big data for chemical toxicity research.
Available Public Toxicity Data Resources
| name | general information | data description |
|---|---|---|
| PubChem[ | Around 47 million compounds, over 700 000 bioassays, over 13 billion data points | Toxicity, pharmaceutical, genomics, and literature data |
| ChEMBL[ | Over 600 000 compounds, 3.3 million bioassay readout data | Literature data |
| ACToR[ | The toxicity results from 100 various data resources | Both |
| ToxNET[ | Over 50 000 environmental compounds from 16 different resources | Both |
| SEURAT[ | Over 5500 cosmetic-type compounds in the current COSMOS database web portal | Animal toxicity data |
| CTD[ | Over 13 000 compounds, over 32 000 genes, over 6000 diseases | Compound, gene, and disease relationships |
| CEBS[ | About 10 000 toxicity bioassays from various sources | Gene expression data |
| DrugMatrix[ | About 600 drug molecules and 10 000 genes | Gene expression data |
| Cmap[ | About 1300 compounds and 7000 genes | Gene expression data |
Figure 2Increase of compounds recorded in PubChem within 5 years (from September 2008 to September 2013).
Twenty Human Toxicants with Their Relevant PubChem Bioassay Responses
| chemicals | CAS | no. of active responses | no. of inactive responses |
|---|---|---|---|
| Chlordecone | 143-50-0 | 328 | 539 |
| Toxaphene | 8001-35-2 | 294 | 112 |
| Hexachlorocyclopentadiene | 77-47-4 | 208 | 262 |
| Dichlorvos | 62-73-7 | 181 | 633 |
| Pentachlorophenol | 87-86-5 | 95 | 690 |
| Heptachlor | 76-44-8 | 85 | 624 |
| DDT, p,p′- | 50-29-3 | 76 | 386 |
| DDD, p,p′- | 72-54-8 | 70 | 186 |
| Endosulfan | 115-29-7 | 65 | 259 |
| Naphthalene | 91-20-3 | 61 | 890 |
| DDD, o,p′- | 53-19-0 | 61 | 964 |
| 1,4-Dichlorobenzene | 106-46-7 | 57 | 362 |
| 4,6-Dinitro- | 534-52-1 | 57 | 213 |
| Phenol | 108-95-2 | 53 | 518 |
| Chlorpyrifos | 2921-88-2 | 48 | 739 |
| Methoxychlor | 72-43-5 | 47 | 710 |
| 2,4-Dinitrophenol | 51-28-5 | 46 | 672 |
| Tetrachlorophenol | 25167-83-3 | 45 | 515 |
| Benzo(a)pyrene | 50-32-8 | 39 | 358 |
| 4,4′-Methylenebis(2-chloroaniline) | 101-14-4 | 32 | 431 |
Figure 3Response spaces of different ToxCast compound categories represented by the data obtained from 193 PubChem bioassays: (a) 171 consumer use chemicals (not including pharmaceuticals or pesticides), (b) 470 pesticides, (c) 245 pharmaceuticals, and (d) 34 phthalate plasticizers and alternatives.
Figure 4A potential in vitro–in vivo relationship in toxicology studies.