| Literature DB >> 29123642 |
Stéphanie Boué1, Thomas Exner2, Samik Ghosh3, Vincenzo Belcastro1, Joh Dokler2, David Page1, Akash Boda1, Filipe Bonjour1, Barry Hardy2, Patrick Vanscheeuwijck1, Julia Hoeng1, Manuel Peitsch1.
Abstract
The US FDA defines modified risk tobacco products (MRTPs) as products that aim to reduce harm or the risk of tobacco-related disease associated with commercially marketed tobacco products. Establishing a product's potential as an MRTP requires scientific substantiation including toxicity studies and measures of disease risk relative to those of cigarette smoking. Best practices encourage verification of the data from such studies through sharing and open standards. Building on the experience gained from the OpenTox project, a proof-of-concept database and website ( INTERVALS) has been developed to share results from both in vivo inhalation studies and in vitro studies conducted by Philip Morris International R&D to assess candidate MRTPs. As datasets are often generated by diverse methods and standards, they need to be traceable, curated, and the methods used well described so that knowledge can be gained using data science principles and tools. The data-management framework described here accounts for the latest standards of data sharing and research reproducibility. Curated data and methods descriptions have been prepared in ISA-Tab format and stored in a database accessible via a search portal on the INTERVALS website. The portal allows users to browse the data by study or mechanism (e.g., inflammation, oxidative stress) and obtain information relevant to study design, methods, and the most important results. Given the successful development of the initial infrastructure, the goal is to grow this initiative and establish a public repository for 21 st-century preclinical systems toxicology MRTP assessment data and results that supports open data principles.Entities:
Keywords: Data sharing; Database; Harm reduction; Open data; Systems toxicology; Website
Year: 2017 PMID: 29123642 PMCID: PMC5657032 DOI: 10.12688/f1000research.10493.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Systems Toxicology.
To understand the effect and mode of action of chemicals or drugs on Human, different studies can be conducted. Epidemiology will provide the final evidence but requires long periods of observation. Phenotypic observations may be obtained at the individual level from biopsies or tissue collection. Animal studies can provide surrogate information in a controlled setup and allow the collection of various tissues and fluids. Alternatively, new in vitro methods are developed to provide information on toxicity and pathways of toxicity. It is possible to obtain organ-tissue level information from macroscopic observation of tissues, but also to understand cellular level or even molecular level by mining data from –omics profilings using modeled knowledge and dedicated algorithms.
Sources of information on toxicology: database repositories for predictive systems toxicology investigations and risk assessment.
| Toxicology Databases | ||||
|---|---|---|---|---|
| Database Portals/Projects | ||||
| Inh | Portal/Project
| Description | Data type | URL |
| Safety
| • Cluster of 7 projects and portal to
| Access regulated
|
| |
| BioSharing | • Portal of curated web-based, user-
| Several databases |
| |
| EU-ToxRisk | • Integrated European program aiming
| Website under
|
| |
| Data
| • Collection of European toxicogenomics
| Individual cigarette
|
| |
| TOXNET | • Collection of several databases and
| Mostly publication
|
| |
| ToxCast™/Tox21 | • Tox21 is a United States federal
| Data available via
|
| |
| Inh | Individual
| |||
| PubChem | • Provides information on small-molecule
| Publications and
|
| |
| ChEBI | • Dictionary of identifiable, distinct
| Individual CS
|
| |
| Comparative
| • Collection of interlinked public databases
| Independent studies
|
| |
| Aggregated
| • Publicly available online resource for
| Publications, reports/
|
| |
| Chemical
| • Toxicogenomics database with a
| Publications,
|
| |
| CompTox
| • Access to >740,000 chemical substances
|
| ||
| Online Chemical
| • Modeling tool for development of
|
| ||
| ToxBank | • Subsidiary of the SEURAT-1 project that
| Access is regulated
|
| |
The table highlights sources of information on in vivo chemical inhalation and individual in vitro chemical toxicity. The type of data available and, where known, user accessibility (e.g., open source vs licensing) have also been highlighted. While a number of databases and portals are still active, a few of them are no longer maintained. Green color in the “Inh” column means that the resource contains inhalation data.
Figure 2. Concepts of infrastructure and data sharing.
Ideally, as experiments are performed, protocols and metadata are recorded for each of the data entries and curated in ISA-Tab files. They all are imported into a common database that supports defined ontologies. Raw data can be exported from this database and processed with different scripts and/or software to generate analyses results, some of which are usually shared in a publication. All of the results can be saved into the database and the data and results can be accessed through an API to be browsed on and downloaded from the website named INTERVALS. The website also keeps track of publications associated with the studies.
Figure 3. Schematic of the hierarchical structure of interconnected ISA-Tab instances.
The schema depicts the theoretical splitting strategy of data and metadata from two different studies into ISA-Tab files. The highest level will describe all subjects or samples analyzed in a study. Then, for each endpoint, a file describes the data production step, and links out to a raw data file. Another file will describe data processing steps and link out to processed data files. It is also possible that the two steps are combined into a single file. Eventually, analysis and data modelling could consider data from multiple studies.
Figure 4. Study design and organization of the ISA-Tab instances for the C57BL6-pMRTP-SW study.
A. Switching study concept and study design and setup. B. ISA-Tab splitting strategy of endpoints. The data production (DP) and processing (PR) instances describe the experimental setup and processing steps, from raw to processed data. Transcriptomics processing is separated by tissue, resulting in individual PR ISA-Tab instances. C. A more complicated lipidomics scheme was necessary because the experiment was performed independently for different groups of lipids, and hence separate DP instances were used for each mass spectrometry platform/set of methods. Processing was then performed per tissue, resulting in separate PR instances for blood, right lung, and liver. For simplicity, data files are not depicted here.
Figure 5. Organotypic dataset – Study design, setup, and ISA-file splitting.
A. Study design and setup. B. Measurement type per insert. For each condition (test item type and concentration), a set of up to seven inserts was used to measure endpoints at different post-exposure times. Longitudinal measurements were conducted for CBF and CYP1A1/1B1 activity. For other endpoints, a new insert was used for each post-exposure time point. C. ISA-Tab splitting strategy of endpoints data production and processing across ISA-Tab files. Raw and Processed data files are illustrated with green and orange backgrounds, respectively.
Figure 6. Data repository overview and links to website and tools through an API.
Figure 7. Faceted search user interface.
Users can filter the datasets by Organism, Study, Mechanism, Tissue/Organ, and Endpoint Type. A toggle switch provides a choice between downloading raw and processed data.
Figure 8. Schematic of an AOP.
The schematic includes biological assays to test the molecular initiating event and specific key events on different levels, which could be combined into a weight of evidence supporting risk assessment or integrated testing strategies. The in vivo tests (orange) should be increasingly replaced by a combination of in vitro assays and in silico tools (green) to reduce animal testing according to the 3Rs principle [26].
Figure 9. Concepts of an intelligent, knowledge mining and visualization platform for systems toxicology.