| Literature DB >> 25326818 |
Diana M Hendrickx1, Rebecca R Boyles, Jos C S Kleinjans, Allen Dearry.
Abstract
A joint US-EU workshop on enhancing data sharing and exchange in toxicogenomics was held at the National Institute for Environmental Health Sciences. Currently, efficient reuse of data is hampered by problems related to public data availability, data quality, database interoperability (the ability to exchange information), standardization and sustainability. At the workshop, experts from universities and research institutes presented databases, studies, organizations and tools that attempt to deal with these problems. Furthermore, a case study showing that combining toxicogenomics data from multiple resources leads to more accurate predictions in risk assessment was presented. All participants agreed that there is a need for a web portal describing the diverse, heterogeneous data resources relevant for toxicogenomics research. Furthermore, there was agreement that linking more data resources would improve toxicogenomics data analysis. To outline a roadmap to enhance interoperability between data resources, the participants recommend collecting user stories from the toxicogenomics research community on barriers in data sharing and exchange currently hampering answering to certain research questions. These user stories may guide the prioritization of steps to be taken for enhancing integration of toxicogenomics databases.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25326818 PMCID: PMC4247478 DOI: 10.1007/s00204-014-1387-3
Source DB: PubMed Journal: Arch Toxicol ISSN: 0340-5761 Impact factor: 5.153
Summary of the databases, studies, organizations and tools presented at the workshop and how they deal with problems regarding communication, quality of the data, sustainability and education
| Database/study/organization/tool | Deals with | How? | |||
|---|---|---|---|---|---|
| Communication | Quality | Sustainability | Training and support | ||
| RDA | X | X | Working and interest groups Forum | ||
| CEBS | X | X | X | Unified format (SIFT) CEBS data dictionary Curation of data Possibility to deposit new data | |
| diXa | X | X | X | X | InChI/Keys for chemicals Unified format for metadata (ISA-TAB) Quality control pipeline (Genedata) Possibility to deposit new data Training and workshops |
| CTD | X | X | X | Use of ontologies CTD’s merged disease vocabulary Developing exposome ontology Manually curated Updated regularly | |
| LINCS | X | Reproducibility between replicates checked | |||
| TG-GATEs | X | X | Same experimental design for all experiments Data checked on reproducibility between replicates | ||
| DrugMatrix & ToxFX | X | X | Standardized experimental protocol ToxFX provides tools for quality control | ||
| ChEMBL | X | X | Data in a unified format Regularly updated | ||
| UniChem | X | Unified chemical identifier | |||
| MAQC & SEQC | X | X | Data deposited in public resources that used standardized formats, e.g. CEBS Assess reliability, repeatability and reproducibility of microarrays and NGS data Comparing different quality control pipelines currently in use for NGS data | ||
| WikiLIMS | X | Can handle many different data types/formats Easy to update | |||
Fig. 1Links between the different data sources presented at the workshops. Dashed lines indicate ongoing efforts