| Literature DB >> 31244651 |
Gopal Pawar1, Judith C Madden1, David Ebbrell1, James W Firman1, Mark T D Cronin1.
Abstract
A plethora of databases exist online that can assist in in silico chemical or drug safety assessment. However, a systematic review and grouping of databases, based on purpose and information content, consolidated in a single source, has been lacking. To resolve this issue, this review provides a comprehensive listing of the key in silico data resources relevant to: chemical identity and properties, drug action, toxicology (including nano-material toxicity), exposure, omics, pathways, Absorption, Distribution, Metabolism and Elimination (ADME) properties, clinical trials, pharmacovigilance, patents-related databases, biological (genes, enzymes, proteins, other macromolecules etc.) databases, protein-protein interactions (PPIs), environmental exposure related, and finally databases relating to animal alternatives in support of 3Rs policies. More than nine hundred databases were identified and reviewed against criteria relating to accessibility, data coverage, interoperability or application programming interface (API), appropriate identifiers, types of in vitro, in vivo,-clinical or other data recorded and suitability for modelling, read-across, or similarity searching. This review also specifically addresses the need for solutions for mapping and integration of databases into a common platform for better translatability of preclinical data to clinical data.Entities:
Keywords: chemicals; databases; drugs; in silico; safety assessment
Year: 2019 PMID: 31244651 PMCID: PMC6580867 DOI: 10.3389/fphar.2019.00561
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Previous review articles for identification of databases relevant to chemistry and toxicology.
| Alexander-Dann et al., | Gene expression | 12 | Microarray software, database management systems |
| Ayvaz et al., | Potential DDI information resources | 14 | Clinical, natural language corpora, pharmacovigilance data sources |
| Benigni et al., | Chemical mutagenicity and carcinogenecity | 18 | QSAR, Cluster of toxicity databases, risk assessment |
| Bianco et al., | Genetic disease research databases | 18 | Sample sequence, gene expression and post-transcriptional regulation |
| Bower et al., | Toxicity databases | Toxicity data resources and format (ToxML) discussed | |
| Cha et al., | Drug repurposing databases | 29 | Drugs and disease (omics, genomics, transcriptomics, proteomics, epigenetic) databases, omics tools also available |
| Chen et al., | Drug-target interaction databases | 15 | Webserver databases and computational models included |
| Cheng et al., | Drug Target interaction databases | 28 | 3D structure, binding affinities Db, screening programs and data repositories, Curated drug-target interactions |
| Cronin, | Toxicology databases | 26 | Sources of chemical structures also described |
| Cronin, | Toxicology databases | 33 | Data for QSAR modelling purposes |
| Ekins and Williams, | ADME/Tox databases | 13 | Targeted data types required for ADME/Tox and PK databases |
| Ekins et al., | Systems biology and ADMET | 33 | HT techniques, systems biology modelling and ADMET modelling included |
| Ekins et al., | Tuberculosis (TB) databases | 13 | Computational databases, pathways, cheminformatics tools for TB |
| Fostel et al., | Toxicogenomics | 14 | Relevant Databases and Consortia Supporting Systems Toxicology Research |
| Fouretier et al., | Pharmacovigilance (PV) | 11 | North American PV databases not covered |
| Fotis et al., | Omic repositories | 48 | Omics and pathways, tools provided |
| González-Medina et al., | Chemical biology databases | 11 | Online servers and tools for mining chemical and target spaces |
| Hersey et al., | Chemical databases | 10 | Bioactivity, Patents, drugs and target, available compound and other |
| Ji et al., | Proteins associated with drug therapeutic effects, ADR and ADME | 44 | Targets related databases and their websites |
| Jonsdottir et al., | Prediction methods, cheminformatics DBs | 23 | General, screening compounds, medicinal agents, physicochemical and ADMET properties |
| Judson, | Toxicology databases | 15 | |
| Kiyosawa et al., | Microarray databases | 7 | Large scale toxicogenomics databases |
| Koutsoukas et al., | Bioactivity and target predictions | 20 | Bioactivity and target-based databases, WS for target prediction of small molecules |
| Katsila et al., | Drug target identification databases | 19 | Human metabolome, pathway analysis, chemogenomic data, drug-target, protein, disease specific target DB, pharmacogenomic, toxicogenomic, target-toxin, protein expression, therapeutic target |
| Loging et al., | Drug repurposing | 11 | Public resources |
| Luo et al., | DILI databases | 11 | Liver specific injury and broader drug databases |
| Madden, | Toxicity, reactivity, chemical property and structural data | 30 | Assessment of quality data |
| Madden et al., | PBPK and ADME Resources | ~100 | Resources to predict external exposure, physico-chemical properties, ADME properties, physiological/anatomical parameters and model structures for specific organs, PBPK modelling softwares, similar chemicals |
| Nicola et al., | Medicinal chemistry databases | 12 | Databases of binding and bioactivity data for small molecules |
| Opassi et al., | Chemical-Biology databases | 28 | Virtually accessible chemical spaces, biology databases |
| Oprea and Tropsha, | Target, chemical and bioactivity | 24 | Integration of the databases |
| Papadopoulos et al., | Omics databases on kidney disease | 18 | General omics and kidney specific databases |
| Peach et al., | Metabolism related content | 11 | Software for metabolism predictions |
| Polen et al., | Online drug databases | 14 | Drug databases for infectious disease therapies |
| Rana et al., | Receptor and binding databases | 26 | Websites for computational, GPCR specific and nuclear receptors |
| Rigden et al., | Molecular biology databases | 157 | Nucleic acids, genetic basis of cancer, patented drugs, their side effects, withdrawn drugs, and potential drug targets |
| Sato et al., | hERG inhibitors, cardiotoxicity | 4 | hERG inhibition by small molecules |
| Sim et al., | Pharmacogenetics | 7 | Pharmacogenomics, CYP, NAT, Transporters, UGT, ADME Dbs |
| Smalter Hall et al., | Chemical and biological databases | 20 | Protein interaction, pathways, drug discovery, mathematical models databases, data formats for proteomics and genomics and cheminformatics provided. |
| Toropov et al., | Drug toxicity databases | 27 | Software for QSAR analysis of toxic endpoints also given |
| Williams, | Chemical property databases | 15 | Publicly available databases |
| Wishart, | Drug metabolism research | 13 | Online databases and prediction software for drug metabolism |
| Wooden et al., | Big data analysis resources | 18 | Big data for gastro intestinal and liver diseases |
| Young, | Genetic toxicology web resources | 13 | EPA, FDA, US NLM toxicity databases discussed |
| Zou et al., | Biological databases for human research | >100 | DNA, RNA, Proteins, expressions, pathways, disease, ontology and literature-based databases listed |
| Zhang et al., | Pharmacogenomics | 8 | Web resources |
Considerations for characterising the databases.
| Accessibility (open access; registration; license required) |
| Interoperability (linkage via API or importable) |
| Acceptable ontology and units (or readily converted) |
| Appropriate identifiers used (e.g., InChI) |
| Relevance of endpoint (s) project: physico-chemical properties; ADME (including metabolite data); pharmacological activity; toxicity; clinical trial data; adverse events reports |
| Access to metadata |
| Information provided on study protocols/statistics |
| Data quality assessment and accuracy of information |
| Ease of use/navigation |
| Appropriate classification codes (e.g., therapeutic group classification) |
| Currency of information (historical; frequency of updates); size of resource (amount of data / level of detail) |
| Type of data recorded ( |
| Relevance to overall aim of any project (e.g., extrapolation from preclinical to clinical) |
| Experimental vs. predicted values |
| Insights into mechanisms of action/elicitation of molecular initiating event |
| Suitability for modelling, read-across, or similarity searching |
Complete listing of all databases identified in this study grouped according to content (URL links available on-line).
| EDETOX Db | ||||||
| Tox 21 | ||||||
| HIV Molecular Immu DB | ||||||
| Proteome Isoelectric Point | ||||||
| Omics DI | ||||||
| LifeMap Discovery®, Cells & Tiss | ||||||
Figure 1Chart showing the number of databases within each group. DI, Drug Information; CT, Clinical trials; PV, Pharmacovigilance; PPI, Protein-protein interactions; Animal Alt, Animal alternatives.