| Literature DB >> 27303447 |
Ashenafi Legehar1, Henri Xhaard2, Leo Ghemtio1.
Abstract
BACKGROUND: The disposition of a pharmaceutical compound within an organism, i.e. its Absorption, Distribution, Metabolism, Excretion, Toxicity (ADMET) properties and adverse effects, critically affects late stage failure of drug candidates and has led to the withdrawal of approved drugs. Computational methods are effective approaches to reduce the number of safety issues by analyzing possible links between chemical structures and ADMET or adverse effects, but this is limited by the size, quality, and heterogeneity of the data available from individual sources. Thus, large, clean and integrated databases of approved drug data, associated with fast and efficient predictive tools are desirable early in the drug discovery process. DESCRIPTION: We have built a relational database (IDAAPM) to integrate available approved drug data such as drug approval information, ADMET and adverse effects, chemical structures and molecular descriptors, targets, bioactivity and related references. The database has been coupled with a searchable web interface and modern data analytics platform (KNIME) to allow data access, data transformation, initial analysis and further predictive modeling. Data were extracted from FDA resources and supplemented from other publicly available databases. Currently, the database contains information regarding about 19,226 FDA approval applications for 31,815 products (small molecules and biologics) with their approval history, 2505 active ingredients, together with as many ADMET properties, 1629 molecular structures, 2.5 million adverse effects and 36,963 experimental drug-target bioactivity data.Entities:
Keywords: ADMET; Adverse effects; Data analysis; Database; Drug-target database; FDA approved drugs; Predictive modeling; Targets
Year: 2016 PMID: 27303447 PMCID: PMC4906584 DOI: 10.1186/s13321-016-0141-7
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 1Simplified entity-relational model of IDAAPM. Eight entities or tables (main group of data) are shown linked by different types of association (see legend) to characterize how they are related to one another
Summary of IDAAPM content
| FDA applications | Products | Active ingredients | Structures | Drug areas | Target classes | Adeverse effects | Targets | Drug–targets interactions |
|---|---|---|---|---|---|---|---|---|
| 14,260 Generics | 31,815 | 2505 | 1629 | 20 | 6 | 2,472,329 | 3382 | 36,963 |
Fig. 2a Composition of IDAAPM drugs by mode of administration. Each histogram bar represents the amount of drugs present in IDAAPM for a selected mode of administration and is colored by commercial status of the drug. b Target distribution in IDAAPM by protein family. Each histogram bar represents the amount of target present in IDAAPM for the five most populated protein classes and is colored by their corresponding drug area
Fig. 3a Frequency of adverse effects. For the 26 adverse effects SOC MedDRA terms, each histogram bar corresponds to the amount of the selected adverse effect in IDAAPM and is colored by their corresponding drug area. b Systemic drugs distribution with ocular adverse effects. Adverse effects reported have relative frequency >0.1, each histogram bar corresponds to the primary area of systemic drugs with ocular adverse effect and is colored by their route of administration
Fig. 4IDAAPM utility examples. Examples of KNIME workflows to access IDAAPM (a), export data (b), make preliminary analysis (c, d) and build predictive classification models (e)
Fig. 5Ocular pharmaceutics adverse effect network with relative frequency >0.1. Adverse effects are reported using the system organ class of the medical dictionary for regulatory activities