| Literature DB >> 25522365 |
Joseline Ratnam1, Barbara Zdrazil2, Daniela Digles2, Emiliano Cuadrado-Rodriguez1, Jean-Marc Neefs3, Hannah Tipney4, Ronald Siebes5, Andra Waagmeester6, Glyn Bradley4, Chau Han Chau4, Lars Richter2, Jose Brea1, Chris T Evelo6, Edgar Jacoby3, Stefan Senger4, Maria Isabel Loza1, Gerhard F Ecker2, Christine Chichester7.
Abstract
Integration of open access, curated, high-quality information from multiple disciplines in the Life and Biomedical Sciences provides a holistic understanding of the domain. Additionally, the effective linking of diverse data sources can unearth hidden relationships and guide potential research strategies. However, given the lack of consistency between descriptors and identifiers used in different resources and the absence of a simple mechanism to link them, gathering and combining relevant, comprehensive information from diverse databases remains a challenge. The Open Pharmacological Concepts Triple Store (Open PHACTS) is an Innovative Medicines Initiative project that uses semantic web technology approaches to enable scientists to easily access and process data from multiple sources to solve real-world drug discovery problems. The project draws together sources of publicly-available pharmacological, physicochemical and biomolecular data, represents it in a stable infrastructure and provides well-defined information exploration and retrieval methods. Here, we highlight the utility of this platform in conjunction with workflow tools to solve pharmacological research questions that require interoperability between target, compound, and pathway data. Use cases presented herein cover 1) the comprehensive identification of chemical matter for a dopamine receptor drug discovery program 2) the identification of compounds active against all targets in the Epidermal growth factor receptor (ErbB) signaling pathway that have a relevance to disease and 3) the evaluation of established targets in the Vitamin D metabolism pathway to aid novel Vitamin D analogue design. The example workflows presented illustrate how the Open PHACTS Discovery Platform can be used to exploit existing knowledge and generate new hypotheses in the process of drug discovery.Entities:
Mesh:
Year: 2014 PMID: 25522365 PMCID: PMC4270790 DOI: 10.1371/journal.pone.0115460
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Open PHACTS v1.3 API calls (orange boxes) used to address use cases A, B and C, as described in Methods.
Operations performed outside Open PHACTS, viz., sequence similarity searches via BLAST and access to proprietary databases (dark grey boxes) are facilitated by information derived from the platform. Sample input URIs for each API call is shown in S2 Table.
Figure 2Use case A workflow.
Schematic representation of the workflow for use case A. Starting with a free text search for the desired target(s), Uniprot AC identifiers, protein sequences and gene symbols are obtained using ‘Free Text to Concept’ and ‘Target Information’ API calls. A gene symbol list is obtained for targets from the same family (based on GO) using a ‘Target Classification’ API call. Alternatively, UniProt ACs obtained for related protein sequences via a BLAST search are used to get corresponding gene symbols using the ‘Target Information’ API call. Using this gene list, corresponding pharmacology records in the public domain are obtained via the ‘Pharmacology by Target’ API. In parallel, the gene symbol list is used to retrieve target pharmacology information in Thomson Reuters Integrity, World Drug Index, PharmaProjects, GVKBio GOSTAR, and Janssen pharmacology proprietary databases. Public pharmacology records (additional targets) for the retrieved compounds are then obtained using the ‘Pharmacology by compound’ API call with equivalent searches in Janssen pharmacology proprietary databases. If required, a structure similarity search is performed with the retrieved compounds to identify additional compounds, followed by another round of searches in Open PHACTS and proprietary databases as before. A Pipeline Pilot script was developed to run the above steps and produce an integrated list of compounds, activity data and target information from all databases. Proprietary components developed at Janssen were used to parse Janssen pharmacology data. All data processing was performed within the Pipeline Pilot framework.
Number of DRD2-targeted compounds found in different databases.
| Activity Data Source | Number of Compounds found |
| Open PHACTS | 2278 (active) +164 (inactive) |
| Patent Reporting Databases | 3148 |
| Janssen Compound Screening Databases | 8959 |
Active compounds have % activity values>50% or -log(IC50) values>6.
Figure 3case B workflow.
Open PHACTS v 1.3 API calls are shown in orange boxes along with the results obtained. Bioactivity filters and other data processing operations are shown in yellow boxes with results obtained in light grey boxes. Blue colored boxes show results included in the manuscript. Compound pharmacology at the pathway level was retrieved by consecutive execution of the API calls ‘Pathway Information: Get targets’ and ‘Target Pharmacology: List’ - the latter includes a filtering for desired activity endpoints and units - and other filtering, transformation, and normalization steps: transformation into ‘- logActivity values [molar]’, setting a threshold for binary representation, and subsequent filtering by keeping only the max. activity value for each compound/target pair. Retrieving GO annotations for a list of targets, and ChEBI annotations for compounds that have been tested against those targets was achieved by using the API calls ‘Target Classifications’ and ‘Compound Classifications’ and subsequent restriction to terms of the type ‘biological process’ and ‘has role’, respectively.
List of 23 targets (possessing more than 100 active compounds) with their ChEMBL Target IDs, target names, target types, and the number of active and inactive compounds that have been tested on those targets (considering a threshold of 6).
| ChEMBL target ID | Target Name | Target Type | number of actives | number of inactives |
| CHEMBL4096 | Cellular tumor antigen p53 | Single Protein | 3670 | 32405 |
| CHEMBL203 | Epidermal growth factor receptor erbB1 | Single Protein | 2268 | 2760 |
| CHEMBL267 | Tyrosine-protein kinase SRC | Single Protein | 1567 | 2243 |
| CHEMBL262 | Glycogen synthase kinase-3 beta | Single Protein | 1536 | 1547 |
| CHEMBL2842 | FK506 binding protein 12 | Single Protein | 1328 | 3244 |
| CHEMBL4040 | MAP kinase ERK2 | Single Protein | 1230 | 13342 |
| CHEMBL1862 | Tyrosine-protein kinase ABL | Single Protein | 1077 | 614 |
| CHEMBL1824 | Receptor protein-tyrosine kinase erbB-2 | Single Protein | 964 | 1214 |
| CHEMBL2276 | c-Jun N-terminal kinase 1 | Single Protein | 607 | 1215 |
| CHEMBL299 | Protein kinase C alpha | Single Protein | 528 | 483 |
| CHEMBL3587 | Dual specificity mitogen-activated protein kinase kinase 1 | Single Protein | 444 | 475 |
| CHEMBL1907601 | Cyclin-dependent kinase 4/cyclin D1 | Protein Complex | 381 | 379 |
| CHEMBL4501 | Ribosomal protein S6 kinase 1 | Single Protein | 299 | 582 |
| CHEMBL2695 | Focal adhesion kinase 1 | Single Protein | 230 | 825 |
| CHEMBL4816 | Serine/threonine-protein kinase AKT3 | Single Protein | 163 | 876 |
| CHEMBL2095188 | Glycogen synthase kinase-3 | Protein Family | 158 | 201 |
| CHEMBL3663 | Growth factor receptor-bound protein 2 | Single Protein | 145 | 168 |
| CHEMBL4482 | Serine/threonine-protein kinase PAK 4 | Single Protein | 129 | 1016 |
| CHEMBL5023 | p53-binding protein Mdm-2 | Single Protein | 127 | 307 |
| CHEMBL2095942 | Cyclin-dependent kinase 4/cyclin D | Protein Complex | 113 | 42 |
| CHEMBL1907611 | Tumour suppressor p53/oncoprotein Mdm2 | Protein Protein Interaction | 107 | 252 |
| CHEMBL2096618 | Bcr/Abl fusion protein | Chimeric Protein | 107 | 106 |
| CHEMBL3009 | Receptor protein-tyrosine kinase erbB-4 | Single Protein | 102 | 683 |
List of targets, compounds and approved drugs in Vitamin D metabolism pathway obtained from Workflow 1.
| No. | UniProt Accession | Name | Active compounds (bioactivities) | Approved drugs obtained via 2 methods | Comment | |
| Target information API (DrugBank 3.0) | Target Pharmacology API (ChEMBL_16) | |||||
|
| Q07973 | CYP24A1 cytochrome P450 | 26 (30) | 0 | ||
|
| Q548T3 | CYP27B1 cytochrome P450 | 1 (1) | Ergocalciferol | ||
| Calcidiol | ||||||
| Calcitriol | ||||||
|
| Q02318 | Sterol 26-hydroxylase, mitochondrial | 5 (5) | Cholecalciferol | ||
|
| P02774 | Vitamin D-binding protein | 39 (112) | 0 | ||
|
| Q9UBM7 | DHCR7 7-dehydrocholesterol reductase | 4 (5) | NADH | ||
| Q15125, Q9UBM7 | AEBS | 66 (122) | 0 | Tamoxifen, Doxorubicin | Target is protein complex AEBS (made of DHCR7 and D8-D7 sterol isomerase). Not listed in DrugBank 3.0 | |
|
| Q6VVX0 | CYP2R1 cytochrome P450 | 0 (0) | Cholecalciferol, Ergocalciferol | ||
|
| P01270 | PTH parathyroid hormone | 0 (0) | 0 | ||
|
| P19793 | RXRA retinoid X receptor, alpha | 545 (1274) | Alitretinoin | ||
| Acitretin | ||||||
| Adapalene | ||||||
| P37231, P19793 | RXR alpha/PPAR gamma | 33 (65) | 0 | 0 | ||
|
| P28702 | RXRB retinoid X receptor, beta | 149 (261) | Alitretinoin | ||
| Bexarotene | ||||||
| Acitretin | ||||||
| Adapalene | ||||||
| Tazarotene | ||||||
| Tretinoin | ||||||
|
| P11473 | VDR vitamin D (1,25- dihydroxyvitamin D3) receptor | 4139 (5918) | Calcipotriol | ||
| Calcitriol | ||||||
| Ergocalciferol | ||||||
| Paricalcitol | ||||||
| Dihydrotachysterol | ||||||
| Calcidiol | ||||||
Compounds active against CYP24A1 obtained from Workflow 2.
| No. | Compound name | Assay Description | Activity Type | Value | Unit | pChembl | Active against other targets in pathway? | Other targets in general? |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4'-chloro-N-[2-(1H-imidazol-1-yl)-2-phenylethyl]biphenyl-4-carboxamide | Inhibition of CYP24 in human keratinocytes | IC50 | 15 | nM | 7.82 | 25-hydroxyvitamin D-1 alpha hydroxylase, mitochondrial | No |
|
| 1-[4-(4-{[(2S,4R)-2-(2,4-Dichlorophenyl)-2-(1H-imidazol-1-ylmethyl)-1,3-dioxolan-4-yl]methoxy}phenyl)-1-piperazinyl]ethanone | Inhibition of CYP24A1 in human epidermal keratinocytes | IC50 | 126 | nM | 6.90 | RXRA, VDR | 340 different targets (ketoconazole) |
|
| 2-(2-Ethylbenzyl)-6-methoxy-3,4-dihydro-1(2H)-naphthalenone | Inhibition of CYP24A1 expressed in CHO cells | IC50 | 1920 | nM | 5.72 | Sterol 26-hydroxylase, mitochondrial | No |
|
| 6-Methoxy-2-[2-(trifluoromethyl)benzyl]-3,4-dihydro-1(2H)-naphthalenone | Inhibition of CYP24A1 expressed in CHO cells | IC50 | 2080 | nM | 5.68 | Sterol 26-hydroxylase, mitochondrial | No |
|
| (2E)-6-Methoxy-2-{2-[(E)-2-phenylvinyl]benzylidene}-3,4-dihydro-1(2H)-naphthalenone | Inhibition of CYP24A1 expressed in CHO cells | IC50 | 5080 | nM | 5.29 | Sterol 26-hydroxylase, mitochondrial | No |
Compounds 1–7 ranked according to potency (in bold) have no activity against additional targets based on polypharmacology records, whereas compounds 8–12 inhibit calcitriol activating enzymes, VDR and RXRA.
Figure 4Use case C workflows 1 and 2.
Open PHACTS v 1.3 API calls are shown in orange boxes along with the results obtained. Bioactivity filters and other data processing operations are shown in yellow boxes with results obtained in light grey boxes. Blue colored boxes show results included in the manuscript. Sample input URLs are shown in S2 Table. For workflow 1, a description of the pathway and targets contained were obtained using the ‘Pathway information’ and ‘Pathway Information: Get targets’ API calls. Other pathways where these targets are present were obtained using ‘Pathways for Target: List’ API call. Approved drugs against single protein targets were obtained using ‘Target Information’ API call by specifying target type - approved. Compounds tested against all targets in the pathway were retrieved using ‘Target Pharmacology: List’ API call. Approved drugs targeting protein complexes (containing any member of the pathway) were identified by filtering for protein complexes and ‘approved’ target type via the ‘Compound Information’ API call. For workflow 2, compounds hitting CYP24A1 from the previous results were used as input to find additional targets using the ‘Compound Pharmacology: List’ API. Additional pathways containing these new targets were obtained using ‘Pathways for Target: List’ API.
Regulators of Vitamin D signaling obtained from Workflow 3.
| GO terms associated with Vitamin D regulation | Associated Genes (Human) | In pathway? | ||
| Parents | Children | UniProt Accession | TargetName/Gen Name | |
| GO:0010979 regulation of vitamin D 24-hydroxylase activity |
| O15528 | 25-hydroxyvitamin D-1 alpha hydroxylase, mitochondrial | YES |
|
|
|
| ||
| Q9GZV9 | Fibroblast growth factor 23 | NO | ||
| GO:0060556 regulation of vitamin D biosynthetic process | GO:0060557 | P01579 | Interferon gamma | NO |
| P01375 | Tumor necrosis factor | NO | ||
| GO:0070562 regulation of vitamin D receptor signaling pathway | GO:0070564 | O15528 | 25-hydroxyvitamin D-1 alpha hydroxylase, mitochondrial | YES |
| Q13573 | SNW domain-containing protein 1 | NO | ||
| GO:0060556 regulation of vitamin D biosynthetic process | GO:0010957 | O43623 |
| NO |
| O95863 |
| NO | ||
| P19838 | Nuclear factor NF-kappa-B p105 subunit | NO | ||
| Q99684 | Zinc finger protein Gfi-1 | NO | ||
| GO:0070562 regulation of vitamin D receptor signaling pathway | GO:0070563 | O43623 |
| NO |
Terms in bold are discussed in the text.
Benefits of using the Open PHACTS Discovery Platform for drug discovery research.
| Benefits of using the Open PHACTS platform for drug discovery research |
| Mapping identifiers to external databases not required |
| Avoids different interfaces to online knowledge and the need to go back and forth between protein, pathway and bioactivity databases |
| All integrated data available under “Creative Commons” type licenses |
| Bioactivity values normalized via QUDT ontology |
| Getting approved drug status for a list of compounds, including those responsible for off-target or non-approved indications |
| Getting a list of homologues for a given target |
| Integration of ontology tags, i.e. from GO and ChEBI and hierarchies with other datasets |
| Data provenance facilitates enrichment of knowledge from primary literature |
| Possibility to create specialized API-compatible KNIME nodes to enable other user-defined queries |
| Existing pipelining workflows can be re-used entirely or in modules to answer other research questions |
| Results can be easily updated to benefit from future upgrades to the Open PHACTS platform |