| Literature DB >> 27774140 |
D Digles1, B Zdrazil1, J-M Neefs2, H Van Vlijmen2, C Herhaus3, A Caracoti4, J Brea5, B Roibás5, M I Loza5, N Queralt-Rosinach6, L I Furlong6, A Gaulton7, L Bartek8, S Senger8, C Chichester9, O Engkvist10, C T Evelo11, N I Franklin12, D Marren13, G F Ecker1, E Jacoby2.
Abstract
Phenotypic screening is in a renaissance phase and is expected by many academic and industry leaders to accelerate the discovery of new drugs for new biology. Given that phenotypic screening is per definition target agnostic, the emphasis of in silico and in vitro follow-up work is on the exploration of possible molecular mechanisms and efficacy targets underlying the biological processes interrogated by the phenotypic screening experiments. Herein, we present six exemplar computational protocols for the interpretation of cellular phenotypic screens based on the integration of compound, target, pathway, and disease data established by the IMI Open PHACTS project. The protocols annotate phenotypic hit lists and allow follow-up experiments and mechanistic conclusions. The annotations included are from ChEMBL, ChEBI, GO, WikiPathways and DisGeNET. Also provided are protocols which select from the IUPHAR/BPS Guide to PHARMACOLOGY interaction file selective compounds to probe potential targets and a correlation robot which systematically aims to identify an overlap of active compounds in both the phenotypic as well as any kinase assay. The protocols are applied to a phenotypic pre-lamin A/C splicing assay selected from the ChEMBL database to illustrate the process. The computational protocols make use of the Open PHACTS API and data and are built within the Pipeline Pilot and KNIME workflow tools.Entities:
Year: 2016 PMID: 27774140 PMCID: PMC5063042 DOI: 10.1039/c6md00065g
Source DB: PubMed Journal: Medchemcomm ISSN: 2040-2503 Impact factor: 3.597
Fig. 1Outline of data showing the links between the different key identifiers (URI, uniform resource identifiers) assessed for annotation of compounds and targets in the provided computational protocols. The data provenance is shown in brackets.
Fig. 2Schematic overview of protocols 1 to 4.
Fig. 3Schematic overview of protocol 5.
Fig. 4Schematic overview of protocol 6.
API calls and input, output and aggregation parameters
| Protocol | Used API calls | Input | Output | Aggregation |
| 1-2 ChEMBL, GO | Compound Pharmacology: List, | List of Compound URIs | Cache 1: CompoundID and URICompound, canonical smiles | Join cache 3 and 2 based on target name; Join cache 1 on URICompound |
| Target Classifications | Cache 2: URI compound, URI assay, pChembl, TargetName, URITarget | Merge and group on classification | ||
| Cache 3: Target Name, URIClassification, Classification | ||||
| 1 Chebi | Compound Classifications | List of Compound URIs | Cache 1: CompoundID and URICompound, canonical smiles | Join cache 2 and 1 based on URICompound. |
| Cache 2: URI compound, URI Chebi, ChebiDescription | Merge and group on ChebiDescription | |||
| 3 Pathways | Compound Pharmacology: List, | List of Compound URI | Cache 1: CompoundID and URICompound, canonical smiles | Join cache 3 and 2 based on target name; Join cache 1 on URICompound |
| Pathways for Target: List | Cache 2: URI compound, URI assay, pChembl, TargetName, URITarget | Merge and group on PathwayName | ||
| Cache 3 : Target Name, URIPathway, PathwayID, PathwayName | ||||
| 4 Disease | Compound Pharmacology: List, | List of Compound URIs | Cache 1 : CompoundID and URICompound, canonical smiles | Join cache 3 and 2 based on target name; Join cache 1 on URICompound |
| Diseases for Target: List | Cache 2 : URI compound, URI assay, pChembl, TargetName, URITarget | Merge and group on DiseaseName | ||
| Cache 3 : Target Name, URIDisease, DiseaseName | To limit the runtime of the protocol merging is done directly on the data stream of Cache 3. | |||
| 5 Correlation Robot | Target Class Member: List, | Use CHEMBL_PC_6 kinase family key to launch query | Cache 1 : List of 455 human kinases for which ChEMBL holds data | Join Cache 2 based on INCHIKEY to each assay from Cache 1 data stream |
| Target Pharmacology: List | Cache 2 : Lamin A/C splicing assay data | |||
| 6 GtoPdb Box | Compound Pharmacology: List, | List of Compound URIs. Cache 1 : Read GtoPdb interaction file | Cache 2 Keep URITarget and URIUniprot and extract UniprotID from URIUniprot | Join cache 1 GtoPdb interaction file to cache 2 based UniprotID. Merge by ligand name |
| Map URL |