| Literature DB >> 22401035 |
Bin Chen1, Ying Ding, David J Wild.
Abstract
BACKGROUND: Systems chemical biology and chemogenomics are considered critical, integrative disciplines in modern biomedical research, but require data mining of large, integrated, heterogeneous datasets from chemistry and biology. We previously developed an RDF-based resource called Chem2Bio2RDF that enabled querying of such data using the SPARQL query language. Whilst this work has proved useful in its own right as one of the first major resources in these disciplines, its utility could be greatly improved by the application of an ontology for annotation of the nodes and edges in the RDF graph, enabling a much richer range of semantic queries to be issued.Entities:
Year: 2012 PMID: 22401035 PMCID: PMC3320537 DOI: 10.1186/1758-2946-4-6
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Figure 1Workflow for the development of Chem2Bio2OWL.
Primary classes, their description, sample instance data sources and the number of sample annotated instances.
| primary classes | description | sample instance data sources | # of sample instances |
|---|---|---|---|
| SmallMolecule | a small bioactive molecule | PubChem, ChEBI | 15509 |
| Drug | a chemical used in the treatment, cure, prevention, or diagnosis of disease | DrugBank, PharmGKB, TTD | 6544 |
| Protein | a physical entity consisting of a sequence of amino acids | Uniprot, HGNC, GOA | 12242 |
| BioAssay | an experiment to measure the effects of some substance on target, cell or a living organism | PubChem BioAssay, ChEMBL, BindingDB, PDSP | 26861 |
| Disease | any condition that causes pain, dysfunction, distress or social problems | OMIM, DO | 8724 |
| SideEffect | undesired effect from a medicine | SIDER | 1385 |
| Literature | a scientific article | Medline | 28392 |
| Pathway | a set or series of a biological interactions | KEGG, Reactome | 347 |
| Interaction | |||
| DrugDrug-Interaction | a drug affects the activity of another drug | DrugBank, DCDB | 9690 |
| ProtienProtien-Interaction | two or more proteins bind together | HPRD, DIP, BioGrid | 54345 |
| DrugInduced-SideEffect | a drug interaction that results in side effect | SIDER | 61102 |
| DrugTreatment | the use of drug to treat disease | Diseasome | 812 |
| ChemicalProtein-Interaction | genomic response to chemical compounds | ChEMBL, BindingDB, PDSP Ki, TTD, BindingMOAD, DrugBank, CTD, MATADOR, Array-Express, KEGG | 47282 |
Data sources were described in [32]
Figure 2Overview of Chem2Bio2OWL. Only part of classes (presented as nodes) and their relations (presented as edges) are visualized. Some classes in ChemicalProteinInteraction are ignored due to the limited space.
Figure 3Ontological representation of Troglitazone, PPARG and their binding association tested in a bioassay experiment. The real data are available in Chem2Bio2RDF website.
Figure 4Workflow for ontology population.
Figure 5Thiazolidinediones side effect study: left figure shows the association between Troglitazone and liver toxicity; right figure shows the association between Rosiglitazone and heart disease.
Figure 6Paths between compound a benzimidazole analogue (CID:44143441) and target KCNH2 in Chem2Bio2OWL dataset. Nodes are colored by class and some edges are labeled by interaction type.