| Literature DB >> 27336457 |
Andra Waagmeester1, Martina Kutmon1,2, Anders Riutta3, Ryan Miller1, Egon L Willighagen1, Chris T Evelo1,2, Alexander R Pico3.
Abstract
The diversity of online resources storing biological data in different formats provides a challenge for bioinformaticians to integrate and analyse their biological data. The semantic web provides a standard to facilitate knowledge integration using statements built as triples describing a relation between two objects. WikiPathways, an online collaborative pathway resource, is now available in the semantic web through a SPARQL endpoint at http://sparql.wikipathways.org. Having biological pathways in the semantic web allows rapid integration with data from other resources that contain information about elements present in pathways using SPARQL queries. In order to convert WikiPathways content into meaningful triples we developed two new vocabularies that capture the graphical representation and the pathway logic, respectively. Each gene, protein, and metabolite in a given pathway is defined with a standard set of identifiers to support linking to several other biological resources in the semantic web. WikiPathways triples were loaded into the Open PHACTS discovery platform and are available through its Web API (https://dev.openphacts.org/docs) to be used in various tools for drug development. We combined various semantic web resources with the newly converted WikiPathways content using a variety of SPARQL query types and third-party resources, such as the Open PHACTS API. The ability to use pathway information to form new links across diverse biological data highlights the utility of integrating WikiPathways in the semantic web.Entities:
Mesh:
Year: 2016 PMID: 27336457 PMCID: PMC4918977 DOI: 10.1371/journal.pcbi.1004989
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1A construct query is type of SPARQL query that enables the conversion of one graph pattern to another.
Here an interaction described by its spatial properties (GPML) is converted into a semantic representation reflecting its biological interpretation (WP). The SPARQL query is available in the supporting information section.
Example queries handled by the WikiPathways SPARQL endpoint.
| List the species captured in WikiPathways and the number of pathways per species | |
| Get all gene products on a particular pathway (WP615 as an example) | |
| Return all PubChem compounds in WikiPathways and the pathways they are in |
Example federated queries handled by the WikiPathways SPARQL endpoint.
| From DisGeNET get disease-gene pairs on asthma and get all pathways where these genes have a role |
| For the genes differentially expressed in asthma (found in the EBI Expression Atlas), get the gene products associated to a WikiPathways pathway |
Fig 2The colored boxes represent genes which are up (red) or down (blue) regulated in diabetes mellitus.
PIK3R2, MYO1C, PRKAA2, LIPE are down regulated in pre-diabetes. STX4A is down regulated in type 1 diabetes longstanding. PRKCQ, PTPN11, FOXO3A are down regulated in type 2 diabetes. GAB1, RHEB, MAP4K4, SNAP23 are up regulated in pre-diabetes. RHOJ, PRKCB are up regulated in type 1 diabetes recent onset. MAPK14UP, EIF4EBP1 are up regulated in type 1 diabetes clinical onset. From these 17 up or down regulated genes, 9 are being reported as being in the top 10 disease and phenotype associations for the selected gene in DisGeNET (i.e. PIK3R2, PRKAA2, LIPE, STX4A, PRKCQ, FOXO3A, MAP4K4, SNAP23, and PRKCB) (Gene-disease association data were retrieved from the DisGeNET Database, GRIB/IMIM/UPF Integrative Biomedical Informatics Group, Barcelona. (http://www.disgenet.org/). 04, 2016)