| Literature DB >> 30085931 |
Marco Brandizi1, Ajit Singh1, Christopher Rawlings1, Keywan Hassani-Pak1.
Abstract
The speed and accuracy of new scientific discoveries - be it by humans or artificial intelligence - depends on the quality of the underlying data and on the technology to connect, search and share the data efficiently. In recent years, we have seen the rise of graph databases and semi-formal data models such as knowledge graphs to facilitate software approaches to scientific discovery. These approaches extend work based on formalised models, such as the Semantic Web. In this paper, we present our developments to connect, search and share data about genome-scale knowledge networks (GSKN). We have developed a simple application ontology based on OWL/RDF with mappings to standard schemas. We are employing the ontology to power data access services like resolvable URIs, SPARQL endpoints, JSON-LD web APIs and Neo4j-based knowledge graphs. We demonstrate how the proposed ontology and graph databases considerably improve search and access to interoperable and reusable biological knowledge (i.e. the FAIRness data principles).Entities:
Keywords: FAIR data principles; bio-ontologies; biological knowledge networks; data integration; graph databases; linked data; semantic web
Mesh:
Year: 2018 PMID: 30085931 PMCID: PMC6340125 DOI: 10.1515/jib-2018-0023
Source DB: PubMed Journal: J Integr Bioinform ISSN: 1613-4516
Figure 1:The top-level organisation of the BioKNO ontology.
Specific biological entities are defined as subclasses of bk:Concept and relation types (i.e. OWL object properties) between them are based on subproperties of bk:conceptRelation. bk:Relation can be used to model reified relations. Both concepts and reified relations can have bk:attribute and other elements attached.
Figure 2:The new architecture designed for the KnetMiner ecosystem.
BioKNO-based modelling powers both data acquisition (extraction, loading, transformation, or ELT) and data querying from our and 3rd-party applications. RDF serves open data publishing and integration with other data sets.
Figure 3:Using Cypher to query genome-scale knowledge networks.
The query corresponds to the graph pattern on the bottom.