| Literature DB >> 25863278 |
Robert Hoehndorf, Paul N Schofield, Georgios V Gkoutos.
Abstract
Ontologies are widely used in biological and biomedical research. Their success lies in their combination of four main features present in almost all ontologies: provision of standard identifiers for classes and relations that represent the phenomena within a domain; provision of a vocabulary for a domain; provision of metadata that describes the intended meaning of the classes and relations in ontologies; and the provision of machine-readable axioms and definitions that enable computational access to some aspects of the meaning of classes and relations. While each of these features enables applications that facilitate data integration, data access and analysis, a great potential lies in the possibility of combining these four features to support integrative analysis and interpretation of multimodal data. Here, we provide a functional perspective on ontologies in biology and biomedicine, focusing on what ontologies can do and describing how they can be used in support of integrative research. We also outline perspectives for using ontologies in data-driven science, in particular their application in structured data mining and machine learning applications.Entities:
Keywords: Semantic Web; data integration; data mining; ontology
Mesh:
Year: 2015 PMID: 25863278 PMCID: PMC4652617 DOI: 10.1093/bib/bbv011
Source DB: PubMed Journal: Brief Bioinform ISSN: 1467-5463 Impact factor: 11.622
The main features provided by ontologies in support of biological and biomedical research
| Ontology feature | Utility in research |
|---|---|
| Classes and relations | The use of standard identifiers for classes and relations in ontologies is what enables data integration across multiple databases because the same identifiers can be used across multiple, disconnected databases, files, or web sites. |
| Domain vocabulary | Through labels associated with classes and relations, ontologies provide a domain vocabulary that can be exploited for applications ranging from natural language processing, creation of user interfaces, etc. |
| Metadata and descriptions | Textual definitions, descriptions, examples and further metadata associated with classes in ontologies are what enable domain experts to understand the precise meaning of class in the ontology. The definitions and related metadata should allow consistent understanding of the meaning of classes in ontologies. |
| Axioms and formal definitions | Formal definitions and axioms enable automated and computational access to (some parts of) the meaning of a class or relation. |
Figure 1.A part of the Plant Ontology. The figure shows classes as circles, labels and definitions in boxes and axioms as edges between classes. The label and definition of the relation OBO_REL:0000002 is a label for an axiom pattern.
Query and representation languages, and key concepts around ontologies in biology
| Language | Description |
|---|---|
| Resource Description Framework (RDF) | RDF [ |
| Web Ontology Language (OWL) | OWL [ |
| SPARQL Protocol and RDF Query Language (SPARQL) | SPARQL [ |
| Linked Data | Linked Data [ |
| OBO Flatfile Format | The OBO Flatfile Format [ |
| Proprietary graph-based ontology representation formats | A number of graph-based representations of ontologies have been developed that primarily specify labeled graphs. Examples include the representation of the Medical Subject Headings thesaurus [ |
Overview of main ontology repositories in the life science domain
| Repository | Key features | URL |
|---|---|---|
| BioPortal | BioPortal [ | |
| OntoBee | Ontobee [ | |
| Ontology Lookup Service | The Ontology Lookup Service [ | |
| OBO Library | The Open Biological and Biomedical Ontologies (OBO) library [ |
A selection of automated reasoners for OWL ontologies
| Reasoner | OWL support | Description |
|---|---|---|
| Pellet [ | OWL 2, OWL EL | General purpose OWL reasoner with a large set of features, including specialized OWL EL reasoning, support for rules, support of epistemic operators, integration in SPARQL, explanation of inferences, incremental reasoning. |
| HermiT [ | OWL 2, OWL EL | General purpose, highly optimized OWL reasoner. |
| FacT++ [ | OWL-DL, OWL 2 (partially) | Highly optimized reasoner implemented in C++. |
| Konklude [ | OWL 2 | Highly optimized OWL reasoner supporting parallel reasoning. |
| RacerPro 2.0 [ | OWL 2 (partial) | Optimized OWL reasoner, with integration in the AllegroGraph [ |
| TrOWL [ | OWL 2 | Scalable OWL reasoner with support for limited closed-world reasoning (negation as failure) and stream reasoning. |
| ELK [ | OWL-EL | Optimized and feature-rich OWL EL reasoner with support for incremental and parallel reasoning. |
An overview over tools and software libraries for processing and interacting with ontologies
| Tool | Description | Web site |
|---|---|---|
| Protege, WebProtege | Protege [ | |
| OWL API | The OWL API [ | |
| Owlcpp | owlcpp [ | |
| Brain | Brain [ | |
| Redland RDF API | An RDF library written in C. It provides a large set of commonly used command line tools to transform or collect basic statistics about an RDF file. | |
| Apache Jena | Jena is a Java library and collection of tools consisting of an RDF library, integration of SPARQL queries and support for OWL ontologies. |
An overview over generic ontology analysis and visualization tools and libraries
| Tool | Description | Web site |
|---|---|---|
| Gephi | Gephi [ | |
| Cytoscape | Cytoscape [ | |
| Semantic Measures Library | The Semantic Measures Library and Toolkit [ | |
| GO enrichment analysis tools | Enrichment analysis uses the graph-structure underlying ontologies (usually the GO) together with transitive inference over the edges in the graph to statistically test a hypothesis. The graph structure is used to ‘enrich’ statistical power by propagating annotations transitively over the graph and performing a test at each level of the ontology hierarchy. | |
| OntoFUNC | OntoFUNC [ |