| Literature DB >> 28265499 |
Abstract
BACKGROUND: In the era of semantic web, life science ontologies play an important role in tasks such as annotating biological objects, linking relevant data pieces, and verifying data consistency. Understanding ontology structures and overlapping ontologies is essential for tasks such as ontology reuse and development. We present an exploratory study where we examine structure and look for patterns in BioPortal, a comprehensive publicly available repository of live science ontologies.Entities:
Keywords: Biomedical ontology; Graph theory; Ontology evolution; Ontology mappings; Semantic web
Year: 2017 PMID: 28265499 PMCID: PMC5337086 DOI: 10.7717/peerj.2990
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
An example illustrating identified communities at two different time points.
| tp1 (2 communities) | tp2 (3 communities) | ||||
|---|---|---|---|---|---|
| Com | Ont | BC | Com | Ont | BC |
| c1 | A | 2 | c3 | B | 6 |
| B | 4 | F | 2 | ||
| C | 3 | c4 | A | 1 | |
| c2 | D | 0 | C | 4 | |
| E | 1 | c5 | D | 2 | |
| F | 4 | E | 4 | ||
| G | 2 | H | 5 | ||
Figure 1Illustration of identified communities at two different time points tp1 and tp2.
Ontologies (circles) that belong to the same community are coloured the same.
An example of similar and same communities.
| Community | Similar to | Closest to |
|---|---|---|
| c3 | c1 and c2 | / |
| c4 | c1 | c1 |
| c5 | c2 | c2 |
Statistics for different versions of the graph.
| Oct12 | Feb13 | Aug13 | Dec13 | Jul14 | |
|---|---|---|---|---|---|
| MAV | 0.346 | 0.339 | 0.343 | 0.435 | 0.402 |
| #All | 283 | 294 | 317 | 359 | 367 |
| #Map | 254 | 268 | 259 | 321 | 318 |
| %Map | 90% | 91% | 82% | 89% | 87% |
| #NoMap | 29 | 26 | 58 | 38 | 49 |
| %NoMap | 10% | 9% | 18% | 11% | 13% |
| #Com | 5 | 6 | 7 | 7 | 6 |
| #C1 | 87 | 127 | 88 | 211 | 160 |
| #C2 | 85 | 54 | 46 | 49 | 65 |
| #C3 | 31 | 35 | 43 | 28 | 48 |
| #C4 | 31 | 20 | 39 | 11 | 30 |
| #C5 | 20 | 28 | 36 | 11 | 12 |
| #C6 | / | 4 | 5 | 7 | 3 |
| #C7 | / | / | 2 | 4 | / |
Notes.
modularity analysis value
number of all ontologies
number of ontologies with at least one/no mappings
percentage of ontologies with at least one/no mappings
number of communities
community x
Figure 2Illustration of identified communities in a graph.
Different colours represent communities, while nodes represent ontologies. Node labels are ontology abbreviations and node sizes correspond to BC values. Grey nodes are ontologies with no mappings.
Identified communities and their main hub ontologies for all versions of the graph.
Please note that the communities are not aligned.
| Oct12 | Feb13 | Aug13 | Dec13 | Jul14 | |
|---|---|---|---|---|---|
| EP | NIF | ERO | NCIT | SWEET | |
| NCIT | NCIT | NCIT | |||
| UBERON | UBERON | RADLEX | NIFSTD | NIFSTD | |
| RADLEX | RADLEX | BIOMODELS | SYN | ||
| TRAK | MESH | MESH | |||
| / | NCBITaxon | NDFRT | NCBITaxon | SWO | |
| / | / | HIMC-CPT | SWO | / |
Notes.
Cardiac Electrophysiology Ontology
National Cancer Institute Thesaurus
Uber Anatomy Ontology
Radiology Lexicon
Systematized Nomenclature of Medicine—Clinical Terms
Neuroscience Information Framework
National Center for Biotechnology Information Organismal Classification
Eagle-I Research Resource Ontology
Taxonomy for Rehabilitation of Knee Conditions
National Drug File—Reference Terminology
Software Ontology
Sage Bionetworks Synapse Ontology
Semantic Web for Earth and Environment Technology Ontology
Figure 3Proportion of ontology pairs found in different number of abstracts
The horizontal axis displays the number of abstracts, while the vertical axis displays the proportion of ontology pairs for each number of abstracts.
Figure 4Similarities between graph pairs.
(A) Feb13 vs Oct12, (B) Aug12 vs Feb13, (C) Dec13 vs Aug 13, and (D) Jul14 vs Dec13. Column and row names represent the main hub ontology for each identified community. Different shades of green correspond to similarity scores where darker colours represent higher numbers and lighter colours represent lower numbers. The NoMap row represents ontologies that had no mappings in the previous version of the graph but are members of one of the communities in the newer version. The New row represents ontologies that did not exist in the previous version of the graph.
Aligned closest communities, and their main hub ontologies.
| Oct12 | Feb13 | Aug13 | Dec13 | Jul14 | |
|---|---|---|---|---|---|
| G1 | EP | NIF | ERO | NCIT | SWEET |
| G2 | SNOMEDCT | SNOMEDCT | SNOMEDCT | SNOMEDCT | SNOMEDCT |
| G3 | RADLEX | RADLEX | RADLEX | NIFSTD | NIFSTD |
| G4 | NCIT | NCIT | NCIT | MESH | MESH |
| G5 | UBERON | UBERON | TRAK | BioModels | SYN |
| G6 | / | / | / | SWO | SWO |
Figure 5Proportion of ontologies that stay in the same closest community between graph pairs.
Figure 6Top 5 categories.
Distribution of top 5 categories with highest number of members (Health, Anatomy, Gross Anatomy, Phenotype, Animal Gross Anatomy) for all 5 graph versions: Oct 12 (A), Feb13 (B), Aug13 (C), Dec13 (D), Jul14 (E).
Number of connected ontologies in each graph version for two anatomy ontologies.
| #Communities | 1 | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|---|
| #Projects | 12 | 26 | 27 | 7 | 5 | 0 |
Comparison of community information for November 2015 with and without considering ontology size.
| Size | MAV | #Ontologies | #Comm | #C1 | #C2 | #C3 | #C4 | #C5 | #C6 |
|---|---|---|---|---|---|---|---|---|---|
| No | 0.346 | 437 | 6 | 255 | 107 | 30 | 26 | 12 | 7 |
| Yes | 0.339 | 437 | 6 | 259 | 106 | 29 | 24 | 12 | 7 |