| Literature DB >> 29363423 |
Liang Cheng1, Yue Jiang2, Hong Ju3, Jie Sun1, Jiajie Peng4, Meng Zhou5, Yang Hu6.
Abstract
BACKGROUND: Since the establishment of the first biomedical ontology Gene Ontology (GO), the number of biomedical ontology has increased dramatically. Nowadays over 300 ontologies have been built including extensively used Disease Ontology (DO) and Human Phenotype Ontology (HPO). Because of the advantage of identifying novel relationships between terms, calculating similarity between ontology terms is one of the major tasks in this research area. Though similarities between terms within each ontology have been studied with in silico methods, term similarities across different ontologies were not investigated as deeply. The latest method took advantage of gene functional interaction network (GFIN) to explore such inter-ontology similarities of terms. However, it only used gene interactions and failed to make full use of the connectivity among gene nodes of the network. In addition, all existent methods are particularly designed for GO and their performances on the extended ontology community remain unknown.Entities:
Keywords: Biomedical ontology; Information flow; Random walk; Term similarities
Mesh:
Year: 2018 PMID: 29363423 PMCID: PMC5780854 DOI: 10.1186/s12864-017-4338-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Sub-graph of the Directed Acyclic Graph of three GO sub-ontologies. Each node indicates a term of GO, and each arrow symbol represents an ‘IS_A’ relationship of GO. For example, “catalytic complex” is linked to “protein complex” by an ‘IS_A’ relationship
Fig. 2Overview of InfAcrOnt demonstrating the basic ideas of measuring similarity between terms across ontologies
Data sources used for identifying novel relationships across ontologies
| Data source | Web site |
|---|---|
| GO | http://geneontology.org/page/download-ontology |
| GOA for yeast |
|
| GOA for human |
|
| YeastNet |
|
| HumanNet | |
| HPO & HPOA | |
| DO | |
| DOA |
|
| PubMedA |
|
Fig. 3ROC analysis of the benchmark set and random sets for human. a ROC curves for the experimental results on the benchmark set and a random set for human. It shows 1-specificity versus sensitivity of each method for calculating the similarities of terms across BP and MF. b Average of AUC for 100 iterators for human
Fig. 4The correlation between the term similarity based on ontology annotations and prior knowledge in HPO project. a The distribution of the similarity scores by InfAcrOnt method. b Pearson Correlation Coefficient between similarity scores based on TF-IDF and other methods
Fig. 5The correlation between the term similarity based on ontology annotations and prior knowledge in PubMed. a The distribution of the similarity scores by InfAcrOnt method. b Pearson Correlation Coefficient between similarity score based on EMI and other methods