| Literature DB >> 29560823 |
Jiajie Peng1,2,3, Xuanshuo Zhang4, Weiwei Hui4, Junya Lu4, Qianqian Li4, Shuhui Liu4, Xuequn Shang4,5.
Abstract
BACKGROUND: Gene Ontology (GO) is one of the most popular bioinformatics resources. In the past decade, Gene Ontology-based gene semantic similarity has been effectively used to model gene-to-gene interactions in multiple research areas. However, most existing semantic similarity approaches rely only on GO annotations and structure, or incorporate only local interactions in the co-functional network. This may lead to inaccurate GO-based similarity resulting from the incomplete GO topology structure and gene annotations.Entities:
Keywords: Gene Ontology; Random walk with restart; Semantic similarity
Mesh:
Year: 2018 PMID: 29560823 PMCID: PMC5861498 DOI: 10.1186/s12918-018-0539-0
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fig. 1The workflow of NETSIM2
Fig. 2Performance comparison of different measures on GO’s molecular function terms in yeast (a) and Arabidopsis (b)
The LFC scores of five methods for the molecular function category on yeast data
| Method | Resnik | Relevance | Wang | NETSIM | NETSIM2 |
|---|---|---|---|---|---|
| 25% | 0.07 | 0.14 | 0.08 | 0.15 | 0.64 |
| 50% | 0.18 | 0.23 | 0.15 | 0.31 | 1.18 |
| 75% | 0.36 | 0.42 | 0.25 | 0.58 | 1.76 |
Fig. 3Number of ECs for which NETSIM2, NETSIM, Wang and Relevance measures performed the best for yeast (a) and Arabidopsis (b) based on molecular function terms
The LFC scores of five methods for the molecular function category on Arabidopsis data
| Method | Resnik | Relevance | Wang | NETSIM | NETSIM2 |
|---|---|---|---|---|---|
| 25% | 0.12 | 0.25 | 0.22 | 0.26 | 1.69 |
| 50% | 0.35 | 0.59 | 0.51 | 0.65 | 3.19 |
| 75% | 0.75 | 1.27 | 1.07 | 1.87 | 5 |
Fig. 4Performance comparison on LFC scores of similarity measures on GO’s biological process in yeast (a) and Arabidopsis (b)
The LFC scores of five methods for the biological process category on yeast data
| Method | Resnik | Relevance | Wang | NETSIM | NETSIM2 |
|---|---|---|---|---|---|
| 25% | 0.01 | 0.08 | 0.10 | 0.06 | 0.11 |
| 50% | 0.12 | 0.26 | 0.24 | 0.23 | 0.78 |
| 75% | 0.31 | 0.49 | 0.47 | 0.64 | 3.37 |
Fig. 5Number of ECs for which NETSIM2, NETSIM, Wang and Relevance measures performed the best for yeast (a) and Arabidopsis (b) based on biological process terms
The LFC scores of five methods for the biological process category on Arabidopsis data
| Method | Resnik | Relevance | Wang | NETSIM | NETSIM2 |
|---|---|---|---|---|---|
| 25% | 0.07 | 0.15 | 0.17 | 0.12 | 0.002 |
| 50% | 0.17 | 0.43 | 0.40 | 0.47 | 1.94 |
| 75% | 0.42 | 1.03 | 0.91 | 1.19 | 3.75 |