| Literature DB >> 29297309 |
Jiajie Peng1, Honggang Wang2, Junya Lu1, Weiwei Hui1, Yadong Wang3, Xuequn Shang4.
Abstract
BACKGROUND: The Gene Ontology (GO) is a community-based bioinformatics resource that employs ontologies to represent biological knowledge and describes information about gene and gene product function. GO includes three independent categories: molecular function, biological process and cellular component. For better biological reasoning, identifying the biological relationships between terms in different categories are important. However, the existing measurements to calculate similarity between terms in different categories are either developed by using the GO data only or only take part of combined gene co-function network information.Entities:
Keywords: Cross categories; Gene Ontology; Term similarity
Mesh:
Year: 2017 PMID: 29297309 PMCID: PMC5751813 DOI: 10.1186/s12859-017-1959-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The workflow of C r o G O2
Fig. 2Illustration example for iterative ranking based association score. The nodes and edges represent genes and their interactions respectively
Fig. 3ROC curves for the four methods on the gold-standard sets of yeast. The red, blue, yellow and green lines represent CroGO2 (red), CroGO (blue), and ASR (yellow) and VSM (green) method respectively. Most portion of ROC curves of ASR and VSM are overlapping
The performance of ASR, VSM, CroGO and CroGO2 measures on yeast gold-standard set
| Organism | Measure | TP rate (when | TP rate (when | TP rate (when |
|---|---|---|---|---|
| FP rate =5%) | FP rate =10%) | FP rate =15%) | ||
| *Yeast | ASR | 59% | / | / |
| VSM | 59% | / | / | |
| CroGO | 56% | 65% | 67% | |
| CroGO2 | 66% | 69% | 71% |
Fig. 4ROC curves for the robustness test of C r o G O2 with different co-function network densities
Fig. 5Venn diagram of T and T MF′. T is the set of enriched MF terms. T MF′ is the set of MF terms associated with the enriched BP terms
Top 20 term associations that were identified by C r o G O2
| BP Name | MF Name | Evidence |
|---|---|---|
| butanediol biosynthetic process | (R,R)-butanediol dehydrogenase activity | New |
| glutamine biosynthetic process | glutamate-ammonia ligase activity | [ |
| putrescine biosynthetic process | ornithine decarboxylase activity | [ |
| acetyl-CoA biosynthetic process from acetate | acetate-CoA ligase activity | New |
| alanine catabolic process | L-alanine:2-oxoglutarate aminotransferase activity | [ |
| siroheme biosynthetic process | precorrin-2 dehydrogenase activity | [ |
| trehalose catabolic process | alpha,alpha-trehalase activity | [ |
| asparagine catabolic process | asparaginase activity | [ |
| lysine biosynthetic process | aromatic-amino-acid:2-oxoglutarate aminotransferase activity | [ |
| glycerol biosynthetic process | glycerol-1-phosphatase activity | New |
| threonine catabolic process | L-threonine ammonia-lyase activity | New |
| peptide alpha-N-acetyltransferase activity | N-terminal protein amino acid acetylation | [ |
| glutathione catabolic process | gamma-glutamyltransferase activity | [ |
| alanine biosynthetic process | L-alanine:2-oxoglutarate aminotransferase activity | [ |
| positive regulation of histone H3-K36 methylation | TFIIF-class binding TF activity | New |
| siroheme biosynthetic process | uroporphyrin-III C-methyltransferase activity | [ |
| siroheme biosynthetic process | sirohydrochlorin ferrochelatase activity | [ |
| glutathione biosynthetic process | glutamate-cysteine ligase activity | [ |
| positive regulation of telomere maintenance via telomerase | Hsp90 protein binding | [ |
| chorismate biosynthetic process | 3-deoxy-7-phosphoheptulonate synthase activity | [ |