| Literature DB >> 25785184 |
Junguk Hur1, Arzucan Özgür2, Zuoshuang Xiang3, Yongqun He4.
Abstract
BACKGROUND: Literature mining of gene-gene interactions has been enhanced by ontology-based name classifications. However, in biomedical literature mining, interaction keywords have not been carefully studied and used beyond a collection of keywords.Entities:
Keywords: Biomedical ontology; Gene-gene interaction; Interaction enrichment; Interaction network ontology; Literature mining
Year: 2015 PMID: 25785184 PMCID: PMC4362819 DOI: 10.1186/2041-1480-6-2
Source DB: PubMed Journal: J Biomed Semantics
The 2x2 contingency table
| # of gene-gene | Concept- | Whole |
|---|---|---|
| Interaction pairs | specific sub-network | Network |
| With the INO term | 30 – 1 | 500 |
| Without the INO term | 150 | 30000 |
Note: The sub-network has 30 gene pairs associated with this INO term out of a total of 180 gene pairs. A modified Fisher’s exact test, with the “- 1” modification made to the typical Fisher’s exact test to make the statistical test more conservative, was employed to identify significantly over-represented terms (p-value of 6.9E-20).
Figure 1INO hierarchy and selected INO key terms. INO is aligned with BFO. It imports most PSI-MI interaction type terms to represent the various interaction types. Some bottom level interaction terms (e.g., phosphorylation) are replaced with corresponding GO terms. Many INO-specific terms (e.g., regulation) that do not exist in PSI_MI or GO are also generated. Note that there are different interactors but only input interactor is shown here. The network and pathway related terms are not shown.
Figure 2The visualization of one term ‘protein myristoylation’ (GO_0018377) in INO. Originated from GO, this term and its branch of child terms are imported and placed with the framework of PSI-MI interaction types which are also imported into INO. The upper level terms are from BFO. The OntoFox tool [9] was used for importing external ontology terms and their axioms. The image is a screenshot generated from Ontobee [10]. To facilitate literature mining tagging, different synonyms of the term are collected under an annotation note.
Figure 3The workflow of INO-based gene-gene interaction enrichment analysis. This workflow illustrates the overall procedures of ontology-based gene pair enrichment analysis.
Significantly over-represented INO terms among the gene-gene interaction pairs of vaccine-associated sub-network
| INO_ID | Reference term | Enrichment fold | BH * | Most frequent gene-pair (#) |
|---|---|---|---|---|
| INO_0000140 | Neutralization | 6.6 | 0 | IFNG_IL12A (5)** |
| INO_0000096 | induction of production | 6.2 | 0 | TNF_IFNG (2) |
| INO_0000106 | gene fusion | 5.6 | 0 | CD40LG_CD40 (3) |
| INO_0000103 | accessory regulation | 3.9 | 0 | CD8A_CD4 (55) |
| INO_0000062 | Costimulation | 3.7 | 0 | CD40_CD8A (4) |
| INO_0000169 | Synergization | 3.0 | 0 | CD8A_CD40 (5) |
| INO_0000089 | co-regulation | 2.9 | 0 | CD8A_CD40 (5) |
| MI_0559 | glycosylation reaction | 2.9 | 0 | IL17A_MUC6 (1) |
| MI_0195 | covalent binding | 2.5 | 0 | CSF2_ACPP (2) |
| MI_0208 | genetic interaction | 4.9 | 1.82E-10 | CD40LG_CD40 (3) |
| MI_0571 | mRNA cleavage | 23.2 | 2.58E-07 | CFI_SUPT5H (1) |
| MI_0902 | RNA cleavage | 16.2 | 2.21E-06 | CFI_SUPT5H (1) |
| MI_0910 | nucleic acid cleavage | 6.4 | 6.11E-04 | CFI_SUPT5H (1) |
| GO_0018377 | protein myristoylation | 2.3 | 2.68E-03 | CD4_S100B (2) |
*BH: Benjamini-Hochberg; **IFNG_IL12A (5): represents the IFNG and IL12A gene pair with the ‘neutralization’ interaction keyword in five papers.
Significantly under-represented INO terms among the gene-gene interaction pairs of vaccine-associated sub-network
| INO_ID | Reference term | Enrichment fold | BH* P-value |
|---|---|---|---|
| MI_0203 | dephosphorylation reaction | 0.06 | 0 |
| INO_0000178 | tyrosine-phosphorylation | 0.09 | 0 |
| INO_0000044 | gene expression regulation | 0.26 | 0 |
| INO_0000172 | transactivation | 0.26 | 0 |
| INO_0000060 | coprecipitation | 0.28 | 0 |
| GO_0016310 | phosphorylation | 0.36 | 0 |
| MI_0403 | colocalization | 0.36 | 0 |
| MI_0414 | enzymatic reaction | 0.42 | 0 |
| MI_0194 | cleavage reaction | 0.49 | 0 |
| MI_0213 | methylation reaction | 0.37 | 6.84E-16 |
| INO_0000092 | dissociation | 0.28 | 6.27E-15 |
| INO_0000048 | coimmunoprecipitation | 0.35 | 1.00E-13 |
| INO_0000115 | hyperphosphorylation | 0.27 | 2.54E-08 |
| INO_0000084 | destabilization | 0.28 | 1.49E-05 |
| GO_0006461 | protein complex assembly | 0.24 | 1.97E-05 |
| INO_0000088 | protein dimerization | 0.26 | 6.41E-05 |
| INO_0000171 | Termination | 0.42 | 3.98E-03 |
*BH: Benjamini-Hochberg.
Figure 4The hierarchies of over- and under-represented INO interaction terms. (A) The hierarchy of 14 over-represented INO interaction terms. (B) The hierarchy of 17 under-represented INO interaction terms. The results were generated using OntoFox [9] with the OntoFox setting “includeComputedIntermediates”, and visualized using the Protege-OWL editor (http://protege.stanford.edu/). The box-enclosed terms are over- or under-represented interaction types directly identified in our program (see Tables 2 and 3). Other terms not enclosed in boxes are terms retrieved by OntoFox to ensure the completeness of the hierarchies.