| Literature DB >> 28443230 |
Abstract
In the current study, we planned to predict the optimal gene functions for osteosarcoma (OS) by integrating network-based method with guilt by association (GBA) principle (called as network-based gene function inference approach) based on gene oncology (GO) data and gene expression profile. To begin with, differentially expressed genes (DEGs) were extracted using linear models for microarray data (LIMMA) package. Then, construction of differential co-expression network (DCN) relying on DEGs was implemented, and sub-DCN was identified using Spearman correlation coefficient (SCC). Subsequently, GO annotations for OS were collected according to known confirmed database and DEGs. Ultimately, gene functions were predicted by means of GBA principle based on the area under the curve (AUC) for GO terms, and we determined GO terms with AUC >0.7 as the optimal gene functions for OS. Totally, 123 DEGs and 137 GO terms were obtained for further analysis. A DCN was constructed, which included 123 DEGs and 7503 interactions. A total of 105 GO terms were identified when the threshold was set as AUC >0.5, which had a good classification performance. Among these 105 GO terms, 2 functions had the AUC >0.7 and were determined as the optimal gene functions including angiogenesis (AUC =0.767) and regulation of immune system process (AUC =0.710). These gene functions appear to have potential for early detection and clinical treatment of OS in the future.Entities:
Keywords: Area under the curve; Differential co-expression network; Gene ontology; Guilt by association; Osteosarcoma; Spearman correlation coefficient
Year: 2017 PMID: 28443230 PMCID: PMC5396855 DOI: 10.1016/j.jbo.2017.04.003
Source DB: PubMed Journal: J Bone Oncol ISSN: 2212-1366 Impact factor: 4.072
List of the top 20 differentially expressed genes (DEGs).
| Genes | |log (fold change)| | False discovery rate (FDR) |
|---|---|---|
| HOXB7 | 2.085898 | 1.33E−07 |
| RHPN2 | 2.147406 | 6.32E−07 |
| SRGN | 5.174341 | 3.34E−06 |
| FOXF2 | 2.318869 | 4.51E−06 |
| PLVAP | 3.404251 | 1.98E−05 |
| COX7A1 | 2.988735 | 5.41E−05 |
| APOE | 3.399852 | 6.21E−05 |
| SPINT2 | 2.301193 | 6.65E−05 |
| LXN | 2.523381 | 8.93E−05 |
| TNFRSF1B | 2.427194 | 1.10E−04 |
| VAMP8 | 3.494650 | 1.61E−04 |
| CBS | 2.868625 | 1.76E−04 |
| HCLS1 | 2.686705 | 1.81E−04 |
| PHGDH | 2.362045 | 2.15E−04 |
| GIMAP7 | 2.258177 | 2.74E−04 |
| C1QC | 2.885076 | 2.76E−04 |
| HBB | 5.191561 | 2.86E−04 |
| C1QA | 3.521570 | 2.93E−04 |
| TYROBP | 3.511873 | 3.17E−04 |
| C1QB | 3.184835 | 3.17E−04 |
Fig. 1Differentially co-expressed network (DCN) construction for osteosarcoma (OS) based on differentially expressed genes (DEGs). A. Degree distribution of genes in the DCN. B. Weights distribution of edges in the DCN. Heatmap clarified weight distribution for each interaction.
Fig. 2Sub-DCN using the cut-off threshold of weight value >0.8. In the sub-DCN, there were 46 nodes and 438 interactions.
Degree distribution of 46 genes in the sub-differential co-expression network (DCN).
| Genes | Degree | Genes | Degree |
|---|---|---|---|
| TNFRSF1B | 33 | HBD | 20 |
| ZFP36 | 32 | STOM | 19 |
| HLA-E | 31 | DOCK2 | 18 |
| ARHGDIB | 28 | HLA-DMA | 17 |
| HLA-DPA1 | 28 | APOE | 17 |
| VAMP8 | 26 | LAPTM5 | 17 |
| FABP4 | 24 | ICAM2 | 17 |
| GIMAP7 | 24 | FCGRT | 16 |
| CD74 | 24 | ACP5 | 15 |
| PLEK | 23 | APOD | 15 |
| COMP | 23 | COX7A1 | 15 |
| HCLS1 | 23 | HLA-DMB | 15 |
| PLAC9 | 23 | CD93 | 14 |
| JCHAIN | 23 | ATP8B4 | 14 |
| SERPINA3 | 23 | FAM46C | 14 |
| C2orf40 | 22 | CD14 | 14 |
| SRGN | 22 | SPP1 | 13 |
| CTSK | 21 | HLA-DRA | 12 |
| CSF1R | 21 | FCER1G | 12 |
| CAT | 21 | ALOX5AP | 12 |
| ADIRF | 21 | CKM | 8 |
| HCST | 21 | VWF | 4 |
| C1QC | 21 |
Fig. 3Gene function prediction performance using guilt by association (GBA). The histogram of AUCs across all GO terms which can be obtained using a single list constructed from number of coexpression partners.