| Literature DB >> 29142286 |
Chen Peng1,2, Ao Li3,4, Minghui Wang1,5.
Abstract
In human health, a fundamental challenge is the identification of disease-related genes. Bladder cancer (BC) is a worldwide malignant tumor, which has resulted in 170,000 deaths in 2010 up from 114,000 in 1990. Moreover, with the emergence of multi-omics data, more comprehensive analysis of human diseases become possible. In this study, we propose a multi-step approach for the identification of BC-related genes by using integrative Heterogeneous Network Modeling of Multi-Omics data (iHNMMO). The heterogeneous network model properly and comprehensively reflects the multiple kinds of relationships between genes in the multi-omics data of BC, including general relationships, unique relationships under BC condition, correlational relationships within each omics and regulatory relationships between different omics. Besides, a network-based propagation algorithm with resistance is utilized to quantize the relationships between genes and BC precisely. The results of comprehensive performance evaluation suggest that iHNMMO significantly outperforms other approaches. Moreover, further analysis suggests that the top ranked genes may be functionally implicated in BC, which also confirms the superiority of iHNMMO. In summary, this study shows that disease-related genes can be better identified through reasonable integration of multi-omics data.Entities:
Mesh:
Year: 2017 PMID: 29142286 PMCID: PMC5688092 DOI: 10.1038/s41598-017-15890-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Flowchart of iHNMMO. The detailed process is described in Section “Pipeline of iHNMMO”.
Figure 2Overall process of the heterogeneous network model construction. (A) The collection of seeds. (B) Extraction of correlational relationships. (C) Four networks within each omics. (D) The heterogeneous network model.
Figure 3Performance comparison of iHNMMO and network-based models with single-omics data. (A) ROC curves. The x axis and y axis represent 1-Sp and Sn, respectively. (B) Fold enrichment. (C) Rank cutoff curves.
Performance comparison between iHNMMO and network-based models with single-omics data using Sn values at stringent levels of Sp.
| method |
| NMSO- Meth | NMSO-CNV | NMSO-Expr |
|---|---|---|---|---|
|
|
| 67.5% | 67.9% | 84.6% |
|
|
| 0% | 3.7% | 16.2% |
|
| 99.0% | |||
|
|
| 22.2% | 14.8% | 44.4% |
|
| 95.0% | |||
|
|
| 22.2% | 22.2% | 66.7% |
|
| 90.0% | |||
Here TP and FP stand for true positives and false positives, TN and FN for true negatives and false negatives, respectively.
The fractions and corresponding p-values of known BC-related genes predicted by iHNMMO, NSD-CNV, NSD-Meth and NSD-Expr.
|
| NMSO-CNV | NMSO-Meth | NMSO-Expr | |||||
|---|---|---|---|---|---|---|---|---|
| fraction |
| fraction |
| fraction |
| fraction |
| |
| Top 5% |
|
| 22.2% | 6.3 × 10 | 14.8% | 3.3 × 10 | 41.4% | 6.3 × 10 |
| Top 10% |
|
| 22.2% | 1.7 × 10 | 22.2% | 3.2 × 10 | 63.6% | 5.0 × 10 |
| Top 15% |
|
| 22.2% | 2.5 × 10 | 37.0% | 3.0 × 10 | 73.7% | 1.3 × 10 |
| Top 20% |
|
| 22.2% | 3.0 × 10 | 37.0% | 1.9 × 10 | 76.8% | 2.9 × 10 |
Figure 4Performance comparison of iHNMMO and existing approaches. (A) AUC values and Sn values at different levels of Sp. (B) Precision-recall curves. (C) The rank cutoff curves. The x and y axis respectively represents the threshold and the fraction of known BC-related genes.
Functional enrichment analysis of the top 100 ranked genes.
| Category | Term | Count |
|
|---|---|---|---|
| KEGG_PATHWAY | hsa04151:PI3K-Akt signaling pathway | 19 | 2.8 × 10−9 |
| GOTERM_BP_DIRECT | GO:0048146~positive regulation of fibroblast proliferation | 5 | 9.2 × 10−5 |
| KEGG_PATHWAY | hsa05200:Pathways in cancer | 14 | 1.0 × 10−4 |
| GOTERM_BP_DIRECT | GO:0043065~positive regulation of apoptotic process | 7 | 2.6 × 10−4 |
| KEGG_PATHWAY | hsa04014:Ras signaling pathway | 10 | 4.1 × 10−4 |
| KEGG_PATHWAY | hsa04350:TGF-beta signaling pathway | 6 | 1.4 × 10−3 |
| KEGG_PATHWAY | hsa04010:MAPK signaling pathway | 9 | 3.8 × 10−3 |
| GOTERM_MF_DIRECT | GO:0004714~transmembrane receptor protein tyrosine kinase activity | 3 | 5.9 × 10−3 |
| GOTERM_BP_DIRECT | GO:0007219~Notch signaling pathway | 4 | 1.3 × 10−2 |
| GOTERM_BP_DIRECT | GO:0042127~regulation of cell proliferation | 5 | 1.5 × 10−2 |
| GOTERM_BP_DIRECT | GO:0060548~negative regulation of cell death | 3 | 1.6 × 10−2 |
| KEGG_PATHWAY | hsa04115:p53 signaling pathway | 4 | 3.2 × 10−2 |
The information of the top 10 ranked predicted genes.
| Ranking | Genes | normalized scores |
|---|---|---|
| 1 | CCNE2 | 0.77 |
| 2 | FSCN1 | 0.66 |
| 3 | KLK3 | 0.63 |
| 4 | FGFR4 | 0.63 |
| 5 | CDC25A | 0.62 |
| 6 | CGB7 | 0.60 |
| 7 | TFAP2A | 0.59 |
| 8 | WT1 | 0.58 |
| 9 | PRDM16 | 0.57 |
| 10 | SNAI1 | 0.57 |