| Literature DB >> 25708115 |
Hyun-hwan Jeong, So Kim, Kyubum Wee, Kyung-Ah Sohn.
Abstract
BACKGROUND: Network-based approaches have recently gained considerable popularity in high- dimensional regression settings. For example, the Cox regression model is widely used in expression analysis to predict the survival of patients. However, as the number of genes becomes substantially larger than the number of samples, the traditional Cox or L2-regularized Cox models are still prone to noise and produce unreliable estimations of regression coefficients. A recent approach called the network-based Cox (Net-Cox) model attempts to resolve this issue by incorporating prior gene network information into the Cox regression. The Net-Cox model has shown to outperform the models that do not use this network information.Entities:
Mesh:
Year: 2015 PMID: 25708115 PMCID: PMC4331683 DOI: 10.1186/1752-0509-9-S1-S8
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Prediction accuracy of the mutual information network-based Net-Cox model as a function of the network significance parameter σ. Average time-dependent AUC using 5-fold cross validation on training data is shown across σ.
Optimal parameters and averaged time-dependent AUCs of each network using 5-fold cross validation on training data
| Profile | Network |
|
|
| Mean (time AUC) |
|---|---|---|---|---|---|
| Mutual Information | 0.30 | 10−3 | 0.1 | ||
| Correlation | - | 10−3 | 0.9 | 0.5817 | |
| Functional Linkage | - | 10−3 | 0.3 | 0.5786 | |
| - | 10−3 | 1.0 | 0.5810 | ||
| Mutual Information | 0.10 | 10−4 | 0.1 | ||
| Correlation | - | 10−4 | 0.9 | 0.6280 | |
| Functional Linkage | - | 10−4 | 0.3 | 0.6242 | |
| - | 10−4 | 1.0 | 0.6288 | ||
| Mutual Information | 0.30 | 10−4 | 0.5 | ||
| Correlation | - | 10−4 | 0.7 | 0.5894 | |
| Functional Linkage | - | 10−4 | 0.3 | 0.5860 | |
| - | 10−4 | 1.0 | 0.5899 | ||
Figure 2Performance comparison of different network types in terms of 5-fold cross-validation accuracy.
Figure 3Performance comparison of different network types on validation set.
Figure 4Kaplan-Meier survival curves (high-risk group: red line; low risk group: blue line) and log-rank test results on validation set for each profile and network (A--Mutual information based network; B--Correlation network; C--Functional linkage network; D--.
Figure 5Heat-map for the regression coefficients of 15 selected genes. The five genes having the largest regression coefficients are selected for each profile and merged together.
Significantly enriched terms (using ToppGene) in the 100 genes that have the largest regression coefficients for each profile
| Profile | Category | ID | Name | p-value | Adjusted p-value | Count | Total |
|---|---|---|---|---|---|---|---|
| GO:BP | GO:0048706 | embryonic skeletal system development | 5.99E-06 | 1.25E-02 | 7 | 129 | |
| GO:BP | GO:0048704 | embryonic skeletal system morphogenesis | 1.43E-05 | 1.50E-02 | 6 | 98 | |
| GO:BP | GO:0009952 | anterior/posterior pattern specification | 2.17E-05 | 1.52E-02 | 8 | 217 | |
| Disease | ctd:D010051 | Ovarian Neoplasms | 5.24E-04 | 2.84E-02 | 4 | 78 | |
| Disease | ctd:D002277 | Carcinoma | 7.42E-03 | 3.66E-02 | 4 | 161 | |
| GO:BP | GO:0031442 | positive regulation of mRNA 3'-end processing | 7.94E-05 | 4.15E-02 | 3 | 16 | |
| GO:BP | GO:0031440 | regulation of mRNA 3'-end processing | 1.15E-04 | 4.80E-02 | 3 | 18 | |
| GO:MF | GO:0004568 | chitinase activity | 4.80E-06 | 1.78E-03 | 3 | 7 | |
| GO:BP | GO:0006032 | chitin catabolic process | 4.83E-06 | 4.94E-03 | 3 | 7 | |
| GO:BP | GO:0006030 | chitin metabolic process | 4.83E-06 | 4.94E-03 | 3 | 7 | |
| GO:BP | GO:1901072 | glucosamine-containing compound catabolic process | 7.70E-06 | 5.24E-03 | 3 | 8 | |
| GO:MF | GO:0086080 | protein binding involved in heterotypic cell-cell adhesion | 8.12E-05 | 1.50E-02 | 2 | 3 | |
| GO:MF | GO:0098631 | protein binding involved in cell adhesion | 1.62E-04 | 1.50E-02 | 2 | 4 | |
| GO:MF | GO:0098632 | protein binding involved in cell-cell adhesion | 1.62E-04 | 1.50E-02 | 2 | 4 | |
| GO:BP | GO:0046348 | amino sugar catabolic process | 3.86E-05 | 1.97E-02 | 3 | 13 | |
| GO:MF | GO:0008061 | chitin binding | 2.69E-04 | 1.99E-02 | 2 | 5 | |
| GO:MF | GO:0030492 | hemoglobin binding | 7.45E-04 | 4.61E-02 | 2 | 8 | |
| GO:CC | GO:0030424 | axon | 9.91E-06 | 1.95E-03 | 11 | 409 | |
| GO:CC | GO:0043025 | neuronal cell body | 2.06E-04 | 1.57E-02 | 10 | 478 | |
| GO:CC | GO:0043195 | terminal bouton | 3.15E-04 | 1.57E-02 | 4 | 62 | |
| GO:CC | GO:0036477 | somatodendritic compartment | 3.19E-04 | 1.57E-02 | 12 | 705 | |
| GO:CC | GO:0043005 | neuron projection | 4.07E-04 | 1.61E-02 | 14 | 945 | |
| GO:CC | GO:0044297 | cell body | 5.10E-04 | 1.68E-02 | 10 | 536 | |
| GO:CC | GO:0032983 | kainate selective glutamate receptor complex | 7.48E-04 | 2.11E-02 | 2 | 8 | |
| GO:MF | GO:0004952 | dopamine neurotransmitter receptor activity | 2.80E-04 | 4.01E-02 | 2 | 5 | |
| GO:MF | GO:0001965 | G-protein alpha-subunit binding | 3.95E-04 | 4.01E-02 | 3 | 27 | |
| GO:MF | GO:0015075 | ion transmembrane transporter activity | 4.14E-04 | 4.01E-02 | 13 | 819 | |
| GO:MF | GO:0022857 | transmembrane transporter activity | 5.21E-04 | 4.01E-02 | 14 | 951 | |
| GO:MF | GO:0015026 | coreceptor activity | 5.98E-04 | 4.01E-02 | 3 | 31 | |
| GO:MF | GO:0022891 | substrate-specific transmembrane transporter activity | 7.68E-04 | 4.29E-02 | 13 | 875 | |
| GO:MF | GO:0022892 | substrate-specific transporter activity | 9.57E-04 | 4.58E-02 | 14 | 1012 | |
Figure 6Gene-gene interaction sub-network constructed by using the 100 edges with the largest mutual information values on each profile (A: CNA, B: mRNA, C: METH).
Network properties of mutual information sub-network for each profile
| Properties | CNA | mRNA | METH |
|---|---|---|---|
| 78 | 154 | 149 | |
| 7 | 54 | 50 | |
| 0.312 | 0.058 | 0.052 | |
| 2.117 | 2.187 | 2.164 | |
| 2.564 | 1.299 | 1.342 | |
| 0.033 | 0.008 | 0.009 | |
| 1.383 | 0.828 | 0.809 | |
| 0.672 | 0.922 | 0.909 |
Significantly enriched terms (using ToppGene) in the sub-network consisting of the gene pairs that have the 100 largest mutual information values.
| Profile | Category | ID | Name | p-value | Adjusted p-value | Count | Total |
|---|---|---|---|---|---|---|---|
| Disease | ctd:D006949 | Hyperlipidemias | 1.76E-05 | 2.24E-03 | 3 | 13 | |
| Disease | ctd:D009765 | Obesity | 1.98E-03 | 1.21E-02 | 4 | 132 | |
| Disease | ctd:D007333 | Insulin Resistance | 8.96E-03 | 2.42E-02 | 2 | 35 | |
| Disease | ctd:D009369 | Neoplasms | 1.05E-02 | 2.78E-02 | 2 | 38 | |
| Disease | ctd:D002277 | Carcinoma | 1.19E-04 | 1.23E-02 | 8 | 161 | |
| Disease | ctd:D009364 | Neoplasm Recurrence, Local | 1.25E-02 | 4.96E-02 | 3 | 52 | |
| GO:MF | GO:0008528 | G-protein coupled peptide receptor activity | 4.15E-05 | 1.21E-02 | 7 | 120 | |
| GO:MF | GO:0001653 | peptide receptor activity | 4.86E-05 | 1.21E-02 | 7 | 123 | |
| GO:MF | GO:0004942 | anaphylatoxin receptor activity | 1.78E-04 | 1.78E-02 | 2 | 3 | |
| GO:MF | GO:0004948 | calcitonin receptor activity | 1.78E-04 | 1.78E-02 | 2 | 3 | |
| GO:MF | GO:0004800 | thyroxine 5'-deiodinase activity | 1.78E-04 | 1.78E-02 | 2 | 3 | |