| Literature DB >> 28259158 |
Raghvendra Mall1, Luigi Cerulo2,3, Halima Bensmail4, Antonio Iavarone5, Michele Ceccarelli6,7.
Abstract
BACKGROUND: Biological networks contribute effectively to unveil the complex structure of molecular interactions and to discover driver genes especially in cancer context. It can happen that due to gene mutations, as for example when cancer progresses, the gene expression network undergoes some amount of localized re-wiring. The ability to detect statistical relevant changes in the interaction patterns induced by the progression of the disease can lead to the discovery of novel relevant signatures. Several procedures have been recently proposed to detect sub-network differences in pairwise labeled weighted networks.Entities:
Keywords: Differential networks; Gene regulatory network inference; Master regulators
Mesh:
Substances:
Year: 2017 PMID: 28259158 PMCID: PMC5336651 DOI: 10.1186/s12918-017-0412-6
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Time complexity comparison
| dGHD | Closed-form |
|---|---|
|
|
|
Here K represents the number of nodes for which p-value is greater than θ and generally K≪N. An important remark is that the cGHD calculation after removal of each node can be done independently in parallel. So, in case we have T processors, the complexity of the proposed approach will reduce ≈ linearly w.r.t. T
Fig. 1Correlation between topological overlap and cosine similarity on 250 random networks
Fig. 2Sensitivity Analysis of Parameter θ. The boxplots represents the distribution of True Positive Rate (TPR) identified by Closed-Form approach for 100 random runs of the experiment
Fig. 3Comparison of proposed Closed-Form approach with dGHD algorithm. Figure a and b correspond to the ROC and PR plot for permuted sub-network (d=0.15) respectively. Figure c and d represents the ROC and PR plot corresponding to denser sub-network (d=0.3 and d ′=0.5) respectively. Clearly, the Closed-Form technique has better performance than the dGHD algorithm
Comparison of proposed Closed-Form (CF) approach with dGHD algorithm. We compared the proposed Closed-Form approach with dGHD, Louvain, Infomap and Spinglass techniques w.r.t. various evaluation metrics for random geometric (RG) and power law (PL) networks
| Parameters | Method | AUC_ROC | Precision | Recall | Accuracy | Specificity | Kappa | Time |
|---|---|---|---|---|---|---|---|---|
| Mean ± Sd | Mean ± Sd | Mean ± Sd | Mean ± Sd | Mean ± Sd | Mean ± Sd | Mean | ||
|
|
| 0.935 ± 0.051 |
| 0.846 ± 0.102 |
|
| 0.828 ± 0.068 | 0.078 |
|
|
| 0.926 ± 0.018 | 0.793 ± 0.021 | 0.878 ± 0.036 | 0.965 ± 0.005 | 0.974 ± 0.003 | 0.813 ± 0.026 | 1.0 |
|
|
|
| 0.767 ± 0.052 |
| 0.965 ± 0.028 | 0.960 ± 0.031 |
|
|
|
|
| 0.843 ± 0.012 | 0.262 ± 0.015 |
| 0.718 ± 0.022 | 0.685 ± 0.024 | 0.304 ± 0.024 | 0.018 |
|
|
| 0.832 ± 0.011 | 0.249 ± 0.012 |
| 0.699 ± 0.018 | 0.665 ± 0.021 | 0.285 ± 0.020 | 0.85 |
|
|
| 0.927 ± 0.048 | 0.839 ± 0.031 | 0.862 ± 0.098 | 0.969 ± 0.008 |
| 0.825 ± 0.054 | 0.081 |
|
|
| 0.922 ± 0.022 | 0.806 ± 0.027 | 0.868 ± 0.045 | 0.966 ± 0.006 | 0.977 ± 0.004 | 0.816 ± 0.032 | 1.0 |
|
|
|
|
| 0.974 ± 0.042 |
|
|
|
|
|
|
| 0.849 ± 0.008 | 0.269 ± 0.009 |
| 0.728 ± 0.015 | 0.698 ± 0.016 | 0.316 ± 0.016 | 0.020 |
|
|
| 0.859 ± 0.009 | 0.284 ± 0.013 |
| 0.747 ± 0.016 | 0.719 ± 0.017 | 0.339 ± 0.019 | 0.92 |
|
|
|
|
| 0.789 ± 0.135 |
|
|
| 0.083 |
|
|
| 0.724 ± 0.029 | 0.645 ± 0.049 | 0.577 ± 0.059 | 0.921 ± 0.007 | 0.971 ± 0.006 | 0.504 ± 0.051 | 1.0 |
|
|
| 0.866 ± 0.019 | 0.406 ± 0.061 |
| 0.850 ± 0.034 | 0.833 ± 0.038 | 0.505 ± 0.072 |
|
|
|
| 0.677 ± 0.011 | 0.147 ± 0.004 |
| 0.419 ± 0.019 | 0.354 ± 0.022 | 0.100 ± 0.008 | 0.021 |
|
|
| 0.678 ± 0.011 | 0.148 ± 0.004 |
| 0.420 ± 0.018 | 0.355 ± 0.021 | 0.100 ± 0.008 | 0.90 |
|
|
|
|
| 0.930 ± 0.082 |
|
|
| 0.09 |
|
|
| 0.848 ± 0.071 | 0.700 ± 0.038 | 0.731 ± 0.148 | 0.941 ± 0.010 | 0.964 ± 0.009 | 0.672 ± 0.078 | 1.0 |
|
|
| 0.932 ± 0.029 | 0.478 ± 0.118 |
| 0.879 ± 0.054 | 0.866 ± 0.059 | 0.582 ± 0.128 |
|
|
|
| 0.674 ± 0.010 | 0.145 ± 0.004 |
| 0.413 ± 0.018 | 0.348 ± 0.020 | 0.097 ± 0.008 | 0.023 |
|
|
| 0.711 ± 0.007 | 0.162 ± 0.003 |
| 0.481 ± 0.013 | 0.423 ± 0.014 | 0.128 ± 0.006 | 0.94 |
|
|
|
|
| 0.792 ± 0.099 |
|
|
| 0.09 |
|
|
|
| 0.294 ± 0.009 | 0.809 ± 0.027 | 0.787 ± 0.008 | 0.333 ± 0.015 | 0.784 ± 0.009 | 1.0 |
|
|
| 0.780 ± 0.014 | 0.212 ± 0.010 |
| 0.703 ± 0.018 | 0.272 ± 0.016 | 0.690 ± 0.011 |
|
|
|
| 0.665 ± 0.013 | 0.141 ± 0.004 |
| 0.603 ± 0.018 | 0.162 ± 0.012 | 0.484 ± 0.019 | 0.026 |
|
|
| 0.687 ± 0.014 | 0.153 ± 0.006 |
| 0.645 ± 0.021 | 0.194 ± 0.011 | 0.527 ± 0.016 | 0.90 |
|
|
|
|
| 0.825 ± 0.035 |
|
|
| 0.085 |
|
|
| 0.808 ± 0.027 | 0.327 ± 0.018 | 0.799 ± 0.050 | 0.816 ± 0.008 | 0.375 ± 0.031 | 0.817 ± 0.004 | 1.0 |
|
|
| 0.774 ± 0.015 | 0.233 ± 0.011 |
| 0.736 ± 0.019 | 0.301 ± 0.009 | 0.732 ± 0.019 |
|
|
|
| 0.670 ± 0.014 | 0.168 ± 0.005 |
| 0.635 ± 0.017 | 0.210 ± 0.014 | 0.532 ± 0.014 | 0.027 |
|
|
| 0.694 ± 0.013 | 0.179 ± 0.007 |
| 0.670 ± 0.023 | 0.232 ± 0.012 | 0.571 ± 0.017 | 0.94 |
Bold represents the best results
Fig. 4Comparison of proposed Closed-Form approach with dGHD method w.r.t. AUC and AUC for 100 random runs of the experiment. These metrics are calculated using p-value 0.01 as cut-off. Figure a and b correspond to the AUC and AUC for permuted sub-network (d=0.15) respectively. Figure c and d represents the AUC and AUC corresponding to denser sub-network (d=0.3 and d ′=0.5) respectively
Fig. 5Differential sub-networks between IDH-mutant and IDH wild-type detected by the closed form approach. In red the connection present only in the IDH-mutant sub-network, while in green those present only in the IDH-wild-type sub-network. In black are represented common connections
The top most different transcription factors detected between IDH-mutant and IDH-wildtype in the TCGA dataset
| TF | Z-score | GHD |
| MRA fdr |
|---|---|---|---|---|
| FOXD3 | 0.000 | 1.000 | 1.000 | 1.000E+00 |
| FOXJ3 | 0.000 | 1.000 | 1.000 | 8.442E-03 |
| MLX | 0.000 | 1.000 | 1.000 | 8.075E-01 |
| NFIA | 0.000 | 1.000 | 1.000 | 4.502E-01 |
| ETV1 | 0.062 | 0.058 | 0.058 | 1.000E+00 |
| E2F1 | 0.085 | 0.058 | 0.058 | 1.007E-01 |
| CREB1 | 0.208 | 0.058 | 0.058 | 8.580E-01 |
| SOX10 | 0.234 | 0.058 | 0.058 | 8.442E-03 |
| KLF13 | 0.338 | 1.000 | 0.278 | 1.240E-02 |
| STAT3 | 0.354 | 0.058 | 0.058 | 8.442E-03 |
| RUNX3 | 0.387 | 0.058 | 0.059 | 1.671E-02 |
| IRF3 | 0.406 | 0.840 | 0.455 | 8.442E-03 |
| ZNF354C | 0.498 | 0.058 | 0.057 | 1.000E+00 |
| HOXD13 | 0.540 | 0.059 | 0.059 | 2.492E-01 |
| ZIC1 | 0.622 | 0.058 | 0.058 | 5.787E-02 |
| HOXA2 | 0.700 | 0.059 | 0.059 | 1.405E-01 |
| FOXO1 | 0.743 | 0.058 | 0.058 | 8.183E-02 |
| MAFG | 0.817 | 0.862 | 0.467 | 6.857E-01 |
| RFX1 | 0.865 | 0.059 | 0.059 | 3.131E-01 |
| NR1H2 | 0.871 | 0.058 | 0.058 | 8.176E-01 |
| PAX6 | 1.003 | 0.058 | 0.057 | 4.147E-01 |
| GLIS2 | 1.035 | 0.058 | 0.059 | 8.442E-03 |
| NR4A2 | 1.118 | 0.058 | 0.058 | 1.000E+00 |
| STAT4 | 1.137 | 0.848 | 0.486 | 9.615E-01 |
| DLX6 | 1.208 | 0.058 | 0.059 | 1.000E+00 |
| SIX4 | 1.232 | 0.058 | 0.058 | 1.000E+00 |
| MEF2D | 1.379 | 0.058 | 0.059 | 8.442E-03 |
| MTF1 | 1.388 | 0.058 | 0.057 | 1.000E+00 |
| MBD2 | 1.480 | 0.820 | 0.495 | 1.969E-01 |
| OTP | 1.493 | 0.058 | 0.057 | 2.970E-01 |
| ETV4 | 1.529 | 0.059 | 0.059 | 2.122E-01 |
| ZBTB12 | 1.566 | 0.194 | 0.189 | 4.255E-02 |
| HOXB4 | 1.595 | 0.058 | 0.057 | 3.019E-01 |
| PLAG1 | 1.622 | 0.195 | 0.190 | 3.434E-01 |
| E2F6 | 1.668 | 0.197 | 0.192 | 8.442E-03 |
| CREM | 1.674 | 0.765 | 0.506 | 2.122E-01 |
| IRF9 | 1.700 | 0.058 | 0.057 | 5.950E-02 |
| KLF6 | 1.709 | 0.059 | 0.059 | 8.442E-03 |
| TFE3 | 1.716 | 0.199 | 0.193 | 1.049E-01 |
| HSF2 | 1.759 | 0.201 | 0.195 | 1.671E-02 |
| NR2C1 | 1.800 | 0.058 | 0.058 | 2.122E-01 |
| ONECUT2 | 1.804 | 0.202 | 0.196 | 3.657E-02 |
| HOXD3 | 1.847 | 0.204 | 0.198 | 1.000E+00 |
| BACH1 | 1.888 | 0.058 | 0.059 | 2.897E-01 |
| GSX1 | 1.895 | 0.207 | 0.200 | 1.000E+00 |
| HOXA13 | 1.930 | 0.058 | 0.057 | 1.000E+00 |
| VAX2 | 1.937 | 0.208 | 0.201 | 1.609E-01 |
The columns reports the differential measures in terms of Z-score of the proposed differencing test (Eq. (2)), the GHD computed between the two networks, the mean of the null GHD distribution. The last column reports the False Discovery Rate of the GSEA enrichment obtained with a Master Regulator Analysis
The top most different transcription factors detected between IDH-mutant and IDH-wildtype in the REMBRANDT dataset
| TF | Z-score | GHD |
| MRA fdr |
|---|---|---|---|---|
| MGA | 0.000 | 1.000 | 1.000 | 2.166E-03 |
| TEAD1 | 0.000 | 1.000 | 1.000 | 8.017E-04 |
| FOS | 0.000 | 1.000 | 1.000 | 5.137E-04 |
| JUNB | 0.000 | 1.000 | 1.000 | 5.137E-04 |
| MEF2C | 0.015 | 0.015 | 0.015 | 8.001E-04 |
| LEF1 | 0.058 | 0.014 | 0.014 | 5.137E-04 |
| NEUROD2 | 0.096 | 0.016 | 0.016 | 1.221E-03 |
| EGR2 | 0.110 | 0.013 | 0.013 | 6.263E-03 |
| JUN | 0.123 | 0.333 | 0.500 | 5.137E-04 |
| ARX | 0.144 | 0.012 | 0.012 | 9.301E-02 |
| BBX | 0.173 | 0.012 | 0.012 | 7.333E-04 |
| TCF3 | 0.198 | 0.011 | 0.011 | 5.137E-04 |
| LHX6 | 0.205 | 0.017 | 0.017 | 8.492E-04 |
| EGR1 | 0.211 | 0.011 | 0.011 | 9.696E-03 |
| BCL6B | 0.214 | 0.011 | 0.011 | 5.137E-04 |
| E2F2 | 0.217 | 0.011 | 0.011 | 7.786E-04 |
| E2F7 | 0.220 | 0.012 | 0.012 | 5.137E-04 |
| E2F8 | 0.223 | 0.012 | 0.012 | 5.137E-04 |
| ELF4 | 0.226 | 0.012 | 0.012 | 5.137E-04 |
| ETV5 | 0.229 | 0.013 | 0.012 | 5.137E-04 |
| FLI1 | 0.232 | 0.013 | 0.013 | 5.137E-04 |
| FOXG1 | 0.235 | 0.013 | 0.013 | 1.000E+00 |
| HOXD9 | 0.239 | 0.014 | 0.014 | 9.728E-04 |
| ID4 | 0.242 | 0.014 | 0.014 | 7.786E-04 |
| IRF8 | 0.246 | 0.014 | 0.014 | 2.393E-02 |
| MYBL2 | 0.250 | 0.015 | 0.015 | 5.137E-04 |
| NFIA | 0.254 | 0.015 | 0.015 | 4.085E-03 |
| NFIB | 0.258 | 0.016 | 0.016 | 7.796E-04 |
| KLF13 | 0.258 | 0.360 | 0.515 | 8.001E-04 |
| OLIG2 | 0.262 | 0.016 | 0.016 | 7.893E-02 |
| PROX1 | 0.266 | 0.017 | 0.017 | 1.020E-02 |
| SOX2 | 0.270 | 0.017 | 0.017 | 2.995E-03 |
| TEF | 0.275 | 0.018 | 0.018 | 8.221E-04 |
| ZBTB7A | 0.280 | 0.019 | 0.018 | 7.700E-04 |
| ZIC1 | 0.284 | 0.019 | 0.019 | 7.700E-01 |
| SOX13 | 0.295 | 0.021 | 0.020 | 8.086E-04 |
| TCF7L2 | 0.300 | 0.021 | 0.021 | 7.487E-04 |
| BCL6 | 0.305 | 0.022 | 0.022 | 5.137E-04 |
| MAF | 0.317 | 0.024 | 0.024 | 5.137E-04 |
| CEBPB | 0.330 | 0.024 | 0.024 | 5.137E-04 |
| CEBPD | 0.337 | 0.025 | 0.025 | 5.137E-04 |
| HLF | 0.344 | 0.018 | 0.018 | 3.029E-03 |
| ELK1 | 0.349 | 0.025 | 0.025 | 8.017E-04 |
| FOXJ3 | 0.369 | 0.027 | 0.026 | 5.137E-04 |
| MTF1 | 0.377 | 0.028 | 0.027 | 5.137E-04 |
| TP53 | 0.388 | 0.028 | 0.028 | 5.137E-04 |
| GABPA | 0.407 | 0.030 | 0.029 | 5.137E-04 |
| CDC5L | 0.417 | 0.031 | 0.031 | 7.899E-04 |
| RORA | 0.422 | 0.329 | 0.467 | 7.796E-04 |
| IRF9 | 0.426 | 0.031 | 0.031 | 3.062E-03 |
| STAT1 | 0.437 | 0.033 | 0.032 | 5.137E-04 |
| CREB1 | 0.456 | 0.035 | 0.034 | 5.137E-04 |
| SOX10 | 0.462 | 0.036 | 0.035 | 8.250E-04 |
| HOXD1 | 0.475 | 0.038 | 0.037 | 5.137E-04 |
| SOX8 | 0.479 | 0.038 | 0.037 | 1.760E-03 |
| HOXD11 | 0.480 | 0.047 | 0.046 | 2.975E-02 |
| NR2F2 | 0.490 | 0.042 | 0.041 | 5.186E-04 |
| DLX1 | 0.491 | 0.046 | 0.045 | 7.700E-04 |
| TCF12 | 0.493 | 0.040 | 0.040 | 9.117E-04 |
| THRB | 0.495 | 0.051 | 0.050 | 9.850E-04 |
| DLX2 | 0.496 | 0.045 | 0.044 | 8.492E-04 |
| HOXD10 | 0.498 | 0.050 | 0.049 | 5.137E-04 |
| ATF5 | 0.505 | 0.057 | 0.055 | 5.137E-04 |
| STAT4 | 0.515 | 0.055 | 0.054 | 9.220E-04 |
| TBR1 | 0.519 | 0.020 | 0.020 | 9.272E-04 |
| MESP1 | 0.521 | 0.092 | 0.087 | 8.746E-04 |
| POU3F2 | 0.523 | 0.063 | 0.061 | 5.137E-04 |
| TFEC | 0.530 | 0.082 | 0.079 | 5.137E-04 |
| TCF4 | 0.533 | 0.071 | 0.069 | 7.487E-04 |
| ETS2 | 0.543 | 0.176 | 0.163 | 9.728E-04 |
| CREM | 0.558 | 0.110 | 0.104 | 5.140E-04 |
| TP63 | 0.561 | 0.105 | 0.099 | 9.220E-04 |
| STAT6 | 0.563 | 0.091 | 0.087 | 5.137E-04 |
| NPAS2 | 0.575 | 0.136 | 0.127 | 1.889E-01 |
| GLI3 | 0.601 | 0.313 | 0.455 | 4.663E-02 |
The columns reports the differential measures in terms of Z-score of the proposed differencing test (Eq. (2)), the GHD computed between the two networks, the mean of the null GHD distribution. The last column reports the False Discovery Rate of the GSEA enrichment obtained with a Master Regulator Analysis