| Literature DB >> 20673354 |
Andrew E Teschendorff1, Simone Severini.
Abstract
BACKGROUND: The statistical study of biological networks has led to important novel biological insights, such as the presence of hubs and hierarchical modularity. There is also a growing interest in studying the statistical properties of networks in the context of cancer genomics. However, relatively little is known as to what network features differ between the cancer and normal cell physiologies, or between different cancer cell phenotypes.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20673354 PMCID: PMC2925356 DOI: 10.1186/1752-0509-4-104
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1Local entropy changes in integrated PIN-mRNA networks. In the above hypothetical networks, an edge represents a documented interaction between the corresponding proteins in the PIN. The color of the edge codes for the pairwise correlation in mRNA expression in two different conditions, here non-metastatic and metastatic cancer. In this hypothetical example, gene-1 is positively correlated with maximal value (C = 1) to the first 6 genes, but negatively correlated (C = -1) to the remaining four in the non-metastatic PIN-mRNA network. These values translate to a stochastic positive flux vector p = 1/6 (1, 1, 1, 1, 1, 1, 0, 0, 0, 0) and to an entropy value of 0.78 (Methods). In this hypothetical example, gene-1 is lost/mutated in metastatic cancer, leading to a loss of mRNA expression correlation and anti-correlation with the nearest neighbors. This introduces more disorder/randomness in the local flux distribution, illustrated here by zero correlation values. The resulting local entropy takes on a maximum value of 1, and so for this node there is a large increase in the local entropy which is statistically significant. In contrast, a Wilcoxon rank sum test between the correlation values C = (1, 1, 1, 1, 1, 1, -1, -1, -1, -1) and C = (0, 0, 0, 0, 0, 0, 0, 0, 0, 0) yields a non-significant P-value of 0.2.
Figure 2Local entropy is increased in metastatic breast cancer. A) Comparison of local entropies for the 1903 proteins with degrees ≥ 10 between the non-metastatic, metastatic and random weighted networks. The entropies in the metastatic network exhibit significantly higher values than those in the non-metastatic network: P-value given is from a one-tailed paired Wilcoxon rank sum test. Both non-metastatic and metastatic networks show significantly lower entropies than those of a purely random network obtained by randomisation of expression profiles. B) Differential entropy values (metastatic minus non-metastatic) are significantly greater than zero for 10 different choices of non-metastatic networks obtained by bootstrapping samples. One-tailed paired Wilcoxon rank sum test P-values are given. C) The expected variation in differential entropy under the null distribution against node degree. The green-line is a non-linear least squares fit of a power-law function of the form a/kwhere k is the node degree. Estimated parameter values are â = 0.0086; = 0.08411. D) Histogram of P-values of genes (nodes). P-values were estimated by comparing observed differential entropy values to those expected under the null using the variance estimates from C).
Figure 3Increased entropy in metastatic breast cancer is independent of breast cancer cohort. A) Comparison of local entropies for the proteins with degrees ≥ 10 between the non-metastatic, metastatic and random weighted networks in the three validation cohorts: Mainz (1900 genes), Frid (1270 genes) and LoiUnt (1899 genes). The entropies in the metastatic network exhibit significantly higher values than those in the non-metastatic network: P-value given is from a one-tailed paired Wilcoxon rank sum test. Both non-metastatic and metastatic networks show significantly lower entropies than those of a purely random network obtained by randomisation of expression profiles. B) The top 200 genes with most significant P-values (FDR < 0.001) as identified in the EMC (discovery) data set are grouped according to increases in entropy (1) or decreases (-1). The numbers of genes within each group and represented in the validation set are given just above x-axis. The y-axis labels the statistics (dS/σ) of these genes in the corresponding validation set, and the boxplot allows a direct comparison to be made. The P-value is from a one-tailed Wilcox rank sum test as the alternative hypothesis is that the statistics should be larger for those genes identified to undergo increases in entropy in the discovery set.
One tailed paired Wilcoxon test P-values comparing distribution of local measures of disruption in information flow in metastatic and non-metastatic PIN-mRNA networks across four different breast cancer cohorts.
| Metric | EMC ( | Mainz ( | Frid ( | LoiUnt ( |
|---|---|---|---|---|
| 0.001 | 0.01 | 0.54 | 2e-13 | |
| 1e-16 | 5e-10 | 1e-12 | 3e-70 | |
| 2e-16 | 6e-13 | <1e-100 | <1e-100 |
denotes the difference in mean local correlation, denotes the difference in mean local absolute correlation and dS denotes the differential local entropy. In the case of the non-metastatic networks, values were averaged over 10 distinct bootstraps before computation of the P-values. The number of pairs (nodes) in the test, n, corresponding to the number of nodes in the network with degree ≥ 10 are given.
Figure 4Local entropy better characterises the metastatic network. A) For each of the four breast cancer cohorts, we count the number of genes showing increases (grey) and decreases (black) in local entropy (S), negative local mean correlation (D), and heterogeneity (negative local mean absolute correlation) H in the metastatic PIN-mRNA networks. In all cases, nodes of degree ≥ 10 were selected. B) Corresponding -log10(p-values) from a one-tailed binomial test. The line -log10(0.05) is shown in green.
Figure 5Biological subnetworks exhibiting significant increases in entropy. We contrast the integrated PIN-mRNA metastatic and non-metastatic networks for four nodes/genes exhibiting significant increases in entropy and related to tumour suppressor pathways and cancer hallmarks: MYBL2 and cell-cycle, IGFBP7 and IGF-signalling, IL2RB and IL2 immune-mediated tumour suppression, BCL2 and apoptosis. In each case, we only depict the nearest neighbors (interacting protein partners) of the selected nodes. The edge color shows the strength of the Pearson correlation C in expression between the two genes across the given phenotype: bright red (0.5
Gene Set Enrichment Analysis of the top 200 genes of which 133 showed increases in entropy.
| Pathway | P (dS > 0) | P(dS < 0) | Example genes |
|---|---|---|---|
| Apoptosis | 6e-7 | n.s | |
| IL2 | 6e-4 | n.s | |
| AR | 8e-4 | n.s | |
| IGF1 | 9e-4 | n.s |
While we observed enrichment of biological pathways among genes showing increases in entropy, there was none among the 67 genes showing decreases. P-values of enrichment (one tailed Fisher's exact test) against genes showing entropy increases (dS > 0) and decreases are given (dS < 0) and were calculated using all nodes of degree ≥ 10 as reference (1903 genes) to avoid intrinsic literature bias. n.s = not significant