| Literature DB >> 31727119 |
Dharmesh D Bhuva1,2, Joseph Cursons1,3, Gordon K Smyth1,2, Melissa J Davis4,5,6.
Abstract
BACKGROUND: Elucidation of regulatory networks, including identification of regulatory mechanisms specific to a given biological context, is a key aim in systems biology. This has motivated the move from co-expression to differential co-expression analysis and numerous methods have been developed subsequently to address this task; however, evaluation of methods and interpretation of the resulting networks has been hindered by the lack of known context-specific regulatory interactions.Entities:
Keywords: Breast cancer; Differential co-expression; Differential networks; Gene regulation; Immune infiltration; Systems modelling
Year: 2019 PMID: 31727119 PMCID: PMC6857226 DOI: 10.1186/s13059-019-1851-8
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Module-based differential co-expression methods
| Method | Module comparison | Module definition | Number of conditions | Citation | Availability |
|---|---|---|---|---|---|
| DICER | Between | De novo | Multiple | [ | jar file available from [ |
| GSCA | Within | Known | Multiple | [ | R package available from [ |
| GSNCA | Within | Known | Two | [ | GSAR v1.18.0 R/Bioconductor package |
| CoGA | Within | Known | Two | [ | R package available from [ |
| DiffCoEx | Both | De novo | Multiple | [ | dcanr v1.0.0 R/Bioconductor package |
| CoXpress | Within | De novo | Two | [ | R package available from [ |
| dCoxS | Within | Known | Two | [ | Supplementary material of the original publication |
| DCIM | Between | De novo | De novo | [ | R package available from [ |
| DiffCorr | Between | Known | Two | [ | DiffCorr v0.4.1 R package available from CRAN |
Network-based differential co-expression analysis methods
| Method | Statistical method | Test | Number of conditions | Citation | Availability |
|---|---|---|---|---|---|
| Correlation | Two | [ | dcanr v1.0.0 R/Bioconductor package | ||
| DGCA | Correlation | Two | [ | DGCA v1.0.1 R package available from CRAN | |
| Discordant | Correlation | Two | [ | discordant v1.8.0 R/Bioconductor package | |
| MAGIC | Correlation | Modulation test | Two | [ | dcanr v1.0.0 R/Bioconductor package |
| DICER | Correlation | Permutation test | Multiple | [ | jar file available from [ |
| DiffCoEx | Correlation | Permutation test | Multiple | [ | dcanr v1.0.0 R/Bioconductor package |
| EBcoexpress | Empirical Bayes + correlation | – | Two | [ | EBcoexpress v1.28.0 R/Bioconductor package |
| Entropy (ENT) | Entropy based on correlation | Permutation test | Two | [ | dcanr v1.0.0 R/Bioconductor package |
| FTGI | Generalised linear model | Chi-squared test | Multiple | [ | dcanr v1.0.0 R/Bioconductor package |
| ECF | Expected conditional F | Permutation test | Multiple | [ | COSINE v2.1 R package available from CRAN |
| COSINE | Expected conditional F | – | Multiple | [ | COSINE v2.1 R package available from CRAN |
| GGM-based | GGM + posterior odds | – | Two | [ | dcanr v1.0.0 R/Bioconductor package |
| LDGM | Latent differential graphical model | – | Two | [ | dcanr v1.1.4 R/Bioconductor package |
| MINDy | Conditional mutual information | Permutation test | Two | [ | MINDy module in GenePattern, dcanr v1.0.0 R/Bioconductor package |
Fig. 1A simple regulatory network demonstrating differential co-expression. a Schematic of the regulatory network. Genes A and B are input genes and co-activate gene C. b Histograms showing the distribution of expression values for A and B across 500 simulated expression profiles. Gene A is always wildtype whereas gene B is knocked down in about half of the samples. c Scatterplot of expression values for A and B. Background shading shows the activation function generated by A and B used to model regulation of C. d Scatterplots of expression values for A and C, knockdown of B (left panel) and B wildtype samples (right panel). Gene A is highly correlated with C (r = 0.716) when B is at wildtype expression levels but uncorrelated with C (r = 0.049) when B is knocked down
Fig. 2Differential co-expression analysis of an example network with 150 genes and 500 samples. a The regulatory network used to simulate the data and the two knockdown genes (KDs) (orange and purple nodes). b A differential co-expression (DC) network inferred from the simulated data using the z-score method. Interactions shown have significantly different correlations between knockdown and wildtype states (FDR < 0.1). Correct predictions for each knockdown as per the “true” differential association network are coloured respectively with false positives in grey. c Three representations of the true co-expression network obtained from a perturbation analysis of the regulatory network. Direct differential interactions are a subset of differential influences which are in turn a subset of differential associations. d Empirical z-transformed correlations for each interaction in the respective “true” networks. The association network shows a similar correlation profile to the direct and influence networks but with added points, as shown for example by the circled points
Fig. 3Most methods tend to infer the association DC network. Performance of 11 DC inference methods and 2 co-expression methods (highlighted in grey) across 812 different simulations with approximately 500 observations sampled. Performance is quantified using the F1 score and is computed for the three different representations of DC networks: direct, influence, and association. Methods are sorted based on the sum of their F1 scores across all simulations and truth networks. For co-expression methods, the difference of co-expression networks generated separately in each condition was taken as the DC network
Fig. 4A DC sub-network in ER− tumours is associated with lymphocyte infiltration. a The DC sub-network with candidate differentially regulated targets DOCK10, HSH2D, and ITGAL, and TFs TFEC, SP140, IKZF1, KLHL6, IRF4, and STAT4. Nodes are coloured based on log fold-change conditioned on ER status and edges coloured based on differences in correlations. Genes are clustered based on the target they are differentially co-expressed with. b A putative regulatory mechanism proposed from the DC network with insights gained from simulations. Dashed lines indicate a potentially indirect yet causal interaction. c Differential association of HSH2D with tumour-infiltrating lymphocytes (TILs) with infiltration estimated from a naïve T cell signature using singscore (left), and from H&E-stained slides (Saltz. Gupta, et al.). Associations indicate that HSH2D is a marker of lymphocyte infiltration specific to basal-like tumours. d correlations of genes in clusters C1-C5 with all transcription factors. The red line indicates a correlation of 0.8, showing stronger co-expression with TFs in the same cluster. e Expression of selected genes in cancer cell lines annotated with cancer sub-type and blood data annotated with immune cell type. Genes in the DC network have high expression in blood and are rarely expressed in cell lines
Network and model properties calculated to characterise simulations
| Label | Description | Type | Mapping |
|---|---|---|---|
| Avg num input TFs | Average number of input genes co-regulating the differentially regulated target. | Network | 1-to-1 |
| Clust coef (g) | Ratio of cliques over all possible cliques in the network. Large values are indicative of small-world networks. Calculated on the undirected regulatory network. | Network | 1-to-1 |
| Clust coef (l) | Average of clustering coefficients calculated per node. Calculated on the undirected network. | Network | 1-to-1 |
| Density diffnet | Network density of the true differential network. Here the differential association network is used. | Network | 1-to-1 |
| Density source | Network density of the source regulatory network. Density is calculated as the ratio of observed edges over the total number of possible edges ( | Network | 1-to-1 |
| Diameter | Calculated as the longest of all shortest paths in the network and is indicative of the linear size of the network. Calculated on the undirected network. | Network | 1-to-1 |
| Eigen centrality | Eigen centrality of each perturbed (knockdown) node. Calculated on the undirected network. | Network | 1-to-many |
| Input means | Mean value of the distribution each input gene | Model | 1-to-many |
| Input vars | Variance of the distribution of each input gene | Model | 1-to-many |
| KD sample props | The smaller proportion of the population resulting from a knockdown. 0.2 if the proportions are 0.2 and 0.8. | Model | 1-to-many |
| Num co-targeted | Number of differentially regulated targets | Network | 1-to-1 |
| Num inputs | Number of input nodes in the source regulatory network. | Network | 1-to-1 |
| Num KD genes | Number of perturbed genes (knockdown). | Model | 1-to-1 |
| Num TFs | Number of regulators in the source regulatory network (both inputs and downstream). | Network | 1-to-1 |
| Radius | Minimum eccentricity of any node where eccentricity of a node is the shortest distance to the farthest node. Calculated on the undirected network. | Network | 1-to-1 |
| Var of input cors | Variance of the correlations between relative abundances of input genes. | Model | 1-to-1 |