| Literature DB >> 22689766 |
Christopher A Penfold1, Vicky Buchanan-Wollaston, Katherine J Denby, David L Wild.
Abstract
MOTIVATION: The generation of time series transcriptomic datasets collected under multiple experimental conditions has proven to be a powerful approach for disentangling complex biological processes, allowing for the reverse engineering of gene regulatory networks (GRNs). Most methods for reverse engineering GRNs from multiple datasets assume that each of the time series were generated from networks with identical topology. In this study, we outline a hierarchical, non-parametric Bayesian approach for reverse engineering GRNs using multiple time series that can be applied in a number of novel situations including: (i) where different, but overlapping sets of transcription factors are expected to bind in the different experimental conditions; that is, where switching events could potentially arise under the different treatments and (ii) for inference in evolutionary related species in which orthologous GRNs exist. More generally, the method can be used to identify context-specific regulation by leveraging time series gene expression data alongside methods that can identify putative lists of transcription factors or transcription factor targets.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22689766 PMCID: PMC3371854 DOI: 10.1093/bioinformatics/bts222
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Graphical model representation of the CSI framework for node i. (A) In the hierarchical framework, a network structure, a, is inferred for each of the D datasets, with each possessing a unique hyperparameter θ for the system dynamics, where j={1,…, D}. The structure of the hypernetwork, a*, is also inferred along with a set of hyperparameters β controlling the influence of the individual parents on the hypernetwork. When β=0 the hypernetwork is independent of the ith network, while β≫0 induces a strong coupling between the ith parent and hypernetwork. (B) Inference using the standard CSI algorithm identifies a parent structure, a, for the union of all datasets, as well as the hyperparameter, θ, for the GP model of the system dynamics
Average AUROC and average AUPR with standard deviation shown in brackets to three decimal places for the 10 gene DREAM4 networks with and without a hypernetwork constraints
| Dataset | Pert(s). | AUROC | AUPR curve | ||||
|---|---|---|---|---|---|---|---|
| hCSI-Gibbs | hCSI-MH | CSI-Gibbs | hCSI-Gibbs | hCSI-MH | CSI-Gibbs | ||
| 1 | 1 | 0.66 (0.005) | 0.36 (0.011) | ||||
| 2 | 0.65 (0.006) | 0.26 (0.004) | |||||
| 3 | 0.64 (0.004) | 0.27 (0.024) | |||||
| 4 | 0.53 (0.004) | 0.16 (0.002) | |||||
| 5 | 0.73 (0.013) | 0.72 (0.005) | 0.28 (0.024) | 0.28 (0.009) | |||
| 2 | 1 | 0.68 (0.006) | 0.32 (0.006) | ||||
| 2 | 0.71 (0.003) | 0.34 (0.016) | |||||
| 3 | 0.62 (0.004) | 0.26 (0.002) | |||||
| 4 | 0.73 (0.012)} | 0.73 (0.015) | 0.74 (0.005) | 0.43 (0.003) | |||
| 5 | 0.61 (0.005) | 0.24 (0.007) | |||||
| 3 | 1 | 0.70 (0.004) | 0.29 (0.003) | ||||
| 2 | 0.69 (0.004) | 0.33 (0.015) | |||||
| 3 | 0.72 (0.004) | 0.26 (0.004) | |||||
| 4 | 0.69 (0.007) | 0.27 (0.005) | |||||
| 5 | 0.67 (0.004) | 0.36 (0.005) | |||||
| 4 | 1 | 0.71 (0.005) | 0.46 (0.015) | ||||
| 2 | 0.65 (0.005) | 0.38 (0.008) | |||||
| 3 | 0.67 (0.005) | 0.33 (0.005) | |||||
| 4 | 0.64 (0.005) | 0.37 (0.005) | |||||
| 5 | 0.76 (0.006) | 0.41 (0.017) | |||||
| 5 | 1 | 0.74 (0.003) | 0.33 (0.015) | ||||
| 2 | 0.79 (0.003) | 0.42 (0.005) | |||||
| 3 | 0.78 (0.003) | 0.31 (0.006) | |||||
| 4 | 0.82 (0.004) | 0.28 (0.005) | |||||
| 5 | 0.75 (0.006) | 0.37 (0.020) | |||||
An independent Gamma prior is placed over each of the Gaussian process hyperparameters θ~Γ(10, 0.1). For inference with hypernetworks, an independent Gamma prior is placed over the individual temperature parameters, β~Γ(1, 1). Values in bold indicate the score is both statistically significantly different from that achieved using a standard CSI-Gibbs (third and sixth columns) algorithm according to a Wilcoxon rank-sum test (P<0.01), and shows improved performance.
aCases marked indicate scores that were both statistically significantly different and showed worse performance in the hierarchical modelling compared with the standard implementation.
Average AUROC and average AUPR with standard deviation shown in brackets to three decimal places for the 10 gene DREAM4 networks with and without a hypernetwork constraints
| Dataset | Pert(s). | ||||||
|---|---|---|---|---|---|---|---|
| hCSI-Gibbs | hCSI-MH | CSI-Gibbs | hCSI-Gibbs | hCSI-MH | CSI-Gibbs | ||
| 1 & 2 | 1−5 | 0.77 (0.009) | 0.77 (0.014) | 0.78 (0.006) | 0.55 (0.006) | ||
| 1−5 | 0.77 0.009) | 0.77 (0.015) | 0.78 (0.008) | 0.60 (0.005) | 0.60 (0.007) | ||
| 1 & 3 | 1−5 | 0.77 (0.009) | 0.77 (0.015) | 0.78 (0.006) | 0.54 (0.006) | 0.55 (0.006) | |
| 1−5 | 0.70 (0.004) | 0.52 (0.006) | 0.52 (0.010) | 0.53 (0.003) | |||
| 1 & 4 | 1−5 | 0.78 (0.011) | 0.78 (0.018) | 0.78 (0.006) | 0.55 (0.007) | 0.55 (0.006) | |
| 1−5 | 0.76 (0.013) | 0.76 (0.020) | 0.76 (0.008) | 0.62 (0.016) | 0.62 (0.008) | ||
| 1 & 5 | 1−5 | 0.78 (0.011) | 0.77 (0.015) | 0.78 (0.006) | 0.55 (0.008) | 0.55 (0.006) | |
| 1−5 | 0.87 (0.012) | 0.87 (0.017) | 0.88 (0.008) | 0.71 (0.018) | 0.72 (0.022) | 0.72 (0.019) | |
| 2 & 3 | 1−5 | 0.78 (0.015) | 0.78 (0.008) | 0.60 (0.011) | 0.60 (0.007) | ||
| 1−5 | 0.71 (0.008) | 0.71 (0.016) | 0.70 (0.004) | 0.53 (0.005) | 0.52 (0.011) | 0.53 (0.003) | |
| 2 & 4 | 1−5 | 0.78 (0.018) | 0.78 (0.008) | 0.60 (0.007) | 0.60 (0.007) | ||
| 1−5 | 0.76 (0.022) | 0.76 (0.008) | 0.62 (0.013) | 0.62 (0.008) | |||
| 2 & 5 | 1−5 | 0.78 (0.010) | 0.78 (0.017) | 0.78 (0.008) | 0.60 (0.006) | 0.60 (0.010) | 0.60 (0.007) |
| 1−5 | 0.87 (0.013) | 0.87 (0.014) | 0.88 (0.008) | 0.71 (0.017) | 0.72 (0.021) | 0.72 (0.019) | |
| 3 & 4 | 1−5 | 0.70 (0.004) | 0.52 (0.013) | 0.53 (0.003) | |||
| 1−5 | 0.76 (0.019) | 0.76 (0.008) | 0.62 (0.015) | 0.62 (0.008) | |||
| 3 & 5 | 1−5 | 0.70 (0.010) | 0.71 (0.015) | 0.70 (0.004) | 0.53 (0.006) | 0.52 (0.013) | 0.53 (0.003) |
| 1−5 | 0.87 (0.010) | 0.87 (0.014) | 0.88 (0.008) | 0.71 (0.015) | 0.72 (0.021) | 0.72 (0.019) | |
| 4 & 5 | 1−5 | 0.76 (0.014) | 0.75 (0.018) | 0.76 (0.008) | 0.62 (0.015) | 0.62 (0.008) | |
| 1−5 | 0.88 (0.013) | 0.87 (0.012) | 0.88 (0.008) | 0.72 (0.016) | 0.72 (0.019) | ||
An independent Gamma prior is placed over each of the Gaussian process hyperparameters θ~Γ(10, 0.1). For inference with hypernetworks, an independent Gamma prior is placed over the individual temperature parameters, β~Γ(1, 1). Values in bold indicate the score is both statistically significantly different from than that achieved using a standard CSI-Gibbs algorithm (third and sixth columns) according to a Wilcoxon rank-sum test (P<0.01), and shows improved performance.
aCases marked indicate scores that were both statistically significantly different and showed worse performance in the hierarchical modelling compared with the standard implementation.
Fig. 2.(A). Heat map indicating the probability of TFs influencing RD29A expression. Here, dark blue indicates a low probability of that TF being influential in the time series expression dataset, whereas dark red indicates a high probability of the TF being influential. (B). Inferred temperature parameters for each of the experiments compared with samples generated from the prior distribution of temperature. (C). Unsigned stress signalling network adapted from Tsutsui ). Here question marks indicate speculative or indirect links that have not been verified via Y1H