| Literature DB >> 27876826 |
Luis F Iglesias-Martinez1, Walter Kolch1,2,3, Tapesh Santra1.
Abstract
Reconstructing gene regulatory networks (GRNs) from gene expression data is a challenging problem. Existing GRN reconstruction algorithms can be broadly divided into model-free and model-based methods. Typically, model-free methods have high accuracy but are computation intensive whereas model-based methods are fast but less accurate. We propose Bayesian Gene Regulation Model Inference (BGRMI), a model-based method for inferring GRNs from time-course gene expression data. BGRMI uses a Bayesian framework to calculate the probability of different models of GRNs and a heuristic search strategy to scan the model space efficiently. Using benchmark datasets, we show that BGRMI has higher/comparable accuracy at a fraction of the computational cost of competing algorithms. Additionally, it can incorporate prior knowledge of potential gene regulation mechanisms and TF hetero-dimerization processes in the GRN reconstruction process. We incorporated existing ChIP-seq data and known protein interactions between TFs in BGRMI as sources of prior knowledge to reconstruct transcription regulatory networks of proliferating and differentiating breast cancer (BC) cells from time-course gene expression data. The reconstructed networks revealed key driver genes of proliferation and differentiation in BC cells. Some of these genes were not previously studied in the context of BC, but may have clinical relevance in BC treatment.Entities:
Mesh:
Year: 2016 PMID: 27876826 PMCID: PMC5120305 DOI: 10.1038/srep37140
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Workflow of the heuristic model search algorithm.
On the first step the marginal likelihood of the null model is calculated. Then each TF is evaluated as a variable independently and only TFs whose marginal likelihood is higher than the null model’s are further expanded. The highest marginal likelihood of the single TF models is selected as a threshold or bound to evaluate the nested models with two TFs.
AUPRs for the DREAM4 Networks.
| Size 10 Networks | |||||||
|---|---|---|---|---|---|---|---|
| BGRMI | Jump3 | GENIE3 | CLR | Inferelator | ScanBMA | G1DBN | |
| Net1 | 0.635 | 0.498 | 0.555 | 0.465 | — | 0.564 | |
| Net2 | 0.396 | 0.351 | 0.447 | 0.443 | — | 0.392 | |
| Net3 | 0.44 | 0.407 | 0.414 | 0.509 | — | 0.499 | |
| Net4 | 0.751 | 0.584 | 0.519 | 0.555 | 0.653 | — | |
| Net5 | 0.615 | 0.646 | 0.787 | 0.637 | — | 0.77 | |
| Average | 0.513 | 0.524 | 0.553 | 0.577 | 0.505 | 0.597 | |
| Net 1 | 0.245 | 0.228 | 0.179 | 0.126 | — | 0.089 | |
| Net2 | 0.11 | 0.096 | 0.109 | 0.101 | — | 0.055 | |
| Net3 | 0.185 | 0.2 | 0.23 | 0.198 | — | 0.155 | |
| Net4 | 0.18 | 0.157 | 0.154 | 0.147 | — | 0.153 | |
| Net5 | 0.154 | 0.168 | 0.163 | 0.148 | — | 0.117 | |
| Average | 0.184 | 0.176 | 0.167 | 0.144 | 0.101 | 0.114 | |
| Overall average | |||||||
| All Nets | 0.35 | 0.35 | 0.36 | 0.3605 | 0.303 | 0.3555 | |
The numbers in bold represent the best performer. The authors of the scanBMA algorithm published only the average AUPR for size 10 and size 100 category, therefore we showed only the average AUPRs for scanBMA.
AUPRs of the In Vivo IRMA Network.
| BGRMI | Jump3 | GENIE3 | CLR | Inferel-ator | Scan BMA | TSN1 | G1DBN | |
|---|---|---|---|---|---|---|---|---|
| Switch -On Dataset | 0.685 | 0.62 | 0.423 | 0.718 | 0.455 | 0.706 | 0.6 | |
| Switch-Off Dataset | 0.574 | 0.347 | 0.372 | 0.649 | 0.232 | 0.511 | 0.313 |
The numbers in bold represent the best performers.
Execution times of the BGRMI algorithm.
| Network | No. of Genes | No. of Observations | No. of Regulators | Running Time |
|---|---|---|---|---|
| IRMA | 5 | ~62 | 5 | 0.03 secs |
| DREAM4 10 | 10 | 105 | 10 | 0.32 secs |
| DREAM4 100 | 100 | 210 | 100 | ~4 mins |
Figure 2Workflow of BGRMI implementation on time course gene expression profiles on human BC cells.
Figure 3The EGF and HRG induced GRN in BC cells and the clinical relevance of some of its transcriptional hubs.
(A,B) EGF and HRG induced GRNs in MCF7 cells with node size proportional to outdegree (number of targets). (C) Kaplan Meier plot for BC patient survival probability for different levels of SIX5 expression. (D) Kaplan Meier plot for survival probabilities of BC patients who underwent endocrine therapy for different levels of CHD2 expression. (E) Kaplan Meier plot for HER2 positive BC patient survival for different levels of RFX5 expression. (F) Kaplan Meier plot for TNBC patients survival probabilities for different levels of RFX5 expression. In (C–F) the red and black curves show survival probabilities for higher and lower expression of the corresponding markers respectively. (G,H) EGF and HRG induced GRNs in MCF7 cells with node size proportional to their betweenness centralities. (I,J) EGF and HRG induced GRNs in MCF7 cells with node size proportional to their page-rank.