| Literature DB >> 27547217 |
Abstract
Simulation study in systems biology involving computational experiments dealing with Wnt signaling pathways abound in literature but often lack a pedagogical perspective that might ease the understanding of beginner students and researchers in transition, who intend to work on the modeling of the pathway. This paucity might happen due to restrictive business policies which enforce an unwanted embargo on the sharing of important scientific knowledge. A tutorial introduction to computational modeling of Wnt signaling pathway in a human colorectal cancer dataset using static Bayesian network models is provided. The walkthrough might aid biologists/informaticians in understanding the design of computational experiments that is interleaved with exposition of the Matlab code and causal models from Bayesian network toolbox. The manuscript elucidates the coding contents of the advance article by Sinha (Integr. Biol. 6:1034-1048, 2014) and takes the reader in a step-by-step process of how (a) the collection and the transformation of the available biological information from literature is done, (b) the integration of the heterogeneous data and prior biological knowledge in the network is achieved, (c) the simulation study is designed, (d) the hypothesis regarding a biological phenomena is transformed into computational framework, and (e) results and inferences drawn using d-connectivity/separability are reported. The manuscript finally ends with a programming assignment to help the readers get hands-on experience of a perturbation project. Description of Matlab files is made available under GNU GPL v3 license at the Google code project on https://code.google.com/p/static-bn-for-wnt-signaling-pathway and https: //sites.google.com/site/shriprakashsinha/shriprakashsinha/projects/static-bn-for-wnt-signaling-pathway. Latest updates can be found in the latter website.Entities:
Keywords: Bayesian network; Epigenetic information; Heterogeneous data integration; Hypothesis testing; Inference; Prior biological knowledge; Wnt signaling pathway
Year: 2016 PMID: 27547217 PMCID: PMC4977324 DOI: 10.1186/s13637-016-0044-y
Source DB: PubMed Journal: EURASIP J Bioinform Syst Biol ISSN: 1687-4145
Canonical Wnt pathway from [1]
| Canonical Wnt signaling pathway. The canonical Wnt signaling
pathway is a transduction mechanism that contributes to embryo
development and controls homeostatic self-renewal in several tissues
[ |
Fig. 1A cartoon of Wnt signaling pathway contributed by [3]. Part a represents the destruction of β-catenin leading to the inactivation of the Wnt target gene. Part b represents activation of Wnt target gene
Bayesian networks from [1]
| Bayesian networks. In reverse engineering methods for control
networks [ |
| The Bayesian networks work by estimating the posterior
probability of the model given the dataset. This estimation is usually
referred to as the Bayesian score of the model conditioned on the
dataset. Mathematically, let |
|
|
| Thus the Bayesian score is computed by evaluating the
|
|
|
| Finally, the likelihood of the function can be evaluated by
averaging over all possible local conditional distributions
parameterized by |
|
|
| Work on biological systems that make use of Bayesian networks
can also be found in [ |
Epigenetic factors from [1]
| Epigenetic factors. One of the widely studied epigenetic
factors is methylation [ |
Bayesian Wnt pathway from [1]
| Bayesian Wnt pathway. Three static models have been developed
based on particular gene set measured for human colorectal cancer cases
[ |
Network with PBK+EI from [1]
| Network with PBK and EI the NB model [ |
| Roles of |
Network with PBK+EI continued from [1]
| Network with PBK and EI continued … Lastly, it is known that
concentration of |
| In order to understand indirect connections further, it is
imperative to know about |
| Conversely, separation or independence exists between nodes
|
| 1. |
| 2. |
| 3. |
| 4. |
| 5. |
| 6. |
| 7. |
| 8. |
| 9. |
| 10. |
| Knowledge of evidence regarding nodes of |
Network with NB+MPBK from [1]
| Network with minimal PBK. Lastly, a naive Bayes model
|
Conditional probability tables for nodes (excluding gene expression) of
| Conditional probability table for nodes | |||
|---|---|---|---|
| Node | Parents | Cpt values rep. | Node states |
|
| - | [0.50 0.50] | [n t] |
|
| - | [0.10 0.90] | [ia a] |
|
|
| [0.01 0.99; 0.99 0.01] | [lc hc] |
|
|
| [0.99 0.99 0.99 0.01; | [lc hc] |
|
| 0.01 0.01 0.01 0.99] | ||
|
|
| [0.99*ones(1,7) 0.01; | [ia a] |
|
| 0.01*ones(1,7) 0.99] | ||
|
| - | [0.8370 0.1630] | [nm m] |
|
| - | [0.3376 0.6624] | [nm m] |
|
| - | [0.1667 0.8333] | [nm m] |
|
| - | [0.6316 0.3684] | [nm m] |
|
| - | [0.6316 0.3684] | [nm m] |
|
| - | [0.8572 0.1428] | [nm m] |
|
| - | [0.7500 0.2500] | [nm m] |
|
| - | [0.2391 0.7609] | [ia a] |
|
| - | [0.3661 0.6339] | [ia a] |
Notations in the table mean the following “-” implies no parents exist for the particular node; “n” - normal, “t” - tumorous, “ia” - inactive, “a” - active, “lc” - low concentration, “hc” - high concentration, “nm” - non-methylated, “m” - methylation
Conditional probability tables for gene nodes of
| Conditional probability table for nodes | ||
|---|---|---|
| Node | Parents | Cpt values rep. |
|
|
| [0.84 0.16; 0.16 0.84] |
|
|
| [0.94 0.89 0.78 0.31; |
|
| 0.06 0.11 0.22 0.69] | |
|
|
| [0.95 0.89 0.81 0.28; |
|
| 0.06 0.11 0.18 0.72] | |
|
|
| [0.93 0.90 0.67 0.42; |
|
| 0.07 0.10 0.33 0.58] | |
|
|
| [0.95 0.93 0.07 0.05 0.77 0.60 0.40 0.23; |
|
| 0.05 0.07 0.93 0.95 0.23 0.40 0.60 0.76] | |
|
| ||
|
|
| [0.40 0.60; 0.60 0.40] |
|
|
| [0.36 0.64; 0.64 0.36] |
|
|
| [0.56 0.44; 0.44 0.56] |
|
|
| [0.94 0.88 0.82 0.28; |
|
| 0.06 0.11 0.18 0.72] | |
|
|
| [0.56 0.74 0.26 0.44; |
|
| 0.44 0.26 0.74 0.56] | |
|
|
| [0.60 0.71 0.29 0.40; |
|
| 0.40 0.29 0.71 0.60] | |
|
|
| [0.88 0.88 0.12 0.88 0.88 0.88 0.12 0.88; |
|
| 0.12 0.12 0.88 0.12 0.12 0.12 0.88 0.12] | |
|
| ||
|
|
| [0.88 0.98 0.02 0.12 0.20 0.96 0.04 0.80; |
|
| 0.12 0.02 0.98 0.88 0.80 0.04 0.96 0.20] | |
|
| ||
|
|
| [0.31 0.88 0.11 0.69; |
|
| 0.69 0.11 0.89 0.31] | |
|
|
| [0.20 0.80; 0.80 0.20] |
|
|
| [0.71 0.60 0.40 0.29; |
|
| 0.29 0.40 0.60 0.71] | |
|
|
| [0.31 0.89 0.11 0.69; |
|
| 0.69 0.11 0.89 0.31] | |
|
|
| [0.96 0.91 0.09 0.04 0.85 0.47 0.56 0.15; |
|
| 0.04 0.09 0.91 0.96 0.15 0.53 0.47 0.85] | |
|
|
The state of the gene nodes remains [ia a], i.e., “ia” - inactive or “a” - active [ia a]. Note that these values are from one iteration of the 2-holdout experiment
Fig. 2Influence diagram of contains partial prior biological knowledge and epigenetic information in the form of methylation and histone modification. In this model, the state of Sample is distinguished from state of TRCMPLX that constitutes the Wnt pathway
Fig. 3Cases for d-connectivity and d-separation. Black (gray) circles mean that evidence is available (not available) regarding a particular node
Fig. 4Influence diagram of is a naive Bayes model that contains minimal prior biological knowledge. In this model, the state of TRCMPLX is assumed to be indicate whether the sample is cancerous or not
Conditional probability table for D K K1 in (model - t1)
| CPT for | ||||
|---|---|---|---|---|
|
|
|
| Pr( | Pr( |
| Normal | No | Off | h (1) | l (9) |
| Tumor | No | Off | h/l (2) | l/h (10) |
| Normal | Yes | Off | h (3) | l (11) |
| Tumor | Yes | Off | h (4) | l (12) |
| Normal | No | On | h (5) | l (13) |
| Tumor | No | On | h/l (6) | l/h (14) |
| Normal | Yes | On | h (7) | l (15) |
| Tumor | Yes | On | h (8) | l (16) |
h - probability of event being high; l - probability of event being low. Serial numbers in brackets represent the ordering of numbers in vectorial format
Fig. 5Conditional probability table for node D K K1 in
Conditional probability table for D K K1 in (model - t2)
| CPT for | |||
|---|---|---|---|
|
|
| Pr( | Pr( |
| Normal | Off | h (1) | l (5) |
| Tumorous | Off | l (2) | h (6) |
| Normal | On | h (3) | l (7) |
| Tumorous | On | l (4) | h (8) |
h - probability of event being high; l - probability of event being low. Serial numbers in brackets represent the ordering of numbers in vectorial format
Fig. 6Conditional probability table for node D K K1 in
Conditional probability table for D K K1 in (model - p1)
| CPT for | ||
|---|---|---|
|
| Pr( | Pr( |
| Off | h (1) | l (3) |
| On | h (2) | l (4) |
h - probability of event being high; l - probability of event being low. Serial numbers in brackets represent the ordering of numbers in vectorial format
Fig. 7Conditional probability table for node D K K1 in
Conditional probability table for D K K2 in (model - t1)
| CPT for | ||
|---|---|---|
|
| Pr( | Pr( |
| Normal | l/h (1) | h/l (3) |
| Tumor | h/l (2) | l/h (4) |
h - probability of event being high; l - probability of event being low. Serial numbers in brackets represent the ordering of numbers in vectorial format
Fig. 8Conditional probability table for node D K K2 in and
Conditional probability table for D A C T3 in (model - t1)
| CPT for | ||||
|---|---|---|---|---|
|
|
|
| Pr( | Pr( |
| 1 | 1 | Normal | h (1) | l (9) |
| 2 | 1 | Normal | h (2) | l (10) |
| 1 | 2 | Normal | l (3) | h (11) |
| 2 | 2 | Normal | h (4) | l (12) |
| 1 | 1 | Tumor | h (5) | l (13) |
| 2 | 1 | Tumor | h (6) | l (14) |
| 1 | 2 | Tumor | l (7) | h (15) |
| 2 | 2 | Tumor | h (8) | l (16) |
h - probability of event being high; l - probability of event being low. 1 - low; 2 - high. Serial numbers in brackets represent the ordering of numbers in vectorial format
Fig. 9Conditional probability table for node D A C T3 in