| Literature DB >> 30972331 |
Hooman Sedghamiz1, Matthew Morris1, Travis J A Craddock2,3, Darrell Whitley4, Gordon Broderick1,5.
Abstract
The in silico study and reverse engineering of regulatory networks has gained in recognition as an insightful tool for the qualitative study of biological mechanisms that underlie a broad range of complex illness. In the creation of reliable network models, the integration of prior mechanistic knowledge with experimentally observed behavior is hampered by the disparate nature and widespread sparsity of such measurements. The former challenges conventional regression-based parameter fitting while the latter leads to large sets of highly variable network models that are equally compliant with the data. In this paper, we propose a bounded Constraint Satisfaction (CS) based model checking framework for parameter set identification that readily accommodates partial records and the exponential complexity of this problem. We introduce specific criteria to describe the biological plausibility of competing multi-valued regulatory networks that satisfy all the constraints and formulate model identification as a multi-objective optimization problem. Optimization is directed at maximizing structural parsimony of the regulatory network by mitigating excessive control action selectivity while also favoring increased state transition efficiency and robustness of the network's dynamic response. The framework's scalability, computational time and validity is demonstrated on several well-established and well-studied biological networks.Entities:
Keywords: constraint satisfaction; data compliance; multi-objective; multi-valued discrete logic; path robustness; regulatory networks; transition efficiency
Year: 2019 PMID: 30972331 PMCID: PMC6443719 DOI: 10.3389/fbioe.2019.00048
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Benchmark problem definition.
| HPA | 4 | 8 | 14 | 8 | 2 | 8 | 46,656 | 10 | 5 |
| IRMA | 6 | 9 | 16 | 0 | 0 | 0 | ≈604 x 106 | 50 | 12 |
| Dcell | 114 | 129 | 3 | 0 | 0 | 0 | 95,268 × 10135 | 5 | 3 |
| HPG | 5 | 25 | 16 | 25 | 18 | 25 | 3160 | 3 | 8 |
| Th | 23 | 35 | 3 | 0 | 0 | 35 | 2100 | 2 | 2 |
Specification of the five benchmark problems to which an analysis was applied.
.
Figure 1An example of HPA axis described in GDF. Each interaction (edge e) is assigned a threshold (w), polarity [u where solid (resp. dashed) stands for positive (resp. negative)] and each node is allowed to assume an expression level equal to its number of actions (outdegrees). Transition dynamics of each node is represented by a set of K(I) values. K(∅) defines the basal value of entities (no activation is present). For instance, K2({1, 4}) defines how entity v2 behaves when both its activator v1 and inhibitor v4 are present simultaneously (which is an increase in its expression since K2({1, 4}) = 1). Adapted from Figure 1A, Sedghamiz et al. (2019).
Figure 2An example visualizing the length cost (denoted by F) concept. Assuming (xt and xt+1) are two sampled experimental measurements. There are three sets of parameters that make xt+1 reachable from xt: a direct transition with F = 0 and two indirect transitions with F = 2 and F = 1, respectively.
Figure 3An illustration of robustness. Here there are two parameterizations that generate paths with equal length costs (solid and dashed). In order to compute the robustness, we only count the number of branches for the shortest trajectories. It is clear that while the dashed trajectory is still reachable to sample 2 with a similar cost length, it has a chance of deviating and missing its destination in transitions [0100] and [0101] (dotted branches).
Figure 4The T-helper network. Arrow-head and circle-head edges indicate activating and inhibiting interactions, respectively. Dashed edges highlight the interactions that were marked by the model checker not necessary in order to reproduce the three steady states reported in Garg et al. (2008). Adapted from Figure 5A, Sedghamiz et al. (2019).
Computational performance.
| HPA | 0.5 | 0.61 | 0 | 0.6 | 0.45 | 0.08 | 0.11 | 0.81 | 0.61 | 0 | 0.92 | 0.61 | 0.08 | 0.11 | 265.3 | 0.61 | 0 | 123.4 | 0.45 | 0.08 | 0.11 |
| IRMA | 5.2 | 56.1 | 1 | 0.2 | 0.037 | 715.65 | 44.36 | 1 | 0.2 | 0.037 | 70.23 | 1800.1 | 1 | 0.229 | 0.035 | ||||||
| Dcell | 1.8 | 1 | 0 | 2.9 | 1 | 0 | 0.003 | 31.2 | 82.1 | 1 | 0 | 0.003 | 60.2 | 1 | 0 | 2700.1 | 1 | 0 | 0.003 | ||
| HPG | 3,650 | 0.46 | 0.05 | 0.45 | 0.07 | 0.01 | 1,020 | 0.46 | 0.05 | 0.42 | 0.06 | 0.01 | 0.48 | 0.10 | 0.2 | ||||||
| Th | 0.4 | 0.74 | 0 | 0.4 | 0.74 | 0 | 0 | 0.5 | 0.74 | 0 | 0.5 | 0.74 | 0 | 0 | 0.63 | 0.74 | 0 | 0.63 | 0.74 | 0 | 0 |
Performance of the proposed method on five biological networks using three solvers, namely Chuffed, OR-tools and OptiMatSat, for synchronous (Sync) and asynchronous (Async) updating schemes. Metrics include the execution time in seconds T(s) as well as the objective function value for structural complexity z.
Unsatisfiable, meaning that there existed no parameterization supporting the constraints.
Parameterization tasks that were interrupted because they did not converge within the maximum computation of 9,000 s.
No solution was found within the time limit. For the HPG model none of the models found an optimal solution within the time-limit, reported solutions are the sub-optimal ones obtained within this limit. Note that in some cases a Pareto solution did not exist, in those cases we report the final solution reported by OptiMathSat. Furthermore, the objective values reported in the table are normalized. Bio-ModelChecker normalizes these values by dividing them by the maximum objective achievable in each case.