| Literature DB >> 30715136 |
Yannis Pantazis1, Ioannis Tsamardinos1,2,3.
Abstract
MOTIVATION: Temporal variations in biological systems and more generally in natural sciences are typically modeled as a set of ordinary, partial or stochastic differential or difference equations. Algorithms for learning the structure and the parameters of a dynamical system are distinguished based on whether time is discrete or continuous, observations are time-series or time-course and whether the system is deterministic or stochastic, however, there is no approach able to handle the various types of dynamical systems simultaneously.Entities:
Mesh:
Year: 2019 PMID: 30715136 PMCID: PMC6748758 DOI: 10.1093/bioinformatics/btz065
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Different choices for the dictionary, , according to the allowed chemical reaction types under mass action kinetics law
| Unknown reactions | Dictionary, | Size, |
|---|---|---|
|
|
|
|
|
|
|
|
Note: Symbols X1–X correspond to the state variables [a.k.a. (reaction) species in chemistry and systems biology]. The row vector is defined as . For the two-reactant case, the dynamical system is non-linear with respect to the state variables.
Fig. 1.(a) The network of interactions between the three species (P1, P2 and P3). This graph is a coarse high-level representation and it should not be confused with the detailed biochemical reaction network which is given in Supplementary S5 Text. (b) Time-course measurements (dots) and the estimated smoothed trajectories (dashed curves). The collocation method in conjunction with smoothing penalty is used for the estimation of the time-series. Notice that time-course data from the low noise case are shown. (c) MIP and ERC for P1–P3 as a function of the number of experiments under low (blue) and high (red) measurement uncertainty. Standard deviation of the various stochastic terms in high uncertainty regime is twice as much compared to the low uncertainty regime. From the lower plot, it is evident from the negativity of ERC that the problematic variable is P2 when only one experiment is used. (d) Precision and recall curves under low (blue) and high (red) measurement noise as a function of the number of experiments. Results from both USDL (solid lines) and SINDy (dashed lines) algorithms are presented. Perfect inference is achieved only with USDL under five experimental interventions and the low noise case. Precision (solid lines) seems to be insensitive to higher noise levels, however, recall slightly degrades
Fig. 2.Network reconstruction of protein interactions from temporal mass cytometry data. (a) Time-course measurements (dots) and the estimated smoothed trajectories (solid lines). The collocation method in conjunction with smoothing penalty was used for the estimation of the time-series. Observe the high level of stochasticity of the time-course mass cytometry data. (b) The reconstructed subnetwork with four proteins using the USDL algorithm. Bold arrows (true positives) indicate that the true network of interactions is inferred. (c) Similar to (b) for SINDy. (d) The reconstructed network with eight proteins with non-bold arrows corresponds to false positives while dotted arrows correspond to false negatives. Dynamics for the additional proteins vary less over time as it is evident from the lower panel of (a). Nevertheless, most of the interactions are directly (such as Akt Rb or Erk S6) or indirectly (like Akt Erk S6 instead of Akt S6) inferred. (e) Similar to (d) for SINDy
Fig. 3.Performance analysis and comparison between USDL and SINDy algorithms for Ornstein–Uhlenbeck stochastic process. (a) The connectivity graph for each variable of the Ornstein–Uhlenbeck process. The edges, their direction as well as the type of interaction are determined by the non-zero elements of connectivity matrix A. (b) Precision and recall are shown as functions of the number of measured time-series in two different regimes; stationary (blue) and transient (red). Both USDL (solid) and SINDy (dashed) algorithms achieve perfect reconstruction of the dynamical system for the transient regime and when enough time-series are measured. For the stationary regime, perfect reconstruction is possible only for USDL and a special type of test functions (peaky Fourier modes) while SINDy (blue dashed) fails to recover completely the dynamical system in this regime due to the high stochasticity