| Literature DB >> 25050327 |
Marco Aldinucci1, Cristina Calcagno2, Mario Coppo1, Ferruccio Damiani1, Maurizio Drocco1, Eva Sciacca1, Salvatore Spinella1, Massimo Torquati3, Angelo Troina1.
Abstract
The paper arguments are on enabling methodologies for the design of a fully parallel, online, interactive tool aiming to support the bioinformatics scientists .In particular, the features of these methodologies, supported by the FastFlow parallel programming framework, are shown on a simulation tool to perform the modeling, the tuning, and the sensitivity analysis of stochastic biological models. A stochastic simulation needs thousands of independent simulation trajectories turning into big data that should be analysed by statistic and data mining tools. In the considered approach the two stages are pipelined in such a way that the simulation stage streams out the partial results of all simulation trajectories to the analysis stage that immediately produces a partial result. The simulation-analysis workflow is validated for performance and effectiveness of the online analysis in capturing biological systems behavior on a multicore platform and representative proof-of-concept biological systems. The exploited methodologies include pattern-based parallel programming and data streaming that provide key features to the software designers such as performance portability and efficient in-memory (big) data management and movement. Two paradigmatic classes of biological systems exhibiting multistable and oscillatory behavior are used as a testbed.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25050327 PMCID: PMC4090576 DOI: 10.1155/2014/207041
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1CWC simulator with online parallel analysis: architecture.
Figure 2Screenshots of the simulation tool interface.
Figure 3Simulation results on the Schlögl model. The figures report the mean and standard deviation, two exemplificative raw simulation trajectories, and the clustering results using K-means during the simulation runs (a) and at the end of the simulation runs (b).
Figure 4Simulation results on the λ-phage model. (a) Reports (approximatively) the 480 raw trajectories and (b) shows the online clustering results using QT.
Figure 5Simulation results of the cytosolic FRQ protein of the Neurospora model.
Figure 6Speedup of the workflow of the Neurospora model on the Intel platform against number of simulation engines with 3 statistical engines, for different number of trajectories, each of them counting 104 points (a) and 105 points (b).
Performance on 1200 simulation instances of the Neurospora model (Intel 32 core platform).
| Single trajectory information |
Overall data | ||
|---|---|---|---|
| Number of samples | Interarrival time | Throughput | Output size |
| 104 | 25.86 | 2.70 MB/s | 82.40 MB |
| 105 | 2.78 | 28.59 MB/s | 823.98 MB |
| 106 | 232.68 ns | 303.86 MB/s | 8.24 GB |
Biological simulation tools comparison.
| Tool | Calculus | Simulation schema | Parallelism | Data analysis |
|---|---|---|---|---|
| SCWC | CWC | Gillespie | FastFlow | Online statistics |
| SPiM |
| Gillespie | None | None |
| Dizzy | Reaction model | Gillespie, Gibson-Bruck, Tau-Leap, ODE | None | None |
| BioPEPA | Process algebra | ODE, Gillespie | None | None |
| Cellware | Reaction model | Gillespie, Gibson-Bruck, ODE | None | None |
| DiVinE | Model checker | ODE | MPI | None |
| StochKit | Reaction model | Gillespie, Tau-leaping | MPI | Postprocessing |
| StochKit2 | Reaction model | Gillespie, Tau-leaping | Multithread | Postprocessing |
| StochKit-FF | Reaction model | Gillespie, Tau-leaping | FastFlow | Online statistics |
| Hy3S | Reaction model | Gibson-Bruck, Hybrid | MPI | Postprocessing |
| Li and Petzold's | Reaction model | Gillespie | GPGPU | None |
| StochSimGPU | Reaction model | Gillespie, Gibson-Bruck, Li | GPGPU | Postprocessing |