| Literature DB >> 30794548 |
Ali M Roumani1,2, Amgad Madkour3, Mourad Ouzzani4, Thomas McGrew2, Esraa Omran1, Xiang Zhang5.
Abstract
MOTIVATION: Systems biology faces two key challenges when dealing with large amounts of disparate data produced by different experiments: the integration of results across different experiments, and the extraction of meaningful information from the data produced by these experiments. An ongoing challenge is to provide better tools that can mine data patterns that could not have been discovered through simple visualization. Such mining capabilities also need to be coupled with intuitive visualization to portray those findings. We introduce a software toolbox entitled BioNetApp to mine these patterns and visualize them across all experiments.Entities:
Mesh:
Year: 2019 PMID: 30794548 PMCID: PMC6386483 DOI: 10.1371/journal.pone.0211277
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
A comparison between BioNetApp, Mzmine2, Graphle, and Biological Networks.
| Specs | BioNetApp | Mzmine2 | Graphle | Biological Networks |
|---|---|---|---|---|
| Data type supported | Supports all data types represented in a comma or tab delimited text file (csv). | Requires Thermo MSFileReader library for some data types. | Supports network data as nodes and edges in the form of key/value pairs in tables. | Supports values of biological network datasets. |
| Statistical Analysis | Interactive visual data mining of molecular correlations, comparative, and clustering analysis across samples, groups, and time points. | Basic methods for statistical analysis of processed data. | Search and visualization engine to view functional relationship networks predicted by the bioPIXIE system and a collection of microarray data. | Visualization and analysis services over PathSys. |
| Visualization | Comparative analysis uses boxplot display, outlier detection, and data curve fitting. Clustering uses SOM, K-Means, K-Medoids, and Farthest First algorithms. Correlation uses Kamada-Kawai, Fruchterman-Reingold Spring network layouts, with single/multiple circle and heat map layouts. | Can visualize raw data with peak picking and identification results, such as chromatogram plot and 2D plot Shows detected peaks, peak areas, and pal component analysis plots. | Displays dense biological networks as network graphs. Targeted for gene interaction networks. | Can build pathways and common targets, find intersections with curated pathways, and view genome-scale integrated networks of protein-protein, protein-DNA and genetic interactions networks. |
| Observing the results | Produced graphic presentations can be exported as images, and the molecular correlation information as simple flat text format. | Can report the quantification results in table form or using charts. | Allows exploring networks, scaling between different details of visualizations, and saving images and data. | Imported network interactions and components can be annotated and saved, along with related graphs. |
Fig 1Experiment selection window.
(A) Time points available for this experiment (in our case study we chose all three time points). (B) Analysis type to perform. (C) Molecules (peaks) frequency filter across all samples (cross-board filter) and also per group of selected features (group-based filter).
Fig 2Correlation analysis window with single circle layout.
(A) Molecular IDs. (B) Interactive graphic display of molecular correlations in a single circle display while applying the Pearson correlation calculation. (C) Information panel display for rich set of information about the selected (highlighted) molecules in the graph. Color scheme is used to show the correlation directions: red indicates positive correlation while blue indicates negative correlation. The encircled color areas readily demonstrate clusters of strong molecular correlation.
Fig 3Correlation analysis window with multiple circle layout.
Three-circle network graph display illustrating the up-regulated molecules in green borer nodes(A), down-regulated in red border (B), and other (not differentially expressed) molecules with no border (C). The fold change threshold is set at 2, meaning that molecules regulation that don’t meet this threshold are grouped in the third neutral circle (C). The fold change value can be adjusted accordingly.
Fig 4Correlation analysis.
(A) Meta information for molecule 901.7314. (B) List of correlated molecules. Further molecule details can be invoked by highlighting a molecule in this list and choosing wither the “Show Element” or “Show Correlation” buttons. (C) Molecular concentration levels across all samples. (D) Concentration details for molecule 901.7314. (E) Concentration details for correlated molecule 903.7465. (F) Correlation graph showing the expression data of the two molecules. Each point represents the expression levels of both molecules (x-axis and y-axis) in the same sample.
Fig 5Comparative and distribution analysis.
(A) A tree describing the molecules and their corresponding samples and time points. (B) Box plot displaying the concentration of a selected molecule across time points while applying the Robust Linear fitting. (C) Concentration distribution plot of the molecule across all samples. (D) and (E) display the same plots but for two sample groups (control and test) side-by-side for ease of comparison.
Fig 6Clustering.
(A) Clustering averages across all samples, (B) Cluster 2 molecules concentration details across all samples.