| Literature DB >> 25958393 |
Eric Bonnet1, Eric Viara2, Inna Kuperstein1, Laurence Calzone1, David P A Cohen1, Emmanuel Barillot1, Andrei Zinovyev3.
Abstract
Data visualization is an essential element of biological research, required for obtaining insights and formulating new hypotheses on mechanisms of health and disease. NaviCell Web Service is a tool for network-based visualization of 'omics' data which implements several data visual representation methods and utilities for combining them together. NaviCell Web Service uses Google Maps and semantic zooming to browse large biological network maps, represented in various formats, together with different types of the molecular data mapped on top of them. For achieving this, the tool provides standard heatmaps, barplots and glyphs as well as the novel map staining technique for grasping large-scale trends in numerical values (such as whole transcriptome) projected onto a pathway map. The web service provides a server mode, which allows automating visualization tasks and retrieving data from maps via RESTful (standard HTTP) calls. Bindings to different programming languages are provided (Python and R). We illustrate the purpose of the tool with several case studies using pathway maps created by different research groups, in which data visualization provides new insights into molecular mechanisms involved in systemic diseases such as cancer and neurodegenerative diseases.Entities:
Mesh:
Year: 2015 PMID: 25958393 PMCID: PMC4489283 DOI: 10.1093/nar/gkv450
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.General architecture of the NaviCell Web service server. Client software (light blue layer) communicates with the server (red layer) through standard HTTP requests using the standard JSON format to encode data (RESTful web service, dark blue layer). A session (with a unique ID) is established between the server and the browser (yellow layer) through Ajax communication channel to visualize the results of the commands send by the software client. It is worth noticing that communication channels are bidirectional, i.e. the client software can send data (e.g. an expression data matrix) to the server, but it can also receive data from the server (e.g. a list of HUGO gene symbols contained in a map).
List of biological input data types accepted by NaviCell Web Service
| ‘Omics’ data type | Internal representation |
|---|---|
| mRNA expression data | Continuous |
| microRNA expression data | Continuous |
| Protein expression data | Continuous |
| Discrete copy-number data | Discrete ordered |
| Continuous copy-number data | Continuous |
| Mutation data | Discrete unordered |
| Gene list | Set |
The first column lists the types as they appear in the interface. The second column lists the internal data representations that are used to determine what type of data visualization can be applied.
Figure 3.Visualization of two different data types on the Alzheimer's disease (AD) pathway map. (A) Top-level view expression data are visualized with map staining (see Figure 2 legend). Frequently mutated genes are indicated by glyphs (blue triangles), with the size of the glyph proportional to the mutation frequency. (B–D) represent zooms on known key regulators of (B) apoptosis, (C) blood brain barrier and (D) MAPK signaling pathway. Background colors represent expression values as in (A), while barplots illustrate tissue-specific expression values for frontal cortex (FC), thalamo-cortical area (TC) and hippocampus (HI).
Figure 2.Visualization of multiple data types for two different prostate cancer cell lines. Transcriptomic, gene copy-number values and gene mutations (24) are mapped on the Cell Cycle map. (A) Hormone-sensitive prostate cancer cell line (LNCaP). (B) Hormone-refractory prostate cancer cell line (DU145). Expression data is visualized using the map staining technique, i.e. colored territories around entities, ranging from low (green) to high expression values (red). Copy-number values are represented by glyphs (squares) with blue color indicating gene loss (values of −1 and lower) and yellow color indicating amplification (values of 1 and higher). Mutated genes are depicted by cyan triangles.
Comparison of the NaviCell Web Service features with similar web sites for pathway-based data visualization
| Features | Na | Re | KE | iP | Bc | Pa |
|---|---|---|---|---|---|---|
| Map: advanced navigation | • | • | • | • | • | |
| Map: simple zooming | • | • | • | • | • | • |
| Map: semantic zooming | • | |||||
| Visualization: node coloring | • | • | ||||
| Visualization: heatmaps | • | • | ||||
| Visualization: barplots | • | • | ||||
| Visualization: glyphs | • | |||||
| Visualization: map staining | • | |||||
| Data mapping: gene lists | • | • | • | • | • | • |
| Data mapping: expression data | • | • | • | • | ||
| Data mapping: copy-number data | • | |||||
| Data mapping: mutation data | • | |||||
| Data mapping: metabolomic data | • | • | ||||
| Data mapping: interactions | • | |||||
| Programmatic access: RESTful web | • | • | • | • | ||
| Programmatic access: data visual. | • | • |
Abbreviations: Na: NaviCell Web Service, Re: Reactome (4), KE: KEGG (3), iP: iPath (26), Bc: BioCyc (5), Pa: PATIKAweb (27).