| Literature DB >> 29688383 |
Maria Kondratova1,2,3, Nicolas Sompairac1,2,3, Emmanuel Barillot1,2,3, Andrei Zinovyev1,2,3, Inna Kuperstein1,2,3.
Abstract
Generation and usage of high-quality molecular signalling network maps can be augmented by standardizing notations, establishing curation workflows and application of computational biology methods to exploit the knowledge contained in the maps. In this manuscript, we summarize the major aims and challenges of assembling information in the form of comprehensive maps of molecular interactions. Mainly, we share our experience gained while creating the Atlas of Cancer Signalling Network. In the step-by-step procedure, we describe the map construction process and suggest solutions for map complexity management by introducing a hierarchical modular map structure. In addition, we describe the NaviCell platform, a computational technology using Google Maps API to explore comprehensive molecular maps similar to geographical maps and explain the advantages of semantic zooming principles for map navigation. We also provide the outline to prepare signalling network maps for navigation using the NaviCell platform. Finally, several examples of cancer high-throughput data analysis and visualization in the context of comprehensive signalling maps are presented.Entities:
Mesh:
Year: 2018 PMID: 29688383 PMCID: PMC5890450 DOI: 10.1093/database/bay036
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Map construction workflow scheme.The corresponding section in the texts is indicated.
Terms and definitions
| Graphical standards and exchange formats terms and definitions | |
|---|---|
| SBGN | Systems Biology Graphical Notation (SBGN) is a standard graphical syntax for representation of biological processes and interactions. SBGN is compatible with multiple pathway drawing and analytical tools, |
| SBML | Systems Biology Markup Language (SBML) is a representation format, based on XML, for communicating and storing computational models of biological processes. It is a free and open standard language with widespread software support, |
| Standard identifier (ID) | Community-accepted nomenclature for scientific naming of biomolecules as genes, proteins, chemicals, drugs and so on. The sources for standard IDs are repositories as UNIPROT, CHEB and HUGO, |
| Data and models exchange formats | Standard formats for data and models to facilitate networks and software intercompatibility. There are two major standard networks exchange formats, BIOPAX for complex networks and SIF for simple binary interactions. The CellDesigner xml format is commonly used exchange format compatible with multiple network analysis tools. |
| Map (in ACSN) | Diagram of detailed molecular interactions with meaningful layout reflecting a certain biological process, which is graphically represented in CellDesigner tool and converted to NaviCell format for exploration and curation. |
| Map layer (in ACSN) | Map area covering several molecular processes with similar functions. |
| Map module (in ACSN) | Part of the map representing a sequence of molecular interactions responsible for execution of a particular function. |
| Signal transduction pathway or sigalling cascade | Sequence of molecular interaction that transforms extracellular signals into an intercellular activity. |
| Map entity (in ACSN) | Component of the map graphically depicted using SBGN standards in CellDesigner tool. |
| Hierarchical structure of a map | System for signaling network map complexity reduction by dividing it into |
| Confidence score (ACSN) | Value representing the measure of accuracy of binary interactions on the map. There are two confidence scores in NaviCell maps. The |
| Semantic zoom | A mechanism providing several map views with different levels of details depiction achieved by gradual exclusion of details while zooming out. It simplifies navigation through large maps of molecular interactions by providing several levels of details, resembling navigation through geographical maps. Exploring the map from a detailed toward a top-level view is achieved by gradual exclusion and modification (simplification and abstraction) of details. One of the main principle of semantic zooming is in that every detail that is shown on the map at a current zoom level, should be readable. This feature supports the |
| Bird-eye view panel | Window containing top-level view of the map with indication on currently centered area; adapted from Google maps. |
| Zooming bar | Zoom control slider; adapted from Google maps. |
| Marker | Symbol indicating location of chosen objects on the map; adapted from Google maps. |
| Pop-up bubble | Small window that opens by clicking on marker. Contains short description and hyperlinks related to the marked entity. |
| Annotation post | Detailed map entity annotation created in CellDesigner by map manager. The annotation is converted to annotation post and displayed in the associated blog by NaviCell. |
Figure 2.Data model for PD diagram using CellDesigner SBGN syntax.
Figure 3.DNA repair map in NaviCell format. (A) Global layout of the map with maprker indicating BRCA1 protein distribution, (B) Individual module layout H HR, (C) Callout window with annotation of BARD: BRCA1 complex.
Figure 4.Transforming text to diagram: role of p53 and NOTCH in induction of EMT. The following statements were used for diagram construction: (1). Control of EMT program is performed by SNAIL and TWIST, the major transcription factors that can induce the executors EMT program (49). These transcription factors are under the control of several upstream mechanisms. (2) SNAIL and TWIST are inhibited by the p53 protein via a variety of microRNAs, including miR200 (50) (3) miR20 binds to the mRNAs of SNAIL and TWIST and triggers their degradation, this way preventing the translation of mRNAs into the corresponding proteins (51) (4) EMT program can be initiated due to excessive expression of NOTCH that directly activates transcription of SNAIL and TWIST (52).
Figure 5.Semantic zooming and entity visualization on DNA repair map. (A) Canonical pathways view, (B) hide-details view, (C) detailed view, (D) highlighting p53 neighbours.
Figure 6.Visualization of cancer high-throughput data in the context of DNA repair map. Visualization of gene expression from ovarian cancer samples in a form of map staining and mutation profile in a form of glyph (triangle). (A) Proliferative and (B) mesenchymal classes of ovarian cancer. Zoom in on EMT regulators in (C) proliferative and (D) mesenchymal classes of ovarian cancer. Proliferative group, n = 87, mesenchymal group, n = 96.
Figure 7.Retrieval of mechanistic model of EMT program control using EMT regulation map. (A) Comprehensive signalling map of EMT regulation, (B) hub player in functional modules of EMT map, (C) structural analysis and reduction of EMT map complexity; (D) scheme representing major mechanisms controlling EMT program.