Literature DB >> 22576176

Mosaic: making biological sense of complex networks.

Chao Zhang¹, Kristina Hanspers, Allan Kuchinsky, Nathan Salomonis, Dong Xu, Alexander R Pico.

Abstract

UNLABELLED: We present a Cytoscape plugin called Mosaic to support interactive network annotation, partitioning, layout and coloring based on gene ontology or other relevant annotations. AVAILABILITY: Mosaic is distributed for free under the Apache v2.0 open source license and can be downloaded via the Cytoscape plugin manager. A detailed user manual is available on the Mosaic web site (http://nrnb.org/tools/mosaic).

Entities: Disease Species

Mesh：

Year: 2012 PMID： 22576176 PMCID： PMC3389769 DOI： 10.1093/bioinformatics/bts278

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

1 INTRODUCTION

The increasing throughput and quality of molecular measurements in the domains of genomics, proteomics and metabolomics continue to fuel the understanding of biological processes. Collected per molecule, the scope of these data extends to physical, genetic and biochemical interactions that in turn comprise extensive networks. There are software tools available to visualize and analyze data-derived biological networks (Smoot ). One challenge faced by these tools is how to make sense of such networks often represented as massive ‘hairballs’. Many network analysis algorithms filter or partition networks based on topological features, optionally weighted by orthogonal node or edge data (Bader and Hogue, 2003; Royer ). Another approach is to mathematically model networks and rely on their statistical properties to make associations with other networks, phenotypes and drug effects, sidestepping the issue of making sense of the network itself altogether (Machado ). Acknowledging that there is still great value in engaging the minds of researchers in exploratory data analysis at the level of networks (Kelder ), we have produced a Cytoscape plugin called Mosaic to support interactive network annotation and visualization that includes partitioning, layout and coloring based on biologically relevant ontologies (Fig. 1). Mosaic shows slices of a given network in the visual language of biological pathways, which are familiar to any biologist and are ideal frameworks for integrating knowledge.

Fig. 1.

Mosaic Control Panel, context menu and tiled result windows. Mosaic highlights the subset of proteins and interactions associated with ‘cell proliferation’, a significant term from an enrichment analysis in the study of atherosclerosis (King ) Cytoscape is a free and open source network visualization platform that actively supports independent plugin development (Smoot ). For annotation, Mosaic relies primarily on the full gene ontology (GO) or simplified ‘slim’ versions (http://www.geneontology.org/GO.slims.shtml). The cellular layout of partitioned subnetworks strictly depends on the cellular component branch of GO, but the other two functions, partitioning and coloring, can be driven by any annotation associated with a major gene or protein identifier system.

2 METHODS

2.1 Annotation

Although Mosaic uses practically any annotation, its primary usage relies on GO (for best results, a reduced subset of GO-slim). GO provides a controlled vocabulary of terms describing key characteristics of gene products (i.e., process, location and function). Currently, Mosaic supports seven species and we provide four different varieties of GO annotations for each species, including three ‘slimmed’ ontologies. The Mosaic package does not contain any data files. All necessary data for each species are stored on the Mosaic web server where it will be periodically updated. Users can download the corresponding data for their species of interest prior to running Mosaic for the first time. Once the data for one species are successfully downloaded to the local machine, Mosaic can be executed for this species in both offline and online modes. Each time a user starts Mosaic with an Internet connection, Mosaic can synchronize local data information with the server automatically. These data are parsed from Ensembl and GO. Ensembl ID is recommended as the unifying identifier for user networks, although several other identifier systems are also supported in Mosaic.

2.2 Partition

The network is partitioned into a set of subnetworks based on the GO Biological Process annotation of the nodes. For example, all nodes annotated with the GO term ‘translation’ are placed in a new subnetwork entitled ‘Translation’. Subnetworks are hierarchically organized to reflect the parent–child relationships between GO terms, and Mosaic only displays those subnetworks with node counts between minimum and maximum thresholds defined in the settings panel. When a given node is annotated with more than one Biological Process term, it is replicated and placed into each corresponding subnetwork. An overview network is also created, with each node representing a Biological Process subnetwork. Node size reflects the number of genes in the corresponding subnetwork and edge weight represents the number of edges (connections) between the nodes in two subnetworks. In the Mosaic Control Panel (Fig. 1), all subnetworks are listed hierarchically, including subnetworks that fall outside defined thresholds for display. Selecting a subnetwork in the Control Panel will bring it into focus in the tiled window view. Additional functions can be accessed by right-clicking on the name of a particular subnetwork in the Control Panel. In particular, ‘partition this network to one further level’ allows users to partition a huge network to deep levels of GO efficiently without generating hundreds of other subnetworks from parallel branches.

2.3 Compartmental layout

Mosaic performs cell-based layouts using a cell template that defines graphical regions corresponding to cellular compartments and locations. Nodes are positioned into regions based on their GO cellular component annotation or in an ‘unassigned’ region (right margin) if no annotations are present. Once nodes have been assigned to regions and positioned, a force-directed layout is applied to nodes within each region. Nodes annotated as being located in more than one cellular component are replicated across regions. A suffix is added to the node IDs for replicated nodes, whereas the ‘canonical name’ is retained as the original node ID and used as the node label. In addition, replicate nodes are given a red border. Because of node replication, relevant edges are also copied. To reduce the complexity of the network, while retaining the original information, edges can be pruned using a deletion strategy to remove certain edges. The strategy is to keep those replicate edges that are completely contained within a region and delete those replicate edges that extend between different regions. The cellular layout algorithm stores region assignment and whether a node exists in multiple regions as node attributes. Information on whether an edge connects to a node annotated as ‘unassigned’ is stored as an edge attribute.

2.4 Visual style

The final step in Mosaic is to apply color to nodes based on their GO Molecular Function annotation. If a node has multiple Molecular Function annotations, the most specific annotation will determine the node color. At the top of the Mosaic Control Panel, sets of nodes can be selected to display detailed information with the ‘Select nodes’ button. By clicking the ‘Legend’ button, the color legend of all Molecular Function terms can be toggled on and off.

3 CONCLUSION

Mosaic provides researchers with an interactive tool to evaluate biological interactions within the context of well-defined processes, functions and cellular localization while retaining all original network information. Use of additional ontologies is anticipated to provide further insights into the relevance of large-scale interaction datasets and will be supported in future versions.

6 in total

1. Pathway analysis of coronary atherosclerosis.

Authors: Jennifer Y King; Rossella Ferrara; Raymond Tabibiazar; Joshua M Spin; Mary M Chen; Allan Kuchinsky; Aditya Vailaya; Robert Kincaid; Anya Tsalenko; David Xing-Fei Deng; Andrew Connolly; Peng Zhang; Eugene Yang; Clifton Watt; Zohar Yakhini; Amir Ben-Dor; Annette Adler; Laurakay Bruhn; Philip Tsao; Thomas Quertermous; Euan A Ashley
Journal: Physiol Genomics Date: 2005-06-07 Impact factor: 3.107

2. Finding the right questions: exploratory pathway analysis to enhance biological discovery in large datasets.

Authors: Thomas Kelder; Bruce R Conklin; Chris T Evelo; Alexander R Pico
Journal: PLoS Biol Date: 2010-08-31 Impact factor: 8.029

3. An automated method for finding molecular complexes in large protein interaction networks.

Authors: Gary D Bader; Christopher W V Hogue
Journal: BMC Bioinformatics Date: 2003-01-13 Impact factor: 3.169

4. Cytoscape 2.8: new features for data integration and network visualization.

Authors: Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Trey Ideker
Journal: Bioinformatics Date: 2010-12-12 Impact factor: 6.937

5. Modeling formalisms in Systems Biology.

Authors: Daniel Machado; Rafael S Costa; Miguel Rocha; Eugénio C Ferreira; Bruce Tidor; Isabel Rocha
Journal: AMB Express Date: 2011-12-05 Impact factor: 3.298

6. Unraveling protein networks with power graph analysis.

Authors: Loïc Royer; Matthias Reimann; Bill Andreopoulos; Michael Schroeder
Journal: PLoS Comput Biol Date: 2008-07-11 Impact factor: 4.475

6 in total

16 in total

1. Blood RNA profiling in a large cohort of multiple sclerosis patients and healthy controls.

Authors: Dorothee Nickles; Hsuan P Chen; Michael M Li; Pouya Khankhanian; Lohith Madireddy; Stacy J Caillier; Adam Santaniello; Bruce A C Cree; Daniel Pelletier; Stephen L Hauser; Jorge R Oksenberg; Sergio E Baranzini
Journal: Hum Mol Genet Date: 2013-06-06 Impact factor: 6.150

2. Tracing the footsteps of autophagy in computational biology.

Authors: Dipanka Tanu Sarmah; Nandadulal Bairagi; Samrat Chatterjee
Journal: Brief Bioinform Date: 2021-07-20 Impact factor: 11.622

3. Vascular endothelial growth factor pathway promotes osseointegration and CD31^hiEMCN^hi endothelium expansion in a mouse tibial implant model: an animal study.

Authors: G Ji; R Xu; Y Niu; N Li; L Ivashkiv; M P G Bostrom; M B Greenblatt; X Yang
Journal: Bone Joint J Date: 2019-07 Impact factor: 5.082

4. Expression profiling of mouse subplate reveals a dynamic gene network and disease association with autism and schizophrenia.

Authors: Anna Hoerder-Suabedissen; Franziska M Oeschger; Michelle L Krishnan; T Grant Belgard; Wei Zhi Wang; Sheena Lee; Caleb Webber; Enrico Petretto; A David Edwards; Zoltán Molnár
Journal: Proc Natl Acad Sci U S A Date: 2013-02-11 Impact factor: 11.205

5. NOA: a cytoscape plugin for network ontology analysis.

Authors: Chao Zhang; Jiguang Wang; Kristina Hanspers; Dong Xu; Luonan Chen; Alexander R Pico
Journal: Bioinformatics Date: 2013-06-07 Impact factor: 6.937

6. Mining and visualization of microarray and metabolomic data reveal extensive cell wall remodeling during winter hardening in Sitka spruce (Picea sitchensis).

Authors: Ruth Grene; Curtis Klumas; Haktan Suren; Kuan Yang; Eva Collakova; Elijah Myers; Lenwood S Heath; Jason A Holliday
Journal: Front Plant Sci Date: 2012-10-29 Impact factor: 5.753

Review 7. Mapping the multiscale structure of biological systems.

Authors: Leah V Schaffer; Trey Ideker
Journal: Cell Syst Date: 2021-06-16 Impact factor: 11.091

8. Visual analysis of biological data-knowledge networks.

Authors: Corinna Vehlow; David P Kao; Michael R Bristow; Lawrence E Hunter; Daniel Weiskopf; Carsten Görg
Journal: BMC Bioinformatics Date: 2015-04-29 Impact factor: 3.169

9. Evidence for extensive heterotrophic metabolism, antioxidant action, and associated regulatory events during winter hardening in Sitka spruce.

Authors: Eva Collakova; Curtis Klumas; Haktan Suren; Elijah Myers; Lenwood S Heath; Jason A Holliday; Ruth Grene
Journal: BMC Plant Biol Date: 2013-04-30 Impact factor: 4.215

10. Use of Gene Ontology Annotation to understand the peroxisome proteome in humans.

Authors: Prudence Mutowo-Meullenet; Rachael P Huntley; Emily C Dimmer; Yasmin Alam-Faruque; Tony Sawford; Maria Jesus Martin; Claire O'Donovan; Rolf Apweiler
Journal: Database (Oxford) Date: 2013-01-17 Impact factor: 3.451