Literature DB >> 18779234

OmicsViz: Cytoscape plug-in for visualizing omics data across species.

Tian Xia1, Julie A Dickerson.   

Abstract

MOTIVATION: OmicsViz is a Cytoscape plug-in for mapping and visualizing large-scale omics datasets across species, including those with many-to-many mappings between homologs. This allows users to map their data onto pathways of related model organisms. Mapping schemas across species or different experimental protocols allow users to comparatively analyze the omics data. The data can also be visualized in parallel-coordinate plots. AVAILABILITY: The latest version of OmicsViz with documentation, tutorial and jar files can be downloaded from http://metnet.vrac.iastate.edu/MetNet_fcmodeler.htm

Entities:  

Mesh:

Year:  2008        PMID: 18779234      PMCID: PMC2572702          DOI: 10.1093/bioinformatics/btn473

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

High-throughput technologies provide large-scale datasets which describe biointeractions at various levels of detail, such as genes, protein and metabolites. However, integrating multiple independent datasets and mapping such datasets across species and gaining biological insight are challenging. One of the main obstacles for easy use of this data with networks is multiple naming conventions for experimental data and network elements and many-to-many mappings when applying BLAST homology to map across species. Cytoscape (Shannon et al., 2003) is an open-source software platform for network modeling in biology. OmicsViz is a Cytoscape plug-in that associates experimental datasets from transcriptomic, proteomic and metabolomic datasets with biological network models in Cytoscape according to node attribute information. OmicsViz can uniquely handle many-to-many mappings between nodes in a network and experimental datasets.

2 METHODS AND IMPLEMENTATION

Node attributes in Cytoscape characterize elements with specific name IDs and diverse biological information, such as subcellular location, node type, etc. Edges reflect the interactions or other relations between the nodes.

2.1 Omics data import and mapping

OmicsViz maps labeled measurements in the input experimental data to target nodes if the names are identical. However, due to inconsistencies in nomenclature, different names may be used to describe experimental quantities and network nodes, which actually refer to the same biological component in one species. One common example is that microarray datasets use gene probe IDs but metabolic pathways use gene locus names. OmicsViz provides a mapping file which translates between locus IDs and probe IDs to load transcriptomic data into the network model. A unique feature of OmicsViz is that it deals with problems caused by many-to-many mappings between nodes due to homologs or uncertainty as to which node was matched or which compartment it occurs in. The matched homologs are represented by newly created nodes and connected to target nodes. Functionally, they are grouped into HomologGroups (implemented using CyGroups) with the target nodes. The cross-species transcriptomic data, therefore, can be loaded into target species network. Combining further visualization and comparative analysis, one can easily explore biological insights for both species based on prior knowledge and omics experimental observations. The imported data can be stored as a Cytoscape pvals file. The mapping rules between the labels in the data file, X, and the node names in the active graph, Y, F(X)→Y where X={x1,…,x} and Y={y1,…,y} can be summarized as: OmicsViz uses data files in formats ranging from comma separated value files to Excel spreadsheets to take advantage of data from PLEXdb (Shen et al., 2005), GEO, and other microarray databases. OmicsViz can map multiple datasets to networks at the same time. This helps users to compare results across datasets. The plug-in dynamically updates the CyNode attribute panel and lists all selected datasets. IfF(x)maps to a unique value of y (i.e. a one-to−one mapping), then the data associated with x is mapped as attributes of node, y. If F(x)maps to multiple values of Y, then the data are mapped to the set of nodes, {y∣y=F(x)}. If multiple values of X, defined as the set, X, map to a set of nodes in the graph, i.e. ∪ {y∣y=F(x)}, then additional ‘homolog’ nodes and edges connected to these nodes are added to the graph. The attributes of the newly created nodes are the same as the parent node.

2.2 Data visualization

OmicsViz provides line chart visualization component for loaded omics datasets using the JFreeChart library Gilbert and Morgner, 2007 (Fig. 1). The experimental data associated with nodes in the network can be displayed on a parallel coordinate plot. The parallel coordinates of line chart delineate the variability of omics data with respect to different treatments and time points. The plot window can be controlled by users by adjusting parameters, such as line colors, axis ranges and labels.
Fig. 1.

OmicsViz plugin control panel screenshot. The highlighted yellow nodes show mapped Vitis homologs onto an Arabidopsis gene locus. The parallel coordinate plot of the expression data is shown in the upper right corner.

OmicsViz plugin control panel screenshot. The highlighted yellow nodes show mapped Vitis homologs onto an Arabidopsis gene locus. The parallel coordinate plot of the expression data is shown in the upper right corner.

2.3 Usage example

OmicsViz is also able to import transcriptomic data into Cytoscape network platform across species to facilitate further comparative survey. For example, the Arabidopsis genome has been completely sequenced and the annotation is well developed. Grape (Vitis vinifera), has only recently been sequenced and most genes are electronically inferred. The reciprocal better BLAST hits method (Stuart et al., 2003) can produce whole-genome scale homolog mappings between species. This forms a basis for the mapping of omics data from a less-studied organism to the network model developed for the plant model organism, Arabidopsis. The better BLAST hit method gives a set of possible homologs between species. This example uses TBLASTN for each Arabidopsis protein sequence against the Vitis consensus sequences on the Affymetrix Vitis GeneChip to find the top five grape hits and BLASTX to find the top five hits from Arabidopsis for each Vitis sequence. If Arabidopsis gene A and Vitis probeset B are reciprocal hits, A and B then are grouped as a possible homolog pair. This gives a many-to-many mapping relationship. Mappings for many plant species can be found in the sample data files available with the OmicsViz plugin. The microarray data is from experiment VV2 in PLEXdb (www.plexdb.org). OmicsViz also manages multiple datasets which are associated with specific networks. It enables users to compare datasets of interest and dynamically update the CyNode attribute panel and switch between corresponding datasets.
  3 in total

1.  A gene-coexpression network for global discovery of conserved genetic modules.

Authors:  Joshua M Stuart; Eran Segal; Daphne Koller; Stuart K Kim
Journal:  Science       Date:  2003-08-21       Impact factor: 47.728

2.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

3.  BarleyBase--an expression profiling database for plant genomics.

Authors:  Lishuang Shen; Jian Gong; Rico A Caldo; Dan Nettleton; Dianne Cook; Roger P Wise; Julie A Dickerson
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

  3 in total
  4 in total

Review 1.  The IBD interactome: an integrated view of aetiology, pathogenesis and therapy.

Authors:  Heitor S P de Souza; Claudio Fiocchi; Dimitrios Iliopoulos
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2017-08-23       Impact factor: 46.802

2.  OmicsAnalyzer: a Cytoscape plug-in suite for modeling omics data.

Authors:  Tian Xia; John V Hemert; Julie A Dickerson
Journal:  Bioinformatics       Date:  2010-10-14       Impact factor: 6.937

3.  RefNetBuilder: a platform for construction of integrated reference gene regulatory networks from expressed sequence tags.

Authors:  Ying Li; Ping Gong; Edward J Perkins; Chaoyang Zhang; Nan Wang
Journal:  BMC Bioinformatics       Date:  2011-10-18       Impact factor: 3.169

4.  Proceedings of the 2009 MidSouth Computational Biology and Bioinformatics Society (MCBIOS) conference. Introduction.

Authors:  Jonathan D Wren; Yuriy Gusev; Raphael D Isokpehi; Daniel Berleant; Ulisses Braga-Neto; Dawn Wilkins; Susan Bridges
Journal:  BMC Bioinformatics       Date:  2009-10-08       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.