Literature DB >> 23749961

NOA: a cytoscape plugin for network ontology analysis.

Chao Zhang1, Jiguang Wang, Kristina Hanspers, Dong Xu, Luonan Chen, Alexander R Pico.   

Abstract

SUMMARY: The Network Ontology Analysis (NOA) plugin for Cytoscape implements the NOA algorithm for network-based enrichment analysis, which extends Gene Ontology annotations to network links, or edges. The plugin facilitates the annotation and analysis of one or more networks in Cytoscape according to user-defined parameters. In addition to tables, the NOA plugin also presents results in the form of heatmaps and overview networks in Cytoscape, which can be exported for publication figures. AVAILABILITY: The NOA plugin is an open source, Java program for Cytoscape version 2.8 available via the Cytoscape App Store (http://apps.cytoscape.org/apps/noa) and plugin manager. A detailed user manual is available at http://nrnb.org/tools/noa. .ucsf.edu

Entities:  

Mesh:

Year:  2013        PMID: 23749961      PMCID: PMC3722524          DOI: 10.1093/bioinformatics/btt334

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

To make use of the vast wealth of data and knowledge to elucidate the function of biological networks is one of the biggest challenges in bioinformatics. The development of high-throughput technology has given rise to an enormous increase of data on biomolecular expression and interactions, which results in protein interaction networks, gene regulatory networks, signaling networks and metabolic networks. One approach to understanding networks relies on ontologies, such as Gene Ontology (GO), and enrichment analysis. Common approaches, such as BiNGO (Maere ), regard a network as a list of genes, performing gene-level annotation and enrichment methods. However, it is apparent that the same list of genes with different interactions may perform different functions. Thus, the functional analysis of networks should take into consideration network topology. Our previous work introduced the Network Ontology Analysis (NOA) algorithm to perform gene ontology enrichment analysis on biological networks (Wang ). First, NOA assigns ontology terms to interactions based on the known annotations of connected genes via optimizing two novel indexes ‘Coverage’ and ‘Diversity’. Then, NOA generates two alternative reference sets to statistically rank the enriched functional terms for a given biological network. NOA was shown to be more efficient not only for dynamic regulatory networks but also for rewired protein interaction networks. However, there are several shortcomings with the initial implementation: no graphical interface or visualization of results; limited support for species, gene names and ontology types; no batch mode for large-scale computation; and no integration or interoperability with other network tools. To overcome these shortcomings, we have reimplemented the NOA algorithm as a Cytoscape plugin and added interfaces, extensible ontology and identifier support, new visualizations, a batch mode and interoperability with other Cytoscape plugins, as demonstrated with CyThesaurus (van Iersel ) and Mosaic (Zhang ).

2 METHODS

2.1 Algorithms

The original NOA algorithm was described in (Wang ). In the NOA plugin, we have enhanced the network annotation algorithm to include standard statistical enrichment methods (e.g. hypergeometric distribution, Fisher’s exact test and Z-score). We also added the option of running a conventional node-based analysis (i.e. gene set enrichment), to complement the edge-based approach. Several widely used correction methods have been used to control the type I error rate of multiple hypotheses testing, such as Bonferroni and Benjiamini & Hochberg methods. Taking advantage of data model and interoperability within Cytoscape, users can perform NOA analysis on any network opened in Cytoscape or any subnetwork, selected manually, by built-in filtering mechanisms, or even by another plugin. The NOA plugin also has multiple options for reference networks based on the selected algorithm and test network.

2.2 Batch mode

Batch mode is another unique feature of the NOA plugin, allowing for the simultaneous analysis of multiple networks. Batch mode recognizes network and gene set file formats and selects an appropriate algorithm (edge-based or node-based) accordingly. In the case of edge-based analysis, each network is analyzed with respect to the combination of all input networks or their derived clique (connecting all nodes). Node-based analysis uses either the combination of all input genes or a given genome as reference. In batch mode, the primary output is a ranked list of GO terms significantly overrepresented for each network based on the selected algorithm. The result list is displayed as an annotated table, including GO term type, P value, description and associated nodes/edges. An additional table displays the list of GO terms enriched in multiple networks in the batch. The plugin also generates a sorted heatmap showing the most significantly enriched GO terms against all input networks. These tables and heatmap can be exported to third-party tools for custom visualization and analysis.

2.3 Single mode

In single mode, the NOA plugin can analyze any network in Cytoscape. For edge-based analysis, the clique form of the selected network is used as reference by default. If the user selects a subnetwork for analysis, then they also have the option of using the original parent network as reference. In the case of node-based analysis, the corresponding genome is used as reference, or the complete gene set from a given network if only a subset is selected for analysis. The results include a ranked and annotated list of GO terms, similar to batch mode. And, unique to single mode, an overview network is generated with significant GO terms represented as nodes. The size of these nodes is based on the number of occurrences of a given GO term annotation on the edges (or nodes) of the analyzed network. The co-occurrence of any two GO terms is represented by edges in the overview network.

3 DISCUSSION

The following use cases demonstrate the analysis options available in the NOA plugin and the types of the results users should expect.

3.1 Human pathways

We prepared a file containing all the human pathways curated at WikiPathways (Kelder ) and translated Ensembl identifiers for batch mode analysis by the NOA plugin (Supplementary File S1). We ran edge-based analysis to make use of the interaction content in the pathways, rather than just treating them like disconnected gene sets. Each pathway was analyzed against the entire collection as reference and no correction was applied. As expected, curated pathways of known function were significantly annotated with relevant GO terms (Fig. 1 and Supplementary Fig. S2). Users could add novel pathways or interaction networks to this batch analysis to assess functional annotation of edges in the context of known biology.
Fig. 1.

A heatmap of GO-annotated human pathways based on batch mode NOA plugin analysis. The gradient presents the corresponding logarithmic P value from −10 to 0. Axes are labeled in S2 and inset, highlighting apoptosis-related pathways and GO terms

A heatmap of GO-annotated human pathways based on batch mode NOA plugin analysis. The gradient presents the corresponding logarithmic P value from −10 to 0. Axes are labeled in S2 and inset, highlighting apoptosis-related pathways and GO terms

3.2 Human diseases

We collected protein interaction networks for different types of genetic diseases constructed using a generic protein–protein interactome as the source of disease gene interactions (Wang ) (Supplementary File S3). We compared edge-based and node-based analyses using the NOA plugin in batch mode. The ranked and annotated lists of enriched GO terms in Supplementary Tables S4 and S5 show many disease-term pairings in common, not surprisingly, but also a few differences. Significantly, the inclusion of generic (not disease-specific) interactions did not lead to less specific or aberrant GO terms with the edge-based method. Thus, with the NOA plugin, users can easily run both edge- and node-based analyses on gene sets of interest, using interactome data to supply interactions.

3.3 Yeast cell cycle

We collected time-series microarray data on yeast cell cycle and constructed regulatory transcription networks for each time point using transcription factor-target pairs revealed by ChIP-chip experiments (Supplementary File S6). Once again, we compared both types of analysis and two types of reference networks in batch analysis using the NOA plugin (Supplementary Fig. S7). Edge-based analysis provided more specific GO terms than node-based and reflected the known biological processes. We also noted that the all-network reference produced better results (i.e. more dynamic range and in line with known biology) than the clique-based reference. When working with densely connected networks, like in this example, users will typically produce better results using the all-network option as reference. Funding: National Institutes of Health [NRNB P41-GM103504 to A.R.P., R01-GM100701 to D.X.], NSFC [91029301 to L.C.], the Google Summer of Code program [C.Z.] and NSFC [11131009 to J.W.]. Conflict of interest: None declared.
  6 in total

1.  BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks.

Authors:  Steven Maere; Karel Heymans; Martin Kuiper
Journal:  Bioinformatics       Date:  2005-06-21       Impact factor: 6.937

2.  NOA: a novel Network Ontology Analysis method.

Authors:  Jiguang Wang; Qiang Huang; Zhi-Ping Liu; Yong Wang; Ling-Yun Wu; Luonan Chen; Xiang-Sun Zhang
Journal:  Nucleic Acids Res       Date:  2011-05-04       Impact factor: 16.971

3.  WikiPathways: building research communities on biological pathways.

Authors:  Thomas Kelder; Martijn P van Iersel; Kristina Hanspers; Martina Kutmon; Bruce R Conklin; Chris T Evelo; Alexander R Pico
Journal:  Nucleic Acids Res       Date:  2011-11-16       Impact factor: 16.971

4.  Mosaic: making biological sense of complex networks.

Authors:  Chao Zhang; Kristina Hanspers; Allan Kuchinsky; Nathan Salomonis; Dong Xu; Alexander R Pico
Journal:  Bioinformatics       Date:  2012-05-09       Impact factor: 6.937

5.  The BridgeDb framework: standardized access to gene, protein and metabolite identifier mapping services.

Authors:  Martijn P van Iersel; Alexander R Pico; Thomas Kelder; Jianjiong Gao; Isaac Ho; Kristina Hanspers; Bruce R Conklin; Chris T Evelo
Journal:  BMC Bioinformatics       Date:  2010-01-04       Impact factor: 3.169

6.  Disease-aging network reveals significant roles of aging genes in connecting genetic diseases.

Authors:  Jiguang Wang; Shihua Zhang; Yong Wang; Luonan Chen; Xiang-Sun Zhang
Journal:  PLoS Comput Biol       Date:  2009-09-25       Impact factor: 4.475

  6 in total
  10 in total

1.  Affinity purification-mass spectrometry and network analysis to understand protein-protein interactions.

Authors:  John H Morris; Giselle M Knudsen; Erik Verschueren; Jeffrey R Johnson; Peter Cimermancic; Alexander L Greninger; Alexander R Pico
Journal:  Nat Protoc       Date:  2014-10-02       Impact factor: 13.491

2.  BridgeDb app: unifying identifier mapping services for Cytoscape.

Authors:  Jianjiong Gao; Chao Zhang; Martijn van Iersel; Li Zhang; Dong Xu; Nikolaus Schultz; Alexander R Pico
Journal:  F1000Res       Date:  2014-07-01

3.  Temporal Genetic Modifications after Controlled Cortical Impact--Understanding Traumatic Brain Injury through a Systematic Network Approach.

Authors:  Yung-Hao Wong; Chia-Chou Wu; John Chung-Che Wu; Hsien-Yong Lai; Kai-Yun Chen; Bo-Ren Jheng; Mien-Cheng Chen; Tzu-Hao Chang; Bor-Sen Chen
Journal:  Int J Mol Sci       Date:  2016-02-06       Impact factor: 5.923

4.  Characterization of slow cycling corneal limbal epithelial cells identifies putative stem cell markers.

Authors:  R Sartaj; C Zhang; P Wan; Z Pasha; V Guaiquil; A Liu; J Liu; Y Luo; E Fuchs; M I Rosenblatt
Journal:  Sci Rep       Date:  2017-06-19       Impact factor: 4.379

Review 5.  From protein-protein interactions to protein co-expression networks: a new perspective to evaluate large-scale proteomic data.

Authors:  Danila Vella; Italo Zoppis; Giancarlo Mauri; Pierluigi Mauri; Dario Di Silvestre
Journal:  EURASIP J Bioinform Syst Biol       Date:  2017-03-20

6.  Applying NGS Data to Find Evolutionary Network Biomarkers from the Early and Late Stages of Hepatocellular Carcinoma.

Authors:  Yung-Hao Wong; Chia-Chou Wu; Chih-Lung Lin; Ting-Shou Chen; Tzu-Hao Chang; Bor-Sen Chen
Journal:  Biomed Res Int       Date:  2015-08-20       Impact factor: 3.411

7.  Evolution of network biomarkers from early to late stage bladder cancer samples.

Authors:  Yung-Hao Wong; Cheng-Wei Li; Bor-Sen Chen
Journal:  Biomed Res Int       Date:  2014-09-18       Impact factor: 3.411

8.  Identification of network-based biomarkers of cardioembolic stroke using a systems biology approach with time series data.

Authors:  Yung-Hao Wong; Chia-Chou Wu; Hsien-Yong Lai; Bo-Ren Jheng; Hsing-Yu Weng; Tzu-Hao Chang; Bor-Sen Chen
Journal:  BMC Syst Biol       Date:  2015-12-09

9.  Role of plant MicroRNA in cross-species regulatory networks of humans.

Authors:  Hao Zhang; Yanpu Li; Yuanning Liu; Haiming Liu; Hongyu Wang; Wen Jin; Yanmei Zhang; Chao Zhang; Dong Xu
Journal:  BMC Syst Biol       Date:  2016-08-08

10.  Identification of pLG72-Induced Oxidative Stress Using Systemic Approaches.

Authors:  Maofeng Wang; Hsi-Ju Chen; Jun Zhang; Weimin Li; Xinyou Xie; Hao-Teng Chang
Journal:  Biomed Res Int       Date:  2015-10-11       Impact factor: 3.411

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.