Literature DB >> 24363376

JEPETTO: a Cytoscape plugin for gene set enrichment and topological analysis based on interaction networks.

Charles Winterhalter1, Paweł Widera, Natalio Krasnogor.   

Abstract

SUMMARY: JEPETTO (Java Enrichment of Pathways Extended To TOpology) is a Cytoscape 3.x plugin performing integrative human gene set analysis. It identifies functional associations between genes and known cellular pathways, and processes using protein interaction networks and topological analysis. The plugin integrates information from three separate web servers we published previously, specializing in enrichment analysis, pathways expansion and topological matching. This integration substantially simplifies the analysis of user gene sets and the interpretation of the results. We demonstrate the utility of the JEPETTO plugin on a set of misregulated genes associated with Alzheimer's disease. AVAILABILITY: Source code and binaries are freely available for download at http://apps.cytoscape.org/apps/jepetto, implemented in Java and multi-platform. Installable directly via Cytoscape plugin manager. Released under the GNU General Public Licence.

Entities:  

Mesh:

Year:  2013        PMID: 24363376      PMCID: PMC3967109          DOI: 10.1093/bioinformatics/btt732

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

The integration of heterogeneous data derived from functional genomics experiments is an essential step in providing insights into biological systems behaviour, especially disease-related processes. Nowadays this integration is becoming easier, thanks to extendable network analysis platforms such as Cytoscape (Shannon ). Although several existing Cytoscape plugins do functional analysis, not many go beyond performing a ‘term-based’ search or link ontology. In contrast, JEPETTO integrates gene sets with pathways and the molecular interaction networks they are embedded in. It uses information from three different web servers to perform network enrichment, pathways expansion and topological analysis.

2 IMPLEMENTATION

2.1 Input

Our plugin operates on target gene set provided by a user. The list of gene names can be given directly or imported from an existing network created in Cytoscape (many gene identifier formats are accepted, e.g. Ensembl, HGNC, Entrez, UniProt). The other main parameters are the reference annotation database (KEGG, BioCarta, GO, InterPro, etc.) and the molecular interaction network (STRING or a user-defined network).

2.2 Processing

There are two types of analysis available in JEPETTO based on: (1) enrichment or (2) topology. For the enrichment analysis, EnrichNet and PathExpand web servers are used and the topological analysis is performed with TopoGSA. The plugin communicates with the web servers as shown in Figure 1A, and integrates the results within the Cytoscape environment. This single-point integration eliminates the need to use the web servers individually and simplifies repeated analysis on previously obtained results.
Fig. 1.

Plugin architecture. (A) analysis via JEPETTO (single input/output), (B) analysis via existing web servers (multiple input/output points)

Plugin architecture. (A) analysis via JEPETTO (single input/output), (B) analysis via existing web servers (multiple input/output points) EnrichNet (Glaab ) maps the input gene set onto a molecular interaction network and, using a random walk, scores distances between the genes and pathways/processes in a reference database. This network-based association score (XD-score) is relative to the average distance to all pathways and represents a deviation (positive or negative) from the average distance. As an option, EnrichNet provides XD-scores for 60 human tissues, derived from tissue-specific gene expression data. PathExpand (Glaab ) maps the input pathway/process onto the human protein–protein interaction network and extends it with proteins that (1) are strongly associated with the pathway nodes; and (2) increase the pathway compactness by connecting its disconnected members. The exact expansion acceptance criteria can be modified in the plugin advanced options. TopoGSA (Glaab ) maps the input gene set on an interaction network, computes its topological signature and compares it against signatures of pathways/processes in a reference database. The topological signature is built from five distinct properties, among others: network density, centrality of nodes in the network or their tendency to form clusters.

2.3 Output

The enrichment analysis finds pathways significantly associated with the input gene set (in terms of XD-score). When a pathway is selected, its expansion is constructed and a network of interactions between the gene set, pathway and expansion is generated (see Fig. 2). The topology analysis finds pathways with a pattern of interactions most similar to those in the input gene set and visually compares their topological properties (see Fig. 3).
Fig. 2.

Target gene set within Alzheimer’s disease environmental network. Grey nodes represent the input set, green the pathway and blue the overlap between them. Orange nodes represent the expansion. Interactions between the input set and added nodes are highlighted: edges to pathway nodes are green and edges to expansion nodes are orange

Fig. 3.

Comparative analysis of topological properties. The red square next to the Wnt signalling pathway represents the target network

Target gene set within Alzheimer’s disease environmental network. Grey nodes represent the input set, green the pathway and blue the overlap between them. Orange nodes represent the expansion. Interactions between the input set and added nodes are highlighted: edges to pathway nodes are green and edges to expansion nodes are orange Comparative analysis of topological properties. The red square next to the Wnt signalling pathway represents the target network

3 CASE STUDY

We have retrieved the Alzheimer’s disease-related set of genes from Phenopedia (http://www.hugenavigator.net). The enrichment analysis of this set assigned the highest XD-score (1.94) to the Alzheimer’s disease pathway. The pathway was expanded and the disease environmental network (see Fig. 2) was generated. The following genes were added as the expansion: TMED10, METTL2B, APH1B, MMP17 and PITX3. The first three of these genes have been experimentally characterized as direct Alzheimer’s cofactors (additional details in Supplementary Information). MMP17 protein family was found to play a role in β-amyloid proteins degradation related to the increase of mitogen-activated protein kinase (MAPK), the main trigger of the Alzheimer’s disease (Yoshiyama ). PITX3 is involved in transcription of a micro-RNA that is known to have reduced expression levels in Parkinson’s diseased brains and may also contribute to the Alzheimer’s pathology development (Shioya ). The topological analysis of the largest connected component of the Alzheimer’s disease environmental network confirmed the previous findings. Among the most topologically similar pathways were Wnt signalling pathway related to MMP17 and Parkinson’s disease pathway related to PITX3. It also highlighted similarity to sclerosis and diabetes and several environmental information processing pathways (see Fig. 3 and Supplementary Information).

4 SUMMARY

JEPETTO integrates three different network-centric human gene set analysis methods under a single interface of the Cytoscape 3.x environment. It performs enrichment and topological analysis based on the interaction networks. It displays the target gene set within its interaction environment and identifies possible gene cofactors and topologically related pathways and processes that are unlikely to be detected using traditional term-based analysis. In the case study of the Alzheimer’s disease-associated genes, JEPETTO was able to identify a number of known disease cofactors and suggested directions for further investigation. Funding: Engineering and Physical Sciences Research Council [EP/J004111/1]. Conflict of Interest: none declared.
  6 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  Aberrant microRNA expression in the brains of neurodegenerative diseases: miR-29a decreased in Alzheimer disease brains targets neurone navigator 3.

Authors:  M Shioya; S Obayashi; H Tabunoki; K Arima; Y Saito; T Ishida; J Satoh
Journal:  Neuropathol Appl Neurobiol       Date:  2010-02-25       Impact factor: 8.090

3.  Selective distribution of matrix metalloproteinase-3 (MMP-3) in Alzheimer's disease brain.

Authors:  Y Yoshiyama; M Asahina; T Hattori
Journal:  Acta Neuropathol       Date:  2000-02       Impact factor: 17.088

4.  TopoGSA: network topological gene set analysis.

Authors:  Enrico Glaab; Anaïs Baudot; Natalio Krasnogor; Alfonso Valencia
Journal:  Bioinformatics       Date:  2010-03-24       Impact factor: 6.937

5.  Extending pathways and processes using molecular interaction networks to analyse cancer genome data.

Authors:  Enrico Glaab; Anaïs Baudot; Natalio Krasnogor; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2010-12-13       Impact factor: 3.169

6.  EnrichNet: network-based gene set enrichment analysis.

Authors:  Enrico Glaab; Anaïs Baudot; Natalio Krasnogor; Reinhard Schneider; Alfonso Valencia
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

  6 in total
  21 in total

1.  Loss of CYLD expression unleashes Wnt signaling in multiple myeloma and is associated with aggressive disease.

Authors:  H van Andel; K A Kocemba; A de Haan-Kramer; C H Mellink; M Piwowar; A Broijl; M van Duin; P Sonneveld; M M Maurice; M J Kersten; M Spaargaren; S T Pals
Journal:  Oncogene       Date:  2016-10-24       Impact factor: 9.867

2.  NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases.

Authors:  Pietro Di Lena; Pier Luigi Martelli; Piero Fariselli; Rita Casadio
Journal:  BMC Genomics       Date:  2015-06-18       Impact factor: 3.969

3.  Network pharmacological identification of active compounds and potential actions of Erxian decoction in alleviating menopause-related symptoms.

Authors:  Shiwei Wang; Yao Tong; Tzi-Bun Ng; Lixing Lao; Jenny Ka Wing Lam; Kalin Yanbo Zhang; Zhang-Jin Zhang; Stephen Cho Wing Sze
Journal:  Chin Med       Date:  2015-07-19       Impact factor: 5.455

4.  Gene set enrichment and topological analyses based on interaction networks in pediatric acute lymphoblastic leukemia.

Authors:  Shuxiang Sui; Xin Wang; Hua Zheng; Hua Guo; Tong Chen; Dong-Mei Ji
Journal:  Oncol Lett       Date:  2015-09-29       Impact factor: 2.967

5.  Systems biology analysis of the proteomic alterations induced by MPP(+), a Parkinson's disease-related mitochondrial toxin.

Authors:  Chiara Monti; Heather Bondi; Andrea Urbani; Mauro Fasano; Tiziana Alberio
Journal:  Front Cell Neurosci       Date:  2015-02-02       Impact factor: 5.505

6.  EP300 Protects from Light-Induced Retinopathy in Zebrafish.

Authors:  Reiko Kawase; Yuhei Nishimura; Yoshifumi Ashikawa; Shota Sasagawa; Soichiro Murakami; Mizuki Yuge; Shiko Okabe; Koki Kawaguchi; Hiroshi Yamamoto; Kazumi Moriyuki; Shinsaku Yamane; Kazuhiro Tsuruma; Masamitsu Shimazawa; Hideaki Hara; Toshio Tanaka
Journal:  Front Pharmacol       Date:  2016-05-19       Impact factor: 5.810

7.  DNA Damage Response Is Involved in the Developmental Toxicity of Mebendazole in Zebrafish Retina.

Authors:  Shota Sasagawa; Yuhei Nishimura; Tetsuo Kon; Yukiko Yamanaka; Soichiro Murakami; Yoshifumi Ashikawa; Mizuki Yuge; Shiko Okabe; Koki Kawaguchi; Reiko Kawase; Toshio Tanaka
Journal:  Front Pharmacol       Date:  2016-03-14       Impact factor: 5.810

8.  Functional and protein‑protein interaction network analysis of colorectal cancer induced by ulcerative colitis.

Authors:  Yong Dai; Jin-Bo Jiang; Yan-Lei Wang; Zu-Tao Jin; San-Yuan Hu
Journal:  Mol Med Rep       Date:  2015-07-20       Impact factor: 2.952

9.  Preferential Allele Expression Analysis Identifies Shared Germline and Somatic Driver Genes in Advanced Ovarian Cancer.

Authors:  Najeeb M Halabi; Alejandra Martinez; Halema Al-Farsi; Eliane Mery; Laurence Puydenus; Pascal Pujol; Hanif G Khalak; Cameron McLurcan; Gwenael Ferron; Denis Querleu; Iman Al-Azwani; Eman Al-Dous; Yasmin A Mohamoud; Joel A Malek; Arash Rafii
Journal:  PLoS Genet       Date:  2016-01-06       Impact factor: 5.917

10.  Changes in poly(A) tail length dynamics from the loss of the circadian deadenylase Nocturnin.

Authors:  Shihoko Kojima; Kerry L Gendreau; Elaine L Sher-Chen; Peng Gao; Carla B Green
Journal:  Sci Rep       Date:  2015-11-20       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.