Literature DB >> 27559157

PhenomeScape: a cytoscape app to identify differentially regulated sub-networks using known disease associations.

Jamie Soul1, Sara L Dunn1, Tim E Hardingham1, Ray P Boot-Handford1, Jean-Marc Schwartz1.   

Abstract

PhenomeScape is a Cytoscape app which provides easy access to the PhenomeExpress algorithm to interpret gene expression data. PhenomeExpress integrates protein interaction networks with known phenotype to gene associations to find active sub-networks enriched in differentially expressed genes. It also incorporates cross-species phenotypes and associations to include results from animal models of disease. With expression data imported into PhenomeScape, the user can quickly generate and visualise interactive sub-networks. PhenomeScape thus enables researchers to use prior knowledge of a disease to identify differentially regulated sub-networks and to generate an overview of altered biologically processes specific to that disease.
AVAILABILITY AND IMPLEMENTATION: Freely available for download at https://github.com/soulj/PhenomeScape CONTACT: jamie.soul@postgrad.manchester.ac.uk or jean-marc.schwartz@manchester.ac.uk.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2016        PMID: 27559157      PMCID: PMC5167065          DOI: 10.1093/bioinformatics/btw545

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

The interpretation of gene expression data to gain insights into molecular mechanisms of disease after differentially expression analysis is challenging. Looking at groups of genes using enrichment or network based methods adds power to the analysis and aids interpretation. Gene set enrichment approaches are limited to predefined sets of genes and ignore the known interaction information. Finding de novo sub-networks/pathways from a protein–protein interaction (PPI) network and the expression data itself is an alternative approach employed by many tools (Ideker ). For many diseases, there are genes, which are known through human Mendelian disease and animal models, where perturbation gives rise to observed phenotypes present in the disease under study. Projects such as the International Knockout Mouse Consortium are adding much new information as they systematically identify gene to phenotype associations (Bradley ). This prior knowledge is valuable, since when combined with the gene expression data it indicates which genes and pathways/regions of the interactome are of importance in the disease. We recently described an algorithm named PhenomeExpress for analysis of disease related expression data that makes use of known cross-species phenotype to gene associations to generate sub-networks of differentially expressed genes that are linked to the observed phenotypes (Soul ). We here present PhenomeScape, a Cytoscape application allowing a user to quickly run the described PhenomeExpress algorithm on gene expression data within the Cytoscape GUI environment (Shannon ).

2 Methods

2.1 Network sources

The UberPheno network ontology was used to produce a phenotype-phenotype similarity network using Resnik semantic similarity, with a cut-off of 3 to keep strong interactions between phenotypes (Köhler ). Gene to cross-species phenotype associations were downloaded from UberPheno. Example high confidence mouse and human networks from HumanConsensusPathDB and StringDB are provided for convenience and can be loaded though the app menu (Kamburov ; von Mering ). All IDs were mapped to official gene symbols using Biomart (Smedley ).

2.2 PhenomeScape input

For input PhenomeScape requires a PPI network, either input using standard Cytoscape import methods or a provided one (Fig. 1A). Analysed expression data with gene symbols, fold change and P-values are imported into Cytoscape and matched with the network by the Gene Symbol. No thresholding by fold change or P-value is required as significance is determined at a sub-network level by calculation of empirical P-values through random sampling of the background network. Network parameters are set to sensible defaults, but can be altered to increase the size and number of the resulting sub-networks for further exploration of the data. The min sub-network size prevents creation of very small sub-networks while the sub-network size parameter is a constant introduced in the PhenomeExpress algorithm that influences the maximum size of the sub-networks. The P-value threshold allows filtering of the sub-networks by their statistical significance. Finally, phenotypes describing the disease under study are selected from a table with a filter function.
Fig. 1.

Schematic representation of the PhenomeScape app. (A) Example input data. (B) Pipeline for differentially regulated sub-network identification. (C) Example output of sub-network derived from the input PPI. Protein nodes are coloured by fold change (red increase, green decrease). Direct interactions between chosen phenotypes that describe the disease under study to their associated genes are indicated in blue

Schematic representation of the PhenomeScape app. (A) Example input data. (B) Pipeline for differentially regulated sub-network identification. (C) Example output of sub-network derived from the input PPI. Protein nodes are coloured by fold change (red increase, green decrease). Direct interactions between chosen phenotypes that describe the disease under study to their associated genes are indicated in blue

2.3 Sub-network identification

The PhenomeExpress algorithm has previously been described in detail. Briefly, the input PPI is linked to a provided phenotype-phenotype network via phenotype-gene associations (Fig. 1B). The expression data and the chosen phenotypes are used with a diffusion approach to calculate activity scores of proteins in the PPI. Subsequently, highly scoring groups of interacting proteins are identified as sub-networks.

2.4 PhenomeScape output

PhenomeScape creates all identified sub-networks for visualisation, coloured to indicate fold change for rapid identification of the most differentially expressed genes (Fig. 1C). Direct associations between the chosen phenotypes and the genes are shown allowing the user to view which genes have evidence of being previously linked to the disease phenotypes. The results panel shows the empirical P-value of the sub-networks and to the top enriched GO term to give an indication of the function of the de novo sub-networks.

3 Use case

Previously analysed gene expression data comparing human damaged and intact osteoarthritic knee cartilage was downloaded from ArrayExpress (E-MTAB-4304). This data was analysed as previously described to produce a table of all expressed genes and their corresponding fold changes and P-values (Dunn ). The default human network derived from human ConsensusPathDB was loaded from the PhenomeScape app menu. Using the standard table import tools the expression data spreadsheet was loaded and added to the network in Cytoscape by using the gene symbol as a key. The resulting human network has nodes labelled with expression data where the gene is expressed. Phenotypes relevant to the disease osteoarthritis were selected using the text box to filter the phenotype selection table. Increased and decreased susceptibility to arthritis, joint stiffness and knee osteoarthritis phenotypes were chosen based on review of the available phenotype terms. Using default network parameters, PhenomeScape identified 21 statistically significantly sub-networks including a sub-network with the top GO term enrichment of organisation of the extracellular matrix, shown in Figure 1C. This is consistent with the known perturbation of the extracellular matrix during osteoarthritis. Four phenotypes to associated gene links are shown in this sub-network, several genes of which are strongly differentially expressed demonstrating the utility of incorporating the prior disease knowledge.

4 Conclusion

PhenomeScape allows users to rapidly explore analysed gene expression data. Compared to existing tools such as ReactomeFIViz, PhenomeScape is unique in allowing integration of the prior phenotypic knowledge of disease genes into the expression analysis. No coding experience is needed to run PhenomeScape making it suitable for bench scientists who have gene expression data available. The resulting sub-networks produced in Cytoscape allow the user to extend the analysis with additional apps for regulatory analysis. PhenomeScape provides new insights in active processes revealed by expression data in the context of the prior knowledge of the disease phenotypes.
  9 in total

1.  Discovering regulatory and signalling circuits in molecular interaction networks.

Authors:  Trey Ideker; Owen Ozier; Benno Schwikowski; Andrew F Siegel
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

2.  STRING: a database of predicted functional associations between proteins.

Authors:  Christian von Mering; Martijn Huynen; Daniel Jaeggi; Steffen Schmidt; Peer Bork; Berend Snel
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

3.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

4.  Construction and accessibility of a cross-species phenotype ontology along with gene annotations for biomedical research.

Authors:  Sebastian Köhler; Sandra C Doelken; Barbara J Ruef; Sebastian Bauer; Nicole Washington; Monte Westerfield; George Gkoutos; Paul Schofield; Damian Smedley; Suzanna E Lewis; Peter N Robinson; Christopher J Mungall
Journal:  F1000Res       Date:  2013-02-01

5.  The BioMart community portal: an innovative alternative to large, centralized data repositories.

Authors:  Damian Smedley; Syed Haider; Steffen Durinck; Luca Pandini; Paolo Provero; James Allen; Olivier Arnaiz; Mohammad Hamza Awedh; Richard Baldock; Giulia Barbiera; Philippe Bardou; Tim Beck; Andrew Blake; Merideth Bonierbale; Anthony J Brookes; Gabriele Bucci; Iwan Buetti; Sarah Burge; Cédric Cabau; Joseph W Carlson; Claude Chelala; Charalambos Chrysostomou; Davide Cittaro; Olivier Collin; Raul Cordova; Rosalind J Cutts; Erik Dassi; Alex Di Genova; Anis Djari; Anthony Esposito; Heather Estrella; Eduardo Eyras; Julio Fernandez-Banet; Simon Forbes; Robert C Free; Takatomo Fujisawa; Emanuela Gadaleta; Jose M Garcia-Manteiga; David Goodstein; Kristian Gray; José Afonso Guerra-Assunção; Bernard Haggarty; Dong-Jin Han; Byung Woo Han; Todd Harris; Jayson Harshbarger; Robert K Hastings; Richard D Hayes; Claire Hoede; Shen Hu; Zhi-Liang Hu; Lucie Hutchins; Zhengyan Kan; Hideya Kawaji; Aminah Keliet; Arnaud Kerhornou; Sunghoon Kim; Rhoda Kinsella; Christophe Klopp; Lei Kong; Daniel Lawson; Dejan Lazarevic; Ji-Hyun Lee; Thomas Letellier; Chuan-Yun Li; Pietro Lio; Chu-Jun Liu; Jie Luo; Alejandro Maass; Jerome Mariette; Thomas Maurel; Stefania Merella; Azza Mostafa Mohamed; Francois Moreews; Ibounyamine Nabihoudine; Nelson Ndegwa; Céline Noirot; Cristian Perez-Llamas; Michael Primig; Alessandro Quattrone; Hadi Quesneville; Davide Rambaldi; James Reecy; Michela Riba; Steven Rosanoff; Amna Ali Saddiq; Elisa Salas; Olivier Sallou; Rebecca Shepherd; Reinhard Simon; Linda Sperling; William Spooner; Daniel M Staines; Delphine Steinbach; Kevin Stone; Elia Stupka; Jon W Teague; Abu Z Dayem Ullah; Jun Wang; Doreen Ware; Marie Wong-Erasmus; Ken Youens-Clark; Amonida Zadissa; Shi-Jian Zhang; Arek Kasprzyk
Journal:  Nucleic Acids Res       Date:  2015-04-20       Impact factor: 16.971

6.  PhenomeExpress: a refined network analysis of expression datasets by inclusion of known disease phenotypes.

Authors:  Jamie Soul; Timothy E Hardingham; Raymond P Boot-Handford; Jean-Marc Schwartz
Journal:  Sci Rep       Date:  2015-01-29       Impact factor: 4.379

7.  ConsensusPathDB--a database for integrating human functional interaction networks.

Authors:  Atanas Kamburov; Christoph Wierling; Hans Lehrach; Ralf Herwig
Journal:  Nucleic Acids Res       Date:  2008-10-21       Impact factor: 16.971

8.  The mammalian gene function resource: the International Knockout Mouse Consortium.

Authors:  Allan Bradley; Konstantinos Anastassiadis; Abdelkader Ayadi; James F Battey; Cindy Bell; Marie-Christine Birling; Joanna Bottomley; Steve D Brown; Antje Bürger; Carol J Bult; Wendy Bushell; Francis S Collins; Christian Desaintes; Brendan Doe; Aris Economides; Janan T Eppig; Richard H Finnell; Colin Fletcher; Martin Fray; David Frendewey; Roland H Friedel; Frank G Grosveld; Jens Hansen; Yann Hérault; Geoffrey Hicks; Andreas Hörlein; Richard Houghton; Martin Hrabé de Angelis; Danny Huylebroeck; Vivek Iyer; Pieter J de Jong; James A Kadin; Cornelia Kaloff; Karen Kennedy; Manousos Koutsourakis; K C Kent Lloyd; Susan Marschall; Jeremy Mason; Colin McKerlie; Michael P McLeod; Harald von Melchner; Mark Moore; Alejandro O Mujica; Andras Nagy; Mikhail Nefedov; Lauryl M Nutter; Guillaume Pavlovic; Jane L Peterson; Jonathan Pollock; Ramiro Ramirez-Solis; Derrick E Rancourt; Marcello Raspa; Jacques E Remacle; Martin Ringwald; Barry Rosen; Nadia Rosenthal; Janet Rossant; Patricia Ruiz Noppinger; Ed Ryder; Joel Zupicich Schick; Frank Schnütgen; Paul Schofield; Claudia Seisenberger; Mohammed Selloum; Elizabeth M Simpson; William C Skarnes; Damian Smedley; William L Stanford; A Francis Stewart; Kevin Stone; Kate Swan; Hamsa Tadepally; Lydia Teboul; Glauco P Tocchini-Valentini; David Valenzuela; Anthony P West; Ken-ichi Yamamura; Yuko Yoshinaga; Wolfgang Wurst
Journal:  Mamm Genome       Date:  2012-09-12       Impact factor: 2.957

9.  Gene expression changes in damaged osteoarthritic cartilage identify a signature of non-chondrogenic and mechanical responses.

Authors:  S L Dunn; J Soul; S Anand; J-M Schwartz; R P Boot-Handford; T E Hardingham
Journal:  Osteoarthritis Cartilage       Date:  2016-03-10       Impact factor: 6.576

  9 in total
  5 in total

1.  The common pathophysiologic threads between Asian Indian diabetic's 'Thin Fat Phenotype' and partial lipodystrophy: the peripheral adipose tissue transcriptomic evidences.

Authors:  Aditya Saxena; Pradeep Tiwari; Nitin Wahi; Anshul Kumar; Sandeep Kumar Mathur
Journal:  Adipocyte       Date:  2020-01-01       Impact factor: 4.534

2.  Deciphering the scalene association among type-2 diabetes mellitus, prostate cancer, and chronic myeloid leukemia via enrichment analysis of disease-gene network.

Authors:  Qiong Liu; Yingying Zhang; Pengqian Wang; Jun Liu; Bing Li; Yanan Yu; Hongli Wu; Ruixia Kang; Xiaoxu Zhang; Zhong Wang
Journal:  Cancer Med       Date:  2019-04-01       Impact factor: 4.452

3.  Stratification of knee osteoarthritis: two major patient subgroups identified by genome-wide expression analysis of articular cartilage.

Authors:  Jamie Soul; Sara L Dunn; Sanjay Anand; Ferdinand Serracino-Inglott; Jean-Marc Schwartz; Ray P Boot-Handford; Tim E Hardingham
Journal:  Ann Rheum Dis       Date:  2017-12-22       Impact factor: 19.103

4.  Subgroup analysis reveals molecular heterogeneity and provides potential precise treatment for pancreatic cancers.

Authors:  Heying Zhang; Juan Zeng; Yongqiang Tan; Lin Lu; Cheng Sun; Yusi Liang; Huawei Zou; Xianghong Yang; Yonggang Tan
Journal:  Onco Targets Ther       Date:  2018-09-12       Impact factor: 4.147

5.  Functional Interactomes of Genes Showing Association with Type-2 Diabetes and Its Intermediate Phenotypic Traits Point towards Adipo-Centric Mechanisms in Its Pathophysiology.

Authors:  Aditya Saxena; Nitin Wahi; Anshul Kumar; Sandeep Kumar Mathur
Journal:  Biomolecules       Date:  2020-04-13
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.