Literature DB >> 30124768

PathwayConnector: finding complementary pathways to enhance functional analysis.

George Minadakis1, Margarita Zachariou1, Anastasis Oulas1, George M Spyrou1.   

Abstract

SUMMARY: PathwayConnector is a web-tool that facilitates the construction of complementary pathway-to-pathway networks and subnetworks of them, based on a reference pathway network derived from the rich information available either in KEGG or Reactome database for pathway mapping. Specifically, for a given set of pathways, PathwayConnector (i) finds all the direct connections between them, (ii) adds a minimum set of complementary pathways required to achieve connectivity between the pathways, leading to informative fully connected networks and (ii) provides a series of clustering methods for the further grouping of pathways in to sub-clusters. The proposed web-tool is a simple yet informative tool towards identifying connected groups of pathways that are significantly related to specific diseases.
AVAILABILITY AND IMPLEMENTATION: http://bioinformatics.cing.ac.cy/PathwayConnector. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2019        PMID: 30124768      PMCID: PMC6394395          DOI: 10.1093/bioinformatics/bty693

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Pathway-based analysis allows for a comprehensive understanding of the molecular mechanisms related to complex diseases. Although there are several available pathway analysis methods (Jin ), the way in which human pathways are functionally linked within an overall network of existing human pathways is largely unexplored. In classical pathway analysis gene lists, usually obtained from any experimental-computational method, can be further analyzed by relevant software tools, that allow enrichment analysis to be performed based on prior knowledge gene-set libraries and pathways connected to them (Kuleshov ). Such tools may provide significant score-based information on how genes are involved into pathways and pathways to a disease, but also have several limitations that may fall in to the following criticisms. Firstly, it is not clear whether these pathways are functionally linked. Second, when filtering the pathways with a specific threshold on their score, important yet not statistically significant pathways might be excluded whereas non-relevant yet statistically important pathways might be included. Third, when dealing with large lists of pathways it is not always straightforward to relate them to a specific biological status building fully functional stories based on them. To this extend, the proposed PathwayConnector is a web-tool that provides an easy way for rapidly relating pathways together, by creating complementary networks of pathways related to a specific biological status. PathwayConnector is able to provide with respect to a specific reference network: (i) direct connections between pathways of interest (first-neighbors approach), (ii) complementary networks that show the shortest paths between pathways of interest and the intermediate pathways involved between them and (iii) additional clustering approaches to highlight communities (clusters) of pathways.

2 Implementation

For a number of species where data are available, an overall reference pathway-to-pathway network has been developed, that covers all the possible connections that exist between all the available pathways, as referenced in two major pathway databases: KEGG (Kanehisa, 2002) and Reactome (Fabregat ). The derived pathway-pathway network acts as a reference network map for retrieving information about the functional interconnections between pathways of interest. For this we focused on the connectivity information included in the selected database (KEGG: biochemical relationships, Reactome: relations of signaling and metabolic organized into biological pathways and processes). The output of a data mining process over the relationships that each database offers, provided us with all the links between pathways, which were used as edges to construct the undirected, unweighted pathway network. This network is regularly updated and works as the main pathway repository for the services/methods PathwayConnector draws from. Users can provide two types of inputs, as shown in Figure 1A and B: (i) lists of genes as notated in either KEGG’s or Reactome’s repository, and (ii) lists of pathway IDs, that can be found in either KEGG’s or Reactome’s website. When the web-tool detects a gene list it automatically applies enrichment analysis, by means of (i) the EnrichR package required for enrichment analysis of human model organism (Kuleshov ), (ii) the “clusterProfiler” package (Yu ), required for enrichment analysis of non-human species employed in KEGG’s repository and (iii) the “ReactomePA” package (Yu and He, 2016), required for the enrichment of non-human species included in Reactome’s repository. In this way, users are able to obtain the top-scored pathway list required for the network analysis. Users can either examine whether the input genes have been correctly enriched, or re-run the enrichment process with different gene synonyms and scoring methods (see Supplementary Fig. S1). Both types of input lead to an initial network of pathways, as shown Figure 1C, showing the way in which six pathways are directly connected (or not) to each other within the reference pathway network. Herein the methodology that follows, draws from the graph theory as well as the “igraph” R package (Csardi and Nepusz, 2006) that has been employed for the implementation of this web-tool.
Fig. 1.

Schematic workflow of the PathwayConnector web-tool

Schematic workflow of the PathwayConnector web-tool

2.1 The missing pathway approach

Following the initial network construction based on the given (or the derived) pathway list, our proposed methodology identifies and adds the key nodes which ensure the minimal connectivity of the network, i.e. each node in the original list is connected to at least another original node. For this, a specific algorithm finds and calculates all the shortest paths within the reference network that interconnect these nodes together. Then, the algorithm chooses those nodes which belong to the shortest path-length to be included in the final network. For more than one shortest paths of the same path-length, these paths are also included in to the final network. Figure 1D depicts the complementary added 17 nodes to the initial 6-node network which contained two standalone nodes.

2.2 Clustering the complementary network

For small and well connected pathway networks, the latter methodology is often sufficient for further investigation. However, for large and disconnected/sparse pathway networks, the missing pathway approach often leads to even larger complementary networks whose analysis and interpretation with respect to a certain biological status becomes more challenging. To address this, as part of this web-tool we provide the option of clustering the pathways in sub-clusters through a series of community structure detection algorithms (clustering algorithms, further detailed in the Supplementary Material and Supplementary Fig. S2) as shown in Figure 1E. Clustering can be used to separate the complementary networks into sub-networks, and in effect to provide more comprehensive information related to a biological status.

3 Novelty and applications

The described methodology has been applied, with noteworthy results to pathways related to Alzheimer’s Disease (AD) (Zachariou ), in a framework which: (i) integrates knowledge, (ii) prioritises genes and (iii) utilizes the PathwayConnector to optimize pathway selection for further interpretation with respect to pathway mechanisms underlying AD. Additional success stories include its application to Huntington’s disease (HD) and Spastic Ataxia (SA) (Kakouri ). PathwayConnector, when applied in post-pathway analysis on results concerning HD, SA and AD provided us with pathways that were not included in the classical pathway analysis. Specifically, in the case of HD, three pathways identified that were not included in the enrichment analysis results. In the case of SA, the shortest path algorithm provided 14 additional pathways, which were not initially included in the pathway list of the classical pathway enrichment analysis. In the case of Alzheimer’s disease seven additional pathways were provided. By considering the topological information provided by the reference pathway–pathway network through PathwayConnector, we can ensure that key pathways functionally connected to the top selected pathways are not lost due to an ad-hoc cutoff or over-representation statistics in the classical pathway enrichment analysis. We expect that the proposed PathwayConnector will be a valuable tool to complement and enhance classical pathway analysis and offer further insights on the functional inter-relation of pathways of interest.

Funding

This work was supported by the European Commission Research Executive Agency Grant BIORISE (No. 669026), under the Spreading Excellence, Widening Participation, Science with and for Society Framework. This work was partly supported by H2020- WIDESPREAD-04-2017-Teaming Phase 1, Grant Agreement 763781, Integrated Precision Medicine Technologies. Conflict of Interest: none declared. Click here for additional data file.
  8 in total

Review 1.  The KEGG database.

Authors:  Minoru Kanehisa
Journal:  Novartis Found Symp       Date:  2002

2.  clusterProfiler: an R package for comparing biological themes among gene clusters.

Authors:  Guangchuang Yu; Li-Gen Wang; Yanyan Han; Qing-Yu He
Journal:  OMICS       Date:  2012-03-28

3.  ReactomePA: an R/Bioconductor package for reactome pathway analysis and visualization.

Authors:  Guangchuang Yu; Qing-Yu He
Journal:  Mol Biosyst       Date:  2016-02

4.  Integrating multi-source information on a single network to detect disease-related clusters of molecular mechanisms.

Authors:  Margarita Zachariou; George Minadakis; Anastasis Oulas; Sotiroula Afxenti; George M Spyrou
Journal:  J Proteomics       Date:  2018-03-13       Impact factor: 4.044

5.  Revealing Clusters of Connected Pathways Through Multisource Data Integration in Huntington's Disease and Spastic Ataxia.

Authors:  Andrea C Kakouri; Christiana C Christodoulou; Margarita Zachariou; Anastasis Oulas; George Minadakis; Christiana A Demetriou; Christina Votsi; Eleni Zamba-Papanicolaou; Kyproula Christodoulou; George M Spyrou
Journal:  IEEE J Biomed Health Inform       Date:  2018-08-30       Impact factor: 5.772

6.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update.

Authors:  Maxim V Kuleshov; Matthew R Jones; Andrew D Rouillard; Nicolas F Fernandez; Qiaonan Duan; Zichen Wang; Simon Koplev; Sherry L Jenkins; Kathleen M Jagodnik; Alexander Lachmann; Michael G McDermott; Caroline D Monteiro; Gregory W Gundersen; Avi Ma'ayan
Journal:  Nucleic Acids Res       Date:  2016-05-03       Impact factor: 16.971

Review 7.  Pathway-based analysis tools for complex diseases: a review.

Authors:  Lv Jin; Xiao-Yu Zuo; Wei-Yang Su; Xiao-Lei Zhao; Man-Qiong Yuan; Li-Zhen Han; Xiang Zhao; Ye-Da Chen; Shao-Qi Rao
Journal:  Genomics Proteomics Bioinformatics       Date:  2014-10-28       Impact factor: 7.691

8.  Reactome diagram viewer: data structures and strategies to boost performance.

Authors:  Antonio Fabregat; Konstantinos Sidiropoulos; Guilherme Viteri; Pablo Marin-Garcia; Peipei Ping; Lincoln Stein; Peter D'Eustachio; Henning Hermjakob
Journal:  Bioinformatics       Date:  2018-04-01       Impact factor: 6.937

  8 in total
  10 in total

1.  Neurophysiological and Genetic Findings in Patients With Juvenile Myoclonic Epilepsy.

Authors:  Stefani Stefani; Ioanna Kousiappa; Nicoletta Nicolaou; Eleftherios S Papathanasiou; Anastasis Oulas; Pavlos Fanis; Vassos Neocleous; Leonidas A Phylactou; George M Spyrou; Savvas S Papacostas
Journal:  Front Integr Neurosci       Date:  2020-08-20

2.  A TCM Formula YYWY Inhibits Tumor Growth in Non-Small Cell Lung Cancer and Enhances Immune-Response Through Facilitating the Maturation of Dendritic Cells.

Authors:  Bei Zhao; Xiaodan Hui; Lijing Jiao; Ling Bi; Lei Wang; Piao Huang; Wenxiao Yang; Yinan Yin; Shenyi Jin; Chengyan Wang; Xue Zhang; Ling Xu
Journal:  Front Pharmacol       Date:  2020-06-09       Impact factor: 5.810

3.  ComPath: an ecosystem for exploring, analyzing, and curating mappings across pathway databases.

Authors:  Daniel Domingo-Fernández; Charles Tapley Hoyt; Carlos Bobis-Álvarez; Josep Marín-Llaó; Martin Hofmann-Apitius
Journal:  NPJ Syst Biol Appl       Date:  2018-12-13

4.  Inhibition of Breast Cancer Cell Invasion by Ras Suppressor-1 (RSU-1) Silencing Is Reversed by Growth Differentiation Factor-15 (GDF-15).

Authors:  Vasiliki Gkretsi; Maria Louca; Andreas Stylianou; George Minadakis; George M Spyrou; Triantafyllos Stylianopoulos
Journal:  Int J Mol Sci       Date:  2019-01-04       Impact factor: 5.923

5.  Topological integration of RPPA proteomic data with multi-omics data for survival prediction in breast cancer via pathway activity inference.

Authors:  Tae Rim Kim; Hyun-Hwan Jeong; Kyung-Ah Sohn
Journal:  BMC Med Genomics       Date:  2019-07-11       Impact factor: 3.063

6.  APE1 controls DICER1 expression in NSCLC through miR-33a and miR-130b.

Authors:  Giulia Antoniali; Emiliano Dalla; Giovanna Mangiapane; Xiaolong Zhao; Xinming Jing; Yi Cheng; Veronica De Sanctis; Dilara Ayyildiz; Silvano Piazza; Mengxia Li; Gianluca Tell
Journal:  Cell Mol Life Sci       Date:  2022-07-25       Impact factor: 9.207

7.  Functional Interactomes of Genes Showing Association with Type-2 Diabetes and Its Intermediate Phenotypic Traits Point towards Adipo-Centric Mechanisms in Its Pathophysiology.

Authors:  Aditya Saxena; Nitin Wahi; Anshul Kumar; Sandeep Kumar Mathur
Journal:  Biomolecules       Date:  2020-04-13

8.  Investigating the Transition of Pre-Symptomatic to Symptomatic Huntington's Disease Status Based on Omics Data.

Authors:  Christiana C Christodoulou; Margarita Zachariou; Marios Tomazou; Evangelos Karatzas; Christiana A Demetriou; Eleni Zamba-Papanicolaou; George M Spyrou
Journal:  Int J Mol Sci       Date:  2020-10-08       Impact factor: 5.923

9.  Transcriptomic characterization of tissues from patients and subsequent pathway analyses reveal biological pathways that are implicated in spastic ataxia.

Authors:  Andrea C Kakouri; Christina Votsi; Anastasis Oulas; Paschalis Nicolaou; Massimo Aureli; Giulia Lunghi; Maura Samarani; Giacomo M Compagnoni; Sabrina Salani; Alessio Di Fonzo; Thalis Christophides; George A Tanteles; Eleni Zamba-Papanicolaou; Marios Pantzaris; George M Spyrou; Kyproula Christodoulou
Journal:  Cell Biosci       Date:  2022-03-11       Impact factor: 7.133

10.  Identification of viral-mediated pathogenic mechanisms in neurodegenerative diseases using network-based approaches.

Authors:  Anna Onisiforou; George M Spyrou
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.