Literature DB >> 30847467

Automated exploration of gene ontology term and pathway networks with ClueGO-REST.

Bernhard Mlecnik1,2, Jérôme Galon1, Gabriela Bindea1.   

Abstract

SUMMARY: Large scale technologies produce massive amounts of experimental data that need to be investigated. To improve their biological interpretation we have developed ClueGO, a Cytoscape App that selects representative Gene Onology terms and pathways for one or multiple lists of genes/proteins and visualizes them into functionally organized networks. Because of its reliability, userfriendliness and support of many species ClueGO gained a large community of users. To further allow scientists programmatic access to ClueGO with R, Python, JavaScript etc., we implemented the cyREST API into ClueGO. In this article we describe this novel, complementary way of accessing ClueGO via REST, and provide R and Phyton examples to demonstrate how ClueGO workflows can be integrated into bioinformatic analysis pipelines.
AVAILABILITY AND IMPLEMENTATION: ClueGO is available in the Cytoscape App Store (http://apps.cytoscape.org/apps/cluego). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30847467      PMCID: PMC6761950          DOI: 10.1093/bioinformatics/btz163

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

High-throughput technologies produce large amounts of experimental data that need to be investigated to gain insights into biological processes. Systemic approaches for data integration, analysis and visualization were developed to reflect not only individual biological components, but also their interactions in pathways and networks. Software tools that perform such type of analyses in an automatic way are more and more needed. Cytoscape (Shannon ) is a major computational platform to visualize and analyze networks. Cytoscape Automation (https://github.com/cytoscape/cytoscape-automation) enables scientific workflows written in many languages and scales Cytoscape to large datasets and pipelines. The Cytoscape programmatic interface for this is CyREST (Ono ). We have contributed to the Cytoscape App collection (Saito ) with ClueGO (Bindea ) and CluePedia (Bindea ) apps that are broadly used by the scientific community to enhance the interpretation of biological data (Mlecnik ). Within ClueGO, representative gene ontology (GO) terms (Ashburner ) as well as KEGG (Kanehisa ), WikiPathways (Pico ) and Reactome (Croft ) pathways are integrated into a functionally organized network. Furthermore, ClueGO can compare the biological role of several lists of genes/proteins. To perform multiple analyses in complex workflows can be time consuming and prone to errors. We have thus enabled the cyREST Application Programming Interface (API) in ClueGO, to allow scientists programmatic access to functional analyses. We describe here this new functionality of the ClueGO App.

2 ClueGO functional analysis via cyREST

The Cytoscape App Manager (http://cytoscape.org/) allows the automatic download of the latest version of ClueGO from the App Store (Lotia ). The yFiles Layout Algorithms App should be installed as well. ClueGO is written in Java programing language implementing the OSGi interface of Cytoscape. Starting with version 2.5.0 ClueGO implements the cyREST core plugin API and provides programmatic access to its functionality (Fig. 1A). ClueGO features REST enabled can be explored in the cyREST API Swagger (Fig. 1B), accessible via the Cytoscape menu, Help → Automation → CyREST API.
Fig. 1.

(A) ClueGO can be accessed through the Cytoscape Graphical User Interface (GUI) and the cyREST Programmatic User Interface (UI). (B) ClueGO functions REST enabled in the CyREST API Swagger

(A) ClueGO can be accessed through the Cytoscape Graphical User Interface (GUI) and the cyREST Programmatic User Interface (UI). (B) ClueGO functions REST enabled in the CyREST API Swagger ClueGO can be hence accessed with both the graphical user interface of Cytoscape as well as programmatical through cyREST (Fig. 1 and Supplementary Material). These two ways of performing analyses complement each other and answer to different types of user requests.

2.1 ClueGO analysis steps

The four main steps of a typical ClueGO analysis and additional optional REST enabled features are shown in Supplementary Table S1. The ClueGO-REST enabled functions are accessed by a URL that is built up by the host address (e.g. localhost), the port (e.g. 1234) and the feature requested by the user. The HTTP request return types are then encoded as JSON, tab delimited text or binary data. The selection of an organism is the first step. Human and mouse data sources are included by default in ClueGO and more than 200 other organisms are available for download. The organism to analyze has to be set e.g. human:/v1/apps/cluego/cluego-manager/organisms/set-organism/‘Homo Sapiens’. The second step requires the upload of one or several lists of genes/proteins to analyze. ClueGO automatically recognizes multiple identifier types based on information from NCBI (NCBI Resource Coordinators, 2018), UniProtKB (The UniProt Consortium, 2017) and Ensembl (Aken ) databases. Different colors and shapes are automatically attributed to the clusters, to visualize them on the network. In the third step representative GO terms and pathways are selected using predefined filters based on the number of associated genes found from the uploaded list, their percentage from the total number of genes of the term or the GO tree level. Additionally, GO terms can be selected based on particular evidence codes of the gene-term associations. The significance of the pathways and their similarity in terms of associated genes are automatically mapped on the network and illustrated in different interchangeable visual styles. After running the enrichment analysis through the last Step 4 the functionally grouped network with the terms connected and grouped based on kappa score is created (Supplementary Fig. S1). The kappa score is calculated by taking into account how many genes are shared among two terms and is also used to define the functional groups of terms and pathways. The results of the statistical analysis, the network as well as other graphical representations of the results can be downloaded through cyREST functions. The analysis and visualization can be customized. Large networks can be refined by applying the fusion of similar terms or by visualizing only significant pathways (optional steps). Debug messages appear during the workflow if mandatory analysis steps are skipped, if the term selection is too restrictive/permissive or when other exceptions occur.

2.2 Use cases

R and Python ClueGO analysis examples with one or two lists of genes are provided as Supplementary Material. B and NK cell genes (Critchley-Thorne ; Edgar ) were analysed, and results are illustrated as a network of pathways showing either functional groups (Supplementary Fig. S1) or the origin of the genes in the two lists (Supplementary Fig. S2). All available ClueGO-REST endpoints are illustrated, including optional features.

3 Summary

In this article, we describe the REST enabled ClueGO functionality and show how scientists can integrate ClueGO with other Cytoscape apps and non-Cytoscape libraries to perform functional analyses in a programmatic way. Detailed information on ClueGO can be found at http://www.ici.upmc.fr/cluego. Click here for additional data file.
  16 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  The KEGG databases at GenomeNet.

Authors:  Minoru Kanehisa; Susumu Goto; Shuichi Kawashima; Akihiro Nakaya
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository.

Authors:  Ron Edgar; Michael Domrachev; Alex E Lash
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

4.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

5.  Reactome: a database of reactions, pathways and biological processes.

Authors:  David Croft; Gavin O'Kelly; Guanming Wu; Robin Haw; Marc Gillespie; Lisa Matthews; Michael Caudy; Phani Garapati; Gopal Gopinath; Bijay Jassal; Steven Jupe; Irina Kalatskaya; Shahana Mahajan; Bruce May; Nelson Ndegwa; Esther Schmidt; Veronica Shamovsky; Christina Yung; Ewan Birney; Henning Hermjakob; Peter D'Eustachio; Lincoln Stein
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

6.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

7.  ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks.

Authors:  Gabriela Bindea; Bernhard Mlecnik; Hubert Hackl; Pornpimol Charoentong; Marie Tosolini; Amos Kirilovsky; Wolf-Herman Fridman; Franck Pagès; Zlatko Trajanoski; Jérôme Galon
Journal:  Bioinformatics       Date:  2009-02-23       Impact factor: 6.937

8.  Cytoscape app store.

Authors:  Samad Lotia; Jason Montojo; Yue Dong; Gary D Bader; Alexander R Pico
Journal:  Bioinformatics       Date:  2013-04-16       Impact factor: 6.937

9.  CluePedia Cytoscape plugin: pathway insights using integrated experimental and in silico data.

Authors:  Gabriela Bindea; Jérôme Galon; Bernhard Mlecnik
Journal:  Bioinformatics       Date:  2013-01-16       Impact factor: 6.937

10.  WikiPathways: pathway editing for the people.

Authors:  Alexander R Pico; Thomas Kelder; Martijn P van Iersel; Kristina Hanspers; Bruce R Conklin; Chris Evelo
Journal:  PLoS Biol       Date:  2008-07-22       Impact factor: 8.029

View more
  23 in total

1.  Investigation of the Potential Mechanism Governing the Effect of the Shen Zhu San on COVID-19 by Network Pharmacology.

Authors:  Yuxuan Wang; Yuhua Ru; Guowei Zhuo; Maozheng Sheng; Shuangqiu Wang; Jiarui Ma; Chongyi Zhou; Xiaohe Sun; Yanqi Zeng; Ya Zhang; Hui Li; Zhigang Lu; Depei Wu; Minghua Wu
Journal:  Evid Based Complement Alternat Med       Date:  2020-11-07       Impact factor: 2.629

2.  ORA , FCS , and PT Strategies in Functional Enrichment Analysis.

Authors:  Marco Fernandes; Holger Husi
Journal:  Methods Mol Biol       Date:  2021

3.  Non-invasive Intrauterine Administration of Botulinum Toxin A Enhances Endometrial Angiogenesis and Improves the Rates of Embryo Implantation.

Authors:  Hwa Seon Koo; Min-Ji Yoon; Seon-Hwa Hong; Jungho Ahn; Hwijae Cha; Danbi Lee; Chan Woo Park; Youn-Jung Kang
Journal:  Reprod Sci       Date:  2021-03-01       Impact factor: 3.060

4.  Regulatory T Cells Play a Role in a Subset of Idiopathic Preterm Labor/Birth and Adverse Neonatal Outcomes.

Authors:  Nardhy Gomez-Lopez; Marcia Arenas-Hernandez; Roberto Romero; Derek Miller; Valeria Garcia-Flores; Yaozhu Leng; Yi Xu; Jose Galaz; Sonia S Hassan; Chaur-Dong Hsu; Harley Tse; Carmen Sanchez-Torres; Bogdan Done; Adi L Tarca
Journal:  Cell Rep       Date:  2020-07-07       Impact factor: 9.423

5.  Disclosing the Interactome of Leukemogenic NUP98-HOXA9 and SET-NUP214 Fusion Proteins Using a Proteomic Approach.

Authors:  Adélia Mendes; Ramona Jühlen; Sabrina Bousbata; Birthe Fahrenkrog
Journal:  Cells       Date:  2020-07-10       Impact factor: 6.600

6.  Risk of epilepsy in rheumatoid arthritis: a meta-analysis of population based studies and bioinformatics analysis.

Authors:  Huawei Zhao; Shan Li; Meijuan Xie; Rongrong Chen; Haimei Lu; Chengping Wen; Anthony J Filiano; Zhenghao Xu
Journal:  Ther Adv Chronic Dis       Date:  2020-02-07       Impact factor: 5.091

7.  A Coordinated Approach by Public Domain Bioinformatics Resources to Aid the Fight Against Alzheimer's Disease Through Expert Curation of Key Protein Targets.

Authors:  Lionel Breuza; Cecilia N Arighi; Ghislaine Argoud-Puy; Cristina Casals-Casas; Anne Estreicher; Maria Livia Famiglietti; George Georghiou; Arnaud Gos; Nadine Gruaz-Gumowski; Ursula Hinz; Nevila Hyka-Nouspikel; Barbara Kramarz; Ruth C Lovering; Yvonne Lussi; Michele Magrane; Patrick Masson; Livia Perfetto; Sylvain Poux; Milagros Rodriguez-Lopez; Christian Stoeckert; Shyamala Sundaram; Li-San Wang; Elizabeth Wu; Sandra Orchard
Journal:  J Alzheimers Dis       Date:  2020       Impact factor: 4.472

8.  High Expression Levels of CDK1 and CDC20 in Patients With Lung Squamous Cell Carcinoma are Associated With Worse Prognosis.

Authors:  Huan Deng; Qingqing Hang; Dijian Shen; Hangjie Ying; Yibi Zhang; Xu Qian; Ming Chen
Journal:  Front Mol Biosci       Date:  2021-07-07

9.  A Pipeline to Call Multilevel Expression Changes between Cancer and Normal Tissues and Its Applications in Repurposing Drugs Effective for Gastric Cancer.

Authors:  Wei Gao; Jianwei Yang; Changhua Zhuo; Sha Huang; Jinyuan Lin; Guangfeng Wu; Min Zhou
Journal:  Biomed Res Int       Date:  2020-08-05       Impact factor: 3.411

10.  Identification of Methylation-Regulated Differentially Expressed Genes and Related Pathways in Hepatocellular Carcinoma: A Study Based on TCGA Database and Bioinformatics Analysis.

Authors:  Yu Liang; Bin Ma; Peng Jiang; Hong-Mei Yang
Journal:  Front Oncol       Date:  2021-06-03       Impact factor: 6.244

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.