Literature DB >> 27993775

GENIUS: web server to predict local gene networks and key genes for biological functions.

Tomas Puelma1,2, Viviana Araus1, Javier Canales1, Elena A Vidal1, Juan M Cabello1, Alvaro Soto2, Rodrigo A Gutiérrez1.   

Abstract

Summary: GENIUS is a user-friendly web server that uses a novel machine learning algorithm to infer functional gene networks focused on specific genes and experimental conditions that are relevant to biological functions of interest. These functions may have different levels of complexity, from specific biological processes to complex traits that involve several interacting processes. GENIUS also enriches the network with new genes related to the biological function of interest, with accuracies comparable to highly discriminative Support Vector Machine methods. Availability and Implementation: GENIUS currently supports eight model organisms and is freely available for public use at http://networks.bio.puc.cl/genius . Contact: genius.psbl@gmail.com. Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2017        PMID: 27993775      PMCID: PMC5408817          DOI: 10.1093/bioinformatics/btw702

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Biologists often pursue the difficult task of finding key genes to modulate biological functions or complex traits of interest (e.g. disease, growth, yield, plant nitrogen-use efficiency, water-use efficiency). These functions and traits are governed by networks of functionally interacting genes and gene products. Unfortunately, the large number of genes, and the many and intricate interactions among them, make it difficult to discern which genes are relevant. As a consequence, biologists use a mix of prior knowledge and intuition to choose which genes to focus and which to ignore for detailed experimental work (Moreau and Tranchevent, 2012). This has motivated the development of many reverse engineering computational methods and tools that use experimental data to infer and analyze gene networks (Aoki ; Franceschini ; Hruz ; Jupiter ; Obayashi and Kinoshita, 2010; Obayashi ; Stuart ; Usadel ; Vandepoele ; Warde-Farley ). An inconvenience of current methods is that they use a wide and often fixed set of experimental data and genes, which limits their predictive capabilities. To tackle these problems, in previous work we presented DLS, a machine learning method to infer local and discriminative gene networks that can pinpoint new and key genes related to specific biological functions or even complex traits of interest (Puelma ). DLS accomplishes this by using supervised machine learning to find expression signatures: expression patterns present in particular genes and experimental conditions that are distinctive and relevant for the biological function defined by the user. These signatures are used by DLS to find coexpression associations and construct a gene network focused in specific biological contexts that are relevant for the biological function of interest. This ability is the key feature that distinguishes DLS from other network inference algorithms and popular tools like GeneMANIA (Warde-Farley ), allowing it to find relevant genes and relationships that these tools may miss. Here we present GENIUS (GEne Networks Inference Using Signatures), a web server with a user-friendly interface to DLS that allows the scientific community to fully exploit its capabilities. GENIUS incorporates Gene Ontology annotations and thousands of microarrays experiments from Gene Expression Omnibus (GEO) for eight model organisms: A. thaliana, C. elegans, D. rerio, D. melanogaster, E. coli, H. sapiens, M. musculus and S. cerevisiae. Depending on existing annotations, the prediction performance ranges from 54 to 94% (Supplementary Fig. S5). In addition, GENIUS adds tools to visualize and analyze gene networks, including integration with Cytoscape (Smoot ), an advanced network analysis software platform.

2 Results and discussion

Starting a new prediction in GENIUS is simple. The first step is to define a query list of genes related to the biological function or trait of interest. This can be done by directly adding a list of gene identifiers and/or by selecting a list of GO biological processes. Users can select which GO evidence codes to use when importing GO annotations. They can also select the stringency of predictions, allowing them to obtain a more exploratory or reliable network prediction (Supplementary Fig. S1). The predicted network contains genes from the query list, as well as newly predicted genes functionally related to them, which can be ranked according to various properties (Supplementary Fig. S2). Section 1.1 of Supplementary Information presents a case study comparing the predictions of GENIUS and GeneMANIA for nitrate response in A. thaliana. Both tools correctly predict a total of 25 genes that have been validated by published evidence. Of these, 16 genes are common for both tools, while 9 are exclusive to GENIUS and 9 to GeneMANIA, showing that GENIUS can provide complementary and valuable predictions.

2.1 GENIUS automatically selects relevant expression signatures

A key aspect of GENIUS is its ability to automatically select genes and experimental conditions containing discriminative expression patterns or ‘expression signatures’. Examining the discovered signatures can be useful to identify key genes for the biological function of interest, as well as experimental conditions under which these genes may be experimentally verified. Section 1.1.3 of Supplementary Information shows that GENIUS selects relevant and biologically sound expression signatures for the nitrate response case study.

2.2 GENIUS can uncover key regulators for complex traits

A key aspect of GENIUS is that it can infer local functional networks that expose central genes acting at the intersection of several biological processes. GENIUS allows users to easily identify these central genes by ranking them according to centrality indicators used in network theory, like degree centrality (DC) and betweenness centrality (BC) (Section 1.2.1 of Supplementary Information) (Yu ). As an example, we experimentally validated a prediction performed with GENIUS to find genes modulating nitrogen-use efficiency in A. thaliana (Araus ). Additionally, Section 1.2 of Supplementary Information illustrates this use of GENIUS in detail, applied to the drought tolerance trait in A. thaliana. In contrast to GeneMANIA, GENIUS predicts a network with a scale-free topology, allowing it to take advantage of network theory centrality indicators to pinpoint relevant genes. Thus, we believe the use of GENIUS web server can facilitate biologists to generate sophisticated hypothesis, make novel discoveries, and ultimately, improve our molecular understanding of biological systems.

Funding

This work was funded by grants from the Howard Hughes Medical Institute, Fondo de Desarrollo de Areas Prioritarias (FONDAP) Center for Genome Regulation (15090007), Millennium Nucleus Center for Plant Systems and Synthetic Biology (NC130030) and Fondo Nacional de Desarrollo Científico y Tecnológico (FONDECYT) 1141097 to RAG. Conflict of Interest: none declared. Click here for additional data file.
  15 in total

Review 1.  Computational tools for prioritizing candidate genes: boosting disease gene discovery.

Authors:  Yves Moreau; Léon-Charles Tranchevent
Journal:  Nat Rev Genet       Date:  2012-07-03       Impact factor: 53.242

Review 2.  Coexpression landscape in ATTED-II: usage of gene list and gene network for various types of pathways.

Authors:  Takeshi Obayashi; Kengo Kinoshita
Journal:  J Plant Res       Date:  2010-04-10       Impact factor: 2.629

Review 3.  Approaches for extracting practical information from gene co-expression networks in plant biology.

Authors:  Koh Aoki; Yoshiyuki Ogata; Daisuke Shibata
Journal:  Plant Cell Physiol       Date:  2007-01-23       Impact factor: 4.927

Review 4.  Co-expression tools for plant biology: opportunities for hypothesis generation and caveats.

Authors:  Björn Usadel; Takeshi Obayashi; Marek Mutwil; Federico M Giorgi; George W Bassel; Mimi Tanimoto; Amanda Chow; Dirk Steinhauser; Staffan Persson; Nicholas J Provart
Journal:  Plant Cell Environ       Date:  2009-08-27       Impact factor: 7.228

5.  Members of BTB Gene Family of Scaffold Proteins Suppress Nitrate Uptake and Nitrogen Use Efficiency.

Authors:  Viviana Araus; Elena A Vidal; Tomas Puelma; Simón Alamos; Delphine Mieulet; Emmanuel Guiderdoni; Rodrigo A Gutiérrez
Journal:  Plant Physiol       Date:  2016-04-27       Impact factor: 8.340

6.  Discriminative local subspaces in gene expression data for effective gene function prediction.

Authors:  Tomas Puelma; Rodrigo A Gutiérrez; Alvaro Soto
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

7.  Cytoscape 2.8: new features for data integration and network visualization.

Authors:  Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Trey Ideker
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

8.  Genevestigator v3: a reference expression database for the meta-analysis of transcriptomes.

Authors:  Tomas Hruz; Oliver Laule; Gabor Szabo; Frans Wessendorp; Stefan Bleuler; Lukas Oertle; Peter Widmayer; Wilhelm Gruissem; Philip Zimmermann
Journal:  Adv Bioinformatics       Date:  2008-07-08

9.  COXPRESdb: a database of comparative gene coexpression networks of eleven species for mammals.

Authors:  Takeshi Obayashi; Yasunobu Okamura; Satoshi Ito; Shu Tadaka; Ikuko N Motoike; Kengo Kinoshita
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

10.  The importance of bottlenecks in protein networks: correlation with gene essentiality and expression dynamics.

Authors:  Haiyuan Yu; Philip M Kim; Emmett Sprecher; Valery Trifonov; Mark Gerstein
Journal:  PLoS Comput Biol       Date:  2007-02-14       Impact factor: 4.475

View more
  2 in total

1.  In silico prediction and segregation analysis of putative virus defense genes based on SSR markers in sweet potato F1 progenies of cultivars 'New Kawogo' and 'Resisto'.

Authors:  Alexander Ssamula; Anthony Okiror; Liat Avrahami-Moyal; Yehudit Tam; Amit Gal-On; Victor Gaba; Settumba B Mukasa; Peter Wasswa
Journal:  Afr J Biotechnol       Date:  2019-04-17

2.  Cisplatin treatment of testicular cancer patients introduces long-term changes in the epigenome.

Authors:  Cecilie Bucher-Johannessen; Christian M Page; Trine B Haugen; Marcin W Wojewodzic; Sophie D Fosså; Tom Grotmol; Hege S Haugnes; Trine B Rounge
Journal:  Clin Epigenetics       Date:  2019-12-03       Impact factor: 6.551

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.