Literature DB >> 22693210

Inferring the regulatory network behind a gene expression experiment.

Marta Bleda1, Ignacio Medina, Roberto Alonso, Alejandro De Maria, Francisco Salavert, Joaquin Dopazo.   

Abstract

Transcription factors (TFs) and miRNAs are the most important dynamic regulators in the control of gene expression in multicellular organisms. These regulatory elements play crucial roles in development, cell cycling and cell signaling, and they have also been associated with many diseases. The Regulatory Network Analysis Tool (RENATO) web server makes the exploration of regulatory networks easy, enabling a better understanding of functional modularity and network integrity under specific perturbations. RENATO is suitable for the analysis of the result of expression profiling experiments. The program analyses lists of genes and search for the regulators compatible with its activation or deactivation. Tests of single enrichment or gene set enrichment allow the selection of the subset of TFs or miRNAs significantly involved in the regulation of the query genes. RENATO also offers an interactive advanced graphical interface that allows exploring the regulatory network found.RENATO is available at: http://renato.bioinfo.cipf.es/.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22693210      PMCID: PMC3394273          DOI: 10.1093/nar/gks573

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Understanding the regulatory mechanisms that explain a gene expression profiling experiment remains a difficult task yet. Transcription factors (TFs) and miRNAs play a crucial role in the dynamic regulation of the network of gene expression (1). Alterations in these elements have been extensively related with human malignancies, including development diseases (2) and cancer (3,4). Here, we present Regulatory Network Analysis Tool (RENATO), a network-based analysis web tool, for the interpretation and visualization of transcriptional regulatory information. RENATO contains information on regulatory elements in genes, such as transcription factor binding sites and miRNA complementarity targets. RENATO has been designed to identify common regulatory elements in a list of genes. It maps such genes to the regulatory network, extracts the corresponding regulatory connections and evaluates each regulator for significant over-representation (5) of targets in the list. Ranked gene lists can also be analysed with RENATO by using a strategy similar to the gene set enrichment analysis (GSEA) (6). Previous knowledge of the relationship between diseases and the deregulation of these elements is also included. To make these resources easily accessible, we have also implemented a set of RESTful application programming interface web services, where regulatory information can be easily retrieved.

IMPLEMENTATION

RENATO inputs a list of gene identifiers, typically (but not restricted to) differentially expressed genes in transcriptomic profiling experiments (microarray or RNA-seq). The output consists of a list of regulatory elements of the genes in the list along with the corresponding P value of enrichment. Results are represented graphically through an interactive user-friendly web interface that relates regulatory elements to their target genes. Figure 1 shows a schematic representation of the application. A RESTful web service queries the database containing information on the regulatory elements contained in the input file. Then a test is conducted that returns the TFs or miRNAs corresponding to the enriched regulatory elements. The web application displays the results. The interface is based in modern HTML5 technologies and allows the user to explore the regulatory network found as well as executing several operations, including different changes in the format (color, shape, size, labels, etc.), searching elements, filtering genes and performing some simple exploratory analyses that include adding new genes to the network, including the network in the context of Reactome (7), upload a file with gene attributes that can further be used for filtering purposes and uploading gene expression values that can be also used for filtering or color-labeling the genes.
Figure 1.

Schematic representation of the internal organization of Regulatory Network Analysis Tool and the way in which queries are performed, the RESTful Web Service communicates the user query with the database and then the response with the web interface. At the bottom are examples of web service calls.

Schematic representation of the internal organization of Regulatory Network Analysis Tool and the way in which queries are performed, the RESTful Web Service communicates the user query with the database and then the response with the web interface. At the bottom are examples of web service calls. Depending on the data available, two types of tests are possible. If the list of genes represents a set of differentially expressed genes, a single enrichment analysis of regulatory elements over-representation is carried out (5). A conventional Fisher exact test is used to calculate the enrichment. If the input data are list of genes ranked by a value that accounts for differential expression (e.g. the statistic used) then a variant of the GSEA (6) is carried out. This last approach is much more sensitive than the conventional enrichment analysis. As many TFs are simultaneously checked for enrichment, multiple testing effect is corrected by the popular false discovery rate method (8). It is worth mentioning that the results are conceptually different in the case of TF and miRNAs, given that the former are positive regulators and the latter are negative regulators. Thus, if we are analysing genes activated, then the results on TF make reference to the TFs active, whereas the results on miRNA would make reference to miRNAs inactive. The web server can maintain sessions and save data and results obtained for new sessions.

Databases and motif finding

The information on transcription factor binding sites (TFBSs) and miRNA target sites has been collected from different databases. Information on TFBSs was taken from Ensembl (Release 64, GRCh37) (9) through the corresponding application programming interface. In the case of miRNAs, we have used miRBase (release 18, November 2011) (10), the microrna.org (Release August 2010), the TargetScan (11) predictions, as well as more curated information from miRecords (release 3, November 2010) (12) and miRTarBase (release 2.5, October 15, 2011) (13). Information of diseases has been taken from The human miRNA disease database (release November 2011) (3) and Phenomir (14). At present, information for human and mouse is available. In coming versions, more species will be added. MiRNA targets are predicted by searching for matches with the miRNA seed regions (11). In the case of TF, the use of a position-specific weight matrix (15) that accounts for the DNA binding preferences of the TF, is used for detection of TFBSs. These binding sites are mapped into the promoter region of every gene, establishing the connections between TFs and genes.

Other tools

Many single enrichment tools include enrichment analysis of TFBSs, such as the FatiGO+ (16), DAVID (17) and other (18). More recently, some tools arise specifically focused on the analysis of regulatory motifs in the genes of a list. Thus, tools like Pscan (19), TransFind (20), oPOSSUM (21) or CARRIE (22) carry out different variants of enrichment analysis and report lists of over-represented TFBSs alone or in combination (23), present in the set of genes analysed. Similar applications have been published for miRNAs, like MMIA (24), GeneSet2miRNA (25), miRvestigator (26), including the analysis of ensembles of miRNAs, such as MiRror (27). A more sophisticated tool, the mirConnX (28), combines miRNA and TFs and also provides a graphical interface based on Cytoscape web (29), which is a Flash application that depends on the corresponding plug-in. RENATO, in addition to single enrichment analysis, offers the GSEA, which is known to be still more sensitive in finding enrichments (6,18,30). An advanced graphical interface is also provided, based on modern and efficient HTML5, with much more options (e.g., information on the genes and different layouts) than other interfaces and much more optimized than a Flash application.

RESULTS

An example with Fanconi anemia

We illustrate this with an example of Fanconi anemia (FA), a disease in which signaling is relevant, FA is a rare chromosome instability syndrome characterized by aplastic anemia and cancer and leukemia susceptibility (31). A recent study uses gene expression microarrays to identify differences at the transcription level between bone marrow cells from normal volunteers and from children and adults with FA (32). Eleven normal volunteers and 21 patients were studied. Gene expression datasets for FA were retrieved from the GEO database. Differential expression analysis was carried out using the Babelomics suite (33). RENATO has been used to study the set of regulatory elements compatible with the observed gene deregulation. Figure 2 shows the regulatory network that justifies the observed increase in the gene activity caused by the disease. Six TFs (E2F1, Gabp, Yy1, Nfya, Egr1, Cmyc) account for activation observed in the disease (Figure 2A). Some of these TFs have already been linked to FA (34). Similarly, the miRNAs that must be switched off to produce the observed gene expression levels are listed in Figure 2B, along with the regulatory miRNA network. Some experiments of inhibition of DNA repair link FANCG gene to several miRNAs, among which some of the detected in this analysis are included (hsa-miR-30a-5p) (35).
Figure 2.

An example with Fanconi anemia. Significantly expressed genes in a comparison of Fanconi anemia patients with healthy controls have been analysed by enrichment analysis. (A) Transcription factors with an adjusted P < 0.05 have been represented in the network (in red), along with the target genes regulated by them. (B) miRNAs with an adjusted P < 0.05 have been represented in the network. As explained in the text, the transcription factors are compatible with the genes activated, whereas the miRNAs should be downregulated to be compatible with the genes activated.

An example with Fanconi anemia. Significantly expressed genes in a comparison of Fanconi anemia patients with healthy controls have been analysed by enrichment analysis. (A) Transcription factors with an adjusted P < 0.05 have been represented in the network (in red), along with the target genes regulated by them. (B) miRNAs with an adjusted P < 0.05 have been represented in the network. As explained in the text, the transcription factors are compatible with the genes activated, whereas the miRNAs should be downregulated to be compatible with the genes activated.

DISCUSSION

RENATO is a convenient interactive web server for the analysis gene expression profiling experiment that finds the regulators (TF or miRNA) that significantly account for the gene activity observed. The program also implements a highly efficient interactive interface, based on HTML5, with many options, and able to offer much information of the genes and regulators involved on the experiment analysed.

FUNDING

Spanish Ministry of Science and Innovation (MICINN) [BIO2011-27069]; Conselleria de Educacio of the Valencian Community [PROMETEO/2010/001]; Red Tematica de Investigacion Cooperativa en Cancer (RTICC), ISCIII [RD06/0020/1019, partial]. Conflict of interest statement. None declared.
  34 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

2.  CARRIE web service: automated transcriptional regulatory network inference and interactive analysis.

Authors:  Peter M Haverty; Martin C Frith; Zhiping Weng
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  FatiGO: a web tool for finding significant associations of Gene Ontology terms with groups of genes.

Authors:  Fátima Al-Shahrour; Ramón Díaz-Uriarte; Joaquín Dopazo
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

4.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

Review 5.  Online Mendelian Inheritance in Man (OMIM) as a knowledgebase for human developmental disorders.

Authors:  S A Boyadjiev; E W Jabs
Journal:  Clin Genet       Date:  2000-04       Impact factor: 4.438

Review 6.  Molecular pathogenesis of Fanconi anemia: recent progress.

Authors:  Toshiyasu Taniguchi; Alan D D'Andrea
Journal:  Blood       Date:  2006-02-21       Impact factor: 22.113

7.  miRvestigator: web application to identify miRNAs responsible for co-regulated gene expression patterns discovered through transcriptome profiling.

Authors:  Christopher L Plaisier; J Christopher Bare; Nitin S Baliga
Journal:  Nucleic Acids Res       Date:  2011-05-20       Impact factor: 16.971

8.  Ensembl 2012.

Authors:  Paul Flicek; M Ridwan Amode; Daniel Barrell; Kathryn Beal; Simon Brent; Denise Carvalho-Silva; Peter Clapham; Guy Coates; Susan Fairley; Stephen Fitzgerald; Laurent Gil; Leo Gordon; Maurice Hendrix; Thibaut Hourlier; Nathan Johnson; Andreas K Kähäri; Damian Keefe; Stephen Keenan; Rhoda Kinsella; Monika Komorowska; Gautier Koscielny; Eugene Kulesha; Pontus Larsson; Ian Longden; William McLaren; Matthieu Muffato; Bert Overduin; Miguel Pignatelli; Bethan Pritchard; Harpreet Singh Riat; Graham R S Ritchie; Magali Ruffier; Michael Schuster; Daniel Sobral; Y Amy Tang; Kieron Taylor; Stephen Trevanion; Jana Vandrovcova; Simon White; Mark Wilson; Steven P Wilder; Bronwen L Aken; Ewan Birney; Fiona Cunningham; Ian Dunham; Richard Durbin; Xosé M Fernández-Suarez; Jennifer Harrow; Javier Herrero; Tim J P Hubbard; Anne Parker; Glenn Proctor; Giulietta Spudich; Jan Vogel; Andy Yates; Amonida Zadissa; Stephen M J Searle
Journal:  Nucleic Acids Res       Date:  2011-11-15       Impact factor: 16.971

9.  miRBase: microRNA sequences, targets and gene nomenclature.

Authors:  Sam Griffiths-Jones; Russell J Grocock; Stijn van Dongen; Alex Bateman; Anton J Enright
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  Structural and functional properties of genes involved in human cancer.

Authors:  Simon J Furney; Desmond G Higgins; Christos A Ouzounis; Núria López-Bigas
Journal:  BMC Genomics       Date:  2006-01-11       Impact factor: 3.969

View more
  8 in total

1.  PTHGRN: unraveling post-translational hierarchical gene regulatory networks using PPI, ChIP-seq and gene expression data.

Authors:  Daogang Guan; Jiaofang Shao; Zhongying Zhao; Panwen Wang; Jing Qin; Youping Deng; Kenneth R Boheler; Junwen Wang; Bin Yan
Journal:  Nucleic Acids Res       Date:  2014-05-29       Impact factor: 16.971

2.  Babelomics 5.0: functional interpretation for new generations of genomic data.

Authors:  Roberto Alonso; Francisco Salavert; Francisco Garcia-Garcia; Jose Carbonell-Caballero; Marta Bleda; Luz Garcia-Alonso; Alba Sanchis-Juan; Daniel Perez-Gil; Pablo Marin-Garcia; Ruben Sanchez; Cankut Cubuk; Marta R Hidalgo; Alicia Amadoz; Rosa D Hernansaiz-Ballesteros; Alejandro Alemán; Joaquin Tarraga; David Montaner; Ignacio Medina; Joaquin Dopazo
Journal:  Nucleic Acids Res       Date:  2015-04-20       Impact factor: 16.971

3.  A role for SUV39H1-mediated H3K9 trimethylation in the control of genome stability and senescence in WI38 human diploid lung fibroblasts.

Authors:  Corinne Sidler; Rafal Woycicki; Dongping Li; Bo Wang; Igor Kovalchuk; Olga Kovalchuk
Journal:  Aging (Albany NY)       Date:  2014-07       Impact factor: 5.682

4.  Web-based network analysis and visualization using CellMaps.

Authors:  Francisco Salavert; Luz García-Alonso; Rubén Sánchez; Roberto Alonso; Marta Bleda; Ignacio Medina; Joaquín Dopazo
Journal:  Bioinformatics       Date:  2016-06-13       Impact factor: 6.937

5.  The pan-cancer pathological regulatory landscape.

Authors:  Matias M Falco; Marta Bleda; José Carbonell-Caballero; Joaquín Dopazo
Journal:  Sci Rep       Date:  2016-12-21       Impact factor: 4.379

6.  Reconstructing regulatory networks from the dynamic plasticity of gene expression by mutual information.

Authors:  Jianxin Wang; Bo Chen; Yaqun Wang; Ningtao Wang; Marc Garbey; Roger Tran-Son-Tay; Scott A Berceli; Rongling Wu
Journal:  Nucleic Acids Res       Date:  2013-03-06       Impact factor: 16.971

7.  From System-Wide Differential Gene Expression to Perturbed Regulatory Factors: A Combinatorial Approach.

Authors:  Gaurang Mahajan; Shekhar C Mande
Journal:  PLoS One       Date:  2015-11-12       Impact factor: 3.240

Review 8.  A comparison of mechanistic signaling pathway activity analysis methods.

Authors:  Alicia Amadoz; Marta R Hidalgo; Cankut Çubuk; José Carbonell-Caballero; Joaquín Dopazo
Journal:  Brief Bioinform       Date:  2019-09-27       Impact factor: 11.622

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.