Literature DB >> 25979476

CoRegNet: reconstruction and integrated analysis of co-regulatory networks.

Rémy Nicolle1, François Radvanyi2, Mohamed Elati3.   

Abstract

UNLABELLED: CoRegNet is an R/Bioconductor package to analyze large-scale transcriptomic data by highlighting sets of co-regulators. Based on a transcriptomic dataset, CoRegNet can be used to: reconstruct a large-scale co-regulatory network, integrate regulation evidences such as transcription factor binding sites and ChIP data, estimate sample-specific regulator activity, identify cooperative transcription factors and analyze the sample-specific combinations of active regulators through an interactive visualization tool. In this study CoRegNet was used to identify driver regulators of bladder cancer. AVAILABILITY: CoRegNet is available at http://bioconductor.org/packages/CoRegNet CONTACT: remy.nicolle@issb.genopole.fr or mohamed.elati@issb.genopole.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25979476      PMCID: PMC4565029          DOI: 10.1093/bioinformatics/btv305

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Recent advances in genomics enabled the profiling of thousands of tumors by large consortia and individual laboratories. While the amount of data holds great promise for our understanding of tumorigenesis, these datasets necessitate efficient methodologies to extract valuable knowledge from them. Transcriptomics is the most commonly available type of tumor large-scale data. The transcriptome reflects the genetic, epigenetic and environmental states of a tumor tissue and determines a great extent the phenotype of cells. Therefore, one of the first step towards the construction of a mechanistic model underlying cancer is the identification of the sets of transcription factors (TF) that actively maintain a malignant phenotype. This requires methodologies to model the tissue specificity of gene regulation by inferring trustful context-specific networks. More importantly, these models must take into account the complexity of mammalian gene regulation often involving the coordinated action of several TF (Panne, 2008). To identify tumor-driving active regulatory circuits, we propose a Bioconductor (Huber ) package named CoRegNet to (i) reconstruct a large-scale co-regulatory network from gene expression data and by integrating additional regulatory evidences such as TF Binding site and ChIP data, (ii) estimate the activity of each TF of the network in any given sample, (iii) predict sets of cooperative TF and (iv) identify sample-specific combination of active and driver TF using an interactive visualization tool integrating genomic aberrations. The proposed methods can be used as independent modules with alternative inputs such as networks inferred by other methods, experimentally defined networks or a different transcriptomic data for TF activity prediction and visualization (e.g. cell lines of same tissue). The following sections outline the functions of the CoRegNet package and its application to the characterization of the driver regulators of bladder cancer subtypes.

2 The CoRegNet application

Each following sections are detailed in the Supplementary Information file and the workflow is illustrated in Figure 1.
Fig. 1.

Analysis using the CoRegNet package. (a) A set of methods can be used to construct a network of cooperative TF from transcriptomic data using the h-Licorn algorithm and by integrating regulatory evidences. (b) A Shiny/Cytoscape web application is available to visually analyze the network and the datasets. (c) A dynamic heatmap shows the influence of all or only a selection of TF in all samples. (d) The view of the co-regulation network reflects the activity of each TF in the selected samples or sample subtype. (e) Copy number aberration (CNA) of TF can be integrated and will first display as a pie graph showing the proportion of each alteration status in either all samples or in the selected subtype. The selection of a single TF will display a multi-layer heatmap to visualize the relationship between sample phenotype and TF expression, activity and CNA

Analysis using the CoRegNet package. (a) A set of methods can be used to construct a network of cooperative TF from transcriptomic data using the h-Licorn algorithm and by integrating regulatory evidences. (b) A Shiny/Cytoscape web application is available to visually analyze the network and the datasets. (c) A dynamic heatmap shows the influence of all or only a selection of TF in all samples. (d) The view of the co-regulation network reflects the activity of each TF in the selected samples or sample subtype. (e) Copy number aberration (CNA) of TF can be integrated and will first display as a pie graph showing the proportion of each alteration status in either all samples or in the selected subtype. The selection of a single TF will display a multi-layer heatmap to visualize the relationship between sample phenotype and TF expression, activity and CNA

2.1 Regulatory network inference and refinement

To reconstruct a large-scale regulatory network from gene expression data, the CoRegNet package implements the h-Licorn algorithm (Chebil ; Elati ). Compared to current methods, h-Licorn focuses on the identification of cooperative regulators of genes. It was proven to have comparable TF-gene pairs prediction performance with state of the art methods in synthetic and Human datasets (Chebil ) and to retrieve more plausible cooperative TF pairs in yeast (Elati , Lai ) and Human datasets (see Supplementary Information). To obtain a trustful network, regulatory and co-regulatory interactions can be integrated as additional evidences. These can include any type of TF–gene interaction data to support regulatory interactions (e.g. ChIP-seq, TF binding site) and TF–TF interaction data to support cooperative interactions (e.g. protein interactions). The inferred network can then be refined by selecting a subset of the network that is enriched in these external regulatory and co-regulatory evidences using an integrative selection algorithm (Marbach ).

2.2 Transcription factor influence

CoRegNet implements a function to estimate the activity of a TF in a given sample. This is based on a measure of transcriptional influence, which was shown to provide a transformed view of the transcriptome in which classification algorithms are more robust (Nicolle ). Based on a comparison of the expression of the activated and the repressed targets of a regulator, the influence is computed in a sample-specific manner (details in Supplementary Information). Robustness of the measure was tested by correlating for each TF the influence using the original network and using a partially permuted version of the network with increasing levels of noise. The same was done by correlating the influence on the sub-parts of the network that are validated by regulatory evidences. In all comparisons, the influence was significantly more robust and consistent with the validated network (e.g. 97% correlation with 20% of noise, 96% for the ChIP validated network) than the other measures of TF activity that were tested (e.g. 75% with 20% of noise and 72% ChIP validation for network component analysis, details and reference in Supplementary Information). Furthermore, the influence predicts well the activity of a TF in samples in which it was activated at the protein level by a chemical agonist (details in Supplementary Information).

2.3 Constructing a network of cooperative TF

To model the active transcriptional programs, a co-regulation network is built by setting an edge between two significantly cooperative TF (details in Supplementary Information). The relevance of using the h-Licorn algorithm to directly infer a cooperative network is shown by the higher enrichment of the predicted co-regulators in experimentally validated and independently predicted protein interactions (AUPR: 14% for CoRegNet, 6% max. among the four tested algorithms, see Supplementary Information).

2.4 Integrative visualization of transcriptional activity

Transcriptomes are summarized by the influence of the regulators on the expression of their target genes in the analyzed samples. This abstraction of the transcriptomes through TF activity reduces the number of features thereby simplifying the visualization of the dataset. Moreover, the co-regulation network unravels the combination of TF at work in the studied samples. Thus, an interactive visualization tool is embedded in the CoRegNet package to analyze several layers of information through the sets of active co-regulators. The co-regulation network is accessible through a Cytoscape (Shannon ) widget with functionalities to display sets of active co-regulators in particular samples or subtypes (examples in Fig. 1 and in Supplementary Information).

2.5 Implementation and availability

CoRegNet is an R package implementing a Shiny (Winston ) and Cytoscape javascript applet for visualization. The network inference method is implemented in C and can be parallelized. The package is available as a Supplementary File or through the Bioconductor repository.

3 Case study

The CoRegNet package was used to analyze a set of bladder cancer samples for which both transcriptome and genomic alterations were available (Biton , Rebouissou ). The inferred network was used to estimate the influence of regulators and the visualization tool to identify active sets of master regulators for each bladder cancer subtype. In line with previous studies (Biton ; Choi ), PPARγ was found to be the most active TF in the luminal bladder cancer TCGA subgroup I. Association with genomic alteration suggesting PPARγ to be a major driver of these tumors (see Supplementary Information), which has been recently validated experimentally (Biton ). Moreover, FOXA1, an effector of PPARγ (Varley ) and a co-factor of GATA3 in luminal breast cancer (Kong ), is a significant co-regulator of both GATA3 and PPARγ in the inferred network.

Funding

This work is supported by the INCa (French National Cancer Institute) through the INCa_2960 [PLBIO10] project and the European Union/Framework Programme 7/2009 (“SYSCILIA” consortium, grant 241955). R.N. was supported by a fellowship from the French Ministry of Education and Research. Funding for open access charge: SYSCILIA. Conflict of Interest: none declared.
  12 in total

1.  LICORN: learning cooperative regulation networks from gene expression data.

Authors:  Mohamed Elati; Pierre Neuvial; Monique Bolotin-Fukuhara; Emmanuel Barillot; François Radvanyi; Céline Rouveirol
Journal:  Bioinformatics       Date:  2007-08-24       Impact factor: 6.937

2.  Independent component analysis uncovers the landscape of the bladder tumor transcriptome and reveals insights into luminal and basal subtypes.

Authors:  Anne Biton; Isabelle Bernard-Pierrot; Yinjun Lou; Clémentine Krucker; Elodie Chapeaublanc; Carlota Rubio-Pérez; Nuria López-Bigas; Aurélie Kamoun; Yann Neuzillet; Pierre Gestraud; Luca Grieco; Sandra Rebouissou; Aurélien de Reyniès; Simone Benhamou; Thierry Lebret; Jennifer Southgate; Emmanuel Barillot; Yves Allory; Andrei Zinovyev; François Radvanyi
Journal:  Cell Rep       Date:  2014-11-13       Impact factor: 9.423

3.  Hybrid method inference for the construction of cooperative regulatory network in human.

Authors:  I Chebil; R Nicolle; G Santini; C Rouveirol; M Elati
Journal:  IEEE Trans Nanobioscience       Date:  2014-04-22       Impact factor: 2.935

4.  Identification of distinct basal and luminal subtypes of muscle-invasive bladder cancer with different sensitivities to frontline chemotherapy.

Authors:  Woonyoung Choi; Sima Porten; Seungchan Kim; Daniel Willis; Elizabeth R Plimack; Jean Hoffman-Censits; Beat Roth; Tiewei Cheng; Mai Tran; I-Ling Lee; Jonathan Melquist; Jolanta Bondaruk; Tadeusz Majewski; Shizhen Zhang; Shanna Pretzsch; Keith Baggerly; Arlene Siefker-Radtke; Bogdan Czerniak; Colin P N Dinney; David J McConkey
Journal:  Cancer Cell       Date:  2014-02-10       Impact factor: 31.743

Review 5.  The enhanceosome.

Authors:  Daniel Panne
Journal:  Curr Opin Struct Biol       Date:  2008-02-21       Impact factor: 6.809

6.  FOXA1 and IRF-1 intermediary transcriptional regulators of PPARgamma-induced urothelial cytodifferentiation.

Authors:  C L Varley; E J Bacon; J C Holder; J Southgate
Journal:  Cell Death Differ       Date:  2008-08-08       Impact factor: 15.828

7.  EGFR as a potential therapeutic target for a subset of muscle-invasive bladder cancers presenting a basal-like phenotype.

Authors:  Sandra Rebouissou; Isabelle Bernard-Pierrot; Aurélien de Reyniès; May-Linda Lepage; Clémentine Krucker; Elodie Chapeaublanc; Aurélie Hérault; Aurélie Kamoun; Aurélie Caillault; Eric Letouzé; Nabila Elarouci; Yann Neuzillet; Yves Denoux; Vincent Molinié; Dimitri Vordos; Agnès Laplanche; Pascale Maillé; Pascale Soyeux; Karina Ofualuka; Fabien Reyal; Anne Biton; Mathilde Sibony; Xavier Paoletti; Jennifer Southgate; Simone Benhamou; Thierry Lebret; Yves Allory; François Radvanyi
Journal:  Sci Transl Med       Date:  2014-07-09       Impact factor: 17.956

8.  Predictive regulatory models in Drosophila melanogaster by integrative inference of transcriptional networks.

Authors:  Daniel Marbach; Sushmita Roy; Ferhat Ay; Patrick E Meyer; Rogerio Candeias; Tamer Kahveci; Christopher A Bristow; Manolis Kellis
Journal:  Genome Res       Date:  2012-03-28       Impact factor: 9.043

9.  Cellular reprogramming by the conjoint action of ERα, FOXA1, and GATA3 to a ligand-inducible growth state.

Authors:  Say Li Kong; Guoliang Li; Siang Lin Loh; Wing-Kin Sung; Edison T Liu
Journal:  Mol Syst Biol       Date:  2011-08-30       Impact factor: 11.429

10.  A comprehensive performance evaluation on the prediction results of existing cooperative transcription factors identification algorithms.

Authors:  Fu-Jou Lai; Hong-Tsun Chang; Yueh-Min Huang; Wei-Sheng Wu
Journal:  BMC Syst Biol       Date:  2014-12-08
View more
  22 in total

1.  Transcriptomic analysis of purified human cortical microglia reveals age-associated changes.

Authors:  Thais F Galatro; Inge R Holtman; Antonio M Lerario; Ilia D Vainchtein; Nieske Brouwer; Paula R Sola; Mariana M Veras; Tulio F Pereira; Renata E P Leite; Thomas Möller; Paul D Wes; Mari C Sogayar; Jon D Laman; Wilfred den Dunnen; Carlos A Pasqualucci; Sueli M Oba-Shinjo; Erik W G M Boddeke; Suely K N Marie; Bart J L Eggen
Journal:  Nat Neurosci       Date:  2017-07-03       Impact factor: 24.884

2.  The status of causality in biological databases: data resources and data retrieval possibilities to support logical modeling.

Authors:  Vasundra Touré; Åsmund Flobak; Anna Niarakis; Steven Vercruysse; Martin Kuiper
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

3.  Integrating transcriptional activity in genome-scale models of metabolism.

Authors:  Daniel Trejo Banos; Pauline Trébulle; Mohamed Elati
Journal:  BMC Syst Biol       Date:  2017-12-21

4.  Time-course transcriptome analysis of human cellular reprogramming from multiple cell types reveals the drastic change occurs between the mid phase and the late phase.

Authors:  Akihiro Kuno; Ken Nishimura; Satoru Takahashi
Journal:  BMC Genomics       Date:  2018-01-03       Impact factor: 3.969

5.  Inference and interrogation of a coregulatory network in the context of lipid accumulation in Yarrowia lipolytica.

Authors:  Pauline Trébulle; Jean-Marc Nicaud; Christophe Leplat; Mohamed Elati
Journal:  NPJ Syst Biol Appl       Date:  2017-08-11

6.  Identification of potential gene targets in systemic vasculitis using DNA microarray analysis.

Authors:  Yiwen Feng; Miao Zheng; Shujie Gan; Lei Zhang; Zhong Wan; Yanping Zhang; Qin Qian; Jingdong Tang
Journal:  Mol Med Rep       Date:  2017-04-11       Impact factor: 2.952

7.  Latent network-based representations for large-scale gene expression data analysis.

Authors:  Wajdi Dhifli; Julia Puig; Aurélien Dispot; Mohamed Elati
Journal:  BMC Bioinformatics       Date:  2019-02-04       Impact factor: 3.169

8.  A model for gene deregulation detection using expression data.

Authors:  Thomas Picchetti; Julien Chiquet; Mohamed Elati; Pierre Neuvial; Rémy Nicolle; Etienne Birmelé
Journal:  BMC Syst Biol       Date:  2015-12-09

9.  Constraints on signaling network logic reveal functional subgraphs on Multiple Myeloma OMIC data.

Authors:  Bertrand Miannay; Stéphane Minvielle; Florence Magrangeas; Carito Guziolowski
Journal:  BMC Syst Biol       Date:  2018-03-21

10.  Reprogramming of regulatory network using expression uncovers sex-specific gene regulation in Drosophila.

Authors:  Yijie Wang; Dong-Yeon Cho; Hangnoh Lee; Justin Fear; Brian Oliver; Teresa M Przytycka
Journal:  Nat Commun       Date:  2018-10-03       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.