Literature DB >> 21576238

ResponseNet: revealing signaling and regulatory networks linking genetic and transcriptomic screening data.

Alex Lan1, Ilan Y Smoly, Guy Rapaport, Susan Lindquist, Ernest Fraenkel, Esti Yeger-Lotem.   

Abstract

Cellular response to stimuli is typically complex and involves both regulatory and metabolic processes. Large-scale experimental efforts to identify components of these processes often comprise of genetic screening and transcriptomic profiling assays. We previously established that in yeast genetic screens tend to identify response regulators, while transcriptomic profiling assays tend to identify components of metabolic processes. ResponseNet is a network-optimization approach that integrates the results from these assays with data of known molecular interactions. Specifically, ResponseNet identifies a high-probability sub-network, composed of signaling and regulatory molecular interaction paths, through which putative response regulators may lead to the measured transcriptomic changes. Computationally, this is achieved by formulating a minimum-cost flow optimization problem and solving it efficiently using linear programming tools. The ResponseNet web server offers a simple interface for applying ResponseNet. Users can upload weighted lists of proteins and genes and obtain a sparse, weighted, molecular interaction sub-network connecting their data. The predicted sub-network and its gene ontology enrichment analysis are presented graphically or as text. Consequently, the ResponseNet web server enables researchers that were previously limited to separate analysis of their distinct, large-scale experiments, to meaningfully integrate their data and substantially expand their understanding of the underlying cellular response. ResponseNet is available at http://bioinfo.bgu.ac.il/respnet.

Entities:  

Mesh:

Year:  2011        PMID: 21576238      PMCID: PMC3125767          DOI: 10.1093/nar/gkr359

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Cells live in a dynamic environment in which they encounter environmental changes, toxins and mutations. Cellular response to these stimuli is typically complex and involves regulatory, proteomic and metabolic changes. Identification and comprehensive understanding of these changes is very important: it may reveal the mechanism of the stimulus, such as the mode of action of a potent drug [e.g. (1)], and identify proteins and processes that can be used to manipulate the cellular response [e.g. (2,3)], potentially opening new avenues for therapeutic intervention (4). In an effort to accomplish these goals, large-scale genetic screens and transcriptomic profiling assays are applied extensively in current research [e.g. (5–7)]. Genetic screens identify proteins that when mutated significantly alter the phenotype of stimulated cells. In the past few years, genetic screens using gene inactivation or overexpression libraries identified scores of proteins whose association with specific stimuli were previously unknown [e.g. (6)]. However, while providing strong evidence that these proteins were functionally related to the underlying cellular processes, these relationships were often indirect and hard to decipher, leaving many of the identified associations unexplained. A different approach to illuminate the cellular response is provided by transcriptomic profiling assays. Using microarrays or RNA-Seq technologies, these assays reveal genes whose expression levels are significantly altered in stimulated cells. Transcriptomic profiling assays were extensively applied to identify the processes that are activated or inhibited in response to stimuli [e.g. (5,8,9)], yet, similarly to genetic screens, the series of events that led to the transcriptomic change often remained hidden. It had been previously noted in few instances that in the yeast Saccharomices cerevisiae, genetic screens do not identify the same genes as transcriptomic profiling assays conducted in the same conditions [e.g. (10–15)]. By systematically comparing the results of the two types of screens in over 150 distinct conditions, we recently showed that this discrepancy is, in fact, a general rule (16). Importantly, we found that genetic screens tend to identify response regulators, whereas transcriptomic profiling assays tend to identify metabolic responses. In order to make the most of the distinct data provided by genomic screens and transcriptomic profiling assays, we developed ResponseNet (16). ResponseNet is an integrative approach that uses known molecular interactions to bridge the gap between these data and reveal a broader image of the cellular response. Specifically, ResponseNet identifies signaling and regulatory pathways, by which the proteins detected by genetic screens may lead to the measured transcriptomic response (Figure 1). The novelty of ResponseNet lies in its formulation as a minimum-cost flow optimization problem that is solved efficiently using linear programming solvers (17). We applied ResponseNet to reveal cellular pathways that respond to the toxicity of α-synuclein, a small human protein implicated in several neurodegenerative disorders including Parkinson’s disease (16). We conducted a genetic screen of a well-established yeast model for α-synuclein toxicity and identified 77 proteins that when overexpressed altered α-synuclein toxicity. We also profiled the transcritpome of the yeast model and identified 612 differentially expressed genes (16,18). We then used ResponseNet to search the yeast molecular interaction network (interactome), which related over 11 000 proteins and genes via over 57 000 molecular interactions, for a high-probability sub-network connecting the identified proteins and genes. ResponseNet successfully mapped previously unknown as well as recognized pathways responding to α-synuclein toxicity. We experimentally validated four de novo predictions suggested by ResponseNet analysis, including the presence of nitrosative stress, the involvement of the TOR pathway, the disturbance of the sterol biosynthesis pathway and the mode-of-action of the genetic suppressor Gip2 in the response to α-synuclein toxicity (16).
Figure 1.

The signaling and regulatory sub-network, by which stimulus-related proteins detected by genetic screens may lead to the measured transcriptomic response. ResponseNet integrates the identified stimulus-related proteins and genes with known molecular interactions to find molecular interaction paths, through which a subset of the proteins may regulate the transcription of a subset of the genes. The regulation may be direct, e.g. when the stimulus-related protein is the transcriptional regulator of a stimulus-related gene, or indirect via intermediate proteins and transcriptional regulators. Stimulus-related proteins appear as orange nodes; stimulus-related genes appear as blue nodes; intermediary proteins appear as white nodes. Known protein–protein and protein–DNA interactions appear as gray edges. Transcriptional regulators appear as triangles.

The signaling and regulatory sub-network, by which stimulus-related proteins detected by genetic screens may lead to the measured transcriptomic response. ResponseNet integrates the identified stimulus-related proteins and genes with known molecular interactions to find molecular interaction paths, through which a subset of the proteins may regulate the transcription of a subset of the genes. The regulation may be direct, e.g. when the stimulus-related protein is the transcriptional regulator of a stimulus-related gene, or indirect via intermediate proteins and transcriptional regulators. Stimulus-related proteins appear as orange nodes; stimulus-related genes appear as blue nodes; intermediary proteins appear as white nodes. Known protein–protein and protein–DNA interactions appear as gray edges. Transcriptional regulators appear as triangles. In recent years, several other methodologies for identifying interactome pathways that bridge the gap between potential causal proteins and affected genes have been developed [e.g. (19–25)]. These methodologies were based on a variety of computational techniques including maximum likelihood (19), integer programming (21), Steiner trees (23,24), electric circuits (22,26) and Bayesian networks (2,27). Notably, many of these techniques were computationally intensive and therefore required the use of approximation schemes [e.g. (24)] or arbitrary constraints over the topology of the output network [e.g. (21,23)]. Unfortunately, this rich set of sophisticated techniques is not readily available for use. In fact, only in few cases downloadable code was provided (21,27). Here, we present the ResponseNet web server for predicting the signaling and regulatory network connecting stimulus-related proteins and genes. This web server accepts lists of proteins and genes from the user, and then searches within a given interactome for a high-probability sub-network connecting these proteins and genes. The web server presents the identified signaling and regulatory sub-network as a graphical network image in Cytoscape format (28). It also reports the gene ontology (GO) enrichment of the connected input lists and of the output sub-network, further illuminating the underlying cellular response to the stimulus. By default, ResponseNet analysis is carried in yeast using a recently complied version of the yeast interactome that is stored in our database. However, users may upload their own interactome data, opening the way to apply ResponseNet to analyze cellular responses in other organisms, including human. The ResponseNet web server is free and open to all users with no login requirement.

ResponseNet METHODOLOGY

Prediction of signaling and regulatory response pathways in the yeast S. cerevisiae is extremely challenging. For only a handful of stimuli, like the mating pheromone or high osmolarity, these pathways were fully characterized, and efforts to map the yeast signaling network are yet ongoing (29–31). Moreover, due to the vast number of known interactions, a search for all interaction paths connecting stimulus-related proteins to genes typically results in a ‘hairball’ sub-network that is very hard to interpret. To overcome these limitations and identify meaningful signaling and regulatory sub-networks, ResponseNet is designed as a network-optimization approach that uses a graphical model in which: (i) proteins and genes are represented as separate network nodes; (ii) a directed edge leads from a protein node to a gene node only if they correspond to a transcription factor and its target gene; and (iii) each network edge is associated with a probability that reflects its likelihood (16). We implemented ResponseNet as a ‘flow algorithm’, a well-known computational approach for efficiently finding connectivity in graphs (17). Flow algorithms deliver an abstract flow from a source node (S) to a sink node (T) through the edges of a network, which, similarly to pipes, are associated with a capacity that limits the flow and with a cost. (As a loose analogy, this resembles water finding the path of least resistance through a complex landscape.) Because S and T are the two endpoints for the flow, by linking S to the stimulus-related proteins and the stimulus-related genes to T, the flow is forced to find paths that connect the stimulus-related proteins and genes through protein–protein and protein–DNA interactions. To give preference to connecting paths of high probability, we formulated ResponseNet as a minimum-cost flow optimization problem (17). This formulation aims to maximize the flow between S and T while minimizing the cost of the connecting paths. Hence, by setting the cost of an edge to the negative log of its probability, a sparse, high-probability connecting sub-network is obtained. Importantly, the minimum-cost flow problem is solved efficiently using linear programming tools [LOQO (32)]. A typical optimal solution connects a subset of the stimulus-related proteins to a subset of the stimulus-related genes through known interactions and intermediary proteins; these interactions and proteins are weighted by the amount of flow they pass, thus illuminating core versus peripheral components of the response. A detailed description of ResponseNet formulation is given in Yeger-Lotem et al. (16). As described in Yeger-Lotem et al. (16), we tested ResponseNet on the well-characterized pheromone and DNA damage responses by applying it to proteins and genes identified by genetic and transcriptomic profiling assays, respectively. In both cases, ResponseNet revealed a connecting sub-network that was highly enriched for the relevant response pathway and exposed core pathway members. To thoroughly test ResponseNet despite the lack of known response pathways, we applied it to a set of 101 distinct, less-characterized stimuli, which corresponded to gene deletions. For each stimulus, we created a randomized solution set composed of 100 sub-networks that were obtained by applying ResponseNet to partially randomized inputs (either the stimulus-related proteins or the genes were randomly chosen, while all other inputs were fixed). We considered as success solutions that were significantly enriched for the stimulus, or ranked stimulus-related proteins significantly high, relative to the randomized solution set (P ≤ 0.05). Although in 86 of the 101 test cases screening data were very limited, ResponseNet displayed 41% sensitivity, 98% specificity and 30% precision. Moreover, when applied to α-synuclein toxicity screening data ResponseNet successfully revealed meaningful response pathways that we validated experimentally (16).

ResponseNet INPUT AND OUTPUT

The ResponseNet web server is designed for predicting stimulus-related signaling and regulatory pathways in the yeast S. cerevisiae. The basic input to ResponseNet includes a source set comprising a list of stimulus-related proteins, preferably identified through genetic screening, and a target set comprising a list of stimulus-related differentially expressed genes. An advanced form of input specifies each set as a weighted list, with weights being positive real numbers. Higher weights increase the probability that the corresponding proteins or genes will appear in the output network, and may reflect, e.g. the strength of the effect of the mutated protein or the fold-change in the gene’s expression level. If no weights are given, they are considered to be uniform. Upon submitting the data to the web server the user may choose to perform a GO enrichment analysis of the input sets and the output sub-network, to identify processes that are significantly associated with the cellular response. This analysis is carried using the GOrilla tool (33). The ResponseNet web server also offers three advanced features. By default, given the sets of proteins and genes, ResponseNet searches an up-to-date version of the yeast interactome that we compiled as described in Yeger-Lotem et al. (16). Using the advanced options the user can append or replace the interactome data: user-provided protein–protein interactions are considered bidirectional, unless specified otherwise by the user; user-provided protein–DNA interactions are directed from a protein to its target gene. Notably, by replacing the interactome data the user can apply the server to data of other organisms. The two other advanced features relate to the free parameters of the minimum-cost flow formulation: The capping value is an upper bound on the probability of each interaction, and therefore a lower bound on its cost; lowering the capping value reduces the bias toward extensively studied interactions and may lead to sparser solutions. The gamma value controls the connectivity of the solution, with increasing values enabling identification of lower probability connecting sub-networks. More information about the input options can be found in the ResponseNet tutorial that is stored on the server. The computation of the minimum-cost flow optimization problem typically takes a few minutes due to the need to solve a linear programming problem with thousands of variables. Once complete, the output is provided as a multi-panel window (Figure 2). The middle panel contains a graphical image of the identified signaling and regulatory sub-network that connects the input sets. This panel utilizes a Cytoscape plug-in that enables the user to manipulate the visualization of the sub-network (34). The left panel provides various statistics, such as the fraction of input proteins and genes that were actually connected by ResponseNet, and, if requested by the user, a link to the GO enrichment analysis of the input sets and the output sub-network. The right panel provides a link to a text version of the identified sub-network in a Cytoscape table format.
Figure 2.

The ResponseNet web server output. In this example the ResponseNet web server was applied to a protein set containing Ste2 and Ste3 and to a gene set containing Fus1 and Fus3 using default parameters. The middle panel presents the regulatory and signaling output sub-network identified by ResponseNet. The left panel presents a visual summary of the coverage of the output sub-network, as well as a link to the GO enrichment analysis of the connected input sets and the output sub-network. The right panel presents a link to a text version of the output sub-network in a Cytoscape-compatible format, and a legend for the sub-network appearing in the middle panel.

The ResponseNet web server output. In this example the ResponseNet web server was applied to a protein set containing Ste2 and Ste3 and to a gene set containing Fus1 and Fus3 using default parameters. The middle panel presents the regulatory and signaling output sub-network identified by ResponseNet. The left panel presents a visual summary of the coverage of the output sub-network, as well as a link to the GO enrichment analysis of the connected input sets and the output sub-network. The right panel presents a link to a text version of the output sub-network in a Cytoscape-compatible format, and a legend for the sub-network appearing in the middle panel. An example of ResponseNet output is shown in Figure 2. In this example, we used ResponseNet to identify the signaling and regulatory sub-network connecting the two pheromone receptor proteins, Ste2 and Ste3, to the genes Fus1 and Fus3 that are up-regulated during the pheromone response. As shown in the middle panel of Figure 2, ResponseNet indeed identified core components of the pheromone response signaling cascade. The connecting sub-network was most significantly enriched for ‘pheromone-dependent signal transduction involved in conjugation with cellular fusion’ (P-value of 9.72E-13). Notably, ResponseNet is applicable to much larger protein and gene input sets as detailed in (16).

DISCUSSION

ResponseNet is a unique web server that is designed to identify the signaling and regulatory sub-networks that connect stimulus-related proteins and genes. These stimulus-related data typically result from large-scale genetic screens and transcriptomic profiling assays. While each of these assays provides unprecedented genome-wide views into the cellular response, the function of the identified proteins in the response to the stimulus, and the signaling paths leading to transcriptomic changes, often remain hidden. By integrating these data with data of known molecular interactions, ResponseNet simultaneously illuminates the mode-of-action of some of the proteins and the signaling cascades that lead to the measured transcriptomic response. The ResponseNet algorithm is based on the minimum-cost flow optimization problem. This formulation is advantageous over other approaches for finding explanatory paths in molecular interaction networks, since it gives preference to high-confidence interactions, allows for a simultaneous solution for the input proteins and genes, puts no a priori bounds on the structure of the output sub-network, and is solved efficiently using linear programming tools. These features of ResponseNet, together with its verified success (16) substantiate the minimum-cost flow optimization as a powerful framework to illuminate cellular pathways from multidimensional, large-scale experimental data. Importantly, it is one of the few approaches that are freely available on the web and can be readily applied by users. While being designed for integrating the results of genetic screens and transcriptomic profiling assays in yeast, ResponseNet is a general framework for identifying explanatory paths underlying other types of input. For example, it can readily be used to analyze phosphoproteomic assays (23), to reveal the paths underlying eQTLs associations (22) or to identify disease-related explanatory paths in other organisms, including human (27). The ease of use of the ResponseNet web server and its enhanced functionality make it suitable for illuminating the pathways underlying a wide-range of cellular states in a variety of organisms. We believe that many researchers that are limited today to separate analysis of their large-scale experiments can take full advantage of their valuable data by analyzing them using this unique integrative web server.

FUNDING

European Union Seventh Programme under the FP7-PEOPLE-MCA-IRG Funding scheme, grant agreement n°[256360 to E.Y.-L.]; and the United States-Israel Binational Science Foundation (BSF) [2009323 to E.Y.-L. and S.L.]. Funding for open access charge: United States-Israel Binational Science Foundation (BSF) [2009323 to E.Y.-L. and S.L.]. Conflict of interest statement. None declared.
  32 in total

Review 1.  A network view of disease and compound screening.

Authors:  Eric E Schadt; Stephen H Friend; David A Shaywitz
Journal:  Nat Rev Drug Discov       Date:  2009-04       Impact factor: 84.694

2.  Integrating proteomic, transcriptional, and interactome data reveals hidden components of signaling and regulatory networks.

Authors:  Shao-Shan Carol Huang; Ernest Fraenkel
Journal:  Sci Signal       Date:  2009-07-28       Impact factor: 8.192

3.  The chemical genomic portrait of yeast: uncovering a phenotype for all genes.

Authors:  Maureen E Hillenmeyer; Eula Fung; Jan Wildenhain; Sarah E Pierce; Shawn Hoon; William Lee; Michael Proctor; Robert P St Onge; Mike Tyers; Daphne Koller; Russ B Altman; Ronald W Davis; Corey Nislow; Guri Giaever
Journal:  Science       Date:  2008-04-18       Impact factor: 47.728

4.  Integrating siRNA and protein-protein interaction data to identify an expanded insulin signaling network.

Authors:  Zhidong Tu; Carmen Argmann; Kenny K Wong; Lyndon J Mitnaul; Stephen Edwards; Iliana C Sach; Jun Zhu; Eric E Schadt
Journal:  Genome Res       Date:  2009-03-04       Impact factor: 9.043

5.  Toward accurate reconstruction of functional protein networks.

Authors:  Nir Yosef; Lior Ungar; Einat Zalckvar; Adi Kimchi; Martin Kupiec; Eytan Ruppin; Roded Sharan
Journal:  Mol Syst Biol       Date:  2009-03-17       Impact factor: 11.429

6.  Validation of candidate causal genes for obesity that affect shared metabolic pathways and networks.

Authors:  Xia Yang; Joshua L Deignan; Hongxiu Qi; Jun Zhu; Su Qian; Judy Zhong; Gevork Torosyan; Sana Majid; Brie Falkard; Robert R Kleinhanz; Jenny Karlsson; Lawrence W Castellani; Sheena Mumick; Kai Wang; Tao Xie; Michael Coon; Chunsheng Zhang; Daria Estrada-Smith; Charles R Farber; Susanna S Wang; Atila van Nas; Anatole Ghazalpour; Bin Zhang; Douglas J Macneil; John R Lamb; Katrina M Dipple; Marc L Reitman; Margarete Mehrabian; Pek Y Lum; Eric E Schadt; Aldons J Lusis; Thomas A Drake
Journal:  Nat Genet       Date:  2009-03-08       Impact factor: 38.330

7.  Functional organization of the S. cerevisiae phosphorylation network.

Authors:  Dorothea Fiedler; Hannes Braberg; Monika Mehta; Gal Chechik; Gerard Cagney; Paromita Mukherjee; Andrea C Silva; Michael Shales; Sean R Collins; Sake van Wageningen; Patrick Kemmeren; Frank C P Holstege; Jonathan S Weissman; Michael-Christopher Keogh; Daphne Koller; Kevan M Shokat; Nevan J Krogan
Journal:  Cell       Date:  2009-03-06       Impact factor: 41.582

8.  Bridging high-throughput genetic and transcriptional data reveals cellular responses to alpha-synuclein toxicity.

Authors:  Esti Yeger-Lotem; Laura Riva; Linhui Julie Su; Aaron D Gitler; Anil G Cashikar; Oliver D King; Pavan K Auluck; Melissa L Geddie; Julie S Valastyan; David R Karger; Susan Lindquist; Ernest Fraenkel
Journal:  Nat Genet       Date:  2009-02-22       Impact factor: 38.330

9.  Information flow analysis of interactome networks.

Authors:  Patrycja Vasilyev Missiuro; Kesheng Liu; Lihua Zou; Brian C Ross; Guoyan Zhao; Jun S Liu; Hui Ge
Journal:  PLoS Comput Biol       Date:  2009-04-10       Impact factor: 4.475

10.  eQED: an efficient method for interpreting eQTL associations using protein networks.

Authors:  Silpa Suthram; Andreas Beyer; Richard M Karp; Yonina Eldar; Trey Ideker
Journal:  Mol Syst Biol       Date:  2008-03-04       Impact factor: 11.429

View more
  47 in total

1.  PheNetic: network-based interpretation of molecular profiling data.

Authors:  Dries De Maeyer; Bram Weytjens; Joris Renkens; Luc De Raedt; Kathleen Marchal
Journal:  Nucleic Acids Res       Date:  2015-04-15       Impact factor: 16.971

2.  SAMNetWeb: identifying condition-specific networks linking signaling and transcription.

Authors:  Sara J C Gosline; Coyin Oh; Ernest Fraenkel
Journal:  Bioinformatics       Date:  2014-11-19       Impact factor: 6.937

Review 3.  Application of a systems approach to study developmental gene regulation.

Authors:  Joshua W K Ho
Journal:  Biophys Rev       Date:  2012-09-01

4.  Simultaneous reconstruction of multiple signaling pathways via the prize-collecting steiner forest problem.

Authors:  Nurcan Tuncbag; Alfredo Braunstein; Andrea Pagnani; Shao-Shan Carol Huang; Jennifer Chayes; Christian Borgs; Riccardo Zecchina; Ernest Fraenkel
Journal:  J Comput Biol       Date:  2013-02       Impact factor: 1.479

5.  Elucidating influenza inhibition pathways via network reconstruction.

Authors:  Arnon Mazza; Irit Gat-Viks; Roded Sharan
Journal:  J Comput Biol       Date:  2014-01-22       Impact factor: 1.479

Review 6.  Unbiased Screens for Modifiers of Alpha-Synuclein Toxicity.

Authors:  Matthias Höllerhage; Marc Bickle; Günter U Höglinger
Journal:  Curr Neurol Neurosci Rep       Date:  2019-02-09       Impact factor: 5.081

7.  Pathway and network analysis of cancer genomes.

Authors:  Pau Creixell; Jüri Reimand; Syed Haider; Guanming Wu; Tatsuhiro Shibata; Miguel Vazquez; Ville Mustonen; Abel Gonzalez-Perez; John Pearson; Chris Sander; Benjamin J Raphael; Debora S Marks; B F Francis Ouellette; Alfonso Valencia; Gary D Bader; Paul C Boutros; Joshua M Stuart; Rune Linding; Nuria Lopez-Bigas; Lincoln D Stein
Journal:  Nat Methods       Date:  2015-07       Impact factor: 28.547

Review 8.  Integrative approaches for finding modular structure in biological networks.

Authors:  Koyel Mitra; Anne-Ruxandra Carvunis; Sanath Kumar Ramesh; Trey Ideker
Journal:  Nat Rev Genet       Date:  2013-10       Impact factor: 53.242

Review 9.  Computational solutions for omics data.

Authors:  Bonnie Berger; Jian Peng; Mona Singh
Journal:  Nat Rev Genet       Date:  2013-05       Impact factor: 53.242

10.  Correcting the F508del-CFTR variant by modulating eukaryotic translation initiation factor 3-mediated translation initiation.

Authors:  Darren M Hutt; Salvatore Loguercio; Daniela Martino Roth; Andrew I Su; William E Balch
Journal:  J Biol Chem       Date:  2018-07-13       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.