Literature DB >> 22641854

DroPNet: a web portal for integrated analysis of Drosophila protein-protein interaction networks.

Yoan Renaud1, Alexandre Baillif, Jean-Baptiste Perez, Marie Agier, Engelbert Mephu Nguifo, Vincent Mirouse.   

Abstract

DroPNet (Drosophila Protein Network) is a Drosophila-dedicated web portal for generating and analyzing protein-protein interaction (PPI) networks. This platform integrates users' experimental data provided as one or two lists of genes with PPI data from Drosophila and other species. These experimental data can, for example, come from RNAi screens, for which this approach is known to be valuable. DroPNet, therefore, provides an essential basis for further biological analysis by linking functional and physical interactions and reinforcing the relevance of each. DroPNet focuses on the search of PPIs between genes of the entry list, and includes the possibility of searching for intermediate genes for which the corresponding protein indirectly links two entry data. It also offers multiple functions for editing the networks obtained, providing users with interactive possibilities to progressively improve and refine the results. This approach gives a global view of the studied process and makes it possible to highlight specific interactions that have so far been understudied. DroPNet is freely available at http://dropnet.isima.fr.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22641854      PMCID: PMC3394298          DOI: 10.1093/nar/gks434

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Recent years have seen the development of large-scale functional approaches based on RNAi applications. These techniques are able to knock down the expression of a specific gene in cell culture or in vivo, to see whether it has an effect on a specific biological event or function. This is especially true with the Drosophila model, where such screens are very easy to perform in cell culture as well as in vivo (1). In cell culture, RNAi can be very simply and efficiently induced without any need for transfection (2,3). In vivo, the RNAi, coupled to the ectopic expression system UAS/Gal4, makes it possible to knock down a selected gene in a specific population of cells during fly development or adult life (4). Moreover, this in vivo RNAi approach is now made possible at a whole-genome scale thanks to the development of Drosophila transgenic line collections that cover the entire genome (5,6). Thus, both in culture and in vivo, there is an exponentially growing number of reverse genetics screens being performed in Drosophila laboratories all over the world and covering all the domains of biology, from development to immunity or behavior. The main advantage of the reverse genetics approach over classical genetic screening is that biologists immediately know exactly which genes are involved in the process they are studying. They, therefore, end up with a list of genes that may be short (less than 10) or to several hundred depending on the sensitivity and accuracy of their screen read-out. The problem at this point is how to most efficiently handle this list―an issue that raises several questions: is it possible to get a global view of the studied process? What is already known and what is really new in the data? What are the best candidate genes for in-depth research? What are the molecular links connecting all the members of this list? One approach to addressing all these questions is to integrate the data from the functional screens with the data on protein–protein interactions (PPIs) referenced in available databases, which feature results from literature and high-throughput proteomics screens. However, these approaches generate significant numbers of false positives, which ultimately means their results are underexploited. Based on the PPIs, it is possible to map protein interaction networks that can establish molecular links between proteins coded by genes that share functional links. Grouping proteins into a common network based on a shared combination of function and physical interaction should considerably increase relevance confidence of both. This is a concept that has been validated both statistically and experimentally by independent groups (7,8). However, although protein network analysis web servers exist, none of them has made the analysis task easy to perform. We, therefore, developed a web platform called DroPNet (Drosophila Protein Network) with two main goals: (i) to include options that improve the possibilities of the approach, and these options are unique to DropNet; (ii) to build an easy-to-use tool specifically dedicated to Drosophila (although integrating data from other species) to ensure that any biologist working on the Drosophila model can perform this integrative approach and gain positive results.

OVERVIEW

There are a number of applications available for drawing PPI networks, and each has its own aims and possibilities. Cytoscape is an application that offers the possibility of drawing networks from the user’s own data (9). Many plug-ins are available for Cytoscape, and some of them, like APID, BioNetBuilder, MIMI or Droid plug-ins, allow users to search through public PPI data (including Drosophila’s data) using the user’s list of genes and retrieve the interactions (10). However, Cytoscape is not a web server-based application. STRING is a database of known and predicted protein interactions. STRING also offers an online visualization tool that gives the possibility of viewing protein interactions referenced in the String database as a network (11). Similarly, a visualization tool called IM Browser (for Interaction Map Browser) developed by the Finley Lab is able to work with DroID data (12,13). PINA provides a non-redundant protein interaction dataset for six model organisms and includes a tool for construction, visualization and management of protein interaction networks (14,15). GeneMANIA is an other web-based PPI visualization tool that also integrates other kind of data and aim to predict gene function (16). We have a step further and developed functionalities that are specific to DroPNet. DroPNet focuses on the search for both direct and indirect PPI among genes given by the user. Moreover, this search can be done on a unique list of genes or between two lists. An average number of interactions obtained from 10 random genes lists of same size as user gene list is provided for each network generated, giving an estimation of the significance of the interaction number. Another important feature of DroPNet (called ‘Upgrade’) is a system of iterative searches based on manual validation or deletion of interactions and new exploration for intermediates genes.

DROPNET WEB PLATFORM

DroPNet is a web platform that allows users to visualize, manipulate and save PPI networks with novel features for analysis. As DroPNet is geared specifically to Drosophila interactions, it can take data from the DroID database (13). Indeed, Drosophila PPI data can be found in many public databases, including Intact (17), Mint (18) and Biogrid (19), and they extend to cover many species based on the notion of ortholog genes. This is one of the factors that guided our decision to choose DroID, as it already recovers data from these different sources and species. DroID has also recently been updated with fresh datasets from large-scale co-affinity purification coupled to mass spectrometry analysis (20). The DroPNet interface is relatively simple yet complete, and features two main pages. The first page comprises a form enabling users to fill out data on their genes of interest and certain user-selected parameters. The second page gives the search results output as a Java applet displaying the PPI network and containing additional features to improve the obtained graph.

Implementation

The DroPNet web platform runs on a Transtec 2300L Data Storage Server running Intel Xeon E5506 quad-core 2.13 GHz processors with 4 Mb of L2 cache and 8 Gb of RAM. The form parts are implementing using the framework Richfaces 3.3.2 from JBOSS. Richfaces is a framework that simplifies the using of AJAX with JSF 1.2 technology. The part of drawing the networks is a Java applet, built in with the JDK6 version. This applet uses an open-source and free library for drawing networks called Piccolo2D 1.2.1.

Data entries

The input screen of the DroPNet web platform is shown in Figure 1 (containing default parameter values).
Figure 1.

Input screen of the DroPNet web server showing a predefined sample. (a) The two lists of genes to be filled in. (b) Drop-down lists to parameterize intermediate components. (c) List of the different available sources. (d) Sample Data.

Input screen of the DroPNet web server showing a predefined sample. (a) The two lists of genes to be filled in. (b) Drop-down lists to parameterize intermediate components. (c) List of the different available sources. (d) Sample Data. To generate the PPI network, the user needs to list representative genes for each protein he/she wants to consider, and to check a number of parameter boxes. Initially, the user has to provide one or two lists of genes for generating the network (Figure 1a). The major advantage of considering two lists is to allow the user to look for interactions that exclusively involve genes from one list and genes from the other list. This approach can be valuable if the user wants to analyze two lists of genes grouped by functionality to see whether they show interactions. For instance, two sets of genes sharing phenotype similarities could be analyzed using this option to look for PPIs creating physical links between the two sets. This choice option can be specified in a checklist on the input screen. List(s) of genes have to be entered using their CG number or FB number in a specific format (one number per line or inline list separated with comma or space) that is checked by the data entries web page. We chose not to use name as identifier as a gene can have many synonyms and people could misspell a name, making it time consuming to run input verifications. We also decided against using protein IDs as there are often several isoforms for a given gene, which can produce misleading results. Next, users have to specify a small set of parameters to define how they want to construct their network. In fact, the application looks for direct interactions between two of the given proteins, but sometimes two proteins can only be linked indirectly by one or more other proteins. These proteins are called intermediate components, and DroPNet offers the possibility of searching for them. It could be useful to highlight when two proteins interact indirectly via another protein that was not given as input data. This is a selectable option in the two drop-down lists on the input screen (Figure 1b). ‘Number of intermediate proteins’: users can choose to permit intermediate proteins (i.e. non-listed proteins that allow interactions by transitivity with the proteins entered). In the platform, to keep a reasonable calculation time and output easy-to-manipulate networks, number of intermediates is limited to 2. Moreover, tests have shown that with more than two intermediates, the biological meaning of these indirect interactions has little relevance and leads to inopportune extension of the networks. ‘Filter intermediate by number of interactions’: users can filter the intermediate proteins they want to consider in the network by using a maximum interaction criterion. For example, to filter out intermediate proteins that occur in more than 50 interactions, the user will choose 50 in the drop-down list. Several values between 25 and 500 can be chosen. The No Limit value means that the user does not want to apply any filter on intermediate proteins. This option is very useful for removing proteins with low binding specificity, such as chaperone proteins. The indirect links created by such intermediates is usually not relevant. We opted for this filter system rather than one based on a confidence score. First, not all the interactions referenced in databases come with a confidence score. Second, high scores are mainly due to the fact that an interaction has already been extensively studied. Consequently, filtering on such scores would remove all the potentially interesting understudied interactions, which is exactly the reverse of one of the main goals of this application. User can select different sources for interactions from different databases as classified in DroID (Figure 1c). Two methods are available for executing this task: the user can check each interaction database listed one by one, or else they can choose to select all databases or only both Drosophila and ortholog databases simply by clicking on the associated button(s). Hyperlinks are available to get a more detailed description of each database. A detailed description of all the features of the web server is available in the Tutorial Page that is accessible in the banner of each page of the DroPNet platform. Note that the web platform can be tested with four predefined samples on the top left hand corner of the input page. If the user clicks one of the samples, it will automatically fill lists of genes and check a set of parameters. For instance, the first sample is real results from a whole genome RNAi screen on centriole duplication and maturation, a key organizing structure for microtubule cytoskeleton (21).

Results page

Once users have submitted their data and filled in the different parameters, they can generate the network. A Java applet will launch and the network will be drawn. Note that if the user wants to generate a network that turns out to be huge, he/she will be informed of its size (intermediate node number >70 and interaction number >500) before the network is generated. Moreover, the user can choose to continue generating it or to abort the process and return to the data entry page. The results page is divided into three parts: a menu bar with a set of specific options (Figure 2d), the interactive network itself (Figure 2a) and an informative panel split into two boxes (Figure 2b and c). The main part of the page consists in the graph corresponding to the PPI network the user wants to obtain. Proteins are represented by colored round nodes, and interactions between two proteins are represented by colored dashed line edges. Each color of the different edges is associated with the database source corresponding to the interactions. If the user chooses to enter two distinct lists of genes in the input page, nodes will have different colors according to the list from which the gene comes. Moreover, a different color is used for intermediate nodes. Information on these different colors is available in the left part of the informative panel located at the bottom of the screen (Figure 2b). This information is followed by the number of interactions present in this network and the average number of interactions obtained from 10 randomly generated gene lists of the same size and based on the same parameters. These values give user an estimate of how their network is enriched in relevant interactions. It is worth noting that all the published RNAi screen results that we tested with DroPNet gave much bigger networks than random runs, whatever the parameters used, thus confirming the validity of the approach. For instance, we tested a series of parameters with sample 1 (Table 1). In all cases, the number of interactions found in the network is far bigger than the average obtained with random sets. It is worth noting that is also true when looking only to Curagen and DPIM data that have been obtained by whole genome proteomics approaches. Thus, integrating data from an unbiased whole genome RNAi screen with unbiased PPI data still lead to very significant results.
Figure 2.

Results screen of the DroPNet web server. (a) Graph representing the interaction network targeted. (b) Legend panel containing graph-related information. (c) Detail panel on a considered edge or node. (d) Tools menu for editing or saving the network. The network shown here corresponds to Sample 1 and parameters indicated on Figure 1.

Table 1.

Example of numbers of PPI found with different parameters applied on Sample 1

Selected databasesZero intermediateOne intermediate (filter <100)Two intermediates (filter <50)
Curagen and DPIM1/08/0.456/2.4
Drosophila only1/08/0.672/3.1
All10/0.2288/8.7257/7.4

For each the first number corresponds to number of found interactions and the second one to the result average of 10 random sets of the same size and with the same parameters. The first selection (Curagen and DPIM) corresponds to two unbiased whole proteome screen.

Results screen of the DroPNet web server. (a) Graph representing the interaction network targeted. (b) Legend panel containing graph-related information. (c) Detail panel on a considered edge or node. (d) Tools menu for editing or saving the network. The network shown here corresponds to Sample 1 and parameters indicated on Figure 1. Example of numbers of PPI found with different parameters applied on Sample 1 For each the first number corresponds to number of found interactions and the second one to the result average of 10 random sets of the same size and with the same parameters. The first selection (Curagen and DPIM) corresponds to two unbiased whole proteome screen. Users can easily interact with the network using mouse commands: The tool menu at the top of the results page offers more features for editing the network or saving the data and the resulting graph (Figure 2d). The first available option is the choice of layout that the user wants to apply for the generated interactions network. Instead of manipulating the graph, users can opt to apply predefined layout (available in a drop-down list) to their network designed to make analysis easier. However, our experience is that the different organic layouts give the best results. Three other functionalities are also designed to improve the network visualization display according to user choice. In practice, users can center and scale the network on-screen to make it more readable, and also delete all stand-alone genes to get rid of all unnecessary information. The third visual display option consists in applying filters. Thus, users can control the visibility of both the interaction species they want to consider and the number of intermediate nodes in network, which should make specific analyses easier. A user can move a node by straightforward drag and drop. They can also move all the nodes by dragging the background. For a given node, a contextual menu (right-click) can be used to open a new table with a Flybase gene description page or to delete this node. By default, an interaction stands as ‘not validated’. By right-clicking an edge, the user can either validate it if he trusts this interaction or delete it if he thinks it is not relevant. The validation of an interaction makes its edges becoming full-line. It is possible to undo this action by invalidating the interaction, which will return all the edges of the interaction to dashed line format. These options are important for improving network visualization, but they are also required for the ‘upgrade’ option described later. Left-clicking nodes or edges will bring up information on the item in the right part of the informative panel (Figure 2c). Clicking on a node brings up the gene name abbreviation with its associated CG Number and links to its corresponding web pages on both Flybase and DroID web sites. The panel also indicates the number of known interactions for each source involving the selected protein. Clicking on an edge will display the species where the interaction has been identified. The panel also gives a link to paper(s) where this interaction is mentioned and to original source information on the interaction. Scrolling the mouse wheel offers a zoom in/zoom out function. The tools menu also offers several saving features allowing the user to save their network in different ways. The user can save their network as a JPEG picture to keep a visual file, as well as export all data into an .xls file. The option enables the user to reuse the information obtained from the network, and table format data is often valuable for further analysis. Users are not required to log in to use DroPNet, but they can elect to create a log-in account to save their edited networks. Users will find it useful to be able to return later on to their network and find it at the point it was saved, and to be able to store different networks at different states of editing. It also guards against information being lost when the user exits the application. Another important feature of DroPNet is that it proposes a system of iterative searches based on manual validation or deletion of interactions, called Upgrade. To use the Upgrade function, the user has to validate a set of network edges involving intermediate nodes. Note that for each time an internode is linked to a node of the initial list(s) by a validated edge, its associated gene will be added to the initial list. A new network is generated that integrates this new list of genes with the same parameters as the previous network. For instance, rapid overview on Flybase of the intermediate genes shown on Figure 2 allows to identify at least six microtubule binding proteins or proteins involved in meiosis and mitosis, which are processes highly dependent on microtubules. We, therefore, validated interactions with these proteins and upgrade the network and nine new interactors were identified (Figure 3). To our knowledge, this option aiming to progressively improve and extend networks based on user interaction validation is highly innovative within the PPI networks domain.
Figure 3.

Results from an upgrade of the network shown in Figure 2. (a) Network where six interactions have been validated. (b) Resulting Network after upgrade. New intermediates nodes appear in pink.

Results from an upgrade of the network shown in Figure 2. (a) Network where six interactions have been validated. (b) Resulting Network after upgrade. New intermediates nodes appear in pink.

CONCLUSIONS

DroPNet is dedicated to the design of PPI networks based on a list of Drosophila genes. This approach has, for instance, been validated for interpretation of RNAi screens. The main outcomes that can be achieved using DroPNet use are (i) to get an overview of the molecular connectivity of user-entered genes in efforts to identify new functions for specific pathways or molecular complexes and (ii) to highlight understudied molecular interactions present in databases and emerge their potential functions. In practice, DroPNet was developed to manipulate lists of just a handful up to a couple of hundred genes, but it is clearly not a tool for the visualization of a complete interactome or of all the PPIs mobilizing a single protein. It has also been designed to allow major user editing interactivity so as to improve the obtained networks (searches for intermediates, filters, upgrade and saving functions, and more). Users may need to try several parameters and spend time optimize the results depending on the size of their entry list, their precise aims and the degree of connectivity between the genes. Although initially designed for the analysis of RNAi screen results, DroPNet can be used for post-analysis in other systems biology approaches, such as co-expression data. Therefore, DroPNet will prove a valuable tool for the Drosophila community as a whole, and in the extending systems biology field.

FUNDING

CNRS/INSERM ATIP/Avenir program and the (ARC) to Y.R. and V.M.; Lifegrid grant to A.B. and J.-B.P. Funding for open access charge: CNRS. Conflict of interest statement. None declared.
  21 in total

1.  An RNA-directed nuclease mediates post-transcriptional gene silencing in Drosophila cells.

Authors:  S M Hammond; E Bernstein; D Beach; G J Hannon
Journal:  Nature       Date:  2000-03-16       Impact factor: 49.962

2.  Inducible expression of double-stranded RNA directs specific genetic interference in Drosophila.

Authors:  G Lam; C S Thummel
Journal:  Curr Biol       Date:  2000-08-24       Impact factor: 10.834

3.  IntAct: an open source molecular interaction database.

Authors:  Henning Hermjakob; Luisa Montecchi-Palazzi; Chris Lewington; Sugath Mudali; Samuel Kerrien; Sandra Orchard; Martin Vingron; Bernd Roechert; Peter Roepstorff; Alfonso Valencia; Hanah Margalit; John Armstrong; Amos Bairoch; Gianni Cesareni; David Sherman; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

5.  Integrated network analysis platform for protein-protein interactions.

Authors:  Jianmin Wu; Tea Vallenius; Kristian Ovaska; Jukka Westermarck; Tomi P Mäkelä; Sampsa Hautaniemi
Journal:  Nat Methods       Date:  2008-12-14       Impact factor: 28.547

6.  Use of double-stranded RNA interference in Drosophila cell lines to dissect signal transduction pathways.

Authors:  J C Clemens; C A Worby; N Simonson-Leff; M Muda; T Maehama; B A Hemmings; J E Dixon
Journal:  Proc Natl Acad Sci U S A       Date:  2000-06-06       Impact factor: 11.205

Review 7.  MINT: a Molecular INTeraction database.

Authors:  Andreas Zanzoni; Luisa Montecchi-Palazzi; Michele Quondam; Gabriele Ausiello; Manuela Helmer-Citterich; Gianni Cesareni
Journal:  FEBS Lett       Date:  2002-02-20       Impact factor: 4.124

8.  PINA v2.0: mining interactome modules.

Authors:  Mark J Cowley; Mark Pinese; Karin S Kassahn; Nic Waddell; John V Pearson; Sean M Grimmond; Andrew V Biankin; Sampsa Hautaniemi; Jianmin Wu
Journal:  Nucleic Acids Res       Date:  2011-11-08       Impact factor: 16.971

9.  BioGRID: a general repository for interaction datasets.

Authors:  Chris Stark; Bobby-Joe Breitkreutz; Teresa Reguly; Lorrie Boucher; Ashton Breitkreutz; Mike Tyers
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  DroID: the Drosophila Interactions Database, a comprehensive resource for annotated gene and protein interactions.

Authors:  Jingkai Yu; Svetlana Pacifico; Guozhen Liu; Russell L Finley
Journal:  BMC Genomics       Date:  2008-10-07       Impact factor: 3.969

View more
  1 in total

1.  Neuron-specific protein interactions of Drosophila CASK-β are revealed by mass spectrometry.

Authors:  Konark Mukherjee; Justin B Slawson; Bethany L Christmann; Leslie C Griffith
Journal:  Front Mol Neurosci       Date:  2014-06-30       Impact factor: 5.639

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.