Literature DB >> 20847220

Evidence mining and novelty assessment of protein-protein interactions with the ConsensusPathDB plugin for Cytoscape.

Konstantin Pentchev1, Keiichiro Ono, Ralf Herwig, Trey Ideker, Atanas Kamburov.   

Abstract

SUMMARY: Protein-protein interaction detection methods are applied on a daily basis by molecular biologists worldwide. After generating a set of potential interactions, biologists face the problem of highlighting the ones that are novel and collecting evidence with respect to literature and annotation. This task can be as tedious as searching for every predicted interaction in several interaction data repositories, or manually screening the scientific literature. To facilitate the task of evidence mining and novelty assessment of protein-protein interactions, we have developed a Cytoscape plugin that automatically mines publication references, database references, interaction detection method descriptions and pathway annotation for a user-supplied network of interactions. The basis for the annotation is ConsensusPathDB-a meta-database that integrates numerous protein-protein, signaling, metabolic and gene regulatory interaction repositories for currently three species: Homo sapiens, Saccharomyces cerevisiae and Mus musculus. AVAILABILITY: The ConsensusPathDB plugin for Cytoscape (version 2.7.0 or later) can be installed within Cytoscape on a major operating system (Windows, Mac OS, Unix/Linux) with Sun Java 1.5 or later installed through Cytoscape's Plugin manager (category 'Network and Attribute I/O'). The plugin is freely available for download on the ConsensusPathDB web site (http://cpdb.molgen.mpg.de). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20847220      PMCID: PMC2958747          DOI: 10.1093/bioinformatics/btq522

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Due to the high explanatory power of protein–protein interactions for biological processes in health and disease (Ideker and Sharan, 2008), dedicated interaction detection methods like yeast-two-hybrid (Y2H) screening (Fields, 2005) and co-purification (Aebersold and Mann, 2003) are applied on a daily basis by molecular biologists worldwide and contribute to the completion of the map of protein–protein interactions for human and other species. An immediate task after generating a network of predicted interactions is to identify the ones that have not been published previously and to collect evidence for every single interaction from literature and annotation. This information is useful in order to estimate the performance of the interaction screen and to assess the contribution to the protein–protein interaction map of the species in question. To accomplish this task, biologists typically search their new data against every single protein–protein interaction repository like IntAct (Huntley et al., 2007) or MINT (Chatr-aryamontri et al., 2007). Even more tedious is the manual mining for interactions in scientific literature to collect the publication references and detection methods for the novel interaction list. Cytoscape (Shannon et al., 2003) is a widely used, freely available software tool for visualization, manipulation and analysis of biomolecular interaction networks. To aid the process of interaction evidence mining, we have developed a plugin for Cytoscape that searches all interactions from the network of interest in the interaction space stored in ConsensusPathDB. ConsensusPathDB (Kamburov et al., 2009) is an interaction meta-database that integrates functional interaction repositories forming a heterogeneous interaction network which comprises protein–protein interactions, as well as signaling, metabolic and gene regulatory interactions. Currently, the database integrates 18 open-access repositories on human interactions and eight repositories for both yeast and mouse interactions and contains around 150 000 human, 195 000 yeast and 13 000 mouse distinct interactions (many of which are of non-binary nature, i.e. contain more than two interaction partners). In this article, we describe the functionality of the ConsensusPathDB plugin for Cytoscape and demonstrate its usage and performance.

2 DESCRIPTION

After installing the plugin, the user starts by loading the network of interest (denoted query network) represented by binary interactions in Cytoscape and launching the ConsensusPathDB plugin through Cytoscape's ‘Plugins’ menu (Fig. 1A). After setting a few parameters which we describe below, the user starts the evidence mining process. The plugin then communicates with the repository of ConsensusPathDB through a web service. Once the plugin sends the query network to the server, a search is executed on the server-side for all (or, optionally, just the selected) proteins and interactions from the query network in ConsensusPathDB through SQL queries. Proteins from the query network are matched to the data repository on the basis of accession numbers such as UniProt (The UniProt Consortium, 2010) or Ensembl (Flicek et al., 2010). Interactions from the query network are matched to the repository based on their participants.
Fig. 1.

(A) The splash screen of the plugin showing the different parameters; (B) the ConsensusPathDB visual style where reproduced interactions are weighted by evidence and novel interactions are highlighted in green; (C) newly imported attributes of a selected interaction are shown in the ‘Interaction details’ tab of Cytoscape's results panel; (D) evidence mining time plot for networks of different size with default parameters (here, all query interactions were present in ConsensusPathDB such that the mining process took maximal time). The sizes of the networks predicted using large-scale interaction screening by Rual et al. (2005) (R), Stelzl et al. (2005) (S) and Ewing et al. (2006) (E) are marked on the x-axis.

(A) The splash screen of the plugin showing the different parameters; (B) the ConsensusPathDB visual style where reproduced interactions are weighted by evidence and novel interactions are highlighted in green; (C) newly imported attributes of a selected interaction are shown in the ‘Interaction details’ tab of Cytoscape's results panel; (D) evidence mining time plot for networks of different size with default parameters (here, all query interactions were present in ConsensusPathDB such that the mining process took maximal time). The sizes of the networks predicted using large-scale interaction screening by Rual et al. (2005) (R), Stelzl et al. (2005) (S) and Ewing et al. (2006) (E) are marked on the x-axis. The performance of the interaction matching depends critically on how well proteins in the query network are annotated with accession numbers. In the case that accession numbers are not available, the user is prompted to specify whether the node labels represent accession numbers of a certain type. The interaction matching performance is influenced by two parameters, ‘protein annotation matching’ (strict/fuzzy) and ‘interaction cardinality matching’ (strict/allow containment). Strict protein annotation matching denotes that a protein from the query network and a protein from the database are considered identical only if all identifiers of a type match. Fuzzy matching means that the identifiers of the query protein may form a sub-set of the identifiers of the database counterpart or vice versa. Fuzzy matching is useful, e.g. when proteins on the one side are compared with protein families on the other side. The ‘interaction cardinality matching’ parameter specifies whether the binary interactions from the query network should be matched only with binary interactions from the database network (strict matching) or whether they may be matched to complex interactions, i.e. interactions of more than two proteins that contain the binary interactions. More details about protein and interaction mapping can be found in the Supplementary Material to this paper. After matching proteins and interactions, the web service server sends annotation attributes for matched query interactions in the form of publication references (Pubmed identifiers), interaction detection methods, database references (such as IntAct and MINT) and pathway annotations (i.e. pathways that contain both participants of a protein–protein interaction) to the client plugin. The plugin creates a custom visual style in Cytoscape where the thickness of interaction edges reflects (optionally) the number of publications, number of containing interaction databases, number of distinct detection methods, or number of containing pathways for the protein interaction (Fig. 1B). Interactions that are not found in the repository, and thus represent potential novel interactions, are highlighted in green. In the results tab of Cytoscape, an interaction mapping summary is displayed together with a legend. The interaction attributes that have been retrieved from ConsensusPathDB can be viewed for selected interactions under the ‘Interaction details’ tab of the results panel (Fig. 1C). If applicable, this information is provided as web links to the primary data and can be viewed in a web browser. Figure 1D shows the performance of the plugin implementation with respect to the mining of interaction annotation for different network sizes. Results show that even for large networks evidence mining executes in minutes, for example ∼2 min for a network with 20 000 nodes. It should be noted, however, that the Internet connection speed of the client influences the overall speed of interaction matching. Funding: Max Planck Society (IMPRS-CBSC); Cytoscape Project (GM070743); European Union's project APO-SYS (HEALTH-F4-2007-200767); BMBF MedSys project PREDICT (0315428A). Conflict of Interest: none declared.
  12 in total

Review 1.  Mass spectrometry-based proteomics.

Authors:  Ruedi Aebersold; Matthias Mann
Journal:  Nature       Date:  2003-03-13       Impact factor: 49.962

2.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

3.  Towards a proteome-scale map of the human protein-protein interaction network.

Authors:  Jean-François Rual; Kavitha Venkatesan; Tong Hao; Tomoko Hirozane-Kishikawa; Amélie Dricot; Ning Li; Gabriel F Berriz; Francis D Gibbons; Matija Dreze; Nono Ayivi-Guedehoussou; Niels Klitgord; Christophe Simon; Mike Boxem; Stuart Milstein; Jennifer Rosenberg; Debra S Goldberg; Lan V Zhang; Sharyl L Wong; Giovanni Franklin; Siming Li; Joanna S Albala; Janghoo Lim; Carlene Fraughton; Estelle Llamosas; Sebiha Cevik; Camille Bex; Philippe Lamesch; Robert S Sikorski; Jean Vandenhaute; Huda Y Zoghbi; Alex Smolyar; Stephanie Bosak; Reynaldo Sequerra; Lynn Doucette-Stamm; Michael E Cusick; David E Hill; Frederick P Roth; Marc Vidal
Journal:  Nature       Date:  2005-09-28       Impact factor: 49.962

4.  A human protein-protein interaction network: a resource for annotating the proteome.

Authors:  Ulrich Stelzl; Uwe Worm; Maciej Lalowski; Christian Haenig; Felix H Brembeck; Heike Goehler; Martin Stroedicke; Martina Zenkner; Anke Schoenherr; Susanne Koeppen; Jan Timm; Sascha Mintzlaff; Claudia Abraham; Nicole Bock; Silvia Kietzmann; Astrid Goedde; Engin Toksöz; Anja Droege; Sylvia Krobitsch; Bernhard Korn; Walter Birchmeier; Hans Lehrach; Erich E Wanker
Journal:  Cell       Date:  2005-09-23       Impact factor: 41.582

Review 5.  Protein networks in disease.

Authors:  Trey Ideker; Roded Sharan
Journal:  Genome Res       Date:  2008-04       Impact factor: 9.043

Review 6.  High-throughput two-hybrid analysis. The promise and the peril.

Authors:  Stanley Fields
Journal:  FEBS J       Date:  2005-11       Impact factor: 5.542

7.  IntAct--open source resource for molecular interaction data.

Authors:  S Kerrien; Y Alam-Faruque; B Aranda; I Bancarz; A Bridge; C Derow; E Dimmer; M Feuermann; A Friedrichsen; R Huntley; C Kohler; J Khadake; C Leroy; A Liban; C Lieftink; L Montecchi-Palazzi; S Orchard; J Risse; K Robbe; B Roechert; D Thorneycroft; Y Zhang; R Apweiler; H Hermjakob
Journal:  Nucleic Acids Res       Date:  2006-12-01       Impact factor: 16.971

8.  Large-scale mapping of human protein-protein interactions by mass spectrometry.

Authors:  Rob M Ewing; Peter Chu; Fred Elisma; Hongyan Li; Paul Taylor; Shane Climie; Linda McBroom-Cerajewski; Mark D Robinson; Liam O'Connor; Michael Li; Rod Taylor; Moyez Dharsee; Yuen Ho; Adrian Heilbut; Lynda Moore; Shudong Zhang; Olga Ornatsky; Yury V Bukhman; Martin Ethier; Yinglun Sheng; Julian Vasilescu; Mohamed Abu-Farha; Jean-Philippe Lambert; Henry S Duewel; Ian I Stewart; Bonnie Kuehl; Kelly Hogue; Karen Colwill; Katharine Gladwish; Brenda Muskat; Robert Kinach; Sally-Lin Adams; Michael F Moran; Gregg B Morin; Thodoros Topaloglou; Daniel Figeys
Journal:  Mol Syst Biol       Date:  2007-03-13       Impact factor: 11.429

9.  ConsensusPathDB--a database for integrating human functional interaction networks.

Authors:  Atanas Kamburov; Christoph Wierling; Hans Lehrach; Ralf Herwig
Journal:  Nucleic Acids Res       Date:  2008-10-21       Impact factor: 16.971

10.  Ensembl's 10th year.

Authors:  Paul Flicek; Bronwen L Aken; Benoit Ballester; Kathryn Beal; Eugene Bragin; Simon Brent; Yuan Chen; Peter Clapham; Guy Coates; Susan Fairley; Stephen Fitzgerald; Julio Fernandez-Banet; Leo Gordon; Stefan Gräf; Syed Haider; Martin Hammond; Kerstin Howe; Andrew Jenkinson; Nathan Johnson; Andreas Kähäri; Damian Keefe; Stephen Keenan; Rhoda Kinsella; Felix Kokocinski; Gautier Koscielny; Eugene Kulesha; Daniel Lawson; Ian Longden; Tim Massingham; William McLaren; Karine Megy; Bert Overduin; Bethan Pritchard; Daniel Rios; Magali Ruffier; Michael Schuster; Guy Slater; Damian Smedley; Giulietta Spudich; Y Amy Tang; Stephen Trevanion; Albert Vilella; Jan Vogel; Simon White; Steven P Wilder; Amonida Zadissa; Ewan Birney; Fiona Cunningham; Ian Dunham; Richard Durbin; Xosé M Fernández-Suarez; Javier Herrero; Tim J P Hubbard; Anne Parker; Glenn Proctor; James Smith; Stephen M J Searle
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

View more
  19 in total

1.  Methylome-wide association study of schizophrenia: identifying blood biomarker signatures of environmental insults.

Authors:  Karolina A Aberg; Joseph L McClay; Srilaxmi Nerella; Shaunna Clark; Gaurav Kumar; Wenan Chen; Amit N Khachane; Linying Xie; Alexandra Hudson; Guimin Gao; Aki Harada; Christina M Hultman; Patrick F Sullivan; Patrik K E Magnusson; Edwin J C G van den Oord
Journal:  JAMA Psychiatry       Date:  2014-03       Impact factor: 21.596

2.  Normalized lmQCM: An Algorithm for Detecting Weak Quasi-Cliques in Weighted Graph with Applications in Gene Co-Expression Module Discovery in Cancers.

Authors:  Jie Zhang; Kun Huang
Journal:  Cancer Inform       Date:  2016-07-24

3.  Analyzing and interpreting genome data at the network level with ConsensusPathDB.

Authors:  Ralf Herwig; Christopher Hardt; Matthias Lienhard; Atanas Kamburov
Journal:  Nat Protoc       Date:  2016-09-08       Impact factor: 13.491

4.  A comprehensive family-based replication study of schizophrenia genes.

Authors:  Karolina A Aberg; Youfang Liu; Jozsef Bukszár; Joseph L McClay; Amit N Khachane; Ole A Andreassen; Douglas Blackwood; Aiden Corvin; Srdjan Djurovic; Hugh Gurling; Roel Ophoff; Carlos N Pato; Michele T Pato; Brien Riley; Todd Webb; Kenneth Kendler; Mick O'Donovan; Nick Craddock; George Kirov; Mike Owen; Dan Rujescu; David St Clair; Thomas Werge; Christina M Hultman; Lynn E Delisi; Patrick Sullivan; Edwin J van den Oord
Journal:  JAMA Psychiatry       Date:  2013-06       Impact factor: 21.596

Review 5.  Practical aspects of genome-wide association interaction analysis.

Authors:  Elena S Gusareva; Kristel Van Steen
Journal:  Hum Genet       Date:  2014-08-28       Impact factor: 4.132

6.  A travel guide to Cytoscape plugins.

Authors:  Rintaro Saito; Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Samad Lotia; Alexander R Pico; Gary D Bader; Trey Ideker
Journal:  Nat Methods       Date:  2012-11-06       Impact factor: 28.547

7.  Cluster-based assessment of protein-protein interaction confidence.

Authors:  Atanas Kamburov; Arndt Grossmann; Ralf Herwig; Ulrich Stelzl
Journal:  BMC Bioinformatics       Date:  2012-10-10       Impact factor: 3.169

8.  The ConsensusPathDB interaction database: 2013 update.

Authors:  Atanas Kamburov; Ulrich Stelzl; Hans Lehrach; Ralf Herwig
Journal:  Nucleic Acids Res       Date:  2012-11-11       Impact factor: 16.971

9.  Development and application of a DNA microarray-based yeast two-hybrid system.

Authors:  Bernhard Suter; Jean-Fred Fontaine; Reha Yildirimman; Tamás Raskó; Martin H Schaefer; Axel Rasche; Pablo Porras; Blanca M Vázquez-Álvarez; Jenny Russ; Kirstin Rau; Raphaele Foulle; Martina Zenkner; Kathrin Saar; Ralf Herwig; Miguel A Andrade-Navarro; Erich E Wanker
Journal:  Nucleic Acids Res       Date:  2012-12-28       Impact factor: 16.971

10.  Bridging HIV-1 cellular latency and clinical long-term non-progressor: an interactomic view.

Authors:  Jin Yang; Zongxing Yang; Hangjun Lv; Yi Lou; Juan Wang; Nanping Wu
Journal:  PLoS One       Date:  2013-02-25       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.