Literature DB >> 18984619

UniHI 4: new tools for query, analysis and visualization of the human protein-protein interactome.

Gautam Chaurasia1, Soniya Malhotra, Jenny Russ, Sigrid Schnoegl, Christian Hänig, Erich E Wanker, Matthias E Futschik.   

Abstract

Human protein interaction maps have become important tools of biomedical research for the elucidation of molecular mechanisms and the identification of new modulators of disease processes. The Unified Human Interactome database (UniHI, http://www.unihi.org) provides researchers with a comprehensive platform to query and access human protein-protein interaction (PPI) data. Since its first release, UniHI has considerably increased in size. The latest update of UniHI includes over 250,000 interactions between approximately 22,300 unique proteins collected from 14 major PPI sources. However, this wealth of data also poses new challenges for researchers due to the complexity of interaction networks retrieved from the database. We therefore developed several new tools to query, analyze and visualize human PPI networks. Most importantly, UniHI allows now the construction of tissue-specific interaction networks and focused querying of canonical pathways. This will enable researchers to target their analysis and to prioritize candidate proteins for follow-up studies.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18984619      PMCID: PMC2686569          DOI: 10.1093/nar/gkn841

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Human protein interaction maps play an increasingly important role in biomedical research. They have been shown to be highly valuable in the study of a variety of human diseases and signaling pathways (1–3). The rising popularity of network analyses is reflected in the large number of independently constructed human PPI maps based on experimental and computational approaches. However, these maps generally have limited overlap and frequently lack cross-references (4). To obtain comprehensive interaction data, researchers were required to perform time-consuming queries of different databases with subsequent error-prone matching of obtained identifiers. UniHI has been developed to overcome these difficulties (5). It has integrated separate PPI resources to provide a comprehensive platform for querying the human interactome. UniHI is not intended to replace single databases, but to offer a convenient single portal access to human protein interaction data for the biomedical research community. Notably, it allows the identification of network topologies which would not be detectable if PPI resources were examined separately. The size of UniHI has remarkably increased with more than 250 000 human PPIs currently included. However, this amount of data has its challenges. Even searches with a small number of query proteins can lead to large, highly connected—and often unstructured—networks (frequently referred to as ‘hairballs’). For efficient follow-up analysis, new tools for navigation within the network and prioritization of targets are necessary. Also, flexible visualization is a crucial prerequisite for the display and evaluation of network structures. To meet these challenges, we have implemented several new tools in UniHI for query, analysis and visualization of interaction networks. Beyond its original role as a direct entry gate to the human interactome, UniHI will serve as an integrative platform for the exploration and utilization of human PPI data.

UPDATES AND EXTENSIONS

In our aim to continually provide the most comprehensive human PPI dataset, UniHI has been substantially extended by the inclusion of interactions from two additional major protein interaction databases, i.e. IntAct and BioGRID (6,7). Currently, UniHI includes over 250 000 interactions between more than 22 000 unique proteins from 14 distinct sources, establishing it as the largest catalog for human PPIs worldwide (Table 1 and Figure 1). Although the overlap between different PPI resources included in UniHI has increased, they are still strongly divergent. Only a relatively small fraction of ∼19% can be found in two or more interaction maps, underlining the continuing need for integrative platforms such as UniHI (Supplementary Tables S1 and S2; Figures S1A and B).
Table 1.

PPI datasets currently integrated in UniHI

DatasetProteinsInteractionsMethodReferenceDatabase location
MDC-Y2H17033186Y2H screenStelzl et al. (8)http://www.mdc-berlin.de/neuroprot
CCSB-Y2H15492754Y2H screenRual et al. (9)vidal.dfci.harvard.edu (flat file only)
HPRD-BIN878832776LiteratureMishra et al. (10)http://www.hprd.org
HPRD-COMP19698107LiteratureMishra et al. (10)http://www.hprd.org
DIP10851397LiteratureSalwinski et al. (11)dip.doe-mbi.ucla.edu
BIOGRID795324624LiteratureBreitkreutz et al. (7)http://www.thebiogrid.org
INTACT727319404LiteratureKerrien et al. (6)http://www.ebi.ac.uk/intact
BIND52867394LiteratureBader et al. (12)http://www.bind.ca
REACTOME155437332LiteratureJoshi-Tope et al. (13)http://www.reactome.org
COCIT37376580Text miningRamani et al. (14)Bioinformatics.icmb.utexas.edu/idserve/
ORTHO622571466OrthologyLehner and Fraser (15)http://www.sanger.ac.uk/PostGenomics/signaltransduction/interactionmap
HOMOMINT412710174OrthologyPersico et al. (16)mint.bio.uniroma2.it
OPHID478524991OrthologyBrown and Jurisica (17)ophid.utoronto.ca

Number of proteins and interactions in each dataset as well as construction approaches and references.

Figure 1.

Coverage of the functionally annotated human genome by PPI resources. For annotation, Gene Ontology was utilized. Coverage rates were derived after mapping of proteins to corresponding Entrez Gene IDs. Notably, the coverage of UniHI is considerably larger than of the individual PPI resources.

Coverage of the functionally annotated human genome by PPI resources. For annotation, Gene Ontology was utilized. Coverage rates were derived after mapping of proteins to corresponding Entrez Gene IDs. Notably, the coverage of UniHI is considerably larger than of the individual PPI resources. PPI datasets currently integrated in UniHI Number of proteins and interactions in each dataset as well as construction approaches and references. In this study, we optimized the query interface which allows the simultaneous search for interaction partners of several proteins in a network-oriented manner. To facilitate its application, the list of possible protein identifiers has been expanded to include gene symbol, Entrez Gene, Uniprot, NCBI Geneinfo, Ensembl, Biogrid and HPRD IDs. Notably, these identifiers can now also be used for direct hyperlinks to UniHI. As in previous versions, special care was taken to indicate the origin of the interaction data to the user. Besides links to the original resource, a variety of information regarding the interacting proteins is given. Additionally, updates were carried out for measures of co-annotation and co-expression, which are not only important for the interpretation of single interactions, but also for higher network structures (18).

NEW INTERACTIVE VISUALIZATION TOOL

Visualization of the retrieved interaction networks remains to be crucial for the evaluation of query results. The complexity of retrieved networks, however, requires highly flexible graphical tools. While the former versions of UniHI only provided non-interactive display, the present update includes interactive graphical tools which offer many attractive features for rapid analysis and adjustment of the extracted information. For example, nodes (i.e. proteins) can be anchored or hidden allowing filtering of the network and manual adjustment of the layout. Also, information about proteins and interactions can now be accessed directly in the network graphics, thereby avoiding cumbersome comparisons with the textual output. The display can be restricted to direct interactions between query proteins or extended to include bridging proteins. For quality control, users can specify the PPI resource, from which interactions should be retrieved. This allows, e.g. the exclusion of less validated mapping approaches such as computational prediction. As additional criteria, interactions can be filtered based on a minimum number of PubMed references in which they have been reported.

UNIHI SCANNER

Pathway-focused interaction networks

Pathway information can provide highly useful clues about the functions and dynamics of interactions. Especially for the elucidation of local network structures, knowledge of interrelated pathways can be of crucial importance. We therefore constructed a new tool called UniHI Pathway Scanner (Figure 2A and B). It provides the possibility to examine the intersection of canonical pathways from KEGG with the extracted networks (19). In this way, it enables researchers to detect possible modifiers of pathways as well as proteins involved in the cross-talk between different pathways. Users can switch between the graphical display of the complete network and the intersection with selected pathways. UniHI Scanner does not only show the proteins included in the pathway but also the KEGG annotation of the interactions (e.g. phosphorylation, activation or inhibition) between nodes (see also Supplementary Materials). We expect that this will be a highly attractive feature for the large community of researchers working in cell signaling.
Figure 2.

Graphical representation and analysis of PPI networks using UniHI Scanner and UniHI Express. (A) Display of the interaction partners (yellow or gray) of the query proteins (red) GADD45, CDK1, CDK2 and CDK7. Gray nodes represent proteins included in the KEGG ‘cell cycle’ pathway. UniHI Scanner allows a focused display of the intersection between the retrieved PPI network and the pathway (B). Additional information is given regarding the type of interaction (e.g. phosphorylation (+P), dephosphorylation (−P), activation (- ->) or inhibition (- -|)), facilitating the assessment of the retrieved interactions. (C) Construction of tissue-specific networks by UniHI Express: Interaction partners (yellow) of HD, CRMP1, SH3GL3 and PRPF40A (red) which have mininmal expression values in brain tissue are displayed. The selection of larger expression thresholds can lead to a considerable reduction of the network, allowing the prioritization of proteins and interactions for follow-up studies (D).

Graphical representation and analysis of PPI networks using UniHI Scanner and UniHI Express. (A) Display of the interaction partners (yellow or gray) of the query proteins (red) GADD45, CDK1, CDK2 and CDK7. Gray nodes represent proteins included in the KEGG ‘cell cycle’ pathway. UniHI Scanner allows a focused display of the intersection between the retrieved PPI network and the pathway (B). Additional information is given regarding the type of interaction (e.g. phosphorylation (+P), dephosphorylation (−P), activation (- ->) or inhibition (- -|)), facilitating the assessment of the retrieved interactions. (C) Construction of tissue-specific networks by UniHI Express: Interaction partners (yellow) of HD, CRMP1, SH3GL3 and PRPF40A (red) which have mininmal expression values in brain tissue are displayed. The selection of larger expression thresholds can lead to a considerable reduction of the network, allowing the prioritization of proteins and interactions for follow-up studies (D).

UNIHI EXPRESS

Tissue-specific PPI networks

Protein interactions are known to be highly dynamic and to strongly depend on many biological factors. Current protein interaction maps, however, only give a static view of the human interactome. Experimentally validated protein interactions are generally identified under a variety of conditions in numerous cell and tissue types. Current interaction maps do not fully reflect physiological states, because only a selection of proteins is present in a cell at a certain point in time. Biomedical research, however, is usually focused on specific tissues involved in pathogenesis. Addressing this need, we developed and implemented UniHI Express as a new tool in our database (Figure 2C and D). It allows the filtering of PPIs based on gene expression in selected tissues and thus enables the construction of tissue-specific networks (see also Supplementary Materials). First preliminary studies show that the use of UniHI Express can be highly efficient to prioritize interactions. The expression data were derived from Gene Expression Atlas and merged to a number of main tissue types to facilitate their utilization (20). UniHI Express represents a first step towards a dynamic representation of the human interactome.

CONCLUSIONS AND FUTURE DIRECTIONS

Human interaction maps are rapidly increasing in size and have proven to be highly valuable for the study of human health and disease. UniHI will continue to extend its scope by the incorporation of newly available PPI resources and to consolidate the frequently divergent data. In this context, we like to invite other data providers and researchers to participate in the UniHI project. Clearly, the wealth of interaction data poses new challenges in follow-up analysis. The new tools included in UniHI allow researchers a more rapid inspection and prioritization of extracted interactions. Tissue-specific networks can help to focus on biologically relevant interactions, whereas use of pathway information can give important hints about functional modules of interacting proteins. We hope that these and further extensions of UniHI will support scientists in the exploration and utilization of the human interactome.

Supplementary Data

Supplementary Data are available at NAR Online.

FUNDING

Deutsche Forschungsgemeinschaft (grant SFB 618-subproject A5). Funding for open access charge: SFB 618 grant of the Deutsche Forschungsmeinschaft (DFG). Conflict of interest statement. None declared.
  19 in total

1.  BIND--The Biomolecular Interaction Network Database.

Authors:  G D Bader; I Donaldson; C Wolting; B F Ouellette; T Pawson; C W Hogue
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  The Database of Interacting Proteins: 2004 update.

Authors:  Lukasz Salwinski; Christopher S Miller; Adam J Smith; Frank K Pettit; James U Bowie; David Eisenberg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Towards a proteome-scale map of the human protein-protein interaction network.

Authors:  Jean-François Rual; Kavitha Venkatesan; Tong Hao; Tomoko Hirozane-Kishikawa; Amélie Dricot; Ning Li; Gabriel F Berriz; Francis D Gibbons; Matija Dreze; Nono Ayivi-Guedehoussou; Niels Klitgord; Christophe Simon; Mike Boxem; Stuart Milstein; Jennifer Rosenberg; Debra S Goldberg; Lan V Zhang; Sharyl L Wong; Giovanni Franklin; Siming Li; Joanna S Albala; Janghoo Lim; Carlene Fraughton; Estelle Llamosas; Sebiha Cevik; Camille Bex; Philippe Lamesch; Robert S Sikorski; Jean Vandenhaute; Huda Y Zoghbi; Alex Smolyar; Stephanie Bosak; Reynaldo Sequerra; Lynn Doucette-Stamm; Michael E Cusick; David E Hill; Frederick P Roth; Marc Vidal
Journal:  Nature       Date:  2005-09-28       Impact factor: 49.962

4.  A human protein-protein interaction network: a resource for annotating the proteome.

Authors:  Ulrich Stelzl; Uwe Worm; Maciej Lalowski; Christian Haenig; Felix H Brembeck; Heike Goehler; Martin Stroedicke; Martina Zenkner; Anke Schoenherr; Susanne Koeppen; Jan Timm; Sascha Mintzlaff; Claudia Abraham; Nicole Bock; Silvia Kietzmann; Astrid Goedde; Engin Toksöz; Anja Droege; Sylvia Krobitsch; Bernhard Korn; Walter Birchmeier; Hans Lehrach; Erich E Wanker
Journal:  Cell       Date:  2005-09-23       Impact factor: 41.582

5.  Online predicted human interaction database.

Authors:  Kevin R Brown; Igor Jurisica
Journal:  Bioinformatics       Date:  2005-01-18       Impact factor: 6.937

6.  A protein interaction network links GIT1, an enhancer of huntingtin aggregation, to Huntington's disease.

Authors:  Heike Goehler; Maciej Lalowski; Ulrich Stelzl; Stephanie Waelter; Martin Stroedicke; Uwe Worm; Anja Droege; Katrin S Lindenberg; Maria Knoblich; Christian Haenig; Martin Herbst; Jaana Suopanki; Eberhard Scherzinger; Claudia Abraham; Bianca Bauer; Renate Hasenbank; Anja Fritzsche; Andreas H Ludewig; Konrad Büssow; Konrad Buessow; Sarah H Coleman; Claire-Anne Gutekunst; Bernhard G Landwehrmeyer; Hans Lehrach; Erich E Wanker
Journal:  Mol Cell       Date:  2004-09-24       Impact factor: 17.970

7.  A gene atlas of the mouse and human protein-encoding transcriptomes.

Authors:  Andrew I Su; Tim Wiltshire; Serge Batalov; Hilmar Lapp; Keith A Ching; David Block; Jie Zhang; Richard Soden; Mimi Hayakawa; Gabriel Kreiman; Michael P Cooke; John R Walker; John B Hogenesch
Journal:  Proc Natl Acad Sci U S A       Date:  2004-04-09       Impact factor: 11.205

8.  Reactome: a knowledgebase of biological pathways.

Authors:  G Joshi-Tope; M Gillespie; I Vastrik; P D'Eustachio; E Schmidt; B de Bono; B Jassal; G R Gopinath; G R Wu; L Matthews; S Lewis; E Birney; L Stein
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

9.  Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome.

Authors:  Arun K Ramani; Razvan C Bunescu; Raymond J Mooney; Edward M Marcotte
Journal:  Genome Biol       Date:  2005-04-15       Impact factor: 13.583

10.  A first-draft human protein-interaction map.

Authors:  Ben Lehner; Andrew G Fraser
Journal:  Genome Biol       Date:  2004-08-13       Impact factor: 13.583

View more
  30 in total

1.  Interaction databases on the same page.

Authors:  Andrei L Turinsky; Sabry Razick; Brian Turner; Ian M Donaldson; Shoshana J Wodak
Journal:  Nat Biotechnol       Date:  2011-05       Impact factor: 54.908

2.  Characterizing the diversity and biological relevance of the MLPCN assay manifold and screening set.

Authors:  Jintao Zhang; Gerald H Lushington; Jun Huan
Journal:  J Chem Inf Model       Date:  2011-05-13       Impact factor: 4.956

Review 3.  Noise in cellular signaling pathways: causes and effects.

Authors:  John E Ladbury; Stefan T Arold
Journal:  Trends Biochem Sci       Date:  2012-02-15       Impact factor: 13.807

Review 4.  Molecular networks in Network Medicine: Development and applications.

Authors:  Edwin K Silverman; Harald H H W Schmidt; Eleni Anastasiadou; Lucia Altucci; Marco Angelini; Lina Badimon; Jean-Luc Balligand; Giuditta Benincasa; Giovambattista Capasso; Federica Conte; Antonella Di Costanzo; Lorenzo Farina; Giulia Fiscon; Laurent Gatto; Michele Gentili; Joseph Loscalzo; Cinzia Marchese; Claudio Napoli; Paola Paci; Manuela Petti; John Quackenbush; Paolo Tieri; Davide Viggiano; Gemma Vilahur; Kimberly Glass; Jan Baumbach
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2020-04-19

5.  Re-evaluation of the role of calcium homeostasis endoplasmic reticulum protein (CHERP) in cellular calcium signaling.

Authors:  Yaping Lin-Moshier; Peter J Sebastian; Leeann Higgins; Natalie D Sampson; Jane E Hewitt; Jonathan S Marchant
Journal:  J Biol Chem       Date:  2012-11-12       Impact factor: 5.157

6.  Comparison and consolidation of microarray data sets of human tissue expression.

Authors:  Jenny Russ; Matthias E Futschik
Journal:  BMC Genomics       Date:  2010-05-14       Impact factor: 3.969

7.  Gene regulatory network reveals oxidative stress as the underlying molecular mechanism of type 2 diabetes and hypertension.

Authors:  Jesmin Jesmin; Mahbubur Sm Rashid; Hasan Jamil; Raquel Hontecillas; Josep Bassaganya-Riera
Journal:  BMC Med Genomics       Date:  2010-10-13       Impact factor: 3.063

8.  iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence.

Authors:  Brian Turner; Sabry Razick; Andrei L Turinsky; James Vlasblom; Edgard K Crowdy; Emerson Cho; Kyle Morrison; Ian M Donaldson; Shoshana J Wodak
Journal:  Database (Oxford)       Date:  2010-10-12       Impact factor: 3.451

Review 9.  Role for protein-protein interaction databases in human genetics.

Authors:  Kristine A Pattin; Jason H Moore
Journal:  Expert Rev Proteomics       Date:  2009-12       Impact factor: 3.940

10.  DASMIweb: online integration, analysis and assessment of distributed protein interaction data.

Authors:  Hagen Blankenburg; Fidel Ramírez; Joachim Büch; Mario Albrecht
Journal:  Nucleic Acids Res       Date:  2009-06-05       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.