Literature DB >> 23193266

The TissueNet database of human tissue protein-protein interactions.

Ruth Barshir1, Omer Basha, Amir Eluk, Ilan Y Smoly, Alexander Lan, Esti Yeger-Lotem.   

Abstract

Knowledge of protein-protein interactions (PPIs) is important for identifying the functions of proteins and the processes they are involved in. Although data of human PPIs are easily accessible through several public databases, these databases do not specify the human tissues in which these PPIs take place. The TissueNet database of human tissue PPIs (http://netbio.bgu.ac.il/tissuenet/) associates each interaction with human tissues that express both pair mates. This was achieved by integrating current data of experimentally detected PPIs with extensive data of gene and protein expression across 16 main human tissues. Users can query TissueNet using a protein and retrieve its PPI partners per tissue, or using a PPI and retrieve the tissues expressing both pair mates. The graphical representation of the output highlights tissue-specific and tissue-wide PPIs. Thus, TissueNet provides a unique platform for assessing the roles of human proteins and their interactions across tissues.

Entities:  

Mesh:

Year:  2012        PMID: 23193266      PMCID: PMC3531115          DOI: 10.1093/nar/gks1198

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Proteins act through interactions with other molecules, and knowledge of these interactions can help identify the functions of proteins and their involvement in various processes during health and disease (1,2). Owing to their importance, many experimental methods were developed and applied to detect physical protein–protein interactions (PPIs). These methods include in vivo assays such as affinity-based methods, as well as high-throughput in vitro assays such as protein arrays (3). The various types of PPIs have been recorded in several PPI databases that serve a large community of researchers interested in revealing the roles of proteins in various organisms [e.g. (4–7)]. Here, we focus on the human PPIs. Unlike unicellular organisms such as yeast, the human body is composed of many tissues, each expressing a distinct set of proteins [e.g. (8,9)]. Consequently, many proteins have varying PPI partners across tissues (10,11). A tissue-sensitive view of PPIs is therefore important for assessing the various roles of human proteins. However, this view is not readily available in current databases: PPI databases do not hold tissue information, potentially because many PPIs were detected using in vitro assays, regardless of specific tissues. Likewise, protein-centric databases report the expression of a protein across tissues, but do not associate its PPIs with a specific tissue [e.g. (12,13)]. TissueNet was designed to provide a tissue-sensitive view of human PPIs. To this end, we integrated current data of human PPIs from four major PPI databases with data of tissue expression of genes and proteins. The tissue expression data were assembled from three extensive resources, each based on a different profiling technique, and resulted in mapping of proteins to 16 main human tissues. We then associated each PPI with the subset of tissues that express both pair mates, thus leaving out tissues that are less likely to contain the PPI. Users can query TissueNet using a protein and retrieve its PPI partners per tissue, or using a PPI and retrieve the tissues expressing both pair mates. The graphical network output distinctly highlights tissue-specific and tissue-wide proteins and PPIs. TissueNet also provides auxiliary information regarding proteins and interactions, such as protein expression levels across tissues, PPI detection methods and gene ontology (GO) annotations. We believe that TissueNet will be of great help to researchers interested in the various roles of human protein and interactions across tissues.

METHODS

Expression data sources

Data of mRNA levels across tissues, denoted GNF (8), were downloaded from BioGPS (14), and all genes with intensity value >100 in a tissue were considered as expressed (15). Data of protein expression across tissues, denoted Human Protein Atlas (HPA) (16), were limited to proteins with a positive expression value (ranked: 1, 2, 3, 4) in a tissue. Proteins were further filtered by imposing stringent thresholds on the reliability and validity of their antibodies. Specifically, when antibody-reliability score was available, a medium or high was required; otherwise at least one supportive and no negative validity scores were required. RNA-seq data from Illumina Body Map 2.0 (17) were filtered for genes with at least 1 RPKM. All analyses were limited to proteins and protein-coding genes only, and these were mapped to their Ensembl gene ids using BioMart (18). Supplementary Table S1 presents an overview of the objects and tissues measured in each resource.

Consolidation of expression and tissue data

As RNA-seq data covered the largest number of genes per tissue, we based our analysis on the 16 main human tissues profiled with RNA-seq. As GNF and HPA profiled sub-parts of these tissues, we manually consolidated the various tissue sub-parts according to the consolidation scheme given in Supplementary Table S2.

RESULTS

TissueNet synergizes between large-scale data of PPI and expression profiling across tissues to create a unique database of tissue-associated PPIs. Below, we describe the construction of TissueNet and its usage.

TissueNet construction

In TissueNet, a PPI is associated with the tissues expressing both pair mates. This was achieved by integrating recent data of PPIs with extensive data of gene and protein expression across 16 main human tissues. Specifically, experimentally detected human PPIs were assembled for four major PPI databases, including BIOGRID (4), DIP (5), IntAct (6) and MINT (7). These data amounted to 67 439 PPIs between 11 225 human proteins. Data of gene and protein expression across tissues were assembled from three major resources: the GNF data set of Su et al. (8) based on profiling using DNA microarrays, the HPA based on protein immunehistochemistry measurements (16) and the Illumina Body Map 2.0 based on RNA-seq measurements (17). These data were subject to stringent thresholds (see Methods). As RNA-seq identified the largest number of genes per tissue (9,11), we focused on the 16 main human tissues profiled by RNA-seq. The 16 tissues were as follows: adipose, adrenal, brain, breast, colon, heart, kidney, liver, lung, lymph node, ovary, prostate, skeletal muscle, testis, thyroid and white blood cells. We then combined data of these tissues from the three resources as follows: We associated a PPI with a tissue if each pair mate was found to be expressed in that tissue according to at least one resource. For tissues consisting of multiple sub-parts, such as brain, we associated a PPI with the tissue only if the two pair mates were detected in similar or closely related sub-parts of that tissue (see Methods). Consequently, 59 640 PPIs were associated with at least 1 of the 16 tissues.

TissueNet usage

Users query TissueNet using a protein and retrieve its PPI partners per tissue, or using a PPI and retrieve the tissues expressing both pair mates. The output of TissueNet includes a network view using a Cytoscape Web plug-in (19), as well as a textual output and a context menu for consecutive user queries. Below, we describe the input, output and context menu in more detail.

Querying TissueNet

TissueNet supports protein and PPI queries that use Ensmbel gene ID, Entrez gene ID or wiki names. The full lists of proteins that are present in TissueNet are accessible from TissueNet homepage. In addition to the protein and PPI queries, TissueNet also offers a ‘sample protein’ query that retrieves the tissue PPIs of the human protein TP53 and a ‘random protein’ query that retrieves the tissue PPIs of a randomly selected protein from TissueNet.

TissueNet output

TissueNet output includes a graphical network view and textual information of the output proteins and PPIs (Figure 1). The graphical network view depicts proteins as network nodes and PPIs as network edges while highlighting their tissue specificity: tissue-specific proteins, which are expressed in at most three tissues, are coloured orange, and tissue-wide proteins, which are expressed in 14–16 tissues, are coloured blue. Other proteins are coloured grey. The width of a PPI edge increases with the number of tissues associated with it.
Figure 1.

TisseuNet presentation of the PPIs of the human protein alpha-synuclein (SNCA) in brain. The network view on the left shows SNCA as the diamond-shaped node in the middle, circled by its PPI partners in brain. The network view distinguishes between partners expressed in ≥14 tissues (coloured blue), partners expressed in at most three tissues (SNCB, coloured orange) and other partners (coloured grey). The Tissues tab on the right shows the measured expression levels in brain of SNCA’s PPI partners.

TisseuNet presentation of the PPIs of the human protein alpha-synuclein (SNCA) in brain. The network view on the left shows SNCA as the diamond-shaped node in the middle, circled by its PPI partners in brain. The network view distinguishes between partners expressed in ≥14 tissues (coloured blue), partners expressed in at most three tissues (SNCB, coloured orange) and other partners (coloured grey). The Tissues tab on the right shows the measured expression levels in brain of SNCA’s PPI partners. The textual information is divided into tabs, each relating to a different type of data regarding the output. The ‘Properties’ tab lists the tissues associated with each protein or PPI and also specifies the detection methods and source databases for each PPI. The ‘Gene ontology’ (GO) tab provides the GO annotations of proteins selected in the network view (20). The ‘Chemicals’ tab lists chemicals known to interact with selected proteins, which might suggest candidates for experimental manipulation (21). Lastly, the ‘Tissues’ tab lists per tissue the subset of proteins that are expressed in that tissue, along with the reporting expression resource and measured expression value. Thus, the various tabs enable users to assess the reliability of PPIs and of tissue associations, and to learn more about the selected proteins and PPIs.

The context menu

The context menu is activated by a right mouse click over the output network view. This menu provides an interface for several user options, including changing the graph layout, exporting the output, removing proteins from the network and adding the PPIs of selected proteins to the current network view, thereby gradually expanding the network.

DISCUSSION

The TissueNet database of human tissue PPIs at http://netbio.bgu.ac.il/tissuenet/ offers a unique service that associates experimentally detected PPIs with tissues that express both pair mates. To construct TissueNet, we gathered tissue-profiling data from three large resources, each using a different technique to measure expression within a tissue. By applying a stringent threshold for calling expression within a resource and uniting these data, we created a broad, high-confidence data set of gene and protein expression across 16 main human tissues. The integration of these data with data of PPIs resulted in tissue associations for 88% of the PPIs. Although co-expression in a tissue is a necessary condition for a PPI to occur, it is important to clarify that it does not validate the PPI within a tissue. PPIs are probabilistic events that depend on additional parameters, such as the cellular localization, post-translational modifications and concentrations of the interacting proteins, which are yet unavailable for most PPIs. For these reasons, the common practice in the field of PPI mapping is to consider experimentally identified PPIs as potential PPIs [e.g. (2)]. TissueNet further limits these potential PPIs by tissue co-expression, thereby filtering out PPIs from tissues that do not express both pair mates. To judge the filtering power of TissueNet, we calculated the fraction of mapped PPI tissue associations as follows: we summed the number of different tissue associations per PPI across all PPIs (729 675 PPI tissue associations). We then divided it by the total number of potential tissue associations for all PPI, i.e. the total number of PPIs multiplied by 16 tissues (1 079 024 PPI tissue associations). This resulted in 67.6%, meaning that TissueNet filtered out a considerable 32.4% of the potential PPI tissue associations. To help users further assess the reliability of specific PPIs, TissueNet provides for each PPI its detecting methods and the tissue expression levels of its proteins. To summarize, TissueNet is a novel platform for obtaining a tissue-sensitive view of human PPIs that can greatly facilitate the functional assessment of human proteins and their PPIs across tissues.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Tables 1 and 2.

FUNDING

Funding for open access charge: The European Union Seventh Programme under the FP7-PEOPLE-MCA-IRG funding scheme [256360 to E.Y.-L.]. Conflict of interest statement. None declared.
  21 in total

1.  The Database of Interacting Proteins: 2004 update.

Authors:  Lukasz Salwinski; Christopher S Miller; Adam J Smith; Frank K Pettit; James U Bowie; David Eisenberg
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification.

Authors:  Itai Yanai; Hila Benjamin; Michael Shmoish; Vered Chalifa-Caspi; Maxim Shklar; Ron Ophir; Arren Bar-Even; Shirley Horn-Saban; Marilyn Safran; Eytan Domany; Doron Lancet; Orit Shmueli
Journal:  Bioinformatics       Date:  2004-09-23       Impact factor: 6.937

Review 3.  A network view of disease and compound screening.

Authors:  Eric E Schadt; Stephen H Friend; David A Shaywitz
Journal:  Nat Rev Drug Discov       Date:  2009-04       Impact factor: 84.694

Review 4.  A genecentric Human Protein Atlas for expression profiles based on antibodies.

Authors:  Lisa Berglund; Erik Björling; Per Oksvold; Linn Fagerberg; Anna Asplund; Cristina Al-Khalili Szigyarto; Anja Persson; Jenny Ottosson; Henrik Wernérus; Peter Nilsson; Emma Lundberg; Asa Sivertsson; Sanjay Navani; Kenneth Wester; Caroline Kampf; Sophia Hober; Fredrik Pontén; Mathias Uhlén
Journal:  Mol Cell Proteomics       Date:  2008-10       Impact factor: 5.911

5.  GeneCards Version 3: the human gene integrator.

Authors:  Marilyn Safran; Irina Dalah; Justin Alexander; Naomi Rosen; Tsippi Iny Stein; Michael Shmoish; Noam Nativ; Iris Bahir; Tirza Doniger; Hagit Krug; Alexandra Sirota-Madi; Tsviya Olender; Yaron Golan; Gil Stelzer; Arye Harel; Doron Lancet
Journal:  Database (Oxford)       Date:  2010-08-05       Impact factor: 3.451

6.  Cytoscape Web: an interactive web-based network browser.

Authors:  Christian T Lopes; Max Franz; Farzana Kazi; Sylva L Donaldson; Quaid Morris; Gary D Bader
Journal:  Bioinformatics       Date:  2010-07-23       Impact factor: 6.937

7.  Alternative splicing of RNA triplets is often regulated and accelerates proteome evolution.

Authors:  Robert K Bradley; Jason Merkin; Nicole J Lambert; Christopher B Burge
Journal:  PLoS Biol       Date:  2012-01-03       Impact factor: 8.029

8.  neXtProt: a knowledge platform for human proteins.

Authors:  Lydie Lane; Ghislaine Argoud-Puy; Aurore Britan; Isabelle Cusin; Paula D Duek; Olivier Evalet; Alain Gateau; Pascale Gaudet; Anne Gleizes; Alexandre Masselot; Catherine Zwahlen; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2011-12-01       Impact factor: 16.971

9.  BioMart Central Portal: an open database network for the biological community.

Authors:  Jonathan M Guberman; J Ai; O Arnaiz; Joachim Baran; Andrew Blake; Richard Baldock; Claude Chelala; David Croft; Anthony Cros; Rosalind J Cutts; A Di Génova; Simon Forbes; T Fujisawa; E Gadaleta; D M Goodstein; Gunes Gundem; Bernard Haggarty; Syed Haider; Matthew Hall; Todd Harris; Robin Haw; S Hu; Simon Hubbard; Jack Hsu; Vivek Iyer; Philip Jones; Toshiaki Katayama; R Kinsella; Lei Kong; Daniel Lawson; Yong Liang; Nuria Lopez-Bigas; J Luo; Michael Lush; Jeremy Mason; Francois Moreews; Nelson Ndegwa; Darren Oakley; Christian Perez-Llamas; Michael Primig; Elena Rivkin; S Rosanoff; Rebecca Shepherd; Reinhard Simon; B Skarnes; Damian Smedley; Linda Sperling; William Spooner; Peter Stevenson; Kevin Stone; J Teague; Jun Wang; Jianxin Wang; Brett Whitty; D T Wong; Marie Wong-Erasmus; L Yao; Ken Youens-Clark; Christina Yung; Junjun Zhang; Arek Kasprzyk
Journal:  Database (Oxford)       Date:  2011-09-18       Impact factor: 3.451

10.  BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources.

Authors:  Chunlei Wu; Camilo Orozco; Jason Boyer; Marc Leglise; James Goodale; Serge Batalov; Christopher L Hodge; James Haase; Jeff Janes; Jon W Huss; Andrew I Su
Journal:  Genome Biol       Date:  2009-11-17       Impact factor: 13.583

View more
  30 in total

Review 1.  Network.assisted analysis to prioritize GWAS results: principles, methods and perspectives.

Authors:  Peilin Jia; Zhongming Zhao
Journal:  Hum Genet       Date:  2014-02       Impact factor: 4.132

2.  In silico prediction of physical protein interactions and characterization of interactome orphans.

Authors:  Max Kotlyar; Chiara Pastrello; Flavia Pivetta; Alessandra Lo Sardo; Christian Cumbaa; Han Li; Taline Naranian; Yun Niu; Zhiyong Ding; Fatemeh Vafaee; Fiona Broackes-Carter; Julia Petschnigg; Gordon B Mills; Andrea Jurisicova; Igor Stagljar; Roberta Maestro; Igor Jurisica
Journal:  Nat Methods       Date:  2014-11-17       Impact factor: 28.547

3.  Identification of shared and unique susceptibility pathways among cancers of the lung, breast, and prostate from genome-wide association studies and tissue-specific protein interactions.

Authors:  David C Qian; Jinyoung Byun; Younghun Han; Casey S Greene; John K Field; Rayjean J Hung; Yonathan Brhane; John R Mclaughlin; Gordon Fehringer; Maria Teresa Landi; Albert Rosenberger; Heike Bickeböller; Jyoti Malhotra; Angela Risch; Joachim Heinrich; David J Hunter; Brian E Henderson; Christopher A Haiman; Fredrick R Schumacher; Rosalind A Eeles; Douglas F Easton; Daniela Seminara; Christopher I Amos
Journal:  Hum Mol Genet       Date:  2015-10-19       Impact factor: 6.150

4.  Analyzing the Role of MicroRNAs in Schizophrenia in the Context of Common Genetic Risk Variants.

Authors:  Mads Engel Hauberg; Panos Roussos; Jakob Grove; Anders Dupont Børglum; Manuel Mattheisen
Journal:  JAMA Psychiatry       Date:  2016-04       Impact factor: 21.596

5.  The DifferentialNet database of differential protein-protein interactions in human tissues.

Authors:  Omer Basha; Rotem Shpringer; Chanan M Argov; Esti Yeger-Lotem
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

6.  GWAS and network analysis of co-occurring nicotine and alcohol dependence identifies significantly associated alleles and network.

Authors:  Bo Xiang; Bao-Zhu Yang; Hang Zhou; Henry Kranzler; Joel Gelernter
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2018-11-28       Impact factor: 3.568

Review 7.  Molecular network analysis enhances understanding of the biology of mental disorders.

Authors:  Kay S Grennan; Chao Chen; Elliot S Gershon; Chunyu Liu
Journal:  Bioessays       Date:  2014-04-14       Impact factor: 4.345

8.  A context-sensitive framework for the analysis of human signalling pathways in molecular interaction networks.

Authors:  Alexander Lan; Michal Ziv-Ukelson; Esti Yeger-Lotem
Journal:  Bioinformatics       Date:  2013-07-01       Impact factor: 6.937

Review 9.  Applications and analytical tools of cell communication based on ligand-receptor interactions at single cell level.

Authors:  Fen Ma; Siwei Zhang; Lianhao Song; Bozhi Wang; Lanlan Wei; Fengmin Zhang
Journal:  Cell Biosci       Date:  2021-07-03       Impact factor: 7.133

10.  Cataloging the biomedical world of pain through semi-automated curation of molecular interactions.

Authors:  Daniel G Jamieson; Phoebe M Roberts; David L Robertson; Ben Sidders; Goran Nenadic
Journal:  Database (Oxford)       Date:  2013-05-23       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.