Literature DB >> 32296183

A reference map of the human binary protein interactome.

Katja Luck1,2,3, Dae-Kyum Kim1,4,5,6, Luke Lambourne1,2,3, Kerstin Spirohn1,2,3, Bridget E Begg1,2,3, Wenting Bian1,2,3, Ruth Brignall1,2,3, Tiziana Cafarelli1,2,3, Francisco J Campos-Laborie7,8, Benoit Charloteaux1,2,3, Dongsic Choi9, Atina G Coté1,4,5,6, Meaghan Daley1,2,3, Steven Deimling10, Alice Desbuleux1,2,3,11, Amélie Dricot1,2,3, Marinella Gebbia1,4,5,6, Madeleine F Hardy1,2,3, Nishka Kishore1,4,5,6, Jennifer J Knapp1,4,5,6, István A Kovács1,12,13, Irma Lemmens14,15, Miles W Mee4,5,16, Joseph C Mellor1,4,5,6,17, Carl Pollis1,2,3, Carles Pons18, Aaron D Richardson1,2,3, Sadie Schlabach1,2,3, Bridget Teeking1,2,3, Anupama Yadav1,2,3, Mariana Babor1,4,5,6, Dawit Balcha1,2,3, Omer Basha19,20, Christian Bowman-Colin2,3, Suet-Feung Chin21, Soon Gang Choi1,2,3, Claudia Colabella22,23, Georges Coppin1,2,3,11, Cassandra D'Amata10, David De Ridder1,2,3, Steffi De Rouck14,15, Miquel Duran-Frigola18, Hanane Ennajdaoui1,4,5,6, Florian Goebels4,5,16, Liana Goehring2,3, Anjali Gopal1,4,5,6, Ghazal Haddad1,4,5,6, Elodie Hatchi2,3, Mohamed Helmy4,5,16, Yves Jacob24,25, Yoseph Kassa1,2,3, Serena Landini2,3, Roujia Li1,4,5,6, Natascha van Lieshout1,4,5,6, Andrew MacWilliams1,2,3, Dylan Markey1,2,3, Joseph N Paulson26,27,28, Sudharshan Rangarajan1,2,3, John Rasla1,2,3, Ashyad Rayhan1,4,5,6, Thomas Rolland1,2,3, Adriana San-Miguel1,2,3, Yun Shen1,2,3, Dayag Sheykhkarimli1,4,5,6, Gloria M Sheynkman1,2,3, Eyal Simonovsky19,20, Murat Taşan1,4,5,6,16, Alexander Tejeda1,2,3, Vincent Tropepe10, Jean-Claude Twizere11, Yang Wang1,2,3, Robert J Weatheritt4, Jochen Weile1,4,5,6,16, Yu Xia1,29, Xinping Yang1,2,3, Esti Yeger-Lotem19,20, Quan Zhong1,2,3,30, Patrick Aloy18,31, Gary D Bader4,5,16, Javier De Las Rivas7,8, Suzanne Gaudet1,2,3, Tong Hao1,2,3, Janusz Rak9, Jan Tavernier14,15, David E Hill32,33,34, Marc Vidal35,36, Frederick P Roth37,38,39,40,41,42, Michael A Calderwood43,44,45.   

Abstract

Global insights into cellular organization and genome function require comprehensive understanding of the interactome networks that mediate genotype-phenotype relationships1,2. Here we present a human 'all-by-all' reference interactome map of human binary protein interactions, or 'HuRI'. With approximately 53,000 protein-protein interactions, HuRI has approximately four times as many such interactions as there are high-quality curated interactions from small-scale studies. The integration of HuRI with genome3, transcriptome4 and proteome5 data enables cellular function to be studied within most physiological or pathological cellular contexts. We demonstrate the utility of HuRI in identifying the specific subcellular roles of protein-protein interactions. Inferred tissue-specific networks reveal general principles for the formation of cellular context-specific functions and elucidate potential molecular mechanisms that might underlie tissue-specific phenotypes of Mendelian diseases. HuRI is a systematic proteome-wide reference that links genomic variation to phenotypic outcomes.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32296183      PMCID: PMC7169983          DOI: 10.1038/s41586-020-2188-x

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


The reference human genome sequence has enabled systematic study of genetic[6] and expression[4] variability at the organism[6], tissue[4], cell type[7] and single cell level[8]. Despite advances in sequencing genomes, transcriptomes, and proteomes, we still understand little about the cellular mechanisms that mediate phenotypic and tissue or cell type variability. A mechanistic understanding of cellular function and organization emerges from studying how genes and their products, primarily proteins, interact with each other, forming a dynamic interactome that drives biological function. Analogous to the reference human genome sequence[9,10], a reference map of the human protein interactome, generated systematically and comprehensively, is needed to provide a scaffold for the unbiased proteome-wide study of biological mechanisms, generally and within specific cellular contexts. It remains infeasible to assemble a reference interactome map by systematically identifying endogenous PPIs in thousands of physiological and pathological cellular contexts[11,12]. However, systematic affinity purification of exogenously expressed bait proteins in immortalized-cell contexts[13] as well as binary PPI detection assays in cell models[2,14] have generated biophysical human protein interactome maps of demonstrated high functional relevance. Specifically, yeast two-hybrid (Y2H) represents the only binary PPI assay that can be operated at sufficient throughput to systematically screen the human proteome for binary PPIs. Using Y2H followed by validation in orthogonal assays, we previously generated HI-II-14 consisting of ~14,000 PPIs involving 4,000 proteins from screening ~40% of the genome-by-genome search space[2]. In contrast to curated interactions from hypothesis-driven small-scale studies and protein complex interactome maps favoring highly expressed proteins, HI-II-14 covers the proteome more uniformly and is relatively free from ascertainment and expression biases. To increase interactome coverage and generate a reference map of human binary PPIs, we expanded the ORFeome collection to encompass ~90% of the protein-coding genome. We screened this search space a total of nine times with a panel of three Y2H assay versions (Fig. 1a, b). The resulting PPI map quadruples the number of identified PPIs overall, and upon integration with genome, transcriptome and proteome resources enables biological discovery across most cellular contexts, offering a reference map of the binary human protein interactome.
Fig. 1 |

Generation of a reference interactome map using a panel of binary assays.

a, Overview of HuRI generation. b, Schematic of the Y2H assay versions. c, Experimental validation. Lit-BM: literature-curated binary PPIs with multiple evidence; RRS: random protein pairs. Error bars are 68.3% Bayesian confidence interval, n = 2,281, 383, 475 (MAPPIT) 1,639, 382, 465 (GPCA). d, Number of PPIs and proteins, detected with each additional screen. e, Fraction of direct contact pairs among five PPI networks. Error bar is standard error of proportion, n = 121, 410, 1,169, 584, 1,211 PPIs. f, Number of PPIs identified over time from screening at CCSB and Lit-BM.

Generation and characterization of HuRI

The newly established human ORFeome v9.1 covers 17,408 protein-coding genes (Extended Data Fig. 1a, b, Supplementary Table 1, 2), forming a search space (Space III) that encompasses over 150 million pairwise combinations (Fig. 1a). This search space more than doubles the space screened to generate HI-II-14 and represents the most comprehensive search space systematically screened for human PPIs. Limitations in PPI assay sensitivity can be overcome by using different assays[15,16] or different versions of the same assay[17,18]. To maximize sensitivity we used three Y2H assay versions (Fig. 1b, Supplementary Table 3, 4) that showed good sensitivity and low false positive rates when benchmarked against gold-standard positive and random reference sets (PRSv1 and RRSv1, respectively)[15], while detecting complementary PPI sets (Extended Data Fig. 1c, d, Supplementary Table 5). We assessed assay performance using a test space of ~2,000 by ~2,000 human genes[2] (Extended Data Fig. 1e, Supplementary Table 6). The Y2H versions were complementary, in that three screens for each of three versions doubled the number of detected PPIs and proteins relative to nine screens using a single version (Extended Data Fig. 1f, Supplementary Table 7).
Extended Data Fig. 1 |

Y2H assay development and validation of HuRI.

a, Number of protein-coding genes in hORFeome v9.1 and GTEx, FANTOM, and HPA transcriptome projects. The number of genes in hORFeome v9.1 is on par with the number of genes found to be expressed in three comprehensive individual transcriptome sequencing studies and includes 94% of the genes with robust evidence of expression in all three. b, Overlap between hORFeome v9.1 and intersection of transcriptomes in a. c, Individual and combined recovery of PRSv1 and RRSv1 pairs by Y2H assay versions (n = 252, 270). d, Colored squares showing which protein pairs were detected in PRSv1 (left) and RRSv1 (right) by Y2H assay versions. e, Recovery rates of Lit-BM and PPIs from screens of a 2k-by-2k gene test space per Y2H assay version in MAPPIT. f, Cumulative PPI count performing three screens with each Y2H assay version in the test space compared to nine screens with Y2H assay version 1. g, h, MAPPIT and GPCA recovery of Lit-BM and PPIs from screens of Space III when split by screen at a RRS rate of 1% (g) or across a range of thresholds (h). All error bars, in c, e, g, are 68.3% Bayesian confidence interval, shaded error band in h is standard error of proportion and n = between 101 and 395 pairs successfully tested for each category. i, Number of proteins in HuRI, detected with each additional screen.

To map the reference interactome, we performed nine screens of Space III, followed by pairwise verification by quadruplicate retesting and sequence confirmation. PPIs verified by two orthogonal binary PPI assays, MAPPIT[19] and GPCA[20], were recovered at rates on par with high-confidence binary PPIs from the literature (each having ≥2 pieces of experimental evidence, with at least one from a binary assay type; Lit-BM) over a large range of score thresholds (Fig. 1c, Extended Data Fig. 1g, h, Supplementary Table 8). Each additional screen identified novel PPIs and proteins, with the largest gains obtained by switching assay versions (Fig. 1d, Extended Data Fig. 1i). The dataset, versioned HI-III-20 (Human Interactome obtained from screening Space III, published in 2020), contains 52,569 verified PPIs involving 8,275 proteins (Supplementary Table 9). Although our knowledge of the interactome remains incomplete, we refer to HI-III-20 as a reference map of the human binary protein interactome (HuRI) given its systematic nature, extensive coverage, and scale. Molecular mechanisms can be more readily inferred from direct than indirect PPIs, yet the fraction of PPIs reported in various human protein interactome maps that are direct remains unknown. Structures from protein complexes with at least three subunits[21] show that, within those complexes, more PPIs in HuRI correspond to direct biophysical contacts than do PPIs from Lit-BM (90% vs. 81%, P = 0.019, two-sided Fisher’s exact test, n = 121 and 410 for HuRI and Lit-BM, respectively) or from protein complex interactome maps (< 50%, P < 0.001 for all tested maps, two-sided Fisher’s exact test; Fig. 1e, Supplementary Table 10). Combining HuRI with all previously published systematic screening efforts at CCSB yields 64,006 binary PPIs involving 9,094 proteins (HI-union; Supplementary Table 11), which is approximately five-fold more PPIs than the entirety of high-quality binary PPIs curated from the literature (Fig. 1f, Extended Data Fig. 2, Supplementary Table 12–14). The union of Lit-BM and HI-union represents the most complete collection of high-quality direct PPI data available to date (http://interactome-atlas.org).
Extended Data Fig. 2 |

Definition of literature-curated PPI datasets.

a, Categorization of literature-curated PPIs into distinct subsets based on the experimental methods in which they were detected and the number of pieces of experimental evidence. b-e, Results of testing the different categories of literature-curated pairs in Y2H (b, d) and MAPPIT (c, e) where the pairs have been further divided into HT - high throughput and LT - low throughput subsets (b, c). BM: binary multiple; BS: binary singleton; NB: non-binary. Between n = 191–471 successfully tested PPIs for each category.

Complementarity of the three Y2H versions might stem from steric constraints that differ between protein fusions used in the assays. Integrating HuRI with structures of PPIs[21], we observed reduced sensitivity of Y2H assays where the interaction interface was close (< 20Å) to whichever terminus was fused to the Gal4 activation domain (AD; Extended Data Fig. 3a, b, Supplementary Table 15). PPIs found in multiple screens had larger interaction interfaces (P = 0.03, two-sided permutation test, n = 234) and corresponded more often to direct PPIs within, rather than between, protein complexes (P = 3 × 10−18, two-sided Fisher’s exact test, n = 1,817), however, HuRI PPIs found in a single screen were observed to have precision as high as those found in multiple screens (Extended Data Fig. 3c–g, 4, Supplementary Table 16–18, Supplementary Note 1). These results reinforce previous observations[12,22] that the protein interactome might be dominated by weaker and more transient PPIs that are difficult to detect, as indicated by the fact that the majority of PPIs in HuRI were found in only one screen (Extended Data Fig. 3h, i). This is reflected in an increased estimate of the binary protein interactome size (Extended Data Fig. 3j), so that HuRI is estimated to represent 2–11% of the binary protein interactome[23] (Supplementary Note 2).
Extended Data Fig. 3 |

Stericity and interaction strength contribute to PPI detectability.

a, b, Fraction of PPIs with N or C-terminus < 10 Å (a) or 20 Å (b) to PPI interface, for PPIs with known structure in and not in HuRI (n = 37–1,891 PPIs). Error bars are standard error of proportion. The structure of UBE2D3 bound to RNF115 illustrates an example of a PPI found only by Y2H assay version 3 (PDB code: 5ulh). c, MAPPIT recovery rates of HuRI and Lit-BM PPIs that were also detected in HuRI by the number of screens each pair was detected in. Error bars are 68.3% Bayesian confidence interval (n = 22–793 PPIs successfully tested in each category). d, MAPPIT recovery rates of Lit-BM PPIs that were also detected in HuRI, for increasing number of pieces of experimental evidence per PPI. Error bars are 68.3% Bayesian confidence interval (n = 24–61 PPIs successfully tested in each category). e-f, Distributions of interaction interface area (e) or number of atomic contacts (f) by the number of HuRI screens in which a PPI is detected, with boxplots showing median, interquartile range (IQR), and 1.5 × IQR (with outliers), n = 1004 PPIs. g, Examples of within-complex interactions detected in HuRI (purple) and BioPlex (orange). Fraction of HuRI PPIs between proteins of protein complexes that link proteins of the same complex, split by PPIs found in single and multiple screens (dark purple). Error bars are standard error of proportion, n = 1,042 and 775 PPIs. h, Number of screens each PPI in HuRI was detected in, split by Y2H assay version. i, Number of Y2H assay versions each PPI in HuRI was detected in. j, Estimates of the size of the total binary protein interactome and the fraction covered by HuRI, as a function of the minimum number of publications per gene and the minimum number of evidence for the Lit-BM reference. Error bands are 68.3% Bayesian confidence interval, n ≥ 170 Lit-BM PPIs.

Extended Data Fig. 4 |

HuRI provides direct contact information for proteins in complexes.

Intra-complex PPIs are shown for protein complexes from CORUM as found in BioPlex (orange) or HuRI (purple). HuRI PPIs are further distinguished into PPIs found in single (light purple) or multiple screens (dark purple).

Functional relationships in HuRI

Based on the observation that HuRI is enriched in direct PPIs, we also hypothesize that proteins in HuRI with similar interaction interfaces tend to share interaction partners. For example, retinoic acid receptors RXR-γ and -β (Extended Data Fig. 5a, left panel) share previously reported interaction partners involving binding to retinoic acid receptor RAR types[24] and oxysterol receptors NR1 group H types[25]. We derived a profile similarity network (PSN) from HuRI (Supplementary Table 19), and found that the number of pairs of proteins in HuRI with similar interaction profiles is higher than random (P < 0.01, one-sided empirical test; Extended Data Fig. 5b) and proteins of overall higher sequence identity tend to exhibit higher interaction profile similarities (P < 0.01, one-sided empirical test; Extended Data Fig. 5c). We also found that proteins with a tendency to share interaction partners often have interaction interfaces that are similar, as opposed to complementary, and therefore tend not to interact with one another, except where proteins originate from a common ancestor that could self-interact[26] (Extended Data Fig. 5d). Indeed, only 5% of the protein pairs found to interact in HuRI share more than 10% of their interaction partners. Both HuRI PPIs and the PSN are enriched for links between proteins of similar function (P < 0.01, one-sided empirical test; Fig. 2a, Supplementary Table 20, Extended Data Fig. 5e, f) and both contain more functional modules[27] than our previously published interactome maps[2,14] (Fig. 2b, Extended Data Fig. 5g). Thus, HuRI and the PSN are complementary maps of functional relationships between proteins.
Extended Data Fig. 5 |

Topological and functional significance of HuRI.

a, Examples of protein pairs in HuRI with high interaction profile similarity and both high (left) and low sequence identity (right). b, The number of pairs of proteins in HuRI and 100 random networks at increasing Jaccard similarity cutoffs, with boxplots showing median, interquartile range (IQR), and 1.5 × IQR (with outliers). PSN: profile similarity network. c, Enrichment over random networks of the sum of Jaccard similarities of pairs of proteins in HuRI above at increasing thresholds of sequence identity. Error bars are 95% confidence intervals, center is relative to mean of random networks. d, Fraction of PSN edges that are also PPIs in HuRI, split by the PPIs involving no, one or two self-interacting proteins (SIPs), at increasing Jaccard similarity cutoffs. Error bars are standard error of proportion. e, f, Enrichment over random networks of the PPI count (left) or sum of Jaccard similarities (right) of HuRI PPIs or PSN pairs, respectively, at increasing co-expression (e) and co-fitness (f) cutoffs. Error bars are 95% confidence interval, center is relative to mean of random networks. g, Functional modules in HuRI (top) and its PSN (bottom) with functional annotations. h, Heatmaps of PPI counts, ordered by number of publications, for our previous human interactome maps and Lit-BM i, j, Fraction of genes with at least one PPI for biomedically interesting genes. Heatmap of HuRI and Lit-BM PPI counts between proteins, ordered by number of publications, restricted to PPIs involving genes from the corresponding gene set. k, Schematic of relation between variables: observed PPI degree, abundance, study bias and lethality. l, Correlation matrices. LoF: Loss-of-Function. PPI datasets refer to their network degree. m, Degree distribution of various PPI networks, together. n, Empirical determination of significance of correlation between various network degrees and gene properties. HuRI-2s = subset of HuRI found in at least two screens, (n = 13,441–53,704 PPIs per network).

Fig. 2 |

Complementary functional relationships in HuRI between genes.

a, Enrichment of HuRI and its profile similarity network (PSN) for protein pairs with shared functional annotation, showing mean and 95% interval of 100 random networks. b, Functional modules in HuRI and its PSN and in previously published interactome maps from CCSB.

As shown above, global sequence identity between two proteins can be indicative of shared interaction interfaces, however, it likely fails to identify pairs of proteins whose shared interaction interface is small. Indeed, 50% (502) of all protein pairs in HuRI with interaction profile similarities ≥ 0.5 exhibit ≤ 20% sequence identity, so that functional relationships identified by the PSN are not necessarily identifiable by sequence identity. One such protein pair is the endoplasmic reticulum (ER) transmembrane protein TMEM258 and the uncharacterized protein C19ORF18, which have only 10% sequence identity but share 80% of their interactors (Extended Data Fig. 5a, right panel). TMEM258 catalyzes the first step in N-glycosylation of proteins in the ER and might play a role in protein translocation across the ER[28]. Roles in protein transport and ER function have also been ascribed to two of the four shared interaction partners, ARL6IP1[29] and IER3IP1[30], suggesting that C19ORF18 and potentially the other two shared interaction partners (MFSD6 and AC012254.2) contribute to ER-related functions of protein maturation and transport.

Uncharted disease-related interactions

Unlike Lit-BM, HuRI was generated by systematically testing protein pairs for interaction. While Lit-BM is highly biased towards the most studied genes[2], HuRI covers the genome-by-genome space more uniformly and at increased depth compared to Lit-BM and our previous screening efforts (Fig. 3a, Extended Data Fig. 5h). Interestingly, we find that the agreement between Lit-BM and HuRI is highest among the best-studied genes, where Lit-BM is most complete and where ~40% of the PPIs in HuRI have been previously identified (Fig. 3b). HuRI substantially expands the number of biomedically interesting genes for which high-quality direct PPI data is available (Fig. 3c, Extended Data Fig. 5i), and finds new interaction partners for these genes in previously uncharted regions of the protein interactome (Fig. 3d, Extended Data Fig. 5j).
Fig. 3 |

Unbiased proteome coverage of HuRI reveals uncharted network neighborhoods of disease-related genes.

a, Heatmaps of Y2H PPI counts, ordered by number of publications. b, Fraction of HuRI PPIs in Lit-BM, for increasing values of the minimum number of publications per protein. Error bar is standard error of proportion, n = 52,569–170 PPIs. c, Fraction of genes with at least one PPI for biomedically interesting genes. d, As a, but restricted to PPIs involving genes from the indicated gene sets. e, Correlation between degree and variables of interest, before (top) and after (bottom) correcting for the technical confounding factors (n = 13,441–53,704 PPIs per network, two-tailed permutation test).

Essential proteins were often found to have significantly more interaction partners[31]. However, correlation between two variables does not necessarily imply causality, especially where there are other confounding variables (Extended Data Fig. 5k). We find that protein popularity (measured by publication count) and endogenous expression level strongly correlate with: i) each other; ii) the number of interaction partners (‘degree’) in protein complex and literature-curated protein interaction networks but not in HuRI; as well as iii) gene properties such as essentiality, age[32], fitness effect[33] and loss-of-function intolerance[6] (Extended Data Fig. 5l, Supplementary Table 21). After correcting for popularity and expression level, we found substantially reduced correlations between interaction degree and other gene properties, including gene essentiality (Fig. 3e, Extended Data Fig. 5m, n). In line with previous observations[34], this suggests that correlations between degree and essentiality, age, loss-of-function intolerance, and fitness effects are confounded by underlying expression and study biases, and thus may not reflect causal relationships. These results highlight the value of HuRI as a uniformly-mapped reference for the study of systems properties of genes and networks.

Compartment-specific roles of PPIs

Proteins are localized to specific compartments, carrying out functions that depend both on the subcellular environment and the local PPI network. Despite available proteome-wide datasets on the localization of individual proteins[5], experimental determination of cellular localization-specific PPI networks remains challenging. We find that proteins localizing to a diverse range of subcellular compartments are evenly represented in HuRI (Extended Data Fig. 6a), suggesting that cellular localization-specific PPI networks can be inferred for many different cellular compartments via integration of HuRI with available protein localization data.
Extended Data Fig. 6 |

Incomplete protein localization annotation likely underlies apparent lack of co-localization of proteins interacting in HuRI.

a, Odds ratios of proteins in different subcellular compartments and PPI datasets. n = 125–3,941 proteins per compartment, two-tailed Fisher’s exact test. b, The subnetwork of HuRI involving extracellular vesicle (EV) proteins. Names of high-degree proteins are shown. c, Number of PPIs in HuRI between EV proteins compared to the distribution from randomized networks (grey). d, Western Blot of SDCBP (left panel) and ACTB (loading control, right panel) in wild-type (WT) and three knockout (KO) cell lines (#7-#9), repeated twice in two independent laboratories. Full scanned image was displayed, obtained by ChemiDoc MP imager (Bio-Rad, Hercules, CA). Cell line #8 was used for EV proteomics. e, Fraction of proteins whose abundance in EVs was significantly reduced in the SDCBP KO cell line, split by proteins interacting and not interacting with SDCBP as identified in HuRI. Error bars are standard error of proportion (n = 6 interactors, 638 non-interactors, *p = 0.042, one-tailed empirical test). f, Schematic illustrating that the number of HuRI PPIs between proteins from two different compartments should correlate with the enrichment of both compartment pairs to overlap, if co-localization annotation is incomplete. g, Scatter plot showing, for each pair of subcellular compartments, odds ratios quantifying the enrichment for proteins located in both compartments versus the enrichment of the density of PPIs between proteins located to either compartment. Size of points is scaled by the standard error of the x axis variable. Regression line and 95% confidence interval are shown. h, The z-score of the regression slope of g compared to those of random networks.

Extracellular vesicles (EVs) have been studied intensively using proteomics approaches[35]. However, our understanding of the molecular mechanisms that lead to protein recruitment into EVs and subsequent secretion remains limited. The subnetwork of interactions between EV proteins (Extended Data Fig. 6b) shows significantly higher connectivity in HuRI than in degree-controlled randomized networks (P < 0.001, one-sided empirical test; Extended Data Fig. 6c), enabling prediction of EV recruiters using the number of EV interaction partners. Seven of the top 21 most connected proteins in this EV network have established roles in EV biogenesis or cargo recruitment[36]. SDCBP (syntenin-1) functions in ESCRT-dependent EV generation and its knockout shows reduced EV production[37]. SDCBP has 48 PPIs with other EV proteins and is frequently detected in EVs, suggesting that it regulates recruitment of interacting proteins into EVs. To test this hypothesis (Fig. 4a), we knocked out SDCBP in the U373vIII cell line (Extended Data Fig. 6d). Of six SDCBP partners detected in the U373vIII EV proteome, three (CALM1, CEP55, HPRT1) displayed significantly reduced protein levels in EVs in the SDCBP knockout line (P < 0.05, one-sided empirical test, depletion < 0.66; Fig. 4b). In contrast, only 15% of the non-interaction partners of SDCBP were reduced (P = 0.042, one-sided empirical test; Extended Data Fig. 6e). Thus, SDCBP may play a role in the recruitment of proteins into EVs, highlighting the potential value of HuRI in studying protein function within specific subcellular contexts.
Fig. 4 |

Identification of potential recruiters of proteins into extracellular vesicles.

a, Schematic of experimental design to test EV recruitment function of proteins. MS: Mass Spectrometry. b, Protein abundance from EVs for each gene in WT (wild-type) and SDCBP KO (knockout). Mean values of n = 3 biological replicates.

Proteins often change subcellular localization, e.g. as part of their maturation process or in response to external or internal signaling events. These dynamics are difficult to comprehensively capture at proteome-scale, yet existing efforts have already revealed numerous proteins that localize to multiple subcellular compartments with some pairs of compartments exhibiting significantly more crosstalk than others[5]. Despite a tendency for interactions in HuRI to link proteins localized to the same compartment (P < 0.01, one-sided empirical test; Fig. 2a), a considerable number of interactions were identified between proteins never reported to co-localize. To explore whether lack of co-localization of interacting proteins in HuRI might originate from incomplete localization data, we assessed whether these non-co-localized interacting proteins tend to reside in compartments with significant crosstalk and observed a significant positive correlation (P < 0.001, one-sided empirical test; Extended Data Fig. 6f–h). This suggests that HuRI could prove useful in predicting protein localization dynamics.

Principles of tissue-specific function

Despite recent advances in systematic genome-wide identification of tissue-preferentially expressed (TiP) genes[4] (Extended Data Fig. 7, Supplementary Table 22), we lack a concrete understanding of how the surprisingly small set of TiP genes operate together and coordinate their activity with the core ‘housekeeping’ machinery to mediate tissue-specific functions. Insights can be obtained from investigation of the tissue-specific network context of TiP proteins, inferred from integrating protein interactome data with tissue transcriptomes (Supplementary Table 23). However, we should not expect uniform coverage of TiP proteins with PPIs using experimental methods that demand expression of both partners within a single immortalized cell line or result from screening an incomplete ORFeome. Indeed, contrary to protein complex[11-13] and literature-curated interactome maps[38] as well as our previously published binary PPI datasets[2,14], we find that TiP proteins are well-represented in HuRI (Fig. 5a, Extended Data Fig. 8a).
Extended Data Fig. 7 |

Investigation of tissue-preferential expression data.

a, Examples of genes displaying different levels of tissue-preferential (TiP) expression across the GTEx tissue panel (left), with boxplots showing median, interquartile range (IQR), and 1.5 × IQR (with outliers), n = 90–779 samples per tissue. Equation to calculate tissue-preferential expression for every gene-tissue pair and the maximum TiP value for every gene (middle). Number of genes showing tissue-preferential expression for increasing tissue-preferential expression cutoffs (right). b, Relative number of TiP genes for every tissue for increasing tissue-preferential expression cutoffs. c-d, Differences in number of TiP genes upon removal of testis prior to TiP value calculation per tissue (TiP value cutoff = 2) (c) and in total for increasing tissue-preferential expression cutoffs (d). e, Number of TiP genes and number of TiP genes that are also exclusively expressed in one tissue (sglTis: single tissue) for increasing tissue-preferential expression cutoffs.

Fig. 5 |

Tissue-specific functions are largely mediated by interactions between TiP proteins and uniformly expressed proteins.

a, Tissue-preferentially expressed (TiP) protein coverage by PPI networks for increasing levels of tissue-preferential expression (shaded error bands proportional to standard error on proportion, n ≥ 233 genes). b, Tissue-preferential sub-networks. *P < 0.001, 1-sided empirical test for TiP proteins being close to each other (n = 19,960–30,217 PPIs per subnetwork). c, Empirical test of closeness of TiP proteins in the brain sub-network, 1,000 random networks. d, Tissue-specific diseases split by tissue-preferential expression levels of causal genes. e, Tested tissue-specific diseases split by PPI perturbation result. f, Expression profile of PNKP and interactors in brain tissues and PPI perturbation pattern of disease causing (Glu326Lys) and benign (Pro20Ser) mutation. Yeast growth phenotypes on SC-Leu-Trp (upper) or SC-Leu-Trp-His+3AT (3-Amino-1,2,4-triazole) media (lower). Green gene symbols: preferentially expressed. Only interactors expressed in brain shown.

Extended Data Fig. 8 |

PPIs between TiP proteins and uniformly expressed proteins likely adapt basic cellular processes to mediate cellular context-specific functions.

a, TiP protein coverage by CCSB PPI networks for increasing levels of tissue-preferential expression, (shaded error bands proportional to standard error on proportion, n ≥ 233 genes). b, Spearman correlation coefficients and 95% confidence intervals for correlations between degree or betweenness and tissue specificity for HuRI and Lit-BM (n = 6,684 and 4,971 proteins). c, Fraction of HuRI and Lit-BM that involve TiP proteins compared to fraction of genome that are TiP genes for increasing levels of tissue-preferential expression. d, Number of PPIs in HuRI, involving proteins in GTEx, where both proteins are expressed in the same tissue, and the mean of the tissue-specific subnetworks where error bar is standard deviation. e, Test for enrichment of TiP-TiP PPIs (left) and significance of average shortest path between TiP proteins (middle) in each tissue subnetwork, number of TiP proteins in each subnetwork, interacting with other TiP proteins, being part of Keratin (KRT) or Late-cornified envelope (LCE) protein family (right). f, g, Transcript expression levels across the BLUEPRINT hematopoietic cell lineage (f) and GTEx tissue panel (g) for three candidate genes predicted to function in apoptosis. EG = esophagus gastroesophageal. h, Histogram of number of untransfected cells and their time of death (left) without (top) and with (bottom) addition of TRAIL. Time of death of cells expressing OTUD6A-GFP fusions versus OTUD6A expression measured as fluorescence (right) without (top) and with (bottom) addition of TRAIL. i, Apoptosis-related network context of OTUD6A and C6ORF222 in HuRI, unfiltered (left) and filtered using colon transverse or mature eosinophil transcript levels (right).

Restricting HuRI to PPIs between proteins expressed in the same tissue, we observe that TiP proteins engage in as many PPIs and are as central as more uniformly expressed proteins (Extended Data Fig. 8b), contrary to previous observations using literature-curated PPI networks[39,40]. This result, paired with the fact that PPIs mediated by a TiP protein are effectively also tissue-specific, leads to the finding that the fraction of tissue-specific PPIs in the protein interactome as characterized by HuRI is higher than that of tissue-specific genes in the expressed genome, indicating that substantial information on tissue-specific functions can only be obtained from the interactome. The opposite is observed for Lit-BM, likely owing to its bias against TiP genes (Extended Data Fig. 8c). To investigate the local network neighborhoods of TiP proteins within their respective tissue contexts, we used HuRI to derive protein interactome maps for 35 tissues[4,41], each of which contains about 25,000 PPIs (Supplementary Table 24, Extended Data Fig. 8d). Within each tissue PPI network, we focused on the interactions involving at least one TiP protein (Fig. 5b). The TiP PPI networks show extensive interactions between TiP and non-TiP proteins, but with few TiP-TiP PPIs. Despite significant enrichments for HuRI to link proteins that work in the same biological process, TiP-TiP PPIs, as highlighted for brain in Fig. 5c, are not enriched, nor is the average shortest path among TiP proteins shorter than in degree-controlled randomized networks (P > 0.05, empirical test). Using either metric, TiP proteins were found to be significantly close to each other in only six of 35 tissues. In four of these six tissues, enrichment for network proximity was driven by clusters of specifically expressed keratins or late-cornified envelope proteins (Extended Data Fig. 8e). These results support a model in which tissue-specific functions emerge through interactions between TiP proteins and more uniformly expressed members of the basic cellular machinery, presumably modulating and adapting common cellular processes for cellular context-specific needs[42]. One biological process with both cell-type and developmental stage-specific homeostatic roles is apoptosis. We used HuRI to identify proteins with interaction partners that were enriched for known apoptosis regulators (Supplementary Table 25). Five proteins among the top ten predictions had previously demonstrated roles in apoptosis (Supplementary Note 3). Among three TiP genes predicted to be implicated in apoptosis (Extended Data Fig. 8f, g), we further examined OTUD6A. Abundance of OTUD6A negatively correlated with time-of-death after addition of TRAIL (TNF-related apoptosis-inducing ligand; P = 0.012, two-sided, empirical test, n = 40 cells), but not after expression of OTUD6A alone (Extended Data Fig. 8h, Supplementary Table 26), suggesting that OTUD6A participates in the apoptosis pathway but is not an inducer of cell death. This and other evidence (Extended Data Fig. 8f, i, Supplementary Note 3) suggests that OTUD6A exerts an apoptosis sensitization function via transcriptional activation in a haematopoietic cellular context.

Mechanisms of tissue-specific diseases

Many Mendelian diseases display tissue-specific phenotypes, rarely explained by tissue-specific expression of genes with disease-associated mutations[43] (Fig. 5d, Extended Data Fig. 9a). Such mutations broadly or specifically affect PPIs involving the mutated protein[44]. Perturbations of PPIs between uniformly expressed disease-associated proteins and TiP proteins in the affected tissues have been suggested to underlie the tissue-specific phenotypes of those diseases[43]. In HuRI-derived tissue PPI networks, we find 130 such PPIs involving 63 distinct non-TiP disease-causal proteins and 94 TiP proteins. Although we see no enrichment for PPIs between causal proteins and TiP proteins (Extended Data Fig. 9b, Supplementary Note 4), this does not rule out the possibility that perturbations of some of these interactions mediate tissue-specific phenotypes of Mendelian diseases.
Extended Data Fig. 9 |

Potential mechanisms of tissue-specific diseases.

a, Histogram of the number of Mendelian diseases showing symptoms in a number of tissues. b, Test for enrichment of causal proteins associated with tissue-specific Mendelian diseases to interact with TiP proteins of affected tissues. c, Network neighborhood of uniformly expressed causal proteins of tissue-specific diseases found to interact with TiP proteins in HuRI, indicating PPI perturbation by mutations. d, Causal genes split by mutation found to perturb PPI to TiP protein (dashed) or not (solid). e, Expression profile of PNKP and interactors in brain tissues and PPI perturbation pattern of disease causing (Glu326Lys) and benign (Pro20Ser) mutation. Yeast growth phenotypes on SC-Leu-Trp (upper) or SC-Leu-Trp-His+3AT media (lower) are shown; green/grey gene symbols: preferentially/not expressed.

To explore this hypothesis, we experimentally tested whether pathogenic variants associated with Mendelian diseases were able to perturb these PPIs. Of ten causal proteins tested, seven showed perturbation of PPIs to preferentially expressed interaction partners in the corresponding “disease tissues” (Fig 5e, Extended Data Fig. 9c, d, Supplementary Table 27, 28). One example is PNKP (polynucleotide kinase 3’-phosphatase) whose mutations have been associated with microcephaly, seizures, and developmental delay. The pathogenic PNKP mutation Glu326Lys affects neither the DNA kinase nor DNA phosphatase activity of PNKP, rendering the mechanism of pathogenicity unclear[45]. We observed that Glu326Lys perturbed PPIs with two partners preferentially expressed in the brain, SYNGR1 and TRIM37, whereas a benign control mutation Pro20Ser[46] did not affect any PNKP PPIs (Fig. 5f, Extended Data Fig. 9c, e). TRIM37 facilitates DNA repair[47], suggesting a potential mechanism by which perturbation of this interaction could affect the brain-specific DNA repair function of PNKP. In other examples, HuRI identified CTNNA3 and SUCLA2 to have respective TiP interaction partners TRIM54 and ARL6IP1 (Extended Data Fig. 10), which cause similar diseases with overlapping symptoms[48,49], supporting the relevance of these interactions in the physiopathology. Overall, this study yields hypotheses of molecular mechanisms for otherwise unexplained tissue-specific phenotypes of seven Mendelian diseases (Extended Data Fig. 9d) and demonstrates the utility of HuRI as a reference to study biological mechanisms within specific disease contexts.
Extended Data Fig. 10 |

Mutations in uniformly expressed causal proteins associated with tissue-specific Mendelian diseases perturb interactions to TiP proteins.

Expression profile and interaction perturbation profile of nine causal proteins and their interaction partners. Affected tissues were selected for display (top). Control of AD and DB (Gal 4 DNA binding domain) plasmid presence and cell density by spotting yeast colonies on SC-Leu-Trp media (upper). Detection of PPIs by spotting yeast on SC-Leu-Trp-His+3AT media (lower), where yeast growth indicates PPIs. WT = wild-type, red letters = causal proteins or alleles, grey gene symbols = interaction partners not expressed in affected tissues, grey alleles = not pathogenic, green gene symbols = TiP interaction partners in affected tissues.

Discussion

Here, we present HuRI, a systematically generated human protein interactome map with more than 50,000 PPIs of high biophysical quality. While HuRI displays highly significant overlap with known functional relationships, the cellular function of most individual PPIs remains to be elucidated. We show that follow-up studies on the function of proteins and PPIs can be guided by integration of HuRI with contextual genome, transcriptome and proteome data to infer the cellular context in which subnetworks of PPIs operate together to mediate a function. With advances in single cell transcriptomics[8] as well as systematic determination of subcellular protein localization, inference of functional PPI subnetworks that are specific for a given cellular state will further increase in precision. However, a priori removal of PPIs from HuRI because they are not currently known to function together in a physiological cellular context could discard data that can help, e.g., to understand the functional consequences of dysregulated gene and protein expression causing disease. Despite our extensive screening, many PPIs remained undetected as Y2H, like all characterized assays[15,16,18], has limited sensitivity, detecting no interactions for half of the tested proteins. This negative result can guide the design of future interactome mapping efforts to target these proteins. Due to the limitations of Y2H, we expect HuRI to be depleted for PPIs that depend on post-translational processing of human proteins that the yeast cell is unable to catalyze or that require additional partners to stabilize the interaction. Screening only one isoform per gene also misses interaction partners specific to alternative spliceforms[50]. Accurate estimation of the total size of the interactome remains challenging. PPIs display a continuum of binding strength or stability that, along with other factors, could underlie a continuum of detectability, as this study suggests. Furthermore, the results obtained by us and others[12] indicate that very stable and functionally conserved PPIs constitute a minority of the interactome. While incomplete, HuRI’s proteome and interactome coverage and uniformity enable its use as a reference for the study of most aspects of human cellular function. Efforts to further complete this reference will require development of new technologies as well as integration with complementary reference maps of protein complex assemblies[13]. Although multiple challenges remain to be solved for a complete and context-specific map of protein functions, interactions, and higher-level organization, HuRI provides an unbiased genome-scale scaffold with which to coordinate this information as it emerges.

Y2H assay development and validation of HuRI.

a, Number of protein-coding genes in hORFeome v9.1 and GTEx, FANTOM, and HPA transcriptome projects. The number of genes in hORFeome v9.1 is on par with the number of genes found to be expressed in three comprehensive individual transcriptome sequencing studies and includes 94% of the genes with robust evidence of expression in all three. b, Overlap between hORFeome v9.1 and intersection of transcriptomes in a. c, Individual and combined recovery of PRSv1 and RRSv1 pairs by Y2H assay versions (n = 252, 270). d, Colored squares showing which protein pairs were detected in PRSv1 (left) and RRSv1 (right) by Y2H assay versions. e, Recovery rates of Lit-BM and PPIs from screens of a 2k-by-2k gene test space per Y2H assay version in MAPPIT. f, Cumulative PPI count performing three screens with each Y2H assay version in the test space compared to nine screens with Y2H assay version 1. g, h, MAPPIT and GPCA recovery of Lit-BM and PPIs from screens of Space III when split by screen at a RRS rate of 1% (g) or across a range of thresholds (h). All error bars, in c, e, g, are 68.3% Bayesian confidence interval, shaded error band in h is standard error of proportion and n = between 101 and 395 pairs successfully tested for each category. i, Number of proteins in HuRI, detected with each additional screen.

Definition of literature-curated PPI datasets.

a, Categorization of literature-curated PPIs into distinct subsets based on the experimental methods in which they were detected and the number of pieces of experimental evidence. b-e, Results of testing the different categories of literature-curated pairs in Y2H (b, d) and MAPPIT (c, e) where the pairs have been further divided into HT - high throughput and LT - low throughput subsets (b, c). BM: binary multiple; BS: binary singleton; NB: non-binary. Between n = 191–471 successfully tested PPIs for each category.

Stericity and interaction strength contribute to PPI detectability.

a, b, Fraction of PPIs with N or C-terminus < 10 Å (a) or 20 Å (b) to PPI interface, for PPIs with known structure in and not in HuRI (n = 37–1,891 PPIs). Error bars are standard error of proportion. The structure of UBE2D3 bound to RNF115 illustrates an example of a PPI found only by Y2H assay version 3 (PDB code: 5ulh). c, MAPPIT recovery rates of HuRI and Lit-BM PPIs that were also detected in HuRI by the number of screens each pair was detected in. Error bars are 68.3% Bayesian confidence interval (n = 22–793 PPIs successfully tested in each category). d, MAPPIT recovery rates of Lit-BM PPIs that were also detected in HuRI, for increasing number of pieces of experimental evidence per PPI. Error bars are 68.3% Bayesian confidence interval (n = 24–61 PPIs successfully tested in each category). e-f, Distributions of interaction interface area (e) or number of atomic contacts (f) by the number of HuRI screens in which a PPI is detected, with boxplots showing median, interquartile range (IQR), and 1.5 × IQR (with outliers), n = 1004 PPIs. g, Examples of within-complex interactions detected in HuRI (purple) and BioPlex (orange). Fraction of HuRI PPIs between proteins of protein complexes that link proteins of the same complex, split by PPIs found in single and multiple screens (dark purple). Error bars are standard error of proportion, n = 1,042 and 775 PPIs. h, Number of screens each PPI in HuRI was detected in, split by Y2H assay version. i, Number of Y2H assay versions each PPI in HuRI was detected in. j, Estimates of the size of the total binary protein interactome and the fraction covered by HuRI, as a function of the minimum number of publications per gene and the minimum number of evidence for the Lit-BM reference. Error bands are 68.3% Bayesian confidence interval, n ≥ 170 Lit-BM PPIs.

HuRI provides direct contact information for proteins in complexes.

Intra-complex PPIs are shown for protein complexes from CORUM as found in BioPlex (orange) or HuRI (purple). HuRI PPIs are further distinguished into PPIs found in single (light purple) or multiple screens (dark purple).

Topological and functional significance of HuRI.

a, Examples of protein pairs in HuRI with high interaction profile similarity and both high (left) and low sequence identity (right). b, The number of pairs of proteins in HuRI and 100 random networks at increasing Jaccard similarity cutoffs, with boxplots showing median, interquartile range (IQR), and 1.5 × IQR (with outliers). PSN: profile similarity network. c, Enrichment over random networks of the sum of Jaccard similarities of pairs of proteins in HuRI above at increasing thresholds of sequence identity. Error bars are 95% confidence intervals, center is relative to mean of random networks. d, Fraction of PSN edges that are also PPIs in HuRI, split by the PPIs involving no, one or two self-interacting proteins (SIPs), at increasing Jaccard similarity cutoffs. Error bars are standard error of proportion. e, f, Enrichment over random networks of the PPI count (left) or sum of Jaccard similarities (right) of HuRI PPIs or PSN pairs, respectively, at increasing co-expression (e) and co-fitness (f) cutoffs. Error bars are 95% confidence interval, center is relative to mean of random networks. g, Functional modules in HuRI (top) and its PSN (bottom) with functional annotations. h, Heatmaps of PPI counts, ordered by number of publications, for our previous human interactome maps and Lit-BM i, j, Fraction of genes with at least one PPI for biomedically interesting genes. Heatmap of HuRI and Lit-BM PPI counts between proteins, ordered by number of publications, restricted to PPIs involving genes from the corresponding gene set. k, Schematic of relation between variables: observed PPI degree, abundance, study bias and lethality. l, Correlation matrices. LoF: Loss-of-Function. PPI datasets refer to their network degree. m, Degree distribution of various PPI networks, together. n, Empirical determination of significance of correlation between various network degrees and gene properties. HuRI-2s = subset of HuRI found in at least two screens, (n = 13,441–53,704 PPIs per network).

Incomplete protein localization annotation likely underlies apparent lack of co-localization of proteins interacting in HuRI.

a, Odds ratios of proteins in different subcellular compartments and PPI datasets. n = 125–3,941 proteins per compartment, two-tailed Fisher’s exact test. b, The subnetwork of HuRI involving extracellular vesicle (EV) proteins. Names of high-degree proteins are shown. c, Number of PPIs in HuRI between EV proteins compared to the distribution from randomized networks (grey). d, Western Blot of SDCBP (left panel) and ACTB (loading control, right panel) in wild-type (WT) and three knockout (KO) cell lines (#7-#9), repeated twice in two independent laboratories. Full scanned image was displayed, obtained by ChemiDoc MP imager (Bio-Rad, Hercules, CA). Cell line #8 was used for EV proteomics. e, Fraction of proteins whose abundance in EVs was significantly reduced in the SDCBP KO cell line, split by proteins interacting and not interacting with SDCBP as identified in HuRI. Error bars are standard error of proportion (n = 6 interactors, 638 non-interactors, *p = 0.042, one-tailed empirical test). f, Schematic illustrating that the number of HuRI PPIs between proteins from two different compartments should correlate with the enrichment of both compartment pairs to overlap, if co-localization annotation is incomplete. g, Scatter plot showing, for each pair of subcellular compartments, odds ratios quantifying the enrichment for proteins located in both compartments versus the enrichment of the density of PPIs between proteins located to either compartment. Size of points is scaled by the standard error of the x axis variable. Regression line and 95% confidence interval are shown. h, The z-score of the regression slope of g compared to those of random networks.

Investigation of tissue-preferential expression data.

a, Examples of genes displaying different levels of tissue-preferential (TiP) expression across the GTEx tissue panel (left), with boxplots showing median, interquartile range (IQR), and 1.5 × IQR (with outliers), n = 90–779 samples per tissue. Equation to calculate tissue-preferential expression for every gene-tissue pair and the maximum TiP value for every gene (middle). Number of genes showing tissue-preferential expression for increasing tissue-preferential expression cutoffs (right). b, Relative number of TiP genes for every tissue for increasing tissue-preferential expression cutoffs. c-d, Differences in number of TiP genes upon removal of testis prior to TiP value calculation per tissue (TiP value cutoff = 2) (c) and in total for increasing tissue-preferential expression cutoffs (d). e, Number of TiP genes and number of TiP genes that are also exclusively expressed in one tissue (sglTis: single tissue) for increasing tissue-preferential expression cutoffs.

PPIs between TiP proteins and uniformly expressed proteins likely adapt basic cellular processes to mediate cellular context-specific functions.

a, TiP protein coverage by CCSB PPI networks for increasing levels of tissue-preferential expression, (shaded error bands proportional to standard error on proportion, n ≥ 233 genes). b, Spearman correlation coefficients and 95% confidence intervals for correlations between degree or betweenness and tissue specificity for HuRI and Lit-BM (n = 6,684 and 4,971 proteins). c, Fraction of HuRI and Lit-BM that involve TiP proteins compared to fraction of genome that are TiP genes for increasing levels of tissue-preferential expression. d, Number of PPIs in HuRI, involving proteins in GTEx, where both proteins are expressed in the same tissue, and the mean of the tissue-specific subnetworks where error bar is standard deviation. e, Test for enrichment of TiP-TiP PPIs (left) and significance of average shortest path between TiP proteins (middle) in each tissue subnetwork, number of TiP proteins in each subnetwork, interacting with other TiP proteins, being part of Keratin (KRT) or Late-cornified envelope (LCE) protein family (right). f, g, Transcript expression levels across the BLUEPRINT hematopoietic cell lineage (f) and GTEx tissue panel (g) for three candidate genes predicted to function in apoptosis. EG = esophagus gastroesophageal. h, Histogram of number of untransfected cells and their time of death (left) without (top) and with (bottom) addition of TRAIL. Time of death of cells expressing OTUD6A-GFP fusions versus OTUD6A expression measured as fluorescence (right) without (top) and with (bottom) addition of TRAIL. i, Apoptosis-related network context of OTUD6A and C6ORF222 in HuRI, unfiltered (left) and filtered using colon transverse or mature eosinophil transcript levels (right).

Potential mechanisms of tissue-specific diseases.

a, Histogram of the number of Mendelian diseases showing symptoms in a number of tissues. b, Test for enrichment of causal proteins associated with tissue-specific Mendelian diseases to interact with TiP proteins of affected tissues. c, Network neighborhood of uniformly expressed causal proteins of tissue-specific diseases found to interact with TiP proteins in HuRI, indicating PPI perturbation by mutations. d, Causal genes split by mutation found to perturb PPI to TiP protein (dashed) or not (solid). e, Expression profile of PNKP and interactors in brain tissues and PPI perturbation pattern of disease causing (Glu326Lys) and benign (Pro20Ser) mutation. Yeast growth phenotypes on SC-Leu-Trp (upper) or SC-Leu-Trp-His+3AT media (lower) are shown; green/grey gene symbols: preferentially/not expressed.

Mutations in uniformly expressed causal proteins associated with tissue-specific Mendelian diseases perturb interactions to TiP proteins.

Expression profile and interaction perturbation profile of nine causal proteins and their interaction partners. Affected tissues were selected for display (top). Control of AD and DB (Gal 4 DNA binding domain) plasmid presence and cell density by spotting yeast colonies on SC-Leu-Trp media (upper). Detection of PPIs by spotting yeast on SC-Leu-Trp-His+3AT media (lower), where yeast growth indicates PPIs. WT = wild-type, red letters = causal proteins or alleles, grey gene symbols = interaction partners not expressed in affected tissues, grey alleles = not pathogenic, green gene symbols = TiP interaction partners in affected tissues.
  49 in total

1.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

Review 2.  Interactome networks and human disease.

Authors:  Marc Vidal; Michael E Cusick; Albert-László Barabási
Journal:  Cell       Date:  2011-03-18       Impact factor: 41.582

3.  [Somatic polysaccharide-containing antigens of the plague bacterium].

Authors:  E E Bakhrakh; V I Veĭnblat
Journal:  Zh Mikrobiol Epidemiol Immunobiol       Date:  1972-03

4.  Human genomics. The human transcriptome across tissues and individuals.

Authors:  Marta Melé; Pedro G Ferreira; Ferran Reverter; David S DeLuca; Jean Monlong; Michael Sammeth; Taylor R Young; Jakob M Goldmann; Dmitri D Pervouchine; Timothy J Sullivan; Rory Johnson; Ayellet V Segrè; Sarah Djebali; Anastasia Niarchou; Fred A Wright; Tuuli Lappalainen; Miquel Calvo; Gad Getz; Emmanouil T Dermitzakis; Kristin G Ardlie; Roderic Guigó
Journal:  Science       Date:  2015-05-08       Impact factor: 47.728

5.  A subcellular map of the human proteome.

Authors:  Peter J Thul; Lovisa Åkesson; Mikaela Wiking; Diana Mahdessian; Aikaterini Geladaki; Hammou Ait Blal; Tove Alm; Anna Asplund; Lars Björk; Lisa M Breckels; Anna Bäckström; Frida Danielsson; Linn Fagerberg; Jenny Fall; Laurent Gatto; Christian Gnann; Sophia Hober; Martin Hjelmare; Fredric Johansson; Sunjae Lee; Cecilia Lindskog; Jan Mulder; Claire M Mulvey; Peter Nilsson; Per Oksvold; Johan Rockberg; Rutger Schutten; Jochen M Schwenk; Åsa Sivertsson; Evelina Sjöstedt; Marie Skogs; Charlotte Stadler; Devin P Sullivan; Hanna Tegel; Casper Winsnes; Cheng Zhang; Martin Zwahlen; Adil Mardinoglu; Fredrik Pontén; Kalle von Feilitzen; Kathryn S Lilley; Mathias Uhlén; Emma Lundberg
Journal:  Science       Date:  2017-05-11       Impact factor: 47.728

6.  The sequence of the human genome.

Authors:  J C Venter; M D Adams; E W Myers; P W Li; R J Mural; G G Sutton; H O Smith; M Yandell; C A Evans; R A Holt; J D Gocayne; P Amanatides; R M Ballew; D H Huson; J R Wortman; Q Zhang; C D Kodira; X H Zheng; L Chen; M Skupski; G Subramanian; P D Thomas; J Zhang; G L Gabor Miklos; C Nelson; S Broder; A G Clark; J Nadeau; V A McKusick; N Zinder; A J Levine; R J Roberts; M Simon; C Slayman; M Hunkapiller; R Bolanos; A Delcher; I Dew; D Fasulo; M Flanigan; L Florea; A Halpern; S Hannenhalli; S Kravitz; S Levy; C Mobarry; K Reinert; K Remington; J Abu-Threideh; E Beasley; K Biddick; V Bonazzi; R Brandon; M Cargill; I Chandramouliswaran; R Charlab; K Chaturvedi; Z Deng; V Di Francesco; P Dunn; K Eilbeck; C Evangelista; A E Gabrielian; W Gan; W Ge; F Gong; Z Gu; P Guan; T J Heiman; M E Higgins; R R Ji; Z Ke; K A Ketchum; Z Lai; Y Lei; Z Li; J Li; Y Liang; X Lin; F Lu; G V Merkulov; N Milshina; H M Moore; A K Naik; V A Narayan; B Neelam; D Nusskern; D B Rusch; S Salzberg; W Shao; B Shue; J Sun; Z Wang; A Wang; X Wang; J Wang; M Wei; R Wides; C Xiao; C Yan; A Yao; J Ye; M Zhan; W Zhang; H Zhang; Q Zhao; L Zheng; F Zhong; W Zhong; S Zhu; S Zhao; D Gilbert; S Baumhueter; G Spier; C Carter; A Cravchik; T Woodage; F Ali; H An; A Awe; D Baldwin; H Baden; M Barnstead; I Barrow; K Beeson; D Busam; A Carver; A Center; M L Cheng; L Curry; S Danaher; L Davenport; R Desilets; S Dietz; K Dodson; L Doup; S Ferriera; N Garg; A Gluecksmann; B Hart; J Haynes; C Haynes; C Heiner; S Hladun; D Hostin; J Houck; T Howland; C Ibegwam; J Johnson; F Kalush; L Kline; S Koduru; A Love; F Mann; D May; S McCawley; T McIntosh; I McMullen; M Moy; L Moy; B Murphy; K Nelson; C Pfannkoch; E Pratts; V Puri; H Qureshi; M Reardon; R Rodriguez; Y H Rogers; D Romblad; B Ruhfel; R Scott; C Sitter; M Smallwood; E Stewart; R Strong; E Suh; R Thomas; N N Tint; S Tse; C Vech; G Wang; J Wetter; S Williams; M Williams; S Windsor; E Winn-Deen; K Wolfe; J Zaveri; K Zaveri; J F Abril; R Guigó; M J Campbell; K V Sjolander; B Karlak; A Kejariwal; H Mi; B Lazareva; T Hatton; A Narechania; K Diemer; A Muruganujan; N Guo; S Sato; V Bafna; S Istrail; R Lippert; R Schwartz; B Walenz; S Yooseph; D Allen; A Basu; J Baxendale; L Blick; M Caminha; J Carnes-Stine; P Caulk; Y H Chiang; M Coyne; C Dahlke; A Deslattes Mays; M Dombroski; M Donnelly; D Ely; S Esparham; C Fosler; H Gire; S Glanowski; K Glasser; A Glodek; M Gorokhov; K Graham; B Gropman; M Harris; J Heil; S Henderson; J Hoover; D Jennings; C Jordan; J Jordan; J Kasha; L Kagan; C Kraft; A Levitsky; M Lewis; X Liu; J Lopez; D Ma; W Majoros; J McDaniel; S Murphy; M Newman; T Nguyen; N Nguyen; M Nodell; S Pan; J Peck; M Peterson; W Rowe; R Sanders; J Scott; M Simpson; T Smith; A Sprague; T Stockwell; R Turner; E Venter; M Wang; M Wen; D Wu; M Wu; A Xia; A Zandieh; X Zhu
Journal:  Science       Date:  2001-02-16       Impact factor: 47.728

7.  A proteome-scale map of the human interactome network.

Authors:  Thomas Rolland; Murat Taşan; Benoit Charloteaux; Samuel J Pevzner; Quan Zhong; Nidhi Sahni; Song Yi; Irma Lemmens; Celia Fontanillo; Roberto Mosca; Atanas Kamburov; Susan D Ghiassian; Xinping Yang; Lila Ghamsari; Dawit Balcha; Bridget E Begg; Pascal Braun; Marc Brehme; Martin P Broly; Anne-Ruxandra Carvunis; Dan Convery-Zupan; Roser Corominas; Jasmin Coulombe-Huntington; Elizabeth Dann; Matija Dreze; Amélie Dricot; Changyu Fan; Eric Franzosa; Fana Gebreab; Bryan J Gutierrez; Madeleine F Hardy; Mike Jin; Shuli Kang; Ruth Kiros; Guan Ning Lin; Katja Luck; Andrew MacWilliams; Jörg Menche; Ryan R Murray; Alexandre Palagi; Matthew M Poulin; Xavier Rambout; John Rasla; Patrick Reichert; Viviana Romero; Elien Ruyssinck; Julie M Sahalie; Annemarie Scholz; Akash A Shah; Amitabh Sharma; Yun Shen; Kerstin Spirohn; Stanley Tam; Alexander O Tejeda; Shelly A Trigg; Jean-Claude Twizere; Kerwin Vega; Jennifer Walsh; Michael E Cusick; Yu Xia; Albert-László Barabási; Lilia M Iakoucheva; Patrick Aloy; Javier De Las Rivas; Jan Tavernier; Michael A Calderwood; David E Hill; Tong Hao; Frederick P Roth; Marc Vidal
Journal:  Cell       Date:  2014-11-20       Impact factor: 41.582

8.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders.

Authors:  Joanna S Amberger; Carol A Bocchini; François Schiettecatte; Alan F Scott; Ada Hamosh
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 19.160

9.  The Human Cell Atlas.

Authors:  Aviv Regev; Sarah A Teichmann; Eric S Lander; Ido Amit; Christophe Benoist; Ewan Birney; Bernd Bodenmiller; Peter Campbell; Piero Carninci; Menna Clatworthy; Hans Clevers; Bart Deplancke; Ian Dunham; James Eberwine; Roland Eils; Wolfgang Enard; Andrew Farmer; Lars Fugger; Berthold Göttgens; Nir Hacohen; Muzlifah Haniffa; Martin Hemberg; Seung Kim; Paul Klenerman; Arnold Kriegstein; Ed Lein; Sten Linnarsson; Emma Lundberg; Joakim Lundeberg; Partha Majumder; John C Marioni; Miriam Merad; Musa Mhlanga; Martijn Nawijn; Mihai Netea; Garry Nolan; Dana Pe'er; Anthony Phillipakis; Chris P Ponting; Stephen Quake; Wolf Reik; Orit Rozenblatt-Rosen; Joshua Sanes; Rahul Satija; Ton N Schumacher; Alex Shalek; Ehud Shapiro; Padmanee Sharma; Jay W Shin; Oliver Stegle; Michael Stratton; Michael J T Stubbington; Fabian J Theis; Matthias Uhlen; Alexander van Oudenaarden; Allon Wagner; Fiona Watt; Jonathan Weissman; Barbara Wold; Ramnik Xavier; Nir Yosef
Journal:  Elife       Date:  2017-12-05       Impact factor: 8.140

10.  Panorama of ancient metazoan macromolecular complexes.

Authors:  Cuihong Wan; Blake Borgeson; Sadhna Phanse; Fan Tu; Kevin Drew; Greg Clark; Xuejian Xiong; Olga Kagan; Julian Kwan; Alexandr Bezginov; Kyle Chessman; Swati Pal; Graham Cromar; Ophelia Papoulas; Zuyao Ni; Daniel R Boutz; Snejana Stoilova; Pierre C Havugimana; Xinghua Guo; Ramy H Malty; Mihail Sarov; Jack Greenblatt; Mohan Babu; W Brent Derry; Elisabeth R Tillier; John B Wallingford; John Parkinson; Edward M Marcotte; Andrew Emili
Journal:  Nature       Date:  2015-09-07       Impact factor: 49.962

View more
  193 in total

1.  Piggybacking on Classical Import and Other Non-Classical Mechanisms of Nuclear Import Appear Highly Prevalent within the Human Proteome.

Authors:  Tanner M Tessier; Katelyn M MacNeil; Joe S Mymryk
Journal:  Biology (Basel)       Date:  2020-07-23

2.  A SARS-CoV-2 (COVID-19) biological network to find targets for drug repurposing.

Authors:  Mahnaz Habibi; Golnaz Taheri; Rosa Aghdam
Journal:  Sci Rep       Date:  2021-04-30       Impact factor: 4.379

3.  A functional screen identifies transcriptional networks that regulate HIV-1 and HIV-2.

Authors:  Kyle D Pedro; Luis M Agosto; Jared A Sewell; Kimberly A Eberenz; Xianbao He; Juan I Fuxman Bass; Andrew J Henderson
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-16       Impact factor: 11.205

Review 4.  Statistical mechanics meets single-cell biology.

Authors:  Andrew E Teschendorff; Andrew P Feinberg
Journal:  Nat Rev Genet       Date:  2021-04-19       Impact factor: 53.242

5.  Meta-analysis defines principles for the design and analysis of co-fractionation mass spectrometry experiments.

Authors:  Michael A Skinnider; Leonard J Foster
Journal:  Nat Methods       Date:  2021-07-01       Impact factor: 28.547

6.  mPPI: a database extension to visualize structural interactome in a one-to-many manner.

Authors:  Yekai Zhou; Hongjun Chen; Sida Li; Ming Chen
Journal:  Database (Oxford)       Date:  2021-06-22       Impact factor: 3.451

7.  Molecular architecture of postsynaptic Interactomes.

Authors:  Brent Wilkinson; Marcelo P Coba
Journal:  Cell Signal       Date:  2020-09-14       Impact factor: 4.315

8.  Comprehensive characterization of protein-protein interactions perturbed by disease mutations.

Authors:  Feixiong Cheng; Junfei Zhao; Yang Wang; Weiqiang Lu; Zehui Liu; Yadi Zhou; William R Martin; Ruisheng Wang; Jin Huang; Tong Hao; Hong Yue; Jing Ma; Yuan Hou; Jessica A Castrillon; Jiansong Fang; Justin D Lathia; Ruth A Keri; Felice C Lightstone; Elliott Marshall Antman; Raul Rabadan; David E Hill; Charis Eng; Marc Vidal; Joseph Loscalzo
Journal:  Nat Genet       Date:  2021-02-08       Impact factor: 38.330

9.  SECAT: Quantifying Protein Complex Dynamics across Cell States by Network-Centric Analysis of SEC-SWATH-MS Profiles.

Authors:  George Rosenberger; Moritz Heusel; Isabell Bludau; Ben C Collins; Claudia Martelli; Evan G Williams; Peng Xue; Yansheng Liu; Ruedi Aebersold; Andrea Califano
Journal:  Cell Syst       Date:  2020-12-16       Impact factor: 10.304

10.  Development of RNA binding proteins expression signature for prognosis prediction in gastric cancer patients.

Authors:  Liqiang Zhou; You Wu; Lin Xin; Qi Zhou; Shihao Li; Yiwu Yuan; Jinliang Wang; Dengzhong Wu
Journal:  Am J Transl Res       Date:  2020-10-15       Impact factor: 4.060

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.