Literature DB >> 30500481

2C-BioID: An Advanced Two Component BioID System for Precision Mapping of Protein Interactomes.

Alexandre Chojnowski¹, Radoslaw M Sobota², Peh Fern Ong³, Wei Xie⁴, Xianrong Wong⁵, Oliver Dreesen³, Brian Burke⁶, Colin L Stewart⁷.

Abstract

The modulation of protein-protein interactions (PPIs) is an essential regulatory activity defining diverse cell functions in development and disease. BioID is an unbiased proximity-dependent biotinylation method making use of a biotin-protein ligase fused to a protein of interest and has become an important tool for mapping of PPIs within cellular contexts. We devised an advanced method, 2C-BioID, in which the biotin-protein ligase is kept separate from the protein of interest, until the two are induced to associate by the addition of a dimerizing agent. As proof of principle, we compared the interactomes of lamina-associated polypeptide 2β (LAP2β) with those of lamins A and C, using 2C- and conventional BioID. 2C-BioID greatly enhanced data robustness by facilitating the in silico elimination of non-specific interactors as well as overcoming the problems associated with aberrant protein localization. 2C-BioID therefore significantly strengthens the specificity and reliability of BioID-based interactome analysis, by the more stringent exclusion of false-positives and more efficient intracellular targeting.

Entities: CellLine Chemical Disease Gene Mutation Species

Keywords: Methodology in Biological Sciences; Molecular Biology; Molecular Interaction

Year: 2018 PMID： 30500481 PMCID： PMC6263017 DOI： 10.1016/j.isci.2018.11.023

Source DB: PubMed Journal: iScience ISSN： 2589-0042

Introduction

BioID is a proximity-based assay developed by Roux and colleagues to define and map potential protein-protein interactions (PPIs) (Roux et al., 2012). The method relies upon the construction of a bait protein (or protein of interest [PoI]) fused to a promiscuous biotin-protein ligase (pBPL). In the original and most widely used version of BioID, the pBPL is a mutant form (R118G) of the E. coli BirA protein (BioID1). BioID1 was engineered to non-specifically biotinylate neighboring proteins containing exposed primary amines. The assay depends upon the expression of the BioID1-PoI fusion protein in an appropriate cell type, with the cellular location of the fusion protein being determined by the PoI. Subsequent addition of biotin to the culture medium promotes biotinylation of proteins within a 10- to 20-nm radius of the BioID1 moiety (Kim et al., 2014). The biotinylated proteins are recovered on streptavidin-coated magnetic beads and identified by mass spectrometry, or in some situations, by western blot. After in silico sifting, the remaining proteins represent either direct or indirect candidate interactors of the PoI. As a tool for PPI analysis, BioID provides a powerful complement to other established and widely used methods such as the yeast two-hybrid (Y2H) system, co-immunoprecipitation (IP), or pull-down approaches (Moosavi et al., 2017). The BioID method has several significant advantages. First, in contrast to Y2H, potential interactions are monitored in a more physiological cellular context employing, mostly, the full-length PoI. Moreover, BioID efficiently detects weak or transient interactions that may otherwise be missed. Importantly, BioID is unaffected by PoI or target protein solubility because the biotinylation step that flags potential interactors takes place before cell lysis. This allows for biotinylated proteins to be recovered under denaturing conditions, thereby reducing non-specific background interactions (Roux et al., 2012). On the other hand, because of their simplicity combined with the overall speed of implementation, pull-down or IP should represent methods of choice when first embarking on interactome analyses. The drawback associated with these approaches, however, is that they can be exquisitely sensitive to cell lysis conditions and rely on experimental designs that tread a fine line between the solubilization of protein complexes and retention of meaningful interactions. Although these issues can be overcome in part by employing cross-linking strategies, the use of chemical cross-linkers introduces an additional layer of complexity demanding additional controls, and with its own attendant artifacts. Cross-linking notwithstanding, the problem of solubility has, for instance, bedevilled investigations into the mammalian nuclear envelope (NE), as well as focal adhesions at cell surface-substrate contacts. The former contains a number of poorly soluble components such as the nuclear lamina, whereas the latter are highly labile. In both systems, BioID has provided valuable new insights into the interactomes of these cellular structures (Dong et al., 2016, Kim and Roux, 2016). There are, nonetheless, some significant shortcomings in the standard BioID system. BioID cannot provide a “snapshot” of changing PoI interactions with the original pBPL repertoire, because biotinylation occurs during an incubation period of several hours. Instead, the technique provides a complete history of possible functional associations for the PoI, from its site of synthesis to its final cellular location. For very-short-duration time-resolved analyses, the use of recently developed high-activity pBPLs may complement available techniques such as APEX (Branon et al., 2017, Hung et al., 2014) and extend the range of tools at our disposal. It should also be clear that in addition to more or less stable PPIs, BioID also detects stochastic interactions, thus increasing the non-specific background over time. This necessitates the recognition and subsequent filtering of such background as an essential aspect of the BioID workflow. As with all tagging methods, the construction and expression of a fusion protein to mimic the behavior and the physiological interactions of its endogenous counterpart, both qualitatively and quantitatively, has its own limitations. For instance, with a mass of 35.1 kDa (83 residues larger than GFP), BioID1 could potentially affect the localization and/or function of the PoI. As a particular example, the presence of BioID1 may restrict passage through nuclear pore complexes (NPCs) of integral inner nuclear membrane (INM) proteins, in this way interfering with their correct targeting. Worman and colleagues indeed demonstrated that the nucleoplasmic domain of INM proteins hinders their entry into the nucleus when extended beyond 60–70 kDa (Soullam and Worman, 1995). A case in point is the lamina-associated polypeptide 2β (LAP2β), a resident protein of the INM, for which the addition of BioID1 to its N-terminus would exceed the NPC exclusion limit. This size problem can be circumvented to some extent by the recent development of a smaller 26.6-kDa second-generation pBPL (BioID2) from Aquifex aeolicus (Kim et al., 2016). However, for many INM proteins, BioID2 will still compromise intracellular sorting. To minimize issues associated with both the size of the BioID-PoI fusion protein (BioID1-PoI and to a lesser extent BioID2-PoI) and the ever-present challenge of non-specific background subtraction, we devised a modified BioID system. This new, two-component BioID (2C-BioID) takes advantage of the well-established FKBP:FRB (FK506-binding protein:FKBP-rapamycin-binding domain of mammalian target of rapamycin [mTOR]) oligomerization system (Putyrski and Schultz, 2012). In 2C-BioID, the PoI and pBPL are fused to FRB and FKBP. The two components, pBPL and PoI, are only brought together following FKBP:FRB oligomerization induced by the biologically inactive rapamycin analog AP21967. To assess the efficacy of 2C-BioID, we compared the interactomes of LAP2β generated with either conventional BioID or 2C-BioID. In addition, we employed both techniques to interrogate the interactomes of lamins A and C, major components of the nuclear lamina and the subjects of previous BioID studies. The two methods yielded overlapping but non-identical sets of potential interactors. Consistent with this, whereas the BioID1 moiety interfered with efficient targeting of LAP2β to the INM, no such impairment was observed when LAP2β was fused to the much smaller (∼11.2kDa) FRB domain. Furthermore, the FKBP-BioID activity, in the absence of the AP21967 dimerizer, provides an internal baseline of the non-specific levels of biotinylation. Accordingly, 2C-BioID affords a more stringent appraisal of potential false-positives, thereby improving the overall robustness of the assay.

Results

2C-BioID Results in the Appropriate Localization of LAP2β to the Inner Nuclear Membrane

Human LAP2β (∼51kDa) is the largest of four resident INM proteins encoded by the TMPO gene. LAP2β is typical of tail-anchored transmembrane proteins, with the bulk of the molecule (∼46 kDa) exposed on the nucleoplasmic side of the INM. Its amino terminal region harbors a LEM (LAP2, emerin, MAN1) domain, mediating its interaction with a small chromatin-associated protein, barrier-to-autointegration factor (BAF). The LEM domain is also found in the other transmembrane isoforms of LAP2 (ɛ,δ,γ), in its two soluble isoforms (α, ζ), and in several other INM components, including emerin, MAN1 (LEMD3), and LEM2. Although the role of LAP2β in normal cell physiology has yet to be fully elucidated, it has been implicated in transcriptional repression through its association with histone deacetylase 3 (Nili et al., 2001). Moreover, the interaction of LEM domain proteins, including LAP2β, with BAF, appears to have a role in NE reassembly following mitosis (Gant et al., 1999). Functional divergence among LAP2 isoforms has focused to a large extent on LAP2α, which has roles in aspects of cell cycle progression via interactions with A-type lamins and pRb (Dorner et al., 2006, Naetar et al., 2008). LAP2α has also been implicated in telomere maintenance, a function that is impaired in the premature aging syndrome Hutchinson-Gilford progeria (Chojnowski et al., 2015, Dechat et al., 2004, Vidak et al., 2015). Comparing the interactomes of LAP2α and LAP2β would therefore be of particular interest to further define their functional specializations. However, this is less straightforward than might first appear. Given the size of its nucleoplasmic domain, about 46 kDa, LAP2β presents specific challenges when using conventional BioID approaches. Because of size limitations, linking either BioID1 or BioID2 to the LAP2β N terminus may restrict access of the fusion protein to the INM. To explore this further, we fused the BioID1 cDNA to the 5′ end of the corresponding LAP2β-coding sequence and used a doxycycline (DOX)-inducible lentiviral system to express the BioID-LAP2β fusion protein in human fibroblasts (Chojnowski et al., 2015). Both western blot analysis and immunofluorescence microscopy revealed that BioID-LAP2β was duly expressed upon DOX addition as shown in Figures 1A and 1B. However, although some of the BioID-LAP2β fusion protein did localize to the NE, a significant fraction remained in the peripheral endoplasmic reticulum (ER) (Figures 1B, S1A, and S1B). This suggests that BioID-LAP2β is unable to efficiently localize to the INM, thus complicating the analysis of subsequent BioID results. In contrast, V5-tagged LAP2β localized exclusively to the NE, indicative of correct INM targeting (Figures 1C and S1B). These findings are consistent with a diffusion retention model of INM protein sorting, whereby increasing the size of the nucleoplasmic domain beyond 60–70 kDa restricts passage between the outer nuclear membrane (ONM) and INM at the level of the NPC membrane (Lusk et al., 2007, Soullam and Worman, 1995).

Figure 1

2C-BioID Allows Proper Localization of FRB-LAP2β and FKBP-BioID to the Nuclear Lamina/Envelope

(A) Western blot showing doxycycline-dependent expression of myc-tagged BioID-LAP2β in primary human fibroblasts using LAP2 antibody (top panel) and myc-tagged antibody (middle panel) as indicated. LAP2, myc, and GAPDH are shown.

(B) Immunofluorescence microscopy using lamin B1 antibody (left panels, red) and myc-tagged antibody with DAPI counterstaining (right panels), showing doxycycline-dependent expression of myc-tagged BioID-LAP2β in human fibroblasts and subcellular localization at the nuclear periphery/ER. Scale bar, 30 μm.

(C) Immunofluorescence microscopy using lamin B1 antibody (left panels, red) and V5-tagged antibody with DAPI counterstaining (right panels) showing doxycycline-dependent expression and appropriate subcellular localization at the nuclear periphery of V5-tagged LAP2β in human fibroblasts. Scale bar, 30 μm.

(D) Western blot showing inducible expression of V5-tagged FRB-LAP2β and constitutive expression of myc-tagged FKBP-BioID in primary human fibroblasts, using V5 antibody (top panel) and myc-tagged antibody (middle panel). Cell culture condition (+/-biotin, +/− doxycycline, +/− AP21967) is indicated; V5-FRB-LAP2β, myc-FKBP-BioID, and GAPDH are shown.

(E) Immunofluorescence microscopy showing (left panels) lamin B1 (green) with DAPI counterstaining and V5-tagged antibody (red) and (right panels) lamin B1 (green) with DAPI counterstaining and myc-tagged antibody (red). Relocalization of myc-tagged FKBP-BioID to V5-tagged FRB-LAP2β occurs only upon addition of AP21967 and doxycycline-dependent induction (bottom right panel). Scale bar, 30 μm.

(F) Illustration of BioID-LAP2β and 2C-BioID assays and respective localizations of the resulting fusions BioID/protein of interest/FKBP.

Comparison of Conventional BioID and 2C-BioID-LAP2β Interactomes

To gauge the potential advantages and disadvantages of 2C-BioID versus conventional BioID, we used each system to generate a LAP2β interactome. For both systems, we followed the basic protocol originally described by Roux et al (Roux et al., 2012). Additional sample pairs were, nevertheless, generated depending upon the presence or absence of the dimerizing agent AP21967. Both BioID protocols specifically identified proteins of the INM and nuclear lamina, in particular MAN1 (LEMD3), emerin (EMD), and lamin A/C (LMNA) (Figure 2). As expected for the 2C-BioID approach, these proteins were only detected upon induced dimerization of BioID-FKBP and FRB-LAP2β. The conventional BioID-LAP2β method identified the same targets after DOX induction of the fusion protein and addition of biotin. In this case, the same spectrum of biotinylated proteins was duly detected in both the presence and absence of the dimerization compound. The BirA ligases used in the conventional BioID (BioID1), in the 2C-BioID (BioID2), and the FRB domain of mTOR used in the 2C-BioID (identified by mass spectrometry as mTOR) were also detected (Figure 2). Significantly, other NE components such as LAP1 (TOR1AIP1) and lamin B receptor (LBR) were differentially identified by the two BioID methods. Conventional BioID identified both TOR1AIP1 and LBR as potential interactors of LAP2β, whereas the 2C-BioID method indicated that their detection was most likely non-specific, as revealed by their lower overall BioID score (Figure 2A). The BioID score used here represents a composite index based on a combination of prey absolute abundance, abundance ratio between samples and controls, and scores available from the CRAPome database (see Transparent Methods) (Mellacheruvu et al., 2013). The likelihood that both proteins were non-specifically detected is reflected by the fact that for both TOR1AIP1 and LBR, significant levels were detected in the absence of dimerizer. Several other proteins show similar differences between the two methods. The 2C-BioID score classifies them as low ranked, whereas in the absence of a priori functional input the conventional BioID assay suggests otherwise. Figures S2A and S2B shows a gene ontology (GO) terms enrichment analysis of proteins belonging to the two largest clusters of proteins identified in the LAP2β interactome using ClueGO (Bindea et al., 2009). These are mostly ribosome-related proteins for cluster 1 and actin-related proteins for cluster 2. As LAP2β is unlikely to have any extensive role related to either of these two clusters, this would indicate that most of the cluster constituents should be classified as background signal. Presumably many of these protein preys were biotinylated either owing to stochastic encounters with the bait or by transient proximity-based biotinylation occurring during the translation or trafficking of LAP2β to its final destination, i.e., the INM/lamina (Liu et al., 2018). It is also likely that some proteins could be endogenously biotinylated, or be non-specifically adsorbed to the streptavidin-coated beads in a biotin-independent fashion. Conventional BioID highlighted 18 such proteins as being of possible interest based solely on their score (BioID scores above 1), whereas only 3 would be classified as such using the 2C-BioID scoring, a decrease of ∼83% (PLS3, TES, and transforming growth factor β1-induced transcript 1 [TGFβ1I1]; see also Figures S3A and S3B). The complete and unfiltered list of identified proteins, quantification, and BioID/2C-BioID scores is shown in Table S1.

Figure 2

Comparison between 2C-BioID and BioID Analysis for the Nuclear Lamina and Selected Proteins

(A) Graphic representation of the network of selected proteins identified by 2C- and conventional BioID. The edge color of the nodes is mapped to the BioID score, from high score (red) to low score (blue, see Transparent Methods). The bar graphs of each node represent the relative quantity of its stated protein compared to its maximum value, across each condition (+/-biotin, +/− doxycycline, +/− AP21967, see inset on the right) for either 2C- or conventional BioID.

(B) Quantification of protein abundance (a.u.) from mass spectrometric analysis of the BioID assay for LAP2β, LEMD3, mTOR (FRB domain), BioID2 (FKBP-BioID), BioID1 (BioID-LAP2β), EMD, LMNA, LBR, TOR1AIP1, and HP1BP3. Values are represented as mean ± SEM, *p < 0.05, two-way ANOVA with Sidak's post-hoc test.

See also Figures S2 and S3 and Table S1.

Comparison between 2C-BioID and BioID Analysis for the Nuclear Lamina and Selected Proteins (A) Graphic representation of the network of selected proteins identified by 2C- and conventional BioID. The edge color of the nodes is mapped to the BioID score, from high score (red) to low score (blue, see Transparent Methods). The bar graphs of each node represent the relative quantity of its stated protein compared to its maximum value, across each condition (+/-biotin, +/− doxycycline, +/− AP21967, see inset on the right) for either 2C- or conventional BioID. (B) Quantification of protein abundance (a.u.) from mass spectrometric analysis of the BioID assay for LAP2β, LEMD3, mTOR (FRB domain), BioID2 (FKBP-BioID), BioID1 (BioID-LAP2β), EMD, LMNA, LBR, TOR1AIP1, and HP1BP3. Values are represented as mean ± SEM, *p < 0.05, two-way ANOVA with Sidak's post-hoc test. See also Figures S2 and S3 and Table S1. Among the highest ranked candidates in 2C-BioID, we then selected potential but currently unknown LAP2β interactors: TGFβ1I1, NHP2, SWAP70, STRN3, CRBN, AGFG1, and C1Orf198. We then used a yellow fluorescent protein (YFP)-based bimolecular fluorescence complementation assay to assess their association with LAP2β. In this system, PPI is revealed by reconstitution of a fluorescent YFP protein from two separate YFP segments fused to two separate putative partners (Bischof et al., 2018, Kerppola, 2008, Ohashi et al., 2012). As shown Figure S4A, the interaction between free FRB and FKBP was specifically detected after induction of their dimerization with AP21967: no YFP fluorescence could be observed without the addition of the dimerizer, despite both FRB and FKBP sharing similar intracellular localization. To assess the interaction of the selected candidates with LAP2β, each protein was fused at its N or C terminus to the N- or C-terminal segments of the split YFP and co-transfected with the corresponding LAP2β constructs (Figure 3A). Two of the candidates, TGFβ1I1/ARA55 (androgen receptor-associated protein of 55 kDa) and NHP2, were found to support LAP2β-dependent YFP fluorescence complementation and were thus chosen for further analysis (Figures 3C and 3D). Significantly, TGFβ1I1 and NHP2 were specifically identified as potential interactors by the 2C-BioID LAP2β and were not identified as such in the subsequent 2C-BioID lamin A or C experiments (Table S4).

Figure 3

Interaction of LAP2β with TGFβ1I1 and NHP2

(A) Fusion constructs and transfection strategy for BiFC using N- or C-terminal fusions of the N- (yellow, numbered 1 and 2) and C-terminal (orange, numbered 3 and 4) fragments of split-YFP (YFP-f) to the bait (X, blue), or to LAP2β (red).

(B) Graphic representation of TGFβ1I1 and NHP2 as identified by 2C- and conventional BioID. The bar graphs of each node represent the min-max quantity of its stated protein identified across both 2C- and conventional BioID experiments for each condition (+/-biotin, +/− doxycycline, +/− AP21967).

(C and D) YFP fluorescence in live HEK293T transfected with selected combinations of YFP fusions for (C) TGFβ1I1 and (D) NHP2, and corresponding LAP2β-only constructs fused to split-YFP N- or C- terminal fragments without associated YFP-fused bait. Scale bar, 50 μm.

(E and F) 3D structured illumination microscopy immunofluorescence microscopy using hemagglutinin (HA)- (top panels, red) and V5-tagged (middle panels, green) antibodies showing HA-tagged TGFβ1I1 (E) and HA-tagged NHP2 (F) localization in human dermal fibroblasts expressing doxycycline-dependent V5-tagged FRB-LAP2β. Scale bar, 2 μm.

2C-BioID Reduces the Level of False-Positives in the BioID-LAP2β Interactome

The BioID score provides a quantitative ranking of proteins identified by BioID, by taking into account their abundance and CRAPome representation (Mellacheruvu et al., 2013). As shown in Figure S6A, scores from 2C-BioID and conventional BioID- LAP2β analyses share similar profiles across their respective protein ranks. Furthermore, in both datasets a majority of the top 10% of putative interactors was classified as “nuclear,” if only transiently, by either GO or the Human Protein Atlas database (Uhlén et al., 2015). This corresponded to 34/54 and 27/54 nuclear proteins in the respective 2C- and conventional BioID datasets (Table S2). Considering the physiological localization of LAP2β at the INM, such an outcome is not unexpected. However, a comparison of the same top 10% of putative interactors for each BioID strategy revealed that only 10 of 54 proteins were common to both lists (Figure S6B, Table S2). Thus, although the two strategies share most of their respective bottom 90% hits, they differ significantly in their ranking of the putative top 10% hits. The average abundance ratio of the top 10% ranked proteins for 2C- or conventional BioID reveals a major difference in the outcome using the two methods (Figures 4A and 4B). As shown Figure 4A, the top 10% of proteins identified and scored using the 2C-BioID method (blue bars) shows the expected significant abundance ratio differences between control and sample conditions. The same proteins also show a significant difference in abundance between control and sample conditions using detection and scoring by the conventional BioID method (Figure 4A, gray bars). This indicates that the top hits identified by the 2C-BioID would not readily be discarded if the same experiment were to be performed using BioID. In contrast, although the top 10% of proteins identified and scored using conventional BioID display the expected abundance profile (Figure 4B, gray bars), the same proteins identified and scored using 2C-BioID do not show the expected difference between sample and control conditions (Figure 4B, blue bars). This suggests that a proportion of the putative interactors identified using conventional BioID may be false-positives. To confirm this, we submitted to GO terms enrichment analysis the list of proteins that were in the top 10% for one BioID method but were ranked low for the other BioID approach (ranked by their scores in the bottom 70%, Table S3). As shown in Figures S6C and S6D, a subset of proteins that ranked high in BioID but low in the 2C-BioID belonged to the vesicle organization/Golgi transport cluster. Conversely, five proteins ranked high in 2C-BioID were found clustered around cell adhesion/molecule-binding GO terms. This suggests that at least part of the difference between conventional and 2C-BioID within the top hits arises from the very different ways that the BirA ligase reaches its final cellular destination: either fused to the PoI from the start of its translation or separated from the PoI until the dimerization is induced in 2C-BioID. Obviously, the inefficient targeting of BioID-LAP2β to the INM also contributes to this discrepancy. Nevertheless, the two methods show similar profiles of abundance ratios for the lowest ranked proteins (Figures 4C and 4D).

Figure 4

2C-BioID Facilitates the Identification of False-Positives and Non-specific Interactors

(A) Average min-max abundance ratio of the top 10% ranks based on the 2C-BioID scores, from 2C-BioID quantifications (blue bars), or from conventional BioID quantifications (gray bars).

(B) Average min-max abundance ratio of the top 10% ranks based on the conventional BioID scores, from 2C-BioID quantifications (blue bars), or from conventional BioID quantifications (gray bars).

(C) Average min-max abundance ratio of the bottom 90% ranks based on the 2C-BioID scores, from 2C-BioID quantifications (blue bars), or from conventional BioID quantifications (gray bars).

(D) Average min-max abundance ratio of the bottom 90% ranks based on the conventional BioID scores, from 2C-BioID quantifications (blue bars), or from conventional BioID quantifications (gray bars). Cell culture conditions (+/-biotin, +/− doxycycline, +/− AP21967) are indicated below. Values are represented as mean ± SEM, ∗∗p < 0.01, one-way ANOVA with Tukey's post-hoc test.

See also Figures S6 and S7 and Tables S2, S3, and S4.

2C-BioID Facilitates the Identification of False-Positives and Non-specific Interactors (A) Average min-max abundance ratio of the top 10% ranks based on the 2C-BioID scores, from 2C-BioID quantifications (blue bars), or from conventional BioID quantifications (gray bars). (B) Average min-max abundance ratio of the top 10% ranks based on the conventional BioID scores, from 2C-BioID quantifications (blue bars), or from conventional BioID quantifications (gray bars). (C) Average min-max abundance ratio of the bottom 90% ranks based on the 2C-BioID scores, from 2C-BioID quantifications (blue bars), or from conventional BioID quantifications (gray bars). (D) Average min-max abundance ratio of the bottom 90% ranks based on the conventional BioID scores, from 2C-BioID quantifications (blue bars), or from conventional BioID quantifications (gray bars). Cell culture conditions (+/-biotin, +/− doxycycline, +/− AP21967) are indicated below. Values are represented as mean ± SEM, ∗∗p < 0.01, one-way ANOVA with Tukey's post-hoc test. See also Figures S6 and S7 and Tables S2, S3, and S4. To further explore the advantages of 2C-BioID, particularly in the elimination of false-positives, we extended this assay to two additional proteins, lamins A and C (Table S4). This enabled us to compare 2C-BioID analyses with our previously published conventional lamins A and C interactomes (Xie et al., 2016). To accomplish this, we functionally grouped the proteins identified in the lamin A, lamin C, conventional, and 2C-BioID assays. After filtering out the proteins non-ambiguously belonging to the background (conservatively only keeping BioID scores above 0.1, see Figure S7C), the results show that the proportion of proteins belonging to each of the identified functional groups common to all four experiments is extremely similar between 2C- and conventional BioID (Figure S7A). Together with our previous conventional and 2C-BioID LAP2β comparison, this confirms that 2C-BioID is, at minimum, performing similarly to the conventional BioID. This also demonstrates that the recruitment of FKBP-BioID to the bait results in a biotinylation profile as specific as that of the conventional BioID, in which the BioID is permanently fused to the PoI. The distribution of the scores from the 2C- and conventional BioID is similar for both lamin A and lamin C BioID assays (Figure S7C), echoing our LAP2β results. Of note is the reduction of the BioID scores for the preys in the intermediate ranks, which suggests that 2C-BioID should be able to more efficiently discard potential preys compared with the conventional system. As shown in Figure S7B, functional inspection of the first 100 preys identified in both lamin A and lamin C assays indeed shows a reduction in groups related to nuclear import, including passage through NPCs, as well as vesicular trafficking. Taken together, both the lamin A and lamin C data, as well as the LAP2β results, lend support to the notion that 2C-BioID provides a simple strategy to screen out stochastic encounters of the PoI during its translation and shuttling to its final destination. Such encounters are not easily recognizable in the conventional BioID system, where the biotin-protein ligase is permanently fused to the bait PoI. In summary, our results demonstrate that 2C-BioID is able to identify the majority of potential interactors that would have been detected using the original BioID method, but with the added benefit of providing an internal control that eliminates many false-positives.

Discussion

The use of BioID as an entry point into interactome analysis has been steadily increasing since its first description by Roux and colleagues in 2012. The principal attributes of BioID are that it identifies potential PPIs within a physiologically relevant cellular environment while at the same time being insensitive to the solubility properties of both the bait and prey molecules. The method, nevertheless, has certain quirks. First, and the most important, it requires the addition of a sizable polypeptide, consisting of a pBPL, to the PoI. This additional polypeptide, of 26–35 kDa, could interfere with PoI localization or function. Second, because the pBPL is constitutively active, biotinylation of vicinal proteins may commence as soon as the pBPL-PoI fusion protein is produced by the cellular translation machinery. As described in the current study, for N-terminal fusion proteins, this may occur even before synthesis is complete and would account for the almost unvarying representation of ribosomal proteins in BioID experiments. The situation is further complicated by the fact that the PoI may, of necessity, associate with the cellular machinery involved in delivery to its final destination. For instance, lamins A and C must engage with soluble nuclear import receptors to traverse the NE via NPCs. Similarly, LAP2β and other INM proteins, which are initially inserted either post- or co-translationally into the ER or ONM, must transit the NPC membrane domain to reach their final destination. It should come as no surprise then, that assorted components of the nuclear import machinery should be detected in these experiments. As the NPC imposes a size limit on the movement of membrane proteins between the ONM and INM (Lusk et al., 2007, Soullam and Worman, 1995), the most recent BioID studies on the NE and nuclear lamina interactome have made use of the smaller (26 kDa) second-generation pBPL (BioID2) (Birendra et al., 2017). Our 2C-BioID system, in which FRB (∼10kDa) replaces the pBPL, was designed in part to further reduce interference with the bait protein's localization and physiological interactions. Only after the FRB-PoI fusion protein reaches its proper cellular localization does the FKBP counterpart of the oligomerization system couple the BioID2 moiety to the PoI. This provides additional flexibility in the use of BioID to map interactomes, because many proteins can be distributed between two or more cellular compartments. By controlling FKBP-BioID cellular localization, it would therefore be possible to map interactomes within specific cell compartments, using the very same FRB-PoI construct. As an example, to monitor the interactomes of nuclear versus cytoplasmic pools of a certain PoI, this could be performed in cell lines harboring FKBP-BioID bearing either a nuclear localization sequence or a nuclear export sequence. In addition, the FKBP:FRB oligomerization occurs in the presence of dimerizer (Putyrski and Schultz, 2012) over a timescale far shorter than the biotin labeling period, and therefore places on 2C-BioID few, if any, limits on time-constrained PPI studies. When generating the interactome of LAP2β, the 2C-BioID approach clearly sidestepped problems associated with inefficient targeting of conventional BioID-LAP2β fusion proteins. The localization of FRB:LAP2β was equivalent to that observed with V5-LAP2β, the widely used V5-tag being only 14 amino acids long. In the particular case of LAP2β, the replacement of BioID1 by the smaller and newer variant BioID2 should improve the overall localization of the PoI, if only partially. For bulkier proteins the use of FRB in 2C-BioID would clearly represent the most favorable strategy. In the case where even minimal tagging of a PoI would interfere with its physiological activities, more global approaches such as protein correlation profiling or cellular thermal stability shift assay would be recommended, because both avoid protein fusions altogether (Dai et al., 2018, Larance et al., 2016). Size notwithstanding, the principal advantage of 2C-BioID is its capacity to filter out background noise and associated false-positives, a major complication of the conventional system. Here we have shown that this can be achieved without compromising or skewing the detection of candidate interactors. Indeed, interactors that most likely belong to the background noise inherent to this type of experiment (the lowest ranked identifications) were quite similar between 2C- and conventional BioID. The main difference was seen in the highest ranked candidate interactors: in 2C-BioID such interactors were also highly ranked in the conventional BioID. Strikingly, the converse was not true: the highest ranked interactors identified by the conventional BioID were not consistently highly ranked in the 2C-BioID strategy. In this case, the 2C-BioID results indicated that their identification was most probably due to non-specific detection rather than to actual interactions with the PoI. This is consistent with the fact that even in the lowest ranked potential interactors, conventional BioID still indicated a trend toward increased detection of false-positives when compared with 2C-BioID. This highlights the principal advantage of 2C-BioID: the constitutively active biotin ligase in the absence of dimerizer can be used as an effective internal control to identify non-specific interactors and background chatter. Furthermore, this is done in the same cellular context, in the presence of exactly the same PoI and pBPL fusion proteins. There is no requirement for the preparation of separate cell lines or controls, the only variable being the presence or absence of dimerizer. For these reasons, the 2C strategy should be easily applicable to similar proximity labeling methods such as TurboID or APEX (Branon et al., 2017, Kim and Roux, 2016, Lam et al., 2015). In contrast to BioID and TurboID, APEX promotes biotinylation of vicinal proteins by means of an ascorbate peroxidase enzyme employing hydrogen peroxide and biotin phenol as substrates. APEX's extremely high activity makes it particularly suitable for time-resolved studies and provides an effective labeling method to analyze the entire proteome of membrane-limited compartments, mitochondria being a case in point (Rhee et al., 2013). TurboID and miniTurbo evolutions of the BioID enzymes were similarly developed with improved proximity labeling efficiency in mind. However, targeting or localization of bait proteins in TurboID or APEX assays may equally be restricted by the bulk of the TurboID (35 kDa), miniTurbo (28 kDa), or peroxidase (28kDa) enzymes. The former is akin to BioID1 in size, whereas the latter two more closely resemble BioID2. It goes without saying that our 2C-BioID system can easily accommodate the new-generation high-activity TurboID enzymes as well as the APEX ascorbate peroxidase tag. In each of these situations, this system should reduce problems associated with bait protein localization and functionality, while at the same time providing a built-in control for non-specific labeling. Taken as a whole, our 2C approach should significantly reduce the number of false-positives that bedevil these related proximity labeling techniques. In another recent development, Schopp et al. (2017) introduced a split-BioID system in which the BirA enzyme itself is divided into a pair of inactive N- and C-terminal fragments. The biotin ligase activity is restored only when the two fragments are brought together. This system provides an exemplary method to validate binary associations by fusing N- and C-terminal BirA domains to known interacting partners and can identify additional interactors within multiprotein complexes. Clearly, this may not be feasible in all molecular contexts because it requires the initial constructive modification of two discrete interacting partners. Notwithstanding this proviso, it has been employed successfully to probe new associations within microRNA processing complexes. We would therefore argue that 2C-BioID in all of its guises and split-BioID provide highly complementary tools in the identification and validation of PPIs in vivo. 2C-BioID is ideally suited to the efficient detection of new protein pairs while eliminating spurious interactions, whereas split-BioID will highlight the mutual associations of known partners. In this way, 2C-BioID will underpin subsequent split-BioID analyses. Taken together, our results show that 2C-BioID brings significant improvements to conventional BioID. In addition, generating stable lines or animal models constitutively expressing FKBP-BioID should be straightforward. This will facilitate subsequent 2C-BioID analyses that will have no more experimental complexity than conventional BioID. Together with other novel refinements designed for whole-proteome analysis or conversely for specific purposes, 2C-BioID should further extend and improve our current capabilities in refining both global and subcellular protein network analyses.

Limitations of the Study

BioID is a proximity labeling method that can be used to map PPIs. A significant advantage over conventional methods is that it can be applied to proteins in their normal physiological contexts. BioID is also insensitive to the solubility properties of both the PoI and its potential interactors. There are, however, limitations to the technique. First, attachment of a relatively large biotin-protein ligase to the PoI may interfere with its correct targeting. Second, BioID will detect all associations, stochastic and otherwise, that the PoI experiences during its entire lifetime, thereby generating significant background identifications. 2C-BioID was originally devised to circumvent the protein targeting problem, but its most critical feature is its ability to computationally reduce the non-specific background. This still leaves an important limitation of the 2C-BioID system: as it relies on proximity labeling, hence detecting near neighbors, the method cannot be used to formally demonstrate interactions. It is essential, therefore, that all candidate PPIs identified by 2C-BioID be validated using other independent approaches. To this end, we employed bimolecular fluorescence complementation and super-resolution microscopy, but the alternative scheme could involve pull-downs from cell lysates. However, this may miss weak interactions as well as interactions involving poorly soluble proteins. Ultimately multi-faceted strategies may be required to confirm potential PPIs.

Methods

All methods can be found in the accompanying Transparent Methods supplemental file.

36 in total

1. LAP2alpha and BAF transiently localize to telomeres and specific regions on chromatin during nuclear assembly.

Authors: Thomas Dechat; Andreas Gajewski; Barbara Korbei; Daniel Gerlich; Nathalie Daigle; Tokuko Haraguchi; Kazuhiro Furukawa; Jan Ellenberg; Roland Foisner
Journal: J Cell Sci Date: 2004-11-16 Impact factor: 5.285

Review 2. Highway to the inner nuclear membrane: rules for the road.

Authors: C Patrick Lusk; Günter Blobel; Megan C King
Journal: Nat Rev Mol Cell Biol Date: 2007-04-18 Impact factor: 94.444

3. Mutations in the telomerase component NHP2 cause the premature ageing syndrome dyskeratosis congenita.

Authors: Tom Vulliamy; Richard Beswick; Michael Kirwan; Anna Marrone; Martin Digweed; Amanda Walne; Inderjeet Dokal
Journal: Proc Natl Acad Sci U S A Date: 2008-06-03 Impact factor: 11.205

4. Loss of nucleoplasmic LAP2alpha-lamin A complexes causes erythroid and epidermal progenitor hyperproliferation.

Authors: Nana Naetar; Barbara Korbei; Serguei Kozlov; Marc A Kerenyi; Daniela Dorner; Rosana Kral; Ivana Gotic; Peter Fuchs; Tatiana V Cohen; Reginald Bittner; Colin L Stewart; Roland Foisner
Journal: Nat Cell Biol Date: 2008-10-12 Impact factor: 28.824

5. Nuclear membrane protein LAP2beta mediates transcriptional repression alone and together with its binding partner GCL (germ-cell-less).

Authors: E Nili; G S Cojocaru; Y Kalma; D Ginsberg; N G Copeland; D J Gilbert; N A Jenkins; R Berger; S Shaklai; N Amariglio; F Brok-Simoni; A J Simon; G Rechavi
Journal: J Cell Sci Date: 2001-09 Impact factor: 5.285

Review 6. Bimolecular fluorescence complementation (BiFC) analysis as a probe of protein interactions in living cells.

Authors: Tom K Kerppola
Journal: Annu Rev Biophys Date: 2008 Impact factor: 12.981

7. Hic-5 controls BMP4 responses in prostate cancer cells through interacting with Smads 1, 5 and 8.

Authors: D T N Shola; H Wang; R Wahdan-Alaswad; D Danielpour
Journal: Oncogene Date: 2011-09-26 Impact factor: 9.867

8. Roles of LAP2 proteins in nuclear assembly and DNA replication: truncated LAP2beta proteins alter lamina assembly, envelope formation, nuclear size, and DNA replication efficiency in Xenopus laevis extracts.

Authors: T M Gant; C A Harris; K L Wilson
Journal: J Cell Biol Date: 1999-03-22 Impact factor: 10.539

9. Lamina-associated polypeptide 2alpha regulates cell cycle progression and differentiation via the retinoblastoma-E2F pathway.

Authors: Daniela Dorner; Sylvia Vlcek; Nicole Foeger; Andreas Gajewski; Christian Makolm; Josef Gotzmann; Christopher J Hutchison; Roland Foisner
Journal: J Cell Biol Date: 2006-04-10 Impact factor: 10.539

10. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks.

Authors: Gabriela Bindea; Bernhard Mlecnik; Hubert Hackl; Pornpimol Charoentong; Marie Tosolini; Amos Kirilovsky; Wolf-Herman Fridman; Franck Pagès; Zlatko Trajanoski; Jérôme Galon
Journal: Bioinformatics Date: 2009-02-23 Impact factor: 6.937

14 in total

1. Combined proximity labeling and affinity purification-mass spectrometry workflow for mapping and visualizing protein interaction networks.

Authors: Xiaonan Liu; Kari Salokas; Rigbe G Weldatsadik; Lisa Gawriyski; Markku Varjosalo
Journal: Nat Protoc Date: 2020-08-10 Impact factor: 13.491

Review 2. Proximity Dependent Biotinylation: Key Enzymes and Adaptation to Proteomics Approaches.

Authors: Payman Samavarchi-Tehrani; Reuben Samson; Anne-Claude Gingras
Journal: Mol Cell Proteomics Date: 2020-03-03 Impact factor: 5.911

3. Workflows and considerations for investigating protein interactions of viral DNA sensors.

Authors: Timothy R Howard; Bokai Song; Ileana M Cristea
Journal: Methods Enzymol Date: 2019-05-20 Impact factor: 1.600

4. Proteomic mapping by rapamycin-dependent targeting of APEX2 identifies binding partners of VAPB at the inner nuclear membrane.

Authors: Christina James; Marret Müller; Martin W Goldberg; Christof Lenz; Henning Urlaub; Ralph H Kehlenbach
Journal: J Biol Chem Date: 2019-09-13 Impact factor: 5.157