Literature DB >> 27834739

Systematic approaches to identify E3 ligase substrates.

Mary Iconomou1,2, Darren N Saunders3.   

Abstract

Protein ubiquitylation is a widespread post-translational modification, regulating cellular signalling with many outcomes, such as protein degradation, endocytosis, cell cycle progression, DNA repair and transcription. E3 ligases are a critical component of the ubiquitin proteasome system (UPS), determining the substrate specificity of the cascade by the covalent attachment of ubiquitin to substrate proteins. Currently, there are over 600 putative E3 ligases, but many are poorly characterized, particularly with respect to individual protein substrates. Here, we highlight systematic approaches to identify and validate UPS targets and discuss how they are underpinning rapid advances in our understanding of the biochemistry and biology of the UPS. The integration of novel tools, model systems and methods for target identification is driving significant interest in drug development, targeting various aspects of UPS function and advancing the understanding of a diverse range of disease processes.
© 2016 The Author(s).

Entities:  

Keywords:  cancer; functional genomics; neurodegeneration; proteomics; proteostasis; ubiquitin

Mesh:

Substances:

Year:  2016        PMID: 27834739      PMCID: PMC5103871          DOI: 10.1042/BCJ20160719

Source DB:  PubMed          Journal:  Biochem J        ISSN: 0264-6021            Impact factor:   3.857


Ubiquitin proteasome system

Conjugation of the small protein modifier ubiquitin (Ub) is one of the most abundant post-translational modifications in cellular signalling. Ubiquitylation regulates a wide range of cellular processes, including protein degradation [1,2], endocytosis [2-4], cell cycle progression [5,6], DNA repair [7,8], transcription [9,10], translation [11] and immune function [12]. The fate of individual substrates following ubiquitylation is determined by ubiquitin chain topology, which acts as a molecular code for substrate fate [13]. Deregulation of the tightly regulated ubiquitin proteasome system (UPS) has been implicated in various human diseases, including cancer, autoimmunity, neurodegeneration and infectious diseases [14-17]. Hence, there is significant interest in defining the biochemistry of this pathway — both to better understand pathophysiology and to underpin the development of novel therapies. While efforts to understand other post-translational modifications — such as phosphorylation and acetylation — and exploit them therapeutically — have progressed rapidly, identifying cellular targets/substrates of the UPS has been hindered by the complex hierarchical biochemistry of the system and significant technical challenges. The ubiquitylation cascade (Figure 1) requires the action of three enzymes — Ub-activating enzyme (E1), Ub-conjugating enzyme (E2) and Ub protein ligase (E3 ligase) — in an energy-dependent hierarchical cascade [18]. Initially, the C-terminus of Ub is activated by a two-step ATP-dependent reaction in which a thioester bond is formed with a cysteine residue of E1. Ub is subsequently transferred to a cysteine residue of E2 via a thioester transfer reaction. E3s recruit Ub-loaded E2 enzymes and then either directly catalyze the ubiquitylation of the substrate lysine residue or facilitate the transfer of Ub to the substrate from the E2 via HECT (homology to E6-Ap carboxyl terminus), RING (really interesting new gene) or RBR (RING-in-between-RING) domains [19-21]. Recently, the SidE effector protein family of Legionella pneumophila was shown to directly ubiquitylate substrates via an mART motif. This mechanism operates without the need for E1/E2 enzymes or ATP, and activates ubiquitin using nicotinamide adenine dinucleotide [22]. Interestingly, the mART motif is also present in a family of mammalian proteins, indicating the potential for this all-in-one enzyme activity to exist in mammals [22]. Ubiquitin-specific proteases (USPs) [also known as deubiquitinases (DUBs)] act as reciprocal regulators of ubiquitlyation by catalyzing the removal of Ub from substrate proteins. Substrate ubiquitylation can signal many cellular fates, depending on chain topology, but one of the best-characterized outcomes is degradation by the proteasome.
Figure 1.

Challenges in the identification of E3 ligase substrates.

1. The dynamic nature of protein ubiquitylation. 2. Weak and/or transient interactions between ligase and substrate. 3. Significant degrees of redundancy and multiplicity. 4. Rapid destruction of many ubiquitylated proteins.

Challenges in the identification of E3 ligase substrates.

1. The dynamic nature of protein ubiquitylation. 2. Weak and/or transient interactions between ligase and substrate. 3. Significant degrees of redundancy and multiplicity. 4. Rapid destruction of many ubiquitylated proteins. One E1 enzyme (UBA1) is encoded in the human genome, with two known isoforms (nuclear and cytoplasmic localizations) [23,24]. Thirty-eight E2s have been identified (reviewed by ref. [25]), and there are ∼100 known DUBs [26]. However, substrate specificity is largely determined by >600 putative E3s identified to date [19,27]. It should be noted that the catalytic activities of many putative E3s and DUBs are yet to be validated. A key feature of the UPS is a significant degree of redundancy and multiplicity, where individual protein substrates may be targeted by multiple E3s (or DUBs), and a single E3 (or DUB) may have multiple protein substrates [28]. An emerging theme in understanding the regulation of protein ubiquitylation comes from the observation that E3/DUB pairs can act in a highly co-ordinated manner to regulate ubiquitylation (and hence activity) of each other and downstream substrates, although only a few examples have been observed to date [29-31]. The E3 ligase:DUB pair of Rsp5:Ubp2 acts in a co-ordinated fashion, with Ubp2 antagonizing Rsp5 E3 ligase activity by forming a complex together with Rsp5 and Rup1 that deubiquitylates Rsp5 substrates [29]. The DUB USP9X has been shown to stabilize the E3 ligase SMURF1 in MDA-MB-231 cells [31], and another interesting example is the reciprocal regulation of abundance and activity of the E3 ligase:DUB pair of UBR5 and DUBA, which controls IL-17 production in T cells. UBR5 destabilizes DUBA through ubiquitylation, whereas DUBA stabilizes UBR5 in activated T cells by attenuating degradative auto-ubiquitylation, triggering UBR5-mediated ubiquitylation of the transcription factor RORyt in response to TGF-β [30]. Most components of the UPS — from E1 through to the proteasome — are under active investigation as potential therapeutic targets in cancer and other indications [32-35]. The best-known compound targeting the UPS is the dipeptide boronic bortezomib (velcade) [36,37], a 26S proteasome inhibitor approved for clinical use in multiple myeloma [38]. Thalidomide and its analogues (lenalidomide-CC-5013 and pomalidomide-CC-4047) are immunomodulatory drugs (IMiDs) approved for clinical use in the treatment of multiple myeloma (reviewed in ref. [39]). Although the precise molecular mechanism remains unclear, IMiDs are thought to exert their teratogenic and anti-tumour effects by targeting cereblon (CRBN) [40]. The interaction disrupts the function of the E3 ligase complex constituted by CRBN, DDB1 and Cul4, causing down-regulation of fibroblast growth factor genes [40,41]. Apcin (APC inhibitor) is a novel small molecule inhibitor of another E3 ligase complex: anaphase-promoting complex/cyclosome (APC/C) [42]. Apcin binds to and competively inhibits Cdc20, a substrate recognition co-receptor of APC/C [42]. This effect is amplified by the co-addition of another small molecule inhibitor — tosyl-l-argine methyl ester — which also blocks the APC/C–Cdc20 interaction [42]. Another strategy is to target the UPS by inhibiting DUB activity. Currently, there are several small molecule inhibitors in the pipeline, such as HBX 19818 and P050429, which selectively target USP7/HAUSP (reviewed in ref. [34]). More recently, ML323 has been reported as a potent inhibitor of the USP1UAF1 deubiquitinase complex, revealing USP1UAF1 to be an important regulator of the DNA damage response and a target for overcoming platinum-based chemotherapy resistance [43]. Several studies have shown that ubiquitin variants (UbVs) can also function as highly specific inhibitors or activators for various enzymes, including E3s and DUBs. For example, UbVs have been developed to either inhibit HECT E3s by binding to the E2-binding site or activate by targeting an ubiquitin-binding exosite [44]. Furthermore, exploitation of the low-affinity interactions between Ub and enzymes of the ubiquitin system by engineered optimization has produced potent and selective modulators of UPS components, including DUBs, E2s and E3s [45]. These UbVs will be very useful tools to elucidate the function of E3s by modulating their activity. As E3 ligases primarily determine the substrate specificity of the UPS they may represent more specific targets for inhibition [46,47]. The pursuit of inhibitors that target specific E3 ligase activity or block E3–substrate interactions led to the development of Nutlins (cis-imidazoline compounds), which are currently in clinical trials for various indications. Nutlins prevent binding of the E3 ligase MDM2 to its substrate, the tumour suppressor p53 [19,48-51]. Another compound currently in clinical trials for cancer is the Smac mimetic GDC-0152, which binds to anti-apoptotic proteins (IAPs), inducing IAP self-ubiquitylation and degradation, and consequently apoptosis [52]. Although E3 ligases make promising therapeutic targets, efforts into developing therapeutics targeting E3 ligases have so far been relatively ineffective — with no drugs currently approved for clinical use. This is partly due to limited understanding of ligase–substrate relationships and biological function. A key to the development of effective therapeutics is the identification of E3 ligase substrates and, conversely, the ligases are targeting a specific substrate. These will not only advance understanding of functional roles but also are necessary for the development of functional assays for drug screening and associated discovery and the validation of relevant biomarkers.

E3 ligase substrate identification

Many of the ∼600 E3 ligases identified in the human genome remain relatively uncharacterized [19] and identification of E3 ligase substrates is notoriously difficult. For a few well-characterized E3 ligases, conserved targeting sequences in substrates have been identified, allowing prediction and validation of putative substrates, for example, N-degrons recognized by UBR family ligases, and Nedd4 WW domains, which bind proline-rich domains in target proteins [53-56]. Ligase–substrate recognition can also be modulated by post-translational modifications. For example, the E3 ligase c-Cbl binds to phosphorylated tyrosine motifs in substrates, and ubiquitylation and degradation of phosphoenolpyruvate carboxykinase is mediated via acetylation-dependent recognition by UBR5 under conditions of high glucose [57]. However, in most cases, the identification and validation of substrates of an individual E3 ligase, or the specific ligase(s) targeting a particular substrate, have relied on relatively slow, low-throughput biochemical methods.

Challenges

Many factors present significant technical challenges in identifying E3 ligase substrates or identifying E3 ligases targeting a specific substrate (Figure 1). 1. The dynamic nature of protein ubiquitylation. Ubiquitylation is under tight control of E3 ligases and DUBs acting in a highly co-ordinated manner to edit or remove ubiquitin chains in response to changing cellular environments [58]. 2. Weak and/or transient interactions between ligase and substrate. The weak physical interaction and rapid dissociation rate between some E3–substrate complexes mean that identification by immunoprecipitation, followed by mass spectrometry, is challenging [59]. The low cellular abundance of substrates may also hinder identification [60]. This dynamic equilibrium between E3 ligase and substrate highlights the need for more sensitive techniques and is partly addressed by ‘trapping’ approaches (see below). 3. Significant degrees of redundancy and multiplicity. Any particular substrate may be targeted by multiple E3 ligases at different sites, and a single E3 ligase may target multiple substrates under different conditions or in different cellular compartments. This drives a huge diversity in spatial and temporal control of ubiquitylation (reviewed by ref. [61]). Cellular context is an important consideration, as substrate–ligase pairs identified by biochemical methods may not be expressed or interact in the same sub-cellular compartment. 4. Rapid destruction of many ubiquitylated proteins. The identification of ubiquitylated proteins destined for degradation by the proteasome can be difficult without the use of proteasome inhibitors, and these may produce other confounding biological effects [62]. A complicating factor in E3 ligase substrate identification may be the relative insensitivity of many methods to post-translational modifications of ubiquitin, E3 ligases or substrates — which have the potential to alter activity and substrate binding [63,64]. The relatively recent development of systematic approaches to identify E3 ligase substrates (shown in Figure 2) is providing an unprecedented mechanistic insight into ubiquitin signalling. In a very short timeframe, the defined cellular ubiquitome has expanded by orders of magnitude to now include a significant proportion of the total proteome — with >23 000 individual modification sites mapping to >4600 proteins [65,66]. In particular, integrated approaches combining genetic models (i.e. modulating E3 ligase expression or activity) with powerful functional genomics or proteomics have rapidly expanded our understanding of breadth, depth and diversity of the ubiquitin-modified proteome (ubiquitome) in various contexts [67-69]. The various platforms and techniques present their own advantages and disadvantages, depending on the study context and available model systems (Table 1). Differences in the catalytic mechanisms of HECT, RING and RBR classes of E3 ligases also present slightly different challenges to the various methods for the identification of substrates. These include different mechanisms of Ub binding and transfer, variations in regulatory mechanisms and the need for accessory/adaptor proteins. Unfortunately, there is little empirical evidence comparing the relative performance of each method against different E3 classes, making it difficult to draw definitive conclusions.
Figure 2.

Diversity of systematic approaches available to identify E3 ligase substrates.

Table 1.

Comparison of systematic approaches to identify E3 ligase substrates

MethodAdvantagesDisadvantages
Yeast two-hybrid (Y2H)

Relatively fast, simple, inexpensive.

Rapidly identifies essential interacting domains using deletion mapping.

Detects type I false-positives via segregation analysis. Detects type II and III false-positives with further subcloning.

Split-Ub membrane-based Y2H system detects PPIs in various sub-cellular localizations (e.g. cytoplasm or cell membranes).

Synthetic random peptide library can be used to rapidly screen (e.g. protein interaction domains that bind to the protein interaction or catalytic domain of the E3 ligase of interest).

High false-positive and false-negative rates — up to 70%.

Bait:prey fusion proteins may be toxic to yeast or Escherichia coli cells. May not transcribe/fold correctly without cofactors, adaptors or essential PTMs.

Validation of hits with an independent assay such as co-IP is necessary.

Mammalian proteins may be above the maximum size yeast or E.coli are able to produce. Gene fragments may need to be used.

True-positives need to be sequenced to identify/confirm the gene that interacts with bait gene.

In vitro ubiquitylation: phage display and luminescent

Relatively fast, simple, inexpensive.

Human (or other) cDNA library used.

Recombinant proteins required – may not be feasible for large proteins or proteins that need cofactors for correct folding.

Limited by the number of genes in library.

Not suitable for membrane-bound proteins.

Vector used has a limited amino acid range and thus truncated proteins often need to be used.

Protein microarrays

Human cDNA library used.

Recombinant proteins can be produced in a high-throughput manner.

Thousands of proteins can be screened at a time.

Recombinant proteins required — may not be feasible for large proteins or proteins that need cofactors for correct folding.

Polyubiquitylation or multi-monoubiquitylation may amplify fluorescent signal — may bias against the detection of monoubiquitylation.

Limited by the number of genes in library.

Printing of proteins onto array may affect structure, accessibility of lysines and/or interactions with E3 ligase.

Global protein stability (GPS) profiling

Human cDNA library used.

Performed in living cells.

Stability of proteins can be monitored over time.

Limited by the number of genes in library.

The N-terminal tag may disrupt protein folding and ubiquitylation.

Need to clone library into reporter expression construct — requires specialized technical skills.

High-throughput quantitative microscopy (HCA)

Compatible with ORFeome library.

Performed in living cells.

Limited by the number of genes in library.

N-terminal fusion tag may disrupt protein folding and ubiquitylation.

Need to clone each gene into the reporter construct.

Requires isogenic knockout model.

Differential expression (shotgun) proteomics

Unbiased screen, examine global changes in protein expression following the modulation of E3 activity/expression.

Can alter E3 activity/expression using genetic modulation or chemical inhibition.

Relies on genetic modulation or chemical inhibition of E3 activity/expression.

Identifies proteins based on their stability, therefore, biased towards substrates fated for degradation.

Substrates identified may not be direct targets of the E3 ligase.

Affinity purification proteomics: Ub, TUBEs, PAC-Compass

His-tagged substrates are efficiently purified.

Compatible with domain- or site-specific variants to determine Ub chain topology and interaction domains.

Relies on the overexpression of tagged bait proteins (Ub and E3).

Fusion tag may disrupt protein folding and/or ubiquitylation.

Not all interacting proteins identified are necessarily substrates.

Ubiquitin ligase trapping and proximity labelling

Can detect transient, low-affinity E3–substrate interactions and low-abundance substrates.

UBAITs capture proteins that interact with the E3 in an E1- and E2-dependent manner.

Using different TUBEs, is it possible to detect different polyubiquitin chain topologies.

TUBEs can isolate endogenous ubiquitylated proteins.

NEDDylator enriches for proteins that are specifically ubiquitylated by E3, not just interacting proteins in close proximity.

UBAITs suitable for both HECT and RING E3 ligases.

Relies on the overexpression of fusion proteins, tags may interfere with binding or activity.

Low-abundance monoubiquitylated proteins may not be detected if the main population of that same protein is polyubiquitylated.

Polyubiquitylated proteins may attract their unmodified or monoubiquitylated forms present in a heterotetramer (e.g. p53).

Many different types of UBA domains with different chain topology preference — potential source of bias.

BioID detects proteins in close proximity to E3, not necessarily substrates.

NEDDylator may be restricted to RING domain ligases

DiGly remnant affinity proteomics

Accurately identifies and quantifies specific endogenous ubiquitylation sites.

Can examine the global ubiquitylation site changes following the modulation of E3 activity/expression.

Low abundant sites can be enriched and identified.

Validation of ubiquitylation sites identified is difficult.

This approach does not differentiate between mono- or poly-ubiquitylated sites.

Expensive due to antibody and substantial amount of starting lysate needed.

Relatively fast, simple, inexpensive. Rapidly identifies essential interacting domains using deletion mapping. Detects type I false-positives via segregation analysis. Detects type II and III false-positives with further subcloning. Split-Ub membrane-based Y2H system detects PPIs in various sub-cellular localizations (e.g. cytoplasm or cell membranes). Synthetic random peptide library can be used to rapidly screen (e.g. protein interaction domains that bind to the protein interaction or catalytic domain of the E3 ligase of interest). High false-positive and false-negative rates — up to 70%. Bait:prey fusion proteins may be toxic to yeast or Escherichia coli cells. May not transcribe/fold correctly without cofactors, adaptors or essential PTMs. Validation of hits with an independent assay such as co-IP is necessary. Mammalian proteins may be above the maximum size yeast or E.coli are able to produce. Gene fragments may need to be used. True-positives need to be sequenced to identify/confirm the gene that interacts with bait gene. Relatively fast, simple, inexpensive. Human (or other) cDNA library used. Recombinant proteins required – may not be feasible for large proteins or proteins that need cofactors for correct folding. Limited by the number of genes in library. Not suitable for membrane-bound proteins. Vector used has a limited amino acid range and thus truncated proteins often need to be used. Human cDNA library used. Recombinant proteins can be produced in a high-throughput manner. Thousands of proteins can be screened at a time. Recombinant proteins required — may not be feasible for large proteins or proteins that need cofactors for correct folding. Polyubiquitylation or multi-monoubiquitylation may amplify fluorescent signal — may bias against the detection of monoubiquitylation. Limited by the number of genes in library. Printing of proteins onto array may affect structure, accessibility of lysines and/or interactions with E3 ligase. Human cDNA library used. Performed in living cells. Stability of proteins can be monitored over time. Limited by the number of genes in library. The N-terminal tag may disrupt protein folding and ubiquitylation. Need to clone library into reporter expression construct — requires specialized technical skills. Compatible with ORFeome library. Performed in living cells. Limited by the number of genes in library. N-terminal fusion tag may disrupt protein folding and ubiquitylation. Need to clone each gene into the reporter construct. Requires isogenic knockout model. Unbiased screen, examine global changes in protein expression following the modulation of E3 activity/expression. Can alter E3 activity/expression using genetic modulation or chemical inhibition. Relies on genetic modulation or chemical inhibition of E3 activity/expression. Identifies proteins based on their stability, therefore, biased towards substrates fated for degradation. Substrates identified may not be direct targets of the E3 ligase. His-tagged substrates are efficiently purified. Compatible with domain- or site-specific variants to determine Ub chain topology and interaction domains. Relies on the overexpression of tagged bait proteins (Ub and E3). Fusion tag may disrupt protein folding and/or ubiquitylation. Not all interacting proteins identified are necessarily substrates. Can detect transient, low-affinity E3–substrate interactions and low-abundance substrates. UBAITs capture proteins that interact with the E3 in an E1- and E2-dependent manner. Using different TUBEs, is it possible to detect different polyubiquitin chain topologies. TUBEs can isolate endogenous ubiquitylated proteins. NEDDylator enriches for proteins that are specifically ubiquitylated by E3, not just interacting proteins in close proximity. UBAITs suitable for both HECT and RING E3 ligases. Relies on the overexpression of fusion proteins, tags may interfere with binding or activity. Low-abundance monoubiquitylated proteins may not be detected if the main population of that same protein is polyubiquitylated. Polyubiquitylated proteins may attract their unmodified or monoubiquitylated forms present in a heterotetramer (e.g. p53). Many different types of UBA domains with different chain topology preference — potential source of bias. BioID detects proteins in close proximity to E3, not necessarily substrates. NEDDylator may be restricted to RING domain ligases Accurately identifies and quantifies specific endogenous ubiquitylation sites. Can examine the global ubiquitylation site changes following the modulation of E3 activity/expression. Low abundant sites can be enriched and identified. Validation of ubiquitylation sites identified is difficult. This approach does not differentiate between mono- or poly-ubiquitylated sites. Expensive due to antibody and substantial amount of starting lysate needed. A comprehensive summary of the application of these various strategies to identifying E3 ligase substrates is provided in Table 2. While interacting proteins of E3 ligases represent potential substrates, further validation by orthogonal methods is necessary to attribute status as bona fide substrates.
Table 2.

Examples of E3 ligase substrate identification with various approaches

MethodE3 ligaseNumber of substrates identifiedReference
Yeast two-hybrid (Y2H)LNX64 Interactors and 8 validated substrates[70]
VHL100 Interactors, 8 putative substrates and 2 known substrates[71]
Nedd44 Interactors. 2 substrates[72]
SCF (Cdc4)4 Putative substrates and 1 known substrate[73]
NKLAM (RNF19b)1 Substrate[74]
TRIM321 Substrate[75]
In vitro ubiquitylation — phage displayMDM24 Known substrates and 11 validated substrates[76]
In vitro ubiquitylation — luminescentRsp59 Validated substrates[77]
Protein microarrayRsp545 Putative substrates, 11 known substrates and 6 validated[78]
Rsp586 Candidates identified and 28 validated substrates[79]
Nedd4∼15 Validated substrates[80]
SMURF17 Validated substrates[81]
Praja114 Putative substrates[82]
Global protein stability (GPS) profilingSCF359 Putative substrates and 31 validated substrates[83]
Various CRLs47 Validated substrates[84]
High-throughput quantitative microscopy (HCA)SCFGrr1106 Putative substrates and 7 validated substrates[85]
Differential expression (shotgun) proteomicsASB22 Known substrates[86]
MARCH912 Candidate substrates and 6 validated[87]
Affinity purification proteomicsSCF (Saf1)18 Putative substrates, 4 validated substrates and 17 known substrates[88]
FBXL family88 Interacting proteins and 13 putative substrates[89]
FBXW1196 Interacting proteins, 23 putative substrates and 1 validated[90]
BARD114 Putative substrates and 2 validated substrates[91]
βTrCP221 Putative substrates[92]
MuRF120 Putative substrates, 3 validated substrates and 3 known substrates[93]
Ubiquitin ligase trapping and proximity labellingFBXL family17 Known substrates and 18 novel substrates[94]
Rsp5, Itch, Psh1, RNF126, RNF168Numerous[95]
βTrCP1/250 Candidate substrates and 12 validated[96]
XIAP50 Putative substrates[97]
diGly remnant affinity proteomicsCRL253 Ubiquitylation sites, 59 known Ub-sites or CRL components[65]
PARKIN1654 Putative substrates[98]
SPOP12 Ubiquitylation sites and 4 likely substrates fated for degradation[99]
FBXO2136 Putative substrates, 2 validated substrates — one novel ubiquitylation site and 5 previously reported sites identified[60]
Integrated approachesHRD19 High-confidence substrates and 1 validated substrate[100]
HUWE11 Substrate and 1 interactor confirmed[101]
Trim3219 High-confidence substrates and 1 validated substrate[102]
ASB2α2 Validated substrates[103]

Interacting proteins represent potential substrates, but require further validation by orthogonal methods.

Interacting proteins represent potential substrates, but require further validation by orthogonal methods.

Genetics and functional genomics

Genetic screens have been developed in various formats to facilitate the identification of E3 ligase protein–protein interaction (PPI) networks in eukaryotes and prokaryotes. These include cell-based [e.g. yeast two-hybrid (Y2H)] and in vitro methods using reconstituted ubiquitylation assays (e.g. phage display screens and protein microarrays). Each technique is scalable to screen large gene libraries from human, yeast and other species and can be paired with various reporters (e.g. β-galactosidase and luciferase) for rapid identification of interactors/substrates. A common general restraint across the various genetic approaches used is limitations on the prey libraries available. For example, cDNA libraries may be incomplete, biased towards shorter genes, or contain truncated genes. Phage display systems also place severe restrictions on the size of genes that can be expressed.

Yeast two-hybrid

Y2H screening identifies PPIs in live yeast cells via the reconstitution of a functional transcription factor (Gal4) and subsequent reporter gene expression following the interaction of bait and prey proteins fused to the N-terminal DNA-binding domain and the C-terminal (transcriptional) activation domain of Gal4, respectively. Commonly used reporter systems include a GAL1–lacZ fusion gene, encoding β-galactosidase enzyme that labels the yeast cell blue when cultured on indicator plates, or facilitates selection-based screens based on the ability of yeast cells to grow on galactose [104-107]. Y2H has been applied to identify putative substrates of many E3 ligases. For example, the PDZ protein interaction domains of the LNX family of E3s were used as baits to screen a random peptide library, identifying 64 putative interacting proteins [108]. Similarly, the pVHL substrate recognition subunit was used as a bait to screen a human testis cDNA library to identify substrates of the CBCVHL E3 ligase complex [62]. Eight novel interactors were identified, along with previously known substrates [71]. Y2H is a relatively quick and simple technique that can be adapted to high-throughput format. However, the technique appears to be particularly susceptible to high rates of false-positive and false-negative (24–51% false-positive rate and 45–96% false-negative rate) [109-111]. Furthermore, Y2H may not be compatible with the expression of non-yeast proteins, which may not fold correctly or are potentially toxic. Importantly, Y2H is limited to the number of genes in a library and to PPIs that occur in the nucleus.

In vitro ubiquitylation screens

Many techniques for screening ubiquitin substrates rely on reconstituted in vitro assays using recombinant components (Ub, E1, E2 and E3) applied to a library of potential substrates (e.g. phage display libraries, recombinant protein libraries or protein microarrays). While these assays can be adapted to high-throughput formats and provide a direct readout of substrate ubiquitlyation, their main drawback is that they cannot account for cellular context (e.g. sub-cellular localization, differential expression and co-regulation by DUBs). The requirement for active recombinant enzymes can also be problematic as these enzymes are not always readily available and their activity may be regulated by PTMs and/or cofactors. A further consideration with in vitro ubiquitylation assays is the issue of E2 specificity. While many E3 ligases exhibit activity with ‘promiscuous’ E2 enzymes, such as UBE2D, some E3 ligases only function effectively with a specific E2 enzyme [25]. Protein microarray sub-screens to identify optimal E2 partners for E3 ligases (discussed below) may be useful in this context, but the issue should be considered whether using in vitro assays in either screening or validation formats.

Phage display

An in vitro ubiquitylation assay coupled with a phage display library using His-tagged ubiquitin, a human brain cDNA library and recombinant GST-MDM2, has been used to identify MDM2 substrates. Following purification and amplification of ubiquitylated phages, individual clones were PCR-amplified and sequenced to identify 16 ubiquitylated MDM2 substrates [76]. This method is fast and simple but may suffer from limitations in the size of peptide that can be expressed by the phage display system, meaning that potential conformational determinants and allosteric regulatory regions of substrates may not be accommodated.

Luminescent in vitro ubiquitylation assay

Luminescent in vitro ubiquitylation assays use a collection of putative substrates as GST-tagged fusion proteins bound to anti-GST acceptor beads, while biotin–ubiquitin is bound to streptavidin donor beads [77]. Ubiquitylation of GST-tagged proteins with biotin–ubiquitin brings acceptor and donor beads (which also contain a photosensitizer) into close proximity. Upon excitation (680 nm), a series of luminescent energy transfers between chemiluminescent and fluorescent compounds in the acceptor beads are measured by a micro-plate reader [77,112]. This assay was used to screen for yeast Rsp5 E3 ligase substrates using 188 GST-tagged fusion proteins, identifying and validating 9 Rsp5 substrates [77]. A significant limitation of this approach is the need to generate tagged, recombinant proteins — which may fold incorrectly, lack essential PTMs or suffer interference by affinity tags.

Protein microarrays

Functional protein microarrays are gaining increasing popularity for high-throughput, systematic identification of E3 ligase substrates, in part because this assay is now offered as a commercial service. This method uses a library of putative substrates — in the form of recombinant proteins — arrayed to a carrier surface, such as glass slides [113,114], combined with a reconstituted in vitro ubiquitylation cascade (as above) and a sensitive method (radiological, fluorescence or chemiluminescence) for the detection of conjugated ubiquitin. This system has the advantage of also facilitating sub-screens to identify optimal E2 partners for E3 ligase activity using an arrayed panel of E2 enzymes, but still suffers from the usual restrictions of in vitro assays and limitations on the arrayed substrate library (see above). This method was first used with a 4000 protein yeast proteome microarray to identify 40 substrates of the HECT E3 ligase Rsp5, identifying 11 previously known substrates, with 6 further validated [78]. A similar technique to identify Rsp5 substrates with a different array containing 5800 proteins [79] included a dosage lethality/suppression screen that validated 28 substrates (out of 86 candidates). Interestingly, there was little overlap between the substrates identified in two screens for the same E3 ligase, highlighting the need for orthogonal approaches and validation. Many studies have used protein microarrays to profile substrates of mammalian E3 ligases. For example, a microarray containing 8222 human proteins was used to identify ∼50 putative substrates each for the human Rsp5 orthologues hNedd4-1, hNedd4-2 and the rat orthologue rNedd4-1 [80]. A smaller human protein microarray (containing ∼2000 proteins) was used to identify substrates of the human E3 ligase Praja1 [82]. This study used a commercially available E2 profiling kit to determine that UBE2D3 was the optimal E2 for Praja1. An interesting aspect of this study was the use of duplicate microarrays — one probed with an anti-ubiquitin antibody that detects both mono- and poly-ubiquitylated proteins and the other probed with biotinylated tandem ubiquitin-binding entity (TUBE) specific for ubiquitin chains. Fourteen putative Praja1 substrates were identified in common to both detection methods. A major shortcoming of this technique is that currently available commercial protein microarrays do not have coverage of the entire proteome [115] and do not account for cellular context.

Global protein stability profiling

Significant advances in functional genomic tools and models have made possible the systematic and global identification of E3 ligase substrates in live mammalian cells, which is a key consideration in understanding UPS function in the context of cell signalling. These approaches can identify differential degradation of proteins following the modulation of E3 ligase activity or expression, using reporters such as GFP to permit observation of spatio-temporal context and rapid validation using traditional biochemical techniques such as co-IP. Global protein stability (GPS) profiling is a fluorescence-based, high-throughput system to monitor protein turnover (but not protein ubiquitylation directly) in mammalian cells using a retroviral reporter construct containing a single promoter driving the bi-cistronic translation of two fluorescent proteins from one mRNA transcript [116]. Discosoma sp. red fluorescent protein (DsRed) acts as an internal control, whereas enhanced GFP (eGFP) is expressed fused to a protein of interest. As they are translated from a single mRNA, both fluorescent proteins are produced at approximately the same rate and DsRed expression should remain relatively constant, controlling for background protein stability. Therefore, changes in the green/red fluorescence (eGFP/DsRed) ratio of a cell under different conditions are a direct readout of eGFP-labelled fusion protein degradation [116]. This reporter system was coupled with the hORFeome library, fluorescence-activated cell sorting and microarray deconvolution to identify differential degraded proteins, following genetic alteration of Skp Cullin F-box (SCF) activity (i.e. putative substrates) [83]. This study identified 73% of known SCF substrates and 359 putative substrates, of which 31 were validated [83]. This method was further extended by integration with proteomics to identify CRL substrates following the treatment of cells with a CRL inhibitor (described in detail below — Integrated Approaches). GPS profiling provides significant advantages over other genetic screening methods, probably the most important being the ability to monitor dynamic changes in protein stability in live mammalian cells following pharmacological or genetic manipulation [83,116]. Even with limitations of the hORFeome library used (containing only ∼8000 of the estimated ∼20 000 proteins in the proteome), GPS profiling represents a significant expansion of screening coverage in mammalian cells. Potential disadvantages include the time-consuming cloning of each protein into the reporter construct. The EGFP tag may also interfere with protein folding, localization, PTMs and protein stability [116]. An important caveat of GPS profiling from the perspective of understanding UPS function is that the assay measures protein stability, and is not a direct measure of protein ubiquitylation. Hence, other factors influencing protein stability must be considered when interpreting results, and GPS profiling is not able to distinguish ubiquitylation events with signalling outcomes other than protein degradation.

High-throughput quantitative microscopy (high-content analysis)

High-throughput microscopy (also known as high-content analysis, HCA) was utilized to identify SCFGrr1 substrates by comparing isogenic wild-type GRR1 and GRR1 knockout (grr1Δ) yeast cells simultaneously. GRR1 was replaced with red fluorescent protein (RFP) in grr1Δ cells, and the grr1Δ::RFP strain was then crossed with a library of 4000 yeast strains that each express a GFP-tagged ORF and imaged in 96-well format using HCA [85]. GRR1 substrates were expected to have increased GFP intensity in grr1Δ (RFP-positive) cells compared with GRR1 cells (RFP-negative). Similar to GPS profiling, this technique does not provide a direct measure of protein ubiquitylation and similar caveats around limitations in library size and potential interference from GFP tag apply. However, the utility of this screening strategy is clear, identifying 106 putative GRR1 substrates, with 7 validated. Furthermore, the ability to screen in live cells provides clear advantages in understanding dynamic processes in relevant spatio-temporal context.

Proteomics

Proteomics has emerged as a powerful tool for identifying E3 ligase substrates, as continuing improvements in biochemical techniques and instrument sensitivity allow more precise qualitative and quantitative analyses of the E3 ligase-regulated proteome. Mass spectrometry (MS)-based proteomics has been coupled with a variety of model systems and enrichment strategies to identify E3 ligase substrates at both the protein and site level (>23 000 modified sites mapping to >4600 proteins) [65,66]. These strategies include defining PPI networks of E3 ligases and profiling differential changes in the global proteome — or ubiquitome specifically — following the modulation of E3 ligase expression or activity [86,98,101].

Shotgun proteomics

Label-free shotgun proteomics was used to identify ASB2 (ankyrin repeat-containing protein with a suppressor of cytokine signalling box 2) E3 ligase substrates. Proteins that had reduced abundance in ASB2 wild-type expressing cells, compared with ligase-inactive mutant-expressing cells, were likely to be ubiquitylated by ASB2 and degraded by the proteasome [86]. This technique identified two known ASB2 substrates [117]. Similarly, Hör et al. [87] used SILAC (stable isotope labelling by amino acids in cell culture)-MS to identify differentially expressed plasma membrane proteins in cells overexpressing GFP-tagged MARCH9 E3 ligase or GFP alone. Twelve down-regulated proteins were identified in GFP-MARCH9 overexpressing cells compared with GFP alone, and six were validated as substrates by flow cytometry. The major caveat of shotgun proteomics strategies (as with GPS profiling — above) is that they will identify Ub substrates fated for proteasomal degradation, whereas they will not detect substrates fated for other signalling events.

Affinity purification–tandem mass spectrometry

Various configurations of affinity purification coupled with mass spectrometry (AP–MS/MS) have been successfully applied to identify putative E3 ligase-interacting proteins and substrates. In their most basic format, these typically involve the expression of an E3 ligase or Ub bait fused to a small tag (such as HA) for enrichment, combined with various genetic or pharmacological manipulations. More sophisticated approaches to AP–MS/MS have recently been developed, which allow trapping and/or labelling of substrates, enrichment of specific poly-Ub chain types or site-specific identification of Ub modifications. There are numerous examples of co-purifying interacting proteins of individual E3 ligase baits to identify putative substrates, but the emergence of systematic approaches is proving very powerful. For example, parallel adaptor capture (PAC) proteomics coupled with CompPASS (Comparative Proteomics Analysis Software Suite) was developed to systematically identify substrates of 19 F-box proteins (FBXLs) [89]. Interactors of HA-tagged FBXL proteins, overexpressed in HEK293 cells, were identified by parallel α-HA affinity purification, MS/MS and CompPASS analysis. This study identified 22 high confidence-interacting proteins (HCIPs), and 13 were found to have increased stability in the presence of the Nedd8-activating enzyme inhibitor MLN4924, which inactivates FBXL-containing cullin–RING E3 ligases [89]. This strategy was repeated in a colon cancer cell line, which identified 88 HCIPs, 44 of which were also identified in the HEK293 experiment [89]. If known, the isolated substrate-binding region of an E3 can also be used to purify interacting proteins. For example, WNKs were co-purified and identified using the CUL3 substrate adaptor KLHL3 as a bait [118]. While this is an effective and efficient method to identify putative E3 ligase substrates, not all interacting proteins identified are necessarily substrates, making the use of inhibitors or functional mutants an essential aspect of these experiments. The great power of the PAC–CompPASS method is the ability to simultaneously interrogate substrates of multiple E3 ligases. However, the requirement for affinity purifications of multiple proteins performed in parallel and consequently high demand on instrument and analytical time may put this technique beyond the reach of many investigators. Furthermore, CompPASS can identify proteins based on a single peptide, increasing the potential for incorrect protein identifications or false-positives [89]. Importantly, PAC-based approaches are unlikely to enrich for monoubiquitylated substrates and substrates that are not fated to be degraded [90]. The PAC–CompPASS approach was recently applied to identify substrates of the FBXW11 ubiquitin ligase adaptor protein [90]. This study identified 96 interacting proteins, 23 of which are putative substrates, with 1 independently validated as a substrate [90]. Interestingly, a similar approach has also been applied to systematic identification of DUB substrates [119]. Another AP–MS/MS-based strategy to identifying E3 ligase substrates exploits the attachment of His-tagged Ub (and various mutants) to substrates following E3 ligase protein overexpression [91]. Using quantitative mass spectrometry analysis, this study identified 14 proteins with increased ubiquitylation following BRCA1/BARD1 overexpression [91]. A similar approach combined bioinformatics and AP–MS/MS to identify SCFβTrCP E3 ligase substrates. First, a consensus motif derived from reported βTrCP substrates was generated and used to search biological sequence databases for substrates [92]. Subsequently, using AP–MS/MS, βTrCP-interacting proteins were identified and the data were integrated with the bioinformatics prediction data [92]. This approach identified 221 βTrCP interactors that also harbour conserved sequences similar to the consensus motif of known substrates [92]. Alternative affinity purification strategies have been developed that not only identify substrates destined for proteasomal degradation but also identify substrates labelled with alternative ubiquitin chain topologies signalling various fates. For example, TUBEs, containing four tandem UBA domains and additional tags, such as 6xHis for detection, efficiently bind mono- and poly-ubiquitylated proteins [120]. This method does not rely on protein overexpression, tag fusion proteins, inhibitors or chemical and genetic manipulations and therefore is ideal for the detection of endogenous ubiquitylated proteins [121,122]. To identify substrates of the MuRF1 E3 ligase, TUBEs coupled with two-dimensional differential in-gel electrophoresis (2D-DIGE) and MS were used to compare cells overexpressing either MuRF1 or GFP [93]. This study identified 20 proteins with differential ubiquitylation following MuRF1 overexpression, 3 of which were previously known substrates and 3 that were validated as novel substrates. One limitation arises from the use of 2D-DIGE, as low-abundance protein bands may not be visible, very hydrophobic proteins are difficult to detect, and there may be sample loss during gel processing. More contemporary proteomic approaches have largely made the use of 2D-DIGE redundant and promise much more sensitive detection of proteins that underwent TUBE enrichment.

Ubiquitin ligase trapping and proximity labelling

Many techniques have recently been developed that use various formats of E3 ligase fusion proteins to either ‘trap’ putative ligase substrates or label proteins in close proximity to the E3 ligase active site for downstream identification using proteomics. The ubiquitin ligase trapping method changes the kinetics of the ligase–substrate interaction (i.e. decreases the off-rate), essentially increasing the affinity of a ligase for its polyubiquitylated substrates and providing a means for purification and identification of the substrate. The first iteration of this technique used a synthetic fusion protein, consisting of an UBA domain fused to the N-terminus of an E3 via a linker composed of three tandem FLAG epitopes, coupled with mass spectrometry, [94], to identify substrates of 8 FBXLs in yeast [88]. This study identified 17 known FBXL substrates and 18 novel substrates, but very little overlap was observed in the substrates identified compared with the PAC–CompPASS method described above [89], possibly reflecting differences between yeast and human FBXLs. A possible caveat of this approach is the potential for the UBA domain fused to the FBXL N-terminus to disrupt substrate binding. Furthermore, UBA domains have preference for different ubiquitin chain topologies [94,123,124] and hence, the specific UBA domain used may not detect all possible substrates of a given ligase. A more recently developed trapping approach uses E3-ubiquitin fusion proteins (ubiquitin-activated interaction traps, UBAITs) to covalently capture proteins that interact with the E3 in an E1- and E2-dependent manner. This technique was successfully identified binding partners of both HECT (Rsp5 and Itch) and RING (Psh1, RNF126 and RNF168) E3 ligases [95]. Proximity-dependent biotin labelling (BioID) uses a fusion protein of an E3 coupled with a biotin-conjugating enzyme (BirA*). BirA* activates biotin, which then reacts with nearby amine groups on lysine residues in interacting proteins. This assay was coupled with semi-quantitative mass spectrometry on cells treated with MG132 to identify 50 new SCFβ-TrCP1/2 interacting proteins, with 12 being validated as E3 ligase substrates [96]. A more sophisticated version of the proximity labelling strategy uses an engineered chimeric protein called the NEDDylator, which was generated by removing the RING domain of XIAP and then fusing the E2 for NEDD8 (Ubc12) to the substrate-binding domain of the ligase via a flexible linker. NEDDylator–XIAP maintains specific substrate-binding capacity, but substitutes conjugation of the ubiquitin-like protein NEDD8 to the substrate instead of Ub [97]. Compared with Ub, NEDD8 is a relatively rare PTM in cells and is generally thought to regulate protein activity, not stability [97]. Jurkat cells were incubated with NEDDylator–XIAP and HB–NEDD8 followed by purification of HB–NEDD8-tagged substrates and mass spectrometry analysis. More than 50 putative XIAP substrates were identified [97]. An advantage of this strategy is that it enriches for proteins that are specifically ubiquitylated by the chimeric protein, not just interacting proteins in close proximity. However, generation of the recombinant chimeric protein may be difficult for some larger multidomain E3s. In addition, the utility of the NEDDylator may be restricted to RING domain ligases, as unlike HECT domain ligases, these do not directly catalyze the ubiquitylation of their substrates.

Gly–Gly (diGly) remnant affinity purification

An important breakthrough in understanding cellular ubiquitin signalling was the development of monoclonal antibodies that specifically recognize the characteristic GlyGly (diGly) remnant present on the ε-amine of lysine following trypsin digestion of ubiquitylated proteins. This modification is detected by MS analysis as a distinct mass shift of 114.04 Da [125]. Several laboratories have developed di-Gly antibodies [65,125-127], driving a significant increase in the available resolution and coverage of ubiquitylation detection. The potential of diGly antibodies is underscored by the widespread utility and analytical power provided by the development of phospho-specific antibodies in understanding kinase signalling [128] or methyl-/acetyl-specific antibodies in characterizing histone modifications [129,130]. One important limitation of diGly remnant affinity purification for identifying ubiquitylation sites is that an identical remnant is left following trypsin digestion of proteins conjugated to ubiquitin-like modifiers, such as ISG15 and NEDD8, making them indistinguishable by tandem MS [125,131,132]. One way to overcome this limitation is a two-step enrichment strategy where ubiquitylated proteins are first purified from cells expressing 6xHis-tagged ubiquitin followed by trypsin digestion and diGly immuno-purification proteomics (diGly proteomics). This strategy was used to identify 374 diGly-modified peptides on 236 proteins, but nullifies one of the most powerful aspects of the diGly technique, namely the ability to detect endogenous ubiquitylation without the possible confounding factors introduced by the use of tagged protein overexpression [125]. Initially, global analysis of lysine ubiquitylation using diGly antibodies under normal physiological conditions or following pharmacological treatment (e.g. with proteasome inhibitor, MG132) was deployed to identify between 1664–20 000 diGly-modified sites [65,66,127,133]. However, this strategy could not match the ubiquitylated site with the responsible E3 ligase, nor reveal ubiquitin chain topology. DiGly proteomics is increasingly being used in conjunction with chemical inhibition or genetic manipulation of E3 ligase activity or expression to couple ubiquitylation sites with the responsible E3 ligase. For example, this approach was utilized to identify CRL substrates, identifying 253 diGly sites with reduced abundance following CRL and proteasome inhibition — of which 59 sites were on known CRL substrates or components of CRLs [65]. Another technique combined 73 control and diGly proteomic experiments with PARKIN overexpression and/or mitochondrial depolarization by carbonyl cyanide m-chlorophenyl hydrazone treatment to identify 4772 dynamically regulated ubiquitylation sites in 1654 proteins [98]. Several of these were subsequently validated by western blot analysis using PARKIN siRNA. A similar approach was used to identify cullin–RING ubiquitin ligase adaptor protein speckle-type POZ protein (SPOP) ubiquitylation sites by performing diGly proteomics on cells overexpressing either wild-type SPOP (SPOP-WT), prostate cancer relevant mutants (SPOP-MT) or empty vector [99]. Twelve diGly sites identified were more abundant in SPOP-MT cells compared with SPOP-WT or empty vector, and four proteins had an inverse correlation with protein expression [99]. A newer approach, combining trypsin-resistant tandem ubiquitin-binding entity(ies) (TR-TUBE) technology with diGly proteomics, was recently deployed to identify substrates of the uncharacterized F-box protein FBXO21. TR-TUBE is based on a modification of the previously described TUBE technology [120] to protect ubiquitin chains from trypsin digestion [60]. Overexpression of FLAG-TR-TUBE and E3 ligase followed by enrichment with anti-FLAG antibody and diGly proteomics identified putative substrates with less background compared with enrichment by TR-TUBE alone [60]. In addition, substantially less starting material is required than using diGly proteomics only [60]. This approach was applied to identify several putative substrates of FBXO21, of which two were independently validated [60].

Integrated approaches

Integrating orthogonal approaches is potentially a rapid means to identify high-confidence E3 ligase interactors and substrates, substantially reducing the number of hits required to validate using more traditional biochemical techniques. Aiming to provide a more comprehensive survey, Lee et al. used an integrated strategy combining diGly proteomics and AP–MS. Mono- and poly-ubiquitylated proteins were purified from cells expressing 6xHis-Ub and HRD1 siRNA, followed by mass spectrometry analysis [100]. This approach identified 29 ubiquitylated proteins that had decreased abundance in HRD1 siRNA samples compared with control siRNA [100]. The second strategy utilized diGly proteomics following attenuated expression of HRD1 by siRNA [100]. This study identified 117 ubiquitylated peptides with reduced abundance in HRD1 siRNA sample compared with control. Nine high-confidence substrates were identified by both methods, highlighting the advantage of using a combined strategy. To identify high-confidence E3 ligase substrates in a physiological setting, several studies have employed integrated approaches utilizing tissue from isogenic mouse models coupled with label-free quantitative (LFQ) mass spectrometry. This method was implemented in the search for substrates of Trim32 in skeletal muscle cells [102]. Comparing Trim32 knockout mouse (T32KO) with wild-type control, this study identified 19 proteins that had increased abundance in T32KO muscle samples [102]. Following independent validation of two putative substrates, NDRG2 was identified as a novel substrate of Trim32 [102]. Spinner et al. [103] utilized this strategy to identify substrates of the E3 ligase ASB2α in different dendritic cell (DC) subsets. This study identified FLNa and FLNb as the only proteins that significantly accumulated in ASB2α-deficient DCs compared with ASB2α-expressing DCs, indicating that FLNa and FLNb are highly specific targets of ASB2α in haematopoietic cells [103]. This approach highlights the power of isogenic mouse models coupled with LFQ mass spectrometry to identify E3 ligase substrates in a physiological setting, but validation using independent biochemical approaches is still required.

Validation methods

Even with advances in proteomics, genomics and in vitro assays that address many of the significant challenges outlined above, identifying E3 ligase substrates — or determining which E3 ligase targets a specific substrate — remains a significant challenge. Notably, these technological advances do not directly address a significant remaining bottleneck in the quest to identify genuine E3 ligase targets — validation of the often large number of putative substrates identified, particularly in a functional context. There are many ways to validate putative E3 ligase substrates and interacting proteins, including IP–western blot experiments, in vitro ubiquitylation assays and cell-based assays. The decision about which validation method(s) to adopt is heavily dependent on the ultimate aim of the study. While many consider in vitro ubiquitlyation assays using recombinant components to be the gold standard, theses cannot account for cellular context, regulation by PTMs or feedback regulation by DUBs. Confirming PPIs in the cellular context using Co-IP, IF or BiFC is an important consideration. Similarly, assays combining site-directed mutagenesis, si/shRNA or CRISPR with assays of cellular function (e.g. viability and proliferation) or more specific readouts of signal transduction pathway output may be desirable to confirm functional roles. Animal models of disease and/or gene function may also be used in this context. However, these are not generally suitable to validate large numbers of hits from high-throughput screening platforms. Instead, novel strategies are required to rapidly and efficiently validate a large number of putative E3 substrates, or at least render the large data set down to a more manageable list of high-confidence hits. Currently, integrating orthogonal approaches, such as diGly proteomics and AP–MS with genetic/chemical inhibition of E3 ligase activity or protein levels, seem to be the most effective strategy. DiGly proteomics requires putative sites to be independently validated, but western blot analysis is unable to detect differences in ubiquitylation at the site level [65,66]. While ORFeome libraries and high-content imaging technologies enable high-throughput validation of PPIs, validation of individual ubiquitin-modified sites requires time-consuming site-directed mutagenesis studies. Ideally, functional validation of substrates and ubiquitylation sites using knockout animal models and/or isogenic cell lines is necessary. For example, the identification of putative Nedd4-1 and Nedd4-2 substrates using human protein microarrays was functionally validated by showing that Nedd4-1 negatively regulates signalling via the fibroblast growth factor receptor 1 (FGFR1) [80]. Following on from this study, the Rotin laboratory identified the sequence motif necessary for Nedd4-1 to bind to and ubiquitylate FGFR1 [134]. Deletion of the sequence motif (FGFR1-Δ6) stimulated neuronal stimulation in human embryonic neuronal stem cells [134]. Importantly, FGFR1-Δ6 expression in Zebrafish embryos caused defective anterior neuronal patterning, indicative of excessive FGFR1 signalling [134]. This study exemplifies the utility of an integrated approach to systematically identify an E3 ligase substrate, from high-throughput screen (i.e. protein array) to in vitro and in vivo functional validation.

Conclusions

The recent development of highly innovative methods to identify substrates of the UPS has gone a long way to address a significant challenge in the field. These systematic approaches are underpinning rapid advances in our understanding of the biochemistry and biology of the UPS and are already having significant impact on our understanding of a diverse range of disease processes. Investigators need to consider several factors in choosing the most appropriate approach to identifying E3 ligase substrates in the particular context of interest. 1. Access to analytical platforms and expertise. 2. Speed and cost. 3. Availability and suitability of model systems for both discovery and validation (i.e. cell lines and animals). 4. Availability of reagents (e.g. recombinant proteins, inhibitors and expression libraries). Integrated approaches — combining genomics or proteomics with isogenic models or chemical inhibition — are emerging as the optimal strategy to identify E3 ligase substrates and should be favoured whenever possible. Spatial and temporal cellular context is also an important consideration. E3 ligase substrates identified using reconstituted in vitro assays may not necessarily be relevant in vivo if the ligase–substrate are not expressed in the same cellular compartment at the same time. Conversely, demonstrating in vitro ligase activity towards a particular substrate is still considered the gold standard by many. Functional validation of ubiquitylation sites and putative substrates in cellular and in vivo contexts still presents a major bottleneck, due to both the sheer number of putative substrates identified and a shortage of suitable high-throughput biochemical methods. The future development of improved model systems and research tools will be key to driving the gaining momentum in the field. In particular, many researchers would welcome the development of selective and potent pharmacological inhibitors across all classes of E3 ligases. The recent development of selective Ub variant probe libraries [44], targeting a panel of HECT ligases, is a significant advance in this respect and should provide a powerful toolkit for the discovery and functional validation of E3 ligase substrates. Rapid improvements in both the speed and precision of gene targeting approaches (e.g. using CRISPR) to generate isogenic knockout/knockin mouse and cell line models for in vivo discovery and validation of E3 ligase function and substrates will also be important. One significant outstanding hurdle to enabling a systematic and comprehensive characterization of UPS substrates is the lack of a central, co-ordinated database of ligase–substrate pairs. A practical and reliable knowledge base would provide a key resource — not only for researchers studying the UPS, but also more widely in pathophysiology and drug development in a range of disease contexts.
  133 in total

1.  Genetic and microbial factors modulating the ubiquitin proteasome system in inflammatory bowel disease.

Authors:  Isabelle Cleynen; Emilie Vazeille; Marta Artieda; Hein W Verspaget; Magdalena Szczypiorska; Marie-Agnès Bringer; Peter L Lakatos; Frank Seibold; Kirstie Parnell; Rinse K Weersma; Jestinah M Mahachie John; Rebecca Morgan-Walsh; Dominiek Staelens; Ingrid Arijs; Gert De Hertogh; Stefan Müller; Atilla Tordai; Daniel W Hommes; Tariq Ahmad; Cisca Wijmenga; Sylvia Pender; Paul Rutgeerts; Kristel Van Steen; Daniel Lottaz; Severine Vermeire; Arlette Darfeuille-Michaud
Journal:  Gut       Date:  2013-10-03       Impact factor: 23.059

2.  E2-EPF UCP targets pVHL for degradation and associates with tumor growth and metastasis.

Authors:  Cho-Rok Jung; Kyung-Sun Hwang; Jinsang Yoo; Won-Kyung Cho; Jin-Man Kim; Woo Ho Kim; Dong-Soo Im
Journal:  Nat Med       Date:  2006-07-02       Impact factor: 53.440

Review 3.  Structural insights into the catalysis and regulation of E3 ubiquitin ligases.

Authors:  Lori Buetow; Danny T Huang
Journal:  Nat Rev Mol Cell Biol       Date:  2016-08-03       Impact factor: 94.444

4.  The E3 ubiquitin ligase TRIM32 regulates myoblast proliferation by controlling turnover of NDRG2.

Authors:  Ekaterina I Mokhonova; Nuraly K Avliyakulov; Irina Kramerova; Elena Kudryashova; Michael J Haykinson; Melissa J Spencer
Journal:  Hum Mol Genet       Date:  2015-02-20       Impact factor: 6.150

5.  Protein microarrays for the identification of praja1 e3 ubiquitin ligase substrates.

Authors:  Christian M Loch; Michael J Eddins; James E Strickler
Journal:  Cell Biochem Biophys       Date:  2011-06       Impact factor: 2.194

6.  Acetylation regulates gluconeogenesis by promoting PEPCK1 degradation via recruiting the UBR5 ubiquitin ligase.

Authors:  Wenqing Jiang; Shiwen Wang; Mengtao Xiao; Yan Lin; Lisha Zhou; Qunying Lei; Yue Xiong; Kun-Liang Guan; Shimin Zhao
Journal:  Mol Cell       Date:  2011-07-08       Impact factor: 17.970

7.  A novel genetic system to detect protein-protein interactions.

Authors:  S Fields; O Song
Journal:  Nature       Date:  1989-07-20       Impact factor: 49.962

8.  Global analysis of lysine ubiquitination by ubiquitin remnant immunoaffinity profiling.

Authors:  Guoqiang Xu; Jeremy S Paige; Samie R Jaffrey
Journal:  Nat Biotechnol       Date:  2010-07-18       Impact factor: 54.908

9.  Mdm2 regulates p53 mRNA translation through inhibitory interactions with ribosomal protein L26.

Authors:  Yaara Ofir-Rosenfeld; Kristy Boggs; Dan Michael; Michael B Kastan; Moshe Oren
Journal:  Mol Cell       Date:  2008-10-24       Impact factor: 17.970

10.  Activation of ubiquitin-dependent DNA damage bypass is mediated by replication protein a.

Authors:  Adelina A Davies; Diana Huttner; Yasukazu Daigaku; Shuhua Chen; Helle D Ulrich
Journal:  Mol Cell       Date:  2008-03-14       Impact factor: 17.970

View more
  55 in total

1.  Decoys Untangle Complicated Redundancy and Reveal Targets of Circadian Clock F-Box Proteins.

Authors:  Chin-Mei Lee; Ann Feke; Man-Wah Li; Christopher Adamchek; Kristofor Webb; José Pruneda-Paz; Eric J Bennett; Steve A Kay; Joshua M Gendron
Journal:  Plant Physiol       Date:  2018-05-23       Impact factor: 8.340

Review 2.  Visualizing ubiquitination in mammalian cells.

Authors:  Sjoerd Jl van Wijk; Simone Fulda; Ivan Dikic; Mike Heilemann
Journal:  EMBO Rep       Date:  2019-01-21       Impact factor: 8.807

3.  Emerging Cancer Therapeutic Targets in Protein Homeostasis.

Authors:  Prabhakar Bastola; Derek B Oien; Megan Cooley; Jeremy Chien
Journal:  AAPS J       Date:  2018-08-27       Impact factor: 4.009

4.  Merkel Cell Polyomavirus Small Tumor Antigen Activates Matrix Metallopeptidase-9 Gene Expression for Cell Migration and Invasion.

Authors:  Nnenna Nwogu; Luz E Ortiz; Adrian Whitehouse; Hyun Jin Kwun
Journal:  J Virol       Date:  2020-09-15       Impact factor: 5.103

5.  TNF receptor-associated factor 6 interacts with ALS-linked misfolded superoxide dismutase 1 and promotes aggregation.

Authors:  Sabrina Semmler; Myriam Gagné; Pranav Garg; Sarah R Pickles; Charlotte Baudouin; Emeline Hamon-Keromen; Laurie Destroismaisons; Yousra Khalfallah; Mathilde Chaineau; Elise Caron; Andrew N Bayne; Jean-François Trempe; Neil R Cashman; Alexandra T Star; Arsalan S Haqqani; Thomas M Durcan; Elizabeth M Meiering; Janice Robertson; Nathalie Grandvaux; Steven S Plotkin; Heidi M McBride; Christine Vande Velde
Journal:  J Biol Chem       Date:  2020-02-06       Impact factor: 5.157

Review 6.  Functional analysis of Cullin 3 E3 ligases in tumorigenesis.

Authors:  Ji Cheng; Jianping Guo; Zhiwei Wang; Brian J North; Kaixiong Tao; Xiangpeng Dai; Wenyi Wei
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2017-11-08       Impact factor: 10.680

7.  Direct regulation of Chk1 protein stability by E3 ubiquitin ligase HUWE1.

Authors:  Katelyn B Cassidy; Scott Bang; Manabu Kurokawa; Scott A Gerber
Journal:  FEBS J       Date:  2019-11-29       Impact factor: 5.542

8.  The E3 Ligases Spsb1 and Spsb4 Regulate RevErbα Degradation and Circadian Period.

Authors:  Tsedey Mekbib; Ting-Chung Suen; Aisha Rollins-Hairston; Jason P DeBruyne
Journal:  J Biol Rhythms       Date:  2019-10-14       Impact factor: 3.182

9.  Proteome-wide Analysis Reveals Substrates of E3 Ligase RNF146 Targeted for Degradation.

Authors:  Litong Nie; Chao Wang; Nan Li; Xu Feng; Namsoo Lee; Dan Su; Mengfan Tang; Fan Yao; Junjie Chen
Journal:  Mol Cell Proteomics       Date:  2020-09-21       Impact factor: 5.911

10.  A ubiquitin variant-based affinity approach selectively identifies substrates of the ubiquitin ligase E6AP in complex with HPV-11 E6 or HPV-16 E6.

Authors:  Felix A Ebner; Carolin Sailer; Daniela Eichbichler; Jasmin Jansen; Anna Sladewska-Marquardt; Florian Stengel; Martin Scheffner
Journal:  J Biol Chem       Date:  2020-08-27       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.