Literature DB >> 28234446

Ubiquitin-like Protein Conjugation: Structures, Chemistry, and Mechanism.

Laurent Cappadocia1, Christopher D Lima1,2.   

Abstract

Ubiquitin-like proteins (Ubl's) are conjugated to target proteins or lipids to regulate their activity, stability, subcellular localization, or macromolecular interactions. Similar to ubiquitin, conjugation is achieved through a cascade of activities that are catalyzed by E1 activating enzymes, E2 conjugating enzymes, and E3 ligases. In this review, we will summarize structural and mechanistic details of enzymes and protein cofactors that participate in Ubl conjugation cascades. Precisely, we will focus on conjugation machinery in the SUMO, NEDD8, ATG8, ATG12, URM1, UFM1, FAT10, and ISG15 pathways while referring to the ubiquitin pathway to highlight common or contrasting themes. We will also review various strategies used to trap intermediates during Ubl activation and conjugation.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28234446      PMCID: PMC5815371          DOI: 10.1021/acs.chemrev.6b00737

Source DB:  PubMed          Journal:  Chem Rev        ISSN: 0009-2665            Impact factor:   60.622


Introduction

Ubiquitin was isolated in the 1970s as a ubiquitous protein that is conjugated to other proteins through a peptide bond between its C-terminal glycine and a primary amine on the substrate, most typically a lysine residue. Conjugation was subsequently shown to be dependent on the successive activities of enzymes named E1, E2, and E3. In the 1980s, biochemical studies elucidated the chemical reactions catalyzed by these enzymes. Mostly in the 1990s and 2000s, several protein families were discovered that are evolutionarily related to ubiquitin insofar as they share the ubiquitin fold and the capacity to be conjugated to substrates through the concerted action of evolutionarily related E1s, E2s, and E3s (Figure ). These proteins are now collectively referred to as Ubl’s, an acronym for ubiquitin-like proteins. Studies on various Ubl conjugation cascades have notably addressed: (i) Ubl recognition by cognate conjugation enzymes, (ii) chemical mechanisms used during conjugation, (iii) substrate specificity, (iv) determinants for substrate modification by one or more Ubl’s (chains), and (v) regulation of the conjugation machinery and cross-talk with other post-translational modifications.
Figure 1

Ubl conjugation cascade where E1, E2, E3, and Ubl designate an E1 conjugating enzyme, an E2 conjugation enzyme, an E3 ligase, and a ubiquitin-like protein, respectively.

Ubl conjugation cascade where E1, E2, E3, and Ubl designate an E1 conjugating enzyme, an E2 conjugation enzyme, an E3 ligase, and a ubiquitin-like protein, respectively. In this review, we will focus on structural and mechanistic studies that provided insight into reactions and specificities for the eukaryotic ubiquitin-like conjugation machinery in the past five years, going back further at times to highlight important contributions to the Ubl field. Ubiquitin conjugation cascades have been the subject of numerous recent reviews[1−8] and will not be covered in depth here, but we will refer to mechanisms in the ubiquitin pathway to highlight common or contrasting themes. We will also limit our review to eukaryotic proteins although proteins related to ubiquitin are present in bacteria and archaea (reviewed in Burroughs et al.[9]). In bacteria, the proteins MoaD and ThiS share structural similarity with ubiquitin while MoeB and ThiF share structural and mechanistic similarities with the ubiquitin E1.[10,11] However, these proteins are involved in sulfur metabolism rather than protein conjugation. In archaea, small archaeal modifier proteins (SAMPs) are related to ubiquitin and can be conjugated to proteins (this is also reviewed in Maupin-Furlow[13]).[12] While the conjugation machinery appears limited to SAMPs and proteins similar to eukaryotic E1 in most archaea, an intriguing operon-like cluster was identified in Candidatus Caldiarchaeum subterraneum that encodes a Ubl and proteins related to eukaryotic E1, E2, and E3.[14] Ubiquitin and components of its conjugation system predate the emergence of eukaryotes; however, it is clear that eukaryogenesis resulted in a massive increase in the number of Ub/Ubl genes and pathways,[15,16] the primary focus of this review. At the end of this review, we will describe strategies that enabled investigators to trap and characterize transient intermediates during conjugation. Concerning nomenclature, we employ a slash (/) to indicate noncovalent interactions, a tilde (∼) to indicate thioester bonds, and a dash (−) to indicate other covalent interactions. The names E1, E2, and E3, will be followed by the name of the protein in subscript (for example, E2UBC9). Finally, Ubl’s form families of different sizes and the names of the individual family members can vary according to the organism. For simplicity, we will be referring to the Ubl’s using generic names such as SUMO, NEDD8, ATG8, ATG12, URM1, UFM1, FAT10, and ISG15.

Ubiquitin-like Proteins

Ubl’s encompass a family of proteins that share structural and evolutionary relationships with ubiquitin. Each family member possesses a β-grasp fold composed of a five-stranded β-sheet that partially wraps around a central α-helix.[17,18] Ubl’s have historically been divided into two types based on whether they are conjugated to substrates: type I Ubl’s are conjugated and type II Ubl’s are not.[19] Type II Ubl’s are generally observed in the context of multidomain proteins, and while they can be found in proteins not directly related to Ubl conjugation cascades, many are observed in certain E1 activating enzymes, E3 ligases, and Ub/Ubl proteases. Hub1 and Esc2, proteins where autonomous Ubl folds are observed, may also be considered as type II Ubl’s as they have not been observed conjugated to substrates.[20−22] The protein FUBI is genetically fused with the 40S ribosomal protein S30 as part of the FAU protein.[23] Although FAU undergoes proteolytic processing to release FUBI with a C-terminal diglycine motif, a hallmark of type I Ubl’s, it is considered as a de facto type II Ubl because there is no evidence to date that FUBI undergoes conjugation. In contrast, type I Ubl’s are activated and conjugated to substrates and include the SUMO, NEDD8, ATG8, ATG12, URM1, UFM1, FAT10, and ISG15 protein families. These Ubl’s, and the enzymes required for their conjugation, are the primary subject of this review. The Ubl’s, E1, and E2 for each Ubl family are presented in Table for Homo sapiens and Saccharomyces cerevisiae.
Table 1

Ubl’s and Their E1 and E2 in Human and Budding Yeast

 proteins in H. sapiens
proteins in S. cerevisiae
familyUblE1E2UblE1E2
SUMOSUMO1, SUMO2, SUMO3, SUMO4aUBA2/SAE1UBC9Smt3Uba2/Aos1Ubc9
NEDD8NEDD8UBA3/NAE1UBC12, UBE2FRub1Uba3/Ula1Ubc12
ATG8LC3A, LC3B, LC3B2, LC3C, GABARAP, GABARAPL1, GATE-16aATG7ATG3Atg8Atg7Atg3
ATG12Atg12ATG7ATG10Atg12Atg7Atg10
URM1URM1UBA4Urm1Uba4
UFM1UFM1UBA5UFC1
FAT10FAT10UBA6UBE2Zb
ISG15ISG15UBA7UBCH8b

SUMO5 and GABARAPL3 were not included in this table as they are likely pseudogenes.

UBE2Z and UBCH8 can also work with ubiquitin.

SUMO5 and GABARAPL3 were not included in this table as they are likely pseudogenes. UBE2Z and UBCH8 can also work with ubiquitin.

Gene Number and Organization

While some Ubl families can be identified in all eukaryotes, others appear unique to certain phyla. For instance, the ISG15 and FAT10 families appear to function in the immune system of higher eukaryotes and are absent from lower eukaryotes such as fungi. In addition to possessing unique Ubl’s, some eukaryotic organisms have expanded the number of genes in a particular Ubl family. While budding yeast possesses one gene for SUMO and one for ATG8, human includes at least four genes for SUMO (SUMO1, SUMO2, SUMO3, and SUMO4) and seven for ATG8 (LC3A, LC3B, LC3B2, LC3C, GABARAP, GABARAPL1, and GATE-16). Inclusion of SUMO4 as a bona fide Ubl is controversial as this isoform only appears to be processed and conjugated under stress conditions.[24−26] Two additional Ubl’s, SUMO5[27] and GABARAPL3,[28] could represent pseudogenes as the proteins have yet to be detected. Genes encoding ubiquitin are often observed as head-to-tail concatemers or as fusions with ribosome proteins, and they must be processed by proteases to expose a C-terminal diglycine motif that is required for conjugation. Although encoded as standalone genes, Ubl family members such as UFM1, ISG15, NEDD8, SUMO, and ATG8 encode preproteins that must be processed by proteases to generate a mature form that includes a C-terminal glycine or diglycine motif that is suitable for activation and conjugation. In contrast, genes for ATG12, FAT10, and URM1 are genetically encoded and translated in their mature form. Although all of these Ubl’s can be conjugated to substrates, processing and conjugation is not always a prerequisite for function. For example, free ISG15 can function in humans to stabilize USP18 and prevent autoinflammation via noncovalent interactions in a complex that does not require conjugation.[29]

Structure

The mature form of the Ubl generally includes the β-grasp domain described above and a short flexible C-terminal tail that typically ends with at least one glycine residue. Variations exist, and some Ubl’s include additional elements that are conserved and sometimes structured within the particular Ubl family (Figure ). For example, FAT10 and ISG15 include tandem β-grasp domains separated by a short and sometimes flexible linker. NMR analysis of FAT10 suggests that there is no significant interaction between the two UBL domains,[30] whereas multiple ISG15 crystal structures suggest a defined interface between the two domains.[31−33] Members of the ATG8 family possess two additional α-helices near their N-terminus[34] while ATG12, UFM1, and members of the SUMO protein family include N-terminal extensions that are often missing in available structures, suggesting that they are disordered or at least highly flexible.
Figure 2

Structure of select Ubl’s. (A) Structure of ubiquitin (PDB 1UBQ). (B) Structure of ISG15 (PDB 1Z2M). The first β-grasp domain is colored gray. (C) Structure of ATG8 (GATE-16; PDB 1EO6). The N-terminal extension that contains two α-helices is colored gray. (D) Structure of SUMO (SUMO1; PDB 1A5R). The flexible N-terminal extension is colored gray. Root-mean-square deviations (RMSDs) are calculated between ubiquitin Cα and the Cα of ISG15, ATG8, or SUMO1 that are colored in yellow.

Structure of select Ubl’s. (A) Structure of ubiquitin (PDB 1UBQ). (B) Structure of ISG15 (PDB 1Z2M). The first β-grasp domain is colored gray. (C) Structure of ATG8 (GATE-16; PDB 1EO6). The N-terminal extension that contains two α-helices is colored gray. (D) Structure of SUMO (SUMO1; PDB 1A5R). The flexible N-terminal extension is colored gray. Root-mean-square deviations (RMSDs) are calculated between ubiquitin Cα and the Cα of ISG15, ATG8, or SUMO1 that are colored in yellow.

Substrates

Ubiquitin is mainly conjugated to substrates via a peptide bond between the ubiquitin C-terminal glycine and the Nζ of lysine residues although it is also possible to conjugate ubiquitin to serine, to threonine, to cysteine, or to the N-terminal amine in proteins (reviewed in Stewart et al.[7]). Most Ubl’s are ultimately conjugated to proteins with one notable exception being ATG8, which is conjugated to phosphatidylethanolamine (PE)[35] and possibly other lipid head groups.[36] The number of identified substrates conjugated by a particular Ubl varies considerably with tens of thousands of substrates identified for ubiquitin,[37] thousands for SUMO,[38] just a few for URM1,[39] and two for ATG12.[40,41] The capacity to form chains also varies between Ubl’s. Chains were first reported for ubiquitin,[42] and it was shown that all of its lysine residues[43] as well as its N-terminal amine group[44] can be used to build chains that differ in structure and function to diversify its signaling capacity.[45] Evidence suggests that some members of the SUMO family form chains in vivo,[46] while chains have also been detected in vivo for NEDD8 and UFM1.[47,48] Some other Ubl’s can form chains in vitro; however, convincing data for their existence and function in vivo is still lacking.

Ubl Binding Motifs

Noncovalent interactions with Ubl’s mediate a variety of functions before, during, and after conjugation. Proteins use domains and motifs of varying size and structure to interact with ubiquitin (these are reviewed in Husnjak and Dikic[49]); however, most bind ubiquitin via a hydrophobic patch centered around isoleucine 44 with affinities in the high micromolar range.[49] NEDD8 is closely related to ubiquitin by sequence and structure,[50] and motifs that form noncovalent interactions with NEDD8 are similar to those that contact ubiquitin.[51,52] FAT10 was also shown to interact with a protein containing UBA domains, perhaps consistent with its high sequence similarity to ubiquitin.[53,54] While noncovalent contacts to Ubl members can encompass similar surfaces, they are generally tailored to form specific Ubl/receptor complexes (Figure ).
Figure 3

Ubl binding motifs. (A) Structure of SUMO/RANBP2 (PDB 1Z5S). (B) Structure of ATG8/ATG19 (PDB 2ZPN). (C) Structure of UFM1/UBA5 (PDB 5HKH). (D) Structure of ATG12/ATG3 (PDB 4NAW). The Ubl’s are in cartoon representation colored yellow, while the Ubl binding motifs are in cartoon representation colored gray. Selected residues of the Ubl binding motifs are presented in stick representation. Backbone interactions that mediate β-strand complementation for SUMO/RANBP2 and ATG8/ATG19 are highlighted at the top.

Ubl binding motifs. (A) Structure of SUMO/RANBP2 (PDB 1Z5S). (B) Structure of ATG8/ATG19 (PDB 2ZPN). (C) Structure of UFM1/UBA5 (PDB 5HKH). (D) Structure of ATG12/ATG3 (PDB 4NAW). The Ubl’s are in cartoon representation colored yellow, while the Ubl binding motifs are in cartoon representation colored gray. Selected residues of the Ubl binding motifs are presented in stick representation. Backbone interactions that mediate β-strand complementation for SUMO/RANBP2 and ATG8/ATG19 are highlighted at the top.

SUMO Binding Domains

SUMO-interacting motifs (SIMs) are the most prevalent and best characterized motifs that bind SUMO proteins. SIMs are found in the SUMO activating enzyme UBA2, in all known SUMO E3s, in some ubiquitin E3s, as well as in some SUMO substrates and receptors. SIMs are typically composed of four hydrophobic residues that are flanked by acidic residues or residues that can be phosphorylated to generate negative charge, although variants include SIMs with an acidic or phosphorylatable residue at the second or third position. SIMs are presumably disordered in solution but adopt a β-strand that complements the SUMO β-sheet in either parallel or antiparallel orientations, thereby complementing a hydrophobic groove on SUMO formed by its central α-helix and second β-strand (Figure A). Structures suggest that SIMs can adopt a preferred orientation; however, the binding orientation may not always be an intrinsic characteristic of the SIM sequence as there is at least one case where both orientations were observed in dynamic exchange.[55] Molecular dynamics simulations suggest that the antiparallel arrangement results in a complex with higher stability compared to the parallel orientation.[56] SIMs that adopt antiparallel orientations also appear to tolerate more changes within their sequence,[56,57] possibly because the antiparallel orientation establishes more backbone-mediated interactions.[57] SIMs typically interact with SUMO with affinities in the micromolar range.[58−62] In higher eukaryotes that possess multiple SUMO isoforms, some SIMs appear to have evolved specificity for certain SUMO isoforms. The structural basis for this preference remains elusive, and the measured differences in affinity between SIMs and different SUMO isoforms rarely exceed 1 order of magnitude.[60,61,63] As mentioned earlier, SUMO–SIM interactions can be modulated by post-translational modifications such as phosphorylation at positions within or immediately adjacent to the hydrophobic portion of the SIM.[64] In most cases, phosphorylation increases the strength of SUMO–SIM interactions by at least an order of magnitude.[61,62,65] The structural bases for phosphorylation-mediated stabilization of these interactions came from three studies, highlighting contacts between the phosphorylated SIM residues and adjacent SUMO residues, albeit at differing positions.[61,62,65] Comparing binding modes for phosphoSIMs in DAXX and PML suggests some flexibility in recognition of acidic or phosphorylated residues.[65] In addition, differences between SUMO isoforms can result in different contacts to the acidic/phosphorylated residues in the SIM,[60−62,66] suggesting that a particular SIM might gain or lose specificity for a particular SUMO isoform based on its phosphorylated state. In addition to phosphorylation, SUMO–SIM interactions may be modulated by acetylation of certain SUMO lysine residues.[67] Interestingly, SUMO lysine acetylation could counter the effects of SIM phosphorylation as the lysine residues in question constitute those that mediate contacts to the phosphorylated residues in SIMs.[67] Finally, some proteins possess multiple SIMs that can bind multiple SUMO molecules and/or SUMO chains, presumably stabilizing interactions by increasing avidity. With that said, a concept of SIM dominance has emerged based on the observation that certain SIMs within a cluster appear essential for binding SUMO chains whereas others appear dispensable.[68−71] In addition to SIMs, two other types of SUMO-binding domains have been identified. The ZZ zinc finger domain of HERC2 interacts with SUMO in a zinc-dependent manner.[72] This domain, present in at least 20 human proteins, has a 20-fold preference for SUMO1 over SUMO2.[72] Analysis of the interaction between SUMO1 and the ZZ domain of CBP/p300 by NMR and through mutagenesis reveals that the ZZ domain binds a different surface of SUMO1 than the one employed to engage SIMs.[73] Finally, a MYM-type zinc finger present in at least four human proteins has been shown to interact with SUMO1 in a zinc-independent manner, although in this case MYM interactions appear to use the same SUMO surface for SIM interactions because mutants that destabilize SUMO–SIM interactions also destabilize MYM interactions.[74]

LC3 Interacting Regions (LIRs)

LIRs, also termed LC3 recognition sequence (LRS) or ATG8-interacting motif (AIM), mediate interactions between multiple proteins within the ATG8 family (reviewed in Birgisdottir et al.[75]). The LIR motif includes a core ΘxxΓ consensus sequence, where Θ is an aromatic residue (typically Trp, Tyr, or Phe) and Γ is an hydrophobic residue (typically Leu, Ile, or Val). An extended consensus called xLIR[76] accounts for the observation that the core LIR is often preceded or intervened by acidic residues or residues that can be phosphorylated. Numerous structures have been reported that delineate the requisite ATG8–LIR interactions, although it is worth noting that some were obtained through use of a linear fusion strategy where the LIR sequence is fused to the N-terminus of ATG8 resulting in structures that show the LIR from one molecule interacting with ATG8 from the neighboring ATG8 molecule in the crystal lattice.[77−80] LIR-containing proteins bind a region of ATG8 in a manner that shares some similarity to mechanisms employed in SIM–SUMO interactions as the LIR sequence adopts a β-strand conformation that complements ATG8 β-sheet in a parallel orientation (Figure B) with the aromatic and hydrophobic residues of the LIR motif accommodated in two hydrophobic pockets termed the HP1 and HP2 sites (sometimes called the W and L sites). While the HP2 site has a structural equivalent in SUMO, the HP1 site is unique to the ATG8 protein and its formation is partially dependent on residues and surfaces composed by the N-terminal helices that are unique to ATG8 proteins. A structural and kinetic study of the interaction between ATG13 and three LC3 isoforms suggests a sequential binding model.[78] In this model, binding of a hydrophobic residue to the preformed HP2 site tethers the LIR peptide to ATG8 prior to rearrangement of the HP1 site that includes movement of ATG8 Lys49 and Phe52 to allow binding of the aromatic residue. Acidic residues intervening or preceding the LIR core motif interact with arginine residues on the ATG8 surface and contribute to the specific binding to certain isoforms.[75] Serine/threonine residues are sometimes observed in or adjacent to the LIR core sequence, and their phosphorylation can increase[77,81,82] or decrease[83,84] the strength of the LIR–ATG8 interaction. FUNDC1 constitutes an example where phosphorylation of the LIR motif leads to a decreased interaction with ATG8. Indeed, FUNDC1 acts as a receptor during hypoxia-induced mitophagy that is constitutively phosphorylated on a serine residue close to the LIR motif and the tyrosine residue of the LIR.[83,84] Dephosphorylation of these residues upon hypoxia leads to increased association of FUNDC1 with ATG8.[83,84] The NMR structure of a nonphosphorylated FUNDC1/ATG8 complex suggests that tyrosine phosphorylation decreases ATG8 binding through an electrostatic repulsion, notably with an aspartate residue.[85] This structure further reveals an atypical binding where the HP1 site is simultaneously occupied by the LIR tyrosine residue and an adjacent valine residue.[85] Natural variations of the LIR motif exist, and extension of the LIR by a helix is observed in FYCO1. In this case, the LIR motif is succeeded by a short helix that positions a glutamate residue to interact with an arginine residue located on the α-helix of ATG8.[86,87] The capacity of ATG8 isoforms to accommodate extended LIR motifs has been proposed as another potential mechanism to explain isoform selectivity.[86,87] In support of this model, the extended LIR binds LC3A with a submicromolar affinity whereas binding of canonical LIRs typically occurs in the micromolar range. The presence of a noncanonical LIR has been hypothesized in the Ubl ATG12 where the aromatic and hydrophobic residues of the LIR are provided by residues that are close in space but not in sequence.[88] The structural details of this interaction remain unknown. Recently, a small motif in UBA5 capable of interacting with both ATG8 homologues and UFM1 was discovered[89] and termed LIR/UFIM (UFM1-interacting motif). Interaction between this motif and UFM1 is structurally reminiscent of the interaction between SUMO and an antiparallel SIM (Figure C).[89] Notably, a hydrophobic residue occupies a cavity equivalent to HP2 while a tryptophan residue folds over a small hydrophobic region at the start of the central α-helix of UFM1. Although this structure provides a compelling example of cross-reactivity where one sequence can bind more than one type of UBL, the biological consequences of this dual interaction awaits further investigation. Finally, the structure of an ATG3/ATG12–ATG5/ATG16 complex reveals that 13 residues of ATG3 can interact with the second β-strand and the α-helix of ATG12.[90] This sequence forms a short β-strand that runs parallel to the β-strand of ATG12 before continuing as an amphipathic α-helix that provides additional acidic and hydrophobic contacts with ATG12 that contributes to interactions in the midnanomolar range (Figure D).[90]

E1 Activating Enzymes

As the name implies, E1 activating enzymes are required to activate Ub/Ubl’s for subsequent steps along the conjugation cascade. E1s have historically been divided between canonical and noncanonical E1s (reviewed in Schulman and Harper[91]). Canonical E1s perform three distinct chemical reactions (Figure ). In the first reaction, E1s bind ATP, magnesium and the Ubl’s to catalyze formation of a high-energy acyl adenylate intermediate with subsequent release of pyrophosphate. In the second reaction, an E1 catalytic cysteine attacks the Ubl∼adenylate to catalyze formation of a high-energy thioester bond between the E1 and Ubl (E1∼Ubl) and release of the AMP product. Finally, the E1∼Ubl catalyzes transthioesterification to an E2 to form a high energy thioester-linked E2∼Ubl product. The first structural models for E1 mediated adenylation were gleaned from a MoeB–MoaD complex in 2001,[10] a noncanonical E1/Ubl pair. MoeB and MoaD are evolutionarily related to E1 and ubiquitin, respectively, and participate in the biosynthesis of the molybdenum cofactor in bacteria. While MoeB is capable of adenylating the Ubl MoaD, it lacks a catalytic cysteine that is present in canonical E1s. In 2003, the first structure of a canonical E1 was determined, that of the NEDD8 E1 alone[92] and in complex with NEDD8.[93] Since then, additional structural work has provided snapshots for many of the intermediates during E1 activation. Importantly, information obtained for E1s using different systems and different organisms can be extrapolated to other E1s due to their high sequence, structural, and functional similarities. As mentioned before, despite these similarities, E1s are often divided into canonical and noncanonical E1 families. Canonical E1s activate the SUMO protein family, NEDD8, ubiquitin, FAT10, and ISG15 and include two pseudosymmetric adenylation domains encoded by one or two genes. In contrast, noncanonical E1s form homodimers similar to that observed for MoeB. These include E1s for the ATG8 and ATG12 protein families, UFM1 and URM1.
Figure 4

Canonical E1 chemical reactions. (1) The E1 binds a Ubl and ATP and catalyzes adenylation of the Ubl. (2) The E1 catalytic cysteine attacks the Ubl∼AMP resulting in formation of an E1∼Ubl thioester. (3) The E1 adenylates a second Ubl. (4) The Ubl is transferred to an E2 through a transthioesterification reaction.

Canonical E1 chemical reactions. (1) The E1 binds a Ubl and ATP and catalyzes adenylation of the Ubl. (2) The E1 catalytic cysteine attacks the Ubl∼AMP resulting in formation of an E1∼Ubl thioester. (3) The E1 adenylates a second Ubl. (4) The Ubl is transferred to an E2 through a transthioesterification reaction.

Canonical E1s

Structural information for canonical E1s mainly derives from studies on E1s in the NEDD8, SUMO, and ubiquitin pathways. There is, to date, no structural information available for activation of ISG15 by E1UBA7 (also known as E1UBE1L) or FAT10 by E1UBA6 (also known as E1UBE1L2). Canonical E1 enzymes possess a common architecture (Figure A) composed of two pseudosymmetric adenylation domains: the inactive adenylation domain (IAD) and the active adenylation domain (AAD). Although some differences exist, each canonical E1 includes an insertion within the IAD that is called the first catalytic cysteine half-domain (FCCH) in the ubiquitin E1 and the NAE1 catalytic cysteine (CC) domain in the NEDD8 E1. The SUMO E1 also includes an insertion at this position, but it was not formally named because of its small size and apparent disorder in available crystal structures. In contrast to the variance observed among insertions in the IAD, each canonical E1 includes a conserved CYS domain inserted within the AAD that contains the E1 catalytic cysteine that varies in size through addition of unique structural elements. Most canonical E1s include a ubiquitin fold domain (UFD) at the C-terminal end of the AAD that assists in selecting cognate E2 ubiquitin conjugating enzymes (UBCs).[94−97]
Figure 5

Domain organization of canonical E1s. (A) Primary structure of canonical E1s for NEDD8, SUMO, and ubiquitin. (B) Structure of human NEDD8∼E1NAE1/UBA3/E2UBC12/ATP (PDB 2NVU). Adenylation domains are shown in a Gaussian surface representation, while other domains are in cartoon representation. The sulfur atom of the active site cysteine is in sphere representation colored yellow. Positions for ATP, the crossover loop, and the active site cysteine are highlighted by arrows.

Domain organization of canonical E1s. (A) Primary structure of canonical E1s for NEDD8, SUMO, and ubiquitin. (B) Structure of human NEDD8∼E1NAE1/UBA3/E2UBC12/ATP (PDB 2NVU). Adenylation domains are shown in a Gaussian surface representation, while other domains are in cartoon representation. The sulfur atom of the active site cysteine is in sphere representation colored yellow. Positions for ATP, the crossover loop, and the active site cysteine are highlighted by arrows. Structurally, canonical E1s resemble canyons (Figure B). The IAD and AAD form the base of the canyon, the UFD domain forms one wall while the FCCH/insertion and CYS domains form the opposing wall.[93] This structural organization is apparent in NEDD8, SUMO, and ubiquitin E1s although differences in the fold of the FCCH and size of the CYS domains exist as mentioned earlier. In all cases, the CYS domain is connected at its N-terminal end to the AAD through a crossover loop that passes over the Ubl C-terminal tail, with the catalytic cysteine located within a helix that immediately follows the crossover loop. The SUMO E1 possesses an α-helix termed the CYS cap that covers its catalytic cysteine prior to thioester bond formation.[98] The CYS cap contains acidic residues that could raise the catalytic cysteine pKa thereby reducing its reactivity.[98] The CYS cap becomes disordered when the CYS domain rotates to form the thioesterification active site. The IAD and AAD interact via an extended composite interface to form a single ATP binding site. The UFD, which serves to select cognate E2s, is connected to the AAD through a flexible hinge. In the SUMO and NEDD8 systems, this domain may undergo a transition from a partially disordered state to an ordered state upon E2 binding, and it must rotate to bring the E1 and E2 cysteine residues into proximity for thioester transfer.[99,100] Also in the SUMO and NEDD8 E1s, two residues in the UFD hinge and two residues in the crossover loop coordinate a structural zinc ion. Interestingly, the UFD domain of NEDD8 E1 is absent in certain yeast phyla (see below).

Adenylation Reaction

Structures of MoeB–MoaD complexes derived from Escherichia coli provided the first structural insights to the E1 adenylation reaction (Figure A).[10] Although MoeB lacks the ability to transfer its Ubl to E2s, we introduce it here because these structures illustrated the basic mechanism of adenylation, features of which are common to all E1 enzymes. MoeB forms a homodimer of AADs to form two active sites that can accommodate two molecules of MoaD and ATP. The C-terminal diglycine motif of MoaD is held in close proximity to the ATP-binding pocket to position the MoaD carboxylate terminal group between the AAD and the α-phosphate of ATP. A magnesium ion, held in place by an aspartate residue, is believed to decrease electrostatic repulsion between the phosphates and the C-terminal carboxylate of MoaD.[10] In NEDD8, SUMO, and ubiquitin E1, the ATP and the C-terminal tail of the Ub/Ubl are positioned next to each other in the AAD adenylation pocket in a manner reminiscent of MoaD and ATP (Figure B,C).[93,96,101] Despite being crystallized with ATP and magnesium and poised for adenylation, most E1/Ubl structures reveal the presence of ATP–magnesium and Ubl rather than the acyl adenylate intermediate.[93,97,101] This observation is consistent with prior studies showing that pyrophosphate release is the rate limiting step in adenylation[102] and that the back reaction occurs readily.[103] While most E1s do not undergo adenylation in the crystal, a Ub-adenylate intermediate was observed in a structure of a ubiquitin E1 where it adopts a conformation similar to the one of ubiquitin plus ATP.[104] Although generated artificially, the structure of a SUMO E1 in complex with a SUMO molecule linked to a nonhydrolyzable adenylate mimic revealed a similar configuration.[98]
Figure 6

Adenylation domains of E1 and E1-like proteins. (A) Structure of E. coli MoaD/MoeB/ATP (PDB 1JWA). (B) Structure of human E1SAE1/UBA2/SUMO/ATP (PDB 1Y8R). In both cases, Ubl and adenylation domains are depicted in cartoon and Gaussian surface representations. The ATP and last two residues of the Ubl’s are in stick representation. (C) Close-up view of interactions between ATP, the SUMO C-terminus, and the E1 adenylation pocket.

Adenylation domains of E1 and E1-like proteins. (A) Structure of E. coli MoaD/MoeB/ATP (PDB 1JWA). (B) Structure of human E1SAE1/UBA2/SUMO/ATP (PDB 1Y8R). In both cases, Ubl and adenylation domains are depicted in cartoon and Gaussian surface representations. The ATP and last two residues of the Ubl’s are in stick representation. (C) Close-up view of interactions between ATP, the SUMO C-terminus, and the E1 adenylation pocket.

Thioester Formation

The first E1 structures provided many details pertaining to the adenylation reaction, but the mechanism for pyrophosphate release and subsequent thioester bond formation remained elusive for many years as the E1 active site cysteine and Ubl∼adenylate were separated by as much as ∼35 Å. In 2010, a structure of the SUMO E1 illuminated conformational changes that were required to release pyrophosphate and to bring the active site cysteine into proximity of the Ub/Ubl∼adenylate (Figure ). This was achieved by linking SUMO and ubiquitin to a nonhydrolyzable adenylate mimic that harbored an electrophile at a position where the thioester bond is formed. Incubation of this mimic with an active E1 resulted in a stable thioether bond and tetrahedral intermediate mimic between the E1 active site cysteine and the Ub/Ubl adenylate mimic.[105] A structure of the resulting SUMO E1 complex and comparison to other SUMO E1 structures revealed several conformational changes that occur during thioester bond formation.[98] One change encompasses expulsion of the N-terminal helix of the IAD, a change that might explain how pyrophosphate is released because it forms part of the adenylation active site and contributes a critical arginine that contacts the ATP γ phosphate (Figure ). Another major conformational change includes rotation of the CYS domain which brings the catalytic cysteine into proximity of the Ubl∼adenylate and forms a new composite active site that is capable of thioester bond formation. It remains unclear how or if the E1 cysteine is specifically activated for catalysis by neighboring amino acid side chains, but the structure suggests that an α-helix in the AAD, positioned just below the adenylate, could stabilize the transition state during thioester bond formation via hydrogen bonding or perhaps the helix dipole. Following thioester bond formation, the CYS domain is thought to return to its original position through changes that restore the original conformation of the adenylation active site to permit binding of a second Ubl and ATP.
Figure 7

Thioester formation in the SUMO E1. (A) Structure of human E1SAE1/UBA2/SUMO–AMSN (PDB 3KYC). (B) Structure of human E1SAE1/UBA2/SUMO–AVSN (PDB 3KYD). For simplicity, only the CYS domain and SUMO–AMSN or SUMO–AVSN is presented. The sulfur atom of the active site cysteine is in sphere representation colored in yellow. A color gradient is applied on the CYS domain to highlight the different orientations observed in the two structures. As an additional landmark, Lys336, a lysine residue in the CYS domain, is presented in sphere representation. AMSN and AVSN are nonhydrolyzable AMP mimics with AVSN covalently linked to the E1 catalytic cysteine.

Thioester formation in the SUMO E1. (A) Structure of human E1SAE1/UBA2/SUMO–AMSN (PDB 3KYC). (B) Structure of human E1SAE1/UBA2/SUMO–AVSN (PDB 3KYD). For simplicity, only the CYS domain and SUMO–AMSN or SUMO–AVSN is presented. The sulfur atom of the active site cysteine is in sphere representation colored in yellow. A color gradient is applied on the CYS domain to highlight the different orientations observed in the two structures. As an additional landmark, Lys336, a lysine residue in the CYS domain, is presented in sphere representation. AMSN and AVSN are nonhydrolyzable AMP mimics with AVSN covalently linked to the E1 catalytic cysteine.

Ubl Transfer to E2s

Thioester transfer between the E1∼Ubl and E2 requires juxtaposition of the E1 and E2 active sites. In the case of the NEDD8 E1, formation of an E1∼NEDD8 thioester triggers a change in the conformation of the UFD that exposes an E2 binding surface.[97] The interaction between NEDD8 E1 and E2UBC12 or E2UBE2F is bipartite. One interaction that appears unique to NEDD8 E1/E2 complexes involves the stabilization of the N-terminal extensions of E2UBC12 or E2UBE2F that adopt extended conformations within a groove on NEDD8 E1 (Figure A).[106,107] The other interaction involves contacts between the UFD domain of NEDD8’s E1 and the core domains of E2UBC12 or E2UBE2F in a manner that structurally resembles other ubiquitin–UBD interactions[94] with the β-sheet of the UFD contacting the N-terminal helix and β1−β2 loop of E2UBC12 or E2UBE2F.[94,107] The structure of E2UBC12 in complex with NEDD8 E1 doubly loaded with NEDD8 (one NEDD8 in the adenylation site, another linked to the E1 via a thioester bond) confirmed bipartite recognition of E2UBC12 and revealed a conformation where the E1 and E2 active sites faced each other although they remained ∼20 Å apart (Figure A).[97] Although this distance is too far to promote transthioesterification, it was hypothesized that the distance could be reduced through a hinge movement of the loop linking the AAD and UFD.[97,108] The structure of a ubiquitin E1–E2 complex where both active sites are cross-linked through a disulfide bond provided insight into additional conformational changes required for transthioesterification.[96] Indeed, further movement in the hinge between the UFD and AAD allows the E2 active site to approach the E1 active site cysteine (Figure B). In this arrangement, E2UBC4 interacts with the UFD and CYS domain while also contacting the crossover loop and the molecule of ubiquitin that is located in the adenylation site.
Figure 8

E1–E2 interaction for canonical E1 proteins. (A) Structure of human E1NAE1/UBA3/E2UBC12/NEDD8/ATP (PDB 2NVU) showing the bipartite binding of the E2UBC12 to E1. In this structure, the catalytic cysteine residues of E1 and E2 are separated by ∼20 Å. (B) Structure of S. pombe E1UBA1/E2UBC4/ubiquitin/ATP (PDB 4II2) showing juxtaposition of E1 and E2 active sites. Adenylation domains are shown in Gaussian surface representation. Other domains are in cartoon representation. The sulfur atoms of the active site cysteine residues of E1 and E2 are in sphere representation colored yellow.

E1–E2 interaction for canonical E1 proteins. (A) Structure of human E1NAE1/UBA3/E2UBC12/NEDD8/ATP (PDB 2NVU) showing the bipartite binding of the E2UBC12 to E1. In this structure, the catalytic cysteine residues of E1 and E2 are separated by ∼20 Å. (B) Structure of S. pombe E1UBA1/E2UBC4/ubiquitin/ATP (PDB 4II2) showing juxtaposition of E1 and E2 active sites. Adenylation domains are shown in Gaussian surface representation. Other domains are in cartoon representation. The sulfur atoms of the active site cysteine residues of E1 and E2 are in sphere representation colored yellow. While there is currently no structural information available for a good mimic of the transthioesterification reaction, three studies elaborated models for a tetrameric assembly consisting of a doubly loaded ubiquitin E1 with E2 in conformations suitable for thioester transfer.[96,104,109] In each case, the thioester-linked ubiquitin is bordered by the E2, the FCCH domain, and ubiquitin in the adenylation site, although the models differ somewhat. In two cases the thioester-linked ubiquitin contacts the FCCH domain,[104,109] while in another case the thioester-linked ubiquitin contacts the E2.[96] In the later case, ubiquitin was positioned in a similar but not identical conformation to the closed E2∼ubiquitin conformation (see below). The validity of these models, and their generality, remains to be tested.

Directionality of the Reaction

Each chemical step catalyzed by E1 is reversible, so how does the E1 ensure directionality? First and foremost, ATP is highly abundant, so the reaction could progress by simple mass action. With that said, several mechanisms discussed earlier appear to confer directionality to the overall reaction. Following adenylation, the E1 undergoes large conformational changes that reshape the adenylation active site into one that promotes thioester bond formation. In doing so, residues that contact ATP and are required for adenylation are displaced from the active site to allow the CYS domain to rotate and project the active site cysteine toward the Ubl∼adenylate. These conformational changes would facilitate pyrophosphate release, thus inhibiting the back reaction.[98] After thioester bond formation, the CYS domain returns to its original position, allowing reformation of the adenylation active site, a process that is most certainly enhanced by binding another molecule of ATP and Ubl. The presence of ATP and a second Ubl would likely inhibit CYS domain conformational changes, thus preventing the thioester-linked Ubl from undergoing the reverse reaction. This model is consistent with the observation that doubly loaded ubiquitin E1 is most efficient at promoting thioester transfer to E2.[110] Toggling of affinities upon E1:E2 transthioesterification was also proposed as a driving force in directionality.[97] In this case, the Ubl moiety of a Ubl∼E1 complex contributes additional surfaces to promote interactions with E2 while Ubl transfer to the E2 reduces the number of contacts between the charged E2 and the E1, thereby facilitating rapid dissociation of the charged E2 and inhibiting attack of the charged E2 by the E1. This is followed in the NEDD8 system by an additional change in the conformation of the UFD that buries E2 binding surfaces to inhibit rebinding between the charged E2 and discharged E1.[97]

Ubl Specificity

E1s serve as selectivity filters to ensure faithful transfer of cognate Ubl’s to cognate E2s because they generally exhibit specificity for one or a few Ubl’s. Indeed, reports show that once a noncognate Ubl is loaded on E1, it can be transferred to an E2 without major impediments.[111,112] While many interactions contribute to specificity between cognate E1/Ubl pairings, the identity of the third residue preceding the diglycine motif (position 72 in NEDD8 or ubiquitin) constitutes a major determinant for faithful Ubl activation by its cognate E1.[50,92,93,113] This residue is an arginine in ubiquitin, an alanine in NEDD8, and a glutamine or glutamate in SUMO. Early biochemical work revealed that A72R substitution in NEDD8 increases NEDD8’s affinity for ubiquitin E1 by approximately 2 orders of magnitude[50] while the R72A substitution of ubiquitin permits its adenylation by NEDD8 E1.[93] Furthermore, R72L substitution of ubiquitin can be activated and passed to the E2 by NEDD8 E1 while this activity could not be detected for wild type ubiquitin.[111] Structures of cognate E1–Ubl complexes revealed how different residues at this position are recognized by E1. Ubiquitin Arg72 fits into a negatively charged cavity in its E1 forming contacts to multiple residues via hydrogen bonds and hydrophobic interactions.[108] In the case of SUMO, a glutamate residue is stabilized by contacts to an arginine and a tyrosine,[101] while for NEDD8, Ala72 benefits from hydrophobic interactions with adjacent leucine and tyrosine residues.[93] Interestingly, a single R190Q mutation in NEDD8 E1 allows the charging of ubiquitin.[93] Arg190 does not contact NEDD8 Ala72, but it appears that this residue is part of a gating mechanism that prevents NEDD8 E1 from binding ubiquitin.[113] Additional contacts to the Ubl could include contacts to residues from the FCCH or related insertion domains. Similarly, interactions between SUMO and a SIM-like sequence in the C-terminus of its E1[98] could assist in SUMO specificity although this region of the E1 appears dispensable in vivo and its contribution to Ubl specificity has not been assessed. Specificity was also addressed in the case of E1UBA6 that activates two different Ubl’s, ubiquitin or FAT10,[114−116] and transfers them to E2UBE2Z.[115,117] Ubiquitin and FAT10 were reported to undergo adenylation and thioester formation with similar kinetics although FAT10 displays a much tighter binding to E1UBA6 than ubiquitin.[118] A Cys-Tyr-Cys-Ile sequence immediately preceding the diglycine motif appears important for the selective activation of FAT10 by E1UBA6 and for its transfer to E2UBE2Z.[119] Intriguingly, a FAT10 variant with this motif replaced by the Leu-Arg-Leu-Arg motif of ubiquitin displays an increased rate of transfer to E2UBE2Z when compared with the wild type protein.[119]

E2 Specificity

In addition to E1 specificity for cognate Ubl’s, E1s must also select cognate E2s from a large pool of structurally similar proteins. In this regard, E1s appear to select for cognate E2s while also discriminating against noncognate E2s. The UFD appears to be the main specificity determinant for E1:E2 interactions. The structures of five UFD:E2 pairs have been solved,[96,97,107,120,121] and comparison of these structures suggests that conserved insertions and deletions are used to prevent noncognate pairing.[96] Consistent with the idea the UFD domain constitutes the main E2 binding platform for ubiquitin E1, its deletion leads to a sharp decrease in the rate of E2 transthioesterification.[95] For the interaction between NEDD8 E1 and its E2s, specificity also derives from a tight interaction between the N-termini of E2UBE2F or E2UBC12 and the residues of a groove in NEDD8 E1[106,107] and deletion of the NEDD8 E1 UFD domain was shown to have a limited effect on the rate of E2 transthioesterification,[95] consistent with the absence of this domain in certain yeast strains. This deletion however increases the rate of noncognate E2 charging in agreement with a role of the UFD domain as a selectivity filter.[95] A study aimed at understanding the bases of E2UBC12 specificity for NEDD8 further revealed that certain E2UBC12 residues oppose ubiquitin charging and, through alanine mutagenesis, the vestigial preference of E2UBC12 for ubiquitin can be partially restored.[112] Finally, structural comparison of E1UBA2/E2UBC9 pairs for human and yeast suggests a general conservation of the interaction interface between these species.[121]

Regulation

While regulation of E1s has not been a major focus of investigation, several lines of evidence suggest that E1s could be regulated. For instance, the CYS domain of the SUMO E1 was shown to be targeted by SUMO modification to decrease its interaction with E2UBC9, possibly though steric hindrance.[122] SUMO modification on the E1 C-terminal domain has also been shown to affect the subcellular localization of the E1.[123] Perhaps most intriguing are the observations that, under oxidative conditions, the SUMO E1 can form a disulfide bond with its E2 UBC9,[124] thereby inactivating both enzymes. A similar model was proposed for the ubiquitin E1.[125] There are also examples of proteins regulating E1 activity through direct interaction. LMO2 binds the UFD domain of E1UBA6 and acts as a competitive inhibitor of E1UBA6, thereby decreasing FAT10 loading on E2USE1 and global protein FATylation.[126] Another study reported that glycyl-tRNA synthase (GlyRS) uses its anticodon binding domain to target NEDD8 E1 where it captures NEDD8∼ E2UBC12 through an interaction that depends on the catalytic domain of GlyRS but not on its tRNA synthase activity.[127] GlyRS, but not other tested tRNA synthases, appears to stabilize NEDD8∼E2UBC12 and was posited as a global regulator of protein neddylation.

Noncanonical E1s

Noncanonical E1s form obligate homodimers and do not possess a CYS domain. Instead, their catalytic cysteine is positioned after a crossover loop that is close to the adenylation pocket. Work on E1ATG7 and E1UBA5 provided considerable structural insight into mechanisms utilized by noncanonical E1s. Comparatively, much less is known concerning E1UBA4, which plays a dual role in protein conjugation and sulfur metabolism.[128]

UBA4: URM1 E1

E1UBA4 is a homodimeric protein that includes an adenylation domain followed by a rhodanese-like domain (RLD) and functions as a sulfurtransferase. It is not yet clear how E1UBA4 activates URM1 for downstream protein conjugation or if this process even involves the formation of a thioester intermediate,[39,129,130] as is the case for other E1s. Protein modification by URM1 is dependent on a catalytic cysteine located in the RLD, supporting a mechanism where the adenylated URM1 is attacked by a persulfide group on the RLD’s catalytic cysteine to yield a disulfide-linked E1UBA4–S–S–URM1 intermediate.[130] Attack of this species by a cysteine in the adenylation domain would result in release of a URM1 thiocarboxylate, whereas attack by a substrate lysine residue would rather result in a URM1–protein conjugate.[130]

ATG7: Dual E1 for ATG8 and ATG12

E1ATG7 activates two different Ubl’s, ATG12 and ATG8, and respectively pairs them with E2ATG10 and E2ATG3. E1ATG7 includes two domains connected by a short flexible hinge.[131] The N-terminal domain, termed ATG7-NTD, is unique to the E1ATG7 family of proteins, whereas its C-terminal domain, termed ATG7-CTD, constitutes the AAD common to all E1s. The structure of full-length E1ATG7 resembles a gliding bird where the CTD dimer forms the body while the NTD constitutes the wings.[132] This bipartite organization is important for E1ATG7’s ability to pair two different Ubl’s with their cognate E2s. Structures of ATG7-CTD–ATG8 complexes show similarity with E1 and E1-like proteins with respect to the positions of the ATP[131] and ATG8 C-terminal tail.[131,133] Interestingly, deletion of the last 13 C-terminal residues of E1ATG7 affects ATG8 but not ATG12 conjugation[134] and a peptide encompassing those residues was shown to interact with ATG8 by NMR.[131] However, this interaction is incompatible with the E1ATG7/ATG8 conformation captured in the crystal structure where the visible part of the C-terminus of E1ATG7 forms a helical pad that interacts with a different region of ATG8. These results suggested a sequential recruitment model where ATG8 first interacts with the E1ATG7 C-terminus prior to engaging the adenylation domain.[131] Contrary to canonical E1s, thioester bond formation does not require extensive remodeling of E1ATG7 as structures of E1ATG7 bound to ATG8 show the catalytic cysteine already poised on a loop just 7 Å from ATG8’s C-terminal carboxylate. The conformation required for thioester bond formation has not yet been elucidated. E1ATG7 interacts with two E2s: E2ATG10 and E2ATG3. An insertion of about 80 residues in the core domain of E2ATG3, termed the flexible region, was initially shown to be important for contacting E1ATG7.[135] The binding site was later refined to 13 residues that form a helical structure that interacts with ATG7-NTD.[136] Mutation of certain E1ATG7 residues that interact with E2ATG3’s flexible region only affect E2ATG3 but not E2ATG10 interaction, suggesting differential binding requirements for each E2.[136] To better understand the bases for dual E2 specificity of E1ATG7, structures were determined with the E1ATG7 active site cysteine cross-linked to the active site cysteine of E2ATG10 and E2ATG3 (Figure ).[132] These structures showed that both E2s bind E1ATG7 between the NTD and CTD domain in a region called “under-wing” and that variations in the relative orientation of ATG7-NTD and ATG7-CTD enable unique accommodation of each E2. Furthermore, these structures showed that E2ATG10 and E2ATG3 use different structural elements to contact ATG7-NTD. While E2ATG3 uses an α-helix, E2ATG10 uses a β-hairpin. Importantly, these structures confirmed the trans mechanism as previously envisioned,[131,133,136] as the E2 bound by one subunit of the E1ATG7 dimer is cross-linked to the other subunit. These findings were also corroborated by a contemporaneous study that showed the same trans mechanism using structures of E2ATG3/ATG7-NTD and E2ATG10/ATG7-NTD.[137] These E1ATG7/E2 structures provided considerable insight into E2 selection, although the structural basis for selective pairing of ATG8 with E2ATG3 and ATG12 with E2ATG10 remains elusive.
Figure 9

E1–E2 interaction for ATG7. (A) Structure of S. cerevisiae E1ATG7–E2ATG3 (PDB 4GSL). (B) Structure of S. cerevisiae E1ATG7–E2ATG10 (PDB 4GSK). In both cases, E2 binding occurs between the NTD and CTD domains of E1ATG7. Dashed lines represent regions missing elements in the crystal structure. Sulfur atoms of the catalytic cysteine residues of E1ATG7 and E2ATG3 are in sphere representation colored yellow. The catalytic cysteine of E2ATG10 is not visible in the structure. The position of ATG8 or ATG12 on the adenylation domain is indicated by a yellow oval.

E1–E2 interaction for ATG7. (A) Structure of S. cerevisiae E1ATG7–E2ATG3 (PDB 4GSL). (B) Structure of S. cerevisiae E1ATG7–E2ATG10 (PDB 4GSK). In both cases, E2 binding occurs between the NTD and CTD domains of E1ATG7. Dashed lines represent regions missing elements in the crystal structure. Sulfur atoms of the catalytic cysteine residues of E1ATG7 and E2ATG3 are in sphere representation colored yellow. The catalytic cysteine of E2ATG10 is not visible in the structure. The position of ATG8 or ATG12 on the adenylation domain is indicated by a yellow oval.

UBA5: UFM1 E1

E1UBA5 represents a minimalistic E1 composed of a single domain[138] that is necessary and sufficient for UFM1 activation and thioesterification.[139] The structure of an E1UBA5/UFM1 complex shows a homodimeric arrangement similar to that observed for MoeB.[140] Comparison of structures of E1UBA5 and E1UBA5/UFM1 reveals a rearrangement of the crossover loop upon UFM1 binding that repositions the catalytic cysteine closer to the C-terminal end of UFM1.[89,139] This would facilitate the formation of an E1∼UFM1 thioester, a process that is also stimulated by binding of E2UFC1.[141] Contrary to canonical E1s, UBA5 does not undergo a second round of adenylation following thioester formation.[141] However, thioester transfer is accelerated by E1UBA5 binding of ATP and magnesium.[141] Similar to E1ATG7, thioester transfer of UFM1 from E1UBA5 to E2UFC1 was shown to occur via a trans mechanism.[140] By analogy to canonical E1s, UBA5 was postulated to possess a UFD that would bind E2UFC1. This is based on experiments showing interactions between the C-terminus of E1UBA5 and E2UFC1[142] and observations that deletion of the C-terminus of E1UBA5 abrogates UFM1 loading on E2UFC1.[139] A region of 23 residues with helical propensity in the E1UBA5 C-terminus however appears to be sufficient for E2UFC1 interaction, suggesting that the E1UBA5 C-terminus does not adopt a ubiquitin fold.[143] The C-terminus of E1UBA5 also contains an LIR/UFIM motif that interacts with UFM1. In the context of the homodimeric E1UBA5, this motif binds UFM1 in trans(140) to facilitate its activation.[89] Mutation or deletion of this motif does not abolish UFM1 activation, suggesting that this motif does not constitute the only determinant for UFM1 selection.[89,139] Furthermore, this motif was shown to interact with certain members from the ATG8 family although this interaction was insufficient to trigger their activation.[89] In addition to UFM1, E1UBA5 was also reported to activate SUMO2 in vitro;[144] however, details of this interaction remain unclear.

E2 Conjugating Enzymes

E2 conjugating enzymes can be divided into canonical and noncanonical E2s (Figure ). Canonical E2s include those that carry ubiquitin, SUMO, NEDD8, ISG15, and FAT10 as thioester adducts. They share a common architecture called the UBC fold comprised of approximately 150 residues that typically includes four α-helices and four β-strands with the catalytic cysteine located between β-strand 4 and α-helix 2. In addition to the UBC core, some E2s contain N- and/or C-terminal extensions or large insertions after the catalytic cysteine. As such, canonical E2s have historically been subdivided into four classes according to these criteria.[145] More recent phylogenic analyses have further subclassified canonical E2s into 17 classes.[146] Noncanonical E2s, which comprise those that carry ATG8, ATG12, and UFM1 as thioester adducts, were deemed as too divergent to be included within canonical E2 classifications.[135,146] While these E2s bear some structural resemblance to canonical E2s with respect to their topology, they include more structural variations including a notable lack of the last two C-terminal helices that are observed in canonical E2s. In addition, noncanonical E2s receive their respective Ubl from noncanonical E1s while canonical E2s receive their Ubl from canonical E1s. Canonical E1/E2 interactions are mutually exclusive with E2/E3 interactions,[147] implying that E2s must disengage from E3s to be recharged by E1s. This property appears conserved for noncanonical E2s as illustrated by analysis of E2ATG3 activation.[148,149] In this section we will review mechanistic and structural findings on E2s that mediate Ubl conjugation. Complementary information on the functions and structures of E2s involved in ubiquitin conjugation can be found in a recent review.[7]
Figure 10

Canonical and noncanonical E2s. (A) A structure of E2UBC9 (PDB 1Z5S) was chosen as a representative example of canonical E2s. (B) Structure of E2ATG10 (PDB 3VX7), a noncanonical E2. (C) Structure of E2UFC1 (PDB 3EVX), a second noncanonical E2 that differs from both E2UBC9 and E2ATG10. Proteins are in cartoon representation colored cyan except for divergent structural elements that are colored white. The sulfur atoms of the catalytic cysteine residues are in sphere representation to highlight the shift in position of this residue in E2ATG10 as compared to E2UBC9 and E2UFC1.

Canonical and noncanonical E2s. (A) A structure of E2UBC9 (PDB 1Z5S) was chosen as a representative example of canonical E2s. (B) Structure of E2ATG10 (PDB 3VX7), a noncanonical E2. (C) Structure of E2UFC1 (PDB 3EVX), a second noncanonical E2 that differs from both E2UBC9 and E2ATG10. Proteins are in cartoon representation colored cyan except for divergent structural elements that are colored white. The sulfur atoms of the catalytic cysteine residues are in sphere representation to highlight the shift in position of this residue in E2ATG10 as compared to E2UBC9 and E2UFC1.

Reactions

E1s transfer Ubl’s to E2s through a transthioesterification reaction. Following this step, many E2s can transfer their Ubl to a lysine residue through an aminolysis reaction where the primary amine of a deprotonated lysine residue acts as a nucleophile to attack the thioester bond that links the Ubl and E2 (Figure ). This reaction likely involves formation of a tetrahedral intermediate and ultimately resolves with the formation of an isopeptide (amide) bond between the lysine residue and the Ubl C-terminal glycine. A similar reaction is catalyzed by E2ATG3 to conjugate ATG8 to the primary amine of PE.[35]
Figure 11

E2 chemical reactions (A) Scheme illustrating how the Ubl moiety of a E2∼Ubl thioester can be transferred to the primary amine group of a lysine residue of a protein substrate (top), to the primary amine group of PE (middle), or to a HECT or RBR E3 for subsequent transfer to the lysine residue of a protein substrate (bottom). Transfer to PE is performed by E2ATG3. Ubl transfer to E3s of the HECT or RBR families are limited to E2UBCH8, and this only allows the ISGylation of a limited number of protein substrates. (B) Mechanism for the aminolysis reaction. In this case, the primary amines are presented in their deprotonated states.

E2 chemical reactions (A) Scheme illustrating how the Ubl moiety of a E2∼Ubl thioester can be transferred to the primary amine group of a lysine residue of a protein substrate (top), to the primary amine group of PE (middle), or to a HECT or RBR E3 for subsequent transfer to the lysine residue of a protein substrate (bottom). Transfer to PE is performed by E2ATG3. Ubl transfer to E3s of the HECT or RBR families are limited to E2UBCH8, and this only allows the ISGylation of a limited number of protein substrates. (B) Mechanism for the aminolysis reaction. In this case, the primary amines are presented in their deprotonated states. Several ubiquitin E2s can transfer their ubiquitin moieties to members of the homologous to E6AP C-terminus (HECT) or RING-between-RING (RBR) families of E3 ligases through a transthioesterification reaction that generates an E3 thioester adduct to ubiquitin.[7,150] Once loaded with ubiquitin, these E3s can modify the lysine residue of protein substrates through an aminolysis reaction in the absence of the E2 (reviewed in Buetow and Huang[8]). In Ubl conjugation, this appears to be limited to E2UBCH8 that transfers its Ubl ISG15 to HERC5[151] or HHARI.[152]

Active Site Organization

Canonical E2s share an active site architecture that includes two loops that are supported by multiple intra- or interloop interactions. The catalytic cysteine is located on one of these loops, and its position within the UBC fold is conserved among canonical E2s. While the position of the catalytic cysteine is conserved, the identity and location of other residues that participate in catalysis are often different. Furthermore, and unlike many other enzymes, E2 active sites appear devoid of residues that could participate in general acid/base catalysis suggesting that they must rely on alternative mechanisms to activate or deprotonate the incoming nucleophile and/or to increase the reactivity of the thioester bond. Although many E2s have been characterized at the biochemical and structural level, several characteristics of the E2 active site can be appreciated from studies in the SUMO system that identified E2UBC9 residues adjacent to the catalytic cysteine that contribute to a microenvironment that suppresses the pKa of the incoming lysine nucleophile while optimally positioning the thioester bond and incoming lysine residue for catalysis (Figure ).[153−155] One particular study highlighted the contribution of Asn85, Tyr87, and Asp127 to isopeptide bond formation using discharge assays, model substrates, and structural analysis of mutations at each position in complex with the substrate RANGAP1.[155] Importantly, mutation of individual residues to alanine resulted in a decreased rate of discharge with little effect on substrate binding. Asn85 lies within the HPN motif that is highly conserved among E2s,[156,157] and its side chain is within hydrogen bonding distance to the carbonyl oxygen of the C-terminal glycine in structures of E2 thioester mimics or product complexes.[154,158−166] As such, this residue appears optimally positioned to stabilize the thioester bond prior to catalysis and was in fact originally proposed to stabilize the oxyanion intermediate.[156] While Asn85 is certainly important for catalysis, later studies suggested that it was also important for stabilizing the loop that contains Asp127.[155,167] Unlike Asn85 that contacts the thioester, Tyr87 and Asp127 appear to contribute to catalysis by forming an environment that orients and desolvates the incoming lysine nucleophile.[155] Asp127 is often present in other E2s as a serine or aspartate residue,[155,168] and in some cases phosphorylation of the serine residue has been shown to increase catalytic efficiency[155,168,169] although downregulation has also been reported.[170] While Tyr87 is less well conserved, the hydrophobic property of amino acids at the analogous structural position appears widely conserved.[155]
Figure 12

E2 active site. (A) Overall view and (B) close-up view of E2UBC9/SUMO–RANGAP1 (PDB 1Z5S) illustrating how the RANGAP1 substrate and SUMO are positioned in the E2UBC9 active site. This state represents a product complex after conjugation where SUMO, colored yellow, has been transferred to a lysine of RANGAP1 colored gray. SUMOD designates a SUMO protein in donor (D) configuration. E2UBC9 is in cartoon representation colored cyan. Certain residues of the E2 active site are in stick representation. The consensus sequence for substrate recognition by E2UBC9 is indicated on top. (C) Close-up of E2UBC9/RANGAP1 (PDB 1KPS) representing RANGAP binding prior to catalysis in the absence of SUMO.

E2 active site. (A) Overall view and (B) close-up view of E2UBC9/SUMO–RANGAP1 (PDB 1Z5S) illustrating how the RANGAP1 substrate and SUMO are positioned in the E2UBC9 active site. This state represents a product complex after conjugation where SUMO, colored yellow, has been transferred to a lysine of RANGAP1 colored gray. SUMOD designates a SUMO protein in donor (D) configuration. E2UBC9 is in cartoon representation colored cyan. Certain residues of the E2 active site are in stick representation. The consensus sequence for substrate recognition by E2UBC9 is indicated on top. (C) Close-up of E2UBC9/RANGAP1 (PDB 1KPS) representing RANGAP binding prior to catalysis in the absence of SUMO. In cases where equivalents of Asp127 or Tyr87 are missing, structural equivalents appear to be contributed by the substrate itself. For instance, E2UBC12 lacks a residue equivalent to Tyr87 in E2UBC9. In a structure of a charged E2UBC12/CUL1 complex, a tyrosine residue in the CUL1 substrate acts as a structural mimetic of Tyr87.[171] While the equivalent of Asp127 in E2UBC9 exists in E2UBC12, mutation of Asp143 had little effect on NEDD8 conjugation. In this case, the backbone carbonyl of Asp143 appears closer to the incoming lysine and may functionally replace its carboxylic group.[171] Another example was illustrated for E2UBC1, an E2 that lacks a structural equivalent to Tyr87.[172] In this case, the authors suggested that ubiquitin Tyr59 contributes to the formation of a hydrophobic microenvironment that assists in activating the attacking lysine residue, in this case Lys48 of ubiquitin. E2UBE2S constitutes a case of substrate-assisted catalysis as Glu34, a glutamate residue in the ubiquitin substrate, is predicted to interact with the target lysine Lys11, thereby functionally mimicking Asp127 in E2UBC9.[173] The location of the active site is not strictly conserved for all E2s. In E2ATG3 and E2ATG10, despite being present in topologically equivalent locations, the position of the active site cysteine is shifted by ∼12 Å when compared to canonical E2s such as E2UBC9. Despite this difference, the catalytic cysteine of E2ATG10 is located adjacent to Tyr56 and Asn114, two residues that could function in manners similar to those reported for Tyr87 and Asp127 in E2UBC9. Consistent with this idea, mutation of these residues to alanine leads to a defect on kcat with little effect on Km.[174]

Target Selection

Most E2s do not exhibit specificity and consequently require an E3 to promote selective interactions with their substrates. In contrast, E2UBC9 can directly recognize substrates that contain a SUMO consensus motif comprised by Ψ-Lys-X-Asp/Glu where Ψ is a hydrophobic residue, Lys is the target lysine to which SUMO is attached, and X is any residue.[175] While SUMO can be conjugated to other sites, mass spectrometry analyses of SUMO targets show a clear enrichment of this motif or related motifs at sites of conjugation (see Hendriks and Vertegaal[38] for review). The structural basis for recognition of this motif was first illustrated by a structure of RANGAP1/E2UBC9 (Figure ).[153] In this complex, the hydrophobic residue contacts a somewhat featureless hydrophobic surface formed by residues Pro128-Ala129-Gln130, an observation consistent with accommodation of hydrophobic residues that differ in size. As discussed in section , the substrate lysine residue is positioned via aliphatic contacts to Tyr87 and hydrogen bonding interactions with Asp127 to place its primary amine next to the catalytic cysteine of E2UBC9. The X residue is located above Tyr87, enabling interactions between the Asp/Glu residue that includes aliphatic contacts to Tyr87 as well as hydrogen bonding contacts to Ser89 and Thr91. Two lysine residues (Lys74, Lys76) are also proximal to the Asp/Glu residues. Although other E2s may harbor intrinsic substrate specificities, the SUMO system appears to be the only one where in vitro and in vivo substrate specificities appear well reconciled by available structural and biochemical data. Other examples include the noncanonical E2s E2ATG10 and E2ATG3. Indeed, no E3 has been found for E2ATG10 for the modification of the ATG5 substrate on a specific lysine of its helical domain.[176,177] NMR and cross-linking analyses of the ATG5/E2ATG10 interaction is consistent with the idea that direct recognition of ATG5 by E2ATG10, notably through ATG5 C-terminal ubiquitin-like domain, is sufficient for mediating its conjugation to ATG12.[174] E2ATG3 was also reported to be sufficient for recognizing its PE substrate in vitro, although this process is stimulated by E3ATG12–ATG5.[178] Formation of Ubl chains can be considered as a special case of target selection as the target is the Ubl itself. In the ubiquitin system, multiple strategies are employed for chain formation. For example, the heterodimeric E2MMS2/E2UBC13 uses the inactive component E2MMS2 to bind ubiquitin and position Lys63 in the active site of E2UBC13[179] while the monomeric E2UBE2S exploits a noncovalent interaction with ubiquitin to promote synthesis of Lys11 chains.[173] In the case of SUMO, certain SUMO isoforms contain one or more SUMO consensus motifs within their N-terminal extensions that can be used to form chains.[46,180] It was also proposed that assembly of two or more E2UBC9 enzymes can favor formation of SUMO chains.[181] In this case, one E2UBC9 would scaffold its backside bound SUMO such that the SUMO consensus site could reach the active site of a second SUMO∼E2UBC9 complex. The association of at least two E2UBC9 molecules is needed as the SUMO consensus site of a backside bound SUMO cannot reach the active site of the same E2UBC9. The concept that E2/E2 interactions might promote chain formation was also proposed based on packing of E2s in certain crystal forms.[182−184] In yet another study, SUMO modification of E2UBC9 at Lys153 was shown to decrease E2 activity; however, SUMO-modified E2UBC9 increased chain formation by unmodified E2UBC9 suggesting that association of two or more E2s, directly or indirectly, can result in increased chain formation.[185] In the NEDD8 system, one study suggested that NEDD8 chains can be formed on the active site cysteine of the E2UBC12 and then transferred en bloc to the substrate.[186] In this case, NEDD8 chains appear to be facilitated by association of two or more E2s. Formation of mixed chains has been observed for ubiquitin and some Ubl’s with ubiquitination of SUMO chains by SUMO-targeted ubiquitin ligases being possibly the best understood process.[187,188] Evidence also exists for formation of mixed chains containing NEDD8 and ubiquitin under stress conditions, perhaps by misprocessing of NEDD8 by the ubiquitin conjugation machinery.[189,190] Finally, the ISGylation of Lys29 of ubiquitin was recently reported.[191] While multiple mechanisms have been proposed for formation of chains, the underlying structural bases for these activities remain unclear.

Dynamics

The dynamics of the E2∼Ubl thioester adduct are best understood in the ubiquitin system where comparisons between multiple E2∼ubiquitin structures or their mimics have revealed that ubiquitin can adopt a variety of orientations relative to the E2.[158,166,179,192] NMR studies have further suggested that different E2s can populate different conformational states.[193] One conformation with particular significance was termed the “closed” conformation. In this case, ubiquitin packs against the crossover helix of the E2, thereby positioning the ubiquitin C-terminus in a shallow groove that leads to the E2 active site cysteine where it is stabilized by multiple interactions. The first structure of an E2 thioester-linked to ubiquitin, determined by NMR, revealed ubiquitin in a state similar to the closed conformation.[158] A subsequent structure of E3RANBP2/E2UBC9/SUMO–RANGAP1 showed SUMO in a closed conformation (Figure A) in work that proposed that the closed conformation was required for activation of the thioester bond.[154] Closed conformations have since emerged as a hallmark for E2∼Ubl activation in the ubiquitin, SUMO, and NEDD8 pathways for aminolysis[160,161,171,194,195] because the closed conformation positions the thioester bond in the E2 active site in an orientation that enhances reactivity. As described in section , E3 ligases can enhance reactivity by stabilizing the closed conformation (see below).
Figure 13

E2∼Ubl complexes. (A) Structure of E2UBC9 in complex with SUMO in a closed conformation (PDB 1Z5S). (B) Structure of E2UBC9 in complex with SUMO that binds E2UBC9 on the E2 backside (PDB 2PE6). Proteins are in cartoon representation with catalytic cysteine residues in sphere representation. SUMOD and SUMOB represent SUMO proteins in donor (D) and backside (B) configurations, respectively.

E2∼Ubl complexes. (A) Structure of E2UBC9 in complex with SUMO in a closed conformation (PDB 1Z5S). (B) Structure of E2UBC9 in complex with SUMO that binds E2UBC9 on the E2 backside (PDB 2PE6). Proteins are in cartoon representation with catalytic cysteine residues in sphere representation. SUMOD and SUMOB represent SUMO proteins in donor (D) and backside (B) configurations, respectively.

Backside Binding by Ubl’s

Studies in the ubiquitin and SUMO pathways have revealed that proteins, including ubiquitin and SUMO, can interact with E2s on a surface termed the E2 “backside” that is opposite to the active site (Figure B). Backside binding between E2s and ubiquitin or SUMO contributes to chain building activities[181,196−198] or to regulation of E2 activities via allostery.[162] Although backside E2/Ubl complexes utilize similar surfaces in ubiquitin or SUMO pathways, the strength of these interactions can vary. Ubiquitin and certain ubiquitin E2s interact with affinities in the high micromolar range,[162,196] while interactions between SUMO and E2UBC9 occur in the nanomolar range.[164,181,199,200] Other proteins can also bind to E2s via the backside surface. For instance, a membrane-anchored ubiquitin-fold protein (MUB) that is targeted to membranes following prenylation of its C-terminus was shown to bind the backside of certain ubiquitin E2s; however, binding affinity in this case was in the nanomolar range due to the presence of a longer loop that extends the binding interface.[201] In addition to the canonical ubiquitin interaction on the backside of an E2, a second “noncanonical” binding site for ubiquitin has been found on E2RAD6.[202] There is limited data on the prevalence of this alternate ubiquitin interaction surface among other E2s. In addition to Ubl’s, several proteins can bind the backside of E2 to modulate E2 activities. The SUMO E3 ligase RANBP2 contacts E2UBC9 through its backside.[154] In the ubiquitin system, the protein CUE1 contacts the K48 chain-building E2UBC7 on its backside though an α-helical U7BR domain.[203] CUE1 also possesses a Cue domain that displays preference for binding the proximal penultimate ubiquitin moiety of a chain,[204] presumably to increase its chain extending activities. Collectively, these studies reveal that the E2 backside can act as a versatile platform to modulate E2 activities or localization.

E2 Modifications

Covalent modification of an E2 by its own Ubl has been suggested as a mechanism to regulate E2 activities, specificities, or levels of the respective E2. For example, E2UBE2Z is FATylated,[117] a modification that targets the E2 for proteasomal degradation.[205] In the case of metazoan E2UBC9, SUMO modification on the first α-helix has been shown to alter substrate specificity with little effect on enzyme activity.[206] E2ATG3 can be modified with ATG12 on a specific lysine, and this modification requires the E2ATG3 catalytic cysteine and was shown to occur in cis.[41] The resulting ATG12–E2ATG3 plays a role in mitochondrial homeostasis.[41]

E3 Ligases

E3 protein ligases are generally considered as factors that increase the rate of ubiquitin or Ubl conjugation to substrates. This is often accomplished by recruiting the E2∼Ubl thioester and substrate into a complex. While colocalization is critical, the E3 can also enhance the rate of isopeptide bond formation by templating the charged E2∼Ubl thioester into a “primed” conformation that positions or aligns the ubiquitin or Ubl thioester bond for nucleophilic attack. While the ubiquitin system includes more than 600 E3s and dozens of E2s that combine in unique configurations to dictate substrate specificity,[207] only a few E3s have been reported thus far for SUMO, NEDD8, and ISG15 Ubl’s. Furthermore, only one E3 has been identified to date for ATG8 and UFM1 and no E3s have been identified to date for FAT10, URM1, and ATG12. In the ubiquitin system, really interesting new gene (RING), HECT, and RBR proteins constitute the three main E3 families (recently reviewed in Buetow and Huang[8]). Those E3s that do not belong to one of these families are generally referred to as atypical. In this section, we will review RING E3s that facilitate Ubl conjugation, emphasizing common and contrasting properties with those of the ubiquitin E3 RING family. We will also describe mechanisms underlying atypical Ubl E3 ligase activity. While important, we will not describe HECT and RBR ligases or E3 ligases for ISG15 conjugation[151,152,208,209] as most insights gained for these E3s relied on characterizations with ubiquitin and not ISG15. We refer the reader to Buetow and Huang for their detailed review of ubiquitin E3s.[8]

RING Ligases

RING Domain and its Variants

The RING domain contains cysteine and histidine residues that coordinate two zinc ions with a cross-braced topology (reviewed in Deshaies and Joazeiro[210]). In a landmark study, Lorick et al.[211] noted that several RING domain-containing proteins that interact with E2 also displayed ubiquitin E3 ligase activity. U-boxes and SP-RING domains were later shown to be structurally and functionally related to the RING domain. Indeed, the U-box domain shares a similar topology as the RING domain except that the canonical RING cysteine and histidine residues that coordinate the two zinc ions are replaced by side chains that interact to stabilize the cross-braced architecture.[212] The SP-RING domain is hybrid between RING and U-box domains in that it possesses one zinc coordination site with the other replaced by side chains that interact to stabilize the cross-braced architecture.[213] Proteins with U-box domains are known to catalyze ubiquitin E3 ligase activity, whereas proteins with SP-RING domains catalyze SUMO E3 ligase activity. Collectively, the RING domain and its structural variants increase the rate of ubiquitin, SUMO, NEDD8, and ISG15 conjugation by binding the charged E2∼Ubl thioester adduct and activating it for chemistry.[211,214−216]

E2 Binding

The first atomic insights into E2 recognition by RING E3s came from the structure of E3CBL/E2UBCH7 in the absence of ubiquitin.[217] This structure showed how a surface on the RING domain composed of two E3 loops and an α-helix between the two respective zinc coordination sites contacts two loops and the N-terminal helix of the E2. While this structure showed how the RING binds E2, direct involvement of the RING domain in E2-mediated conjugation remained a puzzle because the E2 active site was ∼15 Å from the RING domain.[217] Similar interactions were later observed for several E3/E2 pairs (Figure ), including those for the NEDD8 and SUMO pathways.[171,200] In addition to canonical E3/E2 interactions, some RING-containing proteins use additional domains to increase affinity for a particular E2 or to alter E2 properties. For example, E3RNF125 includes a zinc finger that folds back on the RING domain to stabilize the RING and to extend interaction surfaces with the E2,[218] while other proteins include extensions that contact the backside of the E2, frequently through the formation of an α-helix.[203,219−222] In the case of E3RAD18/E2RAD6 interaction, the E3RAD18 C-terminal α-helix binds the backside of E2RAD6 to prevent ubiquitin from binding the same surface, thus decreasing the ability of this E3/E2 pair to form ubiquitin chains.[220] To illustrate yet another variation, the RING of E3FANCL uses a short N-terminal extension and additional hydrophilic contacts to achieve specific interaction with E2UBE2T.[223] Additional domains can also be used for allosteric regulation. For example, binding of an E3GP78 C-terminal α-helix on the backside of E2UBE2G2 was reported to stimulate E2 activity through an allosteric mechanism.[219] To provide an example of ligand induced regulation, binding of one unit of a poly(ADP-ribose) chain by the WWE domain of E3RNF146 triggers a conformational change in this RING-containing protein that increases E2 binding and stimulates E3 activity.[224] In the SUMO system, the SP-RING domain is frequently followed by a SIM with evidence pointing toward a role for this SIM in contacting SUMO conjugated substrates or a second molecule of SUMO that is bound to the backside of the E2UBC9∼SUMO thioester adduct.[200,225] In the NEDD8 system, DCN1 is often viewed as a co-E3 for RBX1 because it further stimulates E3RBX1-mediated Cullin neddylation.[226] In this case, DCN1 recognizes the Cullin substrates and the acetylated N-terminal extension of E2UBC12 to facilitate E2UBC12∼NEDD8 recruitment.[171,227,228]
Figure 14

Representative RING E3/E2 interaction. (A) Overall view and (B) close-up view of human E3TRIM25/E2UBCH5A (PDB 5FER). Both proteins are in cartoon representation. A white-to-green gradient running from the N- to C-terminus has been applied to E3TRIM25. Two zinc ions are depicted as gray spheres. Residues contributing to the E3TRIM25/E2UBCH5A interaction are in stick representation.

Representative RING E3/E2 interaction. (A) Overall view and (B) close-up view of human E3TRIM25/E2UBCH5A (PDB 5FER). Both proteins are in cartoon representation. A white-to-green gradient running from the N- to C-terminus has been applied to E3TRIM25. Two zinc ions are depicted as gray spheres. Residues contributing to the E3TRIM25/E2UBCH5A interaction are in stick representation.

E2∼Ubl Binding and Activation

While structures of multiple E2/E3 pairs provided clues as to how RING E3s might increase the rate of E2-mediated catalysis, these complexes represented product complexes because they lacked ubiquitin or the E2∼ubiquitin thioester (or mimics). Indeed, structural studies of RING E3/E2∼ubiquitin complexes revealed that catalytic activation was achieved because the RING domain stabilizes a closed E2∼ubiquitin conformation that aligns the thioester bond for nucleophilic attack and ubiquitin discharge.[160−163,165,194,195,229,230] This mechanism also appears prevalent in the NEDD8 and SUMO pathways where RING E3s stabilize the closed conformation of E2UBC9∼SUMO and E2UBC12∼NEDD8 (Figure ).[171,200] Formation of a closed conformation remains dependent on canonical E2/E3 interactions but now also depends on contacts between the RING or ancillary motifs and the ubiquitin or the Ubl moiety. In the case of dimeric RINGs, one RING binds the E2 while the other RING provides vital contacts to ubiquitin.[161,163,165,195,229,230] For monomeric RINGs, additional structural elements contribute interactions that are functionally analogous to those provided by dimeric RINGs. In the case of E3CBL-B, ubiquitin is contacted by an N-terminal helix that contains a phosphotyrosine residue,[160] interactions that increase E2 catalytic efficiency by more than 2 orders of magnitude.[160] The monomeric RING of E3ARK2C binds a second molecule of ubiquitin on its backside to form a complex that stabilizes E2∼ubiquitin in a closed conformation through ubiquitin–ubiquitin interactions.[231]
Figure 15

E3 stabilization of a closed E2∼Ubl conformation. (A) Structure of human NEDD8∼E2UBC12/E3RBX1 (PDB 4P5O). The position of the catalytic cysteine (a serine residue in the structure) is indicated by a yellow sphere. (B) Structure of yeast SUMO∼E2UBC9/E3SIZ1 (PDB 5JNE). (C) Structure of human SUMO∼E2UBC9/E3RANBP2 (PDB 1Z5S). (D) Structure of human SUMO∼E2UBC9/E3ZNF451 (PDB 5D2M). Zinc atoms are in gray sphere representations. SUMOD and SUMOB represent SUMO proteins in donor (D) and backside (B) configurations, respectively. NEDD8D designates a NEDD8 protein in donor (D) configuration.

E3 stabilization of a closed E2∼Ubl conformation. (A) Structure of human NEDD8∼E2UBC12/E3RBX1 (PDB 4P5O). The position of the catalytic cysteine (a serine residue in the structure) is indicated by a yellow sphere. (B) Structure of yeast SUMO∼E2UBC9/E3SIZ1 (PDB 5JNE). (C) Structure of human SUMO∼E2UBC9/E3RANBP2 (PDB 1Z5S). (D) Structure of human SUMO∼E2UBC9/E3ZNF451 (PDB 5D2M). Zinc atoms are in gray sphere representations. SUMOD and SUMOB represent SUMO proteins in donor (D) and backside (B) configurations, respectively. NEDD8D designates a NEDD8 protein in donor (D) configuration. Early studies in the SUMO pathway suggested the importance of interactions between an acidic region of E3SIZ1 and a basic patch on SUMO that was derived by docking E2UBC9∼SUMO in a closed conformation to the E3SIZ1 SP-RING domain,[213] but the true nature of contacts was not revealed until a structure of E3SIZ1/E2UBC9∼SUMO/PCNA was determined.[200] This structure showed that SUMO does interact with E3SIZ1; however, contacts were more extensive than anticipated as the E3SIZ1 contains an additional domain termed the SP-CTD that includes an embedded SIM-like motif that stabilizes donor SUMO in the closed conformation. Stabilization of the closed E2∼Ubl conformation is also observed in the NEDD8 pathway. In this system, the RING-containing E3RBX1 interacts with its obligate partner CUL1 via an N-terminal β-strand extension of E3RBX1 that intercalates in a β-sheet of CUL1.[232] A short linker between the N-terminal extension of E3RBX1 and its RING domain enables a hinge motion of the RING domain relative to CUL1. While E2UBC12 binds the RING domain of E3RBX1, the NEDD8 moiety of E2UBC12∼NEDD8 binds E3RBX1 through the linker region. This freezes the conformation of the linker and orients the E3RBX1 active site relative to the lysine substrate, in this case a lysine in CUL1 itself. E3RBX1 binding to NEDD8 thus fulfills multiple roles, it maintains NEDD8 in the closed conformation and it orients the entire complex for substrate recognition.[171] E3RBX1 also binds E2UBCH5 and E2CDC34 to promote substrate ubiquitination. An NMR study showed minimal interaction between isolated E3RBX1 and E2CDC34 while interactions with E2CDC34∼ubiquitin revealed a dissociation constant in the midmicromolar range[233] underscoring the importance of contacts to ubiquitin in a system that is likely optimized for rapid release of the E2 product upon ubiquitin discharge.[233] An unusual case of RING-mediated ubiquitin binding is seen in the anaphase promoting complex/cyclosome (APC/C) complex, a multisubunit E3. In this case, the RING-containing protein APC2 interacts with E2UBE2C through canonical E2/E3 interactions to prime the substrate by ubiquitination; however, efficient substrate polyubiquitination is dependent on replacement of E2UBE2C by E2UBE2S. In this latter step, E2UBE2S does not contact the canonical RING surface of APC2, but instead uses its C-terminal tail to contact other APC/C surfaces. This allows APC2 to interact with ubiquitin in the context of a ubiquitinated substrate to catalyze E2UBE2S-mediated chain elongation.[234] One noted hallmark of ubiquitin RING activation is a “linchpin” arginine within the RING that contacts both E2 and ubiquitin to lock ubiquitin in the closed conformation (reviewed in Buetow and Huang[8]).[194] While this residue is clearly important in a variety of E3s, it may not be a universal requirement as at least one case reported little effect on activity upon substitution to alanine.[229] Interestingly, this residue is notably absent in E3SIZ1[200] and, in E3RBX1, alanine substitution of the topologically equivalent residue (Asn98) does not affect conjugation while an N98R substitution impairs neddylation.[171] Instead, another arginine, Arg46, functions in an analogous manner by interacting with E2UBC12 and NEDD8, although it is present on a distinct surface of E3RBX1. While there are many ways in which RINGs or related domains interact with their E2∼Ubl thioester substrates, a common mechanism emerges, namely that RING and RING-like domains bind the E2∼Ubl thioester, activating it for conjugation by stabilizing the closed conformation.

Substrate Interaction

Structural studies on E3/E2∼Ubl/substrate complexes are scarce, but the ones that exist provide vital clues as to how E3/E2∼Ubl complexes target their substrates (Figure ). One example pertains to E2UBC12-mediated NEDD8 modification of residue Lys720 of CUL1, a subunit of the E3 ligase complex. The structure of a co-E3DCN1/E3RBX1/E2UBC12∼NEDD8/CUL1 complex[171] revealed few contacts between the NEDD8-bound E2UBC12 and its CUL1 substrate, suggesting substrate recognition relied on multiple binding interactions that collectively lead to correct positioning of the activated E2 relative to the target lysine. While the co-E3 DCN1 binds to and bridges the acetylated N-terminal extension of E2UBC12 and CUL1, the UBC domain of E2UBC12 is recognized by E3RBX1 through a classical RING E3/E2 interaction. NEDD8 also plays an important role in the active complex as the closed conformation of NEDD8 bends the E3RBX1 linker between the N-terminal extension of E3RBX1 and its RING domain to bring the E2UBC12 active site into proximity of CUL1 Lys720. Structural studies of ubiquitin modification of histone H2A at Lys119 provided another example where the E2 and substrate barely interact.[235] In this case, the structure of an E3RING1B–BMI1/E2UBCH5C/nucleosome complex revealed few contacts between the E2UBCH5C and the region surrounding Lys119. Instead, the E3 binds to a region centered on the acidic patch of the nucleosome and this interaction, combined with E2UBCH5C/DNA interactions, results in positioning of Lys119 close to the E2UBCH5C active site. Any role for ubiquitin in this complex remains unclear as it was absent from this structure. The last example draws from E3SIZ1 ligase catalyzed SUMO modification of yeast PCNA, a substrate that includes two SUMO modification sites: a primary site at Lys164 and a secondary site at Lys127.[236] While Lys127 modification can be enhanced by E3SIZ1, it lies within a SUMO consensus motif and can be modified by the SUMO E2UBC9 in the absence of E3SIZ1. In contrast, Lys164 modification appears strictly dependent on the E3SIZ1.[237] Subsequent structural and biochemical studies revealed that the SP-RING domain was important for modification of Lys127 and Lys164, but Lys164 modification relied on interactions between PCNA and the N-terminal PINIT domain of E3SIZ1.[213] The structure of an SUMO–E3SIZ1/SUMO∼E2UBC9–PCNA complex provided a rationale for Lys164 modification in that E3SIZ1/PCNA interactions position the substrate within the activated E3/E2∼Ubl complex to force-feed Lys164 into the E2UBC9 active site.[200]
Figure 16

E3/E2∼Ubl/substrate complexes. (A) Structure of human co-E3DCN1/E3RBX1/E2UBC12∼NEDD8/CUL1 (PDB 4P5O). The target residue 720 (an arginine in the structure) is in stick representation. NEDD8D designates a NEDD8 protein in donor (D) configuration. (B) Structure of yeast SUMO–E3SIZ1/SUMO∼ E2UBC9–PCNA (PDB 5JNE). The target residue 164 (a cysteine in the structure) and its linkage to the E2 catalytic cysteine via ethanedithiol are presented in stick representation. SUMOD and SUMOB represent SUMO proteins in donor (D) and backside (B) configurations, respectively. (C) Structure of human SUMO–RANGAP1/E2UBC9/E3RANBP2 (PDB 1Z5S). The isopeptide linkage between the target residue 524 and the C-terminal glycine of SUMO is in stick representation.

E3/E2∼Ubl/substrate complexes. (A) Structure of human co-E3DCN1/E3RBX1/E2UBC12∼NEDD8/CUL1 (PDB 4P5O). The target residue 720 (an arginine in the structure) is in stick representation. NEDD8D designates a NEDD8 protein in donor (D) configuration. (B) Structure of yeast SUMO–E3SIZ1/SUMO∼ E2UBC9–PCNA (PDB 5JNE). The target residue 164 (a cysteine in the structure) and its linkage to the E2 catalytic cysteine via ethanedithiol are presented in stick representation. SUMOD and SUMOB represent SUMO proteins in donor (D) and backside (B) configurations, respectively. (C) Structure of human SUMO–RANGAP1/E2UBC9/E3RANBP2 (PDB 1Z5S). The isopeptide linkage between the target residue 524 and the C-terminal glycine of SUMO is in stick representation.

Atypical E3 Ligases

SIM-Based SUMO E3 Ligases

E3RANBP2 is an atypical SUMO E3 ligase[238] whose catalytic domain(s) reside in a 30 kDa fragment called the IR1-M-IR2 repeat.[239] The structure of an E3RANBP2/E2UBC9/SUMO–RANGAP1 complex that corresponds to a product complex after conjugation revealed that the E3RANBP2 IR1-M domain uses a combination of loops and helices to contact the E2UBC9 backside while a SIM binds the donor SUMO to position it in a closed conformation analogous to that described above for RING-mediated E2∼Ubl activation (Figure C).[154] This early study suggested that E3 ligase activity was due to coordination of the Ubl∼E2 thioester through stabilization of the Ubl∼E2 closed conformation. E3ZNF451 is another recently identified SUMO E3 whose catalytic module includes two SIMs that are separated by an intervening Pro-Leu-Arg-Pro sequence.[164,240] The structure of an E3ZNF451/E2UBC9/RANGAP1–SUMO complex revealed that E3ZNF451 uses its N-terminal SIM to maintain the donor SUMO in a closed conformation[164] while its C-terminal SIM engages a second SUMO molecule that is bound on the backside of E2UBC9 (Figure D). This places the Pro-Leu-Arg-Pro sequence under the E2 to enable direct contacts between the arginine residue and E2UBC9. E3ZNF451 is a target of extensive SUMO modification,[38] and SUMO modification of an E3ZNF451 fragment at a site close to the catalytic module can increase SUMO E3 activity, presumably because SUMO modification provides a second SUMO in cis for backside interactions with E2∼SUMO. This mechanism was also proposed in the context of E3RNF38/E2UBCH5B interactions where ubiquitination of the E3 or a substrate results in an increased catalytic activity.[162] In this case, however, the authors proposed that ubiquitin binding in cis on the backside of the E2 can also trigger allosteric activation of the E2.[162] In summary, E3RANBP2 and E3ZNF451 use SIMs to maintain a donor SUMO in the closed conformation while employing unique strategies to recognize the E2. The idea that SIMs are necessary, although not sufficient, for SUMO E3 ligase activity appears to be a unifying theme among SUMO E3 ligases, including proteins with a SP-RING domain. While SIMs can play a role in SUMO E3 ligases, their ability to interact with SUMO can result in spurious in vitro artifacts and misidentification of SIM-containing proteins as bona fide SUMO E3 ligases as discussed in Parker and Ulrich.[241]

UFL1

E3UFL1 is the only protein identified thus far for which UFM1 E3 activity has been reported.[242,243] E3UFL1 increases the rate of UFM1 conjugation to UfBP1[242] and ASC1[48] although it shares no sequence similarity with E3s from other systems and its mode of action remains unclear. It has been suggested that E3UFL1 could bridge E2UFC1 and its substrate UfBP1 to promote UFM1 conjugation.[242] Interestingly, modification of ASC1 is dependent on prior UFMylation of UfBP1 and likely involves formation of an ASC1/UFM1–UfBP1/E3UFL1 complex.[48] This suggests that UFM1–UfBP1 might act as a co-E3, and phenotypic similarities between UfBP1, UFL1, and UBA5 knockouts in mice is consistent with this idea.[244,245]

ATG12–ATG5

The ATG12–ATG5 conjugate functions in at least two ways in vivo. First, it acts as an E3 for E2ATG3-mediated conjugation of ATG8 to PE.[178,246] Second, through its association with ATG16 and ATG8–PE, it forms a two-dimensional mesh that organizes associated membranes.[88] ATG12–ATG5 is formed by conjugation of the Ubl ATG12 to a specific lysine residue of ATG5.[176] Conjugation appears irreversible, as proteases capable of cleaving ATG12–ATG5 have not been identified to date. Structural analyses of the ATG12–ATG5 complex reveals an extended interface between ATG12 and ATG5 that extends beyond the isopeptide linkage, presumably to maintain ATG12 in a fixed orientation relative to ATG5 (Figure ).[90,247,248] This interface appears important for the E3 activity of the ATG12–ATG5 conjugate as E3 activity is compromised by mutations that disrupt the ATG12/ATG5 interface or a conserved composite surface that includes elements from both ATG5 and ATG12.[247] Also, strategies that bring ATG12 and ATG5 together by means other than native isopeptide linkage failed to reconstitute E3 activity.[247]
Figure 17

E3/E2 complex in the ATG8 system. Structure of E3ATG12–ATG5/E2ATG3 (PDB 4NAW). Proteins are in cartoon representation with the isopeptide linkage between ATG12 and ATG5 in stick representation.

E3/E2 complex in the ATG8 system. Structure of E3ATG12–ATG5/E2ATG3 (PDB 4NAW). Proteins are in cartoon representation with the isopeptide linkage between ATG12 and ATG5 in stick representation. ATG8 conjugation to PE can be recapitulated in vitro using E1ATG7, E2ATG3, mature ATG8, ATP, and liposomes containing a high proportion of PE, suggesting that E2ATG3∼ATG8 can recognize its PE substrate in an E3-independent manner.[249] While this reaction is inefficient in the context of liposomes with low PE content,[249] the ATG12–ATG5 conjugate, but not ATG12 and/or ATG5 alone, increases E2ATG3-mediated ATG8 conjugation.[178] Since ATG12 binds E2ATG3 with nanomolar affinity without substantive contributions by ATG5,[90,247] it appears that the ATG12–ATG5 conjugate might be required to activate the ATG8∼E2ATG3 thioester. Consistent with this model, Sakoh-Nakatogawa et al.[250] observed differences in available E2ATG3 structures around its catalytic cysteine that correlated with active and inactive conformations. Precisely, they showed that E3ATG12–ATG5, or mutation of a phenylalanine residue that physically supports the catalytic loop in some structures, induces a conformational change in E2ATG3 that results in its activation. How the E3ATG12–ATG5 conjugate binds E2ATG3 to modulate its conformation remains unknown. It also remains unclear if the E3ATG12–ATG5 conjugate maintains a stable scaffold[90,247] or if it undergoes conformational changes[248] upon ATG8∼E2ATG3 binding. There is no evidence to date to suggest that E3ATG12–ATG5 promotes formation of a closed E2∼Ubl conformation via contacts to ATG8. With that said, ATG12 possesses a noncanonical LIR motif formed by two residues that are distant in sequence but close in space[88] that when mutated reduce ATG8 conjugation.[88] Thus, it is tempting to speculate that the ATG12 LIR may contact ATG8 moiety within ATG8∼E2ATG3 to activate the ATG8∼E2ATG3 thioester. Finally, evidence suggests that E3ATG12–ATG5 and E2ATG3 activities can be spatially regulated through a series of protein–protein and protein–lipid interactions. The ATG5 moiety of a ATG12–ATG5 conjugate associates with the N-terminus of ATG16,[177] a protein that dimerizes through a coiled coil region,[251] interactions that contribute to membrane targeting of E3ATG12–ATG5.[177,252] Proteins from the PIWI family also contribute to targeting ATG8 and ATG12–ATG5/ATG16 to PI3P-containing membranes where ATG8 conjugation occurs.[253,254] Sakoh-Nakatogawa et al.[255] further showed that E2ATG3 is recruited to preautophagosomal structures in a LIR-dependent manner, consistent with the idea that ATG3/ATG8–PE interactions serve as a positive feedback loop to increase ATG8–PE production and membrane expansion. Membrane curvature may also act as a positive feedback loop as it is induced by ATG8–PE and sensed by an amphipathic helix in the N-terminus of E2ATG3.[256,257] Taken together, these studies depict E3ATG12–ATG5 as an atypical E3 under multiple layers of regulation.

Trapping Intermediates in Conjugation Cascades

The activation and conjugation cascades for ubiquitin or Ubl’s often employ interactions that are sometimes weak, often transient, and nearly always chemically labile. As such, structural studies often rely on stabilization of intermediates through chemical or artificial means. In this section we will review some of the strategies used to trap and structurally characterize various intermediates during Ubl activation and conjugation. As many of these methods are general and can be applied to ubiquitin and Ubl proteins, we will describe examples from various systems.

E1 Activation Intermediates

Most structures of canonical E1s in complex with ubiquitin or Ubl proteins revealed similar E1 conformations, with the C-terminal Ubl tail locked in place under the α-phosphate of the bound ATP–magnesium complex, and the catalytic cysteine far from the adenylation active site.[92,93,96,97,101,104,106,108] To trap complexes after adenylation or during thioester bond formation, Lu et al.[105] synthesized tripeptides containing 5′-(sulfonylaminodeoxy)adenosine (AMSN) or a 5′-(vinylsulfonylaminodeoxy)adenosine (AVSN) that could be ligated to the C-terminus of SUMO or ubiquitin using native chemical ligation. While both result in nonhydrolyzable adenylate Ubl intermediates, the AVSN moiety possesses an eletrophilic center that reacts with the cognate E1 catalytic cysteine to generate a stable thioether bond (Figure ). This strategy was used to determine the structure of an E1∼AVSN–SUMO complex with the catalytic cysteine covalently attached to the AVSN within the adenylation pocket.[98] An analogous chemical strategy was recently described to trap E1 activation intermediates where a nonhydrolyzable AMP analogue was ligated to the C-terminus of a Ubl through native chemical ligation resulting in the formation of a cysteine residue that can be converted to dehydroalanine using 2,5-dibromohexanediamide.[258] In this case, the dehydroalanine residue contains the electrophilic center that reacts with the E1 catalytic cysteine. This strategy was employed to trap E1UBA1∼5′-(dehydroalanine-aminodeoxy)adenosine–ubiquitin and E1ATG7∼5′-(dehydroalanine-aminodeoxy)adenosine–LC3 complexes.[258]
Figure 18

Chemical structures representing methods for trapping the E1 thioesterification step. Non-native linkages are colored red.

Chemical structures representing methods for trapping the E1 thioesterification step. Non-native linkages are colored red.

Transthiolation Intermediates for E1/E2

Huang et al.[97] obtained a structure of a NEDD8∼E1/E2 complex by mutating the E2 catalytic cysteine to alanine to prevent the transfer of NEDD8 from the E1 to the E2. This structure provided some of the first important details on how the E2 is recognized by E1, but it was unable to provide a mechanism for transthiolation as the two active sites remained separated by an ∼20 Å. To trap conformations that would allow the E1 and E2 active sites to come into close proximity, two sulfhydryl-to-sulfhydryl cross-linking strategies were used (Figure ). In the first case, Kaiser et al.[132] used bismaleimidoethane, a bifunctional cross-linker, to bring together the E1 and E2 active sites. Although this strategy successfully trapped both E1ATG7–E2ATG3 and E1ATG7–E2ATG10 complexes, the E2 loops containing the catalytic cysteine were partially disordered suggesting conformational flexibility,[132] perhaps because the maleimide-based cross-linker is longer and more complex when compared to the predicted tetrahedral intermediate. An alternative strategy utilized the S. pombe ubiquitin E1 and E2UBC4 and activation of the E2 cysteine by 2,2′-dipyridyldisulfide to catalyze formation of a disulfide bond between the E1 and E2 active site cysteine residues.[96] While a disulfide also fails to mimic a bona fide tetrahedral intermediate, the disulfide bond reduces the distance between E1 and E2 when compared to the maleimide-based strategy, and the structure of the resulting complex revealed new surfaces between the E1 and E2 that are important for transthiolation activity. Furthermore, the E1–E2 disulfide complex was shown to occur in vivo when cells are placed under oxidizing conditions.[124,125] Additional strategies will be required to capture better mimics of Ubl∼E1/E2 transthiolation intermediates.
Figure 19

Chemical structures representing methods for trapping E1/E2 complexes. Non-native linkages are colored red.

Chemical structures representing methods for trapping E1/E2 complexes. Non-native linkages are colored red.

E2∼Ubl Mimics

E2∼Ubl is an intermediate in the conjugation cascade and an essential component of intact E3 ligase complexes; however, the thioester bond between the E2 active site cysteine and Ubl C-terminus is labile and short-lived, especially in the presence of E3s. A pioneering NMR study first observed the E2UBC1∼ubiquitin thioester in situ by having unlabeled E1, ATP, and magnesium chloride present in the NMR tube to reiteratively generate the E2∼ubiquitin species.[158] This method was subsequently employed to study E2UBC13∼ubiquitin, E2MMS2/E2UBC13∼ubiquitin, and E2UBCH5C∼ubiquitin complexes by NMR,[196,259] where it was estimated that thioester bond formation proceeded to ∼90% completion.[158,196] This method has clear advantages in that it results in native thioester bonds, but its use is limited to select NMR studies. Therefore, other strategies were required to overcome the labile nature of the E2∼Ubl thioester bond to generate stable E2∼Ub mimics (Figure ). Mimics using the oxyester, disulfide, or isopeptide strategies were detailed in a recent methods article.[260]
Figure 20

E2∼Ubl mimics. Non-native linkages or amino acid residues are colored red.

E2∼Ubl mimics. Non-native linkages or amino acid residues are colored red.

Oxyester

Early studies using E2RAD6 and E2UBC1 showed that mutation of the E2 catalytic cysteine to serine allowed for replacement of the thioester bond by a more stable oxyester bond.[261,262] This strategy enabled the first NMR study of an E2∼ubiquitin complex,[263] and it has been used since to structurally investigate many different E2∼ubiquitin complexes, sometimes in the presence of E3s.[166,173,179,184,193−195,264−269] This strategy was also used to study E2UBC12∼NEDD8[171] and closed conformations of E2∼ubiquitin and E2∼NEDD8.[171,173,193−195,266,269] The main advantage of the oxyester mimic is that this single atom substitution results in a bond that is structurally similar to the native thioester. However, the single atom substitution can also change chemical reactivity with some surprising results. Scott et al.[171] reported that an E2∼NEDD8 oxyester conjugate has a higher propensity to undergo hydrolysis versus aminolysis when compared to a native thioester-linked conjugate. Although the oxyester mimic is more stable, it still undergoes hydrolysis or aminolysis[179,264] and half-lives of 5–20 h have been reported for complexes between ubiquitin and E2UBE2G1 and E2UBE2R1, respectively.[269] Furthermore, E3s can decrease the half-life of the oxyester conjugate. For example, an E2UBCH5C∼ubiquitin complex has a half-life of 58 h in the absence of an E3 and a half-life of 10 h in the presence of E3E4BU.[194] Therefore, additional mutations are often required to trap complexes that contain E3s and E2∼Ubl oxyester-based conjugates. In an E3BIRC7/E2UBCH5B∼ubiquitin complex, the oxyester bond is cleaved after 1–3 days in the absence of a stabilizing E2UBCH5B N77A mutation.[195] In the case of E3NEDD4L/E2UBCH5B∼ubiquitin complexes, the HECT E3 catalytic cysteine was mutated to serine or alanine to prevent decomposition of the complex.[264] For the E3RBX1/E2UBC12∼NEDD8/CUL1/DCN1 complex where E2UBC12∼NEDD8 is linked by an oxyester bond, Asn103 was mutated to a serine and the target lysine was mutated to arginine.[171] Efforts have also been made to identify experimental conditions that stabilize the oxyester bond. The use of a citrate buffer at pH 5.75 was reported to increase the lifetime of an E2UBCH5C∼ubiquitin oxyester in the presence of E3E4BU from a few hours to several days.[266]

Disulfide

A disulfide bond can be formed between the E2 catalytic cysteine and a ubiquitin variant where the last glycine residue is replaced by a cysteine residue.[192,269−272] This often requires nonreducing conditions and mutation of other E2 cysteine residues to prevent formation of disulfides at other sites. This strategy was used to isolate E2UBC1∼ubiquitin and E2UBCH8∼ubiquitin conjugates that were reported to be stable for weeks[270] or months,[192] respectively. Adding to the utility of this approach, NMR studies observed few differences between the thioester and disulfide-linked E2UBC1∼ubiquitin complexes, suggesting that the disulfide-linked E2UBC1∼ubiquitin was a good mimic for its thioester linked counterpart.[270] In contrast, another NMR report suggested that a disulfide-linked E2∼Ubl was not as good a mimic as an oxyester linked E2∼Ubl as there was less evidence for interactions between E2UBE2R1 and ubiquitin in the case of a disulfide conjugate.[269] This discrepancy could be due to interference between the carboxylate group introduced by the disulfide strategy, or because E2UBE2R1 contains an extended acidic loop close to the active site that is absent in E2UBC1. Taken together, these studies suggest that the disulfide strategy may not work for all E2s. A variation of the disulfide strategy employs dichloroacetone, a bifunctional cross-linker, to bridge the E2 catalytic cysteine and the last residue of G75C or G76C ubiquitin variants.[273]

Isopeptide

Plechanovová et al.[161] reported the successful isolation an E2∼ubiquitin mimic by replacing the catalytic cysteine with a lysine residue that can be conjugated to ubiquitin at high pH directly from E1∼ubiquitin thioester resulting in a stable peptide bond between the E2 and ubiquitin. Isolation of the E2–Lys–ubiquitin adduct enabled structure determination of an E3RNF4/E2UBCH5A∼ubiquitin complex, and isolation of a complex that mimics the activated state prior to conjugation with E2–Lys–ubiquitin in the closed conformation. Since then, multiple studies have used the same strategy to characterize various E2∼ubiquitin complexes.[160,162,163,165,202,229−231,274−276] A variation of this strategy was recently reported where a residue proximal to the E2 catalytic cysteine was mutated to lysine.[200] In this case, the E2 is charged by E1 to generate an E2∼SUMO thioester with subsequent attack by the engineered lysine to generate a stable peptide bond that leaves the E2 catalytic cysteine available for additional modifications (see below).

Thioether

Mulder et al.[277] formed a thioether bond between an E2 and a ubiquitin variant where the last glycine residue is replaced by an electrophilic dehydroalanine residue. This variant requires activation by E1, a reaction that also generates thioether-linked E1–ubiquitin as a byproduct. This technique was used to isolate E2∼ubiquitin and E2∼NEDD8 complexes resulting in the structure determination of an E2UBCH5C∼ubiquitin complex. Comparison of this structure to a previously determined complex containing an oxyester revealed quasi-identical E2 active site organization with the exception of increased disorder of Arg90, presumably to accommodate the carboxylate group introduced by this strategy.

Substrate and Product Complexes

Substrates or conjugated products that fail to dissociate from the E2 can sometimes be used to trap complexes that closely resemble conformations for the substrates prior to peptide bond formation. As an example, the C-terminal domain of mammalian RANGAP1 interacts with E2UBC9 prior to and after conjugation to SUMO via an extended interface that includes a canonical SUMO modification motif.[153] Structural and biochemical analyses of several RANGAP1/E2UBC9 complexes suggested that the substrate lysine was positioned close to where it would be in an E2UBC9∼SUMO thioester complex and that residues surrounding the E2 active site were positioned to facilitate the reaction.[154,159,164] Subsequent isolation of product inhibited complexes enabled purification and structural characterization of two E3 ligase domains from RANBP2 and ZNF451 by combining the E3 domains with E2UBC9, and SUMO conjugated RANGAP1 to isolate E3RANBP2/E2UBC9∼SUMO complex mimics[154,159] and an E3ZNF451/E2UBC9∼SUMO complex mimic.[164] In each of these complexes, the RANGAP1 lysine was conjugated to SUMO via an isopeptide bond, SUMO was observed in the closed and activated conformation, and the C-terminal SUMO glycine reside was positioned just above the active site cysteine with its C-terminal carbonyl oxygen pointing toward the conserved E2 asparagine residue suggesting that the product complexes resemble the substrate complexes immediately after conjugation.

Trapping E2/E3 Complexes

Unlike the examples described above for RANGAP1, most E2/E3 complexes are unstable and are not easily isolated for structural or biochemical studies. Stabilization of transient E2/E3 interactions can be achieved by linear fusion of these proteins. For example, a weak interaction between E2UBCH5C and E3RING1B was overcome by using an E3–E2 linear fusion and this fusion was instrumental in forming a complex between a nucleosome, E2UBCH5C, E3RING1B, and E3BMI1.[235] Although successful, this approach prevented loading of the E2 with ubiquitin because the fused E3 masks E2 regions important for E1 interaction.[235] In a second example, fusing a SUMO molecule to the C-terminus of E3SIZ1 enhanced interactions between the SUMO E3SIZ1 and E2UBC9 by positioning the fused SUMO for interaction with the backside of E2UBC9.[200] In a last example, genetically fusing E2UBE2T at the C-terminus of the E3FANCL RING domain was reported to be critical for obtaining a high-resolution structure of the complex.[223]

Trapping Ubl∼E2/Substrate Complexes

Successful strategies for trapping Ubl∼E2/substrate structures include use of a three-way cross-linking strategy that was developed to trap a complex consisting of the HECT E3RSP5, ubiquitin, and a substrate (Figure ).[278] A similar approach was then used to tether E2UBCH10, ubiquitin, and a substrate within an APC complex.[279] In this case, the substrate was synthesized with an azidohomoalanine in lieu of a target lysine. This allowed the addition of a bifunctional maleimide-based cross-linker via click chemistry. This cross-linker was in turn reacted with E2UBCH10 and a ubiquitin variant possessing a cysteine residue at position 75. A second method for cross-linking a Ubl∼E2/substrate complex involved use of an E2UBC9 variant where a residue proximal to E2 catalytic cysteine was mutated to lysine so that it could be conjugated to SUMO while leaving the E2 active site cysteine intact.[200] By combining the conjugated E2 variant with a substrate that substituted the target lysine to cysteine, the authors were able to cross-link the E2 and substrate cysteine residues using ethanedithiol, a molecule that closely resembles the lysine side chain with respect to the number of bridging atoms when compared to the predicted tetrahedral intermediate.[200]
Figure 21

Chemical structures representing methods for trapping Ubl∼E2/substrate complexes. Non-native linkages or residues are colored red.

Chemical structures representing methods for trapping Ubl∼E2/substrate complexes. Non-native linkages or residues are colored red.

Conclusion and Future Challenges

Structural and mechanistic studies depict Ubl conjugation as a highly dynamic process under multiple layers of regulation. Indeed, many of the proteins involved in conjugation pathways can undergo conformational changes that are often integral to their function. Multiple studies laid the groundwork for understanding basic mechanisms underlying conjugation enzymes; however, recent work has highlighted the importance of continued investigation as we continue to uncover important contributions of noncanonical adaptations in enzymes and factors involved in Ub/Ubl conjugation cascades. Strategies that take advantage of cross-linking and/or fusions to isolate and structurally characterize mimics for intermediates during conjugation cascade have been instrumental to our understanding of the Ubl conjugation machinery. However, these strategies can sometimes lead to deformations, as they are not always isosteric to their native counterparts and often require extensive structure–function analysis to verify the structural contacts in these complexes. Future challenges should be focused on isolation of chemical adducts that more closely resemble the true intermediates or development of methods that extend the lifetime of these intermediates so that better mimics for the transient states can be isolated for investigation.
  279 in total

Review 1.  The LIR motif - crucial for selective autophagy.

Authors:  Åsa Birna Birgisdottir; Trond Lamark; Terje Johansen
Journal:  J Cell Sci       Date:  2013-08-01       Impact factor: 5.285

2.  Structural and functional roles of Daxx SIM phosphorylation in SUMO paralog-selective binding and apoptosis modulation.

Authors:  Che-Chang Chang; Mandar T Naik; Yen-Sung Huang; Jen-Chong Jeng; Pei-Hsin Liao; Hong-Yi Kuo; Chun-Chen Ho; Yung-Lin Hsieh; Chiou-Hong Lin; Nai-Jia Huang; Nandita M Naik; Camy C-H Kung; Shu-Yu Lin; Ruey-Hwa Chen; Kun-Sang Chang; Tai-Huang Huang; Hsiu-Ming Shih
Journal:  Mol Cell       Date:  2011-04-08       Impact factor: 17.970

3.  Structural basis of FYCO1 and MAP1LC3A interaction reveals a novel binding mode for Atg8-family proteins.

Authors:  Xiaofang Cheng; Yingli Wang; Yukang Gong; Faxiang Li; Yujiao Guo; Shichen Hu; Jianping Liu; Lifeng Pan
Journal:  Autophagy       Date:  2016-05-31       Impact factor: 16.016

4.  Trans-Binding Mechanism of Ubiquitin-like Protein Activation Revealed by a UBA5-UFM1 Complex.

Authors:  Walaa Oweis; Prasanth Padala; Fouad Hassouna; Einav Cohen-Kfir; Dalton R Gibbs; Emily A Todd; Christopher E Berndsen; Reuven Wiener
Journal:  Cell Rep       Date:  2016-09-20       Impact factor: 9.423

5.  The Atg12-Atg5 conjugate has a novel E3-like activity for protein lipidation in autophagy.

Authors:  Takao Hanada; Nobuo N Noda; Yoshinori Satomi; Yoshinobu Ichimura; Yuko Fujioka; Toshifumi Takao; Fuyuhiko Inagaki; Yoshinori Ohsumi
Journal:  J Biol Chem       Date:  2007-11-06       Impact factor: 5.157

6.  Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group.

Authors:  Takuro Nunoura; Yoshihiro Takaki; Jungo Kakuta; Shinro Nishi; Junichi Sugahara; Hiromi Kazama; Gab-Joo Chee; Masahira Hattori; Akio Kanai; Haruyuki Atomi; Ken Takai; Hideto Takami
Journal:  Nucleic Acids Res       Date:  2010-12-15       Impact factor: 16.971

7.  Conservation in the mechanism of Nedd8 activation by the human AppBp1-Uba3 heterodimer.

Authors:  Richard N Bohnsack; Arthur L Haas
Journal:  J Biol Chem       Date:  2003-05-10       Impact factor: 5.157

8.  E1-L2 activates both ubiquitin and FAT10.

Authors:  Yu-Hsin Chiu; Qinmiao Sun; Zhijian J Chen
Journal:  Mol Cell       Date:  2007-09-21       Impact factor: 17.970

9.  The sulfurtransferase activity of Uba4 presents a link between ubiquitin-like protein conjugation and activation of sulfur carrier proteins.

Authors:  Jennifer Schmitz; Mita Mullick Chowdhury; Petra Hänzelmann; Manfred Nimtz; Eun-Young Lee; Hermann Schindelin; Silke Leimkühler
Journal:  Biochemistry       Date:  2008-05-21       Impact factor: 3.162

10.  Activation of UbcH5c~Ub is the result of a shift in interdomain motions of the conjugate bound to U-box E3 ligase E4B.

Authors:  Sarah E Soss; Rachel E Klevit; Walter J Chazin
Journal:  Biochemistry       Date:  2013-04-15       Impact factor: 3.162

View more
  129 in total

1.  PML hyposumoylation is responsible for the resistance of pancreatic cancer.

Authors:  Mirna Swayden; George Alzeeb; Rawand Masoud; Yolande Berthois; Stéphane Audebert; Luc Camoin; Laurent Hannouche; Hortense Vachon; Odile Gayet; Martin Bigonnet; Julie Roques; Françoise Silvy; Alice Carrier; Nelson Dusetti; Juan L Iovanna; Philippe Soubeyran
Journal:  FASEB J       Date:  2019-09-16       Impact factor: 5.191

2.  Closing the gap of ubiquitin activation.

Authors:  Manoj Kumar; Reuven Wiener
Journal:  Proc Natl Acad Sci U S A       Date:  2019-07-05       Impact factor: 11.205

3.  How DNA vicinity controls SUMO E3 ligase activity.

Authors:  Andrea Pichler
Journal:  EMBO J       Date:  2018-05-24       Impact factor: 11.598

4.  Geminivirus Replication Protein Impairs SUMO Conjugation of Proliferating Cellular Nuclear Antigen at Two Acceptor Sites.

Authors:  Manuel Arroyo-Mateos; Blanca Sabarit; Francesca Maio; Miguel A Sánchez-Durán; Tabata Rosas-Díaz; Marcel Prins; Javier Ruiz-Albert; Ana P Luna; Harrold A van den Burg; Eduardo R Bejarano
Journal:  J Virol       Date:  2018-08-29       Impact factor: 5.103

5.  Acetylation of SUMO2 at lysine 11 favors the formation of non-canonical SUMO chains.

Authors:  Anne Gärtner; Kristina Wagner; Soraya Hölper; Kathrin Kunz; Manuel S Rodriguez; Stefan Müller
Journal:  EMBO Rep       Date:  2018-09-10       Impact factor: 8.807

6.  SUMOylation Targets Adeno-associated Virus Capsids but Mainly Restricts Transduction by Cellular Mechanisms.

Authors:  Qingxin Chen; Robin Njenga; Barbara Leuchs; Susanna Chiocca; Jürgen Kleinschmidt; Martin Müller
Journal:  J Virol       Date:  2020-09-15       Impact factor: 5.103

Review 7.  The role of ubiquitination in tumorigenesis and targeted drug discovery.

Authors:  Lu Deng; Tong Meng; Lei Chen; Wenyi Wei; Ping Wang
Journal:  Signal Transduct Target Ther       Date:  2020-02-29

8.  SUMO conjugation regulates immune signalling.

Authors:  Sushmitha Hegde; Amarendranath Soory; Bhagyashree Kaduskar; Girish S Ratnaparkhi
Journal:  Fly (Austin)       Date:  2020-08-31       Impact factor: 2.160

9.  Crystal structure of a human ubiquitin E1-ubiquitin complex reveals conserved functional elements essential for activity.

Authors:  Zongyang Lv; Katelyn M Williams; Lingmin Yuan; James H Atkison; Shaun K Olsen
Journal:  J Biol Chem       Date:  2018-10-02       Impact factor: 5.157

10.  UFM1-Activating Enzyme 5 (Uba5) Requires an Extension to Get the Job Done Right.

Authors:  Zongyang Lv; Shaun K Olsen
Journal:  J Mol Biol       Date:  2018-11-17       Impact factor: 5.469

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.