Literature DB >> 35552740

The metaphorical swiss army knife: The multitude and diverse roles of HEAT domains in eukaryotic translation initiation.

Daniel Friedrich^1,2,3, Assen Marintchev⁴, Haribabu Arthanari^2,3.

Abstract

Biomolecular associations forged by specific interaction among structural scaffolds are fundamental to the control and regulation of cell processes. One such structural architecture, characterized by HEAT repeats, is involved in a multitude of cellular processes, including intracellular transport, signaling, and protein synthesis. Here, we review the multitude and versatility of HEAT domains in the regulation of mRNA translation initiation. Structural and cellular biology approaches, as well as several biophysical studies, have revealed that a number of HEAT domain-mediated interactions with a host of protein factors and RNAs coordinate translation initiation. We describe the basic structural architecture of HEAT domains and briefly introduce examples of the cellular processes they dictate, including nuclear transport by importin and RNA degradation. We then focus on proteins in the translation initiation system featuring HEAT domains, specifically the HEAT domains of eIF4G, DAP5, eIF5, and eIF2Bϵ. Comparative analysis of their remarkably versatile interactions, including protein-protein and protein-RNA recognition, reveal the functional importance of flexible regions within these HEAT domains. Here we outline how HEAT domains orchestrate fundamental aspects of translation initiation and highlight open mechanistic questions in the area.

Entities: Chemical

Mesh：

Substances：

Year: 2022 PMID： 35552740 PMCID： PMC9177959 DOI： 10.1093/nar/gkac342

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 19.160

INTRODUCTION

The characteristic architecture of the HEAT domains and the built-in flexibility

In 1995, Andrade and Bork used systematic sequence analyses to identify a specific, repeating motif found in multi-domain proteins (1). This motif was detected in four distinct proteins, leading to a designation derived from those protein names, the ‘HEAT’ repeat: untingtin, eukaryotic translation longation factor 3 (eEF3), the regulatory subunit of protein phosphatase 2A (PP2A), and mechanistic arget of rapamycin (mTOR). The number of these repeating motifs in individual HEAT domains can vary enormously from 3 to 50 and each motif is comprised of approximately 50 residues (2). The conservation of amino acid composition amongst HEAT domains tends to be relatively low as evidenced by a weak consensus sequence (Figure 1A) (3); however, structural conservation is high with the fundamental HEAT motif formed by two anti-parallel, amphiphilic α-helices connected by one turn, arranged about a common axis (Figure 1B) (2). The turn features a conserved aspartate residue that is often found to interact with an arginine side chain in the second helix via a salt bridge. Consensus sequence residues in the two α-helices typically face each other. These basic helix-turn-helix motifs are further connected by linking inter-motif loops. The overall arrangement can result in a curved architecture (Figure 1C), placing HEAT domains in the alpha solenoid protein family (also known as alpha horseshoes), which also includes armadillo, ankyrin and leucine-rich repeats (4) (Figure 1D). This protein family has been found to be involved in many intracellular signaling and transport processes. Examples for leucine-rich repeat-containing proteins are the toll-like receptor, the ribonuclease inhibitor, and the tropomyosin regulator tropomodulin; in these proteins, leucine-rich repeats mediate protein–protein interactions with binding partners (5–7). Armadillo repeats are found in various proteins as well, e.g. in β-catenin and α-importin; prominent examples of proteins with ankyrin repeats are Notch and NFκB1 (8–13).

Figure 1.

HEAT domains have a characteristic structure. (A) The alignment of HEAT repeat sequences of mouse importin-β highlights a consensus sequence - - -LLP-L- - -ϕ-…- -D- -…- -VR- -A- - -L- -L- - - (ϕ being a hydrophobic residue). (B) The characteristic fold of HEAT domains consists of a helix-turn-helix motif arranged about a common axis (PDB code 1gcj); consensus sequence residues are colored in blue according to the alignment shown in (A). (C) Twelve repeats (colored in alternating blue and grey) of the yeast importin-β HEAT domain feature an alpha solenoid arrangement (PDB code 3nd2). (D) Other members of the alpha solenoid family include armadillo (as example the armadillo repeat region of murine β-catenin, PDB code 3bct), ankyrin (as example the ankyrin domain of the Notch receptor, PDB code 1ot3) and leucine-rich repeats (as example the leucine-rich repeat variant, PDB code 1lrv). (E) HEAT domains exhibit high elasticity, facilitating the formation of multiple global structures. HEAT domains are found in a large variety of proteins, often enabling distinct protein–protein interactions and one of the ways this can be achieved is by structural plasticity, adopting multiple tertiary conformations (1,14–16). The inherent flexibility in the HEAT domain has been experimentally demonstrated using fluorescence spectroscopy and small angle X-ray scattering (17,18). Molecular dynamics simulations further suggested that HEAT domains are highly elastic upon the external application of force (14,19). It is conceivable that such elastic properties provide the structural flexibility required for binding to various interaction partners in vivo (Figure 1E). Even though the fundamental unit, the helix-turn-helix motif (Figure 1B), is structurally highly conserved, HEAT domains are capable of adopting multiple, structurally heterogeneous global architectures (Figure 1E). This diversity is reflected in the plethora of processes in which HEAT domains are involved, ranging from nucleocytoplasmic transport to chromatin remodeling (3,20–22), thus establishing HEAT domains as a fundamental building block deployed by the cell to accomplish a variety of essential processes.

HEAT domains orchestrate vital cellular processes in eukaryotes

HEAT repeats form versatile higher order architecture that can mediate both protein–protein and protein–nucleic acid interactions (23–25). In this section we will highlight a few illustrative examples of HEAT domain-containing proteins that regulate key cellular functions, including mediators of nuclear transport, chromatin remodeling, protein dephosphorylation and RNA degradation. The archetypal examples are the four eponymous proteins: huntingtin, eEF3, PP2A and mTOR. Mutations in huntingtin that introduce a poly-glutamine sequence lead to aggregation, resulting in neuron degeneration (26,27). eEF3 is an essential factor for translation elongation found in fungi (28–30). In eukaryotic cells, the structural subunit of the ubiquitously expressed phosphatase PP2A, a classic example of an alpha solenoid structure consisting of HEAT repeats (1,31–32), contributes significantly to cellular phosphatase activity, dephosphorylating important signaling factors such as RAF, MEK or AKT (33–35). Lastly, the function of mTOR involves HEAT domains as well (36–38). mTOR forms two complexes (mTORC1 and mTORC2), both of which serve as central hubs for multiple signaling networks within the cell (39,40), involving them in the sensing of nutrients, energy and redox potentials and the regulating of the actin cytoskeleton, protein synthesis, and survival and metabolism (41,42). The diversity of roles and pathways for these four proteins is a testament to the versatility of HEAT domains. In addition to these four classic examples, there are other notable cases that highlight how these repeats critically contribute to cellular function. The ataxia-telangiectasia mutated serine/threonine kinase (ATM) and the DNA-dependent protein kinase catalytic subunit (DNA-PKcs) are both chromosome-associated factors that harbor large, flexible HEAT scaffolds (3,43). Nuclear transport mediated by importin is another well-studied case of a fundamental cellular process involving HEAT repeats (44,45). Belonging to the karyopherin family, importin recognizes proteins as cargo through their nuclear localization sequences (NLS) and directs them into the nucleus through the nuclear pore complex (46–48). Importin consists of two subunits, α and β, and it is the β-subunit that has a characteristic HEAT domain, containing nineteen repeats of the basic helix-turn-helix motif (44,49). Finally, HEAT domains are also directly involved in mediating protein–nucleic acid interactions. The Ro protein and its HEAT repeat-mediated binding to RNA, for example, have been studied extensively over the past few decades (25,50–54). Structural studies have shown that Ro consists of two domains, one adopting a Rossmann fold, while the other is a HEAT domain arranged in a toroid-like shape. These examples outline the remarkable versatility of HEAT domains in mediating protein–protein and protein–nucleic acid interactions. In the following sections, we will focus exclusively on HEAT domains in proteins essential for eukaryotic translation initiation.

DISCUSSION

The multifaceted role of HEAT domains in eukaryotic translation initiation

HEAT domains feature prominently among proteins involved in eukaryotic translation. To our knowledge, more than ten different proteins which are translation initiation factors or regulators of protein synthesis, ranging in size from about 50 kDa (basic leucine zipper and W2 domain-containing protein (BZW)) to 300 kDa (eIF2α kinase activator GCN1), harbor HEAT domains. Some of them are schematically shown in Figure 2. Notably, the HEAT domains in most of these proteins share a common ancestor and can be classified into three groups, based on sequence homology and structure (55–57).

Figure 2.

HEAT domain-containing proteins involved in protein synthesis. In most of the proteins discussed in the main text that contribute to translation control, HEAT domains can be grouped in three classes (based on their sequences): middle domain of translation initiation factor 4G (MIF4G, also termed HEAT1 (blue)), MA3 (also termed HEAT2 (brown)) and W2 (also termed HEAT3 (red)) domains. Non-HEAT domains and binding motifs are shown in light grey and HEAT domains in colors and black. eIF4G = eukaryotic initiation factor 4G (UniProtKB Q04637); DAP5 = death-associated protein 5 (UniProtKB P78344); eIF5 = eukaryotic initiation factor 5 (UniProtKB P55010); eIF2Bϵ = eukaryotic initiation factor 2Bϵ (UniProtKB Q13144); NCBP1 = nuclear cap-binding protein subunit 1 (UniProtKB Q09161); PDCD4 = programmed cell death protein 4 (UniProtKB Q53EL6); BZW = basic leucine zipper and W2 domain-containing protein (UniProtKB Q9Y6E2); PAIP1 = polyadenylate-binding protein interacting protein 1 (UniProtKB Q9H074); NOM1 = nucleolar MIF4G domain-containing protein 1 (UniProtKB Q5C9Z4); eEF3 = eukaryotic translation elongation factor 3 (UniProtKB P16521); GCN1 = eIF2α kinase activator general control nonderepressible 1 (UniProtKB Q92616); PAM = polyadenylate-binding protein interaction motif; SNT = sugar-nucleotidyl transferase-like; AT = acyl transferase-like; 4HB = four helical bundle domain; ABC = ATP-binding domain of ABC transporters. All proteins illustrated here are derived from H. sapiens, except eEF3 which is from S. cerevisiae. The first group, defined by the so-called middle domain of eukaryotic translation initiation factor 4G (MIF4G), contains five to seven repeats of the basic helix-turn-helix motif (Figure 2) (55,58). As the most N-terminally located HEAT domain in both isoforms of eukaryotic initiation factor 4G (eIF4G, the isoforms are commonly termed eIF4GI and eIF4GII; the latter, however, is also abbreviated as eIF4G3 in online sequence databases), MIF4G is also often referred to as HEAT1 in the literature (58–61). In addition to being located in eIF4G, the MIF4G/HEAT1 domain is found in a number of other translation factors and regulators, including the eIF4G homolog death-associated protein 5 (DAP5; also known as p97, NAT1 or, in online sequence databases, eIF4G2), nuclear cap-binding protein subunit 1 (NCBP1), the CBP80/20-dependent translation initiation factor (CTIF) (62), the SLBP-interacting protein 1 (SLIP1), required for translation initiation of histone mRNAs that have a 3′ stem-loop but no poly-A tail (63), the Drosophila protein Mextli, which appears to function as an eIF4E-binding protein (64), the Up-frameshift suppressor 2 (UPF2), which has three MIF4G domains, involved in nonsense-mediated decay (NMD) of mRNAs containing premature stop codons by associating with the nuclear exon junction complex (EJC) (65), the polyadenylate-binding protein-interacting protein 1 (PAIP1), and the nucleolar MIF4G domain-containing protein 1 (NOM1) (Figure 2). NOM1 is involved in ribosome biogenesis and interacts with eIF4A3, a DEAD box ATPase involved in alteration of RNA secondary structure (66,67). The second group is represented by the MA3 domain (Figure 2) (55,56), also known as the HEAT2 domain (61,68–71), and is typically formed by three to four HEAT repeats. This class of HEAT domains is also found in eIF4G, DAP5, programmed cell death protein 4 (PDCD4), and NOM1 (Figure 2) (56,58,68,72–78). BZW has also been suggested to feature a MA3/HEAT2 domain (79), but this has not yet been experimentally confirmed. The third group constitute the W2 domains, termed after two invariant tryptophan residues (Figure 2) (56,68,80). These W2 HEAT domains are found at the C-termini of eIF4G, DAP5, eIF5, eIF2Bϵ, and BZW (57,68,81–86). In line with the nomenclature for the MIF4G/HEAT1 and MA3/HEAT2 domains and due to its C-terminal location in eIF4G, the W2 domain is also termed HEAT3 (56). MA3-like and W2-like HEAT domains are also found in NCBP1, C-terminal to its MIF4G/HEAT1 domain (79). In addition to the proteins containing MIF4G/HEAT1, MA3/HEAT2 and W2/HEAT3 domains, characteristic HEAT repeats can also be found in other proteins involved in the regulation of translation. These include the fungal protein eEF3 and GCN1 that features an eEF3-like HEAT domain (Figure 2). There is evidence that GCN1 is composed almost exclusively of HEAT repeats, which extend both N- and C-terminal of its eEF3-like domain as indicated by secondary structure predictions and recent cryo-electron microscopy-derived structural information (87–89). The HEAT domains found in these proteins are distinct, showing no sequence homology to the above three HEAT domain classes. Other HEAT domain-containing proteins, such as the La-related protein 1 (LARP1) and Listerin, are structurally most closely related to importin-β. LARP1 is a regulator of translation of 5′ terminal oligopyrimidine (TOP) mRNAs. Listerin is an E3 ubiquitin-protein ligase component of the ribosome quality control complex (RQC). In addition, there are many other proteins that have helical hairpin repeat domains structurally related to the HEAT domains. For example, six of the 13 subunits of mammalian eIF3 are members of the proteasome, COP9 signalosome, eIF3 (PCI) family of proteins, structurally closely related to the HEAT family (90). All the proteins described in the section above are involved in mRNA translation. In the critical translation initiation phase, both 5′ 7-methylguanosine (m7G) cap-dependent and cap-independent mechanisms are known (91). The latter has been shown to contribute to regulation of cell fate decisions in the context of cancer and cellular differentiation, and plays an important role in viral replication (92).

Initiation of cap-dependent translation

A number of eukaryotic initiation factors have been thoroughly characterized providing a detailed mechanistic understanding of canonical, i.e. cap-dependent translation initiation (93). In this process, the initial step is assembly of the cap binding trimeric complex eIF4F, in which eIF4E recognizes the mRNA 5′cap, and the scaffolding initiation factor eIF4G binds to eIF4E and recruits the RNA helicase eIF4A, an interaction mediated by the MIF4G/HEAT1 and MA3/HEAT2 domains of eIF4G. Initially, eIF4A unwinds secondary structures in the 5′-untranslated mRNA region (5′UTR) of the mRNA in the vicinity of the cap, with the help of eIF4G and eIF4B. Subsequently, eIF4G recruits the 43S pre-initiation complex which scans the 5′UTR with continued unwinding of the mRNA by eIF4A. When the start codon (AUG) in a proper sequence context is identified, the 48S pre-initiation complex is formed and a subset of the eIFs are released upon GTP hydrolysis by eIF2. eIF2 is replaced at the Met-tRNAiMet by eIF5B, another GTPase, which promotes engagement with the 60S ribosomal subunit to form the 80S ribosome, competent to proceed with the elongation step. It is well established that assembly of eIF4F is inhibited by sequestering eIF4E under cellular stress conditions, thereby suppressing cap-dependent and presumably promoting cap-independent translation initiation mechanisms (93). Of the various HEAT domain-featuring proteins involved in protein synthesis, eIF4G and its homolog DAP5, eIF5, and eIF2Bϵ play particularly crucial roles in translation initiation. Since high-resolution structures of the HEAT domains of these proteins have been published, we analyzed their structural properties to highlight the multiple functions of HEAT domains in translation initiation.

Functions of HEAT domain-containing translation initiation factors

The translation initiation factors eIF4G, DAP5, eIF5, and eIF2Bϵ have specific and distinct functions in the first phase of protein synthesis. As introduced in the previous section, eIF4G plays an essential role in recruiting the pre-initiation complex to mRNA in cap-dependent translation by binding eIF4E (Figure 3A), which recognizes the m7G 5′cap, eIF4A, ensuring mRNA unwinding, and polyadenylate-binding protein (PABP), which induces mRNA circularization (94–98). As the major scaffolding initiation factor, eIF4G also promotes binding of the 40S ribosomal subunit to the mRNA by interacting with eIF3 and the 40S subunit directly (59,99–102). eIF4G, which has three HEAT domains in total (Figure 2), exists as two isoforms, eIF4GI and eIF4GII (also known as eIF4G3), with the former being more abundant (93,103–104). The isoforms are encoded by separate genes, and although their functions are similar, there is evidence that they may exhibit some selectivity bias towards different mRNAs (93). Playing a distinct role to eIF4G, its homolog DAP5 has the same HEAT domain organization and homology with the C-terminal two thirds of eIF4G (Figure 2). However, as DAP5 lacks the ability to bind PABP and eIF4E, it promotes non-canonical translation initiation mechanisms by recruiting the ribosome, including via internal ribosome entry sites (IRESs) in 5′UTRs of mRNAs (Figure 3A) (105–109). It has been suggested that DAP5 can also promote an alternative cap-dependent mechanism by utilizing eIF3d (110).

Figure 3.

eIF4G, DAP5, eIF5 and eIF2Bϵ mediate their functionality through structurally highly similar HEAT domains. (A) (i) In canonical cap-dependent translation initiation, eIF4G (green) binds to cap-binding eIF4E and the RNA helicase eIF4A. (ii) DAP5 (dark grey) has been shown to promote cap-independent translation initiation events, such as binding of internal ribosome entry sites (IRESs) in 5′ untranslated regions (5′UTRs) in mRNAs with subsequent recruitment of the ribosome. (iii) eIF5 (purple) binds to eIF3 and eIF2-GTP, associated with the Met-tRNAiMet, to form the 43S pre-initiation complex. (iv) Formation of the 43S translation pre-initiation complex depends on regeneration of the eIF2-GTP:Met-tRNAiMet ternary complex. Upon GTP hydrolysis, eIF2 dissociates and the guanine nucleotide exchange factor eIF2B (yellow) promotes GDP–GTP exchange on eIF2 and Met-tRNAiMet binding to form a new complex. (B) HEAT domain-mediated interactions of eIF4G, DAP5, eIF5 and eIF2Bϵ with RNA, other translation initiation factors, and MNK1. GDI = GDP dissociation inhibitor; GEF = guanine nucleotide exchange factor. (C) Overlays of MIF4G/HEAT1 (left panel), MA3/HEAT2 (middle panel) and W2/HEAT3 (right panel) domains of DAP5 (grey), eIF4GII/eIF4G (green), the ϵ-subunit of eIF2B (yellow), and eIF5 (purple). PDB codes: eIF4GII MIF4G/HEAT1 1hu3; DAP5 MIF4G/HEAT1 4iul; eIF4G MA3/HEAT2 1ug3; DAP5 MA3/HEAT2 3l6a; eIF4G W2/HEAT3 1ug3; DAP5 W2/HEAT3 3d3m; eIF5 W2/HEAT3 2iu1; eIF2Bϵ W2/HEAT3 3jui. eIF5 is another HEAT domain-containing factor directly involved in translation initiation. Similar to eIF4G, DAP5, and eIF2Bϵ, it has a C-terminal W2/HEAT3 domain (Figure 2) (83,84). eIF5 joins the 43S pre-initiation complex through specific binding to eIF2-GTP and eIF3c (Figure 3A), as well as eIF1 and eIF1A (83,93,111–113). eIF5 acts as the GTPase-activating protein of eIF2, which brings the Met-tRNAiMet to the pre-initiation complex (114,115). Upon start codon recognition, GTP hydrolysis, and phosphate release, eIF5 dissociates together with eIF2-GDP from the pre-initiation complex (116–120). eIF2-GDP, which is released after every round of translation initiation, is then recycled to form a new eIF2-GTP:Met-tRNAiMet ternary complex (Figure 3A). This recycling function is performed by the guanine nucleotide exchange factor eIF2B, which substitutes GTP for GDP on eIF2 and promotes Met-tRNAiMet binding to the eIF2-GTP complex (121–123). Structurally, eIF2B consists of two copies each of five subunits, α, β, γ, δ, and ϵ, the last of which features a W2/HEAT3 domain at its C-terminus that actually functions as the catalytic domain of the protein, again highlighting the functionality of HEAT domains (Figure 2) (124,125).

The interaction network nucleated by HEAT domains in translation initiation

The wide diversity of interactions mediated by the HEAT domains of eIF4G, DAP5, eIF5 and eIF2Bϵ is remarkable (Figure 3B). The MIF4G/HEAT1 domains of eIF4G and DAP5 are involved in binding to eIF4A and eIF3 (Figure 3B) (58,69,73,83,102,126–128); some contribution to eIF3 interaction is mediated by the intrinsically disordered region (IDR) in eIF4G, located C-terminally of the MIF4G/HEAT1 domain (129–131). In the case of eIF4G, the MIF4G/HEAT1 domain has also been shown to bind to RNA as well as protein, as demonstrated by studies on the encephalomyocarditis virus IRES (59). The DAP5 MIF4G/HEAT1 domain is capable of binding RNA as well (109,132). Whereas the MA3/HEAT2 domain of eIF4G serves as a second interaction site for eIF4A binding, interaction partners for the MA3/HEAT2 domain of DAP5 have not yet been identified (Figure 3B) (73,76,128,133). The W2/HEAT3 domains of both eIF4G and DAP5 recruit mitogen-activated protein (MAP) kinase-interacting serine/threonine-protein kinase 1/2 (MNK1/2) via a direct interaction between the N-terminal disordered region of MNK1/2 and the W2/HEAT3 domain. MNK1/2 interacts with MAP kinase (MAPK) 1, thus establishing a link between the regulation of translation initiation and the MAPK/extracellular signal-regulated kinase (ERK) pathway (134,135). MNK1/2 phosphorylates the cap-binding protein eIF4E at serine 209, thereby promoting canonical translation initiation of select mRNAs (134,136–137). In the case of DAP5, the same W2/HEAT3 domain has also been shown to interact with the N-terminal tail (NTT) of eIF2β, yet another interaction with a disordered region of a protein (133). The W2/HEAT3 domains of eIF5 and eIF2Bϵ also bind to the eIF2β−NTT, but form interactions distinct from those of the eIF4G and DAP5 W2/HEAT3 domains (Figure 3B). The eIF2Bϵ W2/HEAT3 domain interacts with the γ subunit of eIF2 (138–140), while the eIF5 W2/HEAT3 domain mediates binding to eIF1, eIF1A, and eIF3c (83,120,141–144). In contrast, through their MIF4G/HEAT1 domains, eIF4G and DAP5 interact with a different portion of eIF3, which is located on the solvent surface of the 40S ribosomal subunit (Figure 3B) (102,110,145–146). Most notably, the W2/HEAT3 domains of eIF5 and eIF2Bϵ have additional, stand-alone functionality in translation initiation. In eIF5, the W2/HEAT3 domain contributes to the GDP dissociation inhibitor (GDI) activity by blocking eIF2B access to eIF2–GDP. In eIF2Bϵ, the guanine nucleotide exchange factor for eIF2, the W2/HEAT3 domain is the catalytic domain mediating nucleotide exchange (Figure 3A and B) (147–150). When eIF2–GDP is released together with eIF5 from the 48S complex, the eIF2Bϵ W2/HEAT3 domain is partially responsible for displacement of the eIF5 W2/HEAT3 domain from eIF2β–NTT, thus contributing to the GDI dissociating factor activity of eIF2B (147,151–152). This demonstrates how HEAT domains can contribute functionality in different ways within the same eukaryotic translation initiation pathway.

A view at the HEAT domains through a structural lens

Molecular structures of the HEAT domains in eIF4G, DAP5, eIF5 and eIF2Bϵ have been determined by X-ray crystallography (Figure 3C) (58,68,84,86,127,153–154). Comparison of structures of the DAP5 and eIF4GII MIF4G/HEAT1 domains reveals a high degree of structural similarity between them, with only minor differences as detailed in the following paragraphs. The overall fold resembles the characteristic HEAT repeat-based shape with most of the α-helices overlaying well, even though they share only 43% sequence identity in structure-based sequence alignments (127). The strongest differences are found in orientations for some of the α-helices and, even more pronounced, two loops that are longer in DAP5 than in eIF4GII. Structural differences between the MA3/HEAT2 domains of DAP5 and eIF4G are even more subtle (68,153). Superposition of the two shows in both cases a remarkably compact structure with well-aligning α-helices (Figure 3C). Loops connecting the α-helices are rather short in both structures, presumably limiting the degree of freedom to achieve larger structural variations. However, differences in orientation of the α-helices in the first repeat may mediate distinct functionalities through modulating the interaction with eIF4A as the eIF4G MA3/HEAT2 domain contributes to eIF4A binding, which has not been reported for the MA3/HEAT2 domain of DAP5 (see above and Figure 3B). Comparison of W2/HEAT3 domain structures of eIF4G, DAP5, eIF5 and eIF2Bϵ highlights similar characteristic α-helical organization, as expected (Figure 3C). Some differences are observed in length and orientation of α-helices in the first and last HEAT repeats of the domains. Since no significant structural differences between the W2/HEAT3 domains of DAP5 and eIF4G are found, both are expected to bind similarly to MNK1/2. In fact, similar binding affinities for both have been indicated by comparative co-immunoprecipitation experiments of endogenous MNK1/2 (134). As, unlike eIF4G, DAP5 lacks the eIF4E-binding site (Figure 2), it is not immediately obvious why DAP5 would recruit MNK1/2. It is conceivable that DAP5 inhibits cap-dependent translation by competing with MNK1/2–eIF4G binding through sequestering MNK1/2 and thereby decreasing eIF4E phosphorylation levels. While most kinases phosphorylate specific residues in defined consensus sequences, no such consensus sequence is known for MNK1/2. Instead, MNK1/2 binds to eIF4G and phosphorylates eIF4E in the eIF4F complex. Thus, MNK1/2 specificity appears to be driven by its site of recruitment. Another reason for MNK1/2–DAP5 interaction would be the possibility of MNK1/2-mediated phosphorylation of DAP5. Only a few of the putative phosphorylation sites in DAP5 have been studied experimentally (155). Alternatively, other proteins that bind to DAP5 are possible targets for MNK1/2-phosphorylation, similar to eIF4G-aided eIF4E phosphorylation by MNK1/2. The functional relevance and the underlying mechanism of DAP5–MNK1/2 binding remain to be explored. The reported binding of DAP5 to eIF2β–NTT is also puzzling. As in the case of eIF4G, if DAP5 interacts with eIF3 and eIF4A at the solvent surface of the 40S ribosomal subunit, it would be positioned far away from eIF2, which is on the opposite, interface surface of the 40S ribosome subunit. Compared to the differences between eIF4G and DAP5 HEAT domains, more pronounced structural variation is found in the eIF2Bϵ and eIF5 W2/HEAT3 domains, in particular in the first HEAT repeat (Figure 3C). This is not surprising as eIF2Bϵ and eIF5 perform very different functions than eIF4G and DAP5, which additionally have MIF4G/HEAT1 and MA3/HEAT2 domains (156). Taken together, comparing the global structures and folds of MIF4G/HEAT1, MA3/HEAT2 and W2/HEAT3 domains provides only limited insight into mechanistic and functional differences and similarities among eIF4G, DAP5, eIF5, and eIF2Bϵ. For example, it is not directly obvious from this structural analysis how MIF4G/HEAT1 domains recognize RNA structures and interact with them. The role of MNK1/2 binding to the W2/HEAT3 domain of DAP5 remains unresolved and needs to be studied in more detail. To address the question how the HEAT domains interact with RNA and proteins and what factors dictate specificity, we analyzed electrostatic potentials, hydrophobicity and evolutionary conservation of their surfaces and binding studies below, focusing on specific examples of MIF4G/HEAT1 interactions with eIF4A and RNA, and W2/HEAT3 binding to eIF2β and MNK1/2.

The interactions of eIF4A with both the eIF4G and the DAP5 MIF4G/HEAT1 domains are based on similar binding mechanisms

As discussed in the previous section, the HEAT domain structures of DAP5, eIF4G, eIF2Bϵ and eIF5 are very similar with minor deviations. This is not surprising as HEAT domains generally share high structural similarities, albeit having low sequence homology (1,4,16). Thus, comparative analyses of MIF4G/HEAT1 and W2/HEAT3 domain interactions might provide mechanistic insight by revealing binding modes. We therefore studied the interaction sites on eIF4G and DAP5 MIF4G/HEAT1 domains that would engage eIF4A and RNA in more detail (Figure 4). To this end, we employed the S. cerevisiae complex structure of eIF4G MIF4G/HEAT1 and eIF4A determined by X-ray crystallography (Figure 4A) (126). eIF4A features an N-terminal and a C-terminal domain, both of which interact with eIF4G MIF4G/HEAT1. Here we separate the potential interaction sites into three distinct interfaces on the surface of the eIF4A domains (Sites 1–3) (Figure 4A). The N-terminal domain interacts with MIF4G/HEAT1 through one specific site (Site 1) and the C-terminal domain has two interaction sites (Site 2 and Site 3). Residues participating in the interaction have been identified by mutagenesis earlier (58). A recent cryo-electron microscopy-based structural model indicates that eIF4G MIF4G/HEAT1 and eIF4A interact through these sites in the context of a reconstituted human 48S translation initiation complex (157). At Site 3, eIF4G binds to eIF4A using a short motif that is linked via Loop to the eIF4G MIF4G/HEAT1 domain (Figure 4A). In this short motif, a highly conserved tryptophan residue (579 in S. cerevisiae eIF4G) has been shown by mutational studies to play a significant role in the interaction; however, this is not necessarily the case for DAP5 (126,127). Loop is not resolved in the crystal structure, suggesting that it may remain flexible in the complex. Similarly, one can assume that eIF4G/DAP5 residues that interact with eIF4A at Site 3 are also flexible when eIF4G/DAP5 are not bound to eIF4A. The two domains of eIF4A are continuously moving with respect to each other as the helicase unwinds the mRNA. Therefore, structural flexibility of Loop may provide a mechanistic basis for maintaining high affinity binding between eIF4A and eIF4G as the conformation of eIF4A changes. Leveraging the similarity between the MIF4G/HEAT1 domains of yeast eIF4G and human DAP5, we modeled the DAP5 MIF4G/HEAT1 domain into the complex structure, similar to what has been done earlier by Virgili et al. (127). A large number of residues in the eIF4G/eIF4A binding interfaces are well-conserved between eIF4G and DAP5 (127). This model thus enabled us to discuss MIF4G/HEAT1–eIF4A complex formation and strongly suggests that eIF4G and DAP5 have very similar modes of binding eIF4A.

Figure 4.

The MIF4G/HEAT1 domains of eIF4G and DAP5 have similar interactions to eIF4A, but putatively different ones to IRESs. (A) Crystal structure of the S. cerevisiae complex of eIF4G MIF4G/HEAT1 domain (green) and eIF4A (C-terminal domain in dark blue and N-terminal domain in light blue) (PDB code 2vso). The three interaction sites are indicated in magenta; Site 3 contains a highly conserved W579 residue that is further discussed in the main text. As indicated by the dashed ribbon representation, no electron density is available for Loop that links residues participating in Site 3 to the MIF4G/HEAT1 domain. The human DAP5 MIF4G/HEAT1 domain (grey, PDB code 4iul) is modeled into the complex, superimposing the yeast eIF4G MIF4G/HEAT1 domain. (B) Crystal structure of human eIF4A in complex with polypurine RNA (ten nucleotides, orange), Rocaglamide A (RocA, red), and AMPPNP (not shown) (PDB code 5zc9). The interaction sites for the eIF4G MIF4G/HEAT1 domain are indicated in magenta as revealed by the complex structure in (A). (C) eIF4A interaction sites for the yeast eIF4G MIF4G/HEAT1 domain and the analogous putative ones for the human DAP5 MIF4G/HEAT1 domain are shown in magenta; for both proteins potential IRES interaction sites are shown in orange. IRES binding has been shown for human eIF4G and DAP5 MIF4G/HEAT1 domains and the indication of potential IRES binding sites is based on studies of human eIF4G. Loop and Site 3 are illustrated as in (A). Residues of eIF4A interaction Site 3 are not resolved in the DAP5 MIF4G/HEAT1 domain crystal structure. In the potential IRES binding site of eIF4G, three residues (686–688) in Loop are not resolved as indicated by the dashed line. In DAP5, this loop is fully resolved (Loop), where it adopts a different conformation than in eIF4G. (D) Structural surface characteristics of the MIF4G/HEAT1 domains of S. cerevisiae eIF4G (top six representations, PDB code 2vso) and human DAP5 (bottom six representations, PDB code 4iul). For both cases, the electrostatic surface potential, the surface hydrophobicity according to Kyte-Doolittle, and evolutionary conservation of surface residues are shown. The structures are shown in the same orientations as in (C).

eIF4G and DAP5 MIF4G/HEAT1 domains engage RNA presumably through different binding modes

Beyond MIF4G/HEAT1–eIF4A binding, we also sought to find indications for MIF4G/HEAT1–RNA interaction sites. As discussed above, both eIF4G and DAP5 MIF4G/HEAT1 domains bind to RNA. Since no high-resolution molecular structures for such interaction are available, we investigated how RNA binding sites of eIF4A relate to the ones of eIF4G/DAP5 interaction to eIF4A. A structure of eIF4A bound to RNA has been solved recently, which includes the eIF4A inhibitor Rocaglamide A (RocA) and the non-hydrolyzable ATP analogue AMP-PNP (Figure 4B) (158). The short, ten nucleotide long RNA in this complex binds to eIF4A at the opposite side of the eIF4G/DAP5–eIF4A interaction sites, suggesting that a longer RNA would have to fold back along the eIF4A surface to the eIF4G or DAP5 MIF4G/HEAT1 domains to enable their interaction with RNA (59,109,132,159–160). Since eIF4G and DAP5 orchestrate different mechanisms of translation initiation, we analyzed structural signatures embedded in the surface of their MIF4G/HEAT1 domain. To this end, we compared the binding sites to eIF4A based on the model in Figure 4A, and the ones to the binding to IRESs as suggested by mutational studies (Figure 4C) (58). As revealed by our and earlier models, the sites for eIF4A binding are located in the same areas on eIF4G and DAP5 MIF4G/HEAT1 (127); Site 3 is not resolved in the DAP5 MIF4G/HEAT1 structure. Hence, differences and similarities regarding flexibility and length of Loop and the sequence contributing to Site 3 remain elusive. Tryptophan 579 is conserved in DAP5, indicating a similar contribution of Site 3 to the mode of binding to eIF4A, even though such interaction has not been reported yet (127). Restrictively, we also would like to note that experimental evidence for eIF4A interaction sites in DAP5 MIF4G/HEAT1 is still missing. The proposed residues involved in IRES binding are based on mutational studies of eIF4G (58). These experiments also suggested a specific loop, 13 amino acids in length (residues 819–832 in human eIF4G), to contribute to IRES binding (specifically residues 823–831). However, this loop is not fully resolved in available eIF4G MIF4G/HEAT1 crystal structures. In human eIF4G, residues 823–831 are not resolved, but electron density is missing only for three residues in the yeast crystal structure (residues 686–688 in Loop) (Figure 4C). In contrast, sufficient electron density is observed for the complete loop in DAP5 MIF4G/HEAT1 (residues 142–161), where the loop is also seven residues longer than in eIF4G. Therefore, the proposed IRES interaction site is larger in the case of DAP5 compared to eIF4G in Figure 4C. Since the loop adopts different conformations in the eIF4G and DAP5 structures, the respective surface area varies. More recently, mutational studies of eIF4G MIF4G/HEAT1 further suggested the involvement of two specific aromatic residues in IRES binding, phenylalanines 812 and 978 (59). Aromatic amino acid side chains have been shown to play an important role in protein–nucleic acid interactions, specifically through π–π stacking with adenine (161,162). The role of F812 in engaging the IRES agrees well with earlier studies (58). There is a critical distinction between the nonspecific interactions of eIF4G with mRNA and the stronger and specific interactions with various IRESs. Single-molecule fluorescence assays and RIP-seq experiments indicate that eIF4G orthologs bind nonspecifically and dynamically with mRNA (163–165). In contrast, the interactions of eIF4G and DAP5 with IRESs are structure- and/or sequence-specific, suggesting that common structural folds in RNA could dictate the interactions (59,132,160,166–171). Thus, a protein with high affinity for a specific RNA sequence or structure motif may tend to have lower, albeit significant affinity for any RNA, and vice versa (56,172). The same is true for other eIFs. For example, eIF3 has high affinity for specific motifs in certain mRNAs, which plays an important role in translation reinitiation (173). Important for the present discussion, the helicase eIF4A also has high affinity for structures in some IRESs, where synergistic binding of eIF4A and eIF4G is critical for the high affinity and specificity of their binding to the IRES (168,174–176). Nonspecific RNA-binding proteins involved in translation initiation can thus have high affinity for specific RNA motifs, which may be used in the cell for regulation of translation (59,177–178). Deciphering the mechanisms of recognition of specific IRESs by DAP5 and eIF4G isoforms from different species, and the exact contact interfaces involved, would require solving the structures of a number of such complexes at high resolution. To further tease apart the similarities and differences between the eIF4A and IRES interaction sites we analyzed protein surface features of the eIF4G and DAP5 MIF4G/HEAT1 domains (Figure 4D). We calculated electrostatic surface potentials by the Chimera Coulombic Electrostatics Plugin, surface hydrophobicity according to the Kyte-Doolittle scale, and analyzed evolutionary surface conservation employing the ConSurf algorithm (179–181). Our analyses of evolutionary surface conservation are based on alignments among 30 homologues, including the ones from S. cerevisiae, C. albicans, C. elegans, A. thaliana, D. melanogaster, D. rerio, X. laevis, G. gallus, M. musculus, R. norvegicus, B. taurus, M. mulatta and H. sapiens. At eIF4A interaction Sites 1 and 2, eIF4G and DAP5 exhibit similar surface signatures in charge distributions and hydrophobicity. Our analyses thus suggest that eIF4A binding may occur at these sites in DAP5. In particular, residues at both sites are evolutionarily highly conserved in eIF4G and DAP5. In contrast, the proposed IRES interaction site features different pattern in the two proteins. The negatively charged phosphates of the nucleic acid backbone are often involved in pairing with positively charged amino acid side chains of nucleic acid-binding proteins, even small differences of the electrostatic MIF4G/HEAT1 surface potentials are thus of particular relevance (182–186). Potentially, the observed variations provide a mechanistic basis for distinguishing between different IRESs, as both proteins have been shown to bind such RNA elements via their MIF4G/HEAT1 domains (59,109,132). Most interestingly, the proposed IRES interaction site appears to be less evolutionarily conserved in DAP5 than in eIF4G. We postulate that the afore-mentioned flexible loop at this site (Loop in yeast eIF4G and Loop in DAP5, see above and Figure 4C and D) may play an important role for RNA binding mechanisms, for example by providing structural flexibility that enables recognition of different IRESs. Overall, the differences in surface features between the eIF4G and DAP5 MIF4G/HEAT1 domains, both from the vantage of electrostatics and evolutionary conservation are rather moderate. Achieving a better understanding how MIF4G/HEAT1 domains are involved in cap-dependent and IRES-dependent translation initiation and how they gain selectivity, demands further interaction studies especially with the RNA.

Interaction of W2/HEAT3 domains with eIF2β and MNK1/2

There are a number of occurrences of W2/HEAT3 domains among proteins involved in translation initiation and they mediate a multitude of interactions. The W2/HEAT3 domains of eIF4G and DAP5 bind to the MNK1/2 kinase while those of eIF5, eIF2Bϵ, and DAP5 mediate binding to eIF2β (Figure 3B). Multiple studies have suggested that the interaction of W2/HEAT3 with MNK1/2 is dictated by two specific and highly conserved motifs at the C-terminus of W2/HEAT3, which are composed of aromatic and acidic residues (AA box 1 and AA box 2) (Figure 5A) (68,187); AA box 2 is particularly important for functional interaction. MNK1/2 features a lysine/arginine-rich (KR) stretch in its N-terminal region that is proposed to mediate the interaction (Figure 5A) (188). This N-terminal segment of MNK1/2 is predicted to be unstructured and is distant from the catalytic domain, the nuclear export signal and the MAPK binding site, which are all located towards the C-terminus of MNK1/2. It is intriguing that a basic unstructured region serves as anchor to bind to the AA boxes on W2/HEAT3. Interestingly, the AA boxes are involved in binding of W2/HEAT3 to eIF2β as well, as shown by X-ray crystallography and nuclear magnetic resonance (NMR) binding studies (Figure 5A) (141,189). In its intrinsically disordered N-terminal tail, eIF2β features three lysine-rich sites (referred to as K-boxes; K1, K2 and K3), of which the K2 and K3 are suggested to have greater contribution to recognition of the AA boxes on W2/HEAT3. All three lysine motifs appear to be functionally important as demonstrated by in vivo studies, while K2 was reported to have a stronger contribution to binding to eIF5 in vitro (82,190). The K3 motif is sandwiched by segments that fold into α-helices (referred to as α-HN and α-HC), when bound to eIF5 W2/HEAT3 (Figure 5A) (189).

Figure 5.

Interactions of eIF4G, DAP5, eIF5 and eIF2Bϵ W2/HEAT3 domains with eIF2β and MNK. (A) In the C-terminal region of W2/HEAT3, two sites composed of aromatic and acidic residues (AA box 1 (blue) and AA box 2 (green)) are proposed to mediate interactions to eIF2β and MNK stretches rich in lysine (K2 (yellow) and K3 (orange) motifs) and lysine/arginine (KR) residues, respectively. K1–K3 are located in the intrinsically disordered eIF2β N-terminal tail (NTT, residues 1–192). When bound to eIF5 W2/HEAT3, the sequences N-terminal and C-terminal of the K3 motif fold into α-helices (α-HN (dark green) and α-HC (dark purple), respectively). In MNK, the KR stretch is close to the N-terminus; the catalytic domain, nuclear export signal (NES) and the MAPK binding site are located towards the MNK C-terminus. (B) AA box 1 and box 2 are shown in blue and green, respectively, for W2/HEAT3 domains of eIF4G (PDB code 1ug3), DAP5 (PDB code 3dm3), eIF5 (PDB code 2iu1) and eIF2Bϵ (PDB code 3jui). A structure of AA box 2 (shown in ribbon representation) is only fully resolved in the eIF5 crystal structure, where it adopts an α-helix at the N-terminus of AA box 2 and an extended loop. A second α-helix at the very C-terminus is not part of AA box 2, and the corresponding sequence is only present in metazoan eIF5. No electron density is observed for eIF4G AA box 2, and only partial structural information (for the N-terminal α-helix) is available for AA box 2 in DAP5 and eIF2Bϵ. Interaction sites with eIF2β K2 and K3 motifs, and the surrounding α-helices α-HN and α-HC in eIF2β are indicated by color-coded (according to (A)) surface representations on the W2/HEAT3 domain structures; experimental evidence at high resolution is only available for the interaction interfaces of eIF5 and eIF2β. (C) Structural surface characteristics of the W2/HEAT3 domains of eIF4G (PDB code 1ug3), DAP5 (PDB code 3dm3), eIF5 (PDB code 2iu1) and eIF2Bϵ (PDB code 3jui). The electrostatic surface potentials, the surface Kyte-Doolittle hydrophobicity, and evolutionary conservation of surface residues are shown. In these analyses, the loop and C-terminal α-helix in human eIF5 (depicted in ribbon representation) are not included as they are highly flexible in solution (191). All structures are shown in the same orientations as in (B). Molecular structures resolving interactions of either eIF4G, DAP5, eIF5 or eIF2Bϵ W2/HEAT3 with eIF2β or MNK1/2 are not available in databases. In the case of eIF4G–MNK1/2 binding, the KR region interacts predominantly with AA box 2, in particular with the intrinsically disordered portion of it (Arthanari, unpublished data). In a recent study, the yeast eIF2β−NTT K2-K3 region has been co-crystallized with eIF5 W2/HEAT3 (189); however, atomic coordinates have not been deposited in the Protein Data Bank. In this complex, K3 forms a loop contacting the eIF5 W2/HEAT3 domain via interactions of lysine side chains in K3 and residues in AA box 1 (Figure 5B), whereas K2 is disordered or dynamic as indicated by missing electron density in this study. Even in the folded K3 motif, only a subset of the lysine side chains is resolved, suggesting they remain dynamic while still in the vicinity of the negatively charged AA boxes on the W2/HEAT3 surface. The strength of electrostatic interactions is dependent on the first power of distance, compared to the third or higher power of distance in other types of interactions; therefore, such dynamic contacts could be entropically favorable. Remarkably, the crystal structure of the eIF2β-NTT:eIF5 W2/HEAT3 complex shows that the eIF2β segments surrounding K3 fold into the two α-helices α-HN and α-HC, and play a major role by contributing to the binding free energy through hydrophobic interactions (189). In fact, α-HN and α-HC position K3 (and possibly K2) in the vicinity of the AA boxes on the eIF5 surface. These α-helical segments are as conserved as the K motifs but have not received as much attention so far. While eIF2β K2 was not resolved in the crystal structure, the authors proposed that K2 could be interacting with the negatively charged surface formed by AA boxes 1 and 2 of eIF5 (189). In good agreement with this study, mutational analysis (82,83) and NMR studies (141,191) have shown that AA box 2 plays a central role in the eIF5–eIF2β interaction. In mammals, phosphorylation of the AA box 2 in eIF5 by Casein Kinase 2 (CK2) promotes protein synthesis and cell proliferation (192), mediated by increasing the eIF5 affinity for eIF2β (191). Recently, the complex of eIF2 and eIF2B has been studied by cryo-electron microscopy, but the eIF2β−NTT could not be resolved (125,140). Based on the crystallography and NMR results, we charted the interaction sites for eIF2β binding through K2, K3, and the adjoining α-helices α-HN and α-HC onto the surface of the DAP5, eIF5 and eIF2Bϵ W2/HEAT3 domains (Figure 5B) (eIF4G does not interact with eIF2β, see Figure 3B). This highlights involvement of W2/HEAT3 AA box 1 residues in binding of the K3 motif and to a smaller extent of K2, α-HN and α-HC. Presumably, AA box 2 is mainly involved in an interaction with K2. It is worth noting that experimental evidence for eIF2β−NTT binding sites is available only for eIF5; however, a number of W2/HEAT3 residues at these sites are conserved among DAP5, eIF5, and eIF2Bϵ. Among the crystal structures available for the eIF4G, DAP5, eIF5, and eIF2Bϵ W2/HEAT3 domains, AA box 2, located at the very C-terminus, is visible only in the case of eIF5 (Figure 5B) (84). AA box 2 establishes an extended loop that folds back on the eIF5 W2/HEAT3 surface. However, NMR data indicate that even in eIF5, AA box 2 is disordered in solution and only dynamically contacts the W2/HEAT3 surface (191). Combined with the missing electron density in the other three W2/HEAT3 domain structures, this suggests high flexibility of this region in AA box 2. The short α-helix of AA box 2 resolved in DAP5, eIF5, and eIF2Bϵ W2/HEAT3 overlaps well with the proposed eIF2β K2 motif binding site and participates in the binding interface to α-HN, whereas, as described above, eIF2β K3 contacts a portion of AA box 1 (189). To evaluate if surface characteristics of the W2/HEAT3 domains help to understand why eIF4G W2/HEAT3 does not bind eIF2β, why eIF5 and eIF2Bϵ do not bind MNK1/2, and why DAP5 W2/HEAT3 binds both eIF2β and MNK1/2, we analyzed again the surface electrostatic potentials, hydrophobicity, and evolutionary conservation of the four W2/HEAT3 domains as we have done earlier for the case of MIF4G/HEAT1 (Figure 5C). First, AA box 1 constitutes a pronounced negatively charged area in all four W2/HEAT3 domains; however, to a lesser extent in the case of eIF2Bϵ. This may be compensated by negatively charged residues in AA box 2, which is only partially resolved in eIF2Bϵ. However, we did not include AA box 2 in surface characteristics analyses and comparison among the proteins since electron density for this region is only observed for eIF5 (a large portion of the eIF5 AA box 2 surface is negatively charged). No major variations in hydrophobicity are observed between the four W2/HEAT3 domain surfaces. As expected, the α-HN and α-HC binding sites in DAP5, eIF5, and eIF2Bϵ feature hydrophobic residues. These residues might be functionally relevant as protein–protein interactions are often mainly driven by hydrophobic interactions (185). The distinctive electrostatic signature in AA box 1 is clearly noticeable, and a significant part of it is well-conserved in all four proteins (Figure 5C). A second negative patch on eIF4G W2/HEAT3 is conserved as well, and the W2/HEAT3 domains of DAP5 and eIF2Bϵ have additional conserved sites. In the case of DAP5, it is rather positively charged, and in eIF2Bϵ this second conserved area is of rather hydrophobic nature, consistent with it being the contact surface for eIF2γ (139,140). While the AA boxes are well-conserved among eIF4G, DAP5, eIF5, and eIF2Bϵ, other surface areas are not as universally conserved. Therefore, a likely reason for why eIF4G does not interact with eIF2β is that the surface area, which in eIF5 used to engage the α-helices surrounding eIF2β-K3, is not conserved between eIF5 and eIF4G. This idea is supported by the observation that eIF4G binding to MNK1/2 involves predominantly the AA box 2 (188). The C-terminal AA box 2 may only transiently fold back on W2/HEAT3 (as seen in eIF5) and may populate multiple conformational states of which each can promote binding to specific interaction partners (191). Future investigations are required to decipher if this transient nature is key for how the eIF4G, DAP5, eIF5 and eIF2Bϵ W2/HEAT3 domains selectively interact with their binding partners. Here, NMR can be used to identify transient and minor conformations. Especially as AA box 2 has been suggested to be mainly responsible for eIF4G/DAP5–MNK1/2 binding (188), analyses of structural plasticity, including detailed studies of the interactions, are expected to provide important insight into molecular mechanisms of W2/HEAT3 domains in translation initiation.

OPEN QUESTIONS AND CONCLUDING THOUGHTS

Though HEAT domains are known to orchestrate a diverse array of cellular functions, the abundance of HEAT domains in translation initiation is striking. As discussed above, the HEAT domains of translation initiation factors can be classified into three distinct categories, MIF4G/HEAT1, MA3/HEAT2 and W2/HEAT3, based on a combination of structural, sequence, and functional properties. By analyzing four specific initiation factors of eukaryotic translation, eIF4G, DAP5, eIF5 and eIF2Bϵ, we highlight the importance of these HEAT domains in translation initiation. Even though a structure comparison provides only limited insight, investigating structural signatures through surface analyses — electrostatic potential, hydrophobicity, evolutionary conservation, and binding studies — reveals some potentially important differences among the HEAT domains of these four proteins. As HEAT domains have high structural but low sequence similarities, variations in biochemical properties, in particular in flexible regions, may conceptualize basic principles to mediate the multitude of protein–protein and protein–nucleic acid interactions. The large amount of work on HEAT domains accomplished since their discovery in 1995 impressively revealed their fundamental role in cellular functions including protein synthesis (1,3,17,43,58). Despite a large body of available structural information and biophysical and interaction data on the HEAT domains, there are several open questions. To further understand protein–protein interactions involving eIF4G, DAP5, eIF5 and eIF2Bϵ mediated by their HEAT domains, we suggest that detailed studies of flexible regions, i.e. Loop in MIF4G/HEAT1 and AA box 2 in W2/HEAT3, have to be performed. Of the three types of HEAT domains, the AA boxes are exclusive to the W2/HEAT3 domain. These AA boxes often engage in interactions with intrinsically disordered regions; in fact, W2/HEAT3 domains often interact with IDRs containing basic residues. In addition, high-resolution studies of MIF4G/HEAT1–RNA interactions are required to better understand non-canonical translation initiation mechanisms. A molecular structure of a viral IRES and interaction studies with the MIF4G/HEAT1 domain of eIF4G indicate binding mediated by π–π stacking involving adenosine and aromatic amino acid side chains (59). Are there specific RNA structures with common characteristics that are recognized by HEAT domains? Towards establishing general concepts of how HEAT domains bind RNA, more of such interaction studies at atomic resolution are needed. Most of the reported interactions involving the HEAT domains in translation initiation are in the low micromolar range, including binding of MIF4G/HEAT1 domains to viral components such as the encephalomyocarditis virus IRES or the murine norovirus VPg protein, which contribute to regulation of translation in the context of viral replication (Table 1). This begs the question whether low affinity interaction is a distinct feature of HEAT domains involved in translation initiation which requires transient, yet critical interaction, or whether the interactions measured in isolated contexts do not capture the actual binding affinity. In case of interactions involving IDRs, it has been shown for other systems that avidity-based interactions increase the binding affinity, and it is well conceivable that using full-length constructs and additional components of the translation initiation machinery are necessary to observe more physiologically relevant affinities.

Table 1.

HEAT domains of translation initiation factors typically interact in the low μM affinity range with binding partners

Domain	Protein	Interaction partner	K_d [μM]	Method	Reference
MIF4G/HEAT1	eIF4GI	EMCV IRES (RNA)	1.3	ITC	(59)
	eIF4GI	eIF4A	0.14	ITC	(127)
	DAP5	eIF4A	1.1	ITC	(127)
	eIF4G	eIF4A	12.1	SPR	(61)
	eIF4GII	MNV VPg (1–124)	5.2	ITC	(60)
	eIF4GI	MNV VPg (104–124)	3.0	FA	(60)
	DAP5	MNV VPg (104–124)	20.0	FA	(60)
MA3/HEAT2	eIF4GI	eIF4A	1.1	SPR	(61)
	eIF4GI	eIF4A	0.9	ITC	(61)
W2/HEAT3	eIF5	eIF2β-K2K3	∼ 4	ITC	(141)
	eIF5	eIF2β-NTD	∼ 17	ITC	(141)

For the three classes MIF4G/HEAT1, MA3/HEAT2 and W2/HEAT3, affinities for a number of interactions with binding partners have been determined by isothermal titration calorimetry (ITC), surface plasmon resonance (SPR) and fluorescence anisotropy (FA). EMCV = Encephalomyocarditis Virus; MNV = Murine Norovirus; VPg = Virus Protein, genome-linked; NTD = N-terminal domain.

HEAT domains of translation initiation factors typically interact in the low μM affinity range with binding partners For the three classes MIF4G/HEAT1, MA3/HEAT2 and W2/HEAT3, affinities for a number of interactions with binding partners have been determined by isothermal titration calorimetry (ITC), surface plasmon resonance (SPR) and fluorescence anisotropy (FA). EMCV = Encephalomyocarditis Virus; MNV = Murine Norovirus; VPg = Virus Protein, genome-linked; NTD = N-terminal domain. Beyond the idea of analyzing differences in HEAT domains to identify molecular mechanisms, more work is needed to further understand the remarkable functional versatility of HEAT domains and the proteins in which they are encountered. Specifically, it is intriguing that several HEAT domain-containing proteins feature IDRs around these domains. As suggested by Yoshimura and Hirano, proteins with HEAT domains can be classified into three groups: one, ‘HEAT only’, two, ‘HEAT with IDRs’, and three, ‘HEAT with other functional domain(s)’ (2). Following this idea, it is unconditionally required to study the impact of IDRs on the various functions of HEAT domain-containing proteins in the second group of this classification, which includes eIF4G, DAP5, eIF5, and eIF2Bϵ. As we show, short, disordered regions flanking the MIF4G/HEAT1 and W2/HEAT3 domains are functionally highly relevant in these four proteins. To further understand their complex protein–protein and protein–nucleic acid interactions, we think comprehensive studies of the extended IDRs that link HEAT domains are required. Given the remarkable structural flexibility HEAT domains exhibit, the even more flexible IDRs may establish an additional layer of complexity to accomplish protein function and control of the respective cellular processes (193–195). For example, these IDRs could mediate secondary interactions, especially with RNA, and potentially play a role as spacer providing separation between docking modules and functional sites. These would include avidity-based interactions driven by binding of the structured part of the HEAT domain and the IDRs would provide additional engagement. These IDRs harbor several phosphorylation sites and the contribution to the binding affinity by the IDRs can be further modulated by phosphorylation events. This is especially significant in the case of HEAT domain interactions with RNA, where electrostatics play an important role. IDRs could also act as conduits connecting multiple proteins and bringing proteins and RNA together, which is a common theme in translation initiation. IDRs are also known to undergo disorder-order transitions upon engaging an interaction partner, another modality that a scaffold protein such as the HEAT domain proteins in translation initiation can employ to nucleate interaction hubs. The role of IDRs in the HEAT domain-containing translation proteins demands further studies. Evolutionary sequence conservation correlation can provide good starting points for such investigation. Employing the powerful methods that have been established in recent years to study IDRs, most importantly NMR spectroscopy, small angle X-ray scattering and (single molecule) fluorescence spectroscopy techniques, could shed light onto structural determinants of the IDR-mediated interactions (196–199). With the recent developments in de novo structure prediction/determination, we were curious to find any possible interaction between HEAT domains and between the IDRs and the HEAT domain in the structures of full-length proteins. We analyzed the structures of full-length eIF4G, DAP5, eIF5 and eIF2Be predicted by the recently introduced artificial intelligence-based program AlphaFold (200,201). For the above proteins, there was no additional information or novel interactions in the AlphaFold structures that can be gleaned with high confidence. This does not mean that there are no interactions, especially considering the fact that some of the interactions between the IDRs and structured domain, though vital, are weak in terms of interaction thermodynamics but are often stabilized by avidity and transient in nature. It is well-known that IDRs sample a large conformational space contradicting the notion of a single globular structure of these proteins. Thus, the question of the interplay between IDRs and HEAT domains still remains at large. Recently, IDRs have been implicated in activity-dependent translation, especially in neurons facilitating synaptic plasticity and long-term memory (202). Here, the mRNA is maintained in a translationally dormant form in membrane-less protein assemblies known as neuronal granules formed by a phenomenon referred to as liquid-liquid phase separation. This phase separation is driven by the C-terminal IDR of the RNA-binding protein Fragile X Mental Retardation Protein (FMRP), abundantly found in neuronal granules (202). Post-translational modifications seem to regulate the IDR-mediated phase separation, thus enabling translation in response to a stimulus. The IDRs sandwiching the HEAT domains could be involved in similar mechanisms providing an additional regulatory mechanism to on-demand translation of select mRNAs. Phase separation has also been linked to cation–π interactions, which may play a role in recognition of RNAs by IDRs of HEAT domain-containing proteins in addition to π–π interactions (203). To better understand the structural and mechanistic underpinnings of HEAT domain-containing protein function, focus should further reside in investigating interactions and macromolecular complexes both with proteins and nucleic acids. Understanding structural plasticity and/or dynamics of these HEAT domains at atomic level and the interplay with post-translational modifications (for the sites found in IDRs flanking the HEAT domains) will lead to a better understanding of the multitude of functions orchestrated by the HEAT domains. It is established that phosphorylation can critically regulate the disorder–structure relationship in proteins (194,204–206). As an example, studying the IDR of approximately 240 residues between the MIF4G/HEAT1 and MA3/HEAT2 domains of DAP5 seems to be a promising starting point for such investigations (Figure 3A). In this region of DAP5, four different phosphorylation sites have been identified and verified experimentally (207–209), and we expect that characterizing how these sites affect DAP5 structure and function through biophysical methods will exemplify the relevance of phosphorylation in IDRs that flank HEAT domains of eukaryotic translation initiation factors. Though HEAT domains are structurally similar, through their meticulously evolved sequence composition and structural plasticity they function as swiss army knife in translation initiation, forging multiple specific interactions. Even though binding affinities of these interactions are moderate, in the low micromolar range, they are critical in control of translation initiation, which requires several transient interactions, remodeling of the complex and frequent dissociation of proteins. A well-documented example of such an interaction is the binding of MIF4G/HEAT1 domains with eIF4A and RNA as discussed above. Thus, some of the protein–protein interactions involving HEAT domains represent promising therapeutic targets.

DATA AVAILABILITY

All structures that were analyzed in this article can be accessed via the Protein Data Bank. Relevant PDB IDs are mentioned in the respective figure legends. Any data from our analyses described in this article is available from the authors upon request.

204 in total

1. Structural insights into RNA quality control: the Ro autoantigen binds misfolded RNAs via its central cavity.

Authors: Adam J Stein; Gabriele Fuchs; Chunmei Fu; Sandra L Wolin; Karin M Reinisch
Journal: Cell Date: 2005-05-20 Impact factor: 41.582

2. DAP-5, a novel homolog of eukaryotic translation initiation factor 4G isolated as a putative modulator of gamma interferon-induced programmed cell death.

Authors: N Levy-Strumpf; L P Deiss; H Berissi; A Kimchi
Journal: Mol Cell Biol Date: 1997-03 Impact factor: 4.272

3. The translation initiation factor DAP5 promotes IRES-driven translation of p53 mRNA.

Authors: S Weingarten-Gabbay; D Khan; N Liberman; Y Yoffe; S Bialik; S Das; M Oren; A Kimchi
Journal: Oncogene Date: 2013-01-14 Impact factor: 9.867

4. Dynamic interaction network involving the conserved intrinsically disordered regions in human eIF5.

Authors: Eleanor Elise Paul; Kay Ying Lin; Nathan Gamble; Amy Wei-Lun Tsai; Simon H K Swan; Yu Yang; Matthew Doran; Assen Marintchev
Journal: Biophys Chem Date: 2021-12-10 Impact factor: 2.352

5. DAP5 associates with eIF2β and eIF4AI to promote Internal Ribosome Entry Site driven translation.

Authors: Noa Liberman; Valentina Gandin; Yuri V Svitkin; Maya David; Geneviève Virgili; Maritza Jaramillo; Martin Holcik; Bhushan Nagar; Adi Kimchi; Nahum Sonenberg
Journal: Nucleic Acids Res Date: 2015-03-16 Impact factor: 16.971

6. eIF2B promotes eIF5 dissociation from eIF2*GDP to facilitate guanine nucleotide exchange for translation initiation.

Authors: Martin D Jennings; Yu Zhou; Sarah S Mohammad-Qureshi; David Bennett; Graham D Pavitt
Journal: Genes Dev Date: 2013-12-15 Impact factor: 11.361

7. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules.

Authors: Haim Ashkenazy; Shiran Abadi; Eric Martz; Ofer Chay; Itay Mayrose; Tal Pupko; Nir Ben-Tal
Journal: Nucleic Acids Res Date: 2016-05-10 Impact factor: 16.971

8. Binding of eIF3 in complex with eIF5 and eIF1 to the 40S ribosomal subunit is accompanied by dramatic structural changes.

Authors: Jakub Zeman; Yuzuru Itoh; Zdeněk Kukačka; Michal Rosůlek; Daniel Kavan; Tomáš Kouba; Myrte E Jansen; Mahabub P Mohammad; Petr Novák; Leoš S Valášek
Journal: Nucleic Acids Res Date: 2019-09-05 Impact factor: 16.971

9. eIF2β is critical for eIF5-mediated GDP-dissociation inhibitor activity and translational control.

Authors: Martin D Jennings; Christopher J Kershaw; Christopher White; Danielle Hoyle; Jonathan P Richardson; Joseph L Costello; Ian J Donaldson; Yu Zhou; Graham D Pavitt
Journal: Nucleic Acids Res Date: 2016-07-25 Impact factor: 16.971

Review 10. How Do We Study the Dynamic Structure of Unstructured Proteins: A Case Study on Nopp140 as an Example of a Large, Intrinsically Disordered Protein.

Authors: Jung-Hyun Na; Won-Kyu Lee; Yeon Gyu Yu
Journal: Int J Mol Sci Date: 2018-01-27 Impact factor: 5.923