Literature DB >> 26779403

Why is start codon selection so precise in eukaryotes?

Katsura Asano1.   

Abstract

Translation generally initiates with the AUG codon. While initiation at GUG and UUG is permitted in prokaryotes (Archaea and Bacteria), cases of CUG initiation were recently reported in human cells. The varying stringency in translation initiation between eukaryotic and prokaryotic domains largely stems from a fundamental problem for the ribosome in recognizing a codon at the peptidyl-tRNA binding site. Initiation factors specific to each domain of life evolved to confer stringent initiation by the ribosome. The mechanistic basis for high accuracy in eukaryotic initiation is described based on recent findings concerning the role of the multifactor complex (MFC) in this process. Also discussed are whether non-AUG initiation plays any role in translational control and whether start codon accuracy is regulated in eukaryotes.

Entities:  

Keywords:  archaeal translation; bacterial translation; eukaryotic translation; initiation factors; multifactor complex; non-AUG initiation; ribosome; start codon fidelity; start codon selection; translation initiation

Year:  2014        PMID: 26779403      PMCID: PMC4705826          DOI: 10.4161/trla.28387

Source DB:  PubMed          Journal:  Translation (Austin)        ISSN: 2169-0731


Translation or mRNA-dependent protein synthesis is a crucial process of gene expression that determines the abundance of the cellular proteome. The ribosome that catalyzes this reaction has three tRNA-binding sites, A-site, P-site and E-site, for aminoacyl-, peptidyl- and exit tRNA-binding sites, respectively. During translation initiation, the ribosome binds initiator methionyl-tRNA (Met-tRNAi) and mRNA with the initiation codon (start codon) base-paired to the anticodon of the former in the P-site. During the elongation phase, the ribosome binds aminoacyl-tRNA matching the next codon at the A-site, transfers the methionyl or peptidyl moiety of the P-site tRNA to the N-terminus of the aminoacyl-tRNA in the A-site, and then translocates the A-site tRNA carrying the elongated polypeptide to the P-site, releasing the deacylated tRNA from the E-site. The fidelity of protein translation is very high during the elongation phase, because the ribosome possesses special mechanisms to achieve this.- Among them is the stabilization of codon:anticodon pairing through two universally conserved adenine residues (A-1492 and A-1493 in E. coli) of the small subunit (SSU) rRNA as well as other rRNA residues and ribosomal protein side chains. However, the fidelity of translation initiation varies widely between different organisms. While eukaryotic translation generally starts with the AUG codon, prokaryotic translation permits frequent GUG and UUG initiation besides AUG. In E. coli (Gram negative), AUG, GUG and UUG start translation of 83, 14 and 3% of proteins encoded by the genome, respectively. In B. subtilis (Gram positive), these codons start translation of 78, 9 and 13% of proteins encoded by the genome. Archaea display similar levels of UUG and GUG initiation. This inaccuracy largely stems from a fundamental problem for the ribosome to precisely recognize codon:anticodon pairing at the P-site, where the initiator methionyl-tRNA is bound. This review provides an overview concerning how initiation factors specific to each domain of life evolved to confer stringent initiation by the ribosome with varying accuracies. The mechanistic basis for high accuracy in eukaryotic initiation is described based on recent findings concerning the role of the Eukarya-specific translation initiation multifactor complex (MFC) in this process.- Inspired by recent reports of non-AUG initiation found in the 5`-untranslated region (UTR) of many eukaryotic genes,, conceptual frameworks will be provided to understand how the set of start codons is chosen for each domain of life and how initiation accuracy can be regulated in eukaryotes, thereby changing the cellular proteome translation.

Start Codon Selection in Bacteria

Although bacterial initiation permits initiation from UUG and GUG in addition to AUG, a simpler initiation mechanism operates to achieve this level of fidelity, distinguishing these three codons from other codons. There are examples of complex translational control mechanisms involving GUG codons, raising the possibility that the GUG start codon is maintained to achieve efficient translational control mechanisms.

IF3 is the stand-alone fidelity factor in bacterial translation initiation

As shown in Table 1, there are three bacterial initiation factors, IF1, IF2 and IF3. After ribosome dissociation into the 30S small subunit (SSU) and 50S large subunit (LSU), the former binds GTP-bound IF2 and IF3, priming the subsequent loading of IF1 and formyl-methionyl initiator tRNA (fMet-tRNAifMet) onto the SSU A-site and P-site, respectively. mRNA binds to the SSU through base-pairing between the Shine-Dalgarno (SD) sequence of the mRNA located 5′ of the start codon and the anti-SD sequence located at the 3′-end of the 16S SSU rRNA, forming a 30S pre-initiation complex (PIC).- The rate of mRNA binding to the PIC varies substantially depending on the complementarity of the SD sequence to the rRNA anti-SD sequence and the stability of inhibitory secondary structures in the mRNA. Prior to start codon selection, IF3 binds to the subunit-interface side of the SSU, thereby preventing re-association of the SSU with LSU (anti-association function). However, the start codon base-pairing to the tRNAifMet anticodon triggers transitioning of the 30S PIC into the 30S initiation complex (IC). This is accompanied by stabilization of fMet-tRNAfMet, mRNA, IF2 and IF1, and strong destabilization of IF3. The 30S IC then joins the 50S LSU., After subunit joining, the LSU stimulates the GTP hydrolysis for IF2, promoting the release of the remaining IFs. As the consequence, the 70S IC forms with tRNAifMet bound to the start codon in the P-site, which is ready for the polypeptide elongation phase.

Table 1. Translation initiation factors found in bacteria, archaea, and eukaryotes#

BacteriaArchaeaEukaryotes# of subunits (name)Function
IF11aIF1A1eIF1A11Binds SSU A site, blocking misloading of Met-tRNAi/fMet-tRNAf
IF21aIF5B1eIF5B11Binds GTP. GTP-bound form interacts with tRNAi acceptor stem and the SSU subunit interface to mediate the LSU joining. GTP hydrolysis promoted by LSU factor binding center.
IF321Anti-subunit re-association. Monitors correct tRNAf pairing with start codon. Bacteria-specific
aIF13eIF1*41Anti-subunit reassociation. Monitors correct tRNAi pairing with start codon
aIF23eIF2*43 (α, β, γ)Binds GTP and Met-tRNAi. GTP hydrolysis before start codon recognition. This is promoted by eIF5 in eukaryotes.
eIF5*41Activates eIF2 GTPase (GAP) to control start codon recognition. Binds eIF1, eIF2 and eIF3
eIF3*4Yeast, 6 (a-c, g, i, j)Mammals, 13 (a-m)Scaffold of assembly, to bind eIF1, eIF5, eIF4F and the 40S SSU
eIF2B5 (α−ε)Guanine nucleotide exchange factor (GEF) for eIF2
eIF4F3 (eIF4E, eIF4G, eIF4A)Binds mRNA at its 5′ m7G cap structure and at its poly(A) tail. Contains eIF4A.
eIF4A1ATP-dependent mRNA helicase
eIF4B1Co-activator of eIF4A

Universally conserved factors and domain-specific stringency factors (Bacteria, Archaea, and Eukaryotes) are indicated. * signifies a part of the multi-initiation factor complex (MFC)

Universally conserved factors and domain-specific stringency factors (Bacteria, Archaea, and Eukaryotes) are indicated. * signifies a part of the multi-initiation factor complex (MFC) Base-pairing between the start codon and the tRNAifMet anticodon serves as a crucial checkpoint for the translation initiation reaction. IF3 plays a central role in this process. IF3 matures the 30S PIC into the 30S IC, only when the anticodon pairs with AUG, GUG or UUG. When it pairs with other codons, IF3 is believed to promote dissociation of the 30S PIC. The precise mechanism by which IF3 discriminates against codons other than AUG, GUG and UUG is currently unknown. GUG and UUG start codons are recognized by the fMet-tRNAifMet anticodon (5′-CAU-3′) through “wobble” base-pairing at the 1st position of the start codon (NUG). A likely explanation for why Bacteria allow initiation from these codons may concern a unique feature in the anticodon loop of fMet-tRNAifMet. With a single exception, all bacterial and eukaryotic tRNAs that recognize codons with A in the 1st position (ANN codons) have N6-threonylcarbamoyladenosine (t6A) located 3′ of the anticodon U., This modification has been proposed to enhance the decoding capacity of ANN codons at the ribosomal A-site through strong stacking interactions imposed by this bulky hydrophobic modification. The unique exception among the ANN-decoding tRNAs is bacterial initiator tRNA, fMet-tRNAifMet, which lacks this modification. The paucity of this modification is believed to explain, at least in part, why “wobble” in decoding the 1st position of the start codon can occur at moderately high frequency in Bacteria. It would be intriguing to understand how the decreased initiation accuracy has evolved in Bacteria in light of the molecular mechanism by which protein translation factors, fMet-tRNAifMet and the ribosome cooperate to specify a start codon in a different atomic environment at the P-site.

The role for non-AUG initiation in bacterial translational control

The classical example of the use of a non-AUG start codon in bacterial translational control is that of the AUU start codon in IF3 mRNA translation. When the IF3 abundance decreases, the cellular initiation stringency goes down, thereby allowing AUU initiation of IF3 translation. The fact that the frequency of AUU initiation is increased when the IF3 level is low provides strong (genetic) evidence for the central role that IF3 plays in initiation fidelity. Although UUG and GUG codons are considered to be normal start codons in bacteria, these sequences are viewed as weaker initiation signals., Provided that certain mechanistic constraints including the lack of t6A modification in fMet-tRNAifMet explain the translation initiation from these codons, is there any selective advantage associated with starting translation from them? A hint for answering this question might reside in their property that, due to being an intrinsically weak initiation signal, these codons can be completely masked by a relatively weak secondary structure when accompanied with a weak SD sequence. The GUG codon is particularly suited for regulation by secondary structures, because two G:C pairs (rather than one G:C pair in the case of AUG and UUG) can directly mask it. Since Bacteria lack mRNA helicases that act on translation start sites, the rate of PIC binding by a given mRNA is directly proportional to the fraction of the mRNA with the unstructured (vs. structured or masked) SD sequence. Thus, translation of bacterial mRNA can be regulated in a wide range – up to 10,000 fold – through folding and unfolding of the secondary structure that masks translation initiation signals. The quantitative and theoretical framework of this regulation was established by de Smit and van Duin. Using a bacteriophage MS2 coat protein mRNA as a model, they demonstrated that the stability of the secondary structure masking the SD sequence (ΔGfo) linearly correlates with the log of expression from a particular start codon. A mere increase of ΔGfo by 1.4 kcal/mol increases MS2 coat gene expression by 10-fold. In other words, the disruption of a hydrogen bond (ΔGfo ~1 kcal/mol) in Watson-Crick base pairs can theoretically lead to as much as a 5-fold translational induction. This allows for a wide range of gene regulation executed at the translational level in Bacteria. However, the linear correlation between ΔGfo and the log of gene expression does not hold at higher ΔGfo for the inhibitory structure, where the SD interaction can directly disrupt it. In other words, the weak initiation signal, including the GUG codon, can be masked substantially by relatively weak structures, that are more suited for regulation by mRNA structure changes.

The GUG start codon is a crucial element in the translational control of plasmid replication

GUG start codons are found in some of the well-characterized mRNAs whose translation is regulated by dynamic structural changes.- One of the best described, instructive examples is found in the translational control of the replication initiator (Rep) protein of the IncIα plasmid Colicin Ib-P9 (ColIb-P9) and related low-copy number plasmids,- (Fig. 1). In these plasmids, rep translation initiates with the GUG start codon, which is masked by a stable secondary structure along with its weak SD sequence (Fig. 1B). Remarkably, the inhibited GUG initiation can be strongly induced by the formation of a mRNA pseudoknot, thereby opening the rep initiation signal (Fig. 1C). By the intricate mechanism described below, the plasmids can achieve Rep translational induction of as much as 3000-fold. This translational induction is crucial for plasmid transformation or transfer, and stable copy-number maintenance through the action of the antisense Inc RNA. Inc RNA was identified as a genetic element responsible for incompatibility of like plasmids and is encoded ~100-bp upstream of the Rep initiation site (Fig. 1A). Inc RNA efficiently shuts down rep translation, after the plasmid establishes itself in a new host cell or when the plasmid copy number is too high during the host cell cycles.

Figure 1. Control of GUG-initiated translation through dynamic mRNA conformational changes. (A) The replication region of ColIb-P9 plasmid. The horizontal line represents the DNA of the ColIb-P9 replication region. Boxes denote coding regions for Inc, RLP and Rep and the origin of replication (ori). The short leftward arrow below denotes Inc RNA (antisense), while the long rightward arrow denotes Rep mRNA, starting from its promoter, Prep. Green circles on the transcripts denote the GGCG (filled) and CGCC (open) sequences, which interact together during rep gene regulation. The base-pairing between these sequences in Rep mRNA forms a pseudoknot with the Inc target stem-loop, which then allows rep translation (+ sign). The CGCC sequence (green open circle) on Inc RNA binds the GGCG sequence (green filled circle) on Rep mRNA, inhibiting pseudoknot formation (- sign). Inc RNA also represses RLP translation (- sign). (B) Inhibitory secondary structure found in the translation initiation region of the rep genes in ColIb-P9. Oragne boxes, GUG start codon and SD sequence (also with asteriska). Underline, RLP stop codon. Green box, the CGCC sequence forming a pseudoknot. (C) Models for the control of ColIb-P9 rep translation by competition between pseudoknot formation and antisense RNA binding. Panels a-d denote a conformation state of Rep mRNA, with green boxes denoting the GGCG (filled box) and CGCC (open box) sequences with the potential to form a pseudoknot. Bracket indicates the Rep mRNA region complementary to Inc RNA. Orange boxes are the SD and the GUG start codon for rep. Thin arrows on the mRNA indicate the regions that get translated in each state. Black filled and open circles denote the start and stop codons of RLP.

Figure 1. Control of GUG-initiated translation through dynamic mRNA conformational changes. (A) The replication region of ColIb-P9 plasmid. The horizontal line represents the DNA of the ColIb-P9 replication region. Boxes denote coding regions for Inc, RLP and Rep and the origin of replication (ori). The short leftward arrow below denotes Inc RNA (antisense), while the long rightward arrow denotes Rep mRNA, starting from its promoter, Prep. Green circles on the transcripts denote the GGCG (filled) and CGCC (open) sequences, which interact together during rep gene regulation. The base-pairing between these sequences in Rep mRNA forms a pseudoknot with the Inc target stem-loop, which then allows rep translation (+ sign). The CGCC sequence (green open circle) on Inc RNA binds the GGCG sequence (green filled circle) on Rep mRNA, inhibiting pseudoknot formation (- sign). Inc RNA also represses RLP translation (- sign). (B) Inhibitory secondary structure found in the translation initiation region of the rep genes in ColIb-P9. Oragne boxes, GUG start codon and SD sequence (also with asteriska). Underline, RLP stop codon. Green box, the CGCC sequence forming a pseudoknot. (C) Models for the control of ColIb-P9 rep translation by competition between pseudoknot formation and antisense RNA binding. Panels a-d denote a conformation state of Rep mRNA, with green boxes denoting the GGCG (filled box) and CGCC (open box) sequences with the potential to form a pseudoknot. Bracket indicates the Rep mRNA region complementary to Inc RNA. Orange boxes are the SD and the GUG start codon for rep. Thin arrows on the mRNA indicate the regions that get translated in each state. Black filled and open circles denote the start and stop codons of RLP. An important trigger of the mRNA conformational change and subsequent Rep translation is the translation and termination of an upstream ORF (uORF) termed the rep leader peptide (RLP) (Fig. 1A). The ribosome stalled at the RLP stop codon is proposed to expose the 5′-CGCC-3′ sequence immediately upstream of the Rep SD sequence, allowing the pseudoknot to form through a long-rage base-paring with the 5′-GGCG-3′ sequence in the loop of the Inc target stem-loop (Fig. 1B and C, panel a)., The pseudoknot thus formed is able to keep the Rep initiation signal (SD and GUG, orange squares in Figure 1B and C) unfolded long enough for the stalled ribosome to bind it and re-initiate at Rep (Fig. 1C, panel b). Inc RNA initially binds the Inc target stem-loop on Rep mRNA through the loop-loop contact termed “kissing interaction” (Fig. 1C, panel c). Because this kissing interaction is competitive with pseudoknot formation, Inc RNA is able to inhibit Rep translation directly and quickly (Fig. 1C, panel c)., Subsequently, Inc RNA forms a more stable complex capable of inhibiting RLP translation through blocking the RLP SD sequence (Fig. 1C, panel d). Mutations in the Rep initiation signal, increasing the complementarity to anti-SD or changing the GUG start codon to AUG, generally make Rep expression less dependent on the pseudoknot and less repressible by Inc.,- Therefore, the weak Rep initiation signal, including the GUG start codon, is an essential component of the sophisticated translational control mechanism involving dynamic RNA structural changes. Plasmids play a very important role in the bacterial life cycle, by providing sexual gene transfer, multi-drug resistance, and bacterial toxins including colicins. If the GUG start codon is a crucial element for replication control of certain plasmids, it is entirely possible that the bacterial hosts receive a selective pressure to maintain GUG initiation for their own survival or fitness. Since nearly ~10% of bacterial proteins initiate from GUG, it also is anticipated to find cases of GUG-initiated bacterially coded genes regulated by complex mRNA structural changes, for example, among those regulated by riboswitches.

Start Codon Selection in Archaea

Archaea form a group (domain) of prokaryotes, previously called archaeabacteria. Phylogenetic analysis of SSU rRNA sequences established that Archaea are more closely related to eukaryotes (Eukarya) than to Bacteria (previously called eubacteria) (three-domain classification). In agreement with this idea, archaeal translation initiation factors are more similar to eukaryotic factors than to bacterial factors, as described below. However, archaeal initiation is less stringent than eukaryotic initiation, permitting UUG and GUG initiation besides AUG. This fact suggests that the complexity of the eukaryotic initiation system at least in part evolved to confer stringent AUG initiation.

aIF1 and aIF2 evolved to confer initiation accuracy in Archaea

Archaea contain four initiation factors, aIF1, aIF1A, aIF2, and aIF5B (Table 1). Of these, aIF1A and aIF5B are the orthologs of bacterial IF1 and IF2, respectively, being involved in Met-tRNAiMet loading to the P-site and the 50S LSU joining. aIF1 and aIF2 are homologous to eIF1 and eIF2 found in eukaryotes. In the crenarchaeon Sulfolobus solfataricus, aIF1A and aIF1 bind the 30S SSU, preparing it for binding aIF2-GTP. Met-tRNAiMet then binds to the 30S:aIF1A:aIF1:aIF2 complex through aIF2. Archaeal mRNAs fall into two distinct classes: leaderless mRNAs starting immediately from the start codon and bacterial-type (and often polycistronic) mRNAs with typical SD sequences.,, Bacterial-type mRNAs bind the 30S SSU, depending on the SD:Anti-SD interactions. In the absence of aIF1, the 30S:aIF1A:aIF2:Met-tRNAiMet complex is able to bind both AUG and AUU model mRNAs. But it binds only AUG mRNA in its presence, demonstrating the ability of aIF1 to discriminate against AUU. It is unclear whether GTP hydrolysis for aIF2 is coupled to start codon selection by the archaeal 30S PIC. However, after the 30S initiation complex is formed, aIF5B most likely promotes subunit joining, as proposed previously, because aIF5B is demonstrated to be a ribosome-dependent GTPase and binds the 30S SSU competitively with aIF2. Leaderless mRNA binds the 30S SSU, depending on Met-tRNAiMet bound to the 30S SSU, similar to bacterial leaderless mRNAs. Other than this, the precise pathway for leaderless mRNA translation in Archaea has not been determined. IF3 and aIF1 appear to bind to similar locations in the SSU decoding site, but they are distinct proteins with different folds and domain organizations. Furthermore, the GTP-dependent tRNAi-binding factors, IF2/aIF5B and aIF2, are evolutionarily distinct proteins, because IF2/aIF5B is an ancient translational GTPase superfamily protein that belongs to neither the EF-G or the EF-Tu family. In contrast, aIF2γ (the GTP and tRNAi-binding subunit) clearly belongs to the EF-Tu family., Thus, aIF1 and aIF2 can be considered as evolutionarily independent entities to confer accuracy of initiation in this group of life (Table 1).

Evolution of non-AUG initiation in Archaea

It is unclear if archaeal Met-tRNAiMet carries the t6A modification in the anticodon loop that is suited for precisely decoding ANN codons during the elongation phase. However, genomics analyses indicate that Archaea permit frequent initiation from UUG and GUG codons,,, suggesting a decreased accuracy of translation initiation. Since Archaea contain aIF1, whose eukaryotic homolog, eIF1, plays the central role in discriminating against non-AUG codons including UUG and GUG (see below), it seems clear that the interaction between aIF1 and the SSU decoding site alone is not sufficient to confer the level of accuracy found in eukaryotic initiation. It remains to be determined whether the GUG or UUG start codon has any selective advantage toward archaeal translational control, as suggested in some bacterial translational control systems with GUG codons. Because translation of leaderless mRNAs more strongly depends on the start codon: tRNAiMet anticodon interaction, it would be important to determine whether AUG initiation (with complete matching to the anticodon) is favored in leaderless mRNA translation over UUG or GUG initiation. It would be intriguing to learn whether the initiation stringency operates equally well on both types of archaeal mRNAs.

Start Codon Selection in Eukaryotes

The eukaryotic 18S rRNA in the SSU has lost the anti-SD sequence., Translation generally initiates at the 5′-proximal AUG, irrespective of the length or sequence of the 5′-UTR, implying selection by a process of scanning from the 5′-end. The efficiency of recognition of the 5′-proximal AUG is influenced by a Kozak consensus sequence ([A/G]xxAUGG) in mammals or a similar initiation context (AA[A/G]AUG) in fungi including yeast. If the context of the 5′-proximal AUG is poor, those ribosomes, which fail to initiate at this site, continue scanning and initiate at the next AUG (leaky scanning). Despite the stringent AUG selection mechanism even discriminating against AUG in a poor context, it has been known that the translation of metazoan translational regulator protein, p97/NAT1/DAP5, or the longer form of mammalian oncogenic transcription factor c-Myc starts from the GUG and CUG codons, respectively (reviewed in Ref). Recent ribosome profiling studies also suggest that translation of as much as ~30% of genes may start from non-AUG codons in mouse embryonic stem cells. In plants, in-frame non-AUG start codons add mitochondria or chloroplast localization signals to change the protein’s location. Likewise in humans, an in-frame CUG codon adds an ER signal sequence to the phosphatase PTEN, allowing it to be secreted outside the cell. After briefly overviewing the eukaryotic initiation pathway, here I describe the mechanism by which eukaryotic translation initiates strictly from the AUG codon, focusing on the role of the translation initiation multifactor complex (MFC) in this process. Then I discuss whether, or if so how, alterations in MFC formation or its activity in stringent initiation can affect cellular proteome translation.

The MFC represents the evolutionarily distinct entity to confer initiation accuracy

Eukaryotic translation initiation is promoted by as many as 10 eukaryotic initiation factors (eIF)- (Table 1). Like in prokaryotes, the orthologs of IF1 and IF2, termed eIF1A and eIF5B, bind to the conserved sites in the 40S SSU, promoting Met-tRNAiMet loading to the P-site and 60S LSU joining, respectively., Of the remaining factors, four of them, eIF1, eIF2, eIF3 and eIF5 are considered as an evolutionarily independent entity to confer the accuracy of eukaryotic initiation, because (i) their mutual physical interactions allow them to form a multifactor complex (MFC)- and (ii) earlier yeast genetic data indicate that mutations altering each of the MFC components impair stringent initiation, allowing initiation from UUG- (suppressor of initiation codon mutation or Sui phenotype) (Table 1) (Also see Box 1). In addition to the MFC, the universally conserved factor, eIF1A, evolved to possess N- and C-terminal extensions, playing crucial roles in controlling PIC conformation in response to AUG recognition.,-

Is UUG initiation in yeast Sui- mutants analogous to bacterial UUG initiation?

“Interestingly, yeast is but one amino acid substitution (in one gene…..) away from being able to allow initiation at UUG codons, an interaction facilitated by IF3” – Hartz et al., 1990

In their genetic analyses, the Donahue group used five start codon mutations, AUU, ACG, CUG, ACC and GUG, as parental mutations to screen for extragenic suppressors of the initiation codon mutations (Sui). Sui- mutants isolated from each parent were able to suppress all other start codon mutations. Because the reporter gene (HIS4) was translated from the third codon UUG in these mutants in all the cases examined, the UUG codon, but not GUG or CUG, was determined to be the second optimal start codon used in the Sui- mutants., As quoted above, these beautiful data tempt us to speculate that UUG is a second “preferred” start codon for a somewhat less stringent system found in Bacteria or yeast Sui- mutants and that this property reflects the fundamental structural requirement or constraint for decoding in the P-site. However, it should be noted that nearly all the studies on non-AUG initiation in the Sui- mutants have used the HIS4 mRNA as a model. The HIS4 3rd codon UUG is the first UUG as found in the HIS4 mRNA leader region. All other potential (near-cognate) non-AUG start codons appear more than once in the HIS4 5′ UTR and most are out of frame to HIS4. Thus, if a suppressor were to be isolated and allowed HIS4 translation from the first non-AUG codon, the mutant probably had to utilize the 3rd codon UUG. Nevertheless, the frequency of in-frame GUG initiation of HIS4 was compared with that of the UUG initiation, by disrupting the sole natural GUG codon in the 5′ UTR. In a SUI4 or SUI5 mutant altering eIF2γ and eIF5, respectively, the frequency of GUG initiation was 39% and 1% compared with that of UUG initiation. Thus, UUG is still a preferred start codon even compared with GUG. In addition, the site-directed mutations altering the critical interfaces of eIF1 within MFC in the open PIC allow initiation from UUG-initiated HIS4 allele,, suggesting that, in general, the MFC impairment can cause UUG initiation by destabilizing the open PIC, as far as it is found in the HIS4 mRNA leader. More works using different reporter constructs are needed to determine whether the strong UUG initiation in the suppressor mutants has any general relevance. Of the MFC factors, eIF2 binds Met-tRNAiMet in a GTP dependent manner, thereby recruiting it to the 40S SSU in a ternary complex (TC). The C-terminal domain of eIF5 bridges eIF2 and eIF3. Thus, MFC-mediated mutual cooperativity between the 40S SSU interactions with eIF2 and eIF3 would promote the formation of the 43S PIC, which is made of the 40S SSU, eIF1A, MFC and Met-tRNAiMet 71,72 (Fig. 2).

Figure 2. Eukaryotic 48S PIC. Schematics on the bottom depict the eukaryotic 48S PIC formed at the 5` end of mRNA (horizontal line) with the AUG codon (box) ahead. The m7G cap, shown by an orange small circle, is bound by eIF4F in teal. The translation initiation multifactor complex (MFC) bridges eIF4F and the 40S SSU, the large pale green circle with while letters indicating E, P, and A-sites. Met-tRNAiMet, shown as a plug, is attached through the eIF2 component of the MFC, but not firmly linked to the P-site during scanning (Pout). Arrow indicates the direction of scanning. The 43S PIC is made of the 40S SSU, MFC, eIF1A, and Met-tRNAiMet. On top, eIF1 (pyramid) bound near the P-site blocks Met-tRNAiMet (plug) binding to the P-site together with other MFC partners, until it base-pairs to the AUG codon. The thick stopped bar indicates the physical block of the P-site formed by the presented part of the MFC. Dotted lines denote the interactions of eIF1 with other MFC partners shown by circles.

Figure 2. Eukaryotic 48S PIC. Schematics on the bottom depict the eukaryotic 48S PIC formed at the 5` end of mRNA (horizontal line) with the AUG codon (box) ahead. The m7G cap, shown by an orange small circle, is bound by eIF4F in teal. The translation initiation multifactor complex (MFC) bridges eIF4F and the 40S SSU, the large pale green circle with while letters indicating E, P, and A-sites. Met-tRNAiMet, shown as a plug, is attached through the eIF2 component of the MFC, but not firmly linked to the P-site during scanning (Pout). Arrow indicates the direction of scanning. The 43S PIC is made of the 40S SSU, MFC, eIF1A, and Met-tRNAiMet. On top, eIF1 (pyramid) bound near the P-site blocks Met-tRNAiMet (plug) binding to the P-site together with other MFC partners, until it base-pairs to the AUG codon. The thick stopped bar indicates the physical block of the P-site formed by the presented part of the MFC. Dotted lines denote the interactions of eIF1 with other MFC partners shown by circles. eIF4F binds the 5′ cap directly through its eIF4E subunit and the poly(A) tail through an interaction between its eIF4G subunit and the poly(A) binding protein (PABP). The RNA helicase activity carried by the eIF4A subunit unfolds the 5′ proximal region of the mRNA, thereby allowing the 43S PIC to bind to the 5′ end of the mRNA, (Fig. 2). Besides eIF4A, various mRNA helicases unwind mRNA structures along the path of the scanning PIC, assisting mRNA base triplet pairing to the tRNAiMet anticodon. The MFC bound to the 40S SSU bridges the eIF4G subunit and the SSU (Fig. 2). The 48S PIC thus formed scans for the first AUG codon of the mRNA, likely through a 5′-to-3′ migration of the PIC along the mRNA (mRNA scanning). Similar to all ANN-decoding tRNAs, eukaryotic Met-tRNAiMet carries t6A 3′ of the anticodon. However, this modification is not necessary for stringent initiation, because in vitro transcribed Met-tRNAiMet can be used to reconstitute the translation initiation reaction capable of discriminating against non-AUG codons., Instead, initiation factor proteins that couple GTP hydrolysis to AUG recognition by the PIC play the major role in producing the 80S initiation complex precisely at the AUG codon, as outlined briefly next and described in detail later in this review. On the 48S PIC formation, eIF5 starts to promote GTP hydrolysis for eIF2, but the products, GDP and Pi, remain bound to eIF2, keeping a scanning-competent conformation. Once the tRNAi anticodon base-pairs to AUG, the Pi is released and the PIC stops scanning. As a consequence, eIF2 adopts the GDP-bound conformation, releasing itself from the PIC. As described below, eIF1 leaves the PIC before Pi release and eIF5 leaves the PIC together with eIF2-GDP, allowing the 40S IC to form. eIF5B then binds the 40S IC and promotes 60S LSU joining., eIF2-GDP is recycled into eIF2-GTP by the action of the guanine nucleotide exchange factor (GEF), eIF2B (Table 1). In yeast, a substantial fraction of eIF5 associates with eIF2 but lacks Met-tRNAiMet, suggesting that the clearance or recycling of the eIF2-GDP:eIF5 complex is rate-limiting for efficient MFC formation., eIF5 inhibits GDP dissociation from eIF2 in this complex, thereby serving as a GDP dissociation inhibitor (GDI). Importantly, eIF2 phosphorylation increases the abundance of this complex, thereby enhancing the inhibition of guanine nucleotide exchange by eIF2B. The significance of the eIF5:eIF2:GDP complex as the sequester of both the factors was recently enhanced by the discovery of eIF5 dissociation function carried by eIF2B (GDI dissociation factor or GDF function).

Conservation and diversity in the mechanism of translation initiation throughout eukaryotes

All factors that interact mainly with the subunit interface of the SSU—eIF1, eIF1A, eIF2, eIF5 and eIF5B—are highly conserved in structure and function throughout eukaryotes. However, much diversity is seen with molecular compositions of eIF3 and eIF4F involved in mRNA recruitment to the ribosome. Remarkably, the subunit composition of the yeast Saccharomyces cerevisiae eIF3 is simpler than mammalian eIF3 (Table 1). Nearly all the 13 subunits of eIF3 are generally well conserved in plants, fungi and metazoans, and the simplified eIF3 is found only in a group of closely related yeasts. However, the anaerobic protozoan Giardia intestinalis (lamblia), the most distantly related and perhaps primitive eukaryote, possesses only four identifiable eIF3 subunits (b, c, g, i) also found in S. cerevisiae eIF3 (KA, personal observations) – an example of convergent evolution. Thus, yeast eIF3 most likely carries a minimal function in various aspects of translation initiation. Likewise, the structure of eIF4G is diverse among eukaryotes, except for the Y(X)4LΦ motif responsible for eIF4E-binding and the MIF4G/HEAT domain responsible for eIF4A-binding. There is no conservation in PABP-binding sites between plant, yeast and mammalian homologs.,- In line with the difference in eIF3 structures, the nature of the interactions within the MFC differ somewhat, at least in strength, between yeast and mammals. The eIF4G-binding sites in the MFC are also different between yeast (via eIF5 and eIF1),, and mammals (via eIF3).- In accordance with the diversity in factors involved in mRNA recruitment, Metazoa have evolved many non-redundant mRNA helicases involved in translation initiation, including Vasa, a DEAD-box helicase, that binds the 3` UTR, remodeling mRNA for PIC recruitment. Mammalian DHX29 and DDX3 (yeast homolog, Ded1p), which unwind secondary structures in the 5′ UTR, also belong to distinct subfamilies of mRNA helicases, and hence arose independently during eukaryotic evolution.-

Molecular mechanistics of stringent AUG initiation in eukaryotes

Careful studies of the yeast translation initiation system, coupling in vitro biochemical assays with in vivo phenotypic observations, have reveled the following, intricate mechanism of stringent AUG initiation. The protein motifs and three-dimensional structures involved in this process are highly conserved across the eukaryotic kingdoms, and, in particular, between yeast and mammals. The key features of the mechanistics, including the SSU ribosomal conformational changes, are also being verified in the mammalian system.

eIF1 plays the central role in maintaining the scanning competent state.

eIF1, the smallest component (12 kDa) of the MFC, plays an important role in stringent AUG initiation in eukaryotes. Accumulating evidence from yeast and mammalian cells indicates that this is the key regulator of ribosome conformation from the “open” scanning-competent state to the “closed” scanning-incompetent state. , During scanning, the PIC allows mRNA sliding with a wider mRNA-binding channel. Met-tRNAiMet is not locked into the P-site at this point (Pout), allowing the tRNAi anticodon to base-pair with base triplets in the mRNA. However, when the anticodon base-pairs to the AUG codon, the PIC stops scanning, accommodating the Met-tRNAi:AUG pair in the P-site (Pin) (Fig. 3A).

Figure 3. Translation factor control of the PIC conformational change, coupled to AUG recognition. (A) The open (left) and closed (right) states of the 48S PIC are depicted with the 40S SSU (large circle with E, P and A-sites, different colors indicating distinct states) bound to initiation factors (small circles with numbers corresponding to each eIF). In the open state (left), the decoding site is “open,” allowing mRNA (curved thick line with 5′ and 3′ ends indicated) to slide along the mRNA-binding channel. Met-tRNAiMet (plug) is tilted to imply that it is out of the P-site. eIF1 (circle numbered 1) prevents Met-tRNAiMet paired with the non-AUG codon from binding the P-site. In the closed state (right), eIF1 is released (thick horizontal arrow), allowing Met-tRNAiMet paired with the AUG codon to bind the P-site. The Met-tRNAiMet is positioned vertically to imply the Pin state (near the P/I configuration,). (B) The conformational change is explained by the competition between eIF1 (circle) and the Met-tRNAiMet (plug):base triplet pair (UUG or AUG) for the P-site (labeled white on a part of green circles representing the 40S SSU). (C) MFC partners and eIF1A tails serve to stabilize each state, tightly coupling AUG recognition and commitment to initiation. In the open state (left), indicated MFC partners (circles) and eIF1A-NTT (thick red line) assist eIF1 to block tRNAiMet paired to UUG (representing a non-AUG codon) from binding the P-site. eIF1A-CTT (thick red line) destabilizes tRNAiMet binding to the P-site, contributing to the maintenance of the open state. In right, eIF2β-NTT (thick maroon line) plays the major role in the shift to the closed state by binding eIF5-CTD and disrupting the eIF1 linking to the PIC. These interactions promote eIF1 release. eIF1A-NTT (thick red line) stabilizes tRNAiMet binding to the P-site by direct interaction with tRNAiMet and the 40S P-site.

Figure 3. Translation factor control of the PIC conformational change, coupled to AUG recognition. (A) The open (left) and closed (right) states of the 48S PIC are depicted with the 40S SSU (large circle with E, P and A-sites, different colors indicating distinct states) bound to initiation factors (small circles with numbers corresponding to each eIF). In the open state (left), the decoding site is “open,” allowing mRNA (curved thick line with 5′ and 3′ ends indicated) to slide along the mRNA-binding channel. Met-tRNAiMet (plug) is tilted to imply that it is out of the P-site. eIF1 (circle numbered 1) prevents Met-tRNAiMet paired with the non-AUG codon from binding the P-site. In the closed state (right), eIF1 is released (thick horizontal arrow), allowing Met-tRNAiMet paired with the AUG codon to bind the P-site. The Met-tRNAiMet is positioned vertically to imply the Pin state (near the P/I configuration,). (B) The conformational change is explained by the competition between eIF1 (circle) and the Met-tRNAiMet (plug):base triplet pair (UUG or AUG) for the P-site (labeled white on a part of green circles representing the 40S SSU). (C) MFC partners and eIF1A tails serve to stabilize each state, tightly coupling AUG recognition and commitment to initiation. In the open state (left), indicated MFC partners (circles) and eIF1A-NTT (thick red line) assist eIF1 to block tRNAiMet paired to UUG (representing a non-AUG codon) from binding the P-site. eIF1A-CTT (thick red line) destabilizes tRNAiMet binding to the P-site, contributing to the maintenance of the open state. In right, eIF2β-NTT (thick maroon line) plays the major role in the shift to the closed state by binding eIF5-CTD and disrupting the eIF1 linking to the PIC. These interactions promote eIF1 release. eIF1A-NTT (thick red line) stabilizes tRNAiMet binding to the P-site by direct interaction with tRNAiMet and the 40S P-site. This “conformational change” model was first proposed as the result of analyses of the in vitro reconstituted initiation system using mammalian factors, identifying eIF1 and eIF1A as crucial elements for mRNA scanning and eIF1 for discriminating against non-AUG initiation.- The cryo-EM structure of the yeast eIF1/eIF1A/40S SSU complex (the open state) displayed a wider mRNA-binding space located between the “head” and “body” of the SSU than the 40S SSU alone (the closed state), in agreement with the mRNA-sliding capacity of the open conformation. More recently, comparison between Tetrahymena eIF1/eIF1A/40S (open) complex and the 40S SSU alone (closed) defined conformational changes rather locally restricted to eIF1- and eIF1A-binding sites. More work is needed to delineate structural features in the open PIC required for mRNA sliding, including the role of a “latch” formed by interactions of an rRNA helix of the body with another rRNA helix and Rps3 of the head, which clamp around the mRNA. Experiments with a partial yeast initiation system (composed of eIF1, eIF1A, eIF2, eIF5, Met-tRNAiMet, mRNA and the 40S SSU) demonstrated that eIF1 release is the key event associated with the conformational change, as detected in the FRET (fluorescence resonance energy transfer) assay,, and precedes Pi release (Fig. 3A). Thus, the presence of eIF1 defines the open state of the PIC. Recent studies highlighted the role of eIF1 in maintaining the Pout state of the open PIC (Fig. 3B), as described next. In a TC binding assay with the yeast eIF1:eIF1A:mRNA:40S complex, eIF1 dramatically increases the off rate of the TC, leading to the idea that eIF1 antagonizes tRNAi binding to the P site before start codon selection (Pout). This idea is strongly supported by genetic and biochemical studies, indicating that many yeast eIF1 mutations impairing the interaction with the 40S SSU or MFC partners (also see below) allow faster release of eIF1, thereby increasing the frequency of UUG initiation (Sui or suppressor of initiation codon mutation phenotype) – a premature change to the closed state at the non-AUG codon., Recently solved crystal structures of rabbit 40S SSU complexes revealed how eIF1 maintains the Pout conformation: The tRNAi located in the 40S:mRNA:tRNAi:eIF1A (closed) complex clashed with eIF1 in the 40S:eIF1 or 40S:eIF1:eIF1A (open) complex. Thus, eIF1 binds near the P-site and physically blocks tRNAi binding to the P-site, until its release caused by tRNAiMet binding to the AUG codon (Fig. 2 and Fig. 3B).

The MFC and eIF1A-terminal tails control initiation accuracy by dynamic rearrangement.

An excessive amount of eIF5 stimulates premature eIF1 release, thereby allowing initiation at UUG. , Interestingly, mutational studies delimited the part of eIF5 responsible for this activity to the C-terminal domain (CTD), not the N-terminal domain responsible for GTPase activation for eIF2. NMR studies revealed an evolutionarily conserved, overlapping surface of eIF5-CTD responsible for binding to eIF2β and eIF1. A mutation altering the eIF2β-binding site of eIF5-CTD alleviated the ability of eIF5 to promote eIF1 release, indicating that the eIF2β:eIF5-CTD interaction is involved in the shift to the closed state. eIF5-binding sites in eIF2β were located in three lysine-rich segments (K-boxes) in its N-terminal tail (NTT). Mutations altering the mutual binding sites prevented UUG initiation enforced by an eIF Sui- mutation (Ssu, or suppressor of SUI, phenotype), ,, in agreement with the role for this interaction in stabilizing the closed state of the PIC ( Fig. 3C , right). In contrast, the eIF5-CTD interaction with eIF1 contributes to eIF1 anchoring to the open PIC (Fig. 3C , left). Thus, the eIF2β:eIF5-CTD interaction promotes eIF1 release by disrupting the eIF1 link to the PIC ( Fig. 3C ). The interaction between the eIF5-CTD and the eIF2β-K-boxes is the major driver of MFC assembly in yeast.,, How can this interaction also trigger the PIC shift from the open to closed conformation? The eIF2β-K boxes bind RNA in a manner competitive with eIF5-CTD binding, and evidence suggests that the K-boxes mediate mRNA binding to the PIC. Thus, it is proposed that the interaction between the eIF5-CTD and eIF2β-K-boxes gets resolved upon 48S PIC formation, such that the K-boxes can stabilize mRNA binding during scanning. Once AUG is recognized, the K-boxes are somehow disengaged from mRNA binding and instead gets committed to close the PIC conformation. The eukaryote-specific extensions of eIF1A, its NTT and CTT (C-terminal tail), play opposing roles in regulating PIC conformation. The eIF1A-NTT stabilizes the closed conformation through a direct interaction with tRNAi in the P-site,,, while the eIF1A-CTT stabilizes the open conformation by linking tRNAi to the open PIC directly or indirectly through eIF2- (Fig. 3C). A recent study also revealed a new interaction between the eIF1A-NTT and eIF5-CTD. Similar to eIF2β K-boxes, the eIF1A-NTT is rich in positively charged residues. NMR studies showed that the eIF1A-binding site on eIF5-CTD almost completely overlaps with its eIF2β-binding site. In vivo mutational studies on the charged residues in eIF1A-NTT suggest that the eIF1A-NTT interaction with eIF5 contributes to eIF1 retention of the open PIC. Thus, eIF1A-NTT masks the eIF2β-binding site of eIF5-CTD during the course of scanning and contributes to the retention of eIF1 through eIF5-CTD (Fig. 3C).

Translational Control through Inhibition of MFC Formation

uORF-mediated control

Virtually all eukaryotic mRNAs are structurally and functionally monocistronic. If a bicistronic mRNA coding for two full-length proteins is generated and tested in a laboratory, only the 5′-proximal ORF is expressed, because ribosomes are released from the mRNA at the stop codon of this ORF. However, ~45–50% of mammalian mRNAs and ~13% of yeast mRNAs have at least one short upstream ORF (uORF) that either is bypassed by leaky scanning, prevents translation of the downstream ORF, or, alternatively, allows its re-initiation after the translation of the short uORF. An uORF can become permissive to downstream re-initiation, if the uORF is very short and somehow prevents ribosome dissociation at the stop codon, allowing the ribosomes that had anchored to mRNA following uORF translation to resume scanning. Intriguingly, combination of a short, re-initiation-permissive uORF and an inhibitory uORF (paired uORFs) allows translational control in response to eIF2 inhibition and resulting delayed re-initiation at the main ORF. Examples of regulation by paired uORFs include mammalian ATF4 and yeast GCN4 whose translation is induced by eIF2α phosphorylation. Various stress-activated protein kinases, such as GCN2, PKR, PERK and HRI, phosphorylate the conserved Ser-51 residue of the eIF2α subunit. This renders eIF2 a competitive inhibitor of eIF2B, thereby decreasing the eIF2:GTP:Met-tRNAiMet TC level. As a result, TC binding to the PIC that is about to re-initiate at the inhibitor uORF is delayed, allowing it to bypass the uORF and re-initiate at ATF4 or GCN4 instead. In yeast, eIF5 overxpression decreases TC abundance through eIF2B inhibition (the GDI activity) and slows down TC binding to the PIC through disturbing MFC formation. These effects mimic the effect of eIF2 phosphorylation, thereby allowing the post-uORF PIC to bypass the downstream uORFs and initiate translation of GCN4. In line with this observation, overexpression of a new translational inhibitor protein, eIF5-mimic protein 1 (5MP1, also known as BZW2), can inhibit eIF2 through a direct competition with eIF5, thereby inducing ATF4 translation in mouse embryonic fibroblasts with eIF2α Ser 51-to-Ala mutation. Thus, GCN4 or ATF4 can be translationally induced through eIF2 inhibition independent of eIF2 phosphorylation. Interestingly, the structural requirement for re-initiation following translation at an uORF is different between yeast and mammals or perhaps the whole Metazoa. In yeast, an uORF is generally non-permissive or inhibitory to downstream initiation. An mRNA cis element that binds eIF3 (via the a subunit) is responsible for rendering GCN4 uORF1 to being permissive. In mammals, the permissiveness of an uORF is primarily determined by its size. A short uORF (of ~2–3 amino acids) is generally permissive, likely because eIF3 and/or eIF4G remain(s) anchored more stably to the ribosome that has just finished its translation. uORFs longer than 20 amino acids are inhibitory to downstream initiation, likely because these factors dissociate from the ribosome during the time it takes to complete the uORF translation. Longer uORFs are even more inhibitory if they overlap with the main ORF, because the ribosome which has translated the uORF does not migrate backward (in a 3′-to-5′ direction) and therefore is effectively prevented from re-initiating at the main ORF. This difference leads to a marked difference between yeasts and Metazoa in the paired uORF organization required for translational control through eIF2 inhibition.

Competition with cap-independent translation.

Besides the cap-dependent mode, eukaryotic translation can initiate from an internal ribosome-entry site (IRES) often found in RNA viruses, such as poliovirus (PV), encephalomyocarditis virus (EMCV), hepatitis C virus (HCV) and cricket paralysis virus (CrPV) (cap-independent translation). , The MFC-driven cap-dependent pathway competes with cap-independent translation of IRES-carrying mRNAs. Accordingly, virus infection often inactivates cap-dependent translation through eIF4F cleavage and induces eIF2α phosphorylation, resulting in a decrease in eIF2:GTP:Met-tRNAiMet TC levels. This effect renders the IRES to compete more effectively for the MFC-loaded 43S PIC (EMCV), allows other MFC components or the 40S SSU available for IRES-mediated translation (HCV or CrPV), or, alternatively, allows non-canonical initiation factors (eIF2D/LGTN or MCT-1:DNER) promoting eIF2-indendnent Met-tRNAiMet recruitment , to bind the 40S SSU in place of the MFC (HCV). If eIF5 associates with eIF2:GDP and inhibits GDP dissociation from phosphorylated eIF2 (GDI activity) during virus infection, , eIF5 is sequestered effectively from the MFC, thereby enhancing the effect of eIF2 phosphorylation in favor of MFC inhibition and IRES translation.

Translational control through altered initiation stringency in eukaryotes

In mammals, eIF1 also works to discriminate against AUG codons in a poor Kozak context (consensus). The AUG start codon of mammalian eIF1 carries a poor Kozak context. When the eIF1 level goes down, a larger proportion of the PIC lacks eIF1, thereby decreasing its ability to discriminate against the AUG codon in a poor Kozak context. As a consequence, eIF1 translation goes up, correcting for the low eIF1 abundance – an autoregulatory mechanism similar to AUU-initiated bacterial IF3 regulation. Similarly, the yeast gene encoding eIF1 is translated from an AUG codon in a poor context and is auto-regulated by eIF1 abundance. However, this auto-regulation is only modulatory, since the expression of eIF1 from a high copy plasmid permits its substantial overexpression. Ribosome profiling, based on deep sequencing of ribosome-protected mRNA fragments, has been used to identify ribosome “footprints” on all the mRNAs in the cells. The drug harringtonine causes the ribosome to accumulate precisely at start codons. Thus, the ribosome profiling of harringtonine-treated cells can potentially reveal all the start codons used in the cells. Analysis of mouse embryonic stem cells revealed frequent non-AUG initiations within 5′ UTRs, which were suppressed by ~25% after differentiation. Based on this fact and that the genes displaying frequent non-AUG initiation at their 5′ UTR include transcription factors governing pluripotency, such as c-Myc and Nanog, it was proposed that non-AUG initiation allows translational control characteristic of pluripotency. The use of harringtonine to identify functional start codons in mouse ES cells revealed numerous translated uORFs with near-cognate start codons, which outnumbered AUG-initiated uORFs by ~4:1, with CUG the most frequent. In yeast the similar ribosome profiling data (but done without harringtonine that cannot diffuse into yeast) suggest that uORFs with near-cognate start codons are roughly equal in number to AUG-initiated uORFs. It should be noted that, similar to the prokaryotic cases, the frequencies of initiation from these non-AUG codons might be much more lower than that of initiation from the AUG codon. In a classical work on the yeast CYC7 initiation site, the strongest non-AUG initiation codons were GUG, ACG, AUU and UUG, but their frequency of initiation was 0.4~0.5% compared with that of AUG initiation. Provided that non-AUG initiation occurs at a sufficiently high frequency, non-AUG initiation in a 5′ UTR can affect gene expression through generating new uORFs or in-frame start codons (Fig. 4). If non-AUG start codons initiate translation of short uORFs (of up to 3~5 amino acids), they are expected to be permissive to downstream initiation (Fig. 4B, panel 1). On the other hand, if they initiate translation of long uORFs – especially those overlapping with the main ORF, they are inhibitory to initiation at the main ORF (Fig. 4B, panel 2). In addition, if non-AUG start codons generate a pair of permissive and inhibitory uORFs with appropriate locations (such as the ones found in the ATF4 leader region,), 5` UTR translation may allow regulated initiation of the main ORF in response to changes in eIF2 TC levels (Fig. 4B, panel 3). In-frame non-AUG start codons add N-terminal extensions to the protein encoded by the main ORF, which may regulate its enzyme or transcription factor activity (Fig. 4B, panel 4). Alternatively, these N-terminal extensions may contain an intracellular localization signal (ER, mitochondrial or plastid localization signal) (Fig. 4B, panel 4), regulating protein transport.,

Figure 4. Possible mechanisms of translational control by regulated initiation stringency. Typical coding structures of mRNA (horizontal lines) are depicted with the main ORF starting with the canonical AUG codon (longer box to the right). Black and red arrows indicate AUG- and non-AUG-initiated translation, respectively. mRNA translation patterns in the normal stringent mode (A) and the hypothetical non-stringent mode (B) are depicted. In (A), dotted boxes depict non-AUG-initiated uORFs that are not translated due to the normal, stringent mode of initiation. The main ORF is translated predominantly. In (B), panels 1–4 describe four patterns of regulation achieved by non-AUG-initiation. In panel 1, non-AUG-initiated uORF is permissive to downstream re-initiation, allowing the main ORF translation. In panel 2, non-AUG-initiated uORF is non-permissive to downstream re-initiation. The translation of the main ORF is only possible by the ribosome that had leaky-scanned the uORF (dotted line). In panel 3, the upstream permissive uORF allows downstream re-initiation of the second non-permissive uORF “or” the main ORF. When eIF2 TC abundance and recruitment are normal, the second uORF is re-initiated, inhibiting the main ORF translation. When eIF2 TC is limited or its ribosome binding is inhibited, the re-initiation is delayed and occurs at the main ORF. In panel 4, thick lines below mRNA schematics denote proteins encoding by the mRNA, with boxes indicating N-terminal functional segments added by the in-frame non-AUG initiation. Though not depicted here, there are myriads of mRNAs with AUG-initiated uORFs that are translated in the stringent mode, playing important roles in translational control. See text for details.

Figure 4. Possible mechanisms of translational control by regulated initiation stringency. Typical coding structures of mRNA (horizontal lines) are depicted with the main ORF starting with the canonical AUG codon (longer box to the right). Black and red arrows indicate AUG- and non-AUG-initiated translation, respectively. mRNA translation patterns in the normal stringent mode (A) and the hypothetical non-stringent mode (B) are depicted. In (A), dotted boxes depict non-AUG-initiated uORFs that are not translated due to the normal, stringent mode of initiation. The main ORF is translated predominantly. In (B), panels 1–4 describe four patterns of regulation achieved by non-AUG-initiation. In panel 1, non-AUG-initiated uORF is permissive to downstream re-initiation, allowing the main ORF translation. In panel 2, non-AUG-initiated uORF is non-permissive to downstream re-initiation. The translation of the main ORF is only possible by the ribosome that had leaky-scanned the uORF (dotted line). In panel 3, the upstream permissive uORF allows downstream re-initiation of the second non-permissive uORF “or” the main ORF. When eIF2 TC abundance and recruitment are normal, the second uORF is re-initiated, inhibiting the main ORF translation. When eIF2 TC is limited or its ribosome binding is inhibited, the re-initiation is delayed and occurs at the main ORF. In panel 4, thick lines below mRNA schematics denote proteins encoding by the mRNA, with boxes indicating N-terminal functional segments added by the in-frame non-AUG initiation. Though not depicted here, there are myriads of mRNAs with AUG-initiated uORFs that are translated in the stringent mode, playing important roles in translational control. See text for details.

Is CUG initiation mediated by a Leu-tRNA initiator?

Much of the non-AUG initiation observed in the ribosome profiling studies mentioned above starts with CUG or GUG.Interestingly, CUG initiation was reported to play a specialized role in MHC class I antigen protein synthesis and presentation.The ribosome engaged in the CUG initiation in rabbit reticulocyte lysates was shown to contain a Leu-tRNA. Gene silencing studies also showed that the CUG initiation is promoted by a non-canonical initiation factor, eIF2A. Evidence is also presented that Leu-tRNA initiates the translation of a CUG-initiated uORF found in the c-Myc leader region. This study poses important questions concerning the mechanism of non-AUG initiation found in higher eukaryotes. Is it a modification of the canonical cap-dependent, MFC-mediated pathway, a totally new Leu-tRNA-driven pathway, or a combination thereof?

Altered initiation stringency and disease.

If we consider non-AUG initiation in a 5′ “UTR” as the consequence of regulation of the MFC-mediated stringency mechanism, the finding that 5′ UTR translation goes up in embryonic stem cells and gets suppressed upon differentiation is very intriguing. Considering the MFC-mediated mechanism mentioned above, there are two ways to control the frequency of non-AUG initiation without introducing mutations into MFC components. One is to change expression of eIF1 or eIF5, the key regulators of initiation stringency, as demonstrated in mammalian cells. , The other is to modulate interactions between MFC components through phosphorylation or protein inhibitors. In plants, the MFC is the target of frequent protein phosphorylation during active protein synthesis. Similar MFC components are also phosphorylated in mammals and yeasts. - Recently, the eIF5-mimic protein (5MP, also known as BZW or eIF5C) was reported to inhibit general protein synthesis by competing with eIF5 for eIF2. These findings raise an interesting possibility that MFC regulation leads not only to regulation of general translation, but also to controlled stringency of translation initiation. The cellular pluripotency, whose translation status may be characterized by non-AUG initiation, plays an important role in the generation of cancers. While overexpression or truncation of initiation factors is associated with cancers,, there is no report indicating that the modulation of eIF1 or eIF5 levels is associated with them. However, human 5MP2 (BZW1) was shown to be overexpressed in distinct types of cancers including mucoepidermoid carcinoma (MEC). Since MEC cell lines knocked down for 5MP2 generated smaller tumors in nude mice, 5MP2 is proposed to be an oncogenic protein. As mentioned above, 5MP1 (BZW2) was also reported to promote translation of ATF4, the oncogenic transcription factor, in mouse embryonic fibroblast cell lines with the homozygous eIF2α-S51A mutation defective in eIF2α phosphorylation. It would be intriguing to determine whether MFC regulation through 5MP can affect cellular stringency of translation initiation and whether mis-programming of this regulation can lead to tumorigenesis. Another category of disease caused by changes in translation components include “ribosomopathy,” whose founding members are Diamond-Blackfan anemia and 5q- syndromes, caused by mutations in ribosomal proteins., While most of the symptoms, including macrocytic anemia and cogenital disorders, may be explained by the lack of sufficient protein synthesis required for tissue development due to inappropriate ribosome synthesis, there are many other symptoms associated with ribosomopathies that are not explained by this model. One of them is an increased risk of cancers,, which is also observed in zebrafish lines deficient in ribosomal proteins. It is puzzling that that the ribosomal protein mutations cause common phenotypes despite the fact that the altered proteins locate everywhere on the ribosome and that no common functional defect is found. Recently, it was reported that a common phenotype of rather subtle, small and large rRNA mutations in yeast is to increase the accuracy of initiation (Ssu phenotype).,, An interesting possibility emerges that the ribosomal protein mutations reduce non-AUG initiation in stem cells (even to a small extent), thereby compromising the gene regulation program that sets the stage for later differentiation while suppressing tumorigenesis.

Conclusions and Perspectives

Although the role of the MFC in rapid PIC assembly may not yet have been established in higher eukaryotes, recent studies highlight the mechanism by which eIF2 phosphorylation limits both eIF2 TC and free eIF5 available for efficient MFC formation., In addition, it seems clear that the MFC is the evolutionarily conserved unit responsible for stringent AUG initiation specific to eukaryotes. Careful NMR interaction studies using both yeast and human proteins have identified conserved mutual interfaces in MFC, and its partner, eIF1A-NTT, responsible for stringent AUG initiation., The immediate next goal of this line of study is to delineate the precise molecular role of eIF3, the major component of the MFC, and eIF4G, the major mRNA-binding partner of the MFC, in stringent AUG initiation, since the binding sites for both eIF1 and eIF5 were identified in the unstructured segments of the c subunit of eIF3 and eIF4G., The discovery of non-AUG initiation that is prevalent in the pluripotent cell stage has raised the very important question – how can cell’s stringency in translation initiation be regulated? Whether this involves the control of MFC activity, the new Leu-tRNA-driven pathway or both, the quest for the answer to this question requires the efforts of coming decades, combining all the cutting edge technologies to study the mechanism and regulation of translation initiation. The results of such studies are expected to change the paradigm concerning how we view translational control, diseases caused by its perturbation, and evolution of life on Earth.
  132 in total

1.  Analysis of elements involved in pseudoknot-dependent expression and regulation of the repA gene of an IncL/M plasmid.

Authors:  V Athanasopoulos; J Praszkier; A J Pittard
Journal:  J Bacteriol       Date:  1999-03       Impact factor: 3.490

Review 2.  Gene regulation by riboswitches.

Authors:  Maumita Mandal; Ronald R Breaker
Journal:  Nat Rev Mol Cell Biol       Date:  2004-06       Impact factor: 94.444

Review 3.  Regulation of translation via mRNA structure in prokaryotes and eukaryotes.

Authors:  Marilyn Kozak
Journal:  Gene       Date:  2005-10-05       Impact factor: 3.688

4.  Translation factor control of ribosome conformation during start codon selection.

Authors:  Katsura Asano; Matthew S Sachs
Journal:  Genes Dev       Date:  2007-06-01       Impact factor: 11.361

5.  Regulatory elements in eIF1A control the fidelity of start codon selection by modulating tRNA(i)(Met) binding to the ribosome.

Authors:  Adesh K Saini; Jagpreet S Nanda; Jon R Lorsch; Alan G Hinnebusch
Journal:  Genes Dev       Date:  2010-01-01       Impact factor: 11.361

Review 6.  Diamond Blackfan anemia: ribosomal proteins going rogue.

Authors:  Steven R Ellis; Pierre-Emmanuel Gleizes
Journal:  Semin Hematol       Date:  2011-04       Impact factor: 3.851

7.  Interactions of eukaryotic translation initiation factor 3 (eIF3) subunit NIP1/c with eIF1 and eIF5 promote preinitiation complex assembly and regulate start codon selection.

Authors:  Leos Valásek; Klaus H Nielsen; Fan Zhang; Christie A Fekete; Alan G Hinnebusch
Journal:  Mol Cell Biol       Date:  2004-11       Impact factor: 4.272

8.  Casein kinase II mediates multiple phosphorylation of Saccharomyces cerevisiae eIF-2 alpha (encoded by SUI2), which is required for optimal eIF-2 function in S. cerevisiae.

Authors:  L Feng; H Yoon; T F Donahue
Journal:  Mol Cell Biol       Date:  1994-08       Impact factor: 4.272

9.  Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling.

Authors:  Nicholas T Ingolia; Sina Ghaemmaghami; John R S Newman; Jonathan S Weissman
Journal:  Science       Date:  2009-02-12       Impact factor: 47.728

10.  The Cryo-EM structure of a complete 30S translation initiation complex from Escherichia coli.

Authors:  Patricia Julián; Pohl Milon; Xabier Agirrezabala; Gorka Lasso; David Gil; Marina V Rodnina; Mikel Valle
Journal:  PLoS Biol       Date:  2011-07-05       Impact factor: 8.029

View more
  33 in total

1.  Gcn2 eIF2α kinase mediates combinatorial translational regulation through nucleotide motifs and uORFs in target mRNAs.

Authors:  Yuji Chikashige; Hiroaki Kato; Mackenzie Thornton; Whitney Pepper; Madelyn Hilgers; Ariana Cecil; Izumi Asano; Haana Yamada; Chie Mori; Cheyenne Brunkow; Carter Moravek; Takeshi Urano; Chingakham Ranjit Singh; Katsura Asano
Journal:  Nucleic Acids Res       Date:  2020-09-18       Impact factor: 16.971

2.  Human eIF5 and eIF1A Compete for Binding to eIF5B.

Authors:  Kai Ying Lin; Nabanita Nag; Tatyana V Pestova; Assen Marintchev
Journal:  Biochemistry       Date:  2018-09-26       Impact factor: 3.162

3.  The Interaction between the Ribosomal Stalk Proteins and Translation Initiation Factor 5B Promotes Translation Initiation.

Authors:  Ryo Murakami; Chingakham Ranjit Singh; Jacob Morris; Leiming Tang; Ian Harmon; Azuma Takasu; Tomohiro Miyoshi; Kosuke Ito; Katsura Asano; Toshio Uchiumi
Journal:  Mol Cell Biol       Date:  2018-07-30       Impact factor: 4.272

4.  Overexpression of eIF5 or its protein mimic 5MP perturbs eIF2 function and induces ATF4 translation through delayed re-initiation.

Authors:  Caitlin Kozel; Brytteny Thompson; Samantha Hustak; Chelsea Moore; Akio Nakashima; Chingakham Ranjit Singh; Megan Reid; Christian Cox; Evangelos Papadopoulos; Rafael E Luna; Abbey Anderson; Hideaki Tagami; Hiroyuki Hiraishi; Emily Archer Slone; Ken-Ichi Yoshino; Masayo Asano; Sarah Gillaspie; Jerome Nietfeld; Jean-Pierre Perchellet; Stefan Rothenburg; Hisao Masai; Gerhard Wagner; Alexander Beeser; Ushio Kikkawa; Sherry D Fleming; Katsura Asano
Journal:  Nucleic Acids Res       Date:  2016-06-20       Impact factor: 16.971

5.  Targeted mRNA Therapy for Ornithine Transcarbamylase Deficiency.

Authors:  Mary G Prieve; Pierrot Harvie; Sean D Monahan; Debashish Roy; Allen G Li; Teri L Blevins; Amber E Paschal; Matt Waldheim; Eric C Bell; Anna Galperin; Jean-Rene Ella-Menye; Michael E Houston
Journal:  Mol Ther       Date:  2018-01-04       Impact factor: 11.454

6.  Molecular Landscape of the Ribosome Pre-initiation Complex during mRNA Scanning: Structural Role for eIF3c and Its Control by eIF5.

Authors:  Eiji Obayashi; Rafael E Luna; Takashi Nagata; Pilar Martin-Marcos; Hiroyuki Hiraishi; Chingakham Ranjit Singh; Jan Peter Erzberger; Fan Zhang; Haribabu Arthanari; Jacob Morris; Riccardo Pellarin; Chelsea Moore; Ian Harmon; Evangelos Papadopoulos; Hisashi Yoshida; Mahmoud L Nasr; Satoru Unzai; Brytteny Thompson; Eric Aube; Samantha Hustak; Florian Stengel; Eddie Dagraca; Asokan Ananbandam; Philip Gao; Takeshi Urano; Alan G Hinnebusch; Gerhard Wagner; Katsura Asano
Journal:  Cell Rep       Date:  2017-03-14       Impact factor: 9.423

7.  Efficient and Accurate Translation Initiation Directed by TISU Involves RPS3 and RPS10e Binding and Differential Eukaryotic Initiation Factor 1A Regulation.

Authors:  Ora Haimov; Hadar Sinvani; Franck Martin; Igor Ulitsky; Rafi Emmanuel; Ana Tamarkin-Ben-Harush; Assaf Vardy; Rivka Dikstein
Journal:  Mol Cell Biol       Date:  2017-07-14       Impact factor: 4.272

8.  Selection of start codon during mRNA scanning in eukaryotic translation initiation.

Authors:  Biswajit Gorai; Thyageshwar Chandran; Ipsita Basu; Prabal K Maiti; Tanweer Hussain
Journal:  Commun Biol       Date:  2022-06-15

Review 9.  Origin of translational control by eIF2α phosphorylation: insights from genome-wide translational profiling studies in fission yeast.

Authors:  Katsura Asano
Journal:  Curr Genet       Date:  2021-01-09       Impact factor: 2.695

10.  Human oncoprotein 5MP suppresses general and repeat-associated non-AUG translation via eIF3 by a common mechanism.

Authors:  Chingakham Ranjit Singh; M Rebecca Glineburg; Chelsea Moore; Naoki Tani; Rahul Jaiswal; Ye Zou; Eric Aube; Sarah Gillaspie; Mackenzie Thornton; Ariana Cecil; Madelyn Hilgers; Azuma Takasu; Izumi Asano; Masayo Asano; Carlos R Escalante; Akira Nakamura; Peter K Todd; Katsura Asano
Journal:  Cell Rep       Date:  2021-07-13       Impact factor: 9.423

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.