Literature DB >> 28982758

Non-AUG translation: a new start for protein synthesis in eukaryotes.

Michael G Kearse1, Jeremy E Wilusz1.   

Abstract

Although it was long thought that eukaryotic translation almost always initiates at an AUG start codon, recent advancements in ribosome footprint mapping have revealed that non-AUG start codons are used at an astonishing frequency. These non-AUG initiation events are not simply errors but instead are used to generate or regulate proteins with key cellular functions; for example, during development or stress. Misregulation of non-AUG initiation events contributes to multiple human diseases, including cancer and neurodegeneration, and modulation of non-AUG usage may represent a novel therapeutic strategy. It is thus becoming increasingly clear that start codon selection is regulated by many trans-acting initiation factors as well as sequence/structural elements within messenger RNAs and that non-AUG translation has a profound impact on cellular states.
© 2017 Kearse and Wilusz; Published by Cold Spring Harbor Laboratory Press.

Entities:  

Keywords:  RAN translation; eIF2A; eIF2D; near-cognate; start codon; translation initiation

Mesh:

Substances:

Year:  2017        PMID: 28982758      PMCID: PMC5666671          DOI: 10.1101/gad.305250.117

Source DB:  PubMed          Journal:  Genes Dev        ISSN: 0890-9369            Impact factor:   11.361


Eukaryotic genomes encode thousands of proteins with important structural and regulatory roles, and a significant amount of cellular energy is dedicated to transcription of messenger RNAs (mRNAs) and their subsequent translation into protein. Errors in translation can result in wasteful production of inactive or deleterious proteins that misfold, aggregate, lack regulation, or otherwise disrupt cellular fitness (Drummond and Wilke 2009). It is thus critical that ribosomes initiate at the appropriate codon, incorporate the appropriate amino acids into the growing polypeptide chain, and terminate only at the appropriate stop codon (Zaher and Green 2009; Rozov et al. 2016). Protein-coding sequences have traditionally been defined as uninterrupted ORFs that begin with the universal AUG start codon and end with one of three stop codons (UAA, UGA, and UAG). However, it has been known since the 1980s that translation can initiate at codons other than AUG, albeit at a much lower efficiency (Zitomer et al. 1984; Peabody 1987, 1989; Clements et al. 1988; Hann et al. 1988). In most of these cases, near-cognate codons that differ from AUG by only one nucleotide (e.g., CUG, GUG, and UUG) are used. Given the strong evolutionary pressure that is observed across all organisms to use AUG start codons, it may appear at first glance that initiation events from non-AUG codons represent intrinsic errors of the translation machinery. This idea is directly challenged by the fact that a number of endogenous and viral proteins with important functions are derived solely from non-AUG start codons (Curran and Kolakofsky 1988; Dorn et al. 1990; Xiao et al. 1991; Chang and Wang 2004; Tang et al. 2004; Beerman and Jongens 2011; Ivanov et al. 2011). For example, DAP5 (also called eukaryotic initiation factor 4G2 [eIF4G2] or NAT1) plays a critical role in internal ribosome entry site (IRES)-mediated translation and is initiated solely from a GUG start codon in mouse and human cells (Imataka et al. 1997; Takahashi et al. 2005; Lewis et al. 2008; Marash et al. 2008; Liberman et al. 2015). Likewise, in yeast, UUG and ACG start codons are used to initiate translation of the GRS1 and ALA1 transfer RNA (tRNA) synthetases, respectively (Chang and Wang 2004; Tang et al. 2004). These non-AUG translation events represent only the tip of the iceberg, as ribosome profiling has recently revealed thousands of novel initiation events at non-AUG codons (Ingolia et al. 2009, 2011). Interestingly, not all near-cognate start codons are used with equal efficiency, with CUG generally being most efficient, followed by GUG, ACG, and AUU (Table 1). It should be noted that there is significant variation in these efficiency measurements across assays. This is likely because in vitro translation assays are strongly influenced by how the lysates are prepared (e.g., those prepared by gel filtration may lack low-molecular-weight translation factors) and the ionic concentrations used (Kozak 1989, 1990b).
Table 1.

Efficiency of non-AUG start codons in various assays

Efficiency of non-AUG start codons in various assays While endogenous functional non-AUG start codons may be more prevalent than previously appreciated, it should not be dismissed that such codons typically perform at a markedly reduced efficiency compared with AUG codons (Table 1). Thus, the purpose of this review is not to suggest that most proteins within cells are derived from non-AUG start codons but to highlight how alternative initiation codons can be used to increase protein isoform diversity and impact cellular processes (Touriol et al. 2003). Besides highlighting the widespread nature of non-AUG translation events in eukaryotic cells, we discuss the trans-acting proteins and the features within mRNAs that dictate start codon recognition. In some cases, a canonical scanning mechanism of translation initiation appears to be used, but there are also a number of alternative factors (which can be induced by various stresses) that alter start codon preferences. As aberrant non-AUG translation events are associated with, and likely drive, multiple human diseases, including cancer and neurodegeneration (Zu et al. 2011; Sendoel et al. 2017), there exists the intriguing possibility that modulating non-AUG translation events (e.g., using small molecule inhibitors) may have profound therapeutic effects.

Thousands of non-AUG codons can be used for translation initiation

For many years, the identification of non-AUG start codons was often fortuitous and resulted from efforts aimed at cloning genes of interest. For example, Xiao et al. (1991) observed that endogenous TEAD1 (also called TEF-1) in HeLa cells did not comigrate in SDS-PAGE with in vitro translated TEAD1 that was initiated from the predicted AUG start codon. By pursuing this observation and using mutational analysis, it was revealed that endogenous TEAD1 uses solely an upstream AUU start codon. Approaches like these are largely limited to single genes. However, a genome-wide view of translation can now be provided by ribosome profiling, in which short mRNA fragments that are protected by 80S ribosomes are purified and subjected to high-throughput sequencing (Ingolia et al. 2009, 2011; Ingolia 2010). Ribosome profiling, sometimes referred to as Ribo-seq, has revolutionized our understanding of where translating ribosomes are present across the transcriptome as well as revealed key features that influence translation patterns (for review, see Ingolia 2014; Brar and Weissman 2015; Andreev et al. 2017). For example, specific combinations of codons (often involving proline) can slow translation in both mammalian cells and yeast (Ingolia et al. 2011; Gamble et al. 2016). By treating cells with early elongation inhibitors that block 80S ribosomes after initiation but before the first translocation cycle, ribosome profiling can be used to define translation start sites (Fig. 1). Lactimidomycin (which binds the exit site [E site] of the 60S subunit) (Schneider-Poetsch et al. 2010) or harringtonine (which binds the aminoacyl site [A site] of the 60S subunit) (Fresno et al. 1977) are used for exactly this purpose, as they have little, if any, effect on 80S ribosomes beyond the first translocation step. Thousands of previously unannotated initiation events have now been identified in mouse embryonic stem cells, ∼60% of which initiate at a non-AUG start codon (Ingolia et al. 2011). This includes >75% of upstream ORFs (uORFs) that are present in transcript leader regions (Ingolia et al. 2011). This number should not be misinterpreted to mean that most proteins in cells are derived from non-AUG translation events or that translation initiation at a non-AUG start codon is more efficient than at a canonical AUG. Instead, these data simply indicate that if one tallies up initiation sites across the transcriptome, ignoring the relative efficiency of each event, translation initiation at non-AUG start codons outweighs initiation at AUG codons.
Figure 1.

Ribosome profiling can be used to reveal translation initiation sites across the transcriptome. (Left) Treatment with cycloheximide (blue), which binds to the E site of the 60S subunit, pauses all elongating 80S ribosomes. The mapped ribosome footprints thus typically cover the entire ORF. (Right) In contrast, harringtonine (red) binds to the A site of the 60S subunit and only inhibits 80S ribosomes just after subunit joining at the start codon. Mapped ribosome footprints from harringtonine treatment are thus enriched for the start codon.

Ribosome profiling can be used to reveal translation initiation sites across the transcriptome. (Left) Treatment with cycloheximide (blue), which binds to the E site of the 60S subunit, pauses all elongating 80S ribosomes. The mapped ribosome footprints thus typically cover the entire ORF. (Right) In contrast, harringtonine (red) binds to the A site of the 60S subunit and only inhibits 80S ribosomes just after subunit joining at the start codon. Mapped ribosome footprints from harringtonine treatment are thus enriched for the start codon. As these data challenge the prevailing model of eukaryotic initiation, there has naturally been debate about whether all the footprints obtained from ribosome profiling experiments represent true translation signals (Guttman et al. 2013). For example, it is now clear that the inclusion of translation inhibitors (such as cycloheximide, which helps trap 80S ribosomes on mRNAs) during the preparation and lysis of the cells can sometimes adversely influence the data (Gerashchenko and Gladyshev 2014; Lareau et al. 2014; Hussmann et al. 2015; Andreev et al. 2017). Furthermore, the addition of early elongation inhibitors such as lactimidomycin or harringtonine causes large increases in free ribosomal subunits, which likely influence initiation to some degree. Indeed, some putative initiation events identified using ribosome profiling have not been able to be experimentally confirmed and thus likely represent false positives (Zhang and Hinnebusch 2011). Nevertheless, a growing number of functional peptides generated from non-AUG start codons have been documented (Starck et al. 2008, 2012; Slavoff et al. 2013), and continuing improvements to the methods provide increasingly high-confidence data sets. For example, computational approaches now take into account both fragment size and 3-nucleotide (nt) periodicity (indicative of decoding the genetic code) (Ingolia 2014; Brar and Weissman 2015; Fields et al. 2015), and orthogonal approaches, such as mass spectrometry and the insertion of small epitope tags into endogenous genes (Menschaert et al. 2013; Slavoff et al. 2013; Ingolia et al. 2014), are increasingly being used for validation.

Start codon recognition is tightly controlled by multiple initiation factors

The mechanism of canonical eukaryotic translation initiation and the role of individual eIFs have been reviewed extensively (Sonenberg and Hinnebusch 2009; Jackson et al. 2010; Lorsch and Dever 2010; Hinnebusch 2017). Briefly, for the ribosome to identify a suitable start codon, most eukaryotic translation is thought to follow the scanning model of initiation (Fig. 2). The ternary complex (TC) is first formed when GTP-bound eIF2 (a heterotrimer of α, β, and γ subunits) binds the initiator methionyl-tRNA (Met-tRNAiMet). The TC then interacts with the 40S ribosomal subunit and a number of eIFs, including eIF1, eIF1A, eIF5, and the eIF3 complex (13 subunits [a–m] in humans), to form the 43S preinitiation complex (PIC). Once assembled, the 43S PIC is recruited to the 7-methylguanosine (m7G) cap at the 5′ ends of mRNAs by the eIF4F complex, which consists of the cap-binding protein eIF4E, the DEAD-box helicase eIF4A, and the scaffolding protein eIF4G that binds PABP at the mRNA 3′ end, thereby generating a 48S-activated mRNA. In an ATP-dependent manner, the PIC then scans 5′ to 3′ in search of a start codon, which is most often the first AUG. Once the anti-codon of Met-tRNAiMet (bound by eIF2) base-pairs with the start codon in the peptidyl site (P site) of the scanning PIC, GTP is hydrolyzed by eIF2, and Pi is released. This step is promoted by eIF5, the GTPase-activating protein for eIF2. Subsequent displacement of eIF1, eIF1A, eIF2•GDP, eIF3, and eIF5 along with an additional GTP hydrolysis event by eIF5B allows for joining of the large 60S ribosomal subunit, thereby forming the complete 80S ribosome and allowing the second codon to be decoded in the A site.
Figure 2.

Canonical eukaryotic translation uses a scanning model of initiation. The 43S PIC consists of the 40S small ribosomal subunit, the TC (eIF2•GTP•Met-tRNAiMet), eIF1, eIF1A, eIF3, and eIF5. The PIC is recruited to the 5′ ends of mRNAs by the eIF4F complex, which consists of the cap-binding protein eIF4E, the DEAD-box helicase eIF4A, and the scaffolding protein eIF4G that binds PABP at the mRNA 3′ end. The PIC then scans in an ATP-dependent manner in a 5′-to-3′ direction in search of a start codon. Base pairing between the anti-codon of the Met-tRNAiMet and the start codon causes GTP hydrolysis by eIF2 and translation initiation. (1) eIF1; (1A) eIF1A; (4A) eIF4A; (4E) eIF4E; (4G) eIF4G.

Canonical eukaryotic translation uses a scanning model of initiation. The 43S PIC consists of the 40S small ribosomal subunit, the TC (eIF2•GTP•Met-tRNAiMet), eIF1, eIF1A, eIF3, and eIF5. The PIC is recruited to the 5′ ends of mRNAs by the eIF4F complex, which consists of the cap-binding protein eIF4E, the DEAD-box helicase eIF4A, and the scaffolding protein eIF4G that binds PABP at the mRNA 3′ end. The PIC then scans in an ATP-dependent manner in a 5′-to-3′ direction in search of a start codon. Base pairing between the anti-codon of the Met-tRNAiMet and the start codon causes GTP hydrolysis by eIF2 and translation initiation. (1) eIF1; (1A) eIF1A; (4A) eIF4A; (4E) eIF4E; (4G) eIF4G. The eIFs that are responsible for start codon recognition were first identified in Saccharomyces cerevisiae using a strain that harbored a start codon mutation (AUG to ACG) in the HIS4 gene (Donahue and Cigan 1988; Castilho-Valavicius et al. 1990). When cultured in medium lacking the essential amino acid histidine, these cells survived only if they used an in-frame near-cognate UUG codon that is present at the third codon of the mutant ORF. This is because initiation from the UUG codon generates a slightly truncated but functional His4 protein that supports histidine biosynthesis. By characterizing spontaneous revertants that grew despite the absence of histidine in the medium, five suppressor of initiation (SUI) codon mutation genes were identified: eIF1 (SUI1), all three (α, β, and γ) subunits of eIF2 (SUI2, SUI3, and SUI4, respectively), and eIF5 (SUI5) (Donahue and Cigan 1988; Castilho-Valavicius et al. 1990; Huang et al. 1997). Subsequent experiments identified further mutations (termed suppressor of Sui phenotype [Ssu]) in these same eIFs that caused decreased initiation from a UUG start codon, showcasing how tightly start codon recognition is regulated (Martin-Marcos et al. 2011, 2014; Singh et al. 2012; Saini et al. 2014). In fact, all of the eIFs found in the PIC (Fig. 2), including eIF1A and eIF3, have important roles in start codon recognition (Valasek et al. 2004; Fekete et al. 2005; Saini et al. 2010; Hinnebusch 2017). Consistent with GTP hydrolysis by eIF2 and subsequent Pi release serving as key steps in start codon recognition, mutations in eIF2 and eIF5 that stimulate premature GTPase activity and Pi release result in increased initiation at non-AUG codons, such as UUG (Huang et al. 1997). It was thus proposed early on that mutations in eIF1, eIF1A, and eIF3 likely influence start codon recognition by forcing conformational changes in eIF2 and/or eIF5 that drive premature GTP hydrolysis and/or Pi release. This is certainly still plausible; however, recent high-resolution structural data have now made it clear that many of these factors also contact the context nucleotides (e.g., “Kozak sequence”) (discussed below) immediately upstream of and downstream from the start codon. These initiation factors thus influence the overall conformation of the PIC, causing the stringency of start codon recognition to be strengthened or weakened depending on the sequence/structure of the mRNA (Hussain et al. 2014; Llacer et al. 2015; Hinnebusch 2017). It should also be highlighted that eIFs, including eIF1 and eIF5, can be post-translationally modified, and, at least for eIF1, these modifications affect start codon fidelity (Majumdar et al. 2002; Homma et al. 2005). Additional trans-acting factors, such as ABC50 (also known as ABCF1), can also bind eIF2 to regulate start codon recognition (Stewart et al. 2015). Alterations in eIF stoichiometry are further known to influence start codon recognition patterns. For example, addition of extra eIFs caused shifts in the relative amounts of AUG versus non-AUG initiation in rabbit reticulocyte lysates (Barth-Baus et al. 2013). Using a reporter mRNA that could initiate at either an AUG or an upstream CUG start codon, these in vitro experiments showed that the addition of extra eIF1 and eIF1A proteins strengthened start codon fidelity, thereby resulting in less initiation at the CUG codon and more at the AUG. In contrast, when additional eIF5 and eIF5B was added, start codon fidelity was loosened, resulting in more initiation at the CUG codon and less at the AUG (Barth-Baus et al. 2013). Overexpression of eIF5 similarly increases non-AUG translation in human HEK293T cells (Nanda et al. 2009; Loughran et al. 2012). In yeast, haploinsufficiency of eIF1, but not eIF1A or eIF5, is known to increase non-AUG initiation (Takacs et al. 2011). Consistent with these data, overexpression of eIF1 in mammalian cells inhibited initiation from non-AUG codons (Ivanov et al. 2010), highlighting the importance of proper eIF1 expression for start codon recognition.

RNA sequences and structures affect start codon utilization

Landmark experiments by Kozak's laboratory (Kozak 1984, 1986a,b) in the 1980s provided some of the earliest clues about how sequences and secondary structures surrounding the start codon can influence initiation. Focusing first on the preproinsulin gene, Kozak identified a functional role for the nucleotide at position −3 (relative to the A of the AUG start codon) (Kozak 1984) and then further defined additional nucleotides upstream of and downstream from the start codon that influence initiation across a variety of vertebrate mRNAs (Kozak 1986b, 1987, 1989). These critical nucleotides are now commonly referred to as the “Kozak sequence.” In mammals, the consensus Kozak sequence is C(A/G)CCAUGG, where the AUG start codon is underlined (Kozak 1989), and the flanking nucleotides have a larger influence on recognition of non-AUG start codons than on AUG start codons. In particular, the purine at position −3 and the guanosine at position +4 provide stabilizing interactions with the PIC (Pisarev et al. 2006; Hussain et al. 2014; Hinnebusch 2017). The yeast consensus Kozak sequence (AAAAAUG) (Hamilton et al. 1987; Shabalina et al. 2004) likewise primarily only affects initiation at non-AUG start codons, not at AUG start codons (Zitomer et al. 1984; Donahue and Cigan 1988; Chen et al. 2008). Consistent with the scanning model (Fig. 2), mRNA structures either upstream of or immediately downstream from start codons can influence initiation efficiency by affecting the movement of the PIC. For example, initiation from a non-AUG start codon or an AUG in a poor context can become more efficient when a strong thermostable hairpin is placed downstream from the start codon (Kozak 1990a). Importantly, this stimulatory effect is sensitive to the placement of the hairpin: When a hairpin was placed 14 nt downstream from a reporter mRNA start codon, thereby stalling the start codon in the P site of the PIC, translation initiation from this codon increased in vitro (Kozak 1990a). This structure-dependent influence on initiation has now been confirmed to affect non-AUG start codons on endogenous mRNAs (Kearse et al. 2016; Liang et al. 2017) as well as AUG start codons on viral mRNAs (Ventoso et al. 2006; Toribio et al. 2016). For example, the PTEN gene, a tumor suppressor gene that is frequently mutated in many cancers, generates an alternative protein isoform, PTENβ, by using an upstream AUU start codon (Liang et al. 2017). In addition to requiring a favorable Kozak sequence, efficient PTENβ translation requires an evolutionarily conserved hairpin that begins 18 nt downstream from the AUU. Removal of this hairpin from reporter mRNAs eliminated initiation from the PTENβ start codon but had no effect on the downstream canonical AUG start codon (Liang et al. 2017). Non-AUG translation at ACG, GUG, and GCG start codons in the FMR1 5′ leader likewise strongly increases when the downstream CGG repeat forms highly stable hairpins and G-quadruplexes (Kearse et al. 2016). A recent genome-wide analysis shows that mRNA secondary structures are commonly present downstream from non-AUG start codons and AUG codons in poor contexts, suggesting that structure may commonly be used to bolster translation initiation from weak initiation sites (Lee et al. 2012).

Met-tRNAiMet is generally, but not always, used for initiation

If a non-AUG start codon is used for initiation, what aminoacyl-tRNA (aa-tRNA) is used? This appears to depend on both the start codon and the type of initiation mechanism used. The canonical initiation pathway used by most genes uses eIF2 to deliver Met-tRNAiMet to the P site of the ribosome (Fig. 2). Given that eIF2 has high specificity for Met-tRNAiMet and does not significantly bind other tRNAs (Kolitz and Lorsch 2010), initiation using eIF2 should use Met-tRNAiMet exclusively. Indeed, early in vitro work using [35S]-labeled Met-tRNAiMet demonstrated that reporter mRNAs harboring a CUG, GUG, ACG, UUG, AGG, AAG, AUA, AUC, or AUU start codon all gave rise to full-length proteins containing [35S]-labeled methionine, strongly suggesting that non-AUG translation can use Met-tRNAiMet for initiation (Peabody 1989). Recent mass spectrometry data further support Met-tRNAiMet usage for initiation at ACG and AUU codons (Liang et al. 2017; Sellier et al. 2017). Although translation initiation generally requires eIF2 to deliver Met-tRNAiMet, it is now recognized that at least two other eIFs, eIF2A and eIF2D, can also be used for initiation at non-AUG codons (Fig. 3). Interestingly, eIF2A (not to be confused with eIF2α) can initiate translation in a GTP-independent manner and has a relaxed stringency for binding tRNAs, as it can directly bind the charged and uncharged forms of tRNAiMet with equal affinity in vitro (Merrick and Anderson 1975; Zoll et al. 2002; Komar et al. 2005; Kim et al. 2011; Starck et al. 2012). eIF2A also may be able to bind and deliver Leu-tRNA for initiation at CUG and UUG codons, although the exact affinity of eIF2A for charged tRNALeu has not yet been measured (Starck et al. 2012, 2016; Liang et al. 2014; Sendoel et al. 2017). It thus remains possible that eIF2A may simply promote the recruitment of Leu-tRNA to the 40S ribosomal subunit rather than directly bind it for delivery. Using an ELISA (enzyme-linked immunosorbent assay)-based assay to detect small reporter peptides encoded by either an AUG or CUG start codon, Starck et al. (2012) surprisingly showed that a CUG start codon can be initiated, most likely by eIF2A, as much as ∼40% more efficiently with Leu-tRNACUG than with Met-tRNAiMet. Indeed, depletion of eIF2A reduced initiation at the CUG start codon but had no effect on initiation at AUG codons (Starck et al. 2012). Nevertheless, eIF2A appears to have only a highly specialized role in translation, as the eIF2A mouse knockout has no apparent phenotype (Golovko et al. 2016). Considering that eIF2A is up-regulated upon stress and, in some cases, can function in IRES-mediated initiation (Komar et al. 2005; Kim et al. 2011; Starck et al. 2016; Kwon et al. 2017), it is possible that eIF2A may help enable non-AUG translation to become favored when canonical translation (which normally outcompetes non-AUG translation) is down-regulated.
Figure 3.

Canonical and alternative initiator tRNAi-binding eIFs are differentially regulated and have different tRNAi-binding stringency. (Left) Canonical translation uses eIF2 in a GTP-dependent manner to deliver the canonical Met-tRNAiMet to the P site of the 40S ribosomal subunit. This allows initiation at AUG start codons as well as at non-AUG start codons, albeit at a reduced efficiency. (Right) eIF2A and eIF2D are somewhat similar to eIF2 but are GTP-independent and have the capability to bind both charged and uncharged forms of tRNAiMet. eIF2A can also use Leu-tRNACUG and initiate at CUG and UUG start codons in some mRNAs in vitro and in vivo. eIF2D, which regulates reinitiation, can initiate at AUG codons in vitro as well as use other aa-tRNAs for initiation in a selective manner.

Canonical and alternative initiator tRNAi-binding eIFs are differentially regulated and have different tRNAi-binding stringency. (Left) Canonical translation uses eIF2 in a GTP-dependent manner to deliver the canonical Met-tRNAiMet to the P site of the 40S ribosomal subunit. This allows initiation at AUG start codons as well as at non-AUG start codons, albeit at a reduced efficiency. (Right) eIF2A and eIF2D are somewhat similar to eIF2 but are GTP-independent and have the capability to bind both charged and uncharged forms of tRNAiMet. eIF2A can also use Leu-tRNACUG and initiate at CUG and UUG start codons in some mRNAs in vitro and in vivo. eIF2D, which regulates reinitiation, can initiate at AUG codons in vitro as well as use other aa-tRNAs for initiation in a selective manner. Like eIF2A, eIF2D (sometimes referred to as ligatin [LGTN]) can also deliver Met-tRNAiMet (as well as the uncharged form) in a GTP-independent manner to the P site of the 40S ribosomal subunit (Fig. 3; Dmitriev et al. 2010; Skabkin et al. 2013; Zinoviev et al. 2015; Weisser et al. 2017). eIF2D has a large role in reinitiation and ribosome recycling in vivo but can also function in initiation at non-AUG start codons in vitro in a somewhat selective manner. For example, it was shown that eIF2D can initiate at a GUG codon using Val-tRNAGUG on the Sindbis virus 26S mRNA but not on the hepatitis C virus (HCV) IRES (Dmitriev et al. 2010; Skabkin et al. 2010). Interestingly, eIF2D was identified as an autoantigen in human hepatocellular carcinoma patients (Wang et al. 2002), suggesting a possible functional role for this initiation factor in tumor development. It should also be noted that the MCT-1/DENR heterodimer (which is homologous to the N-terminal and C-terminal domains of eIF2D, respectively) has also been shown to regulate reinitiation and ribosome recycling (Reinert et al. 2006; Schleich et al. 2014, 2017; Janich et al. 2015; Haas et al. 2016). While in vitro experiments have shown that MCT-1/DENR can promote eIF2-independent recruitment of Met-tRNAiMet to the P site of the 40S ribosomal subunit on viral mRNAs (Skabkin et al. 2010, 2013), it is not yet clear whether it can bind/recruit other aa-tRNAs and/or initiate at non-AUG start codons. In some cases, such as the cricket paralysis virus (CrPV) IRES (Jan and Sarnow 2002; Jan et al. 2003; Pestova and Hellen 2003), Taura syndrome virus (TSV) IRES (Cevallos and Sarnow 2005; Koh et al. 2014), and the Plautia stali intestine virus (PSIV) IRES (Sasaki and Nakashima 2000), an aa-tRNA in the P site is not required for initiation. These viral RNA sequences instead fold into structures that directly bind the 40S ribosomal subunit independent of any eIFs or an aa-tRNA in the P site. Upon formation of the 80S•IRES complex, a pseudoknot of the IRES is localized in the P site on the small subunit in a manner mimicking Met-tRNAiMet and the AUG codon. This conformation allows the cognate aa-tRNA to be delivered into the A site, thus forming the first amino acid of the nascent polypeptide and initiating translation (Fernandez et al. 2014; Koh et al. 2014). So far, this mode of initiation has been reported only for this class of viral IRES elements, but these results clearly demonstrate that translation initiation can occur without the canonical Met-tRNAiMet.

Usage of non-AUG initiation codons changes during development and upon stress

Ribosome profiling enables snapshots of translation to be taken, thereby revealing insights into how endogenous genes, including those containing non-AUG start codons, are regulated in response to developmental or external cues. Highlighting this approach, Brar et al. (2012) generated >25 ribosome profiling libraries to reveal temporal changes in translation as S. cerevisiae sporulate and transition from exponential vegetative growth to meiotic growth. As expected, there were broad changes in gene expression at both the mRNA and translation levels, with some genes displaying stage-specific meiotic translation. Furthermore, a number of previously unannotated translation events, including from antisense RNAs, intergenic regions, and uORFs, were found to be specific to meiotic cells. Only 5% of ribosome footprints mapped outside annotated ORFs in vegetative cells, but this number increased to a staggering ∼30% in meiotic cells (Brar et al. 2012). This suggests that there may be a profound shift in meiotic cells away from canonical translation, although functional roles (if any) for the majority of these novel ORFs remain unknown. Many (but certainly not all) of the translated uORFs in meiotic cells use non-AUG start codons. uORFs have generally been thought to play an inhibitory role and act competitively to prevent translation of the downstream AUG-encoded ORF (Sachs and Geballe 2006; Chew et al. 2016; Johnstone et al. 2016). Indeed, about half of the AUG-encoded uORFs were associated with translational repression of their downstream ORFs in meiosis (Brar et al. 2012). In stark contrast, translation of non-AUG-encoded uORFs was correlated primarily with increased translation of their downstream ORFs. The underlying mechanism responsible for this positive correlation remains unknown, but it is possible that these non-AUG uORFs may directly prime translation of their downstream ORF, or, alternatively, the uORF-encoded polypeptides may have functional roles (Brar et al. 2012). Given that nearly 50% of RNA polymerase II transcription is devoted to production of ribosomal proteins when yeast are grown under vegetative conditions (Warner 1999), it is tempting to speculate that changes in ribosomal protein (or eIF) stoichiometry and/or post-translational modifications may be responsible for the drastic changes in translation patterns observed in meiotic cells. Consistent with this idea, it has been shown recently that ribosome composition and ribosome protein stoichiometry can influence translation of particular classes of mRNAs (Xue et al. 2015; Shi et al. 2017; Simsek et al. 2017). Upon experiencing stress, eukaryotic cells generally respond by down-regulating translation initiation, thereby saving their metabolic energy and limiting the production of what could be deleterious or toxic proteins (Fulda et al. 2010; Pakos-Zebrucka et al. 2016). Multiple stress conditions (e.g., ER or oxidative stress) converge on the inhibitory phosphorylation of eIF2α. Under these conditions, eIF2 cannot exchange GDP for GTP and thus cannot bind Met-tRNAiMet for a subsequent round of initiation. This causes nearly all translation initiation events to become strongly inhibited, but there are some mRNAs whose translation is increased—most notably genes with regulatory uORFs such as GCN4 (which has multiple AUG uORFs) (Hinnebusch 2005), ATF4 (which also has multiple AUG uORFs) (Wek et al. 2006), and BiP (Kaufman 1999; Starck et al. 2016). BiP is an ER chaperone protein that senses misfolded proteins, and thus induction of its expression is critical for helping restore ER homeostasis. Starck et al. (2016) showed recently that BiP translation is regulated by two non-AUG uORFs (UUG and CUG) that are up-regulated during stress and dependent on eIF2A. Knocking down eIF2A prior to ER stress led to decreased levels of BiP, which were further reduced when the UUG-encoded uORF was deleted. These results suggest that both eIF2A and the non-AUG uORFs are required for the increased expression of BiP during ER stress (Starck et al. 2016). As various stresses beyond ER stress lead to increases in eIF2A protein levels (Kim et al. 2011; Starck et al. 2016; Sendoel et al. 2017), induction of non-AUG translation appears to be a common stress response mechanism. In mammalian cells, heat shock causes translation of MRPL18 (which encodes a mitochondrial large ribosomal subunit protein) to no longer initiate at the annotated AUG start codon but instead initiate at a downstream CUG codon (Zhang et al. 2015) (Fig. 4). The truncated MRPL18 protein that is generated lacks the N-terminal mitochondrial targeting signal and thus is not incorporated into mitochondrial ribosomes. Instead, the truncated MRPL18 protein is incorporated into cytoplasmic ribosomes. These newly defined “hybrid” ribosomes are functional and are required for increased synthesis of the Hsp70 chaperone protein during heat shock (Zhang et al. 2015). This is perhaps because the presence of MRPL18 permits directed ribosome recruitment to Hsp70 mRNA, similar to what has been reported previously for RpL38 and the Hox genes (Xue et al. 2015). Regardless of the exact underlying mechanism, the MRPL18 locus beautifully demonstrates how changes in the efficiency of non-AUG start codon usage can have a profound impact on cell survival and homeostasis.
Figure 4.

Heat shock causes the production of a CUG-initiated MRPL18 protein that becomes incorporated into cytoplasmic ribosomes. (Top) Under normal growth conditions, the MRPL18 protein is synthesized from a canonical AUG start codon and includes the mitochondrial targeting signal (MTS; pink), which enables transport into mitochondria and subsequent incorporation into mitochondrial ribosomes. (Bottom) Heat shock causes a switch in the preferred start codon, resulting in initiation at a downstream CUG in the MRPL18 mRNA. The truncated MRPL18 (cyto) protein isoform lacks the MTS and is instead incorporated into cytoplasmic ribosomes, creating “hybrid ribosomes” that are required for increased Hsp70 mRNA translation during heat shock.

Heat shock causes the production of a CUG-initiated MRPL18 protein that becomes incorporated into cytoplasmic ribosomes. (Top) Under normal growth conditions, the MRPL18 protein is synthesized from a canonical AUG start codon and includes the mitochondrial targeting signal (MTS; pink), which enables transport into mitochondria and subsequent incorporation into mitochondrial ribosomes. (Bottom) Heat shock causes a switch in the preferred start codon, resulting in initiation at a downstream CUG in the MRPL18 mRNA. The truncated MRPL18 (cyto) protein isoform lacks the MTS and is instead incorporated into cytoplasmic ribosomes, creating “hybrid ribosomes” that are required for increased Hsp70 mRNA translation during heat shock.

Non-AUG translation events can drive cancer progression

Alternative protein isoforms derived from non-AUG start codons have also been found to be generated from numerous cancer-relevant genes and act to either promote (e.g., fibroblast growth factor 2 [FGF2]) or inhibit (e.g., c-myc) cancer growth (Hann et al. 1988; Hann 1994; Touriol et al. 2003; Ivanov et al. 2011). For example, FGF2 controls cell proliferation, differentiation, and angiogenesis (Turner and Grose 2010), and multiple protein isoforms can be generated from this gene via the use of alternative start codons. Whereas the canonical AUG start codon generates an 18-kDa protein isoform that is mostly cytoplasmic or secreted, at least four upstream in-frame CUG codons can also be used to generate longer isoforms that localize to the nucleus (Renko et al. 1990; Bugler et al. 1991; Arnaud et al. 1999). Intriguingly, the CUG-encoded protein isoforms are unique to transformed cells (Vagner et al. 1996). Supporting their functional relevance, ectopic expression of these CUG-encoded isoforms caused cell immortalization in culture and increased the tumorigenic properties of these cells when they were injected into mice (Couderc et al. 1991; Quarto et al. 1991). The c-myc proto-oncogene likewise regulates cell proliferation and transformation. Two protein isoforms can be generated from c-myc through the usage of different start codons: c-myc 2 (p64) uses a canonical AUG start codon, whereas c-myc 1 (p67) uses an upstream in-frame CUG codon (Hann et al. 1988). At low cell density, the CUG-encoded c-myc 1 protein is only 10%–15% as abundant as the AUG-encoded c-myc 2. However, as cell density increases, the availability of amino acids, specifically methionine, becomes limiting, and the CUG-encoded c-myc 1 isoform becomes preferentially translated (Hann et al. 1992). The underlying mechanism is not fully clear, but this reduction in methionine availability may cause activation of the GCN2 kinase, reduction in TC levels, and/or a reduction in charged Met-tRNA levels, leading to a change in start codon usage. Once generated, the unique N terminus of c-myc 1 causes this isoform to have additional DNA-binding capabilities that allow it to regulate genes that inhibit cell growth (Hann et al. 1994). Indeed, overexpression of this CUG-encoded isoform (but not the AUG-encoded isoform) inhibits growth of cultured cells. Inactivation of c-myc 1 is observed in some tumor cells (e.g., in human Burkitt's lymphomas) (Hann and Eisenman 1984; Hann et al. 1988), suggesting that the inability to generate this CUG-encoded protein may provide a selective growth advantage. The mechanisms by which individual non-AUG translation events become activated are still poorly understood, but there are likely global changes in start codon fidelity in cancer cells. Many eIFs, including the cap-binding protein eIF4E (Mamane et al. 2004), are often misregulated in cancers (for review, see Spilka et al. 2013; Chu et al. 2016), and imbalances of eIFs are known to influence overall initiation fidelity and start codon choice in various systems (as discussed above). Perhaps the strongest correlation between non-AUG translation and cancer comes from recent work in squamous cell carcinomas (SCCs) (Sendoel et al. 2017). Sendoel et al. (2017) identified eIF2A as an essential factor in tumor initiation that promotes the translation of many non-AUG uORFs that act to positively regulate expression of their downstream cancer-related ORFs (Fig. 5). Remarkably, deleting eIF2A protected mice from tumors. In stark contrast, depletion of the canonical eIF2 initiation factor largely had no effect on oncogenic growth but did affect the growth of normal cells (Sendoel et al. 2017). As discussed earlier (Fig. 3), eIF2A can initiate at CUG and UUG start codons (both encode Leu) and deliver/promote the recruitment of Leu-tRNACUG for initiation (Starck et al. 2012, 2016; Liang et al. 2014). Considering that the eIF2A locus is amplified in 29% of patients with lung SCC, 15% of patients with head and neck SCC, and 15% of patients with oesophageal carcinoma (Sendoel et al. 2017), it appears likely that misregulation of eIF2A and non-AUG uORFs may play a more general role in cancer development.
Figure 5.

eIF2A regulates translation of non-AUG uORFs in SCCs. (Top) In normal cells, translation of non-AUG uORFs is limited, and downstream ORFs are translated at basal levels. (Bottom) In some SCCs, cell stress and/or gene duplication events cause elevated eIF2A levels. This results in increased expression of non-AUG uORFs in oncogenic mRNAs (e.g., Ctnnb1) and subsequent up-regulation of the downstream oncoprotein ORFs. Upon depleting eIF2A in SCC models, premalignant and malignant progression is greatly reduced.

eIF2A regulates translation of non-AUG uORFs in SCCs. (Top) In normal cells, translation of non-AUG uORFs is limited, and downstream ORFs are translated at basal levels. (Bottom) In some SCCs, cell stress and/or gene duplication events cause elevated eIF2A levels. This results in increased expression of non-AUG uORFs in oncogenic mRNAs (e.g., Ctnnb1) and subsequent up-regulation of the downstream oncoprotein ORFs. Upon depleting eIF2A in SCC models, premalignant and malignant progression is greatly reduced.

Repeat-associated non-AUG (RAN) translation is associated with many neurodegenerative diseases

A number of neurological diseases, so-called nucleotide repeat disorders, are caused by the expansion of nucleotide repeats (Orr and Zoghbi 2007; Almeida et al. 2013). This includes Huntington's disease, which is caused by expansion of a CAG repeat sequence in the coding sequence of the Huntingtin (HTT) gene, and the fragile X disorders, which are due to CGG repeat expansions in the 5′ leader (untranslated region) of the FMR1 gene. Expansion of these repeats can cause RAN translation (Green et al. 2016; Cleary and Ranum 2017) as well as other effects, such as protein loss of function, protein gain of function, and/or RNA gain of function (Orr and Zoghbi 2007; Todd and Paulson 2010; Almeida et al. 2013). In all known cases of RAN translation, non-AUG start codons are used to initiate translation of repeat proteins in multiple reading frames from the expanded nucleotide repeat itself. Once generated, these translation products appear to be toxic to cells in multiple ways, as they impair the ubiquitin-proteasome system, alter ribosomal RNA synthesis, and block nuclear import (Kwon et al. 2014; Oh et al. 2015; Yamakawa et al. 2015; Zhang et al. 2016; Cleary and Ranum 2017). Toxic polypeptides generated by RAN translation were first identified from expanded CAG repeats in ATXN8 and DMPK as well as CUG repeats in ATXN8OS (Zu et al. 2011). Subsequent work from many laboratories has now shown that RAN translation produces neurotoxic proteins from expanded GGGGCC and CCCCGG repeats from the C9orf72 locus (associated with amyotrophic lateral sclerosis and frontotemporal dementia [ALS/FTD]), CGG and CCG repeats from the FMR1 locus (associated with fragile X-associated tremor/ataxia syndrome [FXTAS]), CAG repeats from the HTT locus (associated with Huntington's disease), and UGGAA repeats from the SCA31 locus (associated with spinocerebellar ataxia type 31) (Ash et al. 2013; Mori et al. 2013; Todd et al. 2013; Zu et al. 2013; Banez-Coronel et al. 2015; Krans et al. 2016; Ishiguro et al. 2017). A better understanding of how RAN translation occurs has come from recent work focused on the CGG repeats in the 5′ leader of FMR1 (Fig. 6). A series of reporters was generated with the luciferase ORF placed downstream from a FMR1 5′ leader that contained either normal or disease-associated lengths of CGG repeats (Kearse et al. 2016). Upon mutating the start codon of luciferase to eliminate canonical translation of the luciferase ORF, only the reporters with expanded repeats produced polypeptides through RAN translation. These RAN-translated proteins were generated from multiple reading frames in vitro, in HeLa cells, and in cultured primary neurons, although at orders of magnitude less efficiently compared with a canonical luciferase reporter. In the case of FMR1 and CGG repeats, RAN translation is 5′ cap-, eIF4E-, and eIF4A-dependent and therefore resembles the scanning mechanism used in canonical translation except that either an ACG, GUG, or GCG codon is used for initiation (Kearse et al. 2016). Subsequent mass spectrometry confirmed that this ACG codon is used for initiation (incorporating methionine) when the endogenous FMR1 mRNA contains an expanded repeat (Sellier et al. 2017). RAN translation at expanded CAG repeats from the SCA3, SCA8, DM1, HD, and HDL2 disease loci is likewise initiated at upstream AUU or AUC codons in vitro, and, at least for AUU, [35S]-labeled Met-tRNAiMet was incorporated (Zu et al. 2011).
Figure 6.

RAN translation at CGG repeats in FMR1 produces neurotoxic proteins from multiple reading frames. The fragile X disorders are caused by a CGG nucleotide repeat expansion in the 5′ leader of the FMR1 gene, which encodes FMRP. Unaffected patients have a mean repeat length of ∼30 (top), whereas fragile X disorder patients have expanded repeats of >55 (bottom), and there is a correlation between repeat size and disease severity. CGG repeat expansion stimulates a noncanonical form of protein synthesis (called RAN translation) in multiple reading frames that uses a 5′ cap-, eIF4E-, and eIF4A-dependent scanning mode of initiation. RAN translation of the CGG repeat produces two toxic homopolymeric proteins from separate reading frames: (1) a polyglycine protein (FMRpolyG) that initiates at ACG and GUG codons upstream of the repeat in the +1 reading frame and (2) a polyalanine protein (FMRpolyA) that most likely initiates within the repeat at a GCG codon from the +2 reading frame.

RAN translation at CGG repeats in FMR1 produces neurotoxic proteins from multiple reading frames. The fragile X disorders are caused by a CGG nucleotide repeat expansion in the 5′ leader of the FMR1 gene, which encodes FMRP. Unaffected patients have a mean repeat length of ∼30 (top), whereas fragile X disorder patients have expanded repeats of >55 (bottom), and there is a correlation between repeat size and disease severity. CGG repeat expansion stimulates a noncanonical form of protein synthesis (called RAN translation) in multiple reading frames that uses a 5′ cap-, eIF4E-, and eIF4A-dependent scanning mode of initiation. RAN translation of the CGG repeat produces two toxic homopolymeric proteins from separate reading frames: (1) a polyglycine protein (FMRpolyG) that initiates at ACG and GUG codons upstream of the repeat in the +1 reading frame and (2) a polyalanine protein (FMRpolyA) that most likely initiates within the repeat at a GCG codon from the +2 reading frame. It is still largely unclear why expansion of nucleotide repeats beyond a critical threshold is required for RAN translation. It may be that the expanded repeat RNA forms a specific structure that binds trans-acting factors. Indeed, expanded repeats are known to be able to sequester RNA-binding proteins (Todd and Paulson 2010). It is also possible that the complex secondary structure of the expanded repeat (Cammas and Millevoi 2017) is directing initiation by altering ribosome scanning or recruitment. It is noteworthy that IRES-mediated mechanisms do not appear to account for RAN translation.

Targeting non-AUG translation may represent a promising therapeutic strategy

Small molecule compounds that inhibit protein synthesis not only have provided important insights into how the translation machinery functions but have great potential in the treatment of human diseases that are marked by dysregulated translation, including cancer (Lindqvist and Pelletier 2009; Blanchard et al. 2010; Garreau de Loubresse et al. 2014; Bhat et al. 2015). A key issue, however, is the need for sufficient specificity so that only deleterious translation events are targeted. Most common protein synthesis inhibitors target either the ribosome itself (e.g., cycloheximide, emetine, or anisomycin) or the eIFs (e.g., hippuristanol and Rocaglamide A [RocA]) (Bordeleau et al. 2006; Lindqvist et al. 2008; Iwasaki et al. 2016). Drugs that bind the ribosome itself should inhibit all forms of cytoplasmic translation, whereas hippuristanol and RocA should affect only eIF4A- and scanning-dependent initiation events (note that RocA further requires GA motifs for activity). Nevertheless, most cellular translation is scanning-dependent, and thus these drugs have rather limited specificity. Considering that non-AUG translation events can be regulated quite differently from canonical AUG start codons and that decreasing non-AUG translation by knocking out eIF2A reduced cancer progression (Sendoel et al. 2017), it has been proposed that pharmacologically modulating noncanonical initiation codons may represent a novel therapeutic strategy. In an effort to identify such inhibitors, Takacs et al. (2011) screened >55,000 natural compounds in yeast for effects on a dual-luciferase reporter that expressed UUG-encoded firefly luciferase and AUG-encoded renilla luciferase. This effort resulted in the identification of two structurally similar quinolone-containing compounds (NSC218351 and NSC92218), which increased translation of the UUG-encoded reporter and concomitantly decreased translation of the AUG-encoded reporter (Takacs et al. 2011). Using the same dual-luciferase reporter but with other non-AUG start codons, it was determined that both compounds also result in increased initiation from CUG, GUG, AUA, AUC, and ACG (but not AUU) start codons (Takacs et al. 2011). The specific mechanisms of action of NSC218351 or NSC92218 are not yet fully understood, but genetic and biochemical experiments have provided important clues (Takacs et al. 2011). Overexpression of eIF1 suppressed the effects of NSC218351 and NSC92218, whereas haploinsufficiency of eIF1, eIF1A, or eIF5 in diploid yeast caused increased sensitivity. These results strongly suggest that the compounds interact with the PIC. However, there was no change in eIF1 affinity for the 40S subunit or in the rate of release of eIF1 upon start codon recognition when either compound was present. Further work is required to clarify the detailed mechanism as well as determine whether these compounds are effective in mammalian cells. As translation initiation at CUG codons can use either Met-tRNAiMet (bound by eIF2) or Leu-tRNACUG (bound by eIF2A), it is perhaps not surprising that CUG initiation is selectively resistant against a few inhibitors (Starck et al. 2008, 2012). For example, NSC119893, which inhibits Met-tRNAiMet association with eIF2, inhibited only AUG, but not CUG, initiation in vivo (Starck et al. 2012). Likewise, bruceantin, which binds the P site of the 60S subunit where Met-tRNAiMet binds, was shown in vitro to inhibit AUG, but not CUG, initiation from a short reporter mRNA (Starck et al. 2008). Edeine, which binds the E site of the eukaryotic 40S subunit to interfere with start codon recognition and subunit joining, inhibited AUG initiation by ∼90% but CUG initiation by only ∼25% in vitro (Starck et al. 2008). This suggests that CUG initiation, at least in vitro, may be partially independent of the canonical scanning mechanism and that it may be possible to specifically modulate these events while not affecting AUG translation. The underlying mechanism of edeine is not clear but could be due to Leu-tRNACUG binding in a unique conformation within the ribosome. The nucleic acid intercalating agent acriflavine appears to inhibit CUG initiation more efficiently than AUG initiation in vitro and in vivo (Starck et al. 2008, 2012). Unfortunately, the exact mechanism of action is not known. Acriflavine has been reported to inhibit the aminoacylation of tRNAs in vitro (Birkmayer and Balda 1970; Novac et al. 2004), but it is not immediately obvious how such an activity could make the drug more specific toward CUG initiation. Whether acriflavine inhibits initiation at other non-AUG start codons has not been tested, but this molecule could be a valuable starting point for developing a more potent CUG/non-AUG-specific inhibitor. Inhibitors to RAN translation at GGGGCC (causes ALS/FTD) and CGG (causes FXTAS) nucleotide repeats are perhaps the most medically promising inhibitors of non-AUG translation because of their noted selectivity. Both of these expanded repeats form very stable hairpins and G quadruplexes (Cammas and Millevoi 2017), and small molecule compounds that bind the repeats caused marked improvements in a number of molecular and cellular phenotypes (Disney et al. 2012; Su et al. 2014). For example, these compounds inhibited production of RAN translation proteins, although the effects were not equal across all reading frames of the GGGGCC repeat or antisense CCCCGG repeat (Su et al. 2014). Most importantly, canonical translation of reporter mRNAs in cells appeared unaffected by these compounds (Su et al. 2014). As these compounds increase the melting temperature of the expanded repeats in vitro, RAN translation may be inhibited because the ribosome cannot elongate through the stabilized G-rich repeat secondary structures. However, it remains equally plausible that the effects may occur at the initiation stage, as the stabilized secondary structure could sterically prevent recognition of the non-AUG start codons (e.g., by placing the P site of the PIC over a start codon in a poor context).

Perspectives and concluding remarks

It is now clear that multiple alternative initiation mechanisms can be used in eukaryotic cells. These include IRES-mediated initiation, eIF3-directed cap binding, reinitiation, the use of alternative eIFs to bind tRNAi (Fig. 3), the recruitment of the ribosome via N6-methyladenosine (m6A) mRNA modifications, and the usage of non-AUG start codons (Hellen and Sarnow 2001; Touriol et al. 2003; Ivanov et al. 2011; Starck et al. 2012; Skabkin et al. 2013; Wang et al. 2015; Zhou et al. 2015; Zinoviev et al. 2015; Lee et al. 2016; Johnson et al. 2017). Especially in certain cell states (e.g., during meiosis or cell stress), ribosomes often initiate from near-cognate codons to give rise to functional proteins that regulate key cellular processes, thereby contributing to protein diversity (Fig. 4; Ivanov et al. 2011; Crosby et al. 2015). Thus, it is not surprising that misregulation of non-AUG translation can stimulate cancer malignancy and neurodegeneration. Nevertheless, one should keep in mind that only a handful of non-AUG translation events have yet to be orthogonally validated or characterized in detail. It still remains very possible that some initiation events that have been identified in genome-wide approaches, such as ribosome profiling, may be very rare or inconsequential. Western blot analysis and/or mass spectrometry experiments can provide strong confirmatory support for novel ORFs, but proteins that are expressed at very low levels may not be detected with current technologies. This is not to say that such proteins are not potentially biologically relevant, as some cellular factors (e.g., some microRNA and transcription factors) can be found at only a few molecules per cell and still have important functional roles (Zenklusen et al. 2008; Vaquerizas et al. 2009; Parasramka et al. 2012; Marinov et al. 2014; Denzler et al. 2016). As non-AUG initiation events are influenced by both the translation machinery and the mRNA sequence/structure, an intriguing question is whether we can accurately predict non-AUG translation events in any given transcript. In silico tools using algorithms that are based on ribosome profiling data sets are now available to help predict true non-AUG start codons (Reuter et al. 2016), and the inclusion of additional data, including secondary structure predictions (Kochetov et al. 2007) and evolutionary conservation (Ivanov et al. 2011, 2017), likely can improve accuracy. However, as noted in the multiple examples that were highlighted throughout this review, cellular conditions (e.g., stress) and disease mutations significantly influence start codon usage. Unfortunately, it is difficult to account for these features in prediction software, but this may become more feasible as additional new insights into the underlying molecular mechanisms of non-AUG translation are revealed. In summary, it is now clear that translation can initiate at non-AUG codons across eukaryotes to impact both normal and disease biology. Recent work has revealed many surprises, including the ability of Leu-tRNACUG to be used for initiation and the critical role that RAN translation plays in neurodegenerative diseases. Going forward, it will be of great interest to further define exactly how the various alternative initiation pathways become specifically induced or repressed, depending on cell state. This will likely reveal novel insights into how non-AUG translation is used to expand the functional coding potential of our genome. Furthermore, such studies may also suggest novel therapeutic strategies that can be used to treat a variety of diseases that are marked by dysregulated non-AUG translation events.
  182 in total

1.  Factor-independent assembly of elongation-competent ribosomes by an internal ribosome entry site located in an RNA virus that infects penaeid shrimp.

Authors:  Randal C Cevallos; Peter Sarnow
Journal:  J Virol       Date:  2005-01       Impact factor: 5.103

2.  Regulatory elements in eIF1A control the fidelity of start codon selection by modulating tRNA(i)(Met) binding to the ribosome.

Authors:  Adesh K Saini; Jagpreet S Nanda; Jon R Lorsch; Alan G Hinnebusch
Journal:  Genes Dev       Date:  2010-01-01       Impact factor: 11.361

3.  N(6)-methyladenosine Modulates Messenger RNA Translation Efficiency.

Authors:  Xiao Wang; Boxuan Simen Zhao; Ian A Roundtree; Zhike Lu; Dali Han; Honghui Ma; Xiaocheng Weng; Kai Chen; Hailing Shi; Chuan He
Journal:  Cell       Date:  2015-06-04       Impact factor: 41.582

4.  Interactions of eukaryotic translation initiation factor 3 (eIF3) subunit NIP1/c with eIF1 and eIF5 promote preinitiation complex assembly and regulate start codon selection.

Authors:  Leos Valásek; Klaus H Nielsen; Fan Zhang; Christie A Fekete; Alan G Hinnebusch
Journal:  Mol Cell Biol       Date:  2004-11       Impact factor: 4.272

Review 5.  Regulation and function of non-AUG-initiated proto-oncogenes.

Authors:  S R Hann
Journal:  Biochimie       Date:  1994       Impact factor: 4.079

6.  Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling.

Authors:  Nicholas T Ingolia; Sina Ghaemmaghami; John R S Newman; Jonathan S Weissman
Journal:  Science       Date:  2009-02-12       Impact factor: 47.728

7.  Initiation of translation by cricket paralysis virus IRES requires its translocation in the ribosome.

Authors:  Israel S Fernández; Xiao-Chen Bai; Garib Murshudov; Sjors H W Scheres; V Ramakrishnan
Journal:  Cell       Date:  2014-05-01       Impact factor: 41.582

8.  Rocaglates convert DEAD-box protein eIF4A into a sequence-selective translational repressor.

Authors:  Shintaro Iwasaki; Stephen N Floor; Nicholas T Ingolia
Journal:  Nature       Date:  2016-06-15       Impact factor: 49.962

9.  Impact of MicroRNA Levels, Target-Site Complementarity, and Cooperativity on Competing Endogenous RNA-Regulated Gene Expression.

Authors:  Rémy Denzler; Sean E McGeary; Alexandra C Title; Vikram Agarwal; David P Bartel; Markus Stoffel
Journal:  Mol Cell       Date:  2016-10-27       Impact factor: 17.970

10.  A distinct translation initiation mechanism generates cryptic peptides for immune surveillance.

Authors:  Shelley R Starck; Yongkai Ow; Vivian Jiang; Maria Tokuyama; Mark Rivera; Xin Qi; Richard W Roberts; Nilabh Shastri
Journal:  PLoS One       Date:  2008-10-21       Impact factor: 3.240

View more
  109 in total

1.  Translation Initiation Site Profiling Reveals Widespread Synthesis of Non-AUG-Initiated Protein Isoforms in Yeast.

Authors:  Amy R Eisenberg; Andrea L Higdon; Ina Hollerer; Alexander P Fields; Irwin Jungreis; Paige D Diamond; Manolis Kellis; Marko Jovanovic; Gloria A Brar
Journal:  Cell Syst       Date:  2020-07-24       Impact factor: 10.304

Review 2.  uORF-mediated translational control: recently elucidated mechanisms and implications in cancer.

Authors:  Hung-Hsi Chen; Woan-Yuh Tarn
Journal:  RNA Biol       Date:  2019-06-24       Impact factor: 4.652

3.  A purely quantitative form of partial recessive IFN-γR2 deficiency caused by mutations of the initiation or second codon.

Authors:  Carmen Oleaga-Quintas; Caroline Deswarte; Marcela Moncada-Vélez; Ayse Metin; Indumathi Krishna Rao; Saliha Kanık-Yüksek; Alejandro Nieto-Patlán; Antoine Guérin; Belgin Gülhan; Savita Murthy; Aslınur Özkaya-Parlakay; Laurent Abel; Rubén Martínez-Barricarte; Rebeca Pérez de Diego; Stéphanie Boisson-Dupuis; Xiao-Fei Kong; Jean-Laurent Casanova; Jacinta Bustamante
Journal:  Hum Mol Genet       Date:  2018-11-15       Impact factor: 6.150

4.  Identification of residues critical for topology inversion of the transmembrane protein TM4SF20 through regulated alternative translocation.

Authors:  Jingcheng Wang; Lisa N Kinch; Bray Denard; Ching-En Lee; Elina Esmaeilzadeh Gharehdaghi; Nick Grishin; Jin Ye
Journal:  J Biol Chem       Date:  2019-02-26       Impact factor: 5.157

Review 5.  Understanding the Genetic Code.

Authors:  Milton H Saier
Journal:  J Bacteriol       Date:  2019-07-10       Impact factor: 3.490

6.  uORF-seqr: A Machine Learning-Based Approach to the Identification of Upstream Open Reading Frames in Yeast.

Authors:  Pieter Spealman; Armaghan Naik; Joel McManus
Journal:  Methods Mol Biol       Date:  2021

Review 7.  Peptides encoded by noncoding genes: challenges and perspectives.

Authors:  Shuo Wang; Chuanbin Mao; Shanrong Liu
Journal:  Signal Transduct Target Ther       Date:  2019-12-13

8.  Interrogation of Eukaryotic Stop Codon Readthrough Signals by in Vitro RNA Selection.

Authors:  Andrew V Anzalone; Sakellarios Zairis; Annie J Lin; Raul Rabadan; Virginia W Cornish
Journal:  Biochemistry       Date:  2019-02-13       Impact factor: 3.162

Review 9.  Molecular regulation of human skeletal muscle protein synthesis in response to exercise and nutrients: a compass for overcoming age-related anabolic resistance.

Authors:  Nathan Hodson; Daniel W D West; Andrew Philp; Nicholas A Burd; Daniel R Moore
Journal:  Am J Physiol Cell Physiol       Date:  2019-08-28       Impact factor: 4.249

Review 10.  A helicase links upstream ORFs and RNA structure.

Authors:  Eckhard Jankowsky; Ulf-Peter Guenther
Journal:  Curr Genet       Date:  2018-11-27       Impact factor: 3.886

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.