Literature DB >> 16186132

Evolutionary conservation suggests a regulatory function of AUG triplets in 5'-UTRs of eukaryotic genes.

Alexander Churbanov1, Igor B Rogozin, Vladimir N Babenko, Hesham Ali, Eugene V Koonin.   

Abstract

By comparing sequences of human, mouse and rat orthologous genes, we show that in 5'-untranslated regions (5'-UTRs) of mammalian cDNAs but not in 3'-UTRs or coding sequences, AUG is conserved to a significantly greater extent than any of the other 63 nt triplets. This effect is likely to reflect, primarily, bona fide evolutionary conservation, rather than cDNA annotation artifacts, because the excess of conserved upstream AUGs (uAUGs) is seen in 5'-UTRs containing stop codons in-frame with the start AUG and many of the conserved AUGs are found in different frames, consistent with the location in authentic non-coding sequences. Altogether, conserved uAUGs are present in at least 20-30% of mammalian genes. Qualitatively similar results were obtained by comparison of orthologous genes from different species of the yeast genus Saccharomyces. Together with the observation that mammalian and yeast 5'-UTRs are significantly depleted in overall AUG content, these findings suggest that AUG triplets in 5'-UTRs are subject to the pressure of purifying selection in two opposite directions: the uAUGs that have no specific function tend to be deleterious and get eliminated during evolution, whereas those uAUGs that do serve a function are conserved. Most probably, the principal role of the conserved uAUGs is attenuation of translation at the initiation stage, which is often additionally regulated by alternative splicing in the mammalian 5'-UTRs. Consistent with this hypothesis, we found that open reading frames starting from conserved uAUGs are significantly shorter than those starting from non-conserved uAUGs, possibly, owing to selection for optimization of the level of attenuation.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16186132      PMCID: PMC1236974          DOI: 10.1093/nar/gki847

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Translational efficiency of eukaryotic mRNAs depends on the structural features of the 5′-untranslated region (5′-UTR, or leader sequence) and the nucleotide sequence flanking the translation start codon (start codon context). Both experimental (1–5) and statistical (6–17) approaches have been used to reveal the characteristics of the 5′-UTRs that influence translation. In particular, 5′-UTR length (18), secondary structure (19) and the presence of AUG triplets upstream of the true translation start in mRNA, known as upstream AUGs (20) (hereinafter uAUGs; for the sake of uniformity, we designate start codons AUG disregarding the fact that it should be rendered as ATG when DNA sequences are considered), have been shown to affect the efficiency of translation. Although, in most cases, translation of eukaryotic mRNAs is initiated at the first AUG of the 5′-UTR, according to the scanning model of Kozak, many exceptions to this rule have been described (2,3,19–25). The context of the start codon is another important regulatory factor. A consensus sequence, (GCC)GCCRCCAUGG (where R = G or A), has been derived for the start codon context of mammalian mRNAs (6,8). The most critical sites near the start AUG codon (sAUG) are −3 (the A of the sAUG is position +1), usually occupied by a purine, and +4, typically occupied by a guanine (19,26). Other sites near the start codon seem to be less important, although the influence of nucleotides in different positions may vary from species to species (27–30). Generally, uAUGs decrease mRNA translation efficiency and may be considered strong negative translational regulatory signals (3,31,32), since the major mechanism of translation initiation in eukaryotes appears to be the scanning model, according to which eukaryotic ribosomes initiate translation at the 5′-proximal AUG codon (1,6). This model is supported by the observation that mammalian 5′-UTRs contain significantly fewer uAUGs than expected by chance, suggesting that purifying selection caused elimination of uAUGs in many 5′-UTRs (13,17). Nevertheless, a substantial fraction of 5′-UTRs contain uAUGs (13,33–35). The presence of uAUGs correlates with a ‘weak’ start codon context (low information content) and conversely, high information content of the start codon context is typical of short 5′-UTRs containing no uAUGs (13). This finding seems to be compatible with the possibility that uAUGs function as attenuators of translation initiation by engaging ribosomes in futile initiation and precluding them from reaching the sAUG. However, the alternative possibility remains that cDNAs with AUG-containing 5′-UTRs might represent mainly non-functional sequences, such as incorrectly processed transcripts and transcripts of pseudogenes, artifacts of cloning and coding region annotation, such that the purported uAUGs actually belong to misannotated 5′-portions of coding regions, or mRNAs of genes with multiple transcription start sites (2,13,19,31,32). Indeed, follow-up studies have shown that some cDNA sequences containing many AUGs in the 5′-UTR do not correspond to functional mRNAs (31,32,36). A natural approach to address the issue of the functional significance of uAUGs is to examine the evolutionary conservation of these triplets on genome scale. By comparing sequences of human, mouse and rat orthologous genes, we show here that in 5′-UTRs of mammalian cDNAs but not in 3′-UTRs or coding sequences, AUG is conserved to a significantly greater extent than any of the other 63 nt triplets. This observation can be hardly explained by annotation errors because pronounced excess of conserved uAUGs was detected in 5′-UTRs containing an in-frame stop codon and because many of the conserved AUGs are found in different frames, which is consistent with the location in non-coding sequences. A similar excess of conserved uAUGs was detected in yeast 5′-UTRs. These observations strongly suggest that at least some of the conserved uAUGs are functional elements of translation initiation regulation.

MATERIALS AND METHODS

Mammalian cDNA sequences were extracted from the Refseq database (16 773 human cDNAs, 19 777 mouse cDNAs and 21 178 rat cDNAs). All 5′-UTRs that showed statistically significant BLASTX (37) hits (E-value < 10−5) to the non-redundant protein database (National Center for Biotechnology Information, NIH, Bethesda) and, accordingly, were probable to represent that undetected coding sequences were removed from the datasets. In addition, the identity between the first 50 amino acids of the Refseq proteins encoded in the analyzed genes and the corresponding Swissprot entries were checked. All redundant 5′-UTR sequences were removed from the datasets. Two 5′-UTRs were considered redundant if their masked sequences (RepeatView, ) (38) produced a bidirectional BLAST hit with an E-value < 10−6. Of all redundant 5′-UTRs, only the shortest versions were retained to exclude cloning artifacts and unspliced introns. Probable one-to-one orthologs were identified by comparing symmetrical best hits (39) between proteins from the respective genomes using the BLASTP program (40). The nucleotide sequence alignments for identified orthologous pairs of cDNAs were produced using the BLASTN program (37). Only long pairwise alignments (high-scoring segment pairs) with length ≥50 bases and with a low expectation value (E-value < 10−10) were retained for analysis. This ensures the reliability of nucleotide sequence alignments since BLASTN is known to produce conservative alignments, at least when applied with restrictive cut-off values as done here (41,42). The sequence lists and alignments from this dataset are available at . Aligned sequences of orthologous genes from four yeast species, Saccharomyces cerevisiae, Saccharomyces paradoxus, Saccharomyces mikatae and Saccharomyces bayanus were extracted from the SGD database () (43). The alignment of start and stop codons was fixed in order to ensure the correct partitioning of 5′-UTR, the CDS (protein-coding sequence) and the 3′-UTR. Saccharomyces cDNA sequences were extracted from GenBank, and the reliability of annotation of the 5′-UTRs and CDS was assessed as described above. The evolutionary conservation was calculated as the fraction of triplets that are conserved in pairwise (for comparisons of mammalian sequences) or multiple (for comparisons of yeast sequences) alignments of 5′-UTRs.

RESULTS

We analyzed evolutionary conservation of uAUGs in pairwise alignments of humanmouse and mouserat orthologs. Conservative criteria were adopted to select reliable alignments for further analysis (Materials and Methods). It was found that 33% of the 2241 selected humanmouse 5′-UTR alignments and 38% of the 2312 mouserat 5′-UTR alignments contained at least one conserved uAUG. These fractions of uAUG-containing 5′-UTRs are consistent with previous observations (13,33–35). Examination of the evolutionary conservation of each of the 64 nt triplets in the aligned 5′-UTRs showed that AUG was the most conserved triplet in both humanmouse and mouserat comparisons (Figure 1A and B). Tables 1 and 2 compare the frequencies of conserved uAUGs with those of the five triplets that are permutations of AUG, in order to exclude any possible effect of nucleotide composition; in both cases, the fraction of conserved AUGs was markedly greater than those of the other triplets (P < 10−42 by the Fisher's exact test). In contrast, no excessive conservation of AUG compared with the other 63 triplets was detected in 3′-UTRs and CDSs (excluding the start AUG) of mammalian mRNAs (Figure 1C–F), indicating that AUG conservation was specific for 5′-UTRs. The conservation of uAUGs seemed to be spread throughout the 5′-UTRs: a significant excess of conserved uAUGs was observed both in the proximal (region −100: −1 nt) and the distal (region −200: −101 nt) parts of 5′-UTR alignments, without a significant enrichment in either (Supplementary Tables S1–S4).
Figure 1

Plots of nucleotide triplet conservation in mammalian cDNAs. (A) Human–mouse 5′-UTRs. (B) Mouse–rat 5′-UTRs. (C) Human–mouse CDS. (D) Mouse–rat CDS. (E) Human–mouse 3′-UTR. (F) Mouse–rat 3′-UTR.

Table 1

Preferential conservation of uAUGs in orthologous human and mouse 5′-UTRs

TrinucleotideTriplet present in human but not in mouse (%)Triplet conserved in human and mouse (%)Triplet present in mouse but not in human (%)
AUG8857
AGU196120
GUA245423
GAU176815
UAG215920
UGA166915

Fisher's exact test: 1266 conserved uAUGs, 218 non-conserved uAUGs, 5690 conserved AGU/GUA/GAU/UAG/UGAs, 3219 non-conserved AGU/GUA/GAU/UAG/UGAs and P = 1.4 × 10−66.

Table 2

Preferential conservation of uAUGs in mouse and rat orthologous 5′-UTRs

TrinucleotideTriplet present in mouse but not in rat (%)Triplet conserved in mouse and rat (%)Triplet present in rat but not in mouse (%)
AUG98110
AGU176518
GUA205723
GAU156916
UAG186220
UGA137215

Fishers's exact test: 1539 conserved uAUGs, 359 non-conserved uAUGs, 11 218 conserved AGU/GUA/GAU/UAG/UGAs, 5703 non-conserved AGU/GUA/GAU/UAG/UGAs and P = 2.9 × 10−42.

An issue of concern with this observation is that the excess of conserved uAUGs could be an artifact of erroneous annotation of coding regions, with the sequences annotated as 5′-UTR actually containing the upstream portion of the CDS, including the typically conserved authentic start AUG and, possibly, conserved methionine codons as well (36). This did not seem to be a major factor because the observed excess of conserved uAUG did not depend on whether we used Refseq annotations (Tables 1 and 2) or only those Refseq annotations that were consistent with the corresponding SwissProt annotations (Supplementary Tables S5 and S6). Nevertheless, to control such an artifact in a more direct fashion, we examined 5′-UTRs containing an upstream conserved stop codon(s) in the same frame with the sAUG (Supplementary Figure S1); 5′-UTRs containing in-phase stop codons are unlikely to represent undetected parts of the CDS. Figure 2 shows an example of a conserved in-frame stop codon in a humanmouse 5′-UTR alignment with two conserved uAUGs. The main open reading frame (ORF) encoding the α1 type III collagen proprotein cannot be extended in the 5′ direction because, in both human and mouse 5′-UTRs, there is an in-frame stop codon UGA. Thus, the start codon of this gene apparently is annotated correctly, and the conserved uAUGs are, indeed, located in the 5′-UTR (Figure 2). Analysis of the portions of 5′-UTRs located upstream of conserved in-frame stop codons (5′-UTR< stop) yielded results that were fully compatible with those for complete 5′-UTRs. Specifically, 19 and 22% of the humanmouse and mouserat alignments, respectively, contained conserved uAUGs, and there was a highly significant excess of conserved uAUGs compared with the five shuffled triplets (Tables 3 and 4).
Figure 2

A mammalian 5′-UTR with conserved uAUGs and an in-frame stop codon. The alignment of the 5′UTRs of human and mouse α1 type III collagen proprotein (GI numbers: 15 149 480 and 33 859 525, respectively). uAUGs are colored yellow, the collagen starting codon is colored green, and the open reading frames are colored grey. The protein-coding region cannot be extended in the 5′ direction because of the presence of a conserved in-frame UGA stop codon (dark gray).

Table 3

Preferential conservation of uAUGs in human–mouse stop-codon-bounded 5′-UTR alignments (5′-UTR

TrinucleotideTriplet present in human, but not in mouse (%)Triplet conserved in human and mouse (%)Triplet present in mouse, but not in human (%)
AUG9847
AGU176914
GUA206218
GAU177013
UAG157015
UGA167212

Fishers's exact test: 255 conserved uAUGs, 49 non-conserved uAUGs, 1124 conserved AGU/GUA/GAU/UAG/UGAs, 491 non-conserved AGU/GUA/GAU/UAG/UGAs and P = 1.52 × 10−7.

Table 4

Preferential conservation of uAUGs in mouse–rat stop-codon-bounded 5′-UTR alignments (5′-UTR

TrinucleotideTriplet present in mouse, but not in rat (%)Triplet conserved in mouse and rat (%)Triplet present in rat, but not in mouse (%)
AUG11809
AGU156718
GUA185725
GAU166816
UAG176221
UGA147016

Fishers's exact test: 499 conserved uAUGs, 124 non-conserved uAUGs, 3262 conserved AGU/GUA/GAU/UAG/UGAs, 1675 non-conserved AGU/GUA/GAU/UAG/UGAs and P = 3.53 × 10−13.

Further evidence of bona fide evolutionary conservation of uAUGs was obtained by analysis of the phase distribution of the conserved uAUGs (phase 0 is the frame of the start codon, and phases 1 and 2 are frames shifted by 1 or 2 nt, respectively) (Supplementary Figure S2). We found that humanmouse and mouserat alignments have a significant excess of conserved uAUGs in the same phase (0–0, 1–1 or 2–2 in Table 5). However, the fraction of conserved uAUGs in different phases was also substantial (0–1, 1–2 and so on; Table 5) which is not the expectation for protein-coding regions where frameshift mutations are, generally, not tolerated. It should be noted that BLASTN alignments are extremely conservative with respect to deletions/insertions, so the actual fraction of conserved uAUGs in different phases might be even greater. The phase distribution of the conserved uAUGs showed a significant excess of phase 1–1 and phase 2–2 uAUGs compared with phase 0–0 uAUGs (Table 5). This may reflect the greater chance for uORFs in phase 0–0 to merge with the main ORFs, thereby extending the main ORFs in the 5′ direction, which could be a mechanism of evolution of protein-coding regions yielding alternative translation starts (44,45). However, the possibility cannot be ruled out that, at least in part, the deficit of phase 0–0 is caused by the erroneous annotation of the 5′-terminal AUG as the start codon in a subset of mRNAs, resulting in depletion of phase 0 uAUGs. Indeed, analysis of the 5′-UTRuAUGs in the 5′-UTRuAUGs (compare the data in Tables 5 and 6) is probably owing to the lower sequence conservation and greater prevalence of indels in the distal parts of the 5′-UTRs (16). Thus, the phase distribution of the conserved uAUGs is best compatible with their localization in non-coding regions. Taken together, these results show that the observed exceptional conservation of uAUGs in the 5′-UTRs of mammalian genes is not an artifact and strongly suggests that the uAUGs are subject to purifying selection and hence have a function(s), most probably, related to the regulation of translation initiation.
Table 5

Phase distribution of conserved uAUGs in alignments of mammalian 5′-UTRs

Phase 0Phase 1Phase 2
Human/mouse
    Phase 02216072
    Phase 19428294
    Phase 25489350
Mouse/rat
    Phase 026788100
    Phase 16635093
    Phase 28199395
Table 6

Phase distribution of conserved uAUGs in stop-codon-bounded alignments of mammalian 5′-UTRs (5′-UTR

Phase 0Phase 1Phase 2
Human/mouse
    Phase 0502334
    Phase 1364222
    Phase 2303236
Mouse/rat
    Phase 0646056
    Phase 1596554
    Phase 2583772
Obviously, each uAUG starts an ORF, typically, a short one. It has been reported recently that uORFs are, on average, slightly but significantly shorter than random ORFs (17). Figure 3 compares the length distributions of uORFs starting from conserved uAUGs, non-conserved uAUGs and uGAUs (GAU triplet, a permutation of AUG, located in the 5′-UTRs).
Figure 3

Length distributions of human uORFs starting with conserved uAUGs, non-conserved uAUGs and pseudo-ORFs starting with uGAUs. The ORF length is represented by bins, each including 10 codons (i.e. bin 1 includes ORFs from 0 to 10 codons, bin 2 ORFs from 11 to 20 codons and so on).

A noticeable and statistically highly significant excess of very short uORFs was observed for conserved uAUGs: the length distribution of uORFs starting with conserved uAUGs differed from the other two distributions with P < 10−4 (χ2 test), and the excess of the shortest uORFs (0–20 codons) was significant at P < 10−6 (Fisher's exact test). The length distributions of uORF starting with conserved uAUG were very similar for all three phases (Supplementary Figure S3). In contrast, the distributions of uORF lengths starting from non-conserved uAUGs and GAUs were statistically indistinguishable (similar results were obtained with other permutations of AUG; data not shown). Conceivably, the uORFs that start from conserved uAUGs and may be engaged in the regulation of translation initiation were selected for short length optimization of the level of attenuation and prevent complete shutdown of initiation from the sAUG. There was no significant difference between the nucleotide contexts of the conserved and non-conserved uAUGs (data not shown); in both cases, the information content of the uAUG context was lower than that of the sAUG context (13,17). Thus, uORFs do not appear to be optimized for translation efficiency. Having shown that uAUGs are exceptionally conserved in 5′-UTRs of mammalian mRNAs, we sought to investigate how general this pattern might be. To this end, we examined the conservation of uAUGs in orthologous gene sets from four species of yeasts. Since precise mRNA mapping is unavailable for most yeast genes, we had to analyze, as a proxy for 5′-UTRs, genomic sequences (50 nt) located upstream of sAUGs codons of yeast genes. This analysis revealed a highly significant excess of conserved uAUGs compared with the permuted triplets (Table 7). This result was confirmed also with genomic alignments of 5′-UTRyeast cDNA sequences (Table 7). Furthermore, 11% of the 253 genomic alignments of 5′-UTRs for known yeast cDNA sequences contained uAUGs conserved in all four yeast species for which genome sequences are available (data not shown).
Table 7

Preferential conservation of uAUGs in yeast orthologous 5′-UTRs

TrinucleotideConserved triplets (%)Non-conserved triplets (%)
AUG3070
AUG (5′-UTR<stop)2872
AUG (5′-UTR<stop + cDNA)3664
AGU991
GUA793
GAU991
UAG1090
UGA1090

Triplets were considered when present in the aligned sequences from all four yeast species. The AUG data set consisted of alignments of genomic sequences located upstream (50 nt) of sAUGs. The 5′-UTR

DISCUSSION AND CONCLUSIONS

The comparative genomic analysis reported here shows that the 5′-UTRs of at least 20–30% of mammalian mRNAs contain conserved uAUGs; qualitatively similar results were obtained with yeast 5′-UTRs. Strikingly, in both mammals and yeasts, AUG is by far the most conserved nucleotide triplet in 5′-UTR but not in 3′-UTRs or coding regions. These findings suggest that at least a fraction of the conserved uAUGs perform a function related to the regulation of translation initiation. Compatible with this conclusion, we found that uORFs starting from conserved (and, potentially, functional) uAUGs are, on average, significantly shorter than those that start from non-conserved (probably, non-functional) AUGs. There was no single dominant functional trend among the genes containing multiple conserved uAUGs but many of these genes encode transcription factors, receptors and other proteins involved in various forms of signal transduction (Supplementary Table S7 and data not shown). These genes are subject to complex regulation which seems to be compatible with the proposed role of uAUGs in modulation of translation initiation. The conclusion of the present work on the probably functional importance of conserved uAUGs has to be considered in conjunction with the more or less orthogonal previous observation that mammalian 5′-UTRs are significantly depleted of AUGs, a trend that was confirmed in this study and is even stronger in yeast 5′-UTRs (Table 8). Apparently, purifying selection acts on the uAUGs in two opposite directions: the uAUGs that have no specific function tend to be deleterious and are eliminated during evolution; in contrast, those uAUGs that do serve a function are maintained. More specifically, what could be the regulatory functions of uAUGs? Probably, the leading hypothesis is that the uAUGs attenuate translation by diverting scanning ribosomes from the authentic sAUG. Kozak (2,19) noticed that mRNAs coding for regulatory proteins often contain multiple uAUGs in the 5′-UTRs and proposed that this is an adaptation to prevent excessive and deleterious production of proto-oncogenes, transcription and growth factors, and other proteins that require tight regulation of expression. Double suppression of mRNA translational activity by uAUGs and weak start codon context also might be employed for this purpose (12,13,46). Apparently, selection also operates at the level of the regulatory uORFs by shortening their length, possibly, to optimize the level of translation attenuation. An important role for down-regulation signals affecting translation initiation (uAUGs in 5′-UTR and weak sAUG context) is supported by experimental data: deletion of AUG-containing fragments of 5′-UTR has been shown to greatly increase the mRNA translation rate and protein expression levels (47,48).
Table 8

Expected and observed numbers of uAUG triplets and shuffled triplets per 1000 nt in mammalian and yeast 5′-UTRs

Species, tripletsExpectedObserved
Human uAUGs Human12.67.4
AGU/GUA/GAU/UAG/UGAs63.047.7
Mouse uAUGs12.66.9
Mouse AGU/GUA/GAU/UAG/UGAs63.048.2
Rat uAUGs12.77.6
Rat AGU/GUA/GAU/UAG/UGAs63.549.4
Yeast uAUGs17.710.6
Yeast AGU/GUA/GAU/UAG/UGAs88.573.6

The statistical significance of the differences between expected and observed frequencies of uAUG and shuffled triplets were estimated using the χ2 test (2 × 2 tables). In all cases, the difference for uAUGs was significantly greater than that for the combined shuffled triplets (P < 0.001).

An attractive hypothesis adding another level of complexity to the regulation of translation initiation is that, in eukaryotic cells, the AUG-containing portions of 5′-UTRs are similarly removed by alternative splicing, thus converting poorly translatable (in some cases, virtually inactive) mRNAs into active ones. Mironov et al. (49) found that the majority of alternative splicing events in human genes occurred in 5′-UTR regions. This observation suggests that AUG-containing regions are removed by alternative splicing in many human mRNAs and that the structures of cDNAs, which reflect the hypothetical major forms of mRNAs, and the corresponding functional mRNAs (hypothetical minor forms) for certain genes (especially those expressed at low levels) are substantially different. If, indeed, alternative splicing is a common mechanism for the activation of AUG-containing 5′-UTRs, then such 5′-UTRs are expected to be typical of weakly expressed proteins. In accord with this hypothesis, the density of uAUGs is notably lower in yeast genes (Table 8) where alternative splicing does not seem to be a possibility, given the scarcity of introns and partial degradation of the splicing machinery (50,51). Conceivably, yeast genes are subject to stronger purifying selection against uAUGs compared with mammalian genes, owing to the lack of the alternative splicing option in the former. However, the pronounced conservation of uAUGs in yeast genes (Table 7) suggests the existence of a different regulatory mechanism, which is common to a broad variety if not all eukaryotes, possibly, sequestration of ribosomes by the uAUGs. The conventional scanning mechanism (6) is likely to be completely or at least partially suppressed by uAUGs. Apart from the possibility of alternative splicing, several molecular mechanisms might provide for efficient translation of mRNAs containing uAUGs including leaky scanning, reinitiation (1) or internal initiation of translation (52–55). The relative contributions of these mechanisms remain uncertain (31,56,57) but several recent studies suggest that the impact of at least some of them might be substantial (3,5,17,23,24,32,58–66). Additional functional signals, such as specific secondary structure elements within the 5′-UTR, might compensate for the negative effects of uAUGs on translation initiation. Such hypothetical signals, which might interact with still unknown regulatory proteins, could be used for regulation of gene expression in response to various stimuli. Under this hypothesis, the uAUGs may be regarded as regulatory elements that maintain the low constitutive level of expression of proteins that need to be sharply up-regulated in response to specific cues. It has been shown that the efficiency of translation of certain mRNAs depends on unknown 5′-UTR elements other than the −3 or +4 positions of the start codon context (30) although there is no clear evidence of a wide distribution of such regulatory signals (2,4,19,31,56). A recent genome-wide analysis of uAUGs and uORFs in a curated set of human and rodent cDNAs has suggested that the ribosome-shunt mechanism, in which the small ribosome subunit binds to the mRNA in a cap-dependent manner but then jumps over a large region of the 5′-UTR containing secondary structures, uORFs and uAUGs to land near the sAUG, might be a widespread mode of translation regulation (17). Generally, the results of our analysis are compatible with ribosome scanning being the major mechanism of translation initiation in eukaryotes but they also indicate that, in a substantial fraction of eukaryotic mRNAs, uAUGs and uORFs might confer an additional level of translation regulation. The comparative-genomic results presented here suggest that there are thousands of genes in mammalian genomes that might be subject to translation initiation regulation involving uAUGs. This finding is expected to stimulate experimental study of the repertoire and impact of such regulatory mechanisms.

SUPPLEMENTARY DATA

Supplementary Data is available at NAR Online.
  66 in total

Review 1.  New ways of initiating translation in eukaryotes?

Authors:  M Kozak
Journal:  Mol Cell Biol       Date:  2001-03       Impact factor: 4.272

Review 2.  Upstream open reading frames as regulators of mRNA translation.

Authors:  D R Morris; A P Geballe
Journal:  Mol Cell Biol       Date:  2000-12       Impact factor: 4.272

3.  New ways of initiating translation in eukaryotes.

Authors:  R Schneider; V I Agol; R Andino; F Bayard; D R Cavener; S A Chappell; J J Chen; J L Darlix; A Dasgupta; O Donzé; R Duncan; O Elroy-Stein; P J Farabaugh; W Filipowicz; M Gale; L Gehrke; E Goldman; Y Groner; J B Harford; M Hatzglou; B He; C U Hellen; M W Hentze; J Hershey; P Hershey; T Hohn; M Holcik; C P Hunter; K Igarashi; R Jackson; R Jagus; L S Jefferson; B Joshi; R Kaempfer; M Katze; R J Kaufman; M Kiledjian; S R Kimball; A Kimchi; K Kirkegaard; A E Koromilas; R M Krug; V Kruys; B J Lamphear; S Lemon; R E Lloyd; L E Maquat; E Martinez-Salas; M B Mathews; V P Mauro; S Miyamoto; I Mohr; D R Morris; E G Moss; N Nakashima; A Palmenberg; N T Parkin; T Pe'ery; J Pelletier; S Peltz; T V Pestova; E V Pilipenko; A C Prats; V Racaniello; G S Read; R E Rhoads; J D Richter; R Rivera-Pomar; T Rouault; A Sachs; P Sarnow; G C Scheper; L Schiff; D R Schoenberg; B L Semler; A Siddiqui; T Skern; N Sonenberg; W Sossin; N Standart; S M Tahara; A A Thomas; J J Toulmé; J Wilusz; E Wimmer; G Witherell; M Wormington
Journal:  Mol Cell Biol       Date:  2001-12       Impact factor: 4.272

4.  Presence of ATG triplets in 5' untranslated regions of eukaryotic cDNAs correlates with a 'weak' context of the start codon.

Authors:  I B Rogozin; A V Kochetov; F A Kondrashov; E V Koonin; L Milanesi
Journal:  Bioinformatics       Date:  2001-10       Impact factor: 6.937

Review 5.  Internal ribosome entry sites in eukaryotic mRNA molecules.

Authors:  C U Hellen; P Sarnow
Journal:  Genes Dev       Date:  2001-07-01       Impact factor: 11.361

Review 6.  Prediction and phylogenetic analysis of mammalian short interspersed elements (SINEs).

Authors:  I B Rogozin; V I Mayorov; M V Lavrentieva; L Milanesi; L R Adkison
Journal:  Brief Bioinform       Date:  2000-09       Impact factor: 11.622

Review 7.  Do the 5'untranslated domains of human cDNAs challenge the rules for initiation of translation (or is it vice versa)?

Authors:  M Kozak
Journal:  Genomics       Date:  2000-12-15       Impact factor: 5.736

8.  Analysis of oligonucleotide AUG start codon context in eukariotic mRNAs.

Authors:  G Pesole; C Gissi; G Grillo; F Licciulli; S Liuni; C Saccone
Journal:  Gene       Date:  2000-12-30       Impact factor: 3.688

Review 9.  Molecular mechanisms of translation initiation in eukaryotes.

Authors:  T V Pestova; V G Kolupaeva; I B Lomakin; E V Pilipenko; I N Shatsky; V I Agol; C U Hellen
Journal:  Proc Natl Acad Sci U S A       Date:  2001-06-19       Impact factor: 11.205

10.  CART classification of human 5' UTR sequences.

Authors:  R V Davuluri; Y Suzuki; S Sugano; M Q Zhang
Journal:  Genome Res       Date:  2000-11       Impact factor: 9.043

View more
  57 in total

1.  Known and novel post-transcriptional regulatory sequences are conserved across plant families.

Authors:  Justin N Vaughn; Sally R Ellingson; Flavio Mignone; Albrecht von Arnim
Journal:  RNA       Date:  2012-01-11       Impact factor: 4.942

2.  Identification of Psk2, Skp1, and Tub4 as trans-acting factors for uORF-containing ROK1 mRNA in Saccharomyces cerevisiae.

Authors:  Soonmee Jeon; Suran Lim; Jeemin Ha; Jinmi Kim
Journal:  J Microbiol       Date:  2015-08-27       Impact factor: 3.422

3.  The structure of the 5'-untranslated region of mammalian poly(A) polymerase-alpha mRNA suggests a mechanism of translational regulation.

Authors:  Aikaterini Rapti; Theoni Trangas; Martina Samiotaki; Panayotis Ioannidis; Euthymios Dimitriadis; Christos Meristoudis; Stavroula Veletza; Nelly Courtis
Journal:  Mol Cell Biochem       Date:  2010-02-21       Impact factor: 3.396

4.  Close spacing of AUG initiation codons confers dicistronic character on a eukaryotic mRNA.

Authors:  Daiki Matsuda; Theo W Dreher
Journal:  RNA       Date:  2006-05-08       Impact factor: 4.942

5.  Expression of human Hemojuvelin (HJV) is tightly regulated by two upstream open reading frames in HJV mRNA that respond to iron overload in hepatic cells.

Authors:  Cláudia Onofre; Filipa Tomé; Cristina Barbosa; Ana Luísa Silva; Luísa Romão
Journal:  Mol Cell Biol       Date:  2015-02-09       Impact factor: 4.272

6.  uORFs with unusual translational start codons autoregulate expression of eukaryotic ornithine decarboxylase homologs.

Authors:  Ivaylo P Ivanov; Gary Loughran; John F Atkins
Journal:  Proc Natl Acad Sci U S A       Date:  2008-07-14       Impact factor: 11.205

7.  Deciphering the rules by which 5'-UTR sequences affect protein expression in yeast.

Authors:  Shlomi Dvir; Lars Velten; Eilon Sharon; Danny Zeevi; Lucas B Carey; Adina Weinberger; Eran Segal
Journal:  Proc Natl Acad Sci U S A       Date:  2013-07-05       Impact factor: 11.205

8.  Plant upstream ORFs can trigger nonsense-mediated mRNA decay in a size-dependent manner.

Authors:  Tünde Nyikó; Boglárka Sonkoly; Zsuzsanna Mérai; Anna Hangyáné Benkovics; Dániel Silhavy
Journal:  Plant Mol Biol       Date:  2009-08-04       Impact factor: 4.076

9.  Transcription of the rat testis-specific Rtdpoz-T1 and -T2 retrogenes during embryo development: co-transcription and frequent exonisation of transposable element sequences.

Authors:  Chiu-Jung Huang; Wan-Yi Lin; Che-Ming Chang; Kong-Bung Choo
Journal:  BMC Mol Biol       Date:  2009-07-25       Impact factor: 2.946

10.  Autogenous translational regulation of the Borna disease virus negative control factor X from polycistronic mRNA using host RNA helicases.

Authors:  Yohei Watanabe; Naohiro Ohtaki; Yohei Hayashi; Kazuyoshi Ikuta; Keizo Tomonaga
Journal:  PLoS Pathog       Date:  2009-11-06       Impact factor: 6.823

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.