| Literature DB >> 17686169 |
Marija Cvijović1, Daniel Dalevi, Elizabeth Bilsland, Graham J L Kemp, Per Sunnerhagen.
Abstract
BACKGROUND: The translational efficiency of an mRNA can be modulated by upstream open reading frames (uORFs) present in certain genes. A uORF can attenuate translation of the main ORF by interfering with translational reinitiation at the main start codon. uORFs also occur by chance in the genome, in which case they do not have a regulatory role. Since the sequence determinants for functional uORFs are not understood, it is difficult to discriminate functional from spurious uORFs by sequence analysis.Entities:
Mesh:
Year: 2007 PMID: 17686169 PMCID: PMC1964767 DOI: 10.1186/1471-2105-8-295
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Conservation of uORFs in the GCN4 locus of S. cerevisiae and homologues in 18 fungal species. The species are ordered approximately according to evolutionary distance from S. cerevisiae [13]. uORFs which are conserved with respect to sequence and position within the 5' flanking region are connected by dotted lines. The start codon of the GCN4 coding sequence is located at position 0.
Figure 2Flowchart of the steps in defining criteria to find novel uORFs that share characteristics with known functional uORFs. Solid arrows denote partition of a gene set into subsets; dotted arrows denote that a gene set or an algorithm is influenced by or operates on something. Letters within brackets identify the different subsets referred to in the text. Set A was the initial training set; set A + B was the training set for the refined rule set.
Evolutionary conservation of uORFs highlighted by Vilela and McCarthy [3]. Genes with conserved uORFs are shown in bold.
| Gene | uORF conservation1 | If predicted not to be functional, reason for this | Evidence about functional role |
| yes (1/1; 4/6) | [26] | ||
| yes (4/4; 7/7) | [6] | ||
| no (0/1; 0/6) | uORF too long | ||
| no (0/1; 0/6) | uORF too close to main AUG | ||
| no (0/1; 0/5) | uORF too close to main AUG | [32]2 | |
| yes (1/1; 5/5) | [4] | ||
| yes (2/2; 4/4) | [43]3 | ||
| no (0/1; 0/7) | uORF too close to main AUG | ||
| yes (4/6; 4/6) | [31]3 | ||
| yes (1/1; 3/5) | [27] | ||
| yes (2/2; 3/3) | [27] | ||
| no (0/1; 0/5) | uORF too close to main AUG | [32]2 | |
| no (0/1; 0/7) | uORF too close to main AUG | ||
| yes (1/1; 4/4) | [29] | ||
| yes (3/4; 3/4) | [30]4 | ||
| no (0/1; 0/6) | uORF too long (55 codons) |
The STA1-3 genes mentioned by Vilela and McCarthy are not present in the standard S288c genome sequence and were not included in this analysis.
1 Numbers between parentheses denote: (number of uORFs conserved/total number of uORFs; number of species where uORFs are conserved/total number of species where orthologue could be identified)
2 Evidence against translational control by uORFs
3 Evidence for translation using an IRES mechanism
4 Pet111 controls translation of another mRNA, but no evidence for uORF control of PET111 expression
Figure 3Three major classes of organisation of uORFs found in the S. cerevisiae genome. Not drawn to scale.
Figure 4Alignment of a region containing uORF1 (closest to the start codon of the main ORF) from S. cerevisiae YJL139c (YUR1) with the orthologous sequences from four other Saccharomyces species. A, sequence alignment. The start and stop codons of the uORF are marked in yellow. B, DNA sequence similarity profile of uORF1. C, DNA sequence similarity profile of the entire 5'-UTR of YUR1 and its homologues.
Properties of uORFs found in 294 previously identified 5'-UTRs [18], after classification as evolutionarily conserved or non-conserved.
| Conserved | Non-conserved | |
| Total number | ||
| Average length (codons) | ||
| Average distance from start codon of main ORF |
Figure 5Schematic of the arrangement of uORFs in the 5' flank of S. cerevisiae YJL139c (YUR1) and its homologues in other Saccharomyces species. This type of diagram is produced automatically for each gene, showing the intergenic sequence as a numbered axis; the coding sequence of the gene starts at the position one of the intergenic sequence. uORFs are shown as boxes. The box colours show S. cerevisiae uORFs predicted to be functional (red), or not functional (blue). uORFs from other species are represented by black boxes, since we do not predict their functionality. The rightmost uORF (uORF1) is identical to the one shown in Fig. 4.
32 newly identified genes with highly conserved uORFs strongly predicted by the rule set to be functional (marked in bold), with an optimal spacing to the main ORF and other uORFs. Numbering of uORFs is 3' to 5', as uORFs were found from intergenic sequences.
| 489 | -53 | ||||
| 376 | -104 | ||||
| 860 | -130 | ||||
| 509 | -109 | ||||
| 757 | -245 | ||||
| 387 | -144 | ||||
| 505 | -55 | ||||
| 1030 | -142 | ||||
| 825 | -230 | ||||
| 315 | -71 | ||||
| 1006 | -165 | ||||
| 358 | -104 | ||||
| 771 | -155 | ||||
| 1539 | -342 | ||||
| 1317 | -338 | ||||
| 1042 | -103 | ||||
| 371 | -103 | ||||
| 388 | -211 | ||||
| 628 | -367 | ||||
| 488 | -72 | ||||
| 248 | -82 | ||||
| 373 | -145 | ||||
| 821 | -65 | ||||
| 441 | -212 | 206 | |||
| 541 | -364 | 252 | |||
| 692 | -206 | 170 | |||
| 454 | -220 | 171 | |||
| Uncharacterised | 320 | -111 | 80 | ||
| 241 | -99 | 21 | |||
| 313 | -147 | 62 | |||
| 457 | uORF2(11) | -384 | 217 | ||
| 393 | -142 | 123 |
a Genes where predicted functional uORFs are located within the estimated 5'-UTR are marked in bold.
Calculated minimum free folding energy of the 200 nt immediately upstream of the start codon of different sets of uORF-containing genes [21].
| Set | Minimum free energy (kcal/mol) |
| 9 genes in initial set with conserved uORFs (Table 1; Fig. 2 set C) | -25.8 |
| 32 genes with highly conserved uORFs with optimal spacing (Table 3; Fig. 2 set H) | -32.8 |
| 252 genes with highly conserved uORFs (additional file | -35.4 |
| All genes in genome | -36.6 |
Major functional classes for genes that harbour conserved uORFs predicted to play a regulatory role (Fig. 2, set G).
| BDF1 | |||