| Literature DB >> 20042076 |
Christopher H Bryant, Graham J L Kemp, Janeli Sarv, Erik Kristiansson, Per Sunnerhagen.
Abstract
BACKGROUND: Some upstream open reading frames (uORFs) regulate gene expression (i.e., they are functional) and can play key roles in keeping organisms healthy. However, how uORFs are involved in gene regulation is not yet fully understood. In order to get a complete view of how uORFs are involved in gene regulation, it is expected that a large number of experimentally verified functional uORFs are needed. Unfortunately, wet-experiments to verify that uORFs are functional are expensive.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20042076 PMCID: PMC2813248 DOI: 10.1186/1471-2105-10-451
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schematic representation of mRNA primary structure. AUG is start codon. A stop codon can be UAA, UAG, or UGA. A 5' UTR may have zero or more uORFs.
Detailed composition of uORFs obtained using getorf of the EMBOSS package.
| Number of Genes | uORFs | |||
|---|---|---|---|---|
| Functional | Non-functional | Unlabelled | ||
| 18 studied | 9 | 21 | - | - |
| genes | 2 | - | 2 | - |
| 7 | - | - | 8 | |
| 1,475 other genes | - | - | 3,616 | |
| 1,493 genes | 21 | 2 | 3,624 | |
Detailed uORF composition from 18 studied genes within the collection obtained using getorf of the EMBOSS package.
| Gene Name | Systematic Name | uORFs | ||
|---|---|---|---|---|
| Functional | Non-functional | Unlabelled | ||
| YAL040C | 1 | - | - | |
| YEL009C | 4 | - | - | |
| YKL109W | 2 | - | - | |
| YGR162W | 5 | - | - | |
| YML007W | 1 | - | - | |
| YDR423C | 2 | - | - | |
| YNR055C | 1 | - | - | |
| YMR257C | 4 | - | - | |
| YOR303W | 1 | - | - | |
| YBR037C | - | 1 | - | |
| YDL069C | - | 1 | - | |
| YDR123C | - | - | 1 | |
| YLR014C | - | - | 1 | |
| YKL216W | - | - | 1 | |
| YNL104C | - | - | 1 | |
| YGL158W | - | - | 2 | |
| YHR144C | - | - | 1 | |
| YHR205W | - | - | 1 | |
| 18 Genes | 21 | 2 | 8 | |
Number of nodes in the first five levels in each of the GO categories.
| Level | |||||
|---|---|---|---|---|---|
| GO category | 1 | 2 | 3 | 4 | 5 |
| Molecular Function | 1 | 20 | 730 | 737 | 1531 |
| Biological Process | 1 | 20 | 677 | 1892 | 6149 |
| Cellular Component | 1 | 17 | 279 | 902 | 2125 |
The English translations of the hypotheses generated from the set of 21 positive examples.
| A uORF has functional role if it satisfies at least one of the following rules. | |
| 1. | the main gene is regulated under butanol stress and the product of the main gene is involved in nucleic acid binding; |
| 2. | the uORF is conserved in two other species, the main gene is localised in intracellular (or protoplasm), and the UTR length > = 463; |
| 3. | the uORF is conserved in three other species and the main gene is localised in intracellular (or protoplasm); |
| 4. | its length < = 7 and the product of the main gene is involved in nucleic acid binding; |
| 5. | the base in position +4 relative to the uORF's start codon is 'G' and the main gene is involved in regulation of biological process; |
| 6. | its length < = 6, the main gene is localised in intracellular (or protoplasm), and the main gene is involved in regulation of biological process; |
| 7. | the uORF is conserved in two other species, the main gene is not regulated under low concentration of H2O2, the main gene is localised in intracellular (or protoplasm), and the UTR length > = 244; |
| 8. | the product of the main gene is involved in translation regulator activity. |
Comparison between predictions made by our hypotheses (Table 4) and by [32] for the seven genes that they wet-experimentally tested.
| Gene Name | Systematic Name | uORF's Position | uORF's Length | Predicted as functional in | |
|---|---|---|---|---|---|
| Z&D | This study | ||||
| YDR045C | -60 | 4 | Yes | Yes | |
| YJL164C | -42 | 5 | Yes | Yes | |
| YNL256W | -65 | 4 | Yes | Yes | |
| YOL105C | -50 | 7 | Yes | ||
| YOR231W | -71 | 10 | Yes | No | |
| YLR443W | -15 | 5 | No | No | |
| YML056C | -99 | 14 | No | No | |
5' UTR length was predicted to be well over 1000 bases and thus this gene is not included in this study.
Zhang and Dietrich found these uORFs have little effect on translation; we consider them as non-functional.