| Literature DB >> 18237443 |
Celine A Hayden1, Giovanni Bosco.
Abstract
BACKGROUND: Upstream open reading frames (uORFs) are elements found in the 5'-region of an mRNA transcript, capable of regulating protein production of the largest, or major ORF (mORF), and impacting organismal development and growth in fungi, plants, and animals. In Drosophila, approximately 40% of transcripts contain upstream start codons (uAUGs) but there is little evidence that these are translated and affect their associated mORF.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18237443 PMCID: PMC2276209 DOI: 10.1186/1471-2164-9-61
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
K/Kvalues of uORF and associated mORFs correlated to most distantly related organism containing uORF-mORF association in an EST
| CG18624 | 0.11**** | 0.06**** | |
| CG12664 | 0.26** | 0.15**** | |
| CG12788/CG17767 | 0.32**** | 0.28**** | |
| CG33713/CG33714 | 0.06**** | 0.13*** | |
| CG3240 | 0.11**** | 0.11**** | |
| CG9960/CG9958 | 0.02**** | 0.07**** | |
| CG31917 | 0.00**** | 0.09**** | |
| CG31919/CG33995 | 0.10** | 0.31* | |
| CG18042 | 0.01**** | 0.29* | |
| CG7400 | 0.10* | 0.06**** | |
| CG16974 | 0.00** | 0.14**** | |
| CG4824 | 0.04*** | 0.17*** | |
| CG17325 | 0.08**** | 0.07**** | |
| CG10570 | 0.28** | 0.19**** | |
| CG11508 | 0.13**** | 0.54** | |
| CG8026 | 0.31* | 0.04**** | |
| CG17759 (uORF2) | 0.33* | 0.02**** | |
| CG33671/CG33672 | 0.07**** | 0.14**** | |
| CG6191 | 0.13** | 0.05*** | |
| CG30100 | 0.08**** | 0.09**** | |
| CG17725 | 0.00* | 0.06**** | |
| CG5469 | 0.10**** | 0.07**** | |
| CG33786/CG33785 | 0.03** | 0.16**** | |
| CG9865 (uORF1) | 0.12**** | 0.30**** | |
| CG9865 (uORF2) | 0.07**** | 0.30**** | |
| CG9865 (uORF3) | 0.04**** | 0.30**** | |
| CG9878 | 0.30** | 0.04**** | |
| CG30290 | 0.00**** | 0.05* | |
| CG12016 | 0.12**** | 0.12**** | |
| CG32573 | 0.42*** | 0.19**** | |
| CG11989 | 0.04**** | 0.01**** | |
| CG7869 | 0.09**** | 0.12**** | |
| CG7628 | 0.10** | 0.03**** | |
| CG9666 | 0.29**** | 0.04**** | |
| CG2128 | 0.24**** | 0.00**** | |
| CG9288 | 0.16**** | 0.17**** | |
| CG9924 | 0.08* | 0.12**** | |
| CG31241 | 0.23**** | 0.00* | |
| CG31178 | 0.31*** | 0.33** | |
| CG7071/CG34131 | 0.08**** | 0.20**** | |
| CG10238 | 0.29**** | 0.12**** | |
| CG5116 | 0.15** | 0.16**** | |
| CG14550 | 0.13* | 0.21**** | |
| CG7950 | 0.35**** | 0.04**** |
a D. melanogaster taxonomic classification as described by NCBI
b Abbreviations: Boomic, Boophilus microplus; Drovir, Drosophila virilis; Anogam, Anopheles gambiae; Carmae, Carcinus maenas; Dromoj, Drosophila mojavensis; Dapmag, Daphnia magna; Bommor, Bombyx mori; Glomor, Glossina morsitans; Dropse, Drosophila pseudoobscura; Drovir, Drosophila virilis; Drogri, Drosophila grimshawi; Apimel, Apis mellifera; Ixosca, Ixodes scapularis; Drowil, Drosophila willistoni; Aedaeg, Aedes aegypti; Acypis, Acyrthosiphon pisum; Myzper, Myzus persicae; Artfra, Artemia franciscana; Lutlon, Lutzomyia longipalpis; Taepyg, Taeniopygia guttata
* p-value < 0.05; H0: K/K= 1, H: K/K< 1
** p-value < 0.01
*** p-value < 0.001
****p-value < 0.0001
Cytological distribution and peptide length of putative CPuORFs in Drosophila melanogaster
| FBtr0071140_1 | CG18624 | 7C2-7C2 | 54 |
| FBtr0071349_3 | CG12664 | 8C11-8C13 | 41 |
| FBtr0074767_3 | CG12788/CG17767b | 18D3-18D7 | 117 |
| FBtr0077227_1 | CG33713/CG33714b | 19F4-19F4 | 90 |
| FBtr0077747_1 | CG3240 | 23A1-23A1 | 179 |
| FBtr0077737_2 | CG9960/CG9958b | 23A3-23A3 | 134 |
| FBtr0079037_2 | CG31917 | 25C1-25C1 | 73 |
| FBtr0079006_1 | CG31919/CG33995b | 25C1-25C1 | 44 |
| FBtr0079695_3 | CG18042 | 29D4-29D5 | 85 |
| FBtr0080133_1 | CG7400 | 31F4-31F5 | 20 |
| FBtr0080489_1 | CG16974 | 34A8-34A8 | 21 |
| FBtr0080803_5 | CG4824 | 35E2-35E2 | 44 |
| FBtr0081102_1 | CG17325 | 37A4-37A5 | 48 |
| FBtr0081122_2 | CG10570 | 37A4 | 50 |
| FBtr0088817_5 | CG11508 | 44B3-44B3 | 150 |
| FBtr0088610_3 | CG8026 | 45B3-45B3 | 48 |
| FBtr0087829_3 | CG17759 | 49B8-49B9 | 31 |
| FBtr0091650_2 | CG33671/CG33672b | 49B10-49B10 | 86 |
| FBtr0087678_3 | CG6191 | 50B3-50B4 | 21 |
| FBtr0087140_1 | CG30100 | 53B1-53B1 | 70 |
| FBtr0086701_1 | CG17725 | 55D3-55D3 | 27 |
| FBtr0086654_7 | CG5469 | 55E5-55E5 | 121 |
| FBtr0091786_1 | CG33786/CG33785b | 57A8-57A9 | 108 |
| FBtr0071680_7 | CG9865a (uORF1) | 57F7-57F7 | 65 |
| FBtr0071680_5 | CG9865a (uORF2) | 57F7-57F7 | 84 |
| FBtr0071680_4 | CG9865a (uORF3) | 57F7-57F7 | 76 |
| FBtr0071676_1 | CG9878 | 57F8-57F8 | 65 |
| FBtr0071672_1 | CG30290 | 57F8-57F9 | 94 |
| FBtr0073063_4 | CG12016 | 63D1-63D1 | 81 |
| FBtr0074315_3 | CG32573 | 14F5-14F5 | 109 |
| FBtr0076348_2 | CG11989 | 67D2-67D2 | 50 |
| FBtr0076203_3 | CG7869 | 68A4-68A4 | 68 |
| FBtr0076213_1 | CG7628 | 68A7-68A8 | 18 |
| FBtr0074991_5 | CG9666 | 76A3-76A3 | 129 |
| FBtr0078767_1 | CG2128 | 83A4-83A4 | 38 |
| FBtr0082829_3 | CG9288 | 87F13-87F13 | 80 |
| FBtr0082871_2 | CG9924 | 88A3-88A4 | 25 |
| FBtr0083570_4 | CG31241 | 90F11-90F11 | 178 |
| FBtr0084138_3 | CG31178 | 93F14-93F14 | 40 |
| FBtr0084211_1 | CG7071/CG34131b | 94A6-94A6 | 157 |
| FBtr0084782_2 | CG10238 | 96C1-96C1 | 90 |
| FBtr0084877_1 | CG5116 | 96E2-96E2 | 15 |
| FBtr0084974_2 | CG14550 | 96F10-96F10 | 111 |
| FBtr0085563_1 | CG7950 | 99D3-99D3 | 111 |
a Gene with multiple CPuORFs in the same 5'UTR
b Different gene identifiers annotated as producing the same transcript; the first CG identifier predicts the translation of the mORF and the second CG identifier predicts the translation of the uORF.
Figure 1Conserved peptide uORF length distribution. A. A total of 44 CPuORFs identified in Drosophila melanogaster, B. CPuORFs in Arabidopsis thaliana as described by Hayden and Jorgensen [19], C. CPuORFs conserved between D. melanogaster and non-Brachycera species.
Gene Ontology term and InterPro domain overrepresentation in uORF dataset as determined by Genemerge
| GO:0008170 (MF) | N-methyltransferase activity | 10/14601 | 2/481 | 0.015 |
| GO:0045039 (BP) | protein import into mitochondrial inner membrane | 6/14601 | 2/482 | 0.008 |
| IPR002296 | N6 adenine-specific DNA methyltransferase, N12 class | 4/14040 | 2/481 | 0.004 |
| IPR000241 | Putative RNA methylase | 3/14040 | 2/481 | 0.002 |
| IPR004217 | Zinc finger, Tim10/DDP-type | 5/14040 | 2/482 | 0.006 |
MF, molecular function; BP, biological process
1 GO term or Interpro domain observed in CG9666 and CG9960
2 GO term or Interpro domain observed in CG9878 (Tim10) and CG17767 (Tim9b)
Predicted function and biological processes of uORF-mORF pairs in Drosophila
| CG18624 | Putative NADH dehydrogenase | Mitochondrial electron transport | Pfam domain; GO term designation | |
| CG12664 | Unknown | Neuromuscular development | [61, 62] | |
| CG12788/CG17767c | Mitochondrial inner membrane translocase subunit (uORF) | Transport across mitochondrial inner membrane (uORF) | Interpro domain | |
| CG33713/CG33714c | Acyl-CoA binding (mORF) RNA binding (uORF) | Unknown | Interpro domain | |
| CG3240 | Putative 3'->5' exonuclease activity | DNA repair | [63, 64] | |
| CG9960/CG9958c | Putative methyltransferase (mORF) Putative Biogenesis of Lysosome-related Organelles Complex-1-like (BLOC-1-like) subunit (uORF) | Biogenesis of lysosome-related organelles (eg. melanosomes and platelet dense granules; uORF) | [65] (uORF) Interpro domain (mORF) | |
| CG31917 | Putative TFIIH subunit (uORF) | Transcription and DNA repair (uORF) | [66, 67]; Interpro domain | |
| CG31919/CG33995c | Ankyrin repeat, protein-protein interactions | Target of transcription factor Glial cells missing (Gcm), involved in neuronal development and function | Interpro domain; [68] | |
| CG18042 | Putative component of Anaphase Promoting Complex (uORF) | Mitosis; Neural development (unclear whether it is the uORF, mORF or both) | [69, 70]; Flybase personal communication FBrf0125046; [71]; NCBI Conserved Domain Search | |
| CG7400 | Putative very-long-chain fatty acyl-CoA synthetase | Fatty acid metabolism | [72] | |
| CG16974 | Member of | Leucine-rich repeat and Immunoglobulin domain-containing protein | Unknown | [73, 74] |
| CG4824 | RNA binding protein | Anterior-Posterior patterning | [75–77] | |
| CG17325 | Unknown | Unknown | ||
| CG10570 | Unknown | Unknown | ||
| CG11508 | Subunit of an snRNA transcriptional activator protein | Transcription of splicing factors | [78] | |
| CG8026 | Mitochondrial carrier protein | Mitochondrial folate transport | [37, 38] | |
| CG17759b (uORF2) | G-protein subunit | Photoreceptor signal transduction; Axonal guidance | [79–81] | |
| CG33671/CG33672c | Mevalonate kinase (mORF); BolA-like protein, putative nucleic acid binding protein (uORF) | Isoprenoid production (mORF) | [82, 83] | |
| CG6191 | Unknown | Unknown | ||
| CG30100 | Translation release factor | Translation termination | GO term designation | |
| CG17725 | Putative phosphoenolpyruvate carboxykinase | Gluconeogenesis; Starvation; Glyceroneogenesis | [84–86] | |
| CG5469 | Ubiquitin regulatory X domain (UBX), putative RNA binding | Unknown | [87]; FBrf0189302 | |
| CG33786/CG33785c | Unknown | Translation (mORF) Transcription (uORF) | Interpro domain | |
| CG9865b (uORF1) | Putative mannosyl transferase | Unknown | Interpro domain | |
| CG9865b (uORF2) | Putative mannosyl transferase | Unknown | Interpro domain | |
| CG9865b (uORF3) | Putative mannosyl transferase | Unknown | Interpro domain | |
| CG9878 | Putative inner mitochondrial membrane translocase | Protein transport across mitochondrial membrane | [88] | |
| CG30290 | Putative flavoprotein enzyme | Unknown | Interpro domain | |
| CG12016 | Unknown | Unknown | ||
| CG32573 | Unknown | Unkown | ||
| CG11989 | Putative N-Acetyltransferase catalytic subunit | Unknown | [89]; Interpro domain | |
| CG7869 | DNA binding | Endoreplication | [90, 91] | |
| CG7628 | Phosphate transporter | Phosphate transport | Interpro domain | |
| CG9666 | Putative methyltransferase | Unknown | Interpro domain | |
| CG2128 | Histone deacetylase | Wing development; Chromatin remodeling | [92, 93] | |
| CG9288 | Pyruvate kinase | Unknown | Interpro domain | |
| CG9924 | Unknown | Regulator of Hedgehog response (growth and development) | [94, 95] | |
| CG31241 | Putative RNA methylase | Late larval development | [24]; Interpro domain | |
| CG31178 | Unknown | Unknown | ||
| CG7071/CG34131c | Unknown | Unknown | ||
| CG10238 | Molybdopterin synthase large subunit (mORF) and small subunit (uORF) | Production of molybdopterin; Implicated in mammalian neurological damage | [23, 96] | |
| CG5116 | Putative GTP-binding protein | Unknown | Interpro domain | |
| CG14550 | Putative phosphatidylinositol N-acetylglucosaminyltransferase subunit P (mORF); Pcc1-like transcription factor (uORF) | Unknown | Interpro domains | |
| CG7950 | Putative tRNA processing enzyme subunit (uORF) | tRNA processing (uORF) | Interpro domain |
a refers to mORF unless otherwise noted
bfend, Forked end; Tim9b, Translocase of inner membrane 9b; Rad1, Radiation insensitive 1; lmg, Lemming;Fatp, Fatty acid transport protein; LIG, Leucine-Rich Repeat and Immunoglobulin-containing protein (MacLaren et al, 2004); BicC, Bicaudal C; DmSNAP50/DmPBP49, snRNA activator protein 50/Proximal Sequence Element-Binding Protein 49;Galpha49B, G-protein alpha49B; Pepck, Phosphoenolpyruvate carboxykinase; Gint3, GDI interacting protein 3; Tim10, Translocase of inner membrane; SuUR, Suppressor of underreplication; Ard1, Arrest defective 1; Hdac3, Histone deacetylase 3; Rdx, Roadkill (Kent et al, 2006); DTL, Drosophila Tat-like; MOCS2, molybdopterin synthase 2