| Literature DB >> 24484385 |
Adam Skarshewski, Mitchell Stanton-Cook, Thomas Huber, Sumaya Al Mansoori, Ross Smith, Scott A Beatson, Joseph A Rothnagel1.
Abstract
BACKGROUND: Several small open reading frames located within the 5' untranslated regions of mRNAs have recently been shown to be translated. In humans, about 50% of mRNAs contain at least one upstream open reading frame representing a large resource of coding potential. We propose that some upstream open reading frames encode peptides that are functional and contribute to proteome complexity in humans and other organisms. We use the term uPEPs to describe peptides encoded by upstream open reading frames.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24484385 PMCID: PMC3914846 DOI: 10.1186/1471-2105-15-36
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Screenshots of the search, alignment and help pages of uPEPperoni. (A) The conserved uPEP search page showing the user-selectable settings for the RefSeq database, Ka/Ks ratio, reference heatmaps, alignment parameters and heatmap generation. (B) The heatmap alignment page showing the user-selectable settings for visual representation of the main coding sequence (CDS) and uORFs and the search parameters for uORF-length, the extent of uORF overlap into the CDS and the region of the transcript to be searched. (C) The help page. uPEPperoni is hosted on an Apache server on a Linux platform and is publically accessible free of charge at http://upep-scmb.biosci.uq.edu.au. Full documentation of uPEPperoni is also accessible via links on the website. The uORF reference database is automatically rebuilt on the server shortly after each major RefSeq release. We archive previous uORF reference databases. The RefSeq release version number from which the reference database is derived is shown on the web page.
Figure 2Example output showing the heatmaps produced by querying the mRNA sequence of the Hairless () transcript (NM_005144) against Hairless () (NM_021877). The solid bars above the heatmap indicate the ORFs on the transcript. The output lists the most conserved uPEPs first. The heatmap generated by the query sequence is shown first; in this case human HR aligned with mouse Hr transcript. The reciprocal heatmap generated using the reference sequence is shown below (mouse Hr transcript versus human HR). The inclusion of the Reference Alignment is selectable by the user. The unformatted aligned sequence can be viewed using a hyperlink shown above the heatmap.
List of species with one or more conserved uPEPs using the uORFs identified in Crowe [2]
| Human, mouse, rat, cow, chicken, frog, monkey, horse, chimpanzee, zebra fish, salmon | 1 |
| Human, mouse, rat, orangutan, chicken, frog, zebra fish, salmon | 1 |
| Human, mouse, rat, cow, monkey, chicken, rabbit, chimpanzee | 1 |
| Human, mouse, rat, pig, chicken, cat, horse | 1 |
| Human, mouse, rat, cow, orangutan, monkey | 1 |
| Human, mouse, rat, cow, orangutan, pig | 1 |
| Human, mouse, rat, cow, orangutan, frog | 1 |
| Human, mouse, rat, cow, chicken, frog | 1 |
| Human, mouse, rat, cow, orangutan | 13 |
| Human, mouse, rat, orangutan, chicken | 1 |
| Human, mouse, rat, zebra fish, frog | 1 |
| Human, mouse, rat, orangutan, pig | 1 |
| Human, mouse, rat, pig, monkey | 1 |
| Human, mouse, cow, pig, orangutan | 1 |
| Human, mouse, rat, orangutan | 10 |
| Human, mouse, rat, cow, monkey | 2 |
| Human, mouse, rat, cow, pig | 1 |
| Human, mouse, rat, cow, chicken | 1 |
| Human, mouse, rat, cow, frog | 1 |
| Human, mouse, rat, cow | 27 |
| Human, mouse, cow, orangutan | 7 |
| Human, mouse, cow, pig | 3 |
| Human, mouse, rat, pig | 2 |
| Human, mouse, rat, monkey | 2 |
| Human, mouse, cow, monkey | 1 |
| Human, mouse, orangutan, chimpanzee | 1 |
| Human, mouse, orangutan, hamster | 1 |
| Human, mouse, rat, horse | 1 |
| Human, mouse, rat, chicken | 1 |
| Human, mouse, rat | 36 |
| Human, mouse, cow | 15 |
| Human, mouse, orangutan | 5 |
| Human, mouse, pig | 2 |
| Human, mouse, monkey | 1 |
| Human, mouse | 55 |
aSpecifies the total number of individual uPEPs that show sequence conservation across the group of species indicated.
Figure 3Several heatmaps of aligned transcript-pairs can be combined to provide a visual snapshot of sequence conservation. (A) Heatmaps for each pair-wise analysis of the human transcript encoding protein tyrosine phosphatase type IVA, member 1 (Ptp4a1) (NM_003463) with the othologous non-human transcript are shown. Black lines above each heatmap mark the position of the conserved uPEP and CDS for that species. Note the conservation of this uPEP even as the phylogenetic distance between the comparison species (on the right) widens. (B) ClustalW alignment of the Ptp4a1 uPEP, translated in silico from the conserved uORF. The numbers below the bar graph represent the conservation of each individual amino acid, where 10 (shown as an asterisk (*)) indicates identity across all species.