| Literature DB >> 26578573 |
Hélène Mouilleron1, Vivian Delcourt2, Xavier Roucou3.
Abstract
mRNAs carry the genetic information that is translated by ribosomes. The traditional view of a mature eukaryotic mRNA is a molecule with three main regions, the 5' UTR, the protein coding open reading frame (ORF) or coding sequence (CDS), and the 3' UTR. This concept assumes that ribosomes translate one ORF only, generally the longest one, and produce one protein. As a result, in the early days of genomics and bioinformatics, one CDS was associated with each protein-coding gene. This fundamental concept of a single CDS is being challenged by increasing experimental evidence indicating that annotated proteins are not the only proteins translated from mRNAs. In particular, mass spectrometry (MS)-based proteomics and ribosome profiling have detected productive translation of alternative open reading frames. In several cases, the alternative and annotated proteins interact. Thus, the expression of two or more proteins translated from the same mRNA may offer a mechanism to ensure the co-expression of proteins which have functional interactions. Translational mechanisms already described in eukaryotic cells indicate that the cellular machinery is able to translate different CDSs from a single viral or cellular mRNA. In addition to summarizing data showing that the protein coding potential of eukaryotic mRNAs has been underestimated, this review aims to challenge the single translated CDS dogma.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26578573 PMCID: PMC4705651 DOI: 10.1093/nar/gkv1218
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The typical tripartite structure of a eukaryotic mRNA with a single annotated or reference coding sequence or refCDS (A, single CDS dogma), or with possible alternative ORFs with an initiation codon located in the 5′ UTR, the CDS, or the 3′ UTR. (B). AltORFs5′UTR may overlap refCDSs in a different reading frame. AltORFsCDS may extend into 3′ UTRs. In general, refCDSs are longer than altORFs. Generally, all annotated mRNAs have a refCDS. An mRNA may have no, one or several altORFs. Abbreviations: AAAAA, polyA tail; altORF, alternative open reading frame; AUG, translation initiation codon; refCDS, annotated or reference protein coding sequence; m7G, 7-methyl-guanosine cap; UAA, stop codon (only one possible stop codon is shown for simplicity).
Examples of detected alternative proteins in the literature
| altORF5′UTR | altORFCDS | altORF3′UTR |
|---|---|---|
| AltSMCR7L ( | AltCHTF8 ( | HsHTD2 ( |
| AltSLC35A4 ( | AltHNRPUL1 ( | AltSF1 ( |
| uORF5 ( | AltATXN1 ( | |
| PEP7 ( | AltPrP ( | |
| uMKKS1 and 2 ( |
Figure 2.Ribosome profiling and MS are common techniques to detect alternative proteins in large scale studies. (A) AltSLC35A4, altCHTF8 and altRPP14(HTD2) are examples of alternative proteins detected by independent laboratories and at least by two different techniques. A schematic view of the genomic structures for the three human genes is shown with the chromosomal coordinates for the three alternative TISs. Exons (gray), refCDSs (green), altORFs (red), and the three reading frames are indicated. The genomic bases around the alternative TISs and the amino acids translated in the three reading frames are also indicated (image from the online genome browser GWIPS-viz (93)). Ribosome profiling data retrieved from GWIPS-viz indicate the position of initiating ribosome profiles from 4 profiling studies (62,63,90,136). Initiating ribosomes are clearly detected around altORFs TISs. The MS data show the peptide sequence of peptides detected in two independent studies, with peptides detected in both studies shown in bold (60,107). (B) Proposed nomenclature for altORFs. For example, altSLC35A45′UTRE1–10 indicates an altORF in the SLC35A4 gene; it is located in the 5′ UTR, and the TIS starts at the 10th base of exon 1.