| Literature DB >> 29162641 |
Chan Hyun Na1,2,3, Mustafa A Barbhuiya1,2, Min-Sik Kim4,5, Steven Verbruggen6, Stephen M Eacker2,3, Olga Pletnikova7, Juan C Troncoso2,7, Marc K Halushka7, Gerben Menschaert5, Christopher M Overall8, Akhilesh Pandey1,3,4,7,9.
Abstract
Translation initiation generally occurs at AUG codons in eukaryotes, although it has been shown that non-AUG or noncanonical translation initiation can also occur. However, the evidence for noncanonical translation initiation sites (TISs) is largely indirect and based on ribosome profiling (Ribo-seq) studies. Here, using a strategy specifically designed to enrich N termini of proteins, we demonstrate that many human proteins are translated at noncanonical TISs. The large majority of TISs that mapped to 5' untranslated regions were noncanonical and led to N-terminal extension of annotated proteins or translation of upstream small open reading frames (uORF). It has been controversial whether the amino acid corresponding to the start codon is incorporated at the TIS or methionine is still incorporated. We found that methionine was incorporated at almost all noncanonical TISs identified in this study. Comparison of the TISs determined through mass spectrometry with ribosome profiling data revealed that about two-thirds of the novel annotations were indeed supported by the available ribosome profiling data. Sequence conservation across species and a higher abundance of noncanonical TISs than canonical ones in some cases suggests that the noncanonical TISs can have biological functions. Overall, this study provides evidence of protein translation initiation at noncanonical TISs and argues that further studies are required for elucidation of functional implications of such noncanonical translation initiation.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29162641 PMCID: PMC5749180 DOI: 10.1101/gr.226050.117
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Identification of translation initiation sites. (A) A schematic diagram for enrichment of protein N-terminal peptides using the TAILS method. All primary amine groups (α- and ε-) on proteins were blocked by dimethylation followed by trypsin digestion. Non-N-terminal peptides displaying a free, trypsin-generated α-amine group were depleted using HPG-ALD polymer, and the remaining N-terminal peptides were analyzed in mass spectrometry after bRPLC fractionation. (B) The acetylated peptides mapping to annotated translation initiation sites were identified from HEK293T cells, HUVEC, human colon, and human substantia nigra using the TAILS method. (C) Overlap of the acetylated peptides mapping to annotated translation initiation sites from HEK293T cells, HUVEC, human colon, and human substantia nigra.
List of acetylated peptides mapped to 5′ UTRs identified from TAILS experiments
Figure 2.Translation initiation sites mapping to 5′ UTRs. (A) Peptides acetylated at their N termini mapping to 5′ UTRs that were identified from HEK293T cells, HUVEC, human colon, and human substantia nigra as shown. The colors indicate whether the peptides correspond to uORFs or were N-terminal extensions of annotated proteins. (B) Codons for TISs corresponding to acetylated peptides mapping to 5′ UTRs which led to N-terminal extension of annotated proteins. (C) Codons for TISs corresponding to acetylated peptides mapping to 5′ UTRs which encode an uORF. (D) Sequence logo of nucleotides surrounding the TIS in cases where the TISs lead to N-terminal extension of annotated proteins. (E) Sequence logo of nucleotides surrounding the TIS in cases where the TISs are located in an uORF.
Figure 3.Translation initiation sites mapping to 5′ UTRs identified by a unique database search strategy. (A) Substitution of near-cognate initiation codons to AUG enables identification of acetylated peptides that begin with methionine by searching against a customized database as shown. (B) Peptides acetylated at their N termini mapping to 5′ UTRs that were identified from HEK293T cells, HUVEC, human colon, and human substantia nigra through a database search against a customized database that incorporates substitution of near-cognate initiation codons to AUGs. The colors indicate whether the peptides correspond to upstream open reading frames (ORFs) or were N-terminal extensions of annotated proteins. (C) Codons for TISs corresponding to acetylated peptides mapping to 5′ UTRs identified through a search against a customized database that incorporates substitution of near-cognate initiation codons to AUG. (D) Sequence logo of nucleotides surrounding the TISs identified by searching against the database with the substitution of near-cognate initiation codons to AUG.
List of acetylated peptides with the first amino acid substitution mapped to the 5′ UTR identified from TAILS experiments
Figure 4.Acetylated peptides identified in 5′ UTRs map to TISs inferred using ribosome profiling data and were validated by synthetic peptides. (A) The acetylated peptide identified from HEK293T cells that was derived from the 5′ UTR of STARD10 along with the annotated TIS (the methionine “M” is marked in red) were aligned with ribosome profiling data from HEK293 cells. (B) The acetylated peptide identified from HEK293T cells that was derived from the 5′ UTR of ZNF281 was aligned with the ribosome profiling data from HEK293 cells. The methionine of the annotated TIS is marked as “M” in red. (C) The acetylated peptide positioned in the 5′ UTR of STARD10 was validated with synthetic peptides. The annotated mass spectrum derived from the sample (top) is aligned with the mass spectrum derived from a synthetic peptide (bottom). (D) The acetylated peptide positioned in the 5′ UTR of ZNF281 was validated with synthetic peptides. The annotated mass spectrum derived from the sample (top) is aligned with the mass spectrum derived from a synthetic peptide (bottom).
Figure 5.Sequence conservation of acetylated peptides identified in 5′ UTRs across species. (A) Sequence conservation analysis for the 5′ UTR of the STARD10 gene between different species. (B) Sequence conservation analysis for the 5′ UTR of the ZNF281 gene between different species.
Figure 6.Relative abundance of noncanonical and canonical TISs. (A) The relative abundance was calculated by dividing the number of peptide spectrum matches (PSMs) corresponding to the noncanonical TIS of a gene by the number of PSMs of the corresponding canonical TIS observed in a sample. (B) The relative abundance is shown for the indicated genes across multiple samples.