| Literature DB >> 29529017 |
Florian Erhard1,2, Anne Halenius3,4, Cosima Zimmermann3,4, Anne L'Hernault5, Daniel J Kowalewski6,7, Michael P Weekes8, Stefan Stevanovic6, Ralf Zimmer1, Lars Dölken2.
Abstract
Ribosome profiling has been used to predict thousands of short open reading frames (sORFs) in eukaryotic cells, but it suffers from substantial levels of noise. PRICE (https://github.com/erhard-lab/price) is a computational method that models experimental noise to enable researchers to accurately resolve overlapping sORFs and noncanonical translation initiation. We experimentally validated translation using major histocompatibility complex class I (MHC I) peptidomics and observed that sORF-derived peptides efficiently enter the MHC I presentation pathway and thus constitute a substantial fraction of the antigen repertoire.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29529017 PMCID: PMC6152898 DOI: 10.1038/nmeth.4631
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547
Figure 1The PRICE approach.
(a) Schematic of the approach. Left: Bars represent parameters of the probabilistic model. Center: Translated codons are identified by solving the inverse problem of the model. Right: calling actively translated ORFs based on the generalized binomial distribution (for details see Online Methods).
(b) Approaches to map reads to codons are compared with respect to signal (total number of reads mapped in-frame) and the signal to noise ratio (noise: reads mapped out-of-frame to annotated ORFs). Colors represent deterministic mapping of read classes defined by length and 5’ mismatch state (red, grey), of combinations of read classes (blue; Basic: ignoring 5’ mismatches; Extended: considering 5’ mismatches; Top 4: combining the best read classes; see also Supplementary Fig. 4) and probabilistic mapping by PRICE (green).
(c) Total amount of peptides detected in proteome and MHC-I peptidome mass spectrometry experiments (MHC-I peptidome data set 1; see Supplementary Fig. 9a for the other experiment). The 1% peptide identification FDR is indicated. Grey bars show the peptides from ORFs also identified by ORF-RATER or Rp-Bp (for PRICE) or ORFs also identified by PRICE (for ORF-RATER and Rp-Bp).
d) Validation rates of peptides from predicted ORFs with a minimal number of reads per codon (MHC-I peptidome data set 2; see Supplementary Fig. 9b for the other experiment). Rates for all ORFs (solid lines) identified by the indicated methods and for ORFs predicted de-novo (dashed lines) are shown.
Figure 2Re-decoding human cytomegalovirus
(a) Venn diagram comparing the indicated datasets. We merged N-terminal variants of ORFs ending at the same stop codon.
(b) Comparison of the start codon distribution of 528 novel ORFs detected only by PRICE (light green) to the distribution of the 248 confirmed ORFs (turquois) (p ≈ 0.42, Fisher's combined probability test based on indicated one-sided binomial tests). Note that an ORF may have more than one start codon.
(c) The start codon distribution of the 232 not confirmed ORFs (blue) and confirmed ORFs (turquoise) (p ≈ 3.2 × 10−5, Fisher's combined probability test based on indicated one-sided binomial tests).