| Literature DB >> 33072102 |
Wayne Paes1, German Leonov2, Thomas Partridge1, Annalisa Nicastri3, Nicola Ternette3, Persephone Borrow1.
Abstract
Proteasomes catalyze the degradation of endogenous proteins into oligopeptides, but can concurrently create spliced oligopeptides through ligation of previously non-contiguous peptide fragments. Recent studies have uncovered a formerly unappreciated role for proteasome-catalyzed peptide splicing (PCPS) in the generation of non-genomically templated human leukocyte antigen class I (HLA-I)-bound cis-spliced peptides that can be targeted by CD8+ T cells in cancer and infection. However, the mechanisms defining PCPS reactions are poorly understood. Here, we experimentally define the biochemical constraints of proteasome-catalyzed cis-splicing reactions by examination of in vitro proteasomal digests of a panel of viral- and self-derived polypeptide substrates using a tailored mass-spectrometry-based de novo sequencing workflow. We show that forward and reverse PCPS reactions display unique splicing signatures, defined by preferential fusion of distinct amino acid residues with stringent peptide length distributions, suggesting sequence- and size-dependent accessibility of splice reactants for proteasomal substrate binding pockets. Our data provide the basis for a more informed mechanistic understanding of PCPS that will facilitate future prediction of spliced peptides from protein sequences.Entities:
Keywords: antigen processing; peptide epitopes; peptide splicing; proteasome; splicing mechanism
Mesh:
Substances:
Year: 2020 PMID: 33072102 PMCID: PMC7541919 DOI: 10.3389/fimmu.2020.563800
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Synthetic polypeptides subjected to in vitro proteasomal digestion.
|
|
|
|
|
|---|---|---|---|
| PP1 | sORF-encoded polypeptide: APITD1 | SSCLPCPLSFEKFK | 14 |
| PP2 | Q92879: CUGBP Elav-like family member 1 | EGCSSPMVVKFADTQK | 16 |
| PP3 | Q5CZC0: Fibrous sheath-interacting protein 2 | LVSIQKSIVSRSPIMIDQ | 18 |
| PP4 | P15882: N-chimerin | LTSLVRRATLKENEQIPK | 18 |
| PP5 | sORF-encoded polypeptide: CD81 | LPRFESRVCGHSLPSCTCP | 19 |
| PP6 | HIV-1 CH529 enr: Vif | DQLIHLYYFDCFSESAIRK | 19 |
| PP7 | P37275: Zinc finger E-box-binding homeobox 1 | SLIPVNGRPRTGLKTSQCS | 19 |
| PP8 | Q8TDU5: Putative vomeronasal receptor-like protein 4 | HLPLIHILLLFTQAILVSS | 19 |
| PP9 | HIV-1 IIIB: Vif | ALIKPKQIKPPLPSVRKLTE | 20 |
| PP10 | HIV-1 CH390 enr: Vif | TADQLIHLYYFDCFSESAIRK | 21 |
| PP11 | Q9NW13: RNA-binding protein 28 | IRNLSFKCSEDDLKTVFAQFGA | 22 |
| PP12 | HIV-1 CH945 enr: Vif | LADQLIHLYHFDCFTESAIRNA | 22 |
| PP13 | HIV-1 CH945 enr: Vif mutant | LQPQLIHLYYFDCFSESAIRK | 22 |
| PP14 | HIV-1 NL4-3: Gag | FGEETTTPSQKQEPIDKELYPLA | 23 |
| PP15 | HIV-1 NL4-3: Gag | AAMQMLKETINEEAAEWDRLHPVHA | 25 |
| PP16 | P46013: Antigen KI-67 | KSWADVVKLGAKQTQTKVIKHGPQR | 25 |
| PP17 | Q00887:Pregnancy-specific beta-1-glycoprotein 9 | EMTDLYHYIISYIVDGKIIIYGPAY | 25 |
| PP18 | HIV-1 IIIB: Cryptic ORF | VAAPRLLPCALQQAESCVERSPLALLS | 27 |
| PP19 | P47989: Xanthine dehydrogenase/oxidase | PRKQLRFEGERVTWIQASTLKELLDLK | 27 |
| PP20 | Q92608: Dedicator of cytokinesis protein 2 | YLDTSSRGEQCEPILRTLKALEYVFKFI | 28 |
| PP21 | Q5T7P8: Synaptotagmin-6 | KLKDPSTLGFLEAAVKISHTSPDIPAEVQM | 30 |
| PP22 | HIV-1 NL4-3: Pol | AELELAENREILKEPVHGAYYDPSKDLIAEL | 31 |
| PP23 | Q14894: Thiomorphine-carboxylate dehydrogenase | ALTTKLVTFYEDRGITSVVPSHQATVLLFEPSNG | 34 |
| PP24 | HIV-1 NL4-3: Pol | IRKVLFLDGIDKAQEEHEKYHSNWRAMASDFNLPPVVAKEIVAS | 44 |
| PP25 | Q9H8V3: Protein ECT2 | KALKTIKIMEVPVIKIKESCPGKSDEKLIKSVINMDIKVGFVKMESV | 47 |
Amino acid frequencies within synthetic polypeptides and combined HIV-1 and UniProt human proteomes.
|
|
|
|
|---|---|---|
| A | 7.16 | 8.25 |
| C | 2.65 | 1.37 |
| D | 4.56 | 5.45 |
| E | 7.49 | 6.75 |
| F | 4.23 | 3.86 |
| G | 3.09 | 7.07 |
| H | 2.76 | 2.27 |
| I | 7.65 | 5.96 |
| K | 8.31 | 5.84 |
| L | 11.24 | 9.66 |
| M | 1.63 | 2.42 |
| N | 1.63 | 4.06 |
| P | 6.19 | 4.7 |
| Q | 4.40 | 3.93 |
| R | 4.23 | 5.53 |
| S | 8.31 | 6.56 |
| T | 4.89 | 5.34 |
| V | 5.86 | 6.87 |
| W | 0.65 | 1.08 |
| Y | 3.09 | 2.92 |
Amino acids are denoted using single letter code.
Figure 1Diversity, abundance, and peptide length distribution of proteasome-derived spliced and non-spliced peptides. (A) Proportion of unique spliced and non-spliced peptides following a 2 h in vitro digestion of 25 self- and HIV-1-derived polypeptides (Table 1) by the constitutive proteasome. Proportions of spliced and non-spliced peptides within all unique peptides (n = 1,646), unique peptides originating from only the 13 longest polypeptide substrates (n = 1,331) and unique peptides originating from only the 12 shortest polypeptide precursors (n = 315) are shown. (B) Violin plots showing abundance of all unique spliced and non-spliced peptides as measured by LC–MS/MS intensity values. Median and quartile abundance values are indicated. A non-parametric unpaired Mann–Whitney t-test was used to determine whether abundance values differed between groups. ****P < 0.0001. (C) Length distributions of unique spliced (n = 135) and non-spliced (n = 900) peptides generated from within polypeptide substrates following a 2 h proteasomal digest.
Figure 2Physical properties of forward and reverse PCPS reactions. (A) Schematic depicting forward and reverse PCPS and indicating the N- and C-terminal fusion partners. (B) Relative proportions of forward and reverse spliced peptides among all unique spliced peptides (n = 135) identified following 2 h in vitro digestion of 25 self- and HIV-1-derived polypeptide sequences by the constitutive proteasome. (C) Length distributions of forward (n = 65) and reverse (n = 70) spliced peptides. (D) Relative abundance of forward and reverse spliced peptides. Median and quartile abundance values are indicated. A non-parametric unpaired Mann–Whitney t-test was used to determine whether abundance values differed between groups, with P < 0.05 as the significance threshold. (E) Fragment length distributions of N-terminal and C-terminal splice partners involved in forward and reverse PCPS reactions. (F) Cis-splicing distances in forward and reverse PCPS reactions.
Figure 3Proteasomal cleavage preferences differ between canonical hydrolysis and PCPS reactions. (A) Schematic depicting cleavage sites employed to generate non-spliced and forward and reverse spliced peptides from precursor polypeptides. X1 and Z1 denote residues at cleavage sites acting as O-acyl enzyme intermediates and nucleophilic free amine groups (respectively) in forward PCPS reactions, while X2 and Z2 denote the corresponding residues in reverse PCPS reactions. (B) Comparison of global amino acid cleavage site usage for spliced and non-spliced peptides. Amino acid frequencies are normalized relative to the overall frequency of that amino acid within all 25 precursor polypeptides. (C) Normalized frequencies of amino acid residues acting as acyl-enzyme intermediates in forward and reverse PCPS reactions. (D) Normalized frequencies of amino acid residues acting as nucleophilic free amines in forward and reverse PCPS reactions. In (B–D), amino acids are defined using single letter code.
Figure 4Forward and reverse PCPS reactions display unique splice site signatures. (A) Schematic depicting forward and reverse PCPS. Red arrows indicate fusion sites. Amino acid pairings proximal to the fusion site are labeled P1/P1′, P2/P2′ and P3/P3′. (B) Forward PCPS splice site signature. (C) Reverse PCPS splice site signature. In (B) and (C), heat maps show frequencies of amino acid residue pairings at ±1, 2, or 3 aa adjacent to the fusion site. Scale represents the frequency of each amino acid pairing as a percentage of the overall pairings, and amino acids are defined using single letter code.
Figure 5Schematics depicting characteristics of proteasome-catalyzed peptide splicing. (A) Diagram illustrating the characteristics of forward PCPS events. (B) Diagram illustrating the characteristics of reverse PCPS events occurring over cis-splicing distances >0 aa. (C) Diagram illustrating the characteristics of reverse PCPS events occurring with no intervening cis-splicing distance. In (A–C), Red arrows indicate splicing events. Amino acids typically acting as O-acyl enzyme intermediates or nucleophilic free amines as defined in splice site signatures (Figure 4) are shown in single letter code at the end and start of N- and C-terminal splice partners (respectively).