| Literature DB >> 34925382 |
Soumya Mukherjee1,2, Alvaro Sanchez-Bernabeu1,2, Laura C Demmers1,2, Wei Wu1,2, Albert J R Heck1,2.
Abstract
Mass-spectrometry based immunopeptidomics has provided unprecedented insights into antigen presentation, not only charting an enormous ligandome of self-antigens, but also cancer neoantigens and peptide antigens harbouring post-translational modifications. Here we concentrate on the latter, focusing on the small subset of HLA Class I peptides (less than 1%) that has been observed to be post-translationally modified (PTM) by a O-linked N-acetylglucosamine (GlcNAc). Just like neoantigens these modified antigens may have specific immunomodulatory functions. Here we compiled from literature, and a new dataset originating from the JY B cell lymphoblastoid cell line, a concise albeit comprehensive list of O-GlcNAcylated HLA class I peptides. This cumulative list of O-GlcNAcylated HLA peptides were derived from normal and cancerous origin, as well as tissue specimen. Remarkably, the overlap in detected O-GlcNAcylated HLA peptides as well as their source proteins is strikingly high. Most of the O-GlcNAcylated HLA peptides originate from nuclear proteins, notably transcription factors. From this list, we extract that O-GlcNAcylated HLA Class I peptides are preferentially presented by the HLA-B*07:02 allele. This allele loads peptides with a Proline residue anchor at position 2, and features a binding groove that can accommodate well the recently proposed consensus sequence for O-GlcNAcylation, P(V/A/T/S)g(S/T), essentially explaining why HLA-B*07:02 is a favoured binding allele. The observations drawn from the compiled list, may assist in the prediction of novel O-GlcNAcylated HLA antigens, which will be best presented by patients harbouring HLA-B*07:02 or related alleles that use Proline as anchoring residue.Entities:
Keywords: HLA; HLA-B*07:02; MHC; O-GlcNAcylation modification; immunopeptidome; neo-antigen
Mesh:
Substances:
Year: 2021 PMID: 34925382 PMCID: PMC8671986 DOI: 10.3389/fimmu.2021.796584
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Illustrative EThcD mass spectrum of the peptide TPASgSRAQTL, originating from Spectrin, in its O-GlcNAcylated form, modified at P5. Highlighted in green are the oxonium ions. Fragment ions that elucidate the amino acid sequence and the glycosylation site are labelled as b, c, y and z. The fragment ions without modification are indicated with a wave dash.
Figure 2Distribution of the identified unmodified (left) and O-GlcNAcylated (right) HLA class I peptides over (A) the JY HLA alleles in the current study, and (B) all HLA alleles in the GR-LCL cell line (C) Overlap in identified peptides in the current study (purple), Malaker et al. (yellow) and Marino et al. (green). (D) Overlap in identified proteins in the current study (purple), Malaker et al. (yellow) and Marino et al. (green).
Compilation of O-GlcNAcylated HLA class I peptides identified in the current study by Marino et al. and Malaker et al.
| # | Sequence | Length | UniProt ID | Source protein | Subcellular localization | Predicted HLA allele | Predicted Affinity (IC50 nM) | Start-Stop | Described at. |
|---|---|---|---|---|---|---|---|---|---|
| 1 | RPLSKgTVRF | 9 | P62280 | 40S ribosomal protein S11 | Nucleus, Cytosol | HLA-B*07:02 | 35 | 132-140 | CS |
| 2 | IVQAgTRTSL | 8 | Q6VMQ6 | Activating transcription factor 7-interacting protein 1 | Nucleus | HLA-B*07:02 | 992 | 831–838 | ( |
| 3 | NPVgSLPSL | 9 | Q6VMQ6 | Activating transcription factor 7-interacting protein 1 | Nucleus | HLA-B*07:02 | 69 | 881-889 | ( |
| 4 | EPSSTVVSL | 9 | O75129 | Astrotactin-2 | Membrane | HLA-B*07:02 | 445 | 1076–1085 | ( |
| 5 | HPMgSTASQV | 9 | Q13492 | Clathrin assembly lymphoid myeloid leukemia | Nucleus, Golgi apparatus | HLA-B*07:02 | 25 | 345–353 | ( |
| 6 | VPTgTSSSL | 8 | Q14004 | Cyclin dependent kinase 13 | Nucleus | HLA-B*07:02 | 168 | 1284–1291 | ( |
| 7 | VPASSTSTL | 9 | Q9NYV4 | Cyclin-dependent kinase 12 | Nucleus | HLA-B*07:02 | 8 | 576-584 | CS, ( |
| 8 | LPKPANTSAL | 10 | Q2KHR2 | DNA-binding protein RFX7 | Nucleus | HLA-B*07:02 | 7 | 336-345 | CS, ( |
| 9 | TPIgSQAQKLa | 9 | Q96L91 | E1A-binding protein p400 | Nucleus | HLA-B*07:02 | 363 | 3024–3032 | ( |
| 10 | VPRSSSMVL | 9 | Q9H992 | E3 ubiquitin-protein ligase | Nucleus, Cytosol | HLA-B*07:02 | 3 | 123-131 | CS |
| 11 | APPSTSAAAL | 10 | Q86TM6 | E3 Ubiquitin-protein ligase synoviolin | Endoplasmic | HLA-B*07:02 | 26 | 405–414 | ( |
| 12 | PPSTSAAAL | 9 | Q86TM6 | E3 Ubiquitin-protein ligase synoviolin | Endoplasmic reticulum | HLA-B*07:02 | 742 | 405–414 | ( |
| 13 | APTSASNVM | 9 | P28324 | ETS domain-containing protein Elk-4 | Nucleus | HLA-B*07:02 | 22 | 238-246 | CS |
| 14 | VLTSNVQTI | 9 | P32519 | ETS-related transcription factor Elf-1 | Nucleus | HLA-A*02:01 | 376 | 507-515 | CS |
| 15 | RPPSSSQQL | 9 | Q8WYB5 | Histone acetyltransferase KAT6B | Nucleus | HLA-B*07:02 | 70 | 1758–1766 | ( |
| 16 | LPRGSSPSVL | 10 | Q9GZN2 | Homeobox protein TGIF2 | Nucleus | HLA-B*07:02 | 5 | 105-114 | CS |
| 17 | APTgSAAAL | 8 | Q86Z02 | Homeodomain-interacting protein kinase 1 | Nucleus, Cytoplasm | HLA-B*07:02 | 90 | 1116–1123 | CS, ( |
| 18 | APVgSSKSSLa | 9 | Q86Z02 | Homeodomain-interacting protein kinase 1 | Nucleus, Cytoplasm | HLA-B*07:02 | 22 | 1157–1164 | CS, ( |
| 19 | VPVgSVGPSL | 9 | Q86Z02 | Homeodomain-interacting protein kinase 1 | Nucleus, Cytoplasm | HLA-B*07:02 | 7 | 850–858 | CS, ( |
| 20 | APFgSCRTEL | 9 | P32942 | Intercellular adhesion molecule 3 | Membrane | HLA-B*07:02 | 5 | 186-194 | ( |
| 21 | IPVgSKPLSL | 9 | Q16621 | Leucine zipper protein 1 | Cytoplasm | HLA-B*07:02 | 22 | 104–112 | ( |
| 22 | LPRNSTMM | 8 | Q9NPI6 | mRNA-decapping enzyme 1A | Nucleus | HLA-B*07:02 | 61 | 335–342 | ( |
| 23 | HPSSTASTAL | 10 | Q96T58 | Msx2-interacting protein | Nucleus | HLA-B*07:02 | 15 | 3041–3050 | ( |
| 24 | RPVgTASITTMa | 10 | Q9ULH7 | Myocardin-related transcription factor B | Nucleus | HLA-B*07:02 | 19 | 927–936 | ( |
| 25 | IPVgSSHNSLa | 9 | Q06413 | Myocyte-specific enhancer factor 2C | Nucleus | HLA-B*07:02 | 10 | 147–155 | CS, ( |
| 26 | VPVgSNQSSL | 9 | Q14814 | Myocyte-specific enhancer factor 2D | Nucleus | HLA-B*07:02 | 25 | 146–154 | ( |
| 27 | LPTgSLPSSL | 9 | P46531 | Neurogenic locus notch homolog protein 1 | Cell membrane | HLA-B*07:02 | 20 | 2464–2472 | ( |
| 28 | VPVSGgTQGL | 9 | P23511 | Nuclear transcription factor Y subunit alpha | Nucleus | HLA-B*07:02 | 56 | 93-101 | CS, ( |
| 29 | MPVRPTgTNTF | 10 | Q7Z3K3 | Pogo transposable element with ZNF domain | Nucleus, Cytoplasm | HLA-B*07:02 | 13 | 218–227 | ( |
| 30 | HPSSTAAVL | 9 | Q86XN7 | Proline and serine-rich protein 1 | – | HLA-B*07:02 | 13 | 740–748 | CS, ( |
| 31 | APRgTNGVAM | 9 | Q92567 | Protein FAM168A | – | HLA-B*07:02 | 3 | 187–195 | ( |
| 32 | IPAVgTRSTI | 9 | Q96RT1 | Protein LAP2 | Nucleus, Plasma membrane | HLA-B*07:02 | 8 | 1066-1074 | ( |
| 33 | IPIgSLHTSLa | 9 | Q5JSZ5 | Protein PRRC2B | – | HLA-B*07:02 | 11 | 1959–1967 | ( |
| 34 | APVgSASASV | 9 | Q9Y520 | Protein PRRC2C | – | HLA-B*07:02 | 20 | 1807–1815 | CS, ( |
| 35 | MPSSSHGSM | 9 | Q15532 | Protein SSXT | Nucleus | HLA-B*07:02 | 5 | 152-160 | CS |
| 36 | IPTgSSVLSL | 9 | O15027 | Protein transport protein Sec 16A | Endoplasmic reticulum | HLA-B*07:02 | 31 | 710–718 | ( |
| 37 | RPPQgSSSVSL | 10 | O15027 | Protein transport protein Sec 16A | Nucleus | HLA-B*07:02 | 9 | 937–946 | ( |
| 38 | gSPRVTQTIAL | 10 | Q9BZA7 | Protocadherin-11 X-linked | Plasma membrane | HLA-B*07:02 | 3 | 1186-1194 | CS |
| 39 | RPPVTKASSF | 10 | Q9Y2K5 | R3H domain-containing protein 2 | Nucleus | HLA-B*07:02 | 26 | 341-350 | CS, ( |
| 40 | RgSPTKSSL | 8 | Q96PK6 | RNA-binding protein 14 | Nucleus, Cytoplasm | – | – | 243-250 | CS |
| 41 | IPRPPIgTQSSL | 9 | Q9P2N5 | RNA-binding protein 27 | Nucleus, Cytoplasm | HLA-B*07:02 | 20 | 382–390 | ( |
| 42 | RPPIgTQSSLa,b | 11 | Q9P2N5 | RNA-binding protein 27 | Nucleus, Cytoplasm | HLA-B*07:02 | 11 | 380-390 | ( |
| 43 | RPgTPRGITL | 9 | Q7Z614 | Sorting nexin-20 | Nucleus, Cytoplasm | HLA-B*07:02 | 4 | 298-306 | ( |
| 44 | TPASSRAQTL | 10 | Q01082 | Spectrin beta chain, non-erythrocytic 1 | Cytoskeleton, Plasma membrane | HLA-B*07:02 | 9 | 2320–2329 | CS, ( |
| 45 | APVgSPSSQKL | 10 | Q9NYB0 | Telomeric repeat-binding factor 2-Interacting protein 1 | Nucleus, Cytoplasm | HLA-B*07:02 | 70 | 200-209 | ( |
| 46 | KPPVgSFFSL | 9 | Q6PKC3 | Thioredoxin domain containing protein 11 | Endoplasmic reticulum | HLA-B*07:02 | 91 | 95–103 | ( |
| 47 | IPTgSARSML | 9 | Q8WXI9 | Transcriptional repressor p66-beta | Nucleus | HLA-B*07:02 | 9 | 522-530 | ( |
| 48 | TPARSRgSKE | 9 | P62995 | Transformer-2 protein homolog beta | Nucleus | HLA-B*07:02 | 2172 | 32-40 | CS |
| 49 | VPEVgTKPSL | 9 | Q9UPQ9 | Trinucleotide repeat-containing gene 6B protein | P-bodies | HLA-B*07:02 | 65 | 39-47 | ( |
| 50 | TPASSSSAL | 9 | Q9NPG3 | Ubinuclein-1 | Nucleus | HLA-B*07:02 | 5 | 875-883 | CS, ( |
| 51 | KPPTSQSSVL | 10 | Q5T6F2 | Ubiquitin-associated protein 2 | Nucleus, Cytoplasm | HLA-B*07:02 | 32 | 411–420 | CS, ( |
| 52 | VPVgSSASEL | 9 | Q7Z2W4-3 | Zinc finger CCCH-type, antiviral 1 | Nucleus, Cytoplasm | HLA-B*07:02 | 36 | 596–603 | ( |
| 53 | SVVTTVWGV | 9 | Q8NF64 | Zinc finger MIZ domain-containing protein 2 | Nucleus | HLA-A*02:01 | 12 | 40-48 | CS |
| 54 | RVKTPTgSQSY | 10 | Q9Y2X9 | Zinc finger protein 281 | Nucleus | HLA-A*03:01 | 631 | 885-894 | ( |
| 55 | RVKTPTgSQSYR | 11 | Q9Y2X9 | Zinc finger protein 281 | Nucleus | HLA-A*03:01 | 439 | 885-895 | ( |
The O-GlcNAcylated Thr or Ser is tagged with a “g” when unambiguously assigned by tandem MS data. †Each of these two pairs of overlapping peptides were considered as a unique sequence. Peptides described in the current study are assigned as CS. aThese peptides were tested by Malaker et al. for immunogenicity. bMethylated and O-GlcNAcylated peptidoform of this peptide was also tested for immunogenicity by Malaker et al.
Figure 3(A) Sequence motif (pLogo) of the in the current study detected unmodified HLA-B*07:02 peptides (9-mers) predicted as binders (% Rank < 2), with the clear anchoring positions Pro at P2 and Leu at P9. (B) Sequence motif of the O-GlcNAcylated HLA peptides (9-mers), with next to Pro at P2, Val at P3 and Ser at P4 and P5. Semi-consensus sequences based on the 7002 human O-GlcNAcylation sites for (C) Threonine and (D) Serine O-GlcNAcylation. (adapted from Wuff-Fuentes et al. (https://creativecommons.org/licenses/by/4.0/legalcode).
Figure 4Schematic of detected co-occurring peptidoforms for the two overlapping unique HLA peptides detected, RPPITQSSL and IPRPPITQSSL, originating from RNA-binding protein 27. In the datasets of both Marino et al. and Malaker et al. several O-GlcNAcylated HLA peptides originating from the RNA binding protein 27 were reported. In Marino et al. the backbone sequence covered aa 380-390, whereas Malaker et al. observed the alike peptide covering aa 382-390. These peptides are likely HLA-B*07:02 binders, due to the underlined anchoring positions Pro at 2 and Leu at C-terminal position. In both datasets peptides were found with the Arg at 1 (3) being non-methylated, mono-methylated or di-methylated. The Thr at P5 (P7) was detected in several different glycoforms: non-O-GlcNAc, one GlcNAc, hexose-GlcNAc and even more extended glycans with mass shifts of 730 and 1021 Da.