| Literature DB >> 35740681 |
Haitao Xiang1,2, Le Zhang3, Fanyu Bu2, Xiangyu Guan1,2, Lei Chen2, Haibo Zhang2, Yuntong Zhao2, Huanyi Chen2, Weicong Zhang1,2, Yijian Li1,2,4, Leo Jingyu Lee3, Zhanlong Mei5, Yuan Rao5, Ying Gu2,6, Yong Hou5, Feng Mu5, Xuan Dong2,4.
Abstract
Tumor-specific antigens can activate T cell-based antitumor immune responses and are ideal targets for cancer immunotherapy. However, their identification is still challenging. Although mass spectrometry can directly identify human leukocyte antigen (HLA) binding peptides in tumor cells, it focuses on tumor-specific antigens derived from annotated protein-coding regions constituting only 1.5% of the genome. We developed a novel proteogenomic integration strategy to expand the breadth of tumor-specific epitopes derived from all genomic regions. Using the colorectal cancer cell line HCT116 as a model, we accurately identified 10,737 HLA-presented peptides, 1293 of which were non-canonical peptides that traditional database searches could not identify. Moreover, we found eight tumor neo-epitopes derived from somatic mutations, four of which were not previously reported. Our findings suggest that this new proteogenomic approach holds great promise for increasing the number of tumor-specific antigen candidates, potentially enlarging the tumor target pool and improving cancer immunotherapy.Entities:
Keywords: immunopeptidome; immunotherapy; mass spectrometry; neo-epitope
Year: 2022 PMID: 35740681 PMCID: PMC9220843 DOI: 10.3390/cancers14123016
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Proteogenomic-based identification of the human leukocyte antigen (HLA) immunopeptidome: (A) Schematic representation of experimental and bioinformatic workflow for HLA-I-presented peptide identification; (B) Distribution of peptide-spectrum match (PSM) scores reported by pNovo3 for peptides identified by all three items of software; (C) Venn diagrams showing the reproducibility of HLA-I-presented peptides identified using the three items of software; (D) Length distribution of HLA-I-presented peptides; (E) Precursor charge distribution of the HCT116 immunopeptidome; and (F) Peptide-spectrum match (PSM) count distribution of the HCT116 immunopeptidome.
Figure 2Common features of human leukocyte antigen (HLA)-I-presented peptides of the HC116 cell line: (A) Comparison of measured versus predicted retention times of HLA-I-presented peptides showed high correlation (Pearson R2 = 0.969). Solid red lines mark the difference between measured and predicted retention times encompassing 90% of all peptides. Solid blue line indicates the fitted linear regression line; (B) Proportion of peptides identified by the three items of software under different binder levels. (%Rank ≤ 0.5% for Strong Binder, %Rank > 0.5% and ≤ 2% for Weak Binder, %Rank > 2% for Non Binder)”; (C) Distribution of the proportions of peptides with predicted affinity for specific HLA alleles; and (D) Distribution of the dissociation constant (IC50) predicted by NetMHCpan version 4.1 (Department of Bio and Health Informatics, Lyngby, Denmark).
Figure 3Comparison of features between canonical and non-canonical human leukocyte antigen (HLA) class I immunopeptidomes: (A) Comparison of the distributions of canonical and non-canonical peptide lengths; (B) Comparison of precursor charge distributions of canonical and non-canonical peptides; (C) Comparison of the percentages of peptides bound to HLA class I molecule predicted by NetMHCpan 4.1; (D) Comparison of predicted IC50 (nM) distributions for canonical and non-canonical peptides; (E) Comparison of the distributions of retention time differences between predicted and experimental spectra of canonical and non-canonical peptides; and (F) Comparison of six binding motifs in canonical and non-canonical peptides.
Figure 4Features of non-canonical peptides identified using de novo identification: (A) Upset diagram illustrates the intersection of the non-canonical peptides and the matrix layout for all interaction patterns of the non-canonical peptides at various omics levels, ordered by size. The black circles in the matrix indicate the sets that are part of the intersection; (B) Distribution of the lengths of sequences between two spliced fragments (i.e., the intervening sequence) of normal and reverse cis-spliced peptides; (C) Distribution of the lengths of N- and C-terminal spliced fragments of cis-spliced peptides; (D) Heatmap showing frequencies of the amino acids at each residue of the 9-mer cis-spliced peptide (left) and amino acid preferences at N- terminal, C-terminal, and the boundary of the potential splice site of cis-spliced peptides (right); and (E) Top 10 enriched GO terms at the “biological process” level for the parent proteins of canonical peptides identified by MaxQuant (left) and cis-spliced peptides candidates (right), respectively.
Figure 5Features of tumor-specific neo-epitopes identified using an integrated proteogenomic approach: (A) Venn diagrams showing the overlap in tumor-specific mutant peptides identified by three items of software; (B) Distribution of peptide-spectrum match (PSM) counts of eight tumor-specific mutant peptides; (C) Comparison of measured versus predicted retention times of eight mutant peptides; (D) Mirror diagram comparing the experimentally obtained mutant peptide spectrum (bottom) with its corresponding synthetic peptide spectrum (top) for mutant peptide QTDQMVFNTY; (E) Parallel reaction monitoring verifies the consistency of mutant peptides (right panel) with their corresponding synthetic equivalents (left panel) for mutant peptide QTDQMVFNTY. The intensity graph displays parallel reaction monitoring (PRM) peak signals of fragment ions. The bar graph shows the normalized peak areas of all fragment ions for the peptide. The contribution from each fragment ion is shown in a different color in the bar graph.
List of mutant peptides identified in HCT116 cells.
| Sequence | HLA | IC50 (nM) | Mutation Locus | Gene | Protein | AA Change | Wild Peptides Identified | Reference |
|---|---|---|---|---|---|---|---|---|
| QTDQMVFNTY | HLA-A*01:01 | 15.85 | chr8:23258741 | CHMP7 | Q8WUX9 | p.A324T | Yes | [ |
| RLDPGEPKSY | HLA-A*01:01 | 1193.67 | chr9:35750732 | RGP1 | Q92546 | p.S110P | No | [ |
| AAAPVVPQV | HLA-A*02:01 | 175.14 | chr6:149796499 | PCMT1 | P22061 | p.A168V | No | [ |
| DEQQVDVL | HLA-B*18:01 | 771.4 | chr20:31605574 | ID1 | P41134 | p.N63D | Yes | - |
| EEEYPGVTA | HLA-B*45:01 | 106.68 | chr22:17726722 | BCL2L13 | Q9BXK5 | p.I216V | Yes | [ |
| EEYPGVTA | HLA-B*45:01 | 448.13 | chr22:17726722 | BCL2L13 | Q9BXK5 | p.I216V | No | - |
| SLFNKYPAL | HLA-A*02:01 | 11.61 | chrX:115643440 | PLS3 | P13797 | p.N372S | No | - |
| FLDNQQHGM | HLA-C*05:01 | 20.86 | chr1:88804485 | PKN2 | Q16513 | p.R459Q | No | - |
HLA, human leukocyte antigen; AA, amino acid.