| Literature DB >> 29653567 |
Mary A Wood1,2, Mayur Paralkar1,3, Mihir P Paralkar1,3, Austin Nguyen1,4, Adam J Struck1, Kyle Ellrott1,5, Adam Margolin1,5, Abhinav Nellore1,5,6, Reid F Thompson7,8,9,10.
Abstract
BACKGROUND: Tumor neoantigens are drivers of cancer immunotherapy response; however, current prediction tools produce many candidates requiring further prioritization. Additional filtration criteria and population-level understanding may assist with prioritization. Herein, we show neoepitope immunogenicity is related to measures of peptide novelty and report population-level behavior of these and other metrics.Entities:
Keywords: Immunogenicity; Immunotherapy; Neoantigens; Neoepitopes; TCGA
Mesh:
Substances:
Year: 2018 PMID: 29653567 PMCID: PMC5899330 DOI: 10.1186/s12885-018-4325-6
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Illustration of proposed neoepitope prioritization metrics. a. Tumor vs. paired normal peptide binding affinity difference addresses the difference in MHC Class I binding affinity between the paired tumor and normal epitopes, and a novel binding change occurs when a tumor epitope binds readily to a patient’s HLA allele while its paired normal epitope does not. Examples are shown of a neoepitope which displayed a novel binding change (left) and a neoepitope which did not (right). Mutated residues are shown in blue underline. b. Tumor vs. paired normal peptide sequence similarity addresses the similarity in sequence between the paired tumor-normal epitopes at non-anchor residues based on a BLOSUM62 matrix, normalized by the tumor epitope’s similarity with itself. Examples are shown of a neoepitope with low similarity to its paired normal epitope (left) and a neoepitope with high similarity to its paired normal epitope (right). Anchor residue positions are shown faded, and mutated residues are shown in blue and underlined. c. Tumor vs. closest human peptide sequence similarity addresses how similar the neoepitope is to all human proteins based on a blastp search. Examples are shown of a neoepitope which matched to a peptide from a gene other than its gene of origin (left) and a neoepitope which matched to a peptide from its gene of origin (right). Anchor residue positions are shown faded, and mutated residues are shown in blue and underlined. d. Tumor vs. closest microbial peptide sequence similarity addresses how similar the neoepitope is to all bacterial and viral proteins based on a blastp search. Examples are shown of a neoepitope that matches closer to a microbial peptide than any human peptide (left) and a neoepitope which matches closer to a human peptide than any microbial peptide (right). Anchor residue positions are shown faded, and mutated residues are shown in blue and underlined
Fig. 2Neoepitope predictions in TCGA across disease sites and HLA alleles. a. Number of total and putatively novel-binding predicted neoepitopes in each disease group from TCGA. The total number of neoepitopes (gray) for each patient in each disease group, shown in order of decreasing median neoepitope burden, was determined using pVAC-Seq. Novel binding (red) are the subset of neoepitopes which displayed a putatively novel binding change (see Methods). Vertical lines separate TCGA disease sites. Outliers have been removed for clarity. On average, 20.3% of a patient’s neoepitopes were novel-binding. A TCGA disease abbreviation key is available in Additional file 1: Table S1. b. Putatively novel binding predicted neoepitopes across HLA alleles in TCGA. Top: number of neoepitopes with a novel binding change in TCGA for each HLA allele studied, colored green, blue, and yellow according to HLA allele types (A, B, and C, respectively). Bottom: average population frequency for each HLA allele studied, colored as per Top pane. Alleles with > 10% frequency in the population are labeled. There was no relationship between allele frequency and number of putatively novel binding neoepitopes (Pearson’s product-moment correlation of 0.1, p = 0.1)
Fig. 3Establishment of putatively novel binding criteria in melanoma patients. a. Distribution of paired tumor (x-axis) and normal (y-axis) epitope binding affinities for all neoepitopes analyzed in the Hugo et al. [48] melanoma cohort. The vertical blue line divides epitopes into groups with (left) and without (right) strong tumor epitope binding (MHC affinity < 500 nM), while the horizontal blue line divides epitopes into groups with (bottom) and without (top) strong normal epitope binding (MHC affinity < 500 nM). The diagonal blue line divides epitopes into groups where the normal epitope binding affinity is at least 5× poorer than the tumor epitope (above) and where the normal epitope binding affinity is less than 5× poorer than the tumor epitope (below). Epitopes colored red are those that have strong tumor epitope binding affinities (< 500 nM), weak normal epitope binding affinities (> 500 nM), and normal epitope binding affinity at least 5× poorer than tumor epitope binding affinity (constituting a putatively novel binding change). Both axes are shown as log scale. b. Number of total (gray) and putatively novel binding (red) neoepitopes for each melanoma patient among the Hugo et al. cohort. Patients 15 and 21 each have a single putatively novel binding neoepitope. Y axis is shown as log scale
Fig. 4Similarity of neoepitopes to human peptides. a. Matching status of top blastp hit(s) to a neoepitope’s gene of origin (see Methods) for patients across TCGA disease groups (gray = matching, red = non-matching). Vertical lines separate TCGA disease sites. Outliers have been removed for clarity. A disease abbreviation key is available in Additional file 1 Table S1. b. Distribution of the proportion of neoepitopes that matched to the gene of origin for each patient’s neoepitopes. On average, 77.3% of neoepitopes for each patient had a top blastp hit that matched their gene of origin
Fig. 5Species of origin of bacterial peptide matches to neoepitopes. Top 10 most frequent bacterial genera with peptides matching more closely to a neoepitope than either its paired normal epitope or its top blastp hit are shown
Significance of prioritization metrics in predicting immune response. Based on a linear model, each prioritization metric, along with tumor and paired normal epitope binding affinities and the number of sequence mismatches between neoepitopes and paired normal epitopes, were tested for the ability to predict immune response to a predicted neoepitope
| Predictor of immune response | Adjusted R2 | Significance ( |
|---|---|---|
| Neoepitope binding affinity | −0.002 | 0.7 |
| Paired normal epitope binding affinity | −0.002 | 0.7 |
| Difference in binding affinity between neoepitope and paired normal epitope | −0.002 | 0.7 |
| Difference in binding affinity between neoepitope and closest human protein | −0.002 | 0.3 |
| Number of mismatches between neoepitope and paired normal epitope | 0.01 | 0.03 |
| Percent protein sequence similarity between neoepitope and paired normal epitope | 0.004 | 0.09 |
| Percent protein sequence similarity between neoepitope and closest human protein | −0.001 | 0.5 |
| Percent protein sequence similarity between neoepitope and closest bacterial protein | −0.002 | 0.8 |
| Percent protein sequence similarity between neoepitope and closest viral protein | 0.007 | 0.046 |
Fig. 6Neoepitope prioritization metric scores and linear modeling in the cohort of peptides with neoepitope-specific immune response data. Score distributions are shown for peptides with (red) and without (gray) neoepitope-specific immune response. Panes in light blue indicate metrics which are included in the model shown in part G. R = “response”, NR = “nonresponse”. a. Neoepitope binding affinity, * p < 0.001 in a Wilcoxon rank sum test. b. Difference in binding affinity between the neoepitope and its paired normal epitope (normal affinity – tumor affinity). c. Percent protein sequence similarity (see Methods) between the neoepitope and its paired normal epitope. d. Percent protein sequence similarity between the neoepitope and its closest human peptide. e. Percent protein sequence similarity between the neoepitope and its closest bacterial peptide. f. Percent protein sequence similarity between the neoepitope and its closest viral peptide. g. ROC curve for prediction of immunogenicity from prioritization criteria. A linear model incorporating neoepitope binding affinity, protein sequence similarity between neoepitopes and their closest viral peptides, and difference in binding affinity between neoepitopes and their closest human peptides was used to predict immune response. Within a limited cohort of 419 peptides with immune response data, our model was able to predict peptide immunogenicity with an AUROC of 0.66. Dashed gray line represents the line y = x for comparison