| Literature DB >> 32256484 |
Mark Yarmarkovich1,2, Alvin Farrel1, Artemio Sison3, Moreno di Marco4, Pichai Raman3,5, Joshua L Parris2, Dimitrios Monos2,6, Hongzhe Lee2, Stefan Stevanovic4, John M Maris1,2.
Abstract
Despite recent advances in cancer immunotherapy, the process of immunoediting early in tumorigenesis remains obscure. Here, we employ a mathematical model that utilizes the Cancer Genome Atlas (TCGA) data to elucidate the contribution of individual mutations and HLA alleles to the immunoediting process. We find that common cancer mutations including BRAF-V600E and KRAS-G12D are predicted to bind none of the common HLA alleles, and are thus "immunogenically silent" in the human population. We identify regions of proteins that are not presented by HLA at a population scale, coinciding with frequently mutated hotspots in cancer, and other protein regions broadly presented across the population in which few mutations occur. We also find that 9/29 common HLA alleles contribute disproportionately to the immunoediting of early oncogenic mutations. These data provide insights into immune evasion of common driver mutations and a molecular basis for the association of particular HLA genotypes with cancer susceptibility.Entities:
Keywords: BRAF; HLA; KRAS; TP53; cancer susceptibility; immune evasion; immunoediting; neoantigens
Mesh:
Substances:
Year: 2020 PMID: 32256484 PMCID: PMC7092187 DOI: 10.3389/fimmu.2020.00069
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1Pipeline for generating immunogenicity map. TCGA mutations imputed by MuSE and Somatic Sniper were filtered for those arising from 125 characterized driver mutations (16). 17 mer peptides sequences were generated for all possible 26,361 variants arising from (covering nine potential epitopes for each variant) each variant/WT pair (~237,000 pairs) and analyzed for MHC binding affinity using NetMHC 4.0 across 84 HLA alleles resulting in 20e6 binding affinities. Peptide affinities were aggregated for all peptides arising from each variant and filtered for those that are strong binders (<0.5% rank), revealing 24,555 strong binders and 1,806 immunogenically silent mutations.
Figure 2Mutation frequency in TCGA inversely correlates with population-scale HLA binding. (A) Mutation frequency for 26,361 TCGA variants compared to the proportion of the population predicted to bind a given neoantigens derived from that mutation reveals immunogenically silent mutations in 1,806 variants (6.85% of mutation in the TCGA, including BRAF V600E and KRAS G12D highlighted) and enrichment of common driver mutations in those which are predicted to not bind any HLA alleles (p = 0.0004). (B) TP53 mutations are enriched in those that are immunogenically silent or bind HLA alleles in a small portion of the population, and mutations with high probability of binding at least one allele in a given patient are significantly underrepresented in the TCGA (p = 0.008). (C) Proportion of characterized neoantigens bound by individual HLA alleles reveals broad range in diversity of binding from 3.5% of neoantigens arising from driver genes bound by HLA-B*37:01 to 18.6% bound by HLA-B*83:01.
Figure 3HLA presentation score by protein region reveals unprotected domains and protein regions broadly applicable as cancer vaccine candidates. (A–C) Regional HLA presentation scores determined by calculating the fraction of the population capable of presenting a 9 mer epitope centered at each amino acid location in p53, PI3K, and BRAF (gray bars). Each protein contains domains that are presented on HLA in the majority of the population and other domains that are unprotected. Mutation frequency at each location overlaid (green lollipop plot) reveals that many common mutations are found in unprotected regions of the protein. (D) Overlay of mutated neoantigen presentation score with regional presentation score for p53 shows strong correlation between regional protection of WT protein (gray bars) with protection against mutated neoantigens (black lollipop plot). (E) Twenty nine peptides detected by ligandomics in 16 neuroblastoma tumors were mapped onto the HLA population presentation scores. Empirically detected peptides were highly enriched in high-scoring regions of the protein (p = 0.000011). (F) Analysis of MYCN HLA presentation across the span of the protein. Analysis of individual peptides (top) reveals the most highly presented peptide derived from MYCN, TVRPKNAAL, to be presented on 9 HLA alleles, representing 58.1% of the population. KATEYVHSL peptide, detected by ligandomics is predicted to be presented on 10 total HLA alleles (31.9% of the population). Analysis of 17 mer regions (middle) reveals a peptide LERQRRNDLRSSFLTLR generating peptides predicted to bind 19 HLA alleles (73.1% of the population). Analysis of 33 mer regions reveals the highest scoring peptide TVRPKNAALGPGRAQSSELILKRCLPIHQQHNY presented on 18 HLA alleles in 85.4% of the population, suggesting these as promising regions of the MYCN protein for broadly applicable vaccination.
Figure 4Immunoediting across HLA alleles. (A) Expected TCGA frequency for each allele as calculated from the Bone Marrow Registry and adjusted for ethnicity (red) compared to observed HLA allele frequency in subset of the TCGA population harboring strong neoantigen/HLA binding pairs (blue). To calculate observed HLA frequencies, predicted mutant binders are used to filter TCGA data and HLA frequencies are deduced from the subset of unique individuals. Decreased frequency in TCGA binders compared to expected frequency is a surrogate metric for early immunoediting events that are responsible for having cleared early tumors and are thus underrepresented in TCGA data. (B) Ratio of observed:predicted binders. Area below dotted line represents immunoediting frequency for each allele. Zero represents complete immunoediting while 1 represents no contribution of immunoediting by a particular HLA allele. (C) Binding affinity between HLA alleles found to participate in immunoediting of cancer neoantigens is stronger than non-contributing alleles (p = 0.015).