| Literature DB >> 31775766 |
Eric Blanc1,2, Manuel Holtgrewe1,2, Arunraj Dhamodaran3, Clemens Messerschmidt1,2, Gerald Willimsky4,5,6, Thomas Blankenstein4,3,5, Dieter Beule7,8.
Abstract
BACKGROUND: Immune escape is one of the hallmarks of cancer and several new treatment approaches attempt to modulate and restore the immune system's capability to target cancer cells. At the heart of the immune recognition process lies antigen presentation from somatic mutations. These neo-epitopes are emerging as attractive targets for cancer immunotherapy and new strategies for rapid identification of relevant candidates have become a priority.Entities:
Keywords: Cancer; Immunotherapy; Neo-antigen; Neo-epitope; Precision treatment
Mesh:
Substances:
Year: 2019 PMID: 31775766 PMCID: PMC6882202 DOI: 10.1186/s12920-019-0611-7
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Workflow overview a Overview of the recurrent neo-epitope candidates generation process: TCGA studies are selected for at least 100 donors with clinical annotations. For each of these studies, recurrent strongly supported missense Single-Nucleotide Variants are collected. Neo-epitopes binding to 11 HLA-1 types are predicted, redundancy is removed from that set (see B) and strong binders are retained. b Example of epitope redundancy: the 18 amino-acids long sequence surrounding recurrent variant GLRA3:S274L generates 7 binding neo-epitopes for the type HLA-A*02:01. Our pipeline retains only the strongest predicted binder for a given variant and HLA-1 type pair (the first, with an IC50 of 8.8 nM in the example). c Number of SNVs occuring in genes classified as Oncogenes or Tumor Suppressors by Vogelstein et al. [28], at various point of the variant selection and neo-epitope selection process
Overview of the 33 TCGA studies used in this analysis
| Project name | Number of patients | Variants per patients | Missense variant per patient | Recurrent variants | Strong binders | |||
|---|---|---|---|---|---|---|---|---|
| Total | With clinical data | Average | Median | Average | Median | |||
| TCGA-BLCA | 412 | 412 | 326 | 226 | 157 | 109 | 22 | 14 |
| TCGA-BRCA | 986 | 986 | 123 | 62 | 50 | 25 | 8 | 10 |
| TCGA-CESC | 289 | 289 | 358 | 157 | 143 | 62 | 17 | 16 |
| TCGA-COAD | 399 | 397 | 666 | 176 | 288 | 82 | 41 | 34 |
| TCGA-ESCA | 184 | 184 | 246 | 187 | 95 | 73 | 80 | 72 |
| TCGA-GBM | 393 | 390 | 212 | 70 | 93 | 36 | 15 | 5 |
| TCGA-HNSC | 508 | 508 | 201 | 139 | 97 | 66 | 14 | 3 |
| TCGA-KIRC | 336 | 336 | 79 | 69 | 33 | 31 | 0 | 0 |
| TCGA-KIRP | 281 | 281 | 85 | 82 | 39 | 38 | 5 | 2 |
| TCGA-LAML | 143 | 143 | 69 | 15 | 16 | 6 | 14 | 7 |
| TCGA-LGG | 508 | 507 | 70 | 36 | 33 | 16 | 14 | 0 |
| TCGA-LIHC | 364 | 364 | 149 | 120 | 70 | 58 | 11 | 16 |
| TCGA-LUAD | 567 | 515 | 367 | 242 | 180 | 113 | 7 | 0 |
| TCGA-LUSC | 492 | 492 | 368 | 301 | 187 | 153 | 20 | 19 |
| TCGA-OV | 436 | 435 | 173 | 121 | 58 | 47 | 10 | 7 |
| TCGA-PAAD | 178 | 178 | 168 | 50 | 77 | 19 | 24 | 12 |
| TCGA-PCPG | 179 | 179 | 13 | 12 | 5 | 4 | 8 | 5 |
| TCGA-PRAD | 495 | 495 | 59 | 35 | 27 | 15 | 3 | 7 |
| TCGA-READ | 137 | 136 | 475 | 148 | 232 | 70 | 320 | 186 |
| TCGA-SARC | 237 | 237 | 119 | 70 | 45 | 26 | 2 | 0 |
| TCGA-SKCM | 467 | 467 | 841 | 472 | 413 | 229 | 266 | 220 |
| TCGA-STAD | 437 | 437 | 488 | 157 | 211 | 74 | 17 | 14 |
| TCGA-TGCT | 144 | 128 | 23 | 21 | 9 | 8 | 9 | 6 |
| TCGA-THCA | 492 | 492 | 22 | 12 | 6 | 5 | 4 | 3 |
| TCGA-THYM | 123 | 123 | 39 | 24 | 10 | 4 | 6 | 2 |
| TCGA-UCEC | 530 | 530 | 1672 | 149 | 708 | 54 | 118 | 109 |
| TCGA-ACC | 92 | 92 | 117 | 36 | 0 | 0 | 0 | 0 |
| TCGA-CHOL | 51 | 45 | 110 | 62 | 0 | 0 | 0 | 0 |
| TCGA-DLBC | 37 | 37 | 173 | 157 | 0 | 0 | 0 | 0 |
| TCGA-KICH | 66 | 66 | 44 | 25 | 0 | 0 | 0 | 0 |
| TCGA-MESO | 82 | 82 | 47 | 44 | 0 | 0 | 0 | 0 |
| TCGA-UCS | 57 | 57 | 183 | 67 | 0 | 0 | 0 | 0 |
| TCGA-UVM | 80 | 80 | 23 | 16 | 0 | 0 | 0 | 0 |
| Total | 10182 | 10100 | Total number: 3155183 | Total number: 1384531 | 1055 | 769 | ||
The 7 studies displayed at the bottom have not been used for the determination of recurrent vairants, as the number of patients is less than 100. The number of strong binders includes all occurrences of neo-epitopes candidates, so a candidate may be counted multiple times when it is predicted to be binding several HLA-1 types
Expected number of newly diagnosed U.S. patients by HLA-1 type and cancer entity
| A) | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cancer entity | Study | Number of patients | HLA-A*01:01 (7.61%) | HLA-A*02:01 (20.36%) | HLA-A*03:01 (6.60%) | HLA-A*11:01 (4.37%) | HLA-B*07:02 (6.51%) | HLA-B*08:01 (4.80%) | HLA-B*15:01 (4.46%) | HLA-C*04:01 (16.69%) | HLA-C*06:02 (5.72%) | HLA-C*07:01 (9.28%) | HLA-C*07:02 (15.39%) |
| Bladder Urothelial Carcinoma | BLCA | 69300 | 340 | 638 | 197 | 152 | 142 | 112 | 89 | 223 | 376 | 596 | 383 |
| Invasive Breast Carcinoma | BRCA | 204800 | 110 | 1090 | 461 | 199 | 108 | 375 | 120 | 482 | 1411 | 2305 | 1234 |
| Cervical Squamous Cell Carcinoma | CESC | 14000 | 58 | 135 | 16 | 17 | 13 | 14 | 23 | 142 | 27 | 66 | 152 |
| Colon Adenocarcinoma | COAD | 154840 | 861 | 1628 | 770 | 760 | 450 | 222 | 375 | 2961 | 1019 | 1620 | 1800 |
| Esophageal Adenocarcinoma | ESCA | 4750 | 23 | 151 | 26 | 16 | 20 | 5 | 13 | 101 | 23 | 26 | 83 |
| Glioblastoma Multiforme | GBM | 3204 | 7 | 17 | 3 | 7 | 0 | 0 | 3 | 8 | 5 | 3 | 3 |
| Head & Neck Squamous Cell Carcinoma | HNSC | 58000 | 43 | 208 | 112 | 45 | 81 | 11 | 81 | 301 | 136 | 220 | 175 |
| Renal Clear Cell Carcinoma | KIRC | 57600 | 13 | 70 | 23 | 0 | 0 | 8 | 0 | 0 | 10 | 0 | 26 |
| Papilliary Renal Cell Carcinoma | KIRP | 8064 | 0 | 6 | 0 | 3 | 0 | 0 | 0 | 29 | 0 | 0 | 0 |
| Acute Myeloid Leukemia | LAML | 13500 | 77 | 115 | 0 | 8 | 49 | 0 | 0 | 0 | 48 | 9 | 73 |
| Hepatocellular Carcinoma | LIHC | 29700 | 12 | 496 | 131 | 84 | 69 | 12 | 7 | 68 | 69 | 112 | 174 |
| Lung Squamous Cell Carcinoma | LUSC | 66000 | 181 | 642 | 328 | 168 | 282 | 13 | 77 | 593 | 151 | 172 | 366 |
| Serous Ovarian Cancer | OV | 16800 | 26 | 24 | 33 | 10 | 15 | 2 | 14 | 39 | 9 | 14 | 36 |
| Prostate Adenocarcinoma | PRAD | 260000 | 120 | 852 | 69 | 69 | 34 | 50 | 70 | 175 | 387 | 628 | 1202 |
| Melanoma | SKCM | 75000 | 2649 | 7890 | 1817 | 936 | 861 | 530 | 203 | 2186 | 438 | 1000 | 2457 |
| Stomach Adenocarcinoma | STAD | 25000 | 47 | 172 | 56 | 66 | 26 | 8 | 30 | 206 | 114 | 166 | 130 |
| Thyroid Cancer | THCA | 46400 | 394 | 0 | 0 | 0 | 0 | 14 | 0 | 0 | 0 | 0 | 44 |
| Endometrial Carcinoma | UCEC | 55000 | 942 | 2804 | 817 | 501 | 369 | 290 | 493 | 2222 | 856 | 1602 | 1779 |
| Total | 1161958 | 5904 | 16936 | 4858 | 3033 | 2517 | 1666 | 1598 | 9736 | 5080 | 8539 | 10116 | |
| B) | |||||||||||||
| Number of candidates in diseases | 55 | 91 | 68 | 64 | 33 | 24 | 24 | 48 | 48 | 50 | 55 |
A) Expected number of patients of a given HLA-1 type who harbor at least one potentially immunogenic neo-epitope candidate for that HLA-1 type. Both the cancer incidence and the allele frequency are estimated for the U.S. population. The probability that a patient carries at least one variant from the set of neo-epitope candidates is computed under the assumption that the occurrence of variants in a cancer patient stems from statistically independent events. B) Number of neo-epitope candidates identified in the 18 studies shown in A, which are predicted to be strong binders to the corresponding HLA-1 type
Fig. 250 most frequent candidates in patients for which strong MHC I binding is predicted. For each candidate, the expected number of patients is obtained by summing over the 18 cancer entities for which the number of newly diagnosed patients in the US is available, and for which a corresponding TCGA study has been included in our analysis
Fig. 3Expected influence of the proportion of false positive neo-epitope candidates on the patient population. Proportion of the patients that carry at least one neo-epitope candidate mutation, and whose HLA-I allele set contains the candidate HLA type, when a limited percentage of the neo-epitope candidates is considered. The patient cohort considered here consists of 6868 patients from the 18 TCGA cohorts for whom the HLA types are known. For each false positive proportion, the false positive candidates have been selected 1000 times at random
Fig. 4Recognition of predicted epitopes by CD8 + T cells. Epitopes for recurrent mutations that have been identified in silico to bind to HLA-A*02:01 using our pipeline were synthesized and used for immunization of human TCR transgenic ABabDII mice. Examples (RAC1:P29S and TRRAP:S722F) of ex vivo ICS analysis of mutant peptide immunized ABabDII mice 7 days after the last immunization are shown. Polyclonal stimulation with CD3/CD28 dynabeads was used as positive control, stimulation with an irrelevant peptide served as negative control (data not shown)