| Literature DB >> 31118084 |
Hanan Besser1,2, Sharon Yunger3, Efrat Merhavi-Shoham3, Cyrille J Cohen4, Yoram Louzoun5,6.
Abstract
BACKGROUND: Targeting epitopes derived from neo-antigens (or "neo-epitopes") represents a promising immunotherapy approach with limited off-target effects. However, most peptides predicted using MHC binding prediction algorithms do not induce a CD8 + T cell response, and there is a crucial need to refine the predictions to readily identify the best antigens that could mediate T-cell responses. Such a response requires a high enough number of epitopes bound to the target MHC. This number is correlated with both the peptide-MHC binding affinity and the number of peptides reaching the ER. Beyond this, the response may be affected by the properties of the neo-epitope mutated residues.Entities:
Keywords: MHC-binding peptides; Machine Learning; Neoantigen; T cell activation
Year: 2019 PMID: 31118084 PMCID: PMC6532181 DOI: 10.1186/s40425-019-0595-z
Source DB: PubMed Journal: J Immunother Cancer ISSN: 2051-1426 Impact factor: 13.751
Fig. 1Existing methods and proposed new classifier (a) Current approaches for neo-antigen detection involve three main stages: RNA sequencing, detection of mutations in tumor cells and the computation of MHC binding peptides in such mutated regions. We propose a new stage (b) the detection among the MHC binding peptides of those that manage to induce a T cell response
Summary of the datasets used
| Source | HLA | Total no. of predicted samples | Confirmed Positive samples | Confirmed Negative samples |
|---|---|---|---|---|
| Melanoma (Me.) | A*02:01 | 485 | 35 | 450 |
| Melanoma Patient 1 | A*02 B*18, B*35 C*07, C*05 | 187 | 7 | 180 |
| Melanoma Patient 2 | A*11, A*23 B*14, B*41 | 56 | 3 | 53 |
| Melanoma Patient 3 | A*02, A*24 B*15, B*38 | 68 | 3 | 65 |
| Tantigen [ | mix | 24 | 0 |
For each dataset the name of the dataset, the number of positives and negatives epitopes in the data, and the HLA composition of the data are presented. The Melanoma patients were used for validation of the model and the results
First column is the score name, second column is the description of the score
| Feature name | Description | Notes |
|---|---|---|
| Expression level | The average expression level by cell line in melanoma tissue |
|
| Size difference | The absolute difference in size between the W.T. amino acid and the mutant | By Dalton units |
| Hydrophobicity | The absolute difference in hydrophobicity index between the W.T. amino acid and the mutant | Kyte J, Doolittle RF |
| Charge difference | The absolute difference in charge between the W.T. amino acid and the mutant | Values at ph = 7.4 |
| Polar change | Categorical variable for the polarity change between the W.T. amino acid the mutant | Values at ph = 7.4 |
| Cleavage score | Estimated cleavage probability of a full peptide. | Vider et al. |
| Tap score | Estimated TAP binding energy | Peters et al. |
Third column is a description of the score and the reference for the score
Fig. 2a. -log 10 of p value for Kolmogorov Smirnov test for similarity between distribution of positive and negative peptides (peptides inducing and not inducing a T cell response). b. Average values for positive and negative groups of all measures with significant differences between groups. c. Histogram of sum of log expression, TAP binding score and cleavage score. One can clearly see a difference between the groups. d. Correlation heatmap of positive and negative groups for all measures. Only correlations with a p value below 0.005 were plotted Rows with no significant correlations were removed. The row and columns are the same properties
Fig. 3Subplots of ROC curves (a) Leave one out test for each one of the datasets. The AUC for the test on melanoma dataset is 0.86. b-d In the ROC curve for three different patients, the prediction was with the classifiers used to generate the test in (a). The horizonal dashed line in (a) indicates the threshold of 90% of the data to be true positive
Fig. 4Experimental validation of T cell response. TIL culture of patient 1 recognized 3 neoantigens, but not the corresponding wildtype peptides. Following pulsing with 10 μg/ml of 25-mer mutant or wt peptide overnight, EBV-transformed autologous B cells B-LCL were co-cultured with T-cells from TIL culture from patient 1. 16 h after the beginning of the co-culture, these cells were co-stained for CD137 (41BB) and CD8+ and analyzed by flow cytometry. The double positive population is indicated in quadrant Q2
Classifier properties
| ME | M.T. | Pat 1 | Pat. 2 | Pat. 3 | |
|---|---|---|---|---|---|
| AUC | 0.809 | 0.868 | 0.657 | 0.679 | 0.733 |
| Fraction of Negatives kept when loosing 50% of positives | 0.101 | 0.051 | 0.322 | 0.000 | 0.246 |
| Fraction of positives kept when loosing 50% of negatives | 0.865 | 0.914 | 0.714 | 0.660 | 1.000 |
| Accuracy | 0.753 | 0.810 | 0.662 | 0.830 | 0.831 |
| F1 | 0.391 | 0.331 | 0.207 | 0.795 | 0.214 |
For each dataset we provide, the AUC and F1 value as well as the max accuracy on the test set. Similarly, we provide the fraction of negatives maintained when keeping 50% of negative and the fraction of positives when removing 50% of negatives