| Literature DB >> 32615972 |
Johanna Vielhaben1, Markus Wenzel1, Wojciech Samek1, Nils Strodthoff2.
Abstract
BACKGROUND: Immunotherapy is a promising route towards personalized cancer treatment. A key algorithmic challenge in this process is to decide if a given peptide (neoepitope) binds with the major histocompatibility complex (MHC). This is an active area of research and there are many MHC binding prediction algorithms that can predict the MHC binding affinity for a given peptide to a high degree of accuracy. However, most of the state-of-the-art approaches make use of complicated training and model selection procedures, are restricted to peptides of a certain length and/or rely on heuristics.Entities:
Keywords: Binding affinity prediction; Language modeling; Major histocompatibility complex; Peptide data; Recurrent neural networks
Mesh:
Substances:
Year: 2020 PMID: 32615972 PMCID: PMC7330990 DOI: 10.1186/s12859-020-03631-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison of MHC I prediction tools
| SMMPMBEC [ | One-hot encoding, linear model (scoring matrix) |
| consensus [ | Linear model (scoring matrix), median rank as prediction |
| NetMHC4 [ | Input: 9mer fixed length blocks substitution matrix (BLOSUM) encoding plus additional features; multilayer perceptron with one hidden layer |
| NetMHCpan4 [ | Input: 9mer fixed length BLOSUM encoding for peptide, pseudo-sequence for MHC molecule plus additional features; multilayer perceptron with one hidden layer |
| MHCFlurry [ | Input: 15mer fixed length BLOSUM62 encoding, missing residues filled with wildcard amino acid (AA); feedforward neural network (NN) with 0 to 2 locally connected and one fully connected hidden layer |
| Learned embedding layer; AWD LSTM with one hidden layer | |
| SMMPMBEC | Ridge regression with modified regularization, peptide MHC binding energy covariance (PMBEC) similarity matrix as Bayesian prior |
| consensus | Four scoring matrices from existing algorithms |
| NetMHC4 | Training on non 9mer peptides by insertion of wildcard AA or deletion at all possible positions; augmented training set with natural peptides for each length assumed to be negative |
| NetMHCpan4 | Same insertion/ deletion procedure as NetMHC4; augmented training set with random artificial negatives |
| MHCFlurry | Pretraining on BLOSUM62 similar allele for alleles with little training data; augmented training set with artificial negative peptides |
| Optional: language model pretraining on unlabeled sequences | |
| SMMPMBEC | Single model |
| consensus | Single model |
| NetMHC4 | Ensemble of 4 NNs |
| NetMHCpan4 | Ensemble of 100 NNs |
| MHCFlurry | Ensemble of 8-16 NNs selected from 320 models on a validation set |
| Optional: ensemble of 10 NNs with identical architectures and hyperparameters | |
Fig. 1Schematic representation of the model architecture
Details of training and test datasets
| Dataset | Usage | Total size | Share of binders | # alleles | Median size | Share of quant. meas. | Sequence length |
|---|---|---|---|---|---|---|---|
| MHC class I | |||||||
| train | 117326 | 0.25 | 53 | 1971 | 0.58 | 8–11 | |
| test | 27680 | 0.33 | 53 | 470 | 0.58 | 8–11 | |
| train | 120720 | 0.25 | 32 | 3659 | 0.68 | 8–15 | |
| test | 2827 | 0.54 | 32 | 73 | 1.0 | 9 | |
| train | 68117 | 0.26 | 7 | 6884 | 0.64 | 8–15 | |
| test | 743 | 0.34 | 7 | 125 | 0.37 | 8–11 | |
| MHC class II | |||||||
| train | 23203 | 0.37 | 24 | 999 | 1.0 | 15–37 | |
| test | 15691 | 0.33 | 24 | 641 | 1.0 | 15 | |
The threshold for MHC class I binders is 500nM, except for the HPV dataset, where the threshold is 100 000nM. For MHC class II binders, the threshold is 1000nM
Fig. 2Comparison of MHC class I predictors. AUC ROC and Spearman r are evaluated on predictions for the IEDB16_I test set (AUC ROC could not be evaluated for alleles HLA-B-2704, HLA-B-1503 and HLA-B-1501, whereas Spearman r could not be computed for alleles HLA-B-1503 and HLA-B-1501. These alleles are therefore not included in the scores.)
Benchmarking MHC class I predictors on recently published binding affinity data (HPV16), see also Table 4 for allele-wise scores
| allele | ||
|---|---|---|
| USMPep_FS_ens | 0.824(3) | 0.814(3) |
| USMPep_FS_sng | 0.818(4) | 0.808(4) |
| USMPep_LM_ens | ||
| USMPep_LM_sng | 0.813(5) | 0.802(4) |
| MHCFlurry | 0.817(4) | 0.809(4) |
| NetMHC 3.4 | 0.794(3) | |
| NetMHC 4.0 | 0.803(4) | 0.780(3) |
| NetMHCpan 2.8 | 0.818(4) | 0.792(3) |
| NetMHCpan 3.0 | 0.815(4) | 0.787(3) |
| NetMHCpan 4.0 | 0.820(3) | 0.792(4) |
| SMM | 0.684(5) | 0.695(4) |
| SMMPMBEC | 0.722(6) | 0.723(4) |
| Pickpocket 1.1 | 0.760(5) | 0.708(4) |
| consensus | 0.751(5) | 0.766(4) |
| IEDB recommended | 0.756(5) | 0.772(4) |
| NetMHCcons 1.1 | 0.827(4) | 0.799(3) |
Predictive performance is evaluated by AUC ROC (threshold for binders < 100 000nM) on single alleles and across all alleles (mean and overall). The scores for literature approaches were calculated based on peptide-wise predictions provided in [12]. Numbers in brackets in the table concisely denote the corresponding bootstrap confidence intervals. For instance, 0.824(3) stands for a mean AUC ROC of 0.824 ± 0.003
Allele-wise results on (HPV16), see also Table 3 for mean and overall scores
| Allele | HLAA1 | HLAA11 | HLAA2 | HLAA24 | HLAA3 | HLAB15 | HLAB7 |
|---|---|---|---|---|---|---|---|
| USMPep_FS_ens | 0.793 | 0.830 | 0.807 | 0.768 | 0.803 | 0.884 | |
| USMPep_FS_sng | 0.785 | 0.883 | 0.822 | 0.798 | 0.764 | 0.799 | 0.883 |
| USMPep_LM_ens | 0.880 | 0.809 | 0.766 | 0.824 | 0.871 | ||
| USMPep_LM_sng | 0.813 | 0.869 | 0.805 | 0.802 | 0.755 | 0.805 | 0.854 |
| MHCFlurry | 0.816 | 0.850 | 0.755 | 0.793 | 0.797 | 0.867 | |
| NetMHC 3.4 | 0.841 | 0.867 | 0.793 | 0.765 | 0.825 | 0.884 | |
| NetMHC 4.0 | 0.823 | 0.855 | 0.792 | 0.730 | 0.779 | 0.825 | 0.801 |
| NetMHCpan 2.8 | 0.756 | 0.863 | 0.787 | 0.778 | 0.794 | 0.857 | 0.880 |
| NetMHCpan 3.0 | 0.841 | 0.848 | 0.781 | 0.739 | 0.778 | 0.876 | 0.825 |
| NetMHCpan 4.0 | 0.839 | 0.854 | 0.805 | 0.742 | 0.784 | 0.836 | |
| SMM | 0.476 | 0.828 | 0.730 | 0.643 | 0.788 | 0.704 | 0.646 |
| SMMPMBEC | 0.593 | 0.846 | 0.777 | 0.639 | 0.799 | 0.716 | 0.670 |
| Pickpocket 1.1 | 0.744 | 0.773 | 0.757 | 0.709 | 0.731 | 0.808 | 0.802 |
| consensus | 0.570 | 0.870 | 0.772 | 0.687 | 0.767 | 0.832 | 0.756 |
| IEDB recommended | 0.566 | 0.877 | 0.769 | 0.702 | 0.772 | 0.852 | 0.755 |
| NetMHCcons 1.1 | 0.807 | 0.872 | 0.797 | 0.777 | 0.819 | 0.847 |
Fig. 3Evaluating MHC class I predictors on recently published binding affinity data (HPV16) grouped by peptide length. Predictive performance is evaluated by mean AUC ROC. For allele-wise and overall performance comparisons see Tables 4 and 3. Percentages in brackets indicate the proportion of peptides of that particular length. For example, bars at 8(15%) show results for peptides that are eight amino acids long, which had a share of 15% of all peptides in total
Fig. 4Performance of USMPep and MHCFlurry on MHC class I binding prediction. Both models were trained on the Kim14BD2009 data AUC ROC and Spearman r were evaluated on the predictions for the Blind test set (AUC ROC could not be evaluated for allele HLA-B-4601, whereas Spearman r could not be computed for allele HLA-B-4601 and HLA-B-2703. These alleles are therefore not included in the scores.)
Fig. 5Performance of our MHC I prediction tools compared to MHCFlurry on single alleles. Spearman r was calculated for predictions on the Kim14Blind data for alleles with more than 25 quantitative measurements. The predictors were trained on Kim14BD2009. The alleles are ranked by the size of the corresponding training set (9528 peptides for rank 0 to 136 peptides with rank 52). No MHCFlurry predictors were provided for alleles HLA-B-2703, HLA-B-0803 and HLA-B-3801 with rank 45, 49 and 52
Fig. 6Comparison of MHC class II predictors. AUC ROC and Spearman r were evaluated on predictions for the IEDB16_II test set
Language model and MHC class I binding affinity prediction performance
| Model | LM | Downstream ( | ||
|---|---|---|---|---|
| perpl. | acc. | AUC ROC | Spearman | |
| LM (protein) | 39.3 | 0.083 | 0.90(2) | 0.55(4) |
| LM (peptide) | 13.4 | 0.206 | 0.89(2) | 0.57(4) |
| From scratch | – | – | 0.89(2) | 0.55(3) |
Language model metrics perplexity (perpl.) and accuracy (acc.) were in all cases evaluated on peptide data. The downstream performance corresponds to an ensemble of 10 predictors trained on the MHCFlurry18 and evaluated on the IEDB16_I test set
Performance summary: Rank of USMPep compared to competitors across the different datasets
| Dataset | ||||
|---|---|---|---|---|
| AUC ROC | Spearman | AUC ROC | Spearman | |
| MHC class I | ||||
| IEDB16_I | 3 rd | 3 rd | ||
| HPV | – | – | ||
| Kim14 | ||||
| MHC class II | ||||
| IEDB16_II | 4th | 4th | 3 rd | 4th |
Scores marked in bold face are best-performing or consistent with the best-performing result within error bars