| Literature DB >> 33235258 |
Marek Prachar1,2,3, Sune Justesen3, Daniel Bisgaard Steen-Jensen3, Stephan Thorgrimsen3, Erik Jurgons4, Ole Winther1,2,5, Frederik Otzen Bagger6,7,8.
Abstract
The outbreak of SARS-CoV-2 (2019-nCoV) virus has highlighted the need for fast and efficacious vaccine development. Stimulation of a proper immune response that leads to protection is highly dependent on presentation of epitopes to circulating T-cells via the HLA complex. SARS-CoV-2 is a large RNA virus and testing of all of its overlapping peptides in vitro to deconvolute an immune response is not feasible. Therefore HLA-binding prediction tools are often used to narrow down the number of peptides to test. We tested NetMHC suite tools' predictions by using an in vitro peptide-MHC stability assay. We assessed 777 peptides that were predicted to be good binders across 11 MHC alleles in a complex-stability assay and tested a selection of 19 epitope-HLA-binding prediction tools against the assay. In this investigation of potential SARS-CoV-2 epitopes we found that current prediction tools vary in performance when assessing binding stability, and they are highly dependent on the MHC allele in question. Designing a COVID-19 vaccine where only a few epitope targets are included is therefore a very challenging task. Here, we present 174 SARS-CoV-2 epitopes with high prediction binding scores, validated to bind stably to 11 HLA alleles. Our findings may contribute to the design of an efficacious vaccine against COVID-19.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33235258 PMCID: PMC7686376 DOI: 10.1038/s41598-020-77466-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Current best-performing or novel HLA prediction tools[10].
| HLA | Tool | Alleles available* | Year | Algorithm | Web server | References |
|---|---|---|---|---|---|---|
| Class I | NetMHC 4.0 | 9/10 | 2003 | NN | Yes | [ |
| IEDB-AR Consensus | 9/10 | 2006 | Cons | Yes | [ | |
| ConvMHC | 6/10 | 2017 | NN | Yes | [ | |
| DeepHLAPan | 10/10 | 2019 | NN | Yes | [ | |
| HLAthena | 10/10 | 2020 | NN | Yes | [ | |
| MixMHCpred 2.0.2 | 10/10 | 2017 | PSSM | No | [ | |
| MHCFlurry 1.3.0 | 8/10 | 2018 | NN | No | [ | |
| NetMHCcons 1.1 | 10/10 | 2012 | Cons | Yes | [ | |
| NetMHCpan_BA 4.0 | 10/10 | 2017 | NN | Yes | [ | |
| NetMHCpan_EL 4.0 | 10/10 | 2017 | NN | Yes | [ | |
| NetMHCstab 1.0 | 10/10 | 2014 | NN | Yes | [ | |
| PickPocket 1.1 | 10/10 | 2009 | PSSM | Yes | [ | |
| PSSMHCpan 1.0 | 10/10 | 2017 | PSSM | No | [ | |
| SMM 1.0 | 8/10 | 2005 | PSSM | Yes | [ | |
| SMMPMBEC 1.0 | 8/10 | 2009 | PSSM | Yes | [ | |
| Class II | IEDB-AR Consensus | 1/1 | 2008 | Cons | Yes | [ |
| NetMHCIIpan 3.2 | 1/1 | 2018 | NN | Yes | [ | |
| NN_Align 2.3 | 1/1 | 2018 | NN | Yes | [ | |
| SMM-align 1.1 | 1/1 | 2007 | PSSM | Yes | [ | |
| Sturniolo 1.0 | 1/1 | 1999 | PSSM | Yes | [ |
Webservers checked on 2 March 2020, NN: Neural network, Cons: Consensus, PSSM: Position specific scoring matrix.
*Availability as a fraction of alleles included in study (10 HLA class I and 1 of HLA class II).
Figure 1ROC curves for each allele that bound more than 10 peptides stably (subplots A, B, C, D, E, F, G), (H) tools used in the benchmark, upper box—HLA class I, lower box—HLA class II (IEDB-AR Consensus is available for both), (I) precision-recall curves for HLA-A*02:01. Corresponding area under curve (AUC) values are listed in Table 2.
AUC values for ROC curves from Fig. 1 for alleles with more than 10 stable complexes.
| Tool/allele | A*01:01 | A*02:01 | A*03:01 | A*11:01 | A*24:02 | B*40:01 | DRB1*04:01 |
|---|---|---|---|---|---|---|---|
| NetMHC 4.0 | 70.3 | 77.06 | 83.81 | 82.96 | 89.76 | – | |
| IEDB-AR Consensus | 97.06 | 69.44 | 77.36 | 87.05 | 83.89 | 90.03 | |
| ConvMHC | 85.13 | 47.71 | 72.53 | 76.33 | 66.88 | 79.80 | – |
| DeepHLAPan | 91.82 | 62.90 | 52.49 | 80.84 | 69.87 | 80.95 | – |
| HLAthena | 89.74 | 65.41 | 66.54 | 87.41 | 76.48 | 83.08 | – |
| MixMHCpred 2.0.2 | 92.54 | 70.04 | 75.29 | 80.82 | 78.68 | 76.29 | – |
| MHCFlurry 1.3.0 | 94.48 | 66.88 | 75.52 | 88.46 | 76.24 | – | |
| NetMHCcons 1.1 | 97.42 | 65.93 | 86.21 | 79.52 | 90.25 | – | |
| NetMHCpan_BA 4.0 | 97.15 | 65.84 | 76.32 | 86.76 | 85.40 | 88.79 | – |
| NetMHCpan_EL 4.0 | 93.40 | 75.89 | 75.84 | 80.11 | 78.36 | 84.27 | – |
| NetMHCstab 1.0 | 89.15 | 76.75 | 77.98 | 86.13 | – | ||
| PickPocket 1.1 | 88.65 | 57.32 | 65.53 | 75.03 | 80.93 | 88.44 | – |
| PrdX 1.0 | – | – | – | – | – | – | |
| PSSMHCpan 1.0 | 90.33 | 67.97 | 75.75 | 76.62 | 82.12 | 77.94 | – |
| SMM 1.0 | 95.25 | 60.09 | 75.15 | 87.26 | 80.26 | 88.82 | – |
| SMMPMBEC 1.0 | 92.13 | 60.48 | 75.36 | 87.26 | 80.13 | 86.34 | – |
| NetMHCIIpan 3.2 | – | – | – | – | – | – | 76.63 |
| NN_Align 2.3 | – | – | – | – | – | – | 78.14 |
| SMM-align 1.1 | – | – | – | – | – | – | 74.42 |
| Sturniolo 1.0 | – | – | – | – | – | – | 75.19 |
First 6 six columns contain HLA class I, last column contains HLA class II. Only four tools were tested for HLA class II. PrdX 1.0 is only available for allele A*02:01. Highest value for each allele is marked in bold.
Figure 2Plot of Spearman correlation coefficient between predicted values and results of NeoScreen stability assay for each available allele. Each colour represents an individual allele. Boxplot whiskers accord for 1.5 distance between the median and quartile hinges.