| Literature DB >> 32087680 |
Daniel Lule Bugembe1, Andrew Obuku Ekii2, Nicaise Ndembi3, Jennifer Serwanga2,4, Pontiano Kaleebu2,4, Pietro Pala2.
Abstract
BACKGROUND: Identifying immunogens that induce HIV-1-specific immune responses is a lengthy process that can benefit from computational methods, which predict T-cell epitopes for various HLA types.Entities:
Keywords: Artificial neural network; Epitope mapping; HIV-1; In-silico; MHCflurry1.2.0 and NetCTL1.2; NetMHCpan4.0.; T-cell
Mesh:
Substances:
Year: 2020 PMID: 32087680 PMCID: PMC7036183 DOI: 10.1186/s12879-020-4876-4
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.090
Participant characteristics, HIV-1 infecting clade, Fiebig stage and HLA class I haplotypes
| Subject | Sex | Age range (years) | HIV-1 subtype | Class-I HLA | Early Time Point (Days) | Fiebig Staging | Late Time Point (Days) |
|---|---|---|---|---|---|---|---|
| 91 | M | 31–40 | A | A*0201,*0301;B*5301,*5802;Cw*0401,*0602 | 121 | VI | 841 |
| 92 | F | 21–30 | D | A*0201,*3002;B*4403,*1402Cw*0401,*0802 | 52 | VI | 743 |
| 94 | M | 51–60 | A | A*3402,*7401;B*4403,*5802;Cw*0401,*0602 | 28 | V | 358 |
| 95 | M | 21–30 | A | A*2301,*7401;B*4403,*1510;Cw*0401,*1601 | 30 | VI | 570 |
| 913 | F | 11–20 | D | A*0201,*3402;B*4501,*4701;Cw*0602,*1601 | 61 | VI | 211 |
| 914 | F | 21–30 | D | A*0101,*0201;B*0702,*4415;Cw*0407,*0702 | 31 | IV | 181 |
Fig. 1ELISPOT peptide consort; the experimental peptide mapping data was generated by culture ELISPOT of multiple peptide pools tested in duplicate wells per time point, followed by ex-vivo ELISPOT of potential candidate epitopes. To experimentally map a single time point required at least 541 assay wells
Fig. 2NetMHCpan Binder Predictions. a Using our experimental peptide sequences as inputs into NetMHCpan4.0 to predict epitopes for 22 HLA types represented in the 6 HIV-1 Infected people, a heatmap showing absolute counts of computationally predicted 9-mer binders against HIV-1 genes was constructed. The dendrogram shows the nearest similarity for the number of predicted counts across HLA types; b the length of the HIV-1 protein sequence plotted against the absolute number of NetMHCpan4.0 predicted 9mer binders showing a positive correlation (Spearman’s correlation coefficient, rs = 0.88). The number of distinct predictions is dependent on the length of the HIV-1 sequence; c comparison of HIV-1 clade A and D absolute number of NetMHCpan4.0 predicted 9mer binders per HIV-1 gene for the wet experiment test peptide sequences. The algorithm predicted more binders for clade D than clade A
Experimentally Mapped Peptides and Computationally Predicted Epitopes
| ID | Participant’s HLA Types | Hit No | Screening Peptide | Screening Peptide HIV-1 Clade | NetMHCpan4.0 9-mer Epitope Prediction | NetMHCpan4.0 9-mer HLA Prediction | NetMHCpan4.0% Rank | MHCflurry 9-mer Epitope prediction | MHCflurry 9-mer HLA prediction | MHCflurry affinity (μm) | NetCTL 9-mer Epitope prediction |
|---|---|---|---|---|---|---|---|---|---|---|---|
| E91 | A*02:01 | 1 | FITKGLGISYGRKKRRQR | D | GLGISYGRK | A*0301 | 0.50 | GLGISYGRK | A*0201 | 0.47 | |
| A*03:01 | 2 | HPKVSSEVHIPLGDARLV | A1 | IPLGDARLV | B*53:01 | 0.70 | IPLGDARLV | A*0302 | 24.63 | ||
| B*53:01 | HPKVSSEVHIPLGDARLV | A1 | KVSSEVHIP | B*58:02 | 0.60 | KVSSEVHIP | B*53:01 | 26.18 | |||
| B*58:02 | HPKVSSEVHIPLGDARLV | A1 | SSEVHIPLG | B*58:02 | 0.60 | ||||||
| Cw*04:01 | HPKVSSEVHIPLGDARLV | A1 | VSSEVHIPL | Cw*04:01 | 0.90 | VSSEVHIPL | Cw*04:01 | 18.48 | |||
| Cw*06:02 | HPKVSSEVHIPLGDARLV | A1 | HPKVSSEVH | B*53:01 | 1.20 | HPKVSSEVH | Cw*06:02 | 28.58 | |||
| E92 | A*02:01 | 3 | RKQNPEIVIYQYMDDLYV | D | YQYMDDLYV | A*02:01 | 0.15 | YQYMDDLYV | A*02:01 | 0.25 | |
| A*30:02 | RKQNPEIVIYQYMDDLYV | D | NPEIVIYQY | B*44:03 | 0.60 | NPEIVIYQY | A*30:02 | 4.94 | |||
| B*44:03 | RKQNPEIVIYQYMDDLYV | D | YQYMDDLYV | A*02:01 | 1.80 | YQYMDDLYV | B*44:03 | 8.44 | |||
| B*14:02 | RKQNPEIVIYQYMDDLYV | D | YQYMDDLYV | Cw*04:01 | 0.90 | YQYMDDLYV | Cw*04:01 | 1.52 | |||
| Cw*04:01 | RKQNPEIVIYQYMDDLYV | D | YQYMDDLYV | Cw*08:02 | 1.20 | YQYMDDLYV | Cw*08:02 | 13.26 | |||
| Cw*08:02 | RKQNPEIVIYQYMDDLYV | D | VIYQYMDDL | A*30:02 | 0.60 | VIYQYMDDL | A*30:02 | 2.52 | |||
| 4 | ELNKRTQDFWEVQLGIPH | A1 | TQDFWEVQL | Cw*08:02 | 0.40 | YQYMDDLYV | Cw*08:02 | 3.09 | |||
| ELNKRTQDFWEVQLGIPH | A1 | TQDFWEVQL | A*02:01 | 0.03 | TQDFWEVQL | A*02:01 | 19.72 | ||||
| ELNKRTQDFWEVQLGIPH | A1 | TQDFWEVQL | Cw*08:02 | 1.50 | TQDFWEVQL | Cw*08:02 | 3.09 | ||||
| ELNKRTQDFWEVQLGIPH | A1 | ELNKRTQDF | B*44:03 | 0.60 | ELNKRTQDF | B*44:03 | 26.56 | ||||
| 5 | NDIQKLVGKLNWASQIYP | D | KLNWASQIY | A*30:02 | 0.50 | KLNWASQIY | A*30:02 | 0.02 | |||
| NDIQKLVGKLNWASQIYP | D | KLVGKLNWA | A*02:01 | 0.47 | KLVGKLNWA | A*02:01 | 0.32 | ||||
| NDIQKLVGKLNWASQIYP | D | KLNWASQIY | Cw*04:01 | 1.80 | KLNWASQIY | Cw*04:01 | 2.34 | ||||
| 6 | PAIQTGSEELRSLYNTVA | D | GSEELRSLY | A*30:02 | 0.40 | GSEELRSLY | A*30:02 | 0.12 | |||
| PAIQTGSEELRSLYNTVA | D | SEELRSLY | B*44:03 | 0.50 | SEELRSLY | B*44:03 | 16.25 | ||||
| 7 | PYNTPIFAIKKKDSTKWR | A1 | PYNTPIFAI | Cw*04:01 | 1.14 | PYNTPIFAI | Cw*04:01 | 32.80 | |||
| 8 | GANNSHNETFRPGGGDMR | D | TFRPGGGDM | Cw*04:01 | 1.60 | TFRPGGGDM | Cw*04:01 | 1.15 | |||
| 9 | GMDGPKVKQWPLTEEKIK | A1 | MDGPKVKQW | B*44:03 | 0.50 | MDGPKVKQW | B*44:03 | 1.37 | |||
| 10 | PLTSLKSLFGNDPLSQ | D | KSLFGNDPL | A*02:01 | 1.40 | KSLFGNDPL | A*02:01 | 31.04 | |||
| PLTSLKSLFGNDPLSQ | D | KSLFGNDPL | Cw*08:02 | 1.20 | |||||||
| D | |||||||||||
| 11 | HERIEVKDTKEALEKI | D | EVKDTKEAL | B*14:02 | 1.40 | ||||||
| HERIEVKDTKEALEKI | D | IEVKDTKEA | B*44:03 | 1.99 | IEVKDTKEA | B*44:03 | 13.12 | ||||
| E94 | A*34:02 | 12 | KIEEIQNKSKQKTQQAAA | A1 | EIQNKSKQK | A*34:02 | 1.03 | EIQNKSKQK | B*44:03 | 35.84 | |
| A*74:01 | 13 | NHPSCVWLEAQEEEEVGF | A1 | LEAQEEEEV | B*44:15 | 1.70 | LEAQEEEEV | B*44:04 | 12.09 | LEAQEEEEV | |
| B*44:03 | 14 | HQDPIPKQPSSQPRGD | D | HQDPIPKQP | Cw*04:01 | 0.60 | HQDPIPKQP | Cw*04:01 | 7.89 | ||
| B*58:02 | LEAQEEEEV | Cw*06:02 | 8.72 | ||||||||
| Cw*04:01 | |||||||||||
| Cw*06:02 | |||||||||||
| E95 | A*23:01 | 15 | VAVHVASGYIEAEVIPA | A1 | VAVHVASGY | Cw*16:01 | 1.50 | VAVHVASGY | A*23:01 | 30.27 | |
| A*74:01 | 17 | KRWIILGLNKIVRMYSPV | A1 | WIILGLNKI | A*23:01 | 0.60 | WIILGLNKI | A*23:01 | 6.99 | ||
| B*44:03 | 16 | KRWIILGLNKIVRMYSPV | A1 | IILGLNKIV | A*74:01 | 0.60 | IILGLNKIV | B*44:03 | 28.76 | ||
| B*15:10 | 17 | NMMLNIVGGHQAAMQMLK | A1 | HQAAMQMLK | B*15:10 | 0.17 | HQAAMQMLK | ||||
| Cw*04:01 | NMMLNIVGGHQAAMQMLK | A1 | HQAAMQMLK | A*74:01 | 0.90 | ||||||
| Cw*16:01 | NMMLNIVGGHQAAMQMLK | A1 | HQAAMQMLK | Cw*04:01 | 1.30 | HQAAMQMLK | Cw*04:01 | 26.63 | |||
| 18 | KNWMTETLLVQNANPDCK | A1 | TETLLVQNA | B*44:15 | 0.09 | TETLLVQNA | B*44:03 | 5.85 | |||
| KNWMTETLLVQNANPDCK | A1 | KNWMTETLL | A*23:01 | 0.80 | KNWMTETLL | A*23:01 | 25.75 | ||||
| 19 | FRDYVDRFFKTLRAEQA | A1 | FRDYVDRFF | Cw*04:01 | 0.03 | FRDYVDRFF | Cw*04:01 | 4.56 | |||
| FRDYVDRFFKTLRAEQA | A1 | FRDYVDRFF | A*23:01 | 0.60 | FRDYVDRFF | A*23:01 | 4.56 | ||||
| FRDYVDRFFKTLRAEQA | A1 | FRDYVDRFF | Cw*04:01 | 1.10 | FRDYVDRFF | Cw*04:01 | 0.68 | ||||
| 20 | GATLEEMMTACQGVGGPGH | A1 | EEMMTACQG | B*44:03 | 0.25 | EEMMTACQG | B*44:03 | 0.97 | |||
| 21 | LRALGPGATLEEMMTA | A1 | RALGPGATL | B*15:10 | 1.80 | RALGPGATL | B*44:01 | 0.56 | |||
| LRALGPGATLEEMMTA | A1 | RALGPGATL | Cw*04:01 | 0.60 | RALGPGATL | Cw*04:01 | 0.56 | ||||
| 22 | FFKTLRAEQATQEVKNWM | A1 | AEQATQEVK | B*44:03 | 0.15 | AEQATQEVK | B*44:03 | 8.59 | |||
| 23 | MEKEGKISKIGPENPY | A1 | SKIGPENPY | B*15:03 | 0.50 | SKIGPENPY | A*23:01 | 35.68 | |||
| MEKEGKISKIGPENPY | A1 | SKIGPENPY | B*15:10 | 0.50 | SKIGPENPY | B*44:03 | 9.49 | ||||
| 25 | WVKVIEEKAFSPEVIPMF | A1 | AFSPEVIPMF | A*23:01 | 0.40 | AFSPEVIPM | A*23:01 | 5.95 | |||
| WVKVIEEKAFSPEVIPMF | A1 | WVKVIEEKA | A*23:01 | 1.70 | WVKVIEEKA | A*23:01 | 17.53 | ||||
| WVKVIEEKAFSPEVIPMF | A1 | EEKAFSPEV | B*44:03 | 0.80 | EEKAFSPEV | B*44:03 | 3.38 | ||||
| WVKVIEEKAFSPEVIPMF | A1 | EEKAFSPEV | B*44:15 | 0.03 | EEKAFSPEV | ||||||
| WVKVIEEKAFSPEVIPMF | A1 | FSPEVIPMF | Cw*04:01 | 0.50 | FSPEVIPMF | Cw*04:01 | 1.63 | FSPEVIPMF | |||
| WVKVIEEKAFSPEVIPMF | A1 | FSPEVIPMF | Cw*16:01 | 0.50 | FSPEVIPMF | A*23:01 | 0.46 | ||||
| WVKVIEEKAFSPEVIPMF | A1 | KAFSPEVIP | Cw*16:01 | 1.20 | KAFSPEVIP | B*4403 | 35.68 | ||||
| 25 | HQMKDCTERQANFLGKIW | A1 | RQANFLGKI | B*4403 | 1.00 | RQANFLGKI | B*4403 | 11.67 | |||
| 26 | PMFSALSEGATPQDLNMM | A1 | SEGATPQDL | B*44:03 | 0.80 | SEGATPQDL | B*4403 | 0.33 | |||
| 27 | HLARNCRAPRKKGCWK | A1 | HLARNCRAP | A*74:01 | 0.60 | HLARNCRAP | A*23:01 | 35.46 | |||
| A1 | |||||||||||
| 28 | VATLYCVHQRIDVKDTK | A1 | ATLYCVHQR | A*74:01 | 0.90 | ATLYCVHQR | A*23:01 | 21.03 | |||
| 29 | KIEEIQNKSKQKTQQAAA | A1 | EIQNKSKQK | A*74:01 | 1.03 | EIQNKSKQK | C*04:01 | 10.98 | |||
| 30 | AGPIPPGQMREPRGSDIA | A1 | AGPIPPGQM | B*15:10 | 0.60 | AGPIPPGQM | C*04:01 | 2.02 | |||
| A1 | |||||||||||
| E913 | A*02:01 | 31 | LWQRPLVTIKIGGQLKEA | D | LWQRPLVTI | A*02:01 | 1.60 | LWQRPLVTI | A*02:01 | 11.25 | |
| A*34:02 | LWQRPLVTIKIGGQLKEA | D | QRPLVTIKI | Cw*06:02 | 0.70 | QRPLVTIKI | Cw*06:02 | 16.29 | |||
| B*45:01 | LWQRPLVTIKIGGQLKEA | D | WQRPLVTIK | B*47:01 | 1.90 | WQRPLVTIK | B*45:01 | 19.93 | |||
| B*47:01 | 32 | TVPVKLKPGMDGPKVKQW | A1 | LKPGMDGPK | A*34:02 | 0.90 | LKPGMDGPK | Cw*06:02 | 26.76 | ||
| Cw*06:02 | |||||||||||
| Cw*16:01 | |||||||||||
| E914 | A*01:01 | 33 | DKWASLWNWFSITQWLWY | D | FSITQWLWY | A*01:01 | 0.06 | FSITQWLWY | B*07:02 | 24.05 | FSITQWLWY |
| A*02:01 | DKWASLWNWFSITQWLWY | D | KWASLWNWF | Cw*04:07 | 1.20 | ||||||
| B*07:02 | DKWASLWNWFSITQWLWY | D | SLWNWFSIT | A*02:01 | 1.66 | ||||||
| B*44:15 | 34 | PVDPDEVEKATEGENNSL | A1 | ATEGENNSL | A*01:01 | 1.74 | |||||
| Cw*04:07 | |||||||||||
| Cw*07:02 | |||||||||||
| L91 | A*02:01 | 35 | EQMHTDIISLWDQSLK | A1 | IISLWDQSLK | A*03:01 | 1.90 | IISLWDQSL | A*03:01 | 20.55 | ISLWDQSLK |
| A*03:01 | EQMHTDIISLWDQSLK | A1 | MHTDIISLW | B*58:02 | 0.90 | MHTDIISLW | A*03:01 | 23.95 | |||
| B*53:01 | EQMHTDIISLWDQSLK | A1 | MHTDIISLW | Cw*06:02 | 1.30 | MHTDIISLW | Cw*06:02 | 3.72 | |||
| B*58:02 | EQMHTDIISLWDQSLK | A1 | QMHTDIISL | A*02:01 | 1.90 | ||||||
| Cw*04:01 | EQMHTDIISLWDQSLK | A1 | QMHTDIISL | B*5301 | 1.20 | QMHTDIISL | B*5301 | 18.52 | |||
| Cw*06:02 | 36 | LETSEGCKQIIGQLQPAI | D | ILAQLQPAI | A*02:01 | 0.40 | |||||
| 37 | SGGKLDAWEKIRLRPGGK | A1 | KIRLRPGGK | A*03:01 | 0.25 | KIRLRPGGK | A*03:01 | 0.06 | KIRLRPGGK | ||
| 38 | LETTEGCQQIMEQLQPAL | A1 | IMEQLQPAL | A*03:01 | 0.40 | IMEQLQPAL | A*03:01 | 21.06 | |||
| LETTEGCQQIMEQLQPAL | A1 | IMEQLQPAL | Cw*04:01 | 0.80 | IMEQLQPAL | Cw*04:01 | 0.19 | ||||
| LETTEGCQQIMEQLQPAL | A1 | QIMEQLQPA | A*02:01 | 0.70 | |||||||
| 39 | ERILSTCLGRSAEPVPL | A1 | RSAEPVPL | B*58:02 | 0.12 | ||||||
| ERILSTCLGRSAEPVPL | A1 | RILSTCLGR | A*03:01 | 0.90 | RILSTCLGR | A*03:01 | 0.11 | ||||
| ERILSTCLGRSAEPVPL | A1 | CLGRSAEPV | A*02:01 | 2.00 | |||||||
| 40 | LVGPTPVNIIGRNMLTQI | A1 | LVGPTPVNI | A*02:01 | 1.61 | LVGPTPVNI | :01 | 4.69 | |||
| 41 | CKQIIGQLQPAIQTGSEEL | D | QIIGQLQPA | A*02:01 | 1.80 | ||||||
| CKQIIGQLQPAIQTGSEEL | D | AIQTGSEEL | A*03:01 | 1.50 | AIQTGSEEL | A*03:01 | 21.04 | ||||
| CKQIIGQLQPAIQTGSEEL | D | IIGQLQPAI | A*03:01 | 1.10 | |||||||
| 42 | PAIQTGSEELRSLYNTVA | D | AIQTGSEEL | A*03:01 | 1.50 | AIQTGSEEL | Cw*06:02 | 3.74 | |||
| PAIQTGSEELRSLYNTVA | D | LRSLYNTVA | Cw*06:02 | 0.70 | LRSLYNTVA | Cw*06:02 | 5.48 | ||||
| L92 | A*02:01 | 43 | NDIQKLVGKLNWASQIYP | D | KLNWASQIY | A*30:02 | 0.50 | KLNWASQIY | A*30:02 | 5.48 | KLNWASQIY |
| A*30:02 | NDIQKLVGKLNWASQIYP | D | KLVGKLNWA | A*02:01 | 0.60 | KLVGKLNWA | A*02:01 | 1.22 | |||
| B*44:03 | NDIQKLVGKLNWASQIYP | D | KLNWASQIY | A*02:01 | 0.90 | KLNWASQIY | A*02:01 | 11.10 | |||
| B*14:02 | NDIQKLVGKLNWASQIYP | D | KLNWASQIY | Cw*04:01 | 1.80 | KLNWASQIY | Cw*04:01 | 13.24 | |||
| Cw*04:01 | 44 | LVVKTYWGLHTGEREWHL | D | LVVKTYWGL | A*02:01 | 1.70 | LVVKTYWGL | A*02:01 | 0.95 | ||
| Cw*08:02 | LVVKTYWGLHTGEREWHL | D | VVKTYWGLH | A*30:02 | 1.50 | ||||||
| 45 | SLVNRVRQGYSPLSFQTL | D | NRVRQGYSPL | B*14:02 | 0.12 | ||||||
| SLVNRVRQGYSPLSFQTL | D | YSPLSFQTL | Cw*04:01 | 0.70 | YSPLSFQTL | Cw*04:01 | 2.47 | ||||
| SLVNRVRQGYSPLSFQTL | D | RQGYSPLSF | A*30:02 | 1.20 | RQGYSPLSF | A*30:02 | 4.51 | RQGYSPLSF | |||
| SLVNRVRQGYSPLSFQTL | D | RQGYSPLSF | Cw*04:01 | 1.40 | RQGYSPLSF | Cw*04:01 | 3.12 | ||||
| 46 | TLPCRIKQIINMWQGV | D | CRIKQIINM | A*02:02 | 0.40 | CRIKQIINM | A*30:02 | 28.36 | CRIKQIINM | ||
| 47 | MRVRGIQRNYQHLWRW | D | RNYQHLWRW | B*44:03 | 0.40 | RNYQHLWRW | B*44:03 | 3.17 | |||
| 48 | GEMKNCSFNITTEIRDKK | D | EMKNCSFNI | B*44:03 | 0.30 | EMKNCSFNI | B*44:03 | 32.11 | |||
| 49 | NVTENFNMWKNNMVEQMH | D | NFNMWKNNM | Cw*04:01 | 1.06 | NFNMWKNNM | Cw*04:02 | 4.53 | |||
| NVTENFNMWKNNMVEQMH | D | TENFNMWKNNM | B*44:03 | 1.81 | TENFNMWKN | B*44:03 | 7.71 | ||||
| 50 | WLIDRIRERAEDSGNESE | D | WLIDRIRER | A*02:01 | 2.00 | WLIDRIRER | A*02:01 | 4.68 | |||
| L94 | A*3402 | 51 | LIHLHYFDCFSDSAIRKA | A1 | YFDCFSDSA | Cw*04:01 | 0.90 | YFDCFSDSA | Cw*04:01 | 2.52 | |
| A*7401 | LIHLHYFDCFSDSAIRKA | A1 | YFDCFSDSA | Cw*06:02 | 1.60 | YFDCFSDSA | Cw*06:02 | 14.91 | |||
| B*4403 | LIHLHYFDCFSDSAIRKA | A1 | HLHYFDCFSDSAIR | A*7401 | 1.40 | ||||||
| B*5802 | LIHLHYFDCFSDSAIRKA | A1 | FSDSAIRKA | Cw*04:01 | 1.10 | FSDSAIRKA | Cw*04:01 | 0.78 | |||
| Cw*0401 | 52 | HLARNCRAPRKKGCWK | A1 | ARNCRAPRK | A*3402 | 1.10 | |||||
| Cw*0602 | HLARNCRAPRKKGCWK | A1 | HLARNCRAP | A*3402 | 1.50 | ||||||
| HLARNCRAPRKKGCWK | A1 | HLARNCRAP | A*74:01 | 0.60 | |||||||
| 53 | SKQKTQQAAADTGNSSKV | A1 | AADTGNSSK | A*3402 | 1.15 | ||||||
| 54 | HQDPIPKQPSSQPRGD | D | HQDPIPKQP | Cw*04:01 | 0.60 | HQDPIPKQP | Cw*04:01 | 7.89 | |||
| L95 | A*23:01 | 55 | KRWIILGLNKIVRMYSPV | A1 | WIILGLNKI | A*23:01 | 0.60 | WIILGLNKI | A*23:01 | 6.99 | |
| A*74:01 | KRWIILGLNKIVRMYSPV | A1 | IILGLNKIV | A*74:01 | 0.60 | IILGLNKIV | Cw*04:01 | 2.81 | |||
| B*44:03 | 56 | NMMLNIVGGHQAAMQMLK | A1 | GHQAAMQML | B*15:10 | 0.40 | |||||
| B*15:10 | NMMLNIVGGHQAAMQMLK | A1 | HQAAMQMLK | A*74:01 | 0.90 | HQAAMQMLK | |||||
| Cw*04:01 | NMMLNIVGGHQAAMQMLK | A1 | HQAAMQMLK | Cw*04:01 | 1.30 | HQAAMQMLK | Cw*04:01 | 26.63 | |||
| Cw*16:01 | 57 | EVNIVTDSQYALGIIQA | A1 | EVNIVTDSQ | B*44:03 | 0.50 | EVNIVTDSQ | B*44:03 | 19.78 | ||
| 58 | AYETEMHNVWATHACV | A1 | TEMHNVWAT | B*44:03 | 0.40 | TEMHNVWAT | B*44:03 | 0.88 | YETEMHNVW | ||
| AYETEMHNVWATHACV | A1 | MHNVWATHA | B*15:10 | 0.90 | MHNVWATHA | Cw*04:01 | 11.32 | ||||
| AYETEMHNVWATHACV | A1 | MHNVWATHA | B*44:03 | 0.90 | MHNVWATHA | B*44:04 | 22.06 | ||||
| 59 | AAEWDRLHPVHAGPI | A1 | AAEWDRLHP | B*44:03 | 0.50 | AAEWDRLHP | B*44:03 | 34.74 | AEWDRLHPV | ||
| AAEWDRLHPVHAGPI | A1 | LHPVHAGPI | B*15:10 | 0.15 | AAEWDRLHP | A*23:01 | 34.79 | ||||
| AAEWDRLHPVHAGPI | A1 | RLHPVHAGP | A*74:01 | 1.50 | AAEWDRLHP | Cw*04:01 | 6.59 | ||||
| 60 | LRALGPGATLEEMMTA | A1 | RALGPGATL | B*15:10 | 0.60 | RALGPGATL | A*23:01 | 8.98 | |||
| LRALGPGATLEEMMTA | A1 | RALGPGATL | Cw*04:01 | 0.60 | RALGPGATL | Cw*04:01 | 0.56 | ||||
| 61 | FFKTLRAEQATQEVKNWM | A1 | AEQATQEVK | B*44:03 | 0.15 | AEQATQEVK | B*44:03 | 8.59 | |||
| 62 | GTTSTPQEQIGWMTGNPPI | A1 | QEQIGWMTG | B*44:15 | 0.65 | QEQIGWMTG | B*44:03 | 1.85 | |||
| GTTSTPQEQIGWMTGNPPI | A1 | GWMTGNPPI | A*23:01 | 1.40 | GWMTGNPPI | A*23:01 | 0.75 | ||||
| 63 | WVKVIEEKAFSPEVIPMF | A1 | EEKAFSPEV | B*44:15 | 0.62 | ||||||
| WVKVIEEKAFSPEVIPMF | A1 | EEKAFSPEV | B*44:03 | 0.80 | EEKAFSPEV | B*44:03 | 3.38 | EEKAFSPEV | |||
| WVKVIEEKAFSPEVIPMF | A1 | WVKVIEEKA | A*23:01 | 1.70 | WVKVIEEKA | A*23:01 | 34.76 | ||||
| WVKVIEEKAFSPEVIPMF | A1 | FSPEVIPMF | Cw*04:01 | 0.50 | FSPEVIPMF | Cw*04:01 | 1.63 | ||||
| WVKVIEEKAFSPEVIPMF | A1 | AFSPEVIPM | Cw*16:01 | 0.70 | AFSPEVIPM | Cw*04:01 | 0.28 | ||||
| WVKVIEEKAFSPEVIPMF | A1 | KAFSPEVIP | Cw*16:01 | 1.20 | KAFSPEVIP | Cw*04:01 | 7.47 | ||||
| 64 | TVYYGVPVWKDAETTLF | A1 | TVYYGVPVW | A*74:01 | 0.17 | TVYYGVPVW | B*44:03 | 7.96 | VYYGVPVWK | ||
| TVYYGVPVWKDAETTLF | A1 | TVYYGVPVW | Cw*16:01 | 1.10 | TVYYGVPVW | Cw*04:01 | 1.25 | ||||
| TVYYGVPVWKDAETTLF | A1 | WKDAETTLF | B*15:10 | 0.90 | |||||||
| TVYYGVPVWKDAETTLF | A1 | VWKDAETTL | A*23:01 | 0.60 | WKDAETTLF | A*23:01 | 4.36 | ||||
| TVYYGVPVWKDAETTLF | A1 | VWKDAETTL | Cw*04:01 | 0.60 | VWKDAETTL | Cw*04:01 | 1.97 | ||||
| 65 | LRWGTMILGMIIICSAA | A1 | RWGTMILGM | A*23:01 | 0.90 | RWGTMILGM | A*23:01 | 3.97 | |||
| LRWGTMILGMIIICSAA | A1 | RWGTMILGM | Cw*04:01 | 0.94 | RWGTMILGM | Cw*04:01 | 1.34 | ||||
| 66 | GHQAAMQMLKDTINEEAA | A1 | HQAAMQMLK | A*74:01 | 0.90 | HQAAMQMLK | A*23:01 | 32.48 | |||
| 67 | IKQGPKEPFRDYVDRFFK | A1 | FRDYVDRFF | A*23:01 | 0.60 | FRDYVDRFF | A*23:01 | 10.14 | |||
| IKQGPKEPFRDYVDRFFK | A1 | FRDYVDRFF | Cw*04:01 | 0.60 | FRDYVDRFF | Cw*04:01 | 0.68 | ||||
| 68 | FRDYVDRFFKTLRAEQA | A1 | FRDYVDRFF | A*23:01 | 0.60 | FRDYVDRFF | A*23:01 | 10.14 | |||
| A1 | |||||||||||
| 69 | MREPRGSDIAGTTSTPQEQI | A1 | MREPRGSDI | B*15:10 | 2.00 | MREPRGSDI | Cw*04:01 | 1.95 | |||
| 70 | EKIRLRPGGKKKYRLKHL | A1 | RLRPGGKKK | A*74:01 | 0.28 | RLRPGGKKK | A*23:01 | 28.99 | |||
| 71 | VATLYCVHQRIDVKDTK | A1 | ATLYCVHQR | A*74:01 | 0.90 | ATLYCVHQR | B*44:03 | 28.12 | |||
| 72 | LFCASDAKAYETEMHNVW | A1 | SDAKAYETEMHNVW | B*44:03 | 0.31 | KAYETEMHN | B*44:03 | 35.92 | |||
| L913 | A*02:01 | 73 | PPLVKLWYQLEKEPIIGA | D | LVKLWYQLE | A*34:01 | 0.50 | LVKLWYQLE | A*02:01 | 18.93 | |
| A*34:02 | PPLVKLWYQLEKEPIIGA | D | KLWYQLEKEPIIGA | A*02:01 | 1.47 | WYQLEKEPI | A*02:01 | 18.24 | |||
| B*45:01 | PPLVKLWYQLEKEPIIGA | D | QLEKEPIIG | B*45:01 | 0.90 | QLEKEPIIG | B*45:01 | 17.25 | |||
| B*47:01 | PPLVKLWYQLEKEPIIGA | D | YQLEKEPII | A*02:01 | 0.80 | YQLEKEPII | A*02:01 | 0.25 | |||
| Cw*06:02 | PPLVKLWYQLEKEPIIGA | D | YQLEKEPII | B*47:01 | 1.20 | YQLEKEPII | Cw*06:02 | 2.05 | |||
| Cw*16:01 | 74 | KWKPKMIGGIGGFIKVR | D | MIGGIGGFIK | A*34:02 | 0.20 | |||||
| KWKPKMIGGIGGFIKVR | D | KMIGGIGGF | A*02:01 | 1.00 | KMIGGIGGF | A*02:01 | 1.55 | ||||
| KWKPKMIGGIGGFIKVR | D | KMIGGIGGF | B*47:01 | 1.10 | KMIGGIGGF | Cw*06:02 | 5.37 | ||||
| 75 | VIWGKTPKFRLPIQKETW | D | IVIWGKTPK | A*34:02 | 0.15 | ||||||
| VIWGKTPKFRLPIQKETW | D | KTPKFRLPI | Cw*16:01 | 1.10 | KTPKFRLPI | A*02:01 | 4.43 | ||||
| VIWGKTPKFRLPIQKETW | D | VIWGKTPKF | A*34:02 | 1.00 | VIWGKTPKF | Cw*06:02 | 7.82 | ||||
| 76 | RQANFLGKIWPSHKGR | D | RQANFLGKI | B*47:01 | 0.40 | ||||||
| RQANFLGKIWPSHKGR | D | RQANFLGKI | Cw*06:02 | 2.00 | RQANFLGKI | Cw*06:02 | 9.23 | ||||
| RQANFLGKIWPSHKGR | D | FLGKIWPSH | A*34:02 | 1.10 | RQANFLGKI | B*45:01 | 7.03 | ||||
| 77 | KIEELREHLLRWGFTTPDK | D | REHLLRWGF | B*47:01 | 0.03 | REHLLRWGF | A*02:01 | 20.74 | |||
| KIEELREHLLRWGFTTPDK | D | REHLLRWGF | B*45:01 | 0.70 | REHLLRWGF | B*45:01 | 1.11 | ||||
| KIEELREHLLRWGFTTPDK | D | LREHLLRWG | Cw*06:02 | 1.40 | LREHLLRWG | Cw*06:02 | 19.44 | ||||
| KIEELREHLLRWGFTTPDK | D | HLLRWGFTT | A*02:01 | 1.20 | HLLRWGFTT | A*02:01 | 0.20 | ||||
| 78 | GFAILKCKDKEFNGTGPCK | A1 | KEFNGTGPC | B*45:01 | 1.50 | KEFNGTGPC | B*45:01 | 1.25 | |||
| 79 | AILNIPTRIRQGLERALL | D | IRQGLERAL | Cw*06:02 | 0.60 | IRQGLERAL | Cw*06:02 | 0.71 | |||
| AILNIPTRIRQGLERALL | D | AILNIPTRI | A*02:01 | 0.60 | |||||||
| AILNIPTRIRQGLERALL | D | RQGLERALL | B*47:01 | 1.70 | RQGLERALL | B*45:01 | 13.50 | ||||
| 80 | QKTELQAINLALQDSGLEV | D | LALQDSGLE | A*02:01 | 1.50 | LALQDSGLE | A*02:01 | 22.65 | |||
| QKTELQAINLALQDSGLEV | D | TELQAINLA | B*47:01 | 0.60 | TELQAINLA | B*45:01 | 0.15 | ||||
| QKTELQAINLALQDSGLEV | D | QKTELQAIN | B*45:01 | 1.00 | QKTELQAIN | B*45:01 | 12.04 | ||||
| QKTELQAINLALQDSGLEV | D | NLALQDSGL | A*34:02 | 1.50 | NLALQDSGL | A*02:01 | 4.72 | ||||
| 81 | IIGRNLLTQIGCTLNFPI | D | IGCTLNFPI | A*02:01 | 0.90 | IGCTLNFPI | A*02:01 | 4.79 | |||
| IIGRNLLTQIGCTLNFPI | D | NLLTQIGCTLNFPI | A*02:01 | 1.63 | |||||||
| IIGRNLLTQIGCTLNFPI | D | LLTQIGCTL | A*02:01 | 1.90 | LLTQIGCTL | A*02:01 | 0.53 | ||||
| IIGRNLLTQIGCTLNFPI | D | TQIGCTLNF | Cw*16:01 | 1.40 | TQIGCTLNF | Cw*06:02 | 2.26 | ||||
| IIGRNLLTQIGCTLNFPI | D | LLTQIGCTL | Cw*16:01 | 0.90 | |||||||
| 82 | KWKPKMIGGIGGFIKVR | D | KMIGGIGGF | A*02:01 | 1.00 | KMIGGIGGF | A*02:01 | 1.55 | |||
| 83 | LWQRPLVTIKIGGQLKEA | D | LWQRPLVTI | A*02:01 | 1.60 | LWQRPLVTI | A*02:01 | 11.25 | |||
| LWQRPLVTIKIGGQLKEA | D | QRPLVTIKI | Cw*06:02 | 0.70 | LWQRPLVTI | Cw*06:02 | 0.74 | ||||
| 84 | LKEALLDTGADDTVLEEI | D | LKEALLDTG | B*45:01 | 1.20 | LKEALLDTG | B*45:01 | 12.15 | |||
| 85 | KRQEILDLWVYHTQGYF | A1 | QEILDLWVY | B*45:01 | 1.70 | QEILDLWVY | B*45:01 | 1.02 | |||
| KRQEILDLWVYHTQGYF | A1 | QEILDLWVY | B*47:01 | 0.90 | |||||||
| KRQEILDLWVYHTQGYF | A1 | RQEILDLWV | Cw*06:02 | 1.10 | RQEILDLWV | Cw*06:02 | 21.81 | ||||
| KRQEILDLWVYHTQGYF | A1 | ILDLWVYHT | A*02:01 | 0.70 | ILDLWVYHT | B*45:01 | 13.80 | ||||
| D | |||||||||||
| L914 | A*01:01 | 86 | SFNCGGEFFYCNTSGLF | A1 | SFNCGGEFFY | A*01:01 | 0.25 | ||||
| A*02:01 | SFNCGGEFFYCNTSGLF | A1 | SFNCGGEFF | Cw*04:07 | 1.30 | SFNCGGEFF | B*44:03 | 7.72 | |||
| B*07:02 | SFNCGGEFFYCNTSGLF | A1 | SFNCGGEFF | Cw*07:02 | 1.10 | SFNCGGEFF | B*07:02 | 22.45 | |||
| B*44:03 | SFNCGGEFFYCNTSGLF | A1 | GEFFYCNTS | B*44:03 | 1.30 | GEFFYCNTS | B*44:03 | 1.80 | |||
| Cw*04:07 | 87 | MEKEGKISKIGPENPY | A1 | KEGKISKIGPENPY | B*44:03 | 1.30 | KISKIGPEN | B*44:03 | 39.08 | ||
| Cw*07:02 | MEKEGKISKIGPENPY | A1 | ISKIGPENP | A*01:01 | 1.80 | ISKIGPENP | B*07:02 | 27.27 | |||
| 88 | ARKNRRRRWRARQRQI | A1 | RRWRARQRQ | Cw*07:02 | 0.60 | RRWRARQRQ | Cw*07:02 | 19.17 |
Experimentally mapped peptides for all participants and their cognate computational core 9-mer and a single 14-mer epitope sequence with scores. Peptides shown in italic text were not algorithmically predicted as binders. Multiple computational predictions contained in a single experimental peptide were counted as a single hit. Participant’s identifiers (ID) beginning with E or L represent early or late time sampling points respectively
Peptides not predicted
| Participant’s Identification | Participant’s HLA Alleles | Experimental Peptide Sequence |
|---|---|---|
| E92 | A*02:01 | FKGPRKIIKCFNCGKEGHI |
| A*30:02 | ||
| B*44:03 | ||
| B*14:02 | ||
| Cw*04:01 | ||
| Cw*08:02 | ||
| E95 | A*23:01 | LVQNANPDCKSILRAL (both time points) |
| A*74:01 | SKQKTQQAAADTGNSSKV | |
| B*44:03 | ||
| B*15:10 | ||
| Cw*04:01 | ||
| Cw*16:01 | ||
| L913 | A*02:01 | IYSLIEESQNQQEKNEQEL |
| A*34:02 | ||
| B*45:01 | ||
| B*47:01 | ||
| Cw*06:02 | ||
| Cw*16:01 |
Experimentally mapped peptides that were not predicted by NetMHCpan4.0 as binders. Participant’s identifiers beginning with E or L represent early or late time sampling points respectively
Fig. 3Computational epitope prediction. NetMHCpan4.0 set length plotted against the number of predicted binders per HLA type shows that the number of predictions reduces as the input set length increases. The dotted line is the trend line, whereas the solid line is the line of best fit. The core 9mer epitope sequence was similar across 9mer through 14mer set length except for one 14-mer peptide (hit 72 in Table 2)
Experimental and computational 9mer peptide confusion matrix
| Experimental Positive | Experimental Negative | |
|---|---|---|
(≥1 epitope(s) contained in a single experimental peptide sequence) | (Hits in table 2) | |
The total number of peptides experimentally tested were 757 and these are broken down to show the fractions from both the experimental testing and NetMHCpan4.0 computational predictions
Fig. 4Early versus Late Peptides. Experimentally mapped peptides at baseline (n = 34) and at least 12 months later (n = 34) were compared using the 9-mer computational NetMHCpan4.0 scores of the hits. The lower the computational score the stronger the predicted binding. Late peptides were significantly stronger binders than early peptides (Wilcoxon signed rank test, p = 0.0000005)
Fig. 5ROC plot. False versus true positive rate for all 9-mer and a single 14-mer test peptides across the 22 test HLA class I types. The diagonal line shows the random guess whereas the red curve shows the observed experimentally mapped epitopes versus the NetMHCpan4.0 expected predictions