| Literature DB >> 32511339 |
Ang Gao1,2, Zhilin Chen3, Florencia Pereyra Segal4, Mary Carrington3,5, Hendrik Streeck6, Arup K Chakraborty1,3,2,7,8, Boris Julg3.
Abstract
We describe a physics-based learning model for predicting the immunogenicity of Cytotoxic T Lymphocyte (CTL) epitopes derived from diverse pathogens, given a Human Leukocyte Antigen (HLA) genotype. The model was trained and tested on experimental data on the relative immunodominance of CTL epitopes in Human Immunodeficiency Virus infection. The method is more accurate than publicly available models. Our model predicts that only a fraction of SARS-CoV-2 epitopes that have been predicted to bind to HLA molecules is immunogenic. The immunogenic CTL epitopes across all SARS-CoV-2 proteins are predicted to provide broad population coverage, but the immunogenic epitopes in the SARS-CoV-2 spike protein alone are unlikely to do so. Our model predicts that several immunogenic SARS-CoV-2 CTL epitopes are identical to those contained in low-pathogenicity coronaviruses circulating in the population. Thus, we suggest that some level of CTL immunity against COVID-19 may be present in some individuals prior to SARS-CoV-2 infection.Entities:
Year: 2020 PMID: 32511339 PMCID: PMC7241102 DOI: 10.1101/2020.05.14.095885
Source DB: PubMed Journal: bioRxiv
Fig. 1.The ROC curve of the binary classifier based on our model (red), compared with the model developed by Calis et al (Calis et al., 2013) (green). (A) shows the ROC curves for the acute HIV infection group. The AUC of the red curve is 0.71. The AUC of the green curve is 0.57. (B) shows the ROC curves for the chronic HIV infection group. The AUC of the red curve is 0.66. The AUC of the green curve is 0.34.
The top most immunogenic SARS-CoV-2 peptides predicted by our model. (A) shows the immunogenic peptides that are filtered from the peptide pool reported by Ahmed et al (Ahmed, Quadeer and McKay, 2020). (B) shows the immunogenic peptides that are filtered from those reported by Prachar et al (Prachar et al., 2020).
| HLA | Peptide |
|---|---|
| HLA-A*02:01 | FLWLLWPVTL; SMWSFNPET; WLLWPVTLA; LLLDRLNQL; FLLNKEMYL; FLLPSLATV; FLNRFTTTL; FLPRVFSAV; ILGTVSWNL; ALWEIQQVV; KLWAQCVQL; FVDGVPFVV; KLNVGDYFV; LLDDFVEII; LLLDDFVEI; LVLSVNPYV; NLWNTFTRL; TLVPQEHYV; TMADLVYAL; VLWAHGFEL; YLDAYNMMI; YLNTLTLAV; HLVDFQVTI; RLDKVEAEV; VLNDILSRL |
| HLA-B*44:03 | LEQWNLVIGF; YEQYIKWPWY |
| HLA-A*23:01 | LWLLWPVTL; SFNPETNIL; GYQPYRVVVL |
| HLA-A*24:02 | LWLLWPVTL; SFNPETNIL; KQFDTYNLW; GYQPYRVVVL |
| HLA-A*30:02 | ATSRTLSYY; GTTLPKGFY; LSPRWYFYY; VTPSGTWLTY; ISDYDYYRY; RVDFCGKGY |
| HLA-A*11:01 | ATSRTLSYYK; KTFPPTEPK; KTFPPTEPKK; AVLQSGFRK; GVAMPNLYK; RLYYDSMSY; VVYRGTTTY; SVLNDILSR |
| HLA-B*07:02 | FPRGQGVPI; SPRWYFYYL; MPASWVMRI; FPPTSFGPL; IPRRNVATL; HPLADNKFAL |
| HLA-B*54:01 | FPRGQGVPI; SPRWYFYYL; FPPTSFGPL; APHGVVFLHV |
| HLA-B*08:01 | FPRGQGVPI |
| HLA-B*35:01 | FPRGQGVPI; MPASWVMRI; VVYRGTTTY; HPLADNKFAL; FPNITNLCPF |
| HLA-B*51:01 | FPRGQGVPI; FPPTSFGPL |
| HLA-B*53:01 | FPRGQGVPI; HPLADNKFAL; FPNITNLCPF |
| HLA-B*40:01 | MEVTPSGTWL; YEGNSPFHPL |
| HLA-A*29:02 | GTTLPKGFY; LSPRWYFYY; TPSGTWLTY; VTPSGTWLTY; ISDYDYYRY |
| HLA-A*03:01 | KTFPPTEPK; KTFPPTEPKK; AVLQSGFRK; GVAMPNLYK; KLFAAETLK; RLYYDSMSY; VVYRGTTTY |
| HLA-A*31:01 | KTFPPTEPK; KTFPPTEPKK; SVSPKLFIR |
| HLA-A*68:01 | KTFPPTEPK; KTFPPTEPKK |
| HLA-A*01:01 | LSPRWYFYY; VTPSGTWLTY; ISDYDYYRY; VVDKYFDCY; LIDLQELGKY; RVDFCGKGY |
| HLA-A*26:01 | VTPSGTWLTY |
| HLA-B*45:01 | AEGSRGGSQA |
| HLA-A*02:02 | FLGRYMSAL; FLLNKEMYL; FLLPSLATV; FLNRFTTTL; FLPRVFSAV; ALWEIQQVV; FVDGVPFVV; KLNVGDYFV; KLSYGIATV; LLDDFVEII; LLLDDFVEI; TMADLVYAL; VLWAHGFEL; YLDAYNMMI; YLNTLTLAV; HLVDFQVTI |
| HLA-A*02:03 | FLGRYMSAL; FLLNKEMYL; FLLPSLATV; FLNGSCGSV; FLNRFTTTL; FLPRVFSAV; ALWEIQQVV; FVDGVPFVV; KLSYGIATV; LLLDDFVEI; TLIGDCATV; TMADLVYAL; VLWAHGFEL; YLDAYNMMI; YLNTLTLAV; HLVDFQVTI |
| HLA-A*02:06 | FLLNKEMYL; FLLPSLATV; FLPRVFSAV; GVYDYLVST; IQPGQTFSV; ALWEIQQVV; FVDGVPFVV; LLDDFVEII; LLLDDFVEI; LVLSVNPYV; TMADLVYAL; VLWAHGFEL; YLDAYNMMI; YTMADLVYA; HLVDFQVTI |
| HLA-C*06:02 | FRYMNSQGL |
| HLA-A*68:02 | FVDGVPFVV; MVMCGGSLYV; YTMADLVYA |
| HLA-B*15:01 | KQFDTYNLW; RLYYDSMSY; TTLPVNVAF; VVYRGTTTY; YQKVGMQKY |
| HLA-B*58:01 | KQFDTYNLW; TTLPVNVAF; VVYRGTTTY |
| HLA-A*69:01 | TLVPQEHYV; VLWAHGFEL |
| HLA-B*40:02 | YEGNSPFHPL |
| HLA-B*27:05 | GRLQSLQTY; VRFPNITNL |
| HLA-C*14:02 | VRFPNITNL |
| HLA-B*44:02 | YEQYIKWPWY |
| HLA-B*18:01 | YEQYIKWPWY |
Shared immunogenic peptides between SARS-CoV-2 and four common low pathogenicity human coronaviruses. The first column shows the number of shared immunogenic peptides between SARS-CoV-2 and each of the four viruses. The second column shows the number of HLA alleles associated with those peptides. The third and fourth column show the population coverage of those HLAs for the US and World, respectively. The fifth column shows the average number of immunogenic peptides associated with each HLA. The last row of the table shows the average of all these quantities.
| N_peptide | N_HLA | HLA_coverage(US) | HLA_coverage (world) | Npep_per_HLA | |
|---|---|---|---|---|---|
| in229E | 8 | 24 | 93.60% | 90.20% | 1.67 |
| inHKUl | 31 | 29 | 97.80% | 96.60% | 5.24 |
| inHL63 | 17 | 28 | 96.70% | 95.00% | 2.86 |
| inOC43 | 20 | 25 | 97.50% | 96.30% | 2.32 |