| Literature DB >> 33469510 |
Mohammad Mostafa Pourseif1, Sepideh Parvizpour1, Behzad Jafari2,1, Jaber Dehghani1, Behrouz Naghili3, Yadollah Omidi4.
Abstract
Introduction: Coronavirus disease 2019 (COVID-19) is undoubtedly the most challenging pandemic in the current century with more than 293,241 deaths worldwide since its emergence in late 2019 (updated May 13, 2020). COVID-19 is caused by a novel emerged coronavirus named severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Today, the world needs crucially to develop a prophylactic vaccine scheme for such emerged and emerging infectious pathogens.Entities:
Keywords: COVID-19; Emerging virus; Epitope; Pandemic; SARS-CoV-2; Self-amplifying mRNA vaccine; Spike glycoprotein; Structural modeling
Year: 2020 PMID: 33469510 PMCID: PMC7803919 DOI: 10.34172/bi.2021.11
Source DB: PubMed Journal: Bioimpacts ISSN: 2228-5652
Fig. 1
Fig. 2
Fig. 3
Fig. 4Predicted B-cell epitopes from SARS-CoV-2 S protein
|
|
|
|
|
|
| AYTNSF | 27–32 | 0.545 | 0.346 | 0.67 |
| HAIHVSGTNGTKRFDNP* | 66–82 | 0.381 | 0.489 | 0.79 |
| TEKSNI | 95–100 | 0.050 | 0.115 | 0.66 |
| DSK | 111–113 | 0.115 | 0.231 | 0.64 |
| QFCNDPFLGVYYHKNNKSWMESEFRVYSSANNC* | 134–166 | 0.400 | 0.567 | 0.71 |
| VSQ | 171–173 | 0.0 | 0.0 | 0.62 |
| LMDLEGKQGNFKNLR* | 176–190 | 0.320 | 0.370 | 0.78 |
| SKHTPINLVRDLPQGFS* | 205–221 | 0.299 | 0.367 | 0.74 |
| LHRSYLTPGDSSSGWTA* | 244–260 | 0.435 | 0.652 | 0.81 |
| YNEN | 279–282 | 0.075 | 0.173 | 0.62 |
| NATRFASVYAWNRKRI | 343–358 | 0.091 | 0.130 | 0.63 |
| VIRGDEVRQIAPGQTGKIADYNYKLPD | 401–427 | 0.071 | 0.102 | 0.66 |
|
WNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEI | 436–508 | 0.319 | 0.351 | 0.76 |
| CGPKKSTNLVKNKCVNFNFNG | 525–545 | 0.064 | 0.132 | 0.65 |
| TESNKKFLPFQQFGRDIADTTDAVRDPQTLEI* | 553–584 | 0.107 | 0.195 | 0.68 |
| TNTS | 602–605 | 0.0 | 0.0 | 0.62 |
| HVNNSYEC | 655–662 | 0.116 | 0.173 | 0.66 |
| YQTQTNSPRRARSVASQ* | 674–690 | 0.210 | 0.367 | 0.68 |
| SLGAENSVAYSNNSIAIPTN* | 698–717 | 0.080 | 0.138 | 0.69 |
| DSTECS | 745–750 | 0.0 | 0.0 | 0.65 |
| VEQDKNTQ | 772–779 | 0.110 | 0.173 | 0.64 |
| KQIYKTPPIKDFGG | 786–799 | 0.114 | 0.148 | 0.67 |
| LPDPSKPSKRSF* | 806–817 | 0.074 | 0.115 | 0.73 |
| QYGDCLGDIAA | 836–846 | 0.068 | 0.126 | 0.63 |
| QNVLYENQK | 913–921 | 0.0 | 0.0 | 0.64 |
| RLDKVEA | 983–989 | 0.008 | 0.0 | 0.64 |
| GQSKR | 1035–1039 | 0.0 | 0.0 | 0.60 |
|
PAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQR | 1069–1146 | 0.053 | 0.089 | 0.68 |
Abbreviation:CBPS, Consensus B-cell epitope prediction score.
aThe average Shannon entropy score calculated using the multiple sequence alignment of the reference sequence of SARS-CoV-2 S protein (accession no. YP_009724390.1) and the hundred different isolates of spike glycoprotein of CoVs. b The average Shannon entropy score calculated using pairwise sequence alignment of the reference sequence of the S glycoproteins of SARS-CoV (accession no. NP_828851.1) and SARS-CoV-2. Final selected B-cell epitopes are indicated by *.
Fig. 5List of dominant SARS-CoV-2-derived cytotoxic T-cell peptides, their consensus rank (CR) scores, and the population coverage results in the most-affected countries with COVID-19
|
|
|
|
|
|
|
|
| IKWPWYIWLGFI# | 1210–1221 | 5.94 | A*02:06, A*23:01, A*24:02, A*32:01, B*35:01, B*51:01, B*53:01 | 41.88±14.94 | 0.50±0.20 | 0.18±0.04 |
| LQIPFAMQMAYRF# | 894–906 | 6.55 | A*02:06, A*23:01, A*24:02, A*26:01, A*33:01, A*68:01, B*08:01, B*15:01, B*35:01, B*53:01, B*58:01 | 54.66±17.07 | 0.72±0.25 | 0.24±0.06 |
| CEFQFCNDPFL# | 131–141 | 6.61 | A*02:06, A*02:01, A*23:01, B*44:03, B*40:01, B*44:02 | 55.88±10.14 | 0.68±0.16 | 0.24±0.05 |
| GVFVSNGTHWFV# | 1093–1104 | 7.48 | A*02:01, A*02:03, A*02:06, A*23:01, A*24:02, A*26:01, A*68:02, B*35:01, B*58:01 | 66.93±9.42 | 0.84±0.18 | 0.33±0.1 |
| FPNITNLCPF# | 329–338 | 8.64 | B*07:02, B*35:01, B*51:01, B*53:01 | 35.56±11.58 | 0.38±0.13 | 0.16±0.03 |
| GFIAGLIAIVM# | 1219–1229 | 9.01 | A*02:01, A*02:03, A*02:0, A*26:01, A*68:02, B*15:01 | 51.59±9.07 | 0.58±0.14 | 0.22±0.05 |
| EVFNATRFASVYAW# | 340–353 | 9.18 | A*30:01, A*68:02, B*08:01, B*15:01, B*35:01, B*57:01, B*58:01 | 39.95±10.45 | 0.45±0.14 | 0.17±0.03 |
| SGWTAGAAAYYV# | 256–267 | 9.42 | A*01:01, A*02:06, A*30:02, A*26:01, A*68:01, A*68:02, B*15:01 | 36.48±15.59 | 0.42±0.19 | 0.17±0.04 |
| LYNSASFSTFKCY# | 368–380 | 9.99 | A*03:01, A*11:01, A*23:01, A*24:02, A*68:01, B*15:01, B*58:01 | 58.69±14.05 | 0.73±0.21 | 0.27±0.09 |
| NFTISVTTEILPV# | 717–729 | 10.27 | A*02:01, A*02:03, A*02:06, A*26:01, A*68:02, B*51:01, B*58:01 | 54.44±8.9 | 0.64±0.14 | 0.23±0.05 |
| GYLQPRTFLLKY# | 268–279 | 10.5 | A*02:01, A*02:03, A*02:06, A*03:01, A*11:01, A*23:01, A*24:02, B*08:01, B*15:01 | 79.05±12.03 | 1.14±0.29 | 0.59±0.26 |
| YTNSFTRGVYY | 28–38 | 11.8 | A*01:01, A*02:03, A*26:01, A*30:02, A*68:02, B*15:01 | 31.69±14.91 | 0.35±0.18 | 0.15±0.03 |
| FLPFFSNVTWF | 55–65 | 12.19 | B*35:01, B*51:01, B*53:01, B*57:01 | 25.09±11.79 | 0.27±0.13 | 0.14±0.02 |
| EQYIKWPWYIW | 1207–1217 | 14.3 | A*23:01, A*24:02, B*44:02, B*44:03 | 35.34±9.59 | 0.4±0.12 | 0.16±0.02 |
| VYSSANNCTFEY | 159–170 | 15.8 | A*30:02, A*23:01, A*24:02, B*58:01, B*35:01, B*15:01 | 40.5±12.11 | 0.48±0.16 | 0.18±0.04 |
| CTLKSFTVEKGIY | 301–313 | 16.6 | A*03:01, A*11:01, A*30:02, A*68:01, B*57:01, B*58:01 | 42.77±8.86 | 0.48±0.1 | 0.18±0.03 |
a Average (±SD) projected population coverage. b Average number of epitope hits/HLA combinations recognized by the population. c Minimum number of epitope hits/HLA combinations recognized by 90% of the population. Final selected CD8+ T-cell epitopes are indicated by #.
List of top-scoring SARS-CoV-2-derived helper T-cell epitopes, their average adjusted ranks, and population coverage results in the most-affected countries with COVID-19
|
|
|
|
|
|
|
|
| TLDSKTQSLLIVNNATNVVIKVCEFQF# | 109–135 | 0.19 | DRB1*04:01, DRB1*13:02, DRB3*02:02 | 14.24±9.16 | 0.15±0.1 | 0.12±0.01 |
| YRVVVLSFELLHAPATVCGPKKS# | 508–530 | 0.25 | DRB1*01:01 | 10.74±6.4 | 0.11±0.07 | 0.11±0.01 |
| FKNLREFVFKNIDGYFKIYSKHTPI# | 186–210 | 0.39 | DRB5*01:01 | NA | NA | NA |
| IGINITRFQTLLALHRSYLTP# | 231–251 | 0.50 | DRB1*01:01, DRB1*15:01, DRB5*01:01 | 23.81±12.87 | 0.25±0.14 | 0.14±0.02 |
| MFVFLVLLPLVSSQCVNLT | 1–19 | 0.52 | DRB1*01:01, DRB1*11:01 | 20.44±10.42 | 0.21±0.11 | 0.13±0.02 |
| KVGGNYNYLYRLFRKSNLKPFER# | 444–466 | 0.57 | DRB1*11:01 | 11.09±4.95 | 0.11±0.05 | 0.11±0.01 |
| IAIPTNFTISVTTEILPVSMT# | 712–732 | 0.58 | DRB1*07:01 | 17.35±7.49 | 0.17±0.07 | 0.12±0.01 |
| TITSGWTFGAGAALQIPFAMQ# | 881–901 | 0.58 | DRB1*01:01, DRB1*09:01 | 15.2±9.3 | 0.15±0.1 | 0.12±0.02 |
| HFPREGVFVSNGTHWFVTQRNF# | 1088–1109 | 0.59 | DRB1*13:02, DRB3*01:01, DRB3*02:02 | 6.98±4.07 | 0.07±0.04 | 0.11±0.01 |
| VYADSFVIRGDEVRQIAPGQTGK | 395–417 | 0.64 | DRB3*01:01 | NA | NA | NA |
| SKHTPINLVRDLPQGFSALEP# | 205–225 | 0.64 | DRB1*03:01, DRB3*01:01 | 19.12±12.55 | 0.19±0.13 | 0.13±0.03 |
| KCVNFNFNGLTGTGVLTES | 537–555 | 0.69 | DRB1*09:01 | 6.21±7.34 | 0.06±0.07 | 0.11±0.01 |
| ADYSVLYNSASFSTFKC | 363–379 | 0.70 | DRB3*02:02 | NA | NA | NA |
| NATRFASVYAWNRKRISN | 343–360 | 0.71 | DRB5*01:01 | NA | NA | NA |
| ECSNLLLQYGSFCTQLNR | 748–765 | 0.71 | DRB1*15:01 | 15.01±8.07 | 0.15±0.08 | 0.12±0.01 |
| ENQKLIANQFNSAIGKI | 918–934 | 0.72 | DRB3*02:02 | NA | NA | NA |
| GNCDVVIGIVNNTVYDPL | 1124–1141 | 0.72 | DRB1*13:02 | 6.98±4.07 | 0.07±0.04 | 0.11±0.01 |
| AALQIPFAMQMAYRFNGI | 892–909 | 0.74 | DRB4*01:01 | NA | NA | NA |
| VQPTESIVRFPNITNLCPFG | 320–339 | 0.78 | DRB1*04:05, DRB1*15:01 | 18.42±8.21 | 0.19±0.08 | 0.12±0.01 |
| FGGFNFSQILPDPSK | 797–811 | 0.81 | DRB1*04:05 | 4.81±6.13 | 0.05±0.06 | 0.1±0.01 |
| ALNTLVKQLSSNFGAIS | 958–974 | 0.81 | DRB1*04:01 | 8.0±7.17 | 0.08±0.07 | 0.11±0.01 |
| DLFLPFFSNVTWFHAI | 53–68 | 0.91 | DRB1*04:01, DRB3*02:02 | 8.0±7.17 | 0.08±0.07 | 0.11±0.01 |
| RAAEIRASANLAATKM | 1014–1029 | 0.93 | DRB3*02:02 | NA | NA | NA |
| LTDEMIAQYTSALLAGT | 865–881 | 0.94 | DRB1*15:01 | 15.01±8.07 | 0.15±0.08 | 0.12±0.01 |
Abbreviation: APR,Average percentile rank.
a Average (±SD) projected population coverage. b Average number of epitope hits/HLA combinations recognized by the population. c Minimum number of epitope hits/HLA combinations recognized by 90% of the population. Four HLA-II alleles (DRB5*01:01, DRB3*01:01, DRB4*01:01, and DRB3*02:02) were not available in population coverage calculation. NA: not available. Final selected CD4+ T-cell epitopes are indicated by #.
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10