| Literature DB >> 33082015 |
Ashkan Safavi1, Amirhosein Kefayat2, Elham Mahdevar3, Ardavan Abiri4, Fatemeh Ghahremani5.
Abstract
SARS-CoV-2 causes a severe respiratory disease called COVID-19. Currently, global health is facing its devastating outbreak. However, there is no vaccine available against this virus up to now. In this study, a novel multi-epitope vaccine against SARS-CoV-2 was designed to provoke both innate and adaptive immune responses. The immunodominant regions of six non-structural proteins (nsp7, nsp8, nsp9, nsp10, nsp12 and nsp14) of SARS-CoV-2 were selected by multiple immunoinformatic tools to provoke T cell immune response. Also, immunodominant fragment of the functional region of SARS-CoV-2 spike (400-510 residues) protein was selected for inducing neutralizing antibodies production. The selected regions' sequences were connected to each other by furin-sensitive linker (RVRR). Moreover, the functional region of β-defensin as a well-known agonist for the TLR-4/MD complex was added at the N-terminus of the vaccine using (EAAAK)3 linker. Also, a CD4 + T-helper epitope, PADRE, was used at the C-terminal of the vaccine by GPGPG and A(EAAAK)2A linkers to form the final vaccine construct. The physicochemical properties, allergenicity, antigenicity, functionality and population coverage of the final vaccine construct were analyzed. The final vaccine construct was an immunogenic, non-allergen and unfunctional protein which contained multiple CD8 + and CD4 + overlapping epitopes, IFN-γ inducing epitopes, linear and conformational B cell epitopes. It could form stable and significant interactions with TLR-4/MD according to molecular docking and dynamics simulations. Global population coverage of the vaccine for HLA-I and II were estimated 96.2% and 97.1%, respectively. At last, the final vaccine construct was reverse translated to design the DNA vaccine. Although the designed vaccine exhibited high efficacy in silico, further experimental validation is necessary.Entities:
Keywords: COVID-19; Immunoinformatics; In silico; SARS-CoV-2; Vaccine
Mesh:
Substances:
Year: 2020 PMID: 33082015 PMCID: PMC7546226 DOI: 10.1016/j.vaccine.2020.10.016
Source DB: PubMed Journal: Vaccine ISSN: 0264-410X Impact factor: 3.641
Fig. 1The flow chart of this study. Designing this vaccine is consisted of six main steps including (1) Antigen screening; (2) Selection of the immunodominant regions of the selected antigens; (3) Designing the final vaccine construct; (4) Tertiary structure prediction and validation; (5) Molecular docking with TLR-4/MD; (6) Designing the DNA vaccine.
Comparison of the SARS-CoV-2 proteins according to number of HLA-I high-binding-affinity epitopes (across 9-mer peptides) with representative alleles from different HLA supertypes based on the TepiTool server predictions.
| SARS-CoV-2 proteins | Reference sequences | Number of 9-mer overlapping peptides | Number of HLA-I binding epitopes |
|---|---|---|---|
| Orf1ab polyprotein | YP_009724389.1 | 6280 | 1303 |
| Spike glycoprotein | YP_009724390.1 | 1221 | 208 |
| Membrane glycoprotein | YP_009724393.1 | 190 | 72 |
| Envelope protein | YP_009724392.1 | 62 | 23 |
| Nucleocapsid phosphoprotein | YP_009724397.2 | 371 | 35 |
The used HLA-I alleles: A*01:01, A*02:01, A*03:01, A*24:02, A*26:01, B*07:02, B*08:01, B*27:05, B*39:01, B*40:0, B*58:01, B*15:01.
The obtained overlapping epitopes from the TepiTool server results.
| # | Peptide | IC50 | Alleles |
|---|---|---|---|
| 1 | AMQTMLFTM | 44.80 | A0201, A0206, B1525, A0301, A1101, A3101, A6801, A0203, A3202 |
| 24.04 | DRB11101, DPA10201DPB10501, DPA10201DPB14001, DRB10701, DRB10801, DPA10301/DPB10402, | ||
| 2 | VLGSLAATV | 50.70 | A0201, A0202, A0206, A0203, A0301, A1101, A0204, A0207, A6802, B5701 |
| RGM | 14.88 | DQA10501/DQB10301, DRB10101, DRB10701, DQA10102DQB10602, DQA10103DQB10302, DQA10106DQB10301, DRB10901, DRB11302, DRB50101 | |
| 3 | MLSDTLKNL | 54.10 | A0201, A0202, A0203, A0204, A0205, A0206, A0207, A1101 |
| RIKIVQ | 16.38 | DRB10101, DRB10401, DRB10404, DRB40101, DRB40103, DRB40104, DRB40106, DRB40107 | |
| 4 | MLFTMLRKL | 69.50 | A0201, A0202, A0203, A0206, , A0301, A1101, A6801 |
| QT | 18.73 | DRB11101, DRB10801, DPA10201DPB10501 | |
| 5 | LMYKGLPWNV | 7.20 | A0201, A0203, A0204, A0206 |
| IP | 13.42 | DRB10101, DRB10901, DRB10103, DRB10109, DRB10110, DRB10114, DRB10115, DRB50101 | |
| 6 | MVLGSLAATV | 8.30 | A0201, A0206, A6802 |
| RG | 14.88 | DRB10101, DRB10701, DRB10901, DRB11302, DRB50101, DQA10102/DQB10602, DQA10103/DQB10302, DQA10106/DQB10301, DQA10501/DQB10301 | |
| 7 | AMQTMLFTML | 78.80 | A0201 |
| 24.04 | DRB11101, DRB10701, DRB10801, DRB50101, DPA10301/DPB10402, DPA10201/DPB10501, DPA10201/DPB14001 | ||
| 8 | HLIPLMYKGL | 85.20 | A0201 |
| 25.95 | DRB10101, DRB10901, DRB11501 | ||
| 9 | YLYFIKGLNNL | 63.50 | A0201, A0202, A0203 |
| 9.69 | DRB10101, DRB10401, DRB50101, DRB10104, DRB10105, DRB10107, DRB10108, DRB10109, DRB10110, DRB10111, DRB10112, DRB10113, DRB10114, DRB10115, DRB30201, DRB30204, DRB30202, DRB50101, DRB10405DRB11101, DRB11501, | ||
| 10 | LMYKGLPWNVV | 83.90 | A0201 |
| IP | 13.42 | DRB10101, DRB10901, DRB10103, DRB10110, DRB10114, DRB10115, DRB50101, DRB11501, DRB10901 | |
| 11 | HLIPLMYKGLPWNV | 59.20 | A0201 |
| 25.95 | DRB10101, DRB10901 | ||
| 12 | MQTMLFTML | 29.90 | A0206, B3901, A0203, A1101 |
| 21.82 | DRB11101, DPA10201DPB10501, DPA10201DPB14001, DRB10801, DRB50101, DPA10301/DPB10402 | ||
| 13 | MVLGSLAAT | 37.70 | A0206, A1101, A6802 |
| RG | 14.88 | DRB10101, DRB10901, DRB11302, DRB50101, DQA10501/DQB10301, DRB10701, DQA10102/DQB10602, DQA10103/DQB10302, DQA10106/DQB10301 | |
| 14 | TMLFTMLRK | 9.50 | A0203, A0301, A1101 |
| Q | 18.73 | DRB11101, DRB10801, DPA10201/DPB10501 | |
| 15 | RQFHQKLLK | 17.00 | A0301, A0302, A0304, A0305, A0306, A0307, A1101, A1102, A1103, A1104, A1105, A1106, A1107, A3008, A3102, A3103, A3104, A3105, A3106, , A3001, A3101, A6801, |
| 16.68 | DRB10101, DRB11101, DRB10102, DRB50104, DRB50101 | ||
| KVKYLYFIK | 7.30 | A0206, A0301, A1101, A1104, A3001, A3008, A3101 | |
| 16 | P | 16.00 | DRB10101, DRB11101, DRB11501, DRB40101, DPA10103/DPB10201, DPA10201/DPB10501, DPA10201/DPB14001 |
| 17 | MLFTMLRK | 64.00 | A0301 |
| QT | 18.73 | DRB11101, DRB10801, DPA10201/DPB10501 | |
| 18 | HLIPLMYK | 94.20 | A0301, A0302, A0304, A0305, A0306, A0307, A1103, A1104 |
| 25.95 | DRB10101 | ||
| 19 | QTMLFTMLR | 3.30 | A0302, A0307, A1101, A1102, A1103, A1104, A1105, A1106, A1107, A3101, , A3102, A3103, A3104, A3105, A3301, A3303, A3106, A6801 |
| 18.73 | DRB11101, DPA10201DPB10501 | ||
| 20 | ATVVIGTSK | 26.60 | A0301, A0302, A0304, A0305, A0306, A0307, A1101, A1102, A1103, A1104, A1105, A1106, A1107, A3001, A3008, |
| IAATRG | 38.37 | DQA10501/DQB10301, DQA10102/DQB10602, DQA10106DQB10301 | |
| 21 | MQTMLFTMLR | 3.80 | A0301, A1101, A3301, A3303, A6801, A3101 |
| 21.82 | DRB11101, DPA10201DPB10501, DPA10201DPB14001, DRB50101, DPA10301/DPB10402, DRB10801 | ||
| 22 | AMQTMLFTMLR | 39.60 | A1101, A3101, A6801 |
| 24.04 | DRB10701, DRB10801, DRB11101, DRB50101, DPA10201/DPB10501, DPA10201/DPB14001, DPA10301/DPB10402 | ||
| 23 | IVQMLSDTLK | 45.70 | A1101, A6801 |
| RIK | 16.38 | DRB10101, DRB40101, DRB40103, DRB40104, DRB40106, DRB40107 | |
| 24 | ATRGATVVI | 22.10 | A0203, A1101, A6802, A3001, A3008, A3201, A3202, A3206, B0704, B1504 |
| LLKSIA | 10.97 | DRB10101, DQA10501/DQB10301, DRB10701, DRB10901, DRB11302, DRB50101, DRB10103, DRB10104, DRB10105, DRB10107, DRB10108, DRB10110, DRB10111, DRB10112, DRB10113, DRB10115, DRB50104, DRB10901 | |
| 25 | RIKIVQMLS | 22.50 | A0206, A0301, A1101, A3001, A3008 |
| 16.38 | DRB10101, DRB40101, DRB40104, DRB40106, DRB40107, DRB40101 | ||
| 26 | KSIAATRGA | 24.60 | A3001, A1101, A3008 |
| QKLL | 9.09 | DRB10102, DRB10103, DRB10104, DRB10105, DRB10106, DRB10107, DRB10108, DRB10109, DRB10110, DRB10112, DRB10404, DRB10901, DRB11101, DRB11302, DRB30201, DRB30204, DRB50101, DRB50102, DRB50103, DRB50104, DRB50105, DRB11501, DRB50101, DQA10501/DQB10301 | |
| 27 | MYKGLPWNVVR | 37.50 | A3301, A3101 |
| IPL | 13.42 | DRB10101, DRB10103, DRB10109, DRB10110, DRB10114, DRB10115, DRB10901, DRB11501, DRB50101 | |
| 28 | LLKSIAATR | 46.90 | A3101, A6801, A0203, A0206, A3102, A3103, A3104, A3105, A3106 |
| QK | 9.09 | DRB10101, DRB10102, DRB10103, DRB10104, DRB10105, DRB10106, DRB10107, DRB10108, DRB10109, DRB10110, DRB10111, DRB10112, DRB10113, DRB11101, DRB11302, DRB11501, DRB30201, DRB30204DRB50101, DRB50102, DRB50103, DRB50104, DRB50105, DQA10501/DQB10301 | |
| 29 | EAFEKMVSL | 33.50 | A0202, A0203, A1101, A2501, A2502, A2503, A2504, A2505, A2506, A2507, A2601, A2602, A2603, A2604, A2605, A2606, A2608, A2609, A6801, A6802, B0710, B0801, B0802, B0803, B0804, B0805, B0807, B1401, B1402, B1403, B1404, B1405, B1406, B3501, B3502, B3503, B3504, B3505, B3506, B3507, B3509, B3903, B3904, B3909, B3910, B4007, B4008, B4201, B4202, B4204, B4205, C0102, C0103, C0303, C0304, C1203 |
| DTT | 19.68 | DRB10101, DRB10401, DRB10405, DRB11101 | |
| 30 | FEKMVSLLSV | 36.00 | B4002 |
| DTTEA | 19.68 | DRB10101, DRB10401, DRB10405, DRB11101 | |
| 31 | TEAFEKMVSL | 57.60 | B0804, B1802, B1803, B3701, B3704, B3705, B3706, B3803, B3902, B3908, B4001, B4002, B4003, B4004, B4005, B4006, B4007, B4008, B4009, B4010, B4101, B4102, B4103, B4104 |
| DT | 19.68 | DRB10101, DRB10401, DRB10405, DRB11101 | |
| 32 | IPLMYKGLPW | 35.00 | B0702, B5301 |
| 13.42 | DRB10101, DRB10103, DRB10109, DRB10110, DRB10114, DRB10115, DRB11501, DRB50101 | ||
| 33 | IAATRGATV | 18.80 | A0203, A0301, A6801, B0801, B0802, B0803, B0805, B0807, B4202, C0102, C0103, C0303, C0304, C1601, C1203 |
| QKLLKS | 9.09 | DRB10102, DRB10103, DRB10104, DRB10105, DRB10106, DRB10107, DRB10108, DRB10109, DRB10110, DRB10111, DRB10112, DRB10113, DRB11101, DRB11302, DRB11501, DRB30201, DRB30204, DRB50101, DRB50102, DRB50103, DRB50104, DRB50105, DQA10501/DQB10301, DRB10101, DRB10701, DRB10901 | |
Percentage of population coverage.
| Area | HLA-I coverage (%) | HLA-II coverage (%) |
|---|---|---|
| East Asia | 96.5 | 96.4 |
| Europe | 97.1 | 97.9 |
| North Africa | 91.8 | 91.8 |
| North America | 97.4 | 98.1 |
| World | 96.2 | 97.1 |
| Average | 95.8 | 96.26 |
| Standard deviation | 2.28582589 | 2.583215051 |
Fig. 2(A) Schematic diagram of the final vaccine construct. (B) 3D model of the final vaccine construct and its main consisting domains.
Comparative investigation of the physicochemical and antigenicity properties of the positive vaccine controls (C1, C2, C3) and SARS-CoV-2 candidate vaccine.
| Properties | Parameters/Tools | Value/Score/Probability | |||
|---|---|---|---|---|---|
| C1 | C2 | C3 | Candidate vaccine | ||
| Physicochemical | Molecular weight | 2.44 kDa | 5.05 kDa | 6.38 kDa | 51.64 kDa |
| Isoelectric point (pI) | 6.24 | 8.67 | 9.61 | 10 | |
| Instability index (II) | 28 | 22.78 | 33.38 | 27.09 | |
| GRAVY | −0.88 | −0.32 | −1.127 | −0.354 | |
| Aliphatic index | 65.94 | – | 53.53 | 79 | |
| Antigenicity | ANTIGENpro | 0.67 | 0.94 | 0.62 | 0.74 |
| Secret-AAR | 42.6 | 27.59 | 35.4 | 39.8 | |
| VaxiJen | 0.65 | 0.67 | 0.61 | 0.59 | |
Fig. 3(A) The root-mean-square deviation (RMSD) and (B) root-mean-square fluctuation (RMSF) of atomic positions in the “last” molecular dynamics simulation of the vaccine construct, depicting stable conformations during the simulation.
Fig. 4(A) Ramachandran plot of the refined 3D structure. The Ramachandran plot revealed that 94.4%, 5.6%, and 0% and 0% of the refined model residues are located in the most-favored (red), favored (yellow), generously allowed (pale yellow), and disallowed regions (white), respectively. The vaccine residues are shown as black squares. (B) ProSA-web server analysis of the refined model. The Z-score of the refined models was − 6.57. The plot contains Z-scores of all protein chains in the protein data bank (PDB) determined by X-ray crystallography (light blue) and NMR spectroscopy (dark blue). The Z-score of the refined model is denoted by a large black dot. (C) The ERRAT plot of the refined model. The regions of the refined model that could be rejected at the levels of 95% and 99% are indicated in gray and black lines, respectively. In general, an appropriate high-resolution structure produces a value of ≥ 95%. (D) Verify 3D analyses plot. 84.20% of the residues of the refined model have average 3D-1D score ≥ 0.2 which is pass. At least 80% of the amino acids should score ≥ 0.2 in the 3D-1D profile to get passed. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 5The predicted linear and conformational B cell epitopes of the final vaccine construct and functional region of S protein by the ElliPro server in the ball and stick model. Yellow balls represent the predicted epitopes’ residues. Non-epitope and core residues are shown in white sticks. Also, ribbon representation of the epitopes’ residues was separately displayed. (A-E) The conformational B cell epitope of the vaccine construct. (F) The linear B cell epitope of the vaccine construct. (G-H) The linear epitopes of the functional region of S protein. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
The common linear B cell epitopes between the final vaccine construct and the S protein of SARS-CoV-2.
| Peptide | Start position | Score |
|---|---|---|
| YQAGSTPCNGVE | 473 | 0.72 |
| PFERDISTEIYQ | 463 | 0.61 |
| GKIADYNYKLPDDF | 416 | 0.90 |
| QAGSTPCNGVEGFN | 474 | 0.87 |
| YRLFRKSNLKPFER | 453 | 0.77 |
| GSTPCNGVEGFNCYFP | 476 | 0.91 |
| TEIYQAGSTPCNGVEG | 470 | 0.89 |
| FERDISTEIYQAGSTP | 464 | 0.86 |
| TGKIADYNYKLPDDFT | 415 | 0.84 |
| EGFNCYFPLQSYGFQP | 484 | 0.73 |
| TGCVIAWNSNNLDSKV | 430 | 0.71 |
| YKLPDDFTGCVIAWNS | 423 | 0.71 |
| GCVIAWNSNNLDSKVGGNYN | 431 | 0.81 |
Fig. 6Molecular docking of the subunit vaccine with TLR-4. (A) Interaction of the subunit vaccine (ribbon representation) with TLR-4/MD monomeric complex (molecular surface representation). (B) Ribbon representation of the interaction pattern of the subunit vaccine (black) with heterodimer TLR-4/MD complex (colored). (C) Illustration of the interacting residues between docked vaccine (black) and TLR-4 (colored) complex. (D) Interaction pattern of the C4 with heterodimer TLR-4/MD (ribbon representation).
Fig. 7The docked structures of one of the most immunodominant overlapping epitopes of the vaccine in the binding cleft of (A) MHC class I and (B) MHC class II.
Comparative evaluation of the candidate vaccine’s sequence with the experimentally validated epitopes of SARS-CoV-2 (nsp7, nsp8, nsp9, nsp10, nsp 12, and nsp14 are subunits of Orf1ab polyprotein).
| # | SARS-CoV-2 proteins | Experimentally validated epitopes | Location in Orf1ab (residue) | Location in the vaccine (residue) | IEDB epitope ID |
|---|---|---|---|---|---|
| 1 | S glycoprotein | RQIAPGQTGKIADYNYKL | 408–425 | 74–91 | 1,075,039 |
| 2 | YNYKLPDDFTGCVIA | 421–435 | 87–101 | 1,074,214 | |
| 3 | KLPDDFTGCV | 424–435 | 90–100 | 1,074,952 | |
| 4 | LPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERD | 425–467 | 91–133 | 1,074,979 | |
| 5 | KLPDDFTGCV | 424–433 | 90–99 | 1,074,952 | |
| 6 | DDFTGCVIAWNSNNL | 427–441 | 94–107 | 1,069,293 | |
| 7 | NLDSKVGGNY | 440–449 | 106–115 | 1,075,002 | |
| 8 | VGGNYNYLYRLFRKS | 445–459 | 111–125 | 1,073,698 | |
| 9 | NYNYLYRLFRK | 448–458 | 114–124 | 1,075,012 | |
| 10 | RLFRKSNLK | 454–462 | 120–128 | 1,075,031 | |
| 11 | YLYRLFRKSNLKPFE | 451–465 | 117–131 | 1,074,201 | |
| 12 | RKSNLKPFERDISTE | 457–471 | 123–137 | 1,072,366 | |
| 13 | KPFERDISTEIY | 462–473 | 128–139 | 1,074,954 | |
| 14 | AGSTPCNGVEGFNCY | 475–486 | 141–155 | 1,069,064 | |
| 15 | NGVEGFNCY | 481–489 | 147–155 | 1,075,001 | |
| 16 | NGVEGFNCYFPLQSY | 481–498 | 147–161 | 1,071,575 | |
| 17 | YFPLQSYGF | 489–497 | 155–163 | 1,075,121 | |
| 18 | NCYFPLQSYGFQPTN | 487–501 | 153–167 | 1,071,518 | |
| 19 | Non-structural protein 7 | FEKMVSLLSV | 3908–3918 | 201–210 | 15,578 |
| 20 | Non-structural protein 8 | AMQTMLFTM | 4028–4036 | 225–233 | 3179 |
| 21 | MQTMLFTMLR | 4029–4038 | 226–234 | 42,417 | |
| 22 | QTMLFTMLR | 4030–4038 | 227–234 | 52,573 | |
| 23 | TMLFTMLRK | 4031–4039 | 228–236 | 65,222 | |
| 24 | Non-structural protein 9 | KVKYLYFIK | 4224–4232 | 250–258 | 34,083 |
| 25 | LYFIKGLNNL | 4228–4237 | 254–263 | 40,753 | |
| 26 | LNNLNRGMVLGSLAA | 4234–4248 | 260–274 | 38,283 | |
| 27 | NRGMVLGSLAATVRL | 4238–4252 | 264–278 | 45,682 | |
| 28 | RGMVLGSLAATVRLQ | 4239–4253 | 265–279 | 53,926 | |
| 29 | Non-structural protein 10 | SFGGASCCLY | 4320–4327 | 282–291 | 57,776 |
| 30 | TVCTVCGMWK | 4368–4377 | 330–339 | 66,958 | |
| 31 | RNA-dependent RNA polymerase (Non-structural protein 12) | SICSTMTNR | 4953–4961 | 254–362 | 58,467 |
| 32 | MTNRQFHQK | 4958–4966 | 359–367 | 42,843 | |
| 33 | RQFHQKLLK | 4961–4969 | 362–370 | 55,413 | |
| 34 | NRQFHQKLLKSIAAT | 4960–4973 | 361–375 | 45,723 | |
| 35 | QKLLKSIAATRGATV | 4965–4979 | 366–380 | 51,217 | |
| 36 | ATVVIGTSK | 4973–4981 | 378–386 | 5209 | |
| 37 | 3′-to-5′ exonuclease | IPLMYKGLPWNVVRI | 6075–6089 | 396–400 | 27,933 |
| 38 | LMYKGLPWNV | 6077–6086 | 398–407 | 38,165 | |
| 39 | IVQMLSDTLK | 6091–6100 | 412–421 | 70,610 | |
| 40 | VQMLSDTLK | 6092–6100 | 413–421 | 29,419 |
Fig. 8(A) The codon adaptation index (CAI) and (B) codon frequency distribution of the DNA vaccine sequence. The CAI value is 0.99 for the mammalian host. Also, percentage of codon having a frequency distribution of 91–100 in mammalian host gene is 100%. (C) The schematic diagram of the DNA vaccine sequence. It contains NheI and NcoI restriction enzymes sequences at 5́ end and an XhoI sequence at the 3́ end. Also, it contains Kozak sequence at its 5́ end.
Fig. 9Simulated immune responses against (A) the final vaccine construct and (B) the C5 positive control (SAPN-MPER of HIV-1 gp41) by the C-ImmSim server. (i) Antigen (Ag) count along with antibody titers with specific subclasses, (ii) B cells population, (iii) Cytokines responses, (iv) Th (helper) cells population, (v) Tc (cytotoxic) cells population, (vi) Tc cells population per state.