INTRODUCTION: The ongoing life-threatening pandemic of coronavirus disease 2019 (COVID-19) has extensively affected the world. During this global health crisis, it is fundamentally crucial to find strategies to combat SARS-CoV-2. Despite several efforts in this direction and continuing clinical trials, no vaccine has been approved for it yet. METHODS: To find a preventive measure, we have computationally designed a multi-epitopic subunit vaccine using immuno-informatic approaches. RESULTS: The structural proteins of SARS-CoV-2 involved in its survival and pathogenicity were used to predict antigenic epitopes. The antigenic epitopes were capable of eliciting a strong humoral as well as cell-mediated immune response, our predictions suggest. The final vaccine was constructed by joining the all epitopes with specific linkers and to enhance their stability and immunogenicity. The physicochemical property of the vaccine was assessed. The vaccine 3D structure prediction and validation were done and docked with the human TLR-3 receptor. Furthermore, molecular dynamics simulations of the vaccine-TLR-3 receptor complex are employed to assess its dynamic motions and binding stability in-silico. CONCLUSION: Based on this study, we strongly suggest synthesizing this vaccine, which further can be tested in-vitro and in-vivo to check its potency in a cure for COVID-19.
INTRODUCTION: The ongoing life-threatening pandemic of coronavirus disease 2019 (COVID-19) has extensively affected the world. During this global health crisis, it is fundamentally crucial to find strategies to combat SARS-CoV-2. Despite several efforts in this direction and continuing clinical trials, no vaccine has been approved for it yet. METHODS: To find a preventivemeasure, we have computationally designed a multi-epitopic subunit vaccine using immuno-informatic approaches. RESULTS: The structural proteins of SARS-CoV-2 involved in its survival and pathogenicity were used to predict antigenic epitopes. The antigenic epitopes were capable of eliciting a strong humoral as well as cell-mediated immune response, our predictions suggest. The final vaccine was constructed by joining the all epitopes with specific linkers and to enhance their stability and immunogenicity. The physicochemical property of the vaccine was assessed. The vaccine 3D structure prediction and validation were done and docked with thehumanTLR-3 receptor. Furthermore, molecular dynamics simulations of the vaccine-TLR-3 receptor complex areemployed to assess its dynamic motions and binding stability in-silico. CONCLUSION: Based on this study, we strongly suggest synthesizing this vaccine, which further can be tested in-vitro and in-vivo to check its potency in a cure for COVID-19.
Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2) is the causative agent for the associated disease named as COVID-19, which had spread throughout the world and declared as a pandemic [1,2]. SARS-CoV-2 has been demonstrated to have a strong affinity for human respiratory receptors (hACE2), which makes it more vulnerable to humans globally [3]. SARS-coronavirus infections are commonly found to be associated with multiple disorders, such as respiratory, enteric, hepatic, and neurological complications [4]. In general, coronaviruses (CoV) belong to a family Coronaviridae, which consists of enveloped positive-sense, single-stranded RNA viruses [5,6]. Among the four genera of CoVs, SARS-CoV, MERS-CoV, and SARS-CoV-2 belong to Betacoronaviruses [7]. Inside host cells, SARS-CoV-2 genome is translated into two large polyproteins: replicase polyprotein 1a (pp1a) and 1ab (pp1ab) [8]. The larger pp1ab comprises 16 non-structural proteins (Nsp1–Nsp16), which are responsible for constructing a viral replication-transcription complex [9]. Alongside the viral RNA transcribes subgenomic RNAs, which encode four structural proteins; spike (S), envelope (E), membrane (M), and nucleocapsid (N) and several accessory proteins [10]. While accessory proteins areengaged in host immune suppression, including downregulation of the interferon pathway, structural proteins shape the virion, facilitating genomeencapsulation, viral particle assembly, and release [11,12]. Recently, one health concept is used and hypothesized that the prior exposure of humans with pet animals lowers the adverseeffect of SARS-CoV-2. TheN and S protein epitopes in comparison with taxonomically related coronaviruses’ epitopes share high sequence homology, which has high implications for vaccine designing against SARS-CoV-2 [13,14]. Additionally, there are repurposing of drugs are also important to cobat this disease [15,16].Since the beginning of the 21st century, two outbreaks of SARS-CoV and MERS-CoV have led the researchers into designing vaccines against SARS coronaviruses. The third outbreak, wherein SARS-CoV-2 infects promiscuously to humans, has given theneed for speedy research into developing a broad-range vaccine against thesecoronaviruses. Evidence shows that humoral and cell-mediated immune response plays a protective role against coronavirus infections, specifically against S and N proteins [17-19]. Although theeffective antibody response is short-lived, the T cell responses are found to provide long-term protection [20,21]. Similarly, TC cell response against the structural proteins S and N are long-lasting [22]. Previously, immunoinformatics approaches have been used to construct the vaccines against theZika virusNipah virus, and Human papillomavirus, where in-vivo studies of theHuman papillomavirus showed promising results [23-25]. Hence, in this study, we have used immunoinformatic approaches to predict highly antigenic epitopes fromSARS-CoV-2 structural proteins that would evoke a strong immune response in humans. For this purpose, we have used the structural proteins: Spike, Envelope, and nucleocapsid to predict B-cell, cytotoxic T lymphocyte (CTL) and helper T lymphocyte (HTL) epitopes for construction of vaccine. Thus, themulti-epitopic subunit vaccine would be capable of eliciting humoral as well as cell-mediated immune responses. We have also performed the docking and molecular dynamic simulations (MDS) between the vaccine and humanToll-like Receptor-3 (TLR-3) to study their binding stability. The outcome of the present study will result in a novel and immunogenic vaccine, which may be further accessed against SARS-CoV-2.
Material and methods
Protein sequence retrieval
Translated sequences of three structural proteins of SARS-CoV-2: Spike glycoprotein [Protein ID: QHD43416.1], Envelope [Protein ID: QHD43418.1], and Nucleocapsid protein [Protein ID: QHD43423.2] were obtained from the GenBank database.
B lymphocytes epitopes prediction
The IEDB tools (http://tools.iedb.org/bcell/) were used to identify the B cell epitope and for verifying antigenicity, BepiPred linear epitope analysis was performed. According to the BepiPred linear epitope prediction method, residues with scores above the threshold (default value 0.35) are predicted to be part of an epitope having 49% sensitivity and 75% specificity [26].
Helper T – cells epitopes prediction
The IEDB server (http://tools.iedb.org/mhcii/) was used for the prediction of MHC II antigenic binders in translated sequences. Top 10 HTL epitopes were sorted based on their percentile rank and IC50 value. Epitopes with the lowest percentile rank were considered an excellent binding affinity, whereas, peptides with IC50 values in lower (<50 nM), middle (<500 nM), and higher (<5000 nM) range corresponds to the highest, intermediate, and lowest binding affinity, for the T-cell, respectively [27].
Cytotoxic T- cells epitopes prediction
MHC class I binding/CTLepitopes for supertype A3 were predicted for all three proteins by using an online server NetCTL 1.2 (http://www.cbs.dtu.dk/services/NetCTL/) [28]. Default threshold of 0.75 was chosen for vaccine construction in this study (>1.25 corresponds to 54% sensitivity and 99.30% specificity; >1.00 shows 70% sensitivity and 98.5% specificity; >0.90 shows 74% sensitivity and 98% specificity). MHC class I binding and proteasomal cleavage prediction was performed using artificial neural networks. Accordingly, TAP transport efficiency was predicted using a weight matrix.
IFN‐γ inducing epitopes prediction
MHC class II binder epitopes/HTL, with the ability to induce cell‐mediated immunity, were identified using the IFNepitope server (http://crdd.osdd.net/raghava/ifnepitope/). The prediction was performed by a motif and support vector machine (SVM) hybrid approach [29]. All the positiveepitopes able to induce interferon‐γ (IFN‐γ) were selected for vaccine construct.
Multi-epitopic vaccine construction
Selected high-scoring CTLs, high-affinity HTLs, and B-cell epitopes were used to generate the vaccine sequence. The different epitopes were linked together using linkers: CTL linker (AAY), HTL linker (GPGPG), and B epitope linker (KK). To improve the immunogenicity of the vaccine, β-defensin 1 (Uniprot Id: P60022) as an adjuvant was added through an EAAAK linker at theN-terminal of the construct.
Prediction of the antigenicity and allergenicity of the vaccine candidate
Antigenicity prediction
The servers ANTIGENpro (http://scratch.proteomics.ics.uci.edu/) and VaxiJen v2.0 (http://www.ddg‐pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) were used for the antigenicity evaluation of the final vaccine construct [30,31]. The VaxiJen 2.0 server predicts the antigenicity of themulti-epitope vaccine peptide based on the physicochemical properties of the input protein. Whereas, ANTIGENpro server predicts the antigenicity of themulti-epitopic vaccine based on the protein microarray data analysis of the target organism.
Allergen prediction
The server AllerTOP v2.0 (http://www.ddg-pharmfac.net/AllerTOP) and AllergenFP were used to predict the allergenicity of multi-epitopic vaccine [32,33]. AllerTOP v2.0 classifies the vaccine protein sequence by machine learning methods for the classification of allergens. AllergenFP is an alignment-free online server that uses SVMmodule to predict the allergenic and non-allergenic nature of proteins.
Vaccine features
Physiochemical properties of the constructed vaccine
Vaccine construct was accessed for its physicochemical properties using the ProtParam server (http://web.expasy.org/protparam/) that predicts the theoretical pI, instability index, half‐life, stability profiling, aliphatic index, and Grand Average of Hydropathy of the sequence [34].
Secondary structure prediction
Protein secondary structure was determined using PSIPRED, a web-based freely accessible online server (http://bioinf.cs.ucl.ac.uk/psipred/) [35] which also gives information of contact analysis, fold recognition, structuremodeling, function prediction, protein intrinsic disorder prediction and domain prediction of a query sequence [36].
Disorder profile generation
The disorder profile of the vaccine construct was generated using PONDR pool of four predictors (http://original.disprot.org/metapredictor.php) [37-39] and IUPRED 2A predictor [40] as described in our previous reports [41-43].
Tertiary structure prediction and validation
The tertiary structure of the vaccine construct was generated using the I-TASSER web server [44]. It provides an energy minimized model through iterative template-based fragment assembly simulations. The structure was further optimized upon adding hydrogen and missing side-chain atoms followed by minimization in Schrodinger’s protein preparation wizard using OPLS 2005 forcefield [45]. This produced a high-quality model for protein-protein docking and molecular simulations. RAMPAGE [Mordred.bioc.cam.ac.uk/~rapper/rampage.php] and PROCHECK [46] web servers were used for evaluation of the vaccinemodel based on the distribution of amino acids in the Ramachandran plot.
Molecular docking of the final vaccine with immune receptors
PIPER programembedded in the BioLuminatemodule of Schrodinger for protein-protein docking was implemented for docking of vaccinemodel and TLR-3 (PDB ID: 20AZ) and TLR-5 (PDB ID: 3J0A) receptors [47,48]. It is an efficient tool that reduces false positive poses and performs a global search with Fast-Fourier Transform (FFT) approach. Using 1000 conformations of input structures, the top 50 clusters were selected with cluster radius 9 Å, which then centralized, and the outcomes of the docking based on cluster size wereevaluated. With the largest cluster size, the docked complex out of 10 complexes was selected for molecular dynamics simulation.
Molecular dynamics simulation of vaccine-immune receptors
The ~86kDa vaccinemodel in complex with TLR-3 receptor was evaluated for its binding stability in an aqueous environment for 20ns. With a total of 6,66,607 molecules of TIP3P watermodel, 60 neutralizing Cl− ions, and 0.15 M concentration of salt, the complex was incorporated in a cubic box. Recently updated forcefield CHARMM36 was implemented to generate the topologies of the protein-protein complex. The generated system was minimized for 50,000 steps of the steepest descent algorithm. Further, theequilibration process under NPT and NVT conditions for 1ns was done with Parrinello-Rahman and V-rescalemethods for coupling of pressure and temperature, respectively. Lastly, production MD run for all three systems was performed in periodic boundary conditions for 20ns. LINCS algorithm was used for calculating bond parameters. ParticleMesh Ewald (PME) for long-rangeelectrostatics with fourier spacing of 0.16 was used for production MD. An analysis of MD trajectory, Root Mean Square Deviation (RMSD), and Root Mean Square Fluctuation (RMSF) was calculated with gmx rms, and rmsf commands.
In-silico cloning of final vaccine
For the production of final vaccine in large quantity, Java Codon Adaptation Tool (http://www.jcat.de/) was used for codon optimization and attached with expression vector. The optimized codon sequence includes Codon Adaption Index (value 1.0) and %GC content (30–70%), which denotes the high level of protein expression and DNA stability. Finally, restriction cloning was performed using tool SnapGene v3.3.4. The restriction sites NdeI and XhoI were added to theN and C terminals of the optimized complementary DNA sequences followed by their insertion within the pET‐21a(+) vector to ensure the polyprotein synthesis within theE coli expression system.
Results
Prediction of B cell and T cell epitope
HTL epitopes for all structural proteins were predicted using IEDB server for humanMHC-II alleles. Among these, top 10 epitopes based on their least percentile rank depicting their high affinity were selected. The final selection of epitopes was made on the basis of overlapping sequences, which were further assessed for their IFN inducing potential. Epitopes with positive IFN inducing potential were finally selected for vaccine construction (Table 1). For each structural protein, CTL epitopes for supertype A3 were predicted using an online server NetCTL 1.2. A total of 40, 5, and 4 epitopes were predicted for S, N, and E proteins, respectively (Table 2). Linear B-cell epitopes in all three structural proteins were predicted using another online server BCPREDS. Epitopes selected on the basis of prediction scores were further screened for their non-allergic and antigenicity potential. Two epitopes each for S and N with high antigenicity scores were selected for final vaccine construction are tabulated in Table 3. However, no B cell epitopes were found for theE protein.
Table 1.
Predicted HTL specific epitopes obtained from IEDB.
S. No.
SARS-CoV-2 protein
Position
Allele
HTL epitopes
Prediction result for the IFN
1.
Spike Glycoprotein
113–131
HLA-DRB1
KTQSLLIVNNATNVVIKVC
Negative
691–707
HLA-DRB1
SIIAYTMSLGAENSVAY
Positive
115–130
HLA-DRB3
QSLLIVNNATNVVIKV
Negative
2.
Nucleocapsid
301–320
HLA-DRB1
WPQIAQFAPSASAFFGMSRI
Positive
310–326
HLA-DRB1
SASAFFGMSRIGMEVTP
Positive
3.
Envelope
34–50
HLA-DRB1
LTALRLCAYCCNIVNVS
Positive
45–60
HLA-DRB1
NIVNVSLVKPSFYVYS
Negative
55–71
HLA-DRB1
SFYVYSRVKNLNSSRVP
Negative
Table 2.
Predicted CTL epitopes for A3 super type obtained from NetCTL 1.2 server.
S. No.
SARS-CoV-2 protein
Position
Allele supertype
CTL epitopes
Score
1.
Spike Glycoprotein
30
A3
NSFTRGVYY
0.9914
41
A3
KVFRSSVLH
1.3419
69
A3
HVSGTNGTK
1.1575
89
A3
GVYFASTEK
1.4615
142
A3
GVYYHKNNK
1.3335
162
A3
SANNCTFEY
0.8377
187
A3
KNLREFVFK
0.7504
270
A3
LQPRTFLLK
1.0511
296
A3
LSETKCTLK
0.8213
302
A3
TLKSFTVEK
1.3483
311
A3
GIYQTSNFR
1.1119
349
A3
SVYAWNRKR
1.1423
357
A3
RISNCVADY
1.1734
361
A3
CVADYSVLY
1.1344
370
A3
NSASFSTFK
1.3454
372
A3
ASFSTFKCY
0.8899
378
A3
KCYGVSPTK
1.2722
409
A3
QIAPGQTGK
1.0483
454
A3
RLFRKSNLK
1.7563
458
A3
KSNLKPFER
0.9792
529
A3
KSTNLVKNK
0.9809
550
A3
GVLTESNKK
0.9827
559
A3
FLPFQQFGR
0.8186
604
A3
TSNQVAVLY
1.1959
725
A3
EILPVSMTK
1.2271
733
A3
KTSVDCTMY
0.8544
757
A3
GSFCTQLNR
1.0783
787
A3
QIYKTPPIK
1.4526
817
A3
FIEDLLFNK
0.8027
827
A3
TLADAGFIK
1.2451
925
A3
NQFNSAIGK
1.0796
939
A3
SSTASALGK
1.0678
956
A3
AQALNTLVK
1.0252
975
A3
SVLNDILSR
0.9214
1020
A3
ASANLAATK
1.2193
1039
A3
RVDFCGKGY
0.7565
1065
A3
VTYVPAQEK
1.3960
1099
A3
GTHWFVTQR
1.0342
1237
A3
MTSCCSCLK
1.3360
1264
A3
VLKGVKLHY
0.9774
2.
Nucleocapsid
229
A3
QLESKMSGK
1.1019
249
A3
KSAAEASKK
1.4421
311
A3
ASAFFGMSR
1.0610
361
A3
KTFPPTEPK
1.4314
379
A3
TQALPQRQK
0.9357
3.
Envelope
30
A3
TLAILTALR
1.1790
45
A3
NIVNVSLVK
0.9921
55
A3
SFYVYSRVK
1.0275
61
A3
RVKNLNSSR
1.0581
Table 3.
Predicted B-cell specific epitopes obtained from server BCPREDS.
S. No.
SARS-CoV-2 protein
B-cell epitopes
Antigenic score
1.
Spike Glycoprotein
VRQIAPGQTGKIAD
0.68
LTPGDSSSGWTAG
0.90
2.
Nucleocapsid
AFGRRGPEQTQGNFG
0.56
TGPEAGLPYGANK
0.81
Predicted HTL specific epitopes obtained from IEDB.Predicted CTL epitopes for A3 super type obtained fromNetCTL 1.2 server.Predicted B-cell specific epitopes obtained from server BCPREDS.
Multi-epitope vaccine candidate
The designed construct comprising of 48 CTL epitopes, 4 HTL epitopes, and 4 B-cell epitopes were then joined using linkers. Additionally, in order to improve the immunogenicity of multi-epitope vaccine, the construct was tagged with an adjuvant (β-defensin-TLR-3 agonist) at N terminal region, which enhances the natural immune response.
Prediction of antigenicity and allergenicity of constructed vaccine
The vaccinemust be antigenic and non-allergic in nature and also induce humoral as well as cell-mediated immune responses against the targeted virus. Our vaccine was observed to be antigenic with a respective probability score of 0.566 and 0.845 predicted by VaxiJen v2.0 and ANTIGENPro servers. The construct was also detected as a non-allergen, checked using the AllerTOP v2.0 server.
Physiochemical characterization of designed vaccine
The vaccine construct is composed of 799 amino acids and has a molecular weight of 86.35 kDa. It is basic in nature (theoretical pI 9.70) and has a half-life of 30 hrs in mammalian reticulocytes (in vitro). The instability index is estimated to be 32.21, which denotes the stability of the protein. The aliphatic index (68.42) showed that the protein is thermostable, whereas grand average of hydropathicity is observed to benegative (−0.084), indicating the hydrophilic nature of vaccine.
Prediction of secondary structure and disordered residues
The secondary structure was predicted using PESIPRED 4.0 server. As shown in Figure 1, the vaccine contains several α-helices together with many the β-strands. Six long helices from residues 3–19, 65–102, 119–142, 433–448, 475–541, and 597–614 with a long β-strands of 14 amino acids from residues 401–414 can be observed.
Figure 1.
PSIPRED graphical result of secondary structure prediction of vaccine is represented by different colors: α-helix (pink), β-sheet (yellow), and coil (gray).
PSIPRED graphical result of secondary structure prediction of vaccine is represented by different colors: α-helix (pink), β-sheet (yellow), and coil (gray).
Evaluation of tertiary structure of vaccine construct and docking with TLR-3 and TLR-5 receptors
The 799-residue long vaccine construct was modeled using I-TASSER. The lowest energy predicted model was selected for further refinement and docking studies. However, the validation on Ramachandran plot by PROCHECK server suggested that 88% residues of the constructed vaccinemodel lies in the favored and allowed regions while the rest 12% are a part of generously allowed and disallowed regions (Figure 2(b)). Similarly, RAMPAGE server also mapped nearly 90.7% residues in favored and allowed regions, while 9.3% residues were present in coordinates portraying the outlier region (Figure 2(a)). As predicted by secondary structure and disorder-based structure analysis, the tertiary structure of the construct has a high number of unstructured regions. Based on disorder prediction data, it is largely disordered at its terminals.
Figure 2.
Ramachandran plot evaluation by RAMPAGE (a) and PROCHECK (b) servers.
Ramachandran plot evaluation by RAMPAGE (a) and PROCHECK (b) servers.From the PIPER program, with highest cluster size 28, out of 10 complexes ranging till 15, the complex 1, also with greatest minimal local energy, was chosen for further analysis. The vaccinemodel interacts with theTLR-3 receptor (PDB: 20AZ) through its N-terminal and themiddle regions (Figure 3). The resultant interacting residues of vaccine-TLR-3 complex are shown in Table 4. At the interface of docked complex, Asn617 of vaccine is observed to form a H-bond with Asp153 residue of the receptor. Additionally, a few more aromatic H-bonds betweenVal621, Leu620 residues of vaccine, and His60 of TLR-3 are identified. The complex if further stabilized by a pi-cation interaction betweenLys595 of vaccine and Tyr283 of TLR-3 (Table 4). Figure 4 depicts a close view of vaccine-TLR-3 interface. Several amino acids (red) of the receptor (green) are found engaged with vaccine residues (blue).
Figure 3.
Molecular docking of vaccine construct with human TLR-3 receptor: The 799 residues long vaccine construct (blue) is found interacting with TLR-3 receptor (PDB: 20AZ) (green) through its residues at N-terminal and middle regions.
Table 4.
A list of Interacting residues of docked vaccine-TLR-3 complex.
Vaccine model
TLR-3 receptor
Distance
Buried SASA
Vaccine model
TLR-3 receptor
Distance
Buried SASA
B:13:Leu
A:287:AsnA:288:Val
2.1 A3.5 A
60.70%
B:572:Thr
A:278:Met
3.9 A
76.10%
B:38:Val
A:247:Asn
3.0 A
8.20%
B:574:Arg
A:227:PheA:229:AsnA:254:SerA:203:Glu
1.5 A1.8 A2.7 A3.9 A
89.30%
B:39:Ser
A:247:Asn
3.2 A
24.80%
B:582:Cys
A:39:His
3.9 A
28.10%
B:40:Ser
A:273:TrpA:247:AsnA:272:Lys
2.7 A3.0 A3.7 A
73.80%
B:594:Val
A:283:Tyr
3.4 A
59.30%
B:41:Gly
A:272:Lys
3.5 A
0.10%
B:595:Lys
A:230:AsnA:229:AsnA:256:SerA:257:Asn
2.0 A3.3 A3.5 A3.9 A
82.70%
B:42:Gly
A:272:Lys
1.5 A
56.70%
B:596:Leu
A:206:SerA:230:AsnA:180:Asn
2.5 A3.2 A3.4 A
59.50%
B:43:Gln
A:272:Lys
2.9 A
15.90%
B:606:Leu
A:60:His
2.9 A
71.30%
B:44:Cys
A:273:Trp
3.5 A
57.60%
B:608:Ala
A:108:His
3.0 A
21.90%
B:45:Leu
A:296:TrpA:269:Leu
3.1 A3.4 A
52.50%
B:612:Ala
A:156:His
3.0 A
69.00%
B:46:Tyr
A:269:Leu
3.5 A
1.50%
B:613:Tyr
A:180:AsnA:156:His
3.0 A3.5 A
77.30%
B:47:Ser
A:269:Leu
3.3 A
59.50%
B:614:Asn
A:156:HisA:180:AsnA:179:SerA:203:Glu
2.7 A3.0 A3.4 A4.0 A
100.00%
B:48:Ala
A:273:Trp
2.6 A
97.90%
B:615:Ile
A:203:GluA:201:LysA:227:Phe
1.4 A2.6 A3.8 A
94.80%
B:49:Cys
A:243:Leu
3.5 A
31.90%
B:617:Asn
A:177:LeuA:153:Asp
3.3 A3.5 A
53.90%
B:50:Pro
A:243:LeuA:246:Ala
1.8 A3.4 A
94.20%
B:619:Ser
A:131:MetA:107:GlnA:156:His
3.3 A3.5 A3.9 A
99.00%
B:51:Ile
A:243:LeuA:244:Glu
3.6 A3.9 A
41.10%
B:620:Leu
A:84:PheA:108:HisA:107:Gln
1.1 A3.1 A3.8 A
87.80%
B:52:Phe
A:244:GluA:218:HisA:243:LeuA:215:Gly
2.7 A2.8 A3.6 A3.8 A
98.30%
B:621:Val
A:84:PheA:60:HisA:107:Gln
2.2 A2.3 A3.4 A
55.90%
B:55:Ile
A:215:Gly
3.9 A
49.50%
B:622:Lys
A:60:His
3.6 A
0.00%
B:59:Cys
A:273:Trp
3.7 A
59.00%
B:623:Ala
A:39:HisA:60:His
1.6 A1.6 A
99.60%
B:524:Leu
A:325:Arg
3.5 A
32.50%
B:624:Ala
A:39:His
1.7 A
59.90%
B:550:Tyr
A:358:GluA:382:Lys
1.7 A2.1 A
64.20%
B:625:Tyr
A:41:LysA:39:His
1.9 A3.2 A
41.90%
B:563:Ala
A:380:Asn
3.6 A
66.40%
B:626:Ser
A:41:Lys
2.9 A
5.90%
B:564:Ala
A:356:Cys
3.9 A
85.50%
B:627:Phe
A:41:Lys
3.6 A
0.00%
B:566:Gly
A:325:ArgA:358:Glu
2.9 A4.0 A
90.00%
B:654:Ile
A:41:LysA:62:Gln
0.9 A3.0 A
68.40%
B:567:Thr
A:325:Arg
4.0 A
0.00%
B:655:Ala
A:40:LeuA:41:Lys
2.5 A2.5 A
92.90%
B:568:His
A:302:TyrA:301:Glu
2.2 A2.3 A
50.50%
B:664:Phe
A:27:Lys
1.4 A
12.60%
B:569:Trp
A:302:Tyr
2.7 A
22.90%
B:665:Phe
A:27:LysA:26:ThrA:28:Cys
1.3 A2.6 A2.7 A
77.60%
B:570:Phe
A:302:Tyr
3.2 A
27.10%
B:667:Met
A:26:Thr
3.1 A
67.00%
Figure 4.
Detailed molecular interaction between epitopes vaccine and TLR-3 receptor. The residues at the interface are represented as lines and sticks. H-bonds (wine color), pi-cation (green), and non-covalent interactions (aromatic H-bond in cyan) between two molecules are displayed using dashed arrows.
Molecular docking of vaccine construct with humanTLR-3 receptor: The 799 residues long vaccine construct (blue) is found interacting with TLR-3 receptor (PDB: 20AZ) (green) through its residues at N-terminal and middle regions.Detailed molecular interaction betweenepitopes vaccine and TLR-3 receptor. The residues at the interface are represented as lines and sticks. H-bonds (wine color), pi-cation (green), and non-covalent interactions (aromatic H-bond in cyan) between two molecules are displayed using dashed arrows.A list of Interacting residues of docked vaccine-TLR-3 complex.Additionally, we have also performed docking with another toll-like receptor (TLR-5) which is known to recognize the flagellin of bacteria. Similar to TLR-3, the vaccinemodel has shown significant interactions with TLR-5 through its N-terminal as well as residues of middle region where theN-terminal folds in the constructed model, as shown in Figure 5. Residues such as Tyr5, Glu17 fromN-terminal, and Thr511, Lys514 frommiddle regions of vaccine construct interacts with residues Asp607, Ala674, Cys585, and Glu584 of TLR-5, respectively, through aromatic hydrogen bonds, salt bridge, and hydrogen bonds.
Figure 5.
Vaccine-TLR-5 interaction identified through docking. As observed in the interacting complex, the vaccine construct interacts through its N-terminal region with TLR-5.
Vaccine-TLR-5 interaction identified through docking. As observed in the interacting complex, the vaccine construct interacts through its N-terminal region with TLR-5.
Molecular dynamics simulations
To analyze the stability and dynamic motions of docked complex, we performed a 20ns long MD simulation using Gromacs v5.1. The simulation system comprised ~7 lacs atoms in a cubic box with proteins, solvent, and ions. Based on the RMSD data, the complex was found to have an upward trend ranging from ~1 to 2.5 nm till 20ns with small fractions of stability in between (Figure 6(a)). However, according to RMSF values depicted in the graph of Figure 6(b), the fluctuation in residues of TLR-3 receptor was very less as compared to the docked structure of the vaccine, which had fluctuations in terminal regions. The interacting part of vaccine (i.e. N-terminal and middle regions) with TLR had minimal fluctuations in comparison with the other residues. Overall, the average fluctuation in the residues of the vaccinemodel was near ~1.2 nm.
Figure 6.
Illustration of docked vaccine model and TLR-3 complex through molecular dynamics simulations till 20ns. (a) RMSD, (b) RMSF, and (c) Hydrogen bonds. (d) Represents the predisposition of intrinsic disorder in residues of vaccine construct as predicted by PONDR® pool of predictors: PONDR-FIT, PONDR-VSL2B, PONDR-VL3, PONDR-VLXT, and IUPRED 2A. The mean of all predictors is shown in short dotted lines in yellow, and the shadowed area over the mean plot represents its standard error.
Illustration of docked vaccinemodel and TLR-3 complex through molecular dynamics simulations till 20ns. (a) RMSD, (b) RMSF, and (c) Hydrogen bonds. (d) Represents the predisposition of intrinsic disorder in residues of vaccine construct as predicted by PONDR® pool of predictors: PONDR-FIT, PONDR-VSL2B, PONDR-VL3, PONDR-VLXT, and IUPRED 2A. Themean of all predictors is shown in short dotted lines in yellow, and the shadowed area over themean plot represents its standard error.Moreover, the regions interacting with TLR-3 receptor are not disordered as per our analysis in Figure 1(b) and had stability throughout the simulation. As calculated in visual molecular dynamics, the average H-bonds were found to be 200 in vaccine-TLR-3 complex during MD simulations (Figure 6(c)). Additionally, disordered regions based on the sequence of vaccine were also predicted. The graph in Figure 6(d) represents themoderate amount of disorder (~20% calculated as themean of all predictors) within residues 50–100 and 650–799 near N- and C-terminal regions. The regions interacting with TLR-3 receptor are not disordered as per our analysis in Figure 6(d) and had stability throughout the simulation.The CAI improved value for codon optimized sequence was 1.0, while GC content obtained for our vaccine construct was 49.56%. Overall, the vaccine sequence lies in a suitable category for its expression in E coli. Finally, the vaccine construct was cloned with restriction sites NdeI and XhoI in theE coli pET‐21a(+) vectors. Finally, a cloned construct was generated, having a sequence length of 7767 base pairs (Figure 7).
Figure 7.
In-silico cloning to express final vaccine construct representing pET‐21a(+) vector. The optimized codon of final vaccine construct is in Red.
In-silico cloning to express final vaccine construct representing pET‐21a(+) vector. The optimized codon of final vaccine construct is in Red.
Discussion
SARS-CoV-2 is highly pathogenic, which has caused more than 4 lakh deaths globally until the writing of this manuscript. Particularly among all, Coronaviridae structural proteins S, M, E, and N are themain structural proteins and can beemployed for protein/peptide-based vaccines. Different human infecting coronaviruses likeOC43, HKU1, and MERS-CoV use a diverse set of lipids and receptors to enter into target cells [49,50]. In SARS-CoV, thespike glycoprotein interacts with ACE-2 receptor and facilitates the fusion of viral membrane with the lungs membrane [11,51,52]. E is an integral membrane protein with N-terminal region, a transmembrane domain (TMD), and a flexible hydrophilic C-terminal tail containing a PZD-binding motif in the last four amino acids [53,54]. Envelope protein forms pentamer to lead viroporin like structure that exhibits a membrane-destabilizing ion-channel activity on the host membrane [55]. CoVenvelope, which plays a vital role during virion assembly, consists mainly of M protein and only a small fraction of E protein [56]. Whereas N protein shows interaction with M protein in lipid membrane of infected cells forming a double-layered vesicular structure in a secreted form that further interacts with host proteins [57,58].Multiple vaccines are in clinical trial, but till date, no vaccine is approved against theSARS-CoV-2. In a recent in-silico report, the vaccinemodel using spike glycoprotein has been constructed but limited to the docking with TLR 5 receptor [59,60]. It was reported that HumancoronavirusesOC43 recognize theMHC class I supertype A3 [61]. Based on these reports, we have chosen the A3 super type for epitope prediction, which will help the vaccine construct to act significantly. According to information obtained from the World Health Organization (WHO), one vaccine is in Phase 2, and two vaccines (DNA and RNA based) are in Phase 12 clinical trial. Moreover, 67 vaccines are in pre-clinical trials [62]. Previously, studies showed that the whole virus vaccine and virus-like particle vaccine against SARS-CoV are able to cureinfection but leads to thepulmonary immunopathologic type lung disease [63]. Therefore, it is needed that the vaccine is selected on the basis of multi factors, i.e., antigenicity, immunogenicity, toxicity, allergenicity. In the present study, we have designed the vaccine construct by using immune-informatics approaches. Three structural proteins (spike glycoprotein, nucleocapsid, and envelope) were selected to construct a multi-epitope vaccine, which is capable of eliciting the humoral and cell-mediated immune response. Furthermore, the vaccine is antigenic, non-allergen, nontoxic, immunogenic in nature, and capable of IFN-γ production. The vaccineepitopes were joined with the help of specific linkers to enhance stability and immunogenicity. The vaccine has a molecular weight of 86.35 kDa, basic in nature, and half-life of 30 hrs in mammalian reticulocytes (in-vitro). Moreover, vaccine was found to be highly stable and hydrophilic in nature. Finally, the build vaccine construct was found to bemoderately disordered, containing few disordered residues at N-terminal and relatively a vast region at C-terminus. As previously known, the vaccine candidates often have an ample amount of protein intrinsic disorder as they feature in the immune system [64]. Further, we also checked the interaction of vaccine construct through a modeled structure docked with two Toll-like Receptors (TLR-3 and TLR-5) structure. The adjuvant β-defensin 1 showed interaction with TLR-3 receptor and few residues from the same region also showed interaction with TLR-5. Further, the stability of the vaccine-TLR-3 complex was monitored by implementing molecular dynamics simulations till 20ns. Several residues at theN-terminal and middle region of vaccine construct were found to form stable interactions. Finally, the cloned sequence vector was generated in-silico for theexpression of the vaccine.
Conclusion and expert opinion
Newly emerged SARS-CoV-2 is declared as pandemic and scientists all over the world running a race against the time to find vaccine/drugs against COVID-19. Despite that, many obstacles like allergenicity, immunogenicity, toxicity, etc. limit the progress of vaccine development. In recent times, advancement in immunoinformatics approaches led researchers to speed up vaccine development. In this context, by precise prediction and selection of multi-epitope, we strongly suggest to experimentaly validate this multi-subunit vaccine, in-vitro and in-vivo.
Authors: Ruben J G Hulswit; Yifei Lang; Mark J G Bakkers; Wentao Li; Zeshi Li; Arie Schouten; Bram Ophorst; Frank J M van Kuppeveld; Geert-Jan Boons; Berend-Jan Bosch; Eric G Huizinga; Raoul J de Groot Journal: Proc Natl Acad Sci U S A Date: 2019-01-24 Impact factor: 11.205
Authors: Runtao He; Frederick Dobie; Melissa Ballantine; Andrew Leeson; Yan Li; Nathalie Bastien; Todd Cutts; Anton Andonov; Jingxin Cao; Timothy F Booth; Frank A Plummer; Shaun Tyler; Lindsay Baker; Xuguang Li Journal: Biochem Biophys Res Commun Date: 2004-04-02 Impact factor: 3.575