Literature DB >> 32815406

Exploring the SARS-CoV-2 structural proteins for multi-epitope vaccine development: an in-silico approach.

Amit Kumar1, Prateek Kumar1, Kumar Udit Saumya1, Shivani K Kapuganti1, Taniya Bhardwaj1, Rajanish Giri1.   

Abstract

INTRODUCTION: The ongoing life-threatening pandemic of coronavirus disease 2019 (COVID-19) has extensively affected the world. During this global health crisis, it is fundamentally crucial to find strategies to combat SARS-CoV-2. Despite several efforts in this direction and continuing clinical trials, no vaccine has been approved for it yet.
METHODS: To find a preventive measure, we have computationally designed a multi-epitopic subunit vaccine using immuno-informatic approaches.
RESULTS: The structural proteins of SARS-CoV-2 involved in its survival and pathogenicity were used to predict antigenic epitopes. The antigenic epitopes were capable of eliciting a strong humoral as well as cell-mediated immune response, our predictions suggest. The final vaccine was constructed by joining the all epitopes with specific linkers and to enhance their stability and immunogenicity. The physicochemical property of the vaccine was assessed. The vaccine 3D structure prediction and validation were done and docked with the human TLR-3 receptor. Furthermore, molecular dynamics simulations of the vaccine-TLR-3 receptor complex are employed to assess its dynamic motions and binding stability in-silico.
CONCLUSION: Based on this study, we strongly suggest synthesizing this vaccine, which further can be tested in-vitro and in-vivo to check its potency in a cure for COVID-19.

Entities:  

Keywords:  SARS-CoV-2; envelope; epitopes; nucleocapsid; spike glycoprotein; vaccine

Mesh:

Substances:

Year:  2020        PMID: 32815406      PMCID: PMC7544969          DOI: 10.1080/14760584.2020.1813576

Source DB:  PubMed          Journal:  Expert Rev Vaccines        ISSN: 1476-0584            Impact factor:   5.217


Introduction

Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2) is the causative agent for the associated disease named as COVID-19, which had spread throughout the world and declared as a pandemic [1,2]. SARS-CoV-2 has been demonstrated to have a strong affinity for human respiratory receptors (hACE2), which makes it more vulnerable to humans globally [3]. SARS-coronavirus infections are commonly found to be associated with multiple disorders, such as respiratory, enteric, hepatic, and neurological complications [4]. In general, coronaviruses (CoV) belong to a family Coronaviridae, which consists of enveloped positive-sense, single-stranded RNA viruses [5,6]. Among the four genera of CoVs, SARS-CoV, MERS-CoV, and SARS-CoV-2 belong to Betacoronaviruses [7]. Inside host cells, SARS-CoV-2 genome is translated into two large polyproteins: replicase polyprotein 1a (pp1a) and 1ab (pp1ab) [8]. The larger pp1ab comprises 16 non-structural proteins (Nsp1Nsp16), which are responsible for constructing a viral replication-transcription complex [9]. Alongside the viral RNA transcribes subgenomic RNAs, which encode four structural proteins; spike (S), envelope (E), membrane (M), and nucleocapsid (N) and several accessory proteins [10]. While accessory proteins are engaged in host immune suppression, including downregulation of the interferon pathway, structural proteins shape the virion, facilitating genome encapsulation, viral particle assembly, and release [11,12]. Recently, one health concept is used and hypothesized that the prior exposure of humans with pet animals lowers the adverse effect of SARS-CoV-2. The N and S protein epitopes in comparison with taxonomically related coronavirusesepitopes share high sequence homology, which has high implications for vaccine designing against SARS-CoV-2 [13,14]. Additionally, there are repurposing of drugs are also important to cobat this disease [15,16]. Since the beginning of the 21st century, two outbreaks of SARS-CoV and MERS-CoV have led the researchers into designing vaccines against SARS coronaviruses. The third outbreak, wherein SARS-CoV-2 infects promiscuously to humans, has given the need for speedy research into developing a broad-range vaccine against these coronaviruses. Evidence shows that humoral and cell-mediated immune response plays a protective role against coronavirus infections, specifically against S and N proteins [17-19]. Although the effective antibody response is short-lived, the T cell responses are found to provide long-term protection [20,21]. Similarly, TC cell response against the structural proteins S and N are long-lasting [22]. Previously, immunoinformatics approaches have been used to construct the vaccines against the Zika virus Nipah virus, and Human papillomavirus, where in-vivo studies of the Human papillomavirus showed promising results [23-25]. Hence, in this study, we have used immunoinformatic approaches to predict highly antigenic epitopes from SARS-CoV-2 structural proteins that would evoke a strong immune response in humans. For this purpose, we have used the structural proteins: Spike, Envelope, and nucleocapsid to predict B-cell, cytotoxic T lymphocyte (CTL) and helper T lymphocyte (HTL) epitopes for construction of vaccine. Thus, the multi-epitopic subunit vaccine would be capable of eliciting humoral as well as cell-mediated immune responses. We have also performed the docking and molecular dynamic simulations (MDS) between the vaccine and human Toll-like Receptor-3 (TLR-3) to study their binding stability. The outcome of the present study will result in a novel and immunogenic vaccine, which may be further accessed against SARS-CoV-2.

Material and methods

Protein sequence retrieval

Translated sequences of three structural proteins of SARS-CoV-2: Spike glycoprotein [Protein ID: QHD43416.1], Envelope [Protein ID: QHD43418.1], and Nucleocapsid protein [Protein ID: QHD43423.2] were obtained from the GenBank database.

B lymphocytes epitopes prediction

The IEDB tools (http://tools.iedb.org/bcell/) were used to identify the B cell epitope and for verifying antigenicity, BepiPred linear epitope analysis was performed. According to the BepiPred linear epitope prediction method, residues with scores above the threshold (default value 0.35) are predicted to be part of an epitope having 49% sensitivity and 75% specificity [26].

Helper T – cells epitopes prediction

The IEDB server (http://tools.iedb.org/mhcii/) was used for the prediction of MHC II antigenic binders in translated sequences. Top 10 HTL epitopes were sorted based on their percentile rank and IC50 value. Epitopes with the lowest percentile rank were considered an excellent binding affinity, whereas, peptides with IC50 values in lower (<50 nM), middle (<500 nM), and higher (<5000 nM) range corresponds to the highest, intermediate, and lowest binding affinity, for the T-cell, respectively [27].

Cytotoxic T- cells epitopes prediction

MHC class I binding/CTL epitopes for supertype A3 were predicted for all three proteins by using an online server NetCTL 1.2 (http://www.cbs.dtu.dk/services/NetCTL/) [28]. Default threshold of 0.75 was chosen for vaccine construction in this study (>1.25 corresponds to 54% sensitivity and 99.30% specificity; >1.00 shows 70% sensitivity and 98.5% specificity; >0.90 shows 74% sensitivity and 98% specificity). MHC class I binding and proteasomal cleavage prediction was performed using artificial neural networks. Accordingly, TAP transport efficiency was predicted using a weight matrix.

IFN‐γ inducing epitopes prediction

MHC class II binder epitopes/HTL, with the ability to induce cell‐mediated immunity, were identified using the IFNepitope server (http://crdd.osdd.net/raghava/ifnepitope/). The prediction was performed by a motif and support vector machine (SVM) hybrid approach [29]. All the positive epitopes able to induce interferon‐γ (IFN‐γ) were selected for vaccine construct.

Multi-epitopic vaccine construction

Selected high-scoring CTLs, high-affinity HTLs, and B-cell epitopes were used to generate the vaccine sequence. The different epitopes were linked together using linkers: CTL linker (AAY), HTL linker (GPGPG), and B epitope linker (KK). To improve the immunogenicity of the vaccine, β-defensin 1 (Uniprot Id: P60022) as an adjuvant was added through an EAAAK linker at the N-terminal of the construct.

Prediction of the antigenicity and allergenicity of the vaccine candidate

Antigenicity prediction

The servers ANTIGENpro (http://scratch.proteomics.ics.uci.edu/) and VaxiJen v2.0 (http://www.ddg‐pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) were used for the antigenicity evaluation of the final vaccine construct [30,31]. The VaxiJen 2.0 server predicts the antigenicity of the multi-epitope vaccine peptide based on the physicochemical properties of the input protein. Whereas, ANTIGENpro server predicts the antigenicity of the multi-epitopic vaccine based on the protein microarray data analysis of the target organism.

Allergen prediction

The server AllerTOP v2.0 (http://www.ddg-pharmfac.net/AllerTOP) and AllergenFP were used to predict the allergenicity of multi-epitopic vaccine [32,33]. AllerTOP v2.0 classifies the vaccine protein sequence by machine learning methods for the classification of allergens. AllergenFP is an alignment-free online server that uses SVM module to predict the allergenic and non-allergenic nature of proteins.

Vaccine features

Physiochemical properties of the constructed vaccine

Vaccine construct was accessed for its physicochemical properties using the ProtParam server (http://web.expasy.org/protparam/) that predicts the theoretical pI, instability index, half‐life, stability profiling, aliphatic index, and Grand Average of Hydropathy of the sequence [34].

Secondary structure prediction

Protein secondary structure was determined using PSIPRED, a web-based freely accessible online server (http://bioinf.cs.ucl.ac.uk/psipred/) [35] which also gives information of contact analysis, fold recognition, structure modeling, function prediction, protein intrinsic disorder prediction and domain prediction of a query sequence [36].

Disorder profile generation

The disorder profile of the vaccine construct was generated using PONDR pool of four predictors (http://original.disprot.org/metapredictor.php) [37-39] and IUPRED 2A predictor [40] as described in our previous reports [41-43].

Tertiary structure prediction and validation

The tertiary structure of the vaccine construct was generated using the I-TASSER web server [44]. It provides an energy minimized model through iterative template-based fragment assembly simulations. The structure was further optimized upon adding hydrogen and missing side-chain atoms followed by minimization in Schrodinger’s protein preparation wizard using OPLS 2005 forcefield [45]. This produced a high-quality model for protein-protein docking and molecular simulations. RAMPAGE [Mordred.bioc.cam.ac.uk/~rapper/rampage.php] and PROCHECK [46] web servers were used for evaluation of the vaccine model based on the distribution of amino acids in the Ramachandran plot.

Molecular docking of the final vaccine with immune receptors

PIPER program embedded in the BioLuminate module of Schrodinger for protein-protein docking was implemented for docking of vaccine model and TLR-3 (PDB ID: 20AZ) and TLR-5 (PDB ID: 3J0A) receptors [47,48]. It is an efficient tool that reduces false positive poses and performs a global search with Fast-Fourier Transform (FFT) approach. Using 1000 conformations of input structures, the top 50 clusters were selected with cluster radius 9 Å, which then centralized, and the outcomes of the docking based on cluster size were evaluated. With the largest cluster size, the docked complex out of 10 complexes was selected for molecular dynamics simulation.

Molecular dynamics simulation of vaccine-immune receptors

The ~86kDa vaccine model in complex with TLR-3 receptor was evaluated for its binding stability in an aqueous environment for 20ns. With a total of 6,66,607 molecules of TIP3P water model, 60 neutralizing Cl− ions, and 0.15 M concentration of salt, the complex was incorporated in a cubic box. Recently updated forcefield CHARMM36 was implemented to generate the topologies of the protein-protein complex. The generated system was minimized for 50,000 steps of the steepest descent algorithm. Further, the equilibration process under NPT and NVT conditions for 1ns was done with Parrinello-Rahman and V-rescale methods for coupling of pressure and temperature, respectively. Lastly, production MD run for all three systems was performed in periodic boundary conditions for 20ns. LINCS algorithm was used for calculating bond parameters. Particle Mesh Ewald (PME) for long-range electrostatics with fourier spacing of 0.16 was used for production MD. An analysis of MD trajectory, Root Mean Square Deviation (RMSD), and Root Mean Square Fluctuation (RMSF) was calculated with gmx rms, and rmsf commands.

In-silico cloning of final vaccine

For the production of final vaccine in large quantity, Java Codon Adaptation Tool (http://www.jcat.de/) was used for codon optimization and attached with expression vector. The optimized codon sequence includes Codon Adaption Index (value 1.0) and %GC content (30–70%), which denotes the high level of protein expression and DNA stability. Finally, restriction cloning was performed using tool SnapGene v3.3.4. The restriction sites NdeI and XhoI were added to the N and C terminals of the optimized complementary DNA sequences followed by their insertion within the pET‐21a(+) vector to ensure the polyprotein synthesis within the E coli expression system.

Results

Prediction of B cell and T cell epitope

HTL epitopes for all structural proteins were predicted using IEDB server for human MHC-II alleles. Among these, top 10 epitopes based on their least percentile rank depicting their high affinity were selected. The final selection of epitopes was made on the basis of overlapping sequences, which were further assessed for their IFN inducing potential. Epitopes with positive IFN inducing potential were finally selected for vaccine construction (Table 1). For each structural protein, CTL epitopes for supertype A3 were predicted using an online server NetCTL 1.2. A total of 40, 5, and 4 epitopes were predicted for S, N, and E proteins, respectively (Table 2). Linear B-cell epitopes in all three structural proteins were predicted using another online server BCPREDS. Epitopes selected on the basis of prediction scores were further screened for their non-allergic and antigenicity potential. Two epitopes each for S and N with high antigenicity scores were selected for final vaccine construction are tabulated in Table 3. However, no B cell epitopes were found for the E protein.
Table 1.

Predicted HTL specific epitopes obtained from IEDB.

S. No.SARS-CoV-2 proteinPositionAlleleHTL epitopesPrediction result for the IFN
1.Spike Glycoprotein113–131HLA-DRB1KTQSLLIVNNATNVVIKVCNegative
691–707HLA-DRB1SIIAYTMSLGAENSVAYPositive
115–130HLA-DRB3QSLLIVNNATNVVIKVNegative
2.Nucleocapsid301–320HLA-DRB1WPQIAQFAPSASAFFGMSRIPositive
310–326HLA-DRB1SASAFFGMSRIGMEVTPPositive
3.Envelope34–50HLA-DRB1LTALRLCAYCCNIVNVSPositive
45–60HLA-DRB1NIVNVSLVKPSFYVYSNegative
55–71HLA-DRB1SFYVYSRVKNLNSSRVPNegative
Table 2.

Predicted CTL epitopes for A3 super type obtained from NetCTL 1.2 server.

S. No.SARS-CoV-2 proteinPositionAllele supertypeCTL epitopesScore
1.Spike Glycoprotein30A3NSFTRGVYY0.9914
41A3KVFRSSVLH1.3419
69A3HVSGTNGTK1.1575
89A3GVYFASTEK1.4615
142A3GVYYHKNNK1.3335
162A3SANNCTFEY0.8377
187A3KNLREFVFK0.7504
270A3LQPRTFLLK1.0511
296A3LSETKCTLK0.8213
302A3TLKSFTVEK1.3483
311A3GIYQTSNFR1.1119
349A3SVYAWNRKR1.1423
357A3RISNCVADY1.1734
361A3CVADYSVLY1.1344
370A3NSASFSTFK1.3454
372A3ASFSTFKCY0.8899
378A3KCYGVSPTK1.2722
409A3QIAPGQTGK1.0483
454A3RLFRKSNLK1.7563
458A3KSNLKPFER0.9792
529A3KSTNLVKNK0.9809
550A3GVLTESNKK0.9827
559A3FLPFQQFGR0.8186
604A3TSNQVAVLY1.1959
725A3EILPVSMTK1.2271
733A3KTSVDCTMY0.8544
  757A3GSFCTQLNR1.0783
787A3QIYKTPPIK1.4526
817A3FIEDLLFNK0.8027
827A3TLADAGFIK1.2451
925A3NQFNSAIGK1.0796
939A3SSTASALGK1.0678
956A3AQALNTLVK1.0252
975A3SVLNDILSR0.9214
1020A3ASANLAATK1.2193
1039A3RVDFCGKGY0.7565
1065A3VTYVPAQEK1.3960
1099A3GTHWFVTQR1.0342
1237A3MTSCCSCLK1.3360
1264A3VLKGVKLHY0.9774
2.Nucleocapsid229A3QLESKMSGK1.1019
249A3KSAAEASKK1.4421
311A3ASAFFGMSR1.0610
361A3KTFPPTEPK1.4314
379A3TQALPQRQK0.9357
3.Envelope30A3TLAILTALR1.1790
45A3NIVNVSLVK0.9921
55A3SFYVYSRVK1.0275
61A3RVKNLNSSR1.0581
Table 3.

Predicted B-cell specific epitopes obtained from server BCPREDS.

S. No.SARS-CoV-2 proteinB-cell epitopesAntigenic score
1.Spike GlycoproteinVRQIAPGQTGKIAD0.68
LTPGDSSSGWTAG0.90
2.NucleocapsidAFGRRGPEQTQGNFG0.56
TGPEAGLPYGANK0.81
Predicted HTL specific epitopes obtained from IEDB. Predicted CTL epitopes for A3 super type obtained from NetCTL 1.2 server. Predicted B-cell specific epitopes obtained from server BCPREDS.

Multi-epitope vaccine candidate

The designed construct comprising of 48 CTL epitopes, 4 HTL epitopes, and 4 B-cell epitopes were then joined using linkers. Additionally, in order to improve the immunogenicity of multi-epitope vaccine, the construct was tagged with an adjuvant (β-defensin-TLR-3 agonist) at N terminal region, which enhances the natural immune response.

Prediction of antigenicity and allergenicity of constructed vaccine

The vaccine must be antigenic and non-allergic in nature and also induce humoral as well as cell-mediated immune responses against the targeted virus. Our vaccine was observed to be antigenic with a respective probability score of 0.566 and 0.845 predicted by VaxiJen v2.0 and ANTIGENPro servers. The construct was also detected as a non-allergen, checked using the AllerTOP v2.0 server.

Physiochemical characterization of designed vaccine

The vaccine construct is composed of 799 amino acids and has a molecular weight of 86.35 kDa. It is basic in nature (theoretical pI 9.70) and has a half-life of 30 hrs in mammalian reticulocytes (in vitro). The instability index is estimated to be 32.21, which denotes the stability of the protein. The aliphatic index (68.42) showed that the protein is thermostable, whereas grand average of hydropathicity is observed to be negative (−0.084), indicating the hydrophilic nature of vaccine.

Prediction of secondary structure and disordered residues

The secondary structure was predicted using PESIPRED 4.0 server. As shown in Figure 1, the vaccine contains several α-helices together with many the β-strands. Six long helices from residues 3–19, 65–102, 119–142, 433–448, 475–541, and 597–614 with a long β-strands of 14 amino acids from residues 401–414 can be observed.
Figure 1.

PSIPRED graphical result of secondary structure prediction of vaccine is represented by different colors: α-helix (pink), β-sheet (yellow), and coil (gray).

PSIPRED graphical result of secondary structure prediction of vaccine is represented by different colors: α-helix (pink), β-sheet (yellow), and coil (gray).

Evaluation of tertiary structure of vaccine construct and docking with TLR-3 and TLR-5 receptors

The 799-residue long vaccine construct was modeled using I-TASSER. The lowest energy predicted model was selected for further refinement and docking studies. However, the validation on Ramachandran plot by PROCHECK server suggested that 88% residues of the constructed vaccine model lies in the favored and allowed regions while the rest 12% are a part of generously allowed and disallowed regions (Figure 2(b)). Similarly, RAMPAGE server also mapped nearly 90.7% residues in favored and allowed regions, while 9.3% residues were present in coordinates portraying the outlier region (Figure 2(a)). As predicted by secondary structure and disorder-based structure analysis, the tertiary structure of the construct has a high number of unstructured regions. Based on disorder prediction data, it is largely disordered at its terminals.
Figure 2.

Ramachandran plot evaluation by RAMPAGE (a) and PROCHECK (b) servers.

Ramachandran plot evaluation by RAMPAGE (a) and PROCHECK (b) servers. From the PIPER program, with highest cluster size 28, out of 10 complexes ranging till 15, the complex 1, also with greatest minimal local energy, was chosen for further analysis. The vaccine model interacts with the TLR-3 receptor (PDB: 20AZ) through its N-terminal and the middle regions (Figure 3). The resultant interacting residues of vaccine-TLR-3 complex are shown in Table 4. At the interface of docked complex, Asn617 of vaccine is observed to form a H-bond with Asp153 residue of the receptor. Additionally, a few more aromatic H-bonds between Val621, Leu620 residues of vaccine, and His60 of TLR-3 are identified. The complex if further stabilized by a pi-cation interaction between Lys595 of vaccine and Tyr283 of TLR-3 (Table 4). Figure 4 depicts a close view of vaccine-TLR-3 interface. Several amino acids (red) of the receptor (green) are found engaged with vaccine residues (blue).
Figure 3.

Molecular docking of vaccine construct with human TLR-3 receptor: The 799 residues long vaccine construct (blue) is found interacting with TLR-3 receptor (PDB: 20AZ) (green) through its residues at N-terminal and middle regions.

Table 4.

A list of Interacting residues of docked vaccine-TLR-3 complex.

Vaccine modelTLR-3 receptorDistanceBuried SASAVaccine modelTLR-3 receptorDistanceBuried SASA
B:13:LeuA:287:AsnA:288:Val2.1 A3.5 A60.70%B:572:ThrA:278:Met3.9 A76.10%
B:38:ValA:247:Asn3.0 A8.20%B:574:ArgA:227:PheA:229:AsnA:254:SerA:203:Glu1.5 A1.8 A2.7 A3.9 A89.30%
B:39:SerA:247:Asn3.2 A24.80%B:582:CysA:39:His3.9 A28.10%
B:40:SerA:273:TrpA:247:AsnA:272:Lys2.7 A3.0 A3.7 A73.80%B:594:ValA:283:Tyr3.4 A59.30%
B:41:GlyA:272:Lys3.5 A0.10%B:595:LysA:230:AsnA:229:AsnA:256:SerA:257:Asn2.0 A3.3 A3.5 A3.9 A82.70%
B:42:GlyA:272:Lys1.5 A56.70%B:596:LeuA:206:SerA:230:AsnA:180:Asn2.5 A3.2 A3.4 A59.50%
B:43:GlnA:272:Lys2.9 A15.90%B:606:LeuA:60:His2.9 A71.30%
B:44:CysA:273:Trp3.5 A57.60%B:608:AlaA:108:His3.0 A21.90%
B:45:LeuA:296:TrpA:269:Leu3.1 A3.4 A52.50%B:612:AlaA:156:His3.0 A69.00%
B:46:TyrA:269:Leu3.5 A1.50%B:613:TyrA:180:AsnA:156:His3.0 A3.5 A77.30%
B:47:SerA:269:Leu3.3 A59.50%B:614:AsnA:156:HisA:180:AsnA:179:SerA:203:Glu2.7 A3.0 A3.4 A4.0 A100.00%
B:48:AlaA:273:Trp2.6 A97.90%B:615:IleA:203:GluA:201:LysA:227:Phe1.4 A2.6 A3.8 A94.80%
B:49:CysA:243:Leu3.5 A31.90%B:617:AsnA:177:LeuA:153:Asp3.3 A3.5 A53.90%
B:50:ProA:243:LeuA:246:Ala1.8 A3.4 A94.20%B:619:SerA:131:MetA:107:GlnA:156:His3.3 A3.5 A3.9 A99.00%
B:51:IleA:243:LeuA:244:Glu3.6 A3.9 A41.10%B:620:LeuA:84:PheA:108:HisA:107:Gln1.1 A3.1 A3.8 A87.80%
B:52:PheA:244:GluA:218:HisA:243:LeuA:215:Gly2.7 A2.8 A3.6 A3.8 A98.30%B:621:ValA:84:PheA:60:HisA:107:Gln2.2 A2.3 A3.4 A55.90%
B:55:IleA:215:Gly3.9 A49.50%B:622:LysA:60:His3.6 A0.00%
B:59:CysA:273:Trp3.7 A59.00%B:623:AlaA:39:HisA:60:His1.6 A1.6 A99.60%
B:524:LeuA:325:Arg3.5 A32.50%B:624:AlaA:39:His1.7 A59.90%
B:550:TyrA:358:GluA:382:Lys1.7 A2.1 A64.20%B:625:TyrA:41:LysA:39:His1.9 A3.2 A41.90%
B:563:AlaA:380:Asn3.6 A66.40%B:626:SerA:41:Lys2.9 A5.90%
B:564:AlaA:356:Cys3.9 A85.50%B:627:PheA:41:Lys3.6 A0.00%
B:566:GlyA:325:ArgA:358:Glu2.9 A4.0 A90.00%B:654:IleA:41:LysA:62:Gln0.9 A3.0 A68.40%
B:567:ThrA:325:Arg4.0 A0.00%B:655:AlaA:40:LeuA:41:Lys2.5 A2.5 A92.90%
B:568:HisA:302:TyrA:301:Glu2.2 A2.3 A50.50%B:664:PheA:27:Lys1.4 A12.60%
B:569:TrpA:302:Tyr2.7 A22.90%B:665:PheA:27:LysA:26:ThrA:28:Cys1.3 A2.6 A2.7 A77.60%
B:570:PheA:302:Tyr3.2 A27.10%B:667:MetA:26:Thr3.1 A67.00%
Figure 4.

Detailed molecular interaction between epitopes vaccine and TLR-3 receptor. The residues at the interface are represented as lines and sticks. H-bonds (wine color), pi-cation (green), and non-covalent interactions (aromatic H-bond in cyan) between two molecules are displayed using dashed arrows.

Molecular docking of vaccine construct with human TLR-3 receptor: The 799 residues long vaccine construct (blue) is found interacting with TLR-3 receptor (PDB: 20AZ) (green) through its residues at N-terminal and middle regions. Detailed molecular interaction between epitopes vaccine and TLR-3 receptor. The residues at the interface are represented as lines and sticks. H-bonds (wine color), pi-cation (green), and non-covalent interactions (aromatic H-bond in cyan) between two molecules are displayed using dashed arrows. A list of Interacting residues of docked vaccine-TLR-3 complex. Additionally, we have also performed docking with another toll-like receptor (TLR-5) which is known to recognize the flagellin of bacteria. Similar to TLR-3, the vaccine model has shown significant interactions with TLR-5 through its N-terminal as well as residues of middle region where the N-terminal folds in the constructed model, as shown in Figure 5. Residues such as Tyr5, Glu17 from N-terminal, and Thr511, Lys514 from middle regions of vaccine construct interacts with residues Asp607, Ala674, Cys585, and Glu584 of TLR-5, respectively, through aromatic hydrogen bonds, salt bridge, and hydrogen bonds.
Figure 5.

Vaccine-TLR-5 interaction identified through docking. As observed in the interacting complex, the vaccine construct interacts through its N-terminal region with TLR-5.

Vaccine-TLR-5 interaction identified through docking. As observed in the interacting complex, the vaccine construct interacts through its N-terminal region with TLR-5.

Molecular dynamics simulations

To analyze the stability and dynamic motions of docked complex, we performed a 20ns long MD simulation using Gromacs v5.1. The simulation system comprised ~7 lacs atoms in a cubic box with proteins, solvent, and ions. Based on the RMSD data, the complex was found to have an upward trend ranging from ~1 to 2.5 nm till 20ns with small fractions of stability in between (Figure 6(a)). However, according to RMSF values depicted in the graph of Figure 6(b), the fluctuation in residues of TLR-3 receptor was very less as compared to the docked structure of the vaccine, which had fluctuations in terminal regions. The interacting part of vaccine (i.e. N-terminal and middle regions) with TLR had minimal fluctuations in comparison with the other residues. Overall, the average fluctuation in the residues of the vaccine model was near ~1.2 nm.
Figure 6.

Illustration of docked vaccine model and TLR-3 complex through molecular dynamics simulations till 20ns. (a) RMSD, (b) RMSF, and (c) Hydrogen bonds. (d) Represents the predisposition of intrinsic disorder in residues of vaccine construct as predicted by PONDR® pool of predictors: PONDR-FIT, PONDR-VSL2B, PONDR-VL3, PONDR-VLXT, and IUPRED 2A. The mean of all predictors is shown in short dotted lines in yellow, and the shadowed area over the mean plot represents its standard error.

Illustration of docked vaccine model and TLR-3 complex through molecular dynamics simulations till 20ns. (a) RMSD, (b) RMSF, and (c) Hydrogen bonds. (d) Represents the predisposition of intrinsic disorder in residues of vaccine construct as predicted by PONDR® pool of predictors: PONDR-FIT, PONDR-VSL2B, PONDR-VL3, PONDR-VLXT, and IUPRED 2A. The mean of all predictors is shown in short dotted lines in yellow, and the shadowed area over the mean plot represents its standard error. Moreover, the regions interacting with TLR-3 receptor are not disordered as per our analysis in Figure 1(b) and had stability throughout the simulation. As calculated in visual molecular dynamics, the average H-bonds were found to be 200 in vaccine-TLR-3 complex during MD simulations (Figure 6(c)). Additionally, disordered regions based on the sequence of vaccine were also predicted. The graph in Figure 6(d) represents the moderate amount of disorder (~20% calculated as the mean of all predictors) within residues 50–100 and 650–799 near N- and C-terminal regions. The regions interacting with TLR-3 receptor are not disordered as per our analysis in Figure 6(d) and had stability throughout the simulation. The CAI improved value for codon optimized sequence was 1.0, while GC content obtained for our vaccine construct was 49.56%. Overall, the vaccine sequence lies in a suitable category for its expression in E coli. Finally, the vaccine construct was cloned with restriction sites NdeI and XhoI in the E coli pET‐21a(+) vectors. Finally, a cloned construct was generated, having a sequence length of 7767 base pairs (Figure 7).
Figure 7.

In-silico cloning to express final vaccine construct representing pET‐21a(+) vector. The optimized codon of final vaccine construct is in Red.

In-silico cloning to express final vaccine construct representing pET‐21a(+) vector. The optimized codon of final vaccine construct is in Red.

Discussion

SARS-CoV-2 is highly pathogenic, which has caused more than 4 lakh deaths globally until the writing of this manuscript. Particularly among all, Coronaviridae structural proteins S, M, E, and N are the main structural proteins and can be employed for protein/peptide-based vaccines. Different human infecting coronaviruses like OC43, HKU1, and MERS-CoV use a diverse set of lipids and receptors to enter into target cells [49,50]. In SARS-CoV, the spike glycoprotein interacts with ACE-2 receptor and facilitates the fusion of viral membrane with the lungs membrane [11,51,52]. E is an integral membrane protein with N-terminal region, a transmembrane domain (TMD), and a flexible hydrophilic C-terminal tail containing a PZD-binding motif in the last four amino acids [53,54]. Envelope protein forms pentamer to lead viroporin like structure that exhibits a membrane-destabilizing ion-channel activity on the host membrane [55]. CoV envelope, which plays a vital role during virion assembly, consists mainly of M protein and only a small fraction of E protein [56]. Whereas N protein shows interaction with M protein in lipid membrane of infected cells forming a double-layered vesicular structure in a secreted form that further interacts with host proteins [57,58]. Multiple vaccines are in clinical trial, but till date, no vaccine is approved against the SARS-CoV-2. In a recent in-silico report, the vaccine model using spike glycoprotein has been constructed but limited to the docking with TLR 5 receptor [59,60]. It was reported that Human coronaviruses OC43 recognize the MHC class I supertype A3 [61]. Based on these reports, we have chosen the A3 super type for epitope prediction, which will help the vaccine construct to act significantly. According to information obtained from the World Health Organization (WHO), one vaccine is in Phase 2, and two vaccines (DNA and RNA based) are in Phase 12 clinical trial. Moreover, 67 vaccines are in pre-clinical trials [62]. Previously, studies showed that the whole virus vaccine and virus-like particle vaccine against SARS-CoV are able to cure infection but leads to the pulmonary immunopathologic type lung disease [63]. Therefore, it is needed that the vaccine is selected on the basis of multi factors, i.e., antigenicity, immunogenicity, toxicity, allergenicity. In the present study, we have designed the vaccine construct by using immune-informatics approaches. Three structural proteins (spike glycoprotein, nucleocapsid, and envelope) were selected to construct a multi-epitope vaccine, which is capable of eliciting the humoral and cell-mediated immune response. Furthermore, the vaccine is antigenic, non-allergen, nontoxic, immunogenic in nature, and capable of IFN-γ production. The vaccine epitopes were joined with the help of specific linkers to enhance stability and immunogenicity. The vaccine has a molecular weight of 86.35 kDa, basic in nature, and half-life of 30 hrs in mammalian reticulocytes (in-vitro). Moreover, vaccine was found to be highly stable and hydrophilic in nature. Finally, the build vaccine construct was found to be moderately disordered, containing few disordered residues at N-terminal and relatively a vast region at C-terminus. As previously known, the vaccine candidates often have an ample amount of protein intrinsic disorder as they feature in the immune system [64]. Further, we also checked the interaction of vaccine construct through a modeled structure docked with two Toll-like Receptors (TLR-3 and TLR-5) structure. The adjuvant β-defensin 1 showed interaction with TLR-3 receptor and few residues from the same region also showed interaction with TLR-5. Further, the stability of the vaccine-TLR-3 complex was monitored by implementing molecular dynamics simulations till 20ns. Several residues at the N-terminal and middle region of vaccine construct were found to form stable interactions. Finally, the cloned sequence vector was generated in-silico for the expression of the vaccine.

Conclusion and expert opinion

Newly emerged SARS-CoV-2 is declared as pandemic and scientists all over the world running a race against the time to find vaccine/drugs against COVID-19. Despite that, many obstacles like allergenicity, immunogenicity, toxicity, etc. limit the progress of vaccine development. In recent times, advancement in immunoinformatics approaches led researchers to speed up vaccine development. In this context, by precise prediction and selection of multi-epitope, we strongly suggest to experimentaly validate this multi-subunit vaccine, in-vitro and in-vivo.
  45 in total

1.  Protein secondary structure prediction based on position-specific scoring matrices.

Authors:  D T Jones
Journal:  J Mol Biol       Date:  1999-09-17       Impact factor: 5.469

2.  Human coronaviruses OC43 and HKU1 bind to 9-O-acetylated sialic acids via a conserved receptor-binding site in spike protein domain A.

Authors:  Ruben J G Hulswit; Yifei Lang; Mark J G Bakkers; Wentao Li; Zeshi Li; Arie Schouten; Bram Ophorst; Frank J M van Kuppeveld; Geert-Jan Boons; Berend-Jan Bosch; Eric G Huizinga; Raoul J de Groot
Journal:  Proc Natl Acad Sci U S A       Date:  2019-01-24       Impact factor: 11.205

3.  Folding and structural polymorphism of p53 C-terminal domain: One peptide with many conformations.

Authors:  Amit Kumar; Prateek Kumar; Shobha Kumari; Vladimir N Uversky; Rajanish Giri
Journal:  Arch Biochem Biophys       Date:  2020-03-14       Impact factor: 4.013

4.  IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding.

Authors:  Bálint Mészáros; Gábor Erdos; Zsuzsanna Dosztányi
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

Review 5.  Mechanisms of coronavirus cell entry mediated by the viral spike protein.

Authors:  Sandrine Belouzard; Jean K Millet; Beth N Licitra; Gary R Whittaker
Journal:  Viruses       Date:  2012-06-20       Impact factor: 5.048

6.  Strategic Development of a Next-Generation Multi-Epitope Vaccine To Prevent Nipah Virus Zoonotic Infection.

Authors:  Rupal Ojha; Aditi Pareek; Rajan K Pandey; Dhaneswar Prusty; Vijay K Prajapati
Journal:  ACS Omega       Date:  2019-08-07

7.  In silico/In vivo analysis of high-risk papillomavirus L1 and L2 conserved sequences for development of cross-subtype prophylactic vaccine.

Authors:  Ali Namvar; Azam Bolhassani; Gholamreza Javadi; Zahra Noormohammadi
Journal:  Sci Rep       Date:  2019-10-23       Impact factor: 4.379

8.  Development of epitope-based peptide vaccine against novel coronavirus 2019 (SARS-COV-2): Immunoinformatics approach.

Authors:  Manojit Bhattacharya; Ashish R Sharma; Prasanta Patra; Pratik Ghosh; Garima Sharma; Bidhan C Patra; Sang-Soo Lee; Chiranjib Chakraborty
Journal:  J Med Virol       Date:  2020-03-05       Impact factor: 20.693

9.  Analysis of multimerization of the SARS coronavirus nucleocapsid protein.

Authors:  Runtao He; Frederick Dobie; Melissa Ballantine; Andrew Leeson; Yan Li; Nathalie Bastien; Todd Cutts; Anton Andonov; Jingxin Cao; Timothy F Booth; Frank A Plummer; Shaun Tyler; Lindsay Baker; Xuguang Li
Journal:  Biochem Biophys Res Commun       Date:  2004-04-02       Impact factor: 3.575

Review 10.  Emerging coronaviruses: Genome structure, replication, and pathogenesis.

Authors:  Yu Chen; Qianyun Liu; Deyin Guo
Journal:  J Med Virol       Date:  2020-02-07       Impact factor: 2.327

View more
  5 in total

1.  Exploring SARS-COV-2 structural proteins to design a multi-epitope vaccine using immunoinformatics approach: An in silico study.

Authors:  Samira Sanami; Morteza Alizadeh; Masoud Nosrati; Korosh Ashrafi Dehkordi; Fatemeh Azadegan-Dehkordi; Shahram Tahmasebian; Hamed Nosrati; Mohammad-Hassan Arjmand; Maryam Ghasemi-Dehnoo; Ali Rafiei; Nader Bagheri
Journal:  Comput Biol Med       Date:  2021-04-20       Impact factor: 6.698

2.  In silico Design and Characterization of Multi-epitopes Vaccine for SARS-CoV2 from Its Spike Protein.

Authors:  Gunderao H Kathwate
Journal:  Int J Pept Res Ther       Date:  2022-01-03       Impact factor: 1.931

3.  Computational construction of a glycoprotein multi-epitope subunit vaccine candidate for old and new South-African SARS-CoV-2 virus strains.

Authors:  Olugbenga Oluseun Oluwagbemi; Elijah Kolawole Oladipo; Emmanuel Oluwatobi Dairo; Ayodele Eugene Ayeni; Boluwatife Ayobami Irewolede; Esther Moradeyo Jimah; Moyosoluwa Precious Oyewole; Boluwatife Mary Olawale; Hadijat Motunrayo Adegoke; Adewale Joseph Ogunleye
Journal:  Inform Med Unlocked       Date:  2022-01-15

4.  In-silico design of an immunoinformatics based multi-epitope vaccine against Leishmania donovani.

Authors:  Subhadip Saha; Shubham Vashishtha; Bishwajit Kundu; Monidipa Ghosh
Journal:  BMC Bioinformatics       Date:  2022-08-05       Impact factor: 3.307

5.  Lessons Learned from Cutting-Edge Immunoinformatics on Next-Generation COVID-19 Vaccine Research.

Authors:  Chiranjib Chakraborty; Ashish Ranjan Sharma; Manojit Bhattacharya; Sang-Soo Lee
Journal:  Int J Pept Res Ther       Date:  2021-07-10       Impact factor: 1.931

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.