Literature DB >> 34746365

Development of an in silico multi-epitope vaccine against SARS-COV-2 by précised immune-informatics approaches.

Saad Al Zamane1, Fahim Alam Nobel2, Ruksana Akter Jebin2, Mohammed Badrul Amin3, Pratul Dipta Somadder1, Nusrat Jahan Antora4, Md Imam Hossain1, Mohammod Johirul Islam2, Kawsar Ahmed5, Mohammad Ali Moni6.   

Abstract

The coronavirus family has been infecting the human population for the past two decades, but the ongoing coronavirus called SARS-CoV-2 has posed an enigmatic challenge to global public health security. Since last year, the mutagenic quality of this virus is causing changes to its genetic material. To prevent those situations, the FDA approved some emergency vaccines but there is no assurance that these will function properly in the complex human body system. In point of view, a short but efficient effort has made in this study to develop an immune epitope-based therapy for the rapid exploitation of SARS-CoV-2 by applying in silico structural biology and advancing immune information strategies. The antigenic epitopes were screened from the Surface, Membrane, Envelope proteins of SARS-CoV-2 and passed through several immunological filters to determine the best possible one. According to this, 7CD4+, 10CD8+ and 5 B-cell epitopes were found to be prominent, antigenic, immunogenic, and most importantly, highly conserved among 128 Bangladeshi and 110 other infected countries SARS-CoV-2 variants. After that, the selected epitopes and adjuvant were linked to finalize the multi-epitope vaccine by appropriate linkers. The immune simulation disclosed that the engineered vaccine could activate both humoral and innate immune responses. For the prediction of an effective binding, molecular docking was carried out between the vaccine and immunological receptors (TLRs). Strong binding affinity and good docking scores clarified the stringency of the vaccines. Furthermore, MD simulation was performed within the highest binding affinity complex to observe the stability. Codon optimization and other physicochemical properties revealed that the vaccine would be suitable for a higher expression at cloning level. So, monitoring the overall in silico assessment, we anticipated that our engineered vaccine would be a plausible prevention against COVID-19.
© 2021 The Authors.

Entities:  

Keywords:  Codon optimization; Epitopes; In-silico cloning; Molecular docking; Molecular dynamics simulations; SARS-CoV-2

Year:  2021        PMID: 34746365      PMCID: PMC8563510          DOI: 10.1016/j.imu.2021.100781

Source DB:  PubMed          Journal:  Inform Med Unlocked        ISSN: 2352-9148


Introduction

The human population is rising, and its mobility has led to urbanization with the environment and ecological shifts responsible for the proliferation of numerous infectious diseases [1,2]. The human race has witnessed many of infectious diseases, which are inconsequential or even cause global disruption [1]. The new flu-like virus was supposed to emerge from Wuhan City on December 19, 2019 in China and was initially known as the novel coronavirus-2019 [[3], [4], [5]]. On February 11, 2020, the International Committee on Virus Taxonomy (ICTV) announced this virus as the severe acute respiratory syndrome-2 (SARS-2). After one month on March 11, 2020, COVID-19 was proclaimed by the World Health Organization as a pandemic condition due to 118326 confirmed cases and 4292 worldwide deaths. Up to September 25, 2021, the virus has spread rapidly to 235 countries, regions, and territories with 229,858,719 confirmed cases and 4,713,543 deaths [https://covid19.who.int]. The first case of COVID-19 in Bangladesh was detected on 8th March. Since then, 1,548,320 confirmed cases and 27,337 deaths had been registered on 25th September [https://covid19.who.int]. Coronavirus typically induces moderate respiratory tract infections such as a common cold. Still, newly developed coronavirus has many clinical symptoms such as low-grade fever, dry cough, dyspnea, exhaustion, gastrointestinal issues, diarrhea [[6], [7], [8], [9], [10], [11]]. Inflammation of the upper and lower respiratory tract has also been found to cause acute respiratory tract infection [3]. The median incubation time for COVID-19 is 3 days (range between 0 and 24 days) and the median time between first signs to death is 14 days (range between 6 and 41 days) [12,13]. Many patients with COVID-19 may experience shortness of breath and, in some cases; patients may develop septic shock, difficult to correct metabolic acidosis, or coagulation disorders [14]. It may also affect many organs; among them, kidneys and heart are very common [15,16]. Symptoms may vary from person to person; for example, some people may experience very mild symptoms without any fever and recover within 1–4 weeks, while others may develop infection and some may die [17,18]. However, the aged person with physical complications is at a high risk of fatality than the young aged and children [19]. Like other coronaviruses, COVID-19 is an enveloped single-stranded positive-sense (+SSRNA) virus [[20], [21], [22]]. The estimated genome length of SARS-CoV-2 is 30 kilo-bases (between 26000 and 32000 bases) [23,24]. Several studies reported that the full-length genome sequence of SARS-CoV-2 has a great similarity with bat coronaviruses, 45–90% to SARS-CoV and a smaller similarity of about 20–60% to MARS-CoV. For that reason, it was assumed that bat might be the original host of SARS-CoV-2, but the intermediate host is still now unknown to all [11]. The genome sequence of SARS-CoV-2 encoded sixteen non-structural and four major structural proteins [15]. These four structural proteins (Surface glycoproteins, Envelope protein, Membrane protein, and Nucleocapsid) are essential components of the viral assembly [25]. With these proteins, SARS-COV-2 paves the way for an invasion into the host [15,22,23]. Of all these structural proteins, the viral entry of SARS-CoV-2 is mediated by surface (S) glycoprotein [26]. The Membrane (M) protein consists of three structural domains, and each domain is responsible for the curvature of the membrane, the shape of the virion by binding to the nucleocapsid (N). Envelope (E) protein is involved in viral pathogenesis, the assembly and release of virion [15]. The fourth structural protein called nucleocapsid contains two different domains; both bind with the viral RNA genome, but their binding mechanism is different from each other [11]. In the case of non-structural protein, some study has reported that they are essential for the replication of SARS-CoV-2 [11]. Because of the high degree of contagiousness and its rapidly spreading to every territory, the World Health Organization declared COVID-19 a Public Health Emergency of International Concern (PHEIC) and called for the immediate implementation of adequate and effective care. As a result, scientists around the world are racing to develop a suitable vaccine candidate and some of them have already given FDA approval for emergency use. However, with the progression of the pandemic, SARS-CoV-2 continually mutated several times and have the possibility to change its pathogenicity [27]. This RNA virus is transmitted faster than any other virus in the last century [28,29]. So, it is essential to produce more and more new vaccine candidates that will stand against all of its mutational strain by giving long-lasting immunity. That is only possible by computational approaches within a short period and low cost. We thus made an effort to develop an in silico multi-epitope based vaccine by applying immunoinformatic tools. It is a subset of bioinformatics that deals with many immunological data using computational analysis. This is also a rapid computational method for the design of vaccines. Hypothetically both structural and non-structural proteins are potential for vaccine targets [30], but we only focused on the structural protein (S, M, E). Several experiments have been performed based on targeting the surface glycoprotein receptor-binding domain (RBD) domain. But, a rapid mutation on the RBD and high genomic variation of RNA viruses have a chance to escape the new strain from neutralization by currently RBD targeting antibodies [[31], [32], [33], [34], [35]]. So, considering the fact full-length surface, envelop and membrane protein have been adopted for multi-epitope vaccine constructs except for nucleocapsid. The reason for deselecting this nucleocapsid was their initial unavailability into the host cell's outside surface during infection. So, in the current study multiple immunoinformatic servers and tools have been applied to predict Cytotoxic T lymphocytes (CTL), Helper T Lymphocytes (HTL), Linear B-lymphocyte (LBL) epitopes that were highly conserved among the entire strain (In total, 238 genome sequences have been chosen to see the conservancy by prioritizing 128 Bangladeshi sequences and 110 from individually affected nations) of SARS-CoV-2 (Supplementary File-S, M, E). With these predictions, we tried to develop a suitable multi-epitope vaccine candidate by combining every selected epitope with corresponding linkers, where an adjuvant has also been associated at the first position to boost up the immunogenicity of the construct. In this way, we hoped that our designed vaccine will generate a proper immune response and could be used against SARS-CoV-2. The overall study processes have been illustrated as a graphical abstract in Fig. 1 .
Fig. 1

Flow chart designed for the multi-epitope vaccine against SARS-COV-2. The numbers in red circle depicted the sequence of the whole process. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Flow chart designed for the multi-epitope vaccine against SARS-COV-2. The numbers in red circle depicted the sequence of the whole process. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Methods and materials

Sequence retrieval and phylogenetic tree construction

The β-family coronavirus (HCOV-OC43, HCOV-HKU1, SARS, MARS, HCOV-NL63, HCOV-229E & SARS-CoV-2) surface, membrane, and envelop proteins amino acid sequences were extracted from NCBI database. The assigned accession numbers are HCoV-HKU1 (YP173238.1 = S, YP173241.1 = M, YP173240.1 = E), MERS-CoV (YP009047204.1 = S, YP009047210.1 = M, YP009047209.1 = E), HCoVNL63 (YP003767.1 = S, YP003770.1 = M, YP003769.1 = E), HCoV-229E (NP073551.1 = S, NP073555.1 = M, NP073554.1 = E), SARS-CoV-2(QJU11812.1 = S,QJU11815.1 = M, QJU11814.1 = E), HCoV-OC43 (YP009555241.1 = S, YP009555244.1 = M, YP009555243.1 = E), SARS-CoV (NP828851.1 = S, NP828855.1 = M, NP828854.1 = E). The first sequenced genome of SARS-CoV-2 in Bangladesh was chosen as a reference sequence from Child health research foundation, Dhaka [36]. To investigate their evolutionary connection, phylogenetic tree was constructed using MEGA X (neighbor-joining algorithm) using the default setting and bootstrap for 1000 replicates [37].

T-cell epitopes prediction for vaccine construct

CTL epitope prediction

The IEDB MHC I processing tool (http://tools.iedb.org/processing/) was applied to identify CTL (Cytotoxic T-lymphocytes) epitopes in accordance with the IEDB recommended method. To evaluate the binding interaction between peptides and MHC class I alleles, we adjusted the threshold at 50 nM (IC50 ≤ 50 strong binders) [38]. The epitopes were predicted by concentrating on three essential components: MHC-I binding, proteasomal processing and transport efficiency, and TAP transport [39]. Following prediction, the IEDB MHC I binding tool (http://tools.iedb.org/mhci/) was used to find the epitopes binding ability with 27 reference alleles using the Stabilized Matrix Method (SMM).

HTL epitope prediction

IEDB MHC-II Binding Predictions tool(http://tools.iedb.org/mhcii/) using SMM method was used to predict HTL epitopes against a set of 27 human reference HLAs. Epitopes were prioritized based on their IC50 value [40]. The IC50 value ≤ 50 nM demonstrated the highest binding affinity towards MHC-I & II, ≤500 nM is in the midrange, and ≤5000 nM corresponds to the lowest binding affinity [41].

B-lymphocyte epitope prediction

B-cell epitopes may be potential antigens capable of interfering with B cells [42]. As a result, the Linear B cell epitopes were predicted using the Kolaskar and Tangaoker Antigenicity and BepiPred 2.0 tools from the Immune Epitope Database Analysis Resource (IEDB-AR) server [43,44].

Antigenicity and allergen prediction of the CTL, HTL, LBL epitopes

The antigenic potential of the specified epitopes (CTL, HTL, LBL) were manually assessed using the VaxiJen2.0 server [45]. For the prediction of antigenicity, a threshold value of 0.4 was used. Non-antigenic epitopes with VaxiJen scores less than 0.4 were avoided, whereas antigenic epitopes with VaxiJen scores more than 0.4 were selected for future investigation. To minimize allergic responses to the selected epitopes, the allergenicity status of all CTL, HTL, and LBL epitopes was validated using the AllerTOP v.2.0 server [46].

Immunogenicity prediction of the CTL epitopes

The MHC class-I immunogenicity tool from the IEDB server was used to quantify the immunogenicity of cytotoxic T cell epitopes (CTL) [47]. This tool was developed to indicate the immunogenicity of peptides based on their amino-acid position and characteristics.

Toxicity prediction

The ToxinPred server (http://crdd.osdd.net/raghava/toxinpred/) was used to estimate the toxicity of the epitopes (CTL, HTL, LBL) to assure their non-toxic nature [48]. This server displays the toxicity and non-toxicity of epitopes depending on their physicochemical properties [39].

Prediction of cytokine-inducing HTL epitopes

The predicted HTL epitopes were examined further for their capacity to induce various cytokines [44]. The IFN epitope (http://crdd.osdd.net/raghava/ifnepitope/) [49], IL4pred (http://crdd.osdd.net/raghava/il4pred/) [50], and IL-10pred (http://crdd.osdd.net/raghava/IL-10pred/) servers [51,52] did the prediction using default settings.

Population coverage and conservancy analysis of the epitopes

Variants of the HLA allele were found in varying degrees of frequency all across the world [53]. As a consequence, the IEDB population analysis tool (https://tools.iedb.org/population/) was used to confirm the population coverage of our selected CTL and HTL epitopes [54]. Furthermore, a chord diagram was created through R programming languages 4.0 version to show the relationship between the country, continent, and ethnic groupings. The epitope's conservation is critical since it implies wider protection against diverse strains of SARS-COV-2. By filtering GSAID [55], a total of 238 genome sequences (128 from Bangladesh and 110 from other afflicted countries) were obtained. We attempted to cover the majority of the impacted nations' genome sequences that were accessible in the GSAID database (Supplementary File-S, M, E). Upon retrieval, multiple sequence alignment using Bioedit software was used to separate S, M, and E from these 238 whole genome sequences [56]. The recovered S, M, and E sequences were then converted to protein form using the Expasy translator program (https://web.expasy.org/translate/) [57]. To appraise the conservancy of our preferred epitopes, we used the IEDB conservancy tool [58].

Docking performance between T cell epitopes and MHC alleles

The binding affinity of CTL and HTL epitopes with their accompanying MHC alleles were compared by molecular docking approaches [3]. Predicted top-ranked epitopes (CTL, HTL) and their respective HLA allele binders (HLA-A-68; 01, HLA-A-02; 06, & HLA-DRB1-01; 01) were submitted to the Galaxy Pep-Dock server (http://galaxy.seoklab.org/cgi-bin/submit.cgi?type=PEPDOCK) [59]. The Protein Data Bank (PDB) (https://www.rcsb.org/) was used to retrieve the crystal structures of the HLA-A-02; 06 (PDB ID-3OXR), HLA-A-68; 01 (PDB ID-6PBH), and HLA-DRB1-01; 01 (PDB ID-5V4N) [60], which were then processed with BIOVIA Discovery Studio 2020 to eliminate superfluous ligands [53]. The complexes were modified with the Galaxy Refine server after docking(http://galaxy.seoklab.org/cgi-bin/submit.cgi?type=REFINE). The PRODIGY web-server (https://wenmr.science.uu.nl/prodigy/) was used to test their binding affinity [61].

Construction of the final vaccine structure

By combining all of the possible CTL, HTL, and LBL epitopes with suitable adjuvant and flexible linkers, a multi-epitope vaccine was constructed [62,63]. Based on the immunological filters mentioned above, the best potential epitopes were screened and combined to produce a single peptide chain [64]. Because peptides are not immunogenic on their own, an adjuvant is necessary to stimulate the immune response [65,66]. The vaccine's sequential position commenced with an adjuvant (β-defensins = GIINTLQKYYCRVRGGRCAVLSCLPKEEQIGKCSTRGRKCCRRKK) and was followed by the top CTL, HTL, and LBL epitopes [38]. The linkers used in this study were EAAAK, AYY, GPGPG, and KK, which aid in the connection of adjuvant and epitopes. The AYY linkers contribute in the formation of a favorable location for binding to the TAP transporter and enhancing epitope presentation. The GPGPG linker being utilized to promote HTL responses and to preserve the immunogenicity of helper and antibody epitopes that are conformationally dependent. The KK linkers combine B cell epitopes, and adjuvant was added to the vaccine's N-terminus with EAAAK linker, resulting in more effective separation and less contact with other vaccine domains [67].

Allergenicity prediction of the vaccine

We utilized several methods to estimate the allergenic score for the designed vaccine sequence with high accuracy [39]. The non-allergic behavior of the vaccine was validated using the AllerTop v.2.0 (https://www.ddg-pharmfac.net/AllerTOP/) [43] and AllergenFP v.1.0 (http://ddg-pharmfac.net/AllergenFP/) servers [68].

Antigenicity prediction of the vaccine

To anticipate the antigenic characteristics of the proposed vaccine construct, Vaxijen v2.0 and ANTIGENpro were both used. At 0.4 thresholds, Vaxijen v2.0 provides antigenicity with excellent accuracy (http://www.ddgpharmfac.net/vaxijen/VaxiJen/VaxiJen.html) [40,69]. ANTIGENpro (http://scratch.proteomics.ics.uci.edu/), a server that predicted total protein antigenicity based on machine learning algorithms utilizing experimentally verified microarray analysis reactivity data [30,70].

Prediction of protein solubility and transmembrane helices

The vaccine construct's solubility was determined using the SOLpro (http://scratch.proteomics.ics.uci.edu/) [71] and Protein-Sol (https://protein-sol.manchester.ac.uk/) servers [72]. SignalP-5.0 (http://www.cbs.dtu.dk/services/SignalP/data.php) [73] and TMHMM Server v2.0 (http://www.cbs.dtu.dk/services/TMHMM/) [74] were used to check for the existence of any signal peptides and transmembrane helices.

Physio-chemistry of the vaccine construct

The ExPASy ProtParam server (https://web.expasy.org/protparam/) was used to estimate the vaccine's physiochemical characteristics [75]. The vaccine candidate was fed into the server, which calculated characteristics such as molecular weight, protein half-life, instability index, theoretical pI, amino acid composition, aliphatic index, and GRAVY [76].

Cross-reactivity analysis with human proteomes

The BLASTP (https://blast.ncbi.nlm.nih.gov/Blast.cgi) search engine in the NCBI database was used to assess the sequential similarity of our epitope-based vaccine inside human proteomes. None of the antigenic sequences indicated more than a 10% similarity to the human proteome [77].

Secondary (2D) structure prediction

PSIPRED v4.0 (http://bioinf.cs.ucl.ac.uk/psipred/) server was used by two feed-forward neural networks that processed the PSI-BLAST output to predict the vaccine's secondary structure [78].

Tertiary (3D) structure prediction

RaptorX (http://raptorx.uchicago.edu/ContactMap/), a publicly available web server, was used to infer the vaccine sequence's 3D structure. The server predicts tertiary structure using a deep learning technique that integrates evolutionary coupling (EC) and sequence conservation information via an ultra-deep neural network built by two deep residual neural networks [79].

Refinement and energy minimization of the tertiary structure

To improve the optimal 3D structure of the construct, a three-step approach was explored; initially, the structure was refined by two independent servers: ModRefiner (https://zhanglab.ccmb.med.umich.edu/ModRefiner/) [80] and Galaxy Refine (http://galaxy.seoklab.org/cgi-bin/submit.cgi?type=REFINE) [81]. The energy was then minimized using Swiss-Pdb Viewer [82] and Chiron online web server (https://dokhlab.med.psu.edu/chiron/login.php) [83].

Tertiary structure validation

The ProSA-web (Protein Structure Analysis), PROCHECK (Ramachandran Plot Assessment), and ERRAT servers were deployed to validate the improved 3D structures and compare the models [84]. ProSA (https://prosa.services.came.sbg.ac.at/prosa.php) assigned to compute an overall quality score for the specified input structure. A 3D molecule view was included in the ProSA-web, facilitating the detection of the problematic part indicated by the given score [85]. The ERRAT (http://services.mbi.ucla.edu/ERRAT/) uses an empirical atom-based method to verify protein structures. This program compares the query sequence's non-bonded atomic interactions to a database of credible high-resolution crystallographic structures [86]. Finally, the Ramachandran plot, which specifies the quality of the modeled structure, was obtained using the PROCHECK server (https://servicesn.mbi.ucla.edu/PROCHECK/) [87].

Disulfide engineering of the vaccine construct

To identify the capacity of establish a disulfide bond between pairs of vaccine residues with correct geometry, the Disulfide by Design 2 server (http://cptweb.cpt.wayne.edu/DbD2/) was utilized [88].

Prediction of conformational B-cell epitopes in vaccine construct

B-lymphocytes' primary features are antigen presentation and cytokine production; in addition to generating antibodies, they provide humoral immunity in the human body [89]. ElliPro, a web server (http://tools.iedb.org/ellipro/), was used to determine conformational B-cell epitopes. Each predicted epitope is provided a PI (protrusion index) value by ElliPro. Finally, to see the epitopes, a Jmol viewer was used [90].

Molecular docking between vaccine and the receptors

Molecular docking is a computational approach that is crucial for studying protein-protein interaction patterns based on molecule binding affinity [91]. For that purpose, we docked our final vaccine construct with immune therapeutically relevant TLR2 (PDB ID-6nig), TLR3 (PDB ID-5gs0), TLR4 (PDB ID-4g8a), TLR7 (PDB ID-5gmg), and TLR8 (PDB ID-4qc0) proteins, since these interactions are critical for eliciting an adequate immune response. Because we used the TLRs structure from the protein data bank, we prepared the receptors by removing the associated ligand and water molecules using Discovery Studio 2020 software [92]. The High Ambiguity Driven Protein Docking (HADDOCK) server (http://milou.science.uu.nl/services/HADDOCK2.2/haddockserver-easy.html) was implemented for fast and accurate docking. This server used information from both biochemical bioinformatics and biophysical approaches to enhance docking sampling and scoring. The server requires active and passive residues from the vaccine and receptors as input. The CPROT server [93] predicted these interaction residues (active and passive). The clusters were polished by the HADDOCK Refinement server [94] after docking. Finally, using PDBsum (https://www.ebi.ac.uk/thorntonsrv/databases/pdbsum/Generate.html), the interaction residues between vaccine construct and TLRs were mapped [95].

Molecular dynamics (MD) simulations of the best docking complexes

GROMACS simulation package (GROMACS 2020.4) was used to perform molecular dynamics simulations for our top docking complex. MD simulation of protein-vaccine complex was carried out for 100 ns in water using CHARMM36 forcefield; trajectory and energy files were written in every 10ps [96]. The system was solvated in a truncated octahedral box, containing TIP3P water molecules. However, the simulation was performed in 0.15 M KCl by adding 122 Potassium ions and 163 Chloride ions [97]. To remove any steric incompatibilities, minimization was undertaken for 5000 steps using the Steepest Descent Method, and convergence was achieved within the maximum force 1000 (KJ mol-1 nm-1). The system was equilibrated at NVT and NPT ensembles for 100ps (50,000 steps) and 1000ps (1,000,000 steps), respectively, using time steps 0.2 and 0.1 fs. Production run for simulation was carried out at a constant temperature of 300 K and a pressure of 1 atm or bar (NPT) using weak coupling velocity-rescaling (modified Berendsen thermostat) and Parrinello-Rahman algorithms. Relaxation times were set to τ T = 0.1 ps and τ P = 2.0 ps. All bond lengths involving hydrogen atom were kept rigid at ideal bond lengths using the Linear Constraint Solver (lincs) algorithm, allowing for a time step of 2 fs. Verlet scheme was used for the calculation of non-bonded interactions. Periodic Boundary Conditions (PBC) were utilized in all x, y, z directions. Interactions within a short-range cutoff of 1.2 nm were calculated in each time step. Particle Mesh Ewald (PME) was used to calculate the electrostatic interactions and forces to account for a homogeneous medium outside the long-range cutoff. The production was run for 100ns for the complex. Visual Molecular Dynamics (VMD) software was involved to combine output trajectories from each run [98]. Finally, stability of the protein, vaccine and complex like the root means square deviation (RMSD), root mean square fluctuation (RMSF), Molecular Mechanics Poisson-Boltzmann Surface Area (MMPBSA) and Principal component analysis (PCA) were determined from the trajectories generated in the simulation [99]. The radius of gyration, temperature, pressure, density, potential was also calculated to see the compactness of the protein, vaccine and complex at different thermodynamic conditions. Bio3D package of R was used to draw all the plots [100] and MMPBSA graph was performed using MMGBSA method [101]. After plotting Xmgrace graph plotting software was used to modify all the plot [102].

mRNA structure prediction

RNAfold (http://rna.tbi.univie.ac.at/cgibin/RNAWebSuite/RNAfold.cgi) [103] and the mfold (http://unafold.rna.albany.edu/?q=mfold) web-servers [104] were used to verify the stability of the mRNA structure of the vaccine construct.

Immune simulation

C‐ImmSimserver (https://kraken.iac.rm.cnr.it/C-IMMSIM/index.php?page=0) was applied for the immunological simulation of the vaccine construct using the position‐specific scoring matrix (PSSM) [4,38]. For the majority of vaccines, the minimum recommended duration between doses 1 and 2 was 4 weeks. Our simulation took 1050-time steps (a time step of around 8 h), nearly 12 months. As a consequence, three peptide injections at time steps 1,84,170 were given four weeks apart [105].

Codon optimization and in silico cloning

The Java Codon Adaptation Tool (Jcat) (https://www.jcat.de/) was utilized for the codon optimization approach to enhance the expression of recombinant protein. Jcat developed a cDNA sequence of the vaccine to do codon optimization and reverse translation at the same time [106]. To ensure optimal protein expression, GenScript a Rare Codon Analysis Tool (https://www.genscript.com/tools/rarecodonanalysis) was used to improve the product by analyzing discrete parameters such as G-C concentration and Codon Adaptation Index (CAI) of the sequence [107]. Finally, cloning procedure was carried out using Snap Gene v4.2 software (https://snapgene.com/) by introducing the modified nucleotide sequence into the pET28a (+) expression vector [108].

Results

Retrieval of sequences and phylogenetic tree construction

The seven human coronavirus S, M, E protein sequences were retrieved from NCBI for the generation of phylogenetic relationship (Fig. 2 ). The analysis revealed that SARS-COV-2 had a great similarity with SARS-COV and MARS-COV (Fig. 2).
Fig. 2

Evolutionary relationship between the family of human corona virus Surface, Membrane and Envelop protein.

Evolutionary relationship between the family of human corona virus Surface, Membrane and Envelop protein.

Prediction of T lymphocyte epitopes

Prediction of CTL epitopes

Epitopes play a pivotal role in the formation of long-term immunity. We found 7 CTL epitopes using the IEDB-recommended approach described above. Several immunological filters were used to find these optimal epitopes (Table 2). All of the epitopes, as well as their positioning, antigenicity, and binding alleles, are documented in Table 1 .
Table 2

Results of the finally selected CTL epitope segments.

CTL EpitopeLengthImmunogenicityAllergenicityToxicity
1.MAYRFNGIGV100.3226Non-AllergenNon-Toxin
2.ASFRLFARTR100.29647Non-AllergenNon-Toxin
3.LVIGAVILR90.2601Non-AllergenNon-Toxin
4.FASVYAWNRK100.25563Non-AllergenNon-Toxin
5.SVLLFLAFVV100.24819Non-AllergenNon-Toxin
6.WTAGAAAYY90.15259Non-AllergenNon-Toxin
7.KLNDLCFTNV100.0961Non-AllergenNon-Toxin
Table 1

MHC-I epitopes with its binding allele (IC50 < 500 nM).

CTL EpitopePositionAlleleIC50Antigenicity (vaxijen v2.0)
1.MAYRFNGIGVS (902–911)HLA-A*68:02,22.871.3523
HLA-A*02:06,39.53
HLA-A*02:03,57.05
HLA-A*30:01,75.33
HLA-A*02:01108.76
2.FASVYAWNRKS (347–355)HLA-A*68:01,15.610.5868
HLA-A*33:01,34.68
HLA-A*68:01,76.73
HLA-A*30:01,105.43
HLA-A*31:01,139.79
HLA-A*11:01,176.32
HLA-A*11:01218.70
3.WTAGAAAYYS (258–266)HLA-A*30:02,58.810.6306
HLA-A*26:01,78.45
HLA-B*35:01,83.10
HLA-A*68:01,141.41
HLA-B*15:01,244.91
HLA-A*01:01,248.63
HLA-B*58:01,370.94
HLA-A*68:02394.09
4.KLNDLCFTNVS (386–395)HLA-A*02:03,23.782.6927
HLA-A*02:01,50.06
HLA-A*32:01,62.36
HLA-A*02:06,108.78
HLA-A*02:06,121.03
HLA-A*02:03202.28
5.ASFRLFARTRM (98–107)HLA-A*31:01,26.330.6237
HLA-A*31:01,99.88
HLA-A*68:01,192.30
HLA-A*33:01,200.01
HLA-A*30:01,256.43
HLA-A*33:01496.02
6.LVIGAVILRM (138–146)HLA-A*68:01,16.350.5004
HLA-A*31:01,100.11
HLA-A*11:01,160.07
HLA-A*33:01403.69
7.SVLLFLAFVVE (16-25)HLA-A*02:01,9.98
HLA-A*02:06,14.54
HLA-A*02:06,20.92
HLA-A*02:06,29.71
HLA-A*02:01,48.43
HLA-A*02:01,63.31
HLA-A*02:03,99.07
HLA-A*02:03,107.14
HLA-A*68:02,117.92
HLA-A*02:03,344.55
HLA-A*30:01381.92
MHC-I epitopes with its binding allele (IC50 < 500 nM). Results of the finally selected CTL epitope segments.

Prediction of HTL epitopes

Utilizing SMM methods, 10 HTL epitopes were sorted (Table 3 ). The epitopes were then pass through several immunological filters and checked for their inducing capability (Table 4 ). Within each epitope, it showed the inducing ability of at least two cytokines (Table 4).
Table- 3

MHC-II epitopes with its binding allele (IC50 < 500 nM).

HTL EpitopePositionAlleleIC50Antigenicity (vaxijen v2.0)
1.MFVFLVLLPLVSSQCS (1–15)HLA-DRB1*01:01,50.5741
HLA-DRB1*11:01,18
HLA-DPA1*03:01/DPB1*04:02,23
HLA-DRB1*04:05,41
HLA-DRB5*01:01,41
HLA-DPA1*01:03/DPB1*02:01,46
HLA-DPA1*02:01/DPB1*01:01,51
HLA-DRB1*15:01,96
HLA-DRB1*04:01,114
HLA-DRB1*07:01,118
HLA-DPA1*01/DPB1*04:01,132
HLA-DRB4*01:01,162
HLA-DRB1*12:01,232
HLA-DRB1*09:01,248
HLA-DRB1*08:02,421
HLA-DPA1*02:01/DPB1*05:01422
2.RVVVLSFELLHAPATS (509–523)HLA-DRB1*01:01,50.7485
HLA-DPA1*02:01/DPB1*01:01,34
HLA-DPA1*03:01/DPB1*04:02,47
HLA-DPA1*01:03/DPB1*02:01,58
HLA-DRB1*03:01,111
HLA-DRB1*09:01,128
HLA-DRB1*11:01,147
HLA-DRB1*04:01,194
HLA-DRB5*01:01,199
HLA-DRB1*15:01,207
HLA-DPA1*01/DPB1*04:01,221
HLA-DRB4*01:01,311
HLA-DRB1*07:01,347
HLA-DRB1*04:05,366
HLA-DQA1*01:01/DQB1*05:01372
3.LPIGINITRFQTLLAS (229–243)HLA-DPA1*01:03/DPB1*02:01,240.8156
HLA-DRB4*01:01,42
HLA-DPA1*03:01/DPB1*04:02,52
HLA-DPA1*01/DPB1*04:01,75
HLA-DPA1*02:01/DPB1*01:01,125
HLA-DRB1*01:01,135
HLA-DRB1*15:01,168
HLA-DRB1*04:01,175
HLA-DRB1*09:01,208
HLA-DRB1*07:01,210
HLA-DRB1*08:02,223
HLA-DRB1*11:01,244
HLA-DRB1*04:05,300
HLA-DQA1*01:02/DQB1*06:02471
4.AYYVGYLQPRTFLLKS (264–278)HLA-DRB1*01:01,260.4269
HLA-DPA1*02:01/DPB1*01:01,70
HLA-DPA1*01:03/DPB1*02:01,97
HLA-DPA1*01/DPB1*04:01,133
HLA-DRB1*07:01,133
HLA-DRB1*04:05,151
HLA-DRB1*15:01,175
HLA-DPA1*03:01/DPB1*04:02,179
HLA-DRB5*01:01,202
HLA-DRB1*11:01,253
HLA-DRB1*04:01,263
HLA-DRB1*09:01468
5.NIDGYFKIYSKHTPIS (196–210)HLA-DRB1*11:01,280.4465
HLA-DRB1*01:01,41
HLA-DRB1*15:01,80
HLA-DRB1*07:01,93
HLA-DRB5*01:01,119
HLA-DRB1*04:01,297
HLA-DRB1*09:01,379
HLA-DRB1*04:05486
6.RAAEIRASANLAATKS (1014–1028)HLA-DQA1*05:01/DQB1*03:1,0.5709
HLA-DRB1*01:01,
HLA-DRB1*09:01,
HLA-DRB1*13:02,
HLA-DQA1*01:02/DQB1*06:02,
HLA-DRB1*04:01,
HLA-DRB1*08:02,
HLA-DRB1*07:01,
HLA-DRB1*11:01
7.SNLLLQYGSFCTQLNS (750–764)HLA-DRB1*15:01,460.8305
HLA-DRB1*04:05,54
HLA-DPA1*01:03/DPB1*02:01,90
HLA-DRB1*01:01,96
HLA-DRB1*04:01,134
HLA-DPA1*02:01/DPB1*01:01,350
HLA-DPA1*01/DPB1*04:01361
8.RTLSYYKLGASQRVAM (174–187)HLA-DRB1*01:01,80.5644
HLA-DRB1*09:01,19
HLA-DRB1*07:01,23
HLA-DRB5*01:01,38
HLA-DQA1*05:01/DQB1*03:01,124
HLA-DRB1*04:01,251
HLA-DRB1*15:01,304
HLA-DRB1*11:01,309
HLA-DRB1*13:02,317
HLA-DRB1*04:05444
9.PKEITVATSRTLSYYM (165–179)HLA-DRB1*07:01,330.7003
HLA-DRB1*01:01,108
HLA-DRB1*09:01,160
HLA-DRB1*03:01,215
HLA-DRB5*01:01,281
HLA-DRB1*04:01,325
HLA-DRB1*11:01
10.FYVYSRVKNLNSSRVE (56-70)HLA-DRB1*01:01,160.6103
HLA-DRB1*07:01,43
HLA-DRB1*11:01,58
HLA-DRB1*04:01,65
HLA-DRB1*04:05,68
HLA-DRB1*09:01,97
HLA-DRB1*13:02,134
HLA-DRB1*15:01,183
HLA-DRB5*01:01,21
HLA-DRB1*08:02,289
HLA-DPA1*03:01/DPB1*04:02375
Table 4

Results of the finally selected HTL epitope segments.

HTL EpitopeAllergenicityToxicityIFN serverIL4 serverIL10 server
1.MFVFLVLLPLVSSQCNon-AllergenNon-ToxinPOSITIVENon-inducerinducer
2.RVVVLSFELLHAPATNon-AllergenNon-ToxinPOSITIVEinducerNon-inducer
3.LPIGINITRFQTLLANon-AllergenNon-ToxinNEGATIVEinducerinducer
4.AYYVGYLQPRTFLLKNon-AllergenNon-ToxinPOSITIVENon-inducerinducer
5.NIDGYFKIYSKHTPINon-AllergenNon-ToxinPOSITIVEinducerinducer
6.RAAEIRASANLAATKNon-AllergenNon-ToxinPOSITIVEinducerNon-inducer
7.SNLLLQYGSFCTQLNNon-AllergenNon-ToxinPOSITIVEinducerinducer
8.RTLSYYKLGASQRVANon-AllergenNon-ToxinPOSITIVEinducerNon-inducer
9.PKEITVATSRTLSYYNon-AllergenNon-ToxinPOSITIVEinducerNon-inducer
10.FYVYSRVKNLNSSRVNon-AllergenNon-ToxinNEGATIVEinducerinducer
MHC-II epitopes with its binding allele (IC50 < 500 nM). Results of the finally selected HTL epitope segments.

Identification of linear B cell epitopes

Promiscuous epitopes have been chosen based on antigenicity for the identification of linear B cell epitopes (Threshold 0.4). The epitopes were further checked for their allergenicity and toxicity (Table 5 ). We found five LBL highly antigenic epitopes ranged between 10 and 26 amino acid sequences (Table 5). Finally, the position of LBL, HTL, and CTL epitopes were visualized into the 3D structure of the surface, membrane, envelope protein by pymol software (Fig. 3 A, B, 3C).
Table 5

B cell epitopes selected by IEDB server.

EpitopesPositionAntigenicity (vaxijen v2.0)AllergenicityToxicity
1.MDLEGKQGNFKNLS (177–189)1.2592Non-AllergenNon-Toxin
2.YNSASFSTFKCYGVSPTKLNDLCFTS (369–393)1.4031Non-AllergenNon-Toxin
3.GDEVRQIAPGQTGKIADYNYKLPS (404–426)1.1017Non-AllergenNon-Toxin
4.VIGAVILRGHM (139–148)0.5844Non-AllergenNon-Toxin
5.YVYSRVKNLNSSRVPE (57-71)0.4492Non-AllergenNon-Toxin
Fig. 3

The glycoproteins (Surface, Membrane,Envelop) 3D structure showed the location of epitopes. A represented CTL epitope, B represented HTL epitopes & C represented LBL epitopes with their appropriate positions.

B cell epitopes selected by IEDB server. The glycoproteins (Surface, Membrane,Envelop) 3D structure showed the location of epitopes. A represented CTL epitope, B represented HTL epitopes & C represented LBL epitopes with their appropriate positions.

Multi-epitope vaccine construction

We combined 7CTL,10HTL, and 5LBL epitopes with the help of appropriate (AAY, GPGPG, and KK) linkers. Finally, we have completed our vaccine construction through the ligation of an adjuvant by the EAAAK linker. The graphical map of the linear multi-epitope vaccine was given in (Fig. 4 ).
Fig. 4

Graphical view of the generated vaccine construct. CTL, HTL & LBL epitopes were shown by light blue, green and violate color. Adjuvant and all the linkers were in orange and red colors. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Graphical view of the generated vaccine construct. CTL, HTL & LBL epitopes were shown by light blue, green and violate color. Adjuvant and all the linkers were in orange and red colors. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Antigenicity, allergenicity and solubility prediction of the final vaccine

The antigenicity of the vaccine was determined by the VaxiJen 2.0 and ANTIGENpro servers to be 0.5466 and 0.860981, respectively (Table S1). Furthermore, it was projected to be non-allergenic by both the AllergenFP and AllerTOP v. 2.0 servers (Table S1). The construct was also shown to be soluble in the SOLpro and protein-sol servers (Fig. S1, Table S1). The state of the engineered vaccine's transmembrane helices was anticipated in vaccine development (Fig. S2). Moreover, the lack of signal peptides also means preventing protein localization (Fig. S3).

Physiochemical properties identification

The vaccine constructions' molecular weight was calculated to be 47.145 kDa (kDa). The theoretical pI 10.01 corroborated the basic nature (Isoelectric point). The vaccine's stability was represented by the Instability Index of 28.26 (a score of less than 40 indicates a stable protein). The approximate amount of positively and negatively charged vaccine residues was 14 and 60, respectively. The half-life of mammalian reticulocytes (in vitro) was estimated to be 30 h, yeast (in vivo) to be > 20 h, and Escherichia coli to be more than 10 h (in vivo). The aliphatic index of 80.62 ensured thermostability, while GRAVY rating −0.134 confirmed the vaccine's hydrophilicity as well as its capacity to interact with diverse solvent molecules (Table S1).

Population coverage and conservation across antigen

The distribution of HLA alleles differs between nations, regions, and ethnic groups across the world. As a result, in addition to designing an effective vaccine, population coverage should be taken into account. Alleles (MHC-I, MHC-II) relevant to our selected CTL, HTL epitopes were gathered separately and combinedly for the population research. The strongest continental coverage was found in Europe, North America, East Asia, South Asia, and North Africa, while Central America had some of the weakest (Data S1, Fig. 5 C). European countries have topped the list in terms of country-specific coverage. However, in combination it covered 97.37% of the world's population. Following the advent of a pandemic, the countries most affected were China, Italy, Spain, France, Iran, and the United States. So, we were very interested to clarify how our vaccine coverage fared into these countries. The study revealed that our vaccines represent 93.29%, 98.62%, 92.85%, 98.95%, 96.15%, and 98.46% of Chinese, Italian, Spanish, French, Iranian, and United States populations respectively. Of the studied ethnic groups, the European Caucasoid ethnic population showed the highest population coverage for combined (99%), while the Asian and American ethnic populations, had more than 90% coverage (Data S1, Fig. 5B). We verified the conservation of our CTL and HTL epitopes among 238 SARS-COV-2 sequences whose genomic data was obtained from the GSAID database, as previously stated. To assess the conservancy, we noticed that all of the chosen epitopes almost spanned 100% conservancy among the sequences.
Fig. 5

T cell epitopes population coverage with their alleles. Circular plot represented the top 49 country, 60 etnicity and 13 continent that covered high population coverage of the CTL and HTL epitopes along with their respective MHC-I, MHC-II alleles when obtained individually (MHC-I or MHC-II) and in combination (MHC-I and MHC-II). (A) Population coverage of the 49 country out of 70. (B) Population coverage of the 60 etnicity out of 83. (C) Population coverage of the 13 continent out of 20.

T cell epitopes population coverage with their alleles. Circular plot represented the top 49 country, 60 etnicity and 13 continent that covered high population coverage of the CTL and HTL epitopes along with their respective MHC-I, MHC-II alleles when obtained individually (MHC-I or MHC-II) and in combination (MHC-I and MHC-II). (A) Population coverage of the 49 country out of 70. (B) Population coverage of the 60 etnicity out of 83. (C) Population coverage of the 13 continent out of 20.

Secondary and tertiary structure identification of the final vaccine

From secondary structure prediction, we observed 137(44%), 102(23%), and 193(31%) amino acid sequences involved in the generation of α-helix, β-sheet, and random coil (Fig. S4, Fig. S5). The calculated RMSD value for our modeled tertiary structure was 11.391, which was the lowest among the available models (lower the RMSD value higher the quality of the structure) (see Fig. 6 ).
Fig. 6

Representation of the tertiary structure (α-helix-red, β-sheet-blue, random coil-light green) and two terminals - (N & C). (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Representation of the tertiary structure (α-helix-red, β-sheet-blue, random coil-light green) and two terminals - (N & C). (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Tertiary structure refinement and validation of the final vaccine

Following refinement, the best 3D model of the vaccine construct was chosen based on the RMSD (0.250), GDT-HA (0.9948), and MolProbity parameters (2.009). The structure was then subjected to energy minimization in order to keep the molecules in stable form. Ramachandran plot analysis of the refined and energy minimized model disclosed that 86.9% of residues lied in the most favorable areas, 10.8% in allowed areas and only 1.4% were disallowed areas (Fig. 7 B). The selected best model quality in ERRAT was 75.924% (Fig. 7C), while ProSA gave a Z-score −5.12, indicating that the model was in the range of native protein conformations (Fig. 7A).
Fig. 7

(A) PROSA validated the structure with a Z score of −5.12 (B) Ramachandran plot analysis with PROCHECK server displayed a favored, allowed and disallowed area of 86.9%, 10.6% and 1.4%. (C) ERRAT validated the vaccine structure with a score of 75.924%.

(A) PROSA validated the structure with a Z score of −5.12 (B) Ramachandran plot analysis with PROCHECK server displayed a favored, allowed and disallowed area of 86.9%, 10.6% and 1.4%. (C) ERRAT validated the vaccine structure with a score of 75.924%.

Disulfide engineering of the final vaccine

In total, 39 pairs of residues that may be used in disulfide engineering have been identified (Data S2). However, after the energy value assessment, only six pairs of residues were finalized, as their value falls below the permitted range (energy should be below 2.2 kcal/mol). The position of six pairs mutations showed in Fig. S6 and their energy scores were listed in data S2.

Conformational B-cell epitope identification of final vaccine

Overall, 5 conformational B-cell epitopes were identified based on 0.5 threshold value and the maximum distance 6 (Fig. S7, Table S2). Their residues size was ranged from 3 to 79 (Table S2).

Molecular docking between T lymphocyte epitopes and MHC alleles

In general, epitopes with a preference for different alleles are the better candidates for vaccine construction. We docked our chosen epitopes (7 CTL & 10 HTL) with the alleles that were common among them. HLA-A*68:01 allele was found to be prevalent in 5 CTL epitopes and HLA-A*02:06 allele in 2 CTL epitopes (Table 1). 10 HTL epitopes, on the other hand, shared the HLA-DRB1*01:01 allele (Table 3). Such docking complexes negative binding energy were presented in (Fig. 8 ).
Fig. 8

Performing molecular docking between alleles and CTL, HTL epitopes. A to G represented the binding between CTL epitopes with the most common MHC I alleles and H to Q represented the binding between HTL epitopes with commonly occurring MHC II alleles. HLA alleles and epitopes were represented by ribbon and stick structures. The light color indicated the templates upon which the alleles and epitopes were formed. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Performing molecular docking between alleles and CTL, HTL epitopes. A to G represented the binding between CTL epitopes with the most common MHC I alleles and H to Q represented the binding between HTL epitopes with commonly occurring MHC II alleles. HLA alleles and epitopes were represented by ribbon and stick structures. The light color indicated the templates upon which the alleles and epitopes were formed. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Performing molecular docking between the vaccine and the receptors

Docking of the vaccine with TLR8

Representing 26% of the water refined models, HADDOCK generated 53 structures in 10 clusters. Among all other clusters, the most reliable cluster depends on the best HADDOCK score which was 558.9±19.3 (Table 6 ). The top structure of the leading cluster was then refined using the HADDOCK refining server. Where 20 structures were grouped into one cluster, that results in 100% of the water refining models. Following refinement, the top cluster's negative HADDOCK value (−227.0± 3.3) indicated a high binding affinity between the vaccine and TLR8 (Table 7 ). The refined model dynamic visualization was presented in (Fig. 9 ). Where we saw 12-hydrogen bonds have been formed and they were His172-Asn240, Arg247-Ser422, Tyr286-Asn421, Gly239-Arg547, Ser153-His699, Tyr88-Gly785, Phe145-Phe425, Thr234-Gln426, Lys73-Arg788, Arg123-Asp436, Ser127-Glu438, and Pro174-Lys328.
Table 6

Statistics of the interaction between vaccine and TLRs.

TLR 8TLR 2TLR 3TLR 4TLR 7
HADDOCK score (kcal/mol)558.9±19.3141.6±16.389.7±27.3170.0±31.2173.7±40.7
Cluster size6141947
RMSD from the overall lowest-energy structure (Å)3.8±0.534.2±0.30.9±0.52.2±0.833.8±0.1
Van der Waals energy (kcal/mol)−96.9±14.2−135.2±14.6−164.3±11.6−98.9±15.7−126.4±10.2
Electrostatic energy (kcal/mol)−198.5±21.8−234.2±59.0−305.2±15.4−405.0±48.9−315.2±49.2
Desolvation energy (kcal/mol)−40.8±11.3−39.5±3.8−49.9±1.6−43.1±0.8−43.4±12.7
Restraint's violation energy (kcal/mol)2983.0 ± 196.23630.7 ± 264.53649.5± 161.03930.1± 305.04065.4±304.1
Buried Surface Area (A2)7363.7 ± 315.94825.7 ± 226.85209.3 ±152.03728.3 ±163.94442.0±181.3
Z-Score−2.0−1.8−2.5−2.5−1.9
Table 7

HADDOCK score of the Vaccine-Receptor complex after refinement.

Vaccine-TLRs ComplexHADDOCK score
Vaccine-TLR8−227.0± 3.3(kcal/mol)
Vaccine-TLR2−244.8±7.5(kcal/mol)
Vaccine-TLR3−337.6±3.3(kcal/mol)
Vaccine-TLR4−250.2±1.7(kcal/mol)
Vaccine-TLR7−256.9±4.3(kcal/mol)
Fig. 9

The interaction pattern between V_COV19_SN-TLR8 complex. Almost twelve hydrogen bonds have been formed between vaccine and TLR8 complex (The red color sticks denoted receptor and blue color were vaccine protein). (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Statistics of the interaction between vaccine and TLRs. HADDOCK score of the Vaccine-Receptor complex after refinement. The interaction pattern between V_COV19_SN-TLR8 complex. Almost twelve hydrogen bonds have been formed between vaccine and TLR8 complex (The red color sticks denoted receptor and blue color were vaccine protein). (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Docking of the vaccine with TLR2

Representing 33% of the water refined models, HADDOCK generated 66 structures in 11 clusters. Among all other clusters, the most reliable cluster depends on the best HADDOCK score which was 141.6±16.3 (Table 6). The top structure of the leading cluster was then refined using the HADDOCK refining server. Where, 20 structures were grouped into one cluster that results 100% of the water refining models. Following refinement, the top cluster's negative HADDOCK value (−244.8± 7.5) indicated a high binding affinity between the vaccine and TLR2 (Table 7). After the interaction analysis it was seen that, 18-hydrogen bonds (ARG-SER, ARG-GLU, ASN-ARG, ARG-TYR, ARG-GLU, ALA-ARG, MET-HIS, PHE-TYR, LEU-ASP, GLY-ASP, TYR-LYS, SER-ASN, SER-SER, HIS-ASP, ILE-TYR, GLU-ARG, GLU-ARG, PRO-LYS) and 4 salt bridges (ARG-GLU, ARG-GLU, HIS-ASP, GLU-ARG) were formed between the vaccine and TLR2. Structural analysis of the docked complex with the prominent hydrogen bonds and salt bridges were depicted in (Fig. S8).

Docking of the vaccine with TLR3

Representing 32% of the water refined models, HADDOCK generated 65 structures in 10 clusters. Among all other clusters, the most reliable cluster depends on the best HADDOCK score which was 89.7± 27.3 (Table 6). The top structure of the leading cluster was then refined using the HADDOCK refining server. Where 20 structures were grouped into one cluster, that results in 100% of the water refining models. Following refinement, the top cluster's negative HADDOCK value (−337.6± 3.3) indicated a high binding affinity between the vaccine and TLR3 (Table 7). After the interaction analysis it was seen that, 15 hydrogen bonds (TYR-ASN, TYR-SER, TRP-SER, TYR-THR, TYR-GLN, TYR-GLN, ARG-HIS, ARG-THR, ARG-GLU, GLY-LYS, PRO-LYS, MET-LYS, VAL-TYR, SER-GLU, GLN-SER) and 2 salt bridges ARG-GLU, ASP-HIS were formed between the vaccine and TLR3. Structural analysis of the docked complex with the prominent hydrogen bonds and salt bridges were depicted in (Fig. S9).

Docking of the vaccine with TLR4

Representing 25% of the water refined models, HADDOCK generated 50 structures in 10 clusters. Among all other clusters, the most reliable cluster depends on the best HADDOCK score which was 170.0± 31.2 (Table 6). The top structure of the leading cluster was then refined using the HADDOCK refining server. Where 20 structures were grouped into one cluster, that results in 100% of the water refining models. Following refinement, the top cluster's negative HADDOCK value (−250.2± 1.7) indicated a high binding affinity between the vaccine and TLR4 (Table 7). After the interaction analysis it was seen that, 19 hydrogen bonds (TYR-ARG, LYS-GLU, ARG-VAL, ALA-GLU, TYR-GLU, TYR-ASN, VAL-SER, TYR-PHE, ARG-ASP, ARG-ASP, ARG-ASN, SER-LYS, TYR-SER, ASN-GLN, SER-HIS, LYS-GLU, ARG-ASP, ARG-HIS, ARG-GLU) and 5 salt bridges (LYS-GLU, ARG-ASP, LYS-GLU, ARG-ASP, ARG-GLU) were formed between the vaccine and TLR4. Structural analysis of the docked complex with the prominent hydrogen bonds and salt bridges were depicted in (Fig. S10).

Docking of the vaccine with TLR7

Representing 35% of the water refined models, HADDOCK generated 71 structures in 9 clusters. Among all other clusters, the most reliable cluster depends on the best HADDOCK score which was 173.7± 40.7 (Table 6). The top structure of the leading cluster was then refined using the HADDOCK refining server. Where 20 structures were grouped into one cluster, that results in 100% of the water refining models. Following refinement, the top cluster's negative HADDOCK value (−256.9±4.3) indicated a high binding affinity between the vaccine and TLR7 (Table 7). After the interaction analysis it was seen that, 12 hydrogen bonds (ARG-HIS, ASN-TYR, ARG-TRP, ARG-THR, TYR-ALA, TYR-THR, SER-HIS, VAL-THR, ARG-GLU, ARG-GLU, ARG-ARG, ARG-GLU) and 2 salt bridges (ARG-GLU, ARG-GLU) were formed between the vaccine and TLR7. Structural analysis of the docked complex with the prominent hydrogen bonds and salt bridges were depicted in (Fig. S11).

Minimization of energy and MD simulations of the complex

The simulation of molecular dynamics (MDS) was performed to investigate the physical movement and stability of our best docking complexes (Vaccine-TLR8) at different thermobaric conditions (Table 8 ). The simulation results have been described through the energy potential, radius of gyration, density, temperature, pressure, RMSD, RMSF, H-bond, MMGBSA and PCA analysis (Fig. 10 A,B, 11(A,B), 12). The complex is known to be energy minimized when its force reaches <1000 kJ/mol. After running the energy was found in between −1.99e+006 to −1.995e+006 which indicated the system was minimized and the complex was stable (Fig. 10B). The temperature plot revealed that the range of fluctuation was very small and the values varied between 298 and 302 K (Fig. 10A). A graph will be considered stable when the value reaches 300 K and stabilizes there. So according to this parameter, our complex was also retained its stability. The pressure value fluctuated widely throughout the 100-ns equilibration phase, but this behavior was not unexpected. Throughout the equilibration, the average fluctuation of the pressure was −200 to 200 bar (Fig. 10A). The system's average density measured for 100 ns was 1060 ± 3 kg/m^3, which also defined the complex stability (Fig. 10A). The RMSD plot showed the docked complex's structural stability and versatility. The mean RMSD value for the complex, protein and vaccine were 10.70 ± 1.49 Å, 2.77 ± 0.37 Å, 13.58 ± 1.78 Å. For vaccine, a higher RMSD indicated a change in conformation within 100ns of simulation and later it achieved a plateau. A higher RMSD of complex was due to the conformational changes in vaccine (Fig. 10B). The RMSF was calculated to determine the fluctuations of amino acid into the docked complex. RMSF showed significant rigidity of the protein structure except for residues ranging from 75 to 85 and 415–440. While RMSF of vaccine manifested slightly higher fluctuations among residues forming N-terminals, and residues ranging from 30 to 40, 255–265, 325–360, and 375–390 (Fig. 10B). The radius of the gyration plot revealed the protein's compactness along its axes (Fig. 10A). The mean RoG values were: complex = 36.23 ± 0.29 Å, protein = 31.43 ± 0.08 Å and vaccine = 29.51 ± 1.00 Å. A prominent fluctuation was observed in vaccine indicating opening and closing of structure. The protein RoG remained highly stable. The fluctuation in complex was mostly associated with the fluctuation in the vaccine molecule (Fig. 10A). The protein and vaccine binding free energy during 100ns was calculated by MMGBSA method (Fig. 11 A). The mean MMGBSA binding energy was found to be −46.53 kcal/mol (Fig. 11A). The result showed that the vaccine binds well with the protein along with few fluctuations during whole simulation time. The bindings become more stronger after 50ns of simulation time (Fig. 11A). Various energy contributions in MMGBSA binding energy (kcal/mol) for protein-vaccine complex was given in Table 8. Total number of hydrogen bonds formed within complex, protein and vaccine during 100ns of simulation time were very vital for the rigidity. Consistent fluctuating H-bonds indicated that protein and vaccine stayed bound throughout the simulation. The plot in blue color showed the number of hydrogen bonds formed between protein and vaccine. It could be said that over the course of simulation time, the interaction between protein and vaccine became more stronger, as the number of H-bonds increased. The Principal Component Analysis of the complex, protein and vaccine were calculated, where all three PCs captured 73.9%, 47.3% and 69.2% of structural variance in complex, protein and vaccine, respectively (Fig. S12). RMSF (PCA) plot represented the contribution of each residue to the first three principal components for complex, protein and vaccine. PC1 mostly captured structural variance for residues 75–90, 420–440 in protein and 5–23, 26–50, 325–355, 375–390 in vaccine (Fig. S13). Fig. 12 represented the compactness of the protein and vaccine after 100ns MD run. In this view, atoms were colored on a scale from blue to red, where blue correspond to atoms showing large motion amplitudes, and red were more rigid atoms. It could be seen that some domains of vaccine were pretty flexible and rest part of it remained in stable form. To see the correlated motion, Dynamic Cross Correlation Matrix Analysis (DCCM) was performed and visualized in Fig. 13 . In that figure colors varying from red to white to blue indicated intensity of correlated motion, where blue colors exhibited negative correlation, white showed no correlation and red color illustrated positive correlated motions between residues. Snapshots of protein-vaccine complex were taken at 0ns, 50ns and 100ns during the course of simulation time representing changing in conformations of protein and vaccine (Fig. 14 ). Both top views and side views were given for comparison (Fig. 14, Fig. S14).
Table 8

The binding free energy (kcal/mol) for vaccine complex-protein (TLR8).

Protein with vaccineΔEVDW (Van der Waal's energy)ΔEelec (Coulombic energy)ΔGGB (Generalized-Born Polar solvation energy)ΔESASA (Non-Polar solvation energy)ΔGMMPBSA (Protein-Ligand Binding energy)
Complex−207.49 ± 25.35 (kcal/mol)−823.29 ± 118.35 (kcal/mol)1008.68 ± 131.47 (kcal/mol)−24.43 ± 2.41 (kcal/mol)−46.53 ± 19.26 (kcal/mol)
Fig. 10

MD simulation between V_COV19_SN and TLR8. (A) Represented radius of gyration, Density, Temperature and Pressure during simulation of the protein, vaccine and complex. (B) Represented the RMSD, RMSF and potentials of the protein, vaccine construct and complex at the time of simulation.

Fig. 11

(A) Binding free energy using MMGBSA method and (B) Formation of hydrogen bond during the course of 100ns MD run.

Fig. 12

Interpolated structures of protein and vaccine along with PC1. The blue color represented large motion of amplitudes, while red was more in rigid form. This indicated some parts of it were flexible and the other parts retained its stability. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Fig. 13

Protein residue dynamic cross correlated motions for the protein, vaccine and complex. The residues that belonged to blue color showed negative correlation, white showed no correlation and red color indicated positive correlated motions among residues. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Fig. 14

The stability and convergence of the complex during 0ns, 50ns and 100ns MD run. The red color dictated vaccine and blue one was the protein. To see the structural changes, the complex represented here both in top view and side view. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

The binding free energy (kcal/mol) for vaccine complex-protein (TLR8). MD simulation between V_COV19_SN and TLR8. (A) Represented radius of gyration, Density, Temperature and Pressure during simulation of the protein, vaccine and complex. (B) Represented the RMSD, RMSF and potentials of the protein, vaccine construct and complex at the time of simulation. (A) Binding free energy using MMGBSA method and (B) Formation of hydrogen bond during the course of 100ns MD run. Interpolated structures of protein and vaccine along with PC1. The blue color represented large motion of amplitudes, while red was more in rigid form. This indicated some parts of it were flexible and the other parts retained its stability. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.) Protein residue dynamic cross correlated motions for the protein, vaccine and complex. The residues that belonged to blue color showed negative correlation, white showed no correlation and red color indicated positive correlated motions among residues. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.) The stability and convergence of the complex during 0ns, 50ns and 100ns MD run. The red color dictated vaccine and blue one was the protein. To see the structural changes, the complex represented here both in top view and side view. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.) Immune simulations of our final vaccine design demonstrated that it was capable of eliciting an effective immune response. In compared to IgG, high amounts of Immunoglobulin IgM were generally detected. Immunoglobulin levels (IgG1+IgG2, IgM, and IgG + IgM antibodies) were frequently enhanced in secondary and tertiary responses, with concurrent antigen decrease (Fig. 15A1). This indicated the development of immune memory as a result increased antigen clearance during subsequent exposures (Fig. 15A5). Furthermore, increased numbers of activated memory B cells revealed the vaccine's ability to elicit effective and long-lasting protection (Fig. 15 A (3,4)). Similarly, for memory formation, the TH, TR, and TC cell populations showed a strong response (Fig. 15A (5,6,7), B (1,2). A high degree of macrophage activity has been linked to dendritic and NK cell behavior (Fig. 15B (3,4,5)). Large levels of IFN- γ as well as IL-2 production were already observed. Furthermore, the innate immune system components were also involved (for example epithelial cell) (Fig. 15B6). Moreover, a lower simpson index (D) showed a variety of immune response possibilities (Fig. 15A2).
Fig. 15

In silico Immune simulation of the V_COV19_SN construct by C-ImmSim server. (A1) Antigen and immunoglobulins, (A2) Production of cytokine and interleukins. (A3) B-cell population, (A4) B-cell population per state, (A5) TH cell population, (A6) TH cell population per state, (A7) TR (regulatory) cell population per state. (B1) TC cell population, (B2) TC-cell population per state, (B3) NK cell population, (B4) MA cell population per state, (B5) DC cell population per state, (B6) EP cell population per state, (B7) PLB cell population.

In silico Immune simulation of the V_COV19_SN construct by C-ImmSim server. (A1) Antigen and immunoglobulins, (A2) Production of cytokine and interleukins. (A3) B-cell population, (A4) B-cell population per state, (A5) TH cell population, (A6) TH cell population per state, (A7) TR (regulatory) cell population per state. (B1) TC cell population, (B2) TC-cell population per state, (B3) NK cell population, (B4) MA cell population per state, (B5) DC cell population per state, (B6) EP cell population per state, (B7) PLB cell population.

mRNA structure prediction and in silico cloning

In order to demonstrate the expression efficiency, in silico cloning was conducted into the expression vector. At first, we reverse transcribed the peptide sequence using the java codon adaption tools for optimization. The length of the nucleotide sequence following codon optimization was 1296. The optimized nucleotide sequence had an average GC content of 51.2% (Jcat) and 52.03% (Genscript) (Ideal range of GC content is between 30% and 70%) (Fig. S15A). The obtained CAI values 0.97(Jcat) and 0.85(GenScript) were deemed ideal for expression in the intended organism. Secondary structure of the mRNA on the basis of minimum free energy was also determined (Fig. S15C). The predicted structure had ΔG = −428.80 kcal/mol. There was no hairpin or pseudoknot been observed at the first nucleotide of the 5′ end (Fig. S15D). The free energy associated with their structural element was given in (Table S3). Finally, the recombinant plasmid was constructed into a pET-28a (+) vector by restriction cloning (Fig. 16 & Fig. S16).
Fig. 16

In silico cloning of the V_COV19_SN. In the pET-28a (+) expression vector, V_COV19_SN was inserted where the red portion denoted the gene code for the vaccine and the vector backbone was represented in the black circle. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

In silico cloning of the V_COV19_SN. In the pET-28a (+) expression vector, V_COV19_SN was inserted where the red portion denoted the gene code for the vaccine and the vector backbone was represented in the black circle. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.)

Discussion

From December 19, 2019, COVID-19 reportedly created life threatening situation all around the world. The virus, SARS-CoV-2 transmits from person to person via droplet has already spread to every territory worldwide. Hence, the world health organization announced this virus as a global public health emergency [97]. This inexorable viral outbreak and resulting deaths have risen day by day; on the contrary, scientists are unable to find adequate medicine to mitigate the loss of this viral disease. Since the scientific community worldwide is looking to find a viable vaccine against this expeditiously developed SARS-CoV-2 and many of them are in the clinical trial, some have been approved by the FDA, which is now available on the market [[24], [109], [110]]. But neither of these vaccinations is 100% competent to the war against SARS-2. However, it was stated that various side effects between persons and some people died after vaccination. This issue influences us a lot to design a multi-epitope vaccine through immunoinformatics approaches. The advancement of immunoinformatics has now revolutionized the field of vaccine production. Based on multiple computational methods, we have planned to develop an antigenic multi-epitope (immunodominant) vaccine called V_COV19_SN (432aa) by targeting spike (S), membrane (M) and envelop (E) glycoprotein through a detailed genomic and proteomic study. Due to the following features, these multi-epitope based vaccines have some benefits compared to conventional or single-epitope based vaccines: (i) It is very safe and time-saving (ii) The T cell receptors can easily identify multi classes of MHC-I and MHC-II epitopes (iii) Adjuvant, which may be associated with the vaccine, would improve the long term immune response and increase immunogenicity (iv) By this way we can also avoid the difficulty of culturing pathogens and in vitro antigen expression complication [[111], [112], [113], [114], [115], [116], [117], [118], [119]]. So, in our approach, we first identified the CTL, HTL, and LBL epitopes from the targeted S, M, E glycoproteins by the IEDB server. Then we only selected those epitopes that were antigenic, immunogenic, and non-toxic via monitoring with particular servers mentioned in method section. Finally, we connected the CTL, HTL, LBL epitopes with proper (AYY, GPGPG, KK) linkers to complete the final part of vaccine development. An adjuvant β-defensin that helps to induce the immunogenicity, longevity, and most importantly, the vaccine construct's stability has also been improved by attaching the adjuvant to the epitopes (CTL, HTL, LBL) via EAAK linkers [120,121]. Thus, the constructed multi-epitope vaccine can generate specific cellular immunity and highly potent humoral immunity against infection [[122], [123], [124]]. After the construction, we verified our vaccine's non-allergenic profile by AllerTop v.2.0 and Allergen FP v.1.0 servers. Furthermore, the Vaxijen and ANTIGENpro server's up to scratch antigenicity scores proved that the vaccine was highly antigenic. The vaccine's physicochemical properties were also investigated using the Expasy Protparam server, where we have found the molecular weight was 47.15 kDa and the instability index was 28.26 which defines the constructed vaccine would be stable (The instability index lesser than 40 means the stability) (Table S1). The vaccine's theoretical pI was calculated to be 10.01, indicating that it was basic in nature and the aliphatic index of 80.62 confirmed its thermostability (Table S1). The vaccine's GRAVY (Grand average of hydropathicity) index −0.134 (lower the GRAVY index, greater its solubility) reflected its polar nature in water (Table S1). This solubility index was also checked by another server (Protein sol), where it again proved its effective interaction with water (Table S1). Because of its solvability and expected half-life >10 h in vivo, the constructed vaccine assured that it could be easily synthesized inside the host cell of E. coli (Table S1). The Ramachandran map was used to verify the vaccine's structural forms (3D structure), where we found that 87.1% of the sequence was in the preferred area, 10.6% was in the additional permitted region, 0.9% was generously allowed regions, and 1.4% was in the disallowed region (Fig. 7B). ERRAT and ProSA web server (Z score server) were further used to verify the vaccine's quality, where it scored 75.924% and −5.12, indicating the vaccine's proficiency (Fig. 7C, A). To observe the cell density behavior and optimal parameter, immune simulations were performed, where the repeated exposure have given the best immunological responses over antigen (Fig. 15). High levels of B cells and T cells population indicated humoral immunity, and the cytokines showed cell-mediated immunity (Fig. 15A(1–7), 15B(1,2)), Because of B cell proliferation and Ig isotype switching, the IFN-γ induced humoral immune responses as well [125,126]. The other (NK, MA, DC, EP and PLB) cell populations were also appreciable, suggesting a good development of immune memory against SARS-COV-2 (Fig. 15B(3–7)). Finally, a molecular docking analysis was performed between V_COV19_SN and TLRs (TLR2, TLR3, TLR4, TLR7, TLR8). Where, a stable protein-protein interaction was found with all the interactors (Fig. 9, Figs. S8–11). The highest binding affinity indicated that the interaction is more stable. Thus, in this case, we found the highest binding affinity (based on HADDOCK score) among the vaccine and TLR8, which means that the interaction between the TLR8-vaccine complex exceedingly stable compared to other complexes (Table 6). This interaction was very important for the activation of APC cells and the following CD4+, CD8+ T cells via MHC-I and MHC-II [63,127]. We also simulated molecular dynamics for 100ns because it is essential to check the vaccine's stability under in vivo conditions. As we determined the highest binding affinity between the vaccine and the TLR8, we performed MD simulation for this complex only to check its stability. By MD run it was confirmed that the interaction of our vaccine and TLR8 were stable (Fig. 10, Fig. 11). To ensure the effectiveness and efficient expression within the E. coli host, the vaccine was enhanced by codon adaptation tools and reversed translated into specific mRNA (Fig. S15). The favorable GC content, robust mRNA structure, and codon adaptability index showed its high-level capacity for effective expression in E. coli (K12) hosts (Fig. S15). Moreover, for successful cloning, two viral restriction enzymes EcoRI and BamHI were used to cut specific vaccine restriction sites. The cutting sites were then incorporated into a suitable vector pET28a (+) for in silico cloning (Fig. 16). Foroutan et al. have also adopted a similar technique to optimize codon before its expression in vitro [128]. After cloning, using IPTG (Isopropyl beta-d-1-thiogalactopyranoside) induction the recombinant vector can be propagated into E. coli cells and cultivated at 28 °C [129]. This strategy has also been applied for Pseudomonas aeruginosa, Hendra virus, Klebsiella pneumonia, Dengue, Malaria, Nipah virus and cancerous antigens [[130], [131], [132], [133], [134], [135], [136]]. Though the designed vaccine has no major limitations, still the thought was that, our vaccine construct covered almost 100% conservancy among the 128 genome sequences from Bangladesh and 110 from other affected countries. Especially it will provide an advantage to Bangladeshi population and Indian ethnic group. Also, the population coverage showed that it will be a good candidate for people all around the world. So, considering all the parameters we are very confident that by activating the immune cells and another complex signaling of the human body, our designed vaccine has the potential to produce long-lasting immunity against this deadly pathogen.

Conclusion

According to the current demand, we need to develop an effective vaccine or drug in an urgent basis for the management of SARS-COV-2 infections. But, it is a challenging task to produce an effective vaccine over a short period, and the possibility diminishes if the genetic material of the pathogen is RNA. Few vaccines have recently gained their FDA approval due to their promising results. However, considering the complex human body systems, there is no assurance that the vaccine will support long-lasting immunity around the globe. That's why more and more vaccine research projects are required through the immunoinformatics approaches. Using a set of immunoinformatics methods, we developed a multi-epitope dependent vaccine by combining CTL, HTL, and LBL epitopes, which were both antigenic and immunogenic. Besides, with high population coverage, our vaccine can cause robust immune responses as well. Therefore, we are very much hopeful that the vaccine built from our immunoinformatics research would be unequivocally suitable for combating this current scenario. Nevertheless, more experimental authentication is required to verify our developed vaccine as an appropriate prophylactic against this etiological agent of the COVID-19 outbreak.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  122 in total

1.  Determination of L1 retrotransposition kinetics in cultured cells.

Authors:  E M Ostertag; E T Prak; R J DeBerardinis; J V Moran; H H Kazazian
Journal:  Nucleic Acids Res       Date:  2000-03-15       Impact factor: 16.971

2.  High-throughput prediction of protein antigenicity using protein microarray data.

Authors:  Christophe N Magnan; Michael Zeller; Matthew A Kayala; Adam Vigil; Arlo Randall; Philip L Felgner; Pierre Baldi
Journal:  Bioinformatics       Date:  2010-10-07       Impact factor: 6.937

3.  SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling.

Authors:  N Guex; M C Peitsch
Journal:  Electrophoresis       Date:  1997-12       Impact factor: 3.535

4.  Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen.

Authors:  Jesper Pallesen; Nianshuang Wang; Kizzmekia S Corbett; Daniel Wrapp; Robert N Kirchdoerfer; Hannah L Turner; Christopher A Cottrell; Michelle M Becker; Lingshu Wang; Wei Shi; Wing-Pui Kong; Erica L Andres; Arminja N Kettenbach; Mark R Denison; James D Chappell; Barney S Graham; Andrew B Ward; Jason S McLellan
Journal:  Proc Natl Acad Sci U S A       Date:  2017-08-14       Impact factor: 11.205

5.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors:  Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

6.  Epitope based peptide vaccine against SARS-COV2: an immune-informatics approach.

Authors:  Richa Bhatnager; Maheshwar Bhasin; Jyoti Arora; Amita S Dang
Journal:  J Biomol Struct Dyn       Date:  2020-07-03

7.  How the interval between prime and boost injection affects the immune response in a computational model of the immune system.

Authors:  F Castiglione; F Mantile; P De Berardinis; A Prisco
Journal:  Comput Math Methods Med       Date:  2012-09-11       Impact factor: 2.238

8.  High-throughput rational design of the remdesivir binding site in the RdRp of SARS-CoV-2: implications for potential resistance.

Authors:  Aditya K Padhi; Rohit Shukla; Prakash Saudagar; Timir Tripathi
Journal:  iScience       Date:  2020-12-26

9.  GalaxyRefine: Protein structure refinement driven by side-chain repacking.

Authors:  Lim Heo; Hahnbeom Park; Chaok Seok
Journal:  Nucleic Acids Res       Date:  2013-06-03       Impact factor: 16.971

View more
  3 in total

1.  COVID-19 vaccine design using reverse and structural vaccinology, ontology-based literature mining and machine learning.

Authors:  Anthony Huffman; Edison Ong; Junguk Hur; Adonis D'Mello; Hervé Tettelin; Yongqun He
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

2.  Computational construction of a glycoprotein multi-epitope subunit vaccine candidate for old and new South-African SARS-CoV-2 virus strains.

Authors:  Olugbenga Oluseun Oluwagbemi; Elijah Kolawole Oladipo; Emmanuel Oluwatobi Dairo; Ayodele Eugene Ayeni; Boluwatife Ayobami Irewolede; Esther Moradeyo Jimah; Moyosoluwa Precious Oyewole; Boluwatife Mary Olawale; Hadijat Motunrayo Adegoke; Adewale Joseph Ogunleye
Journal:  Inform Med Unlocked       Date:  2022-01-15

3.  A novel vaccine based on SARS-CoV-2 CD4+ and CD8+ T cell conserved epitopes from variants Alpha to Omicron.

Authors:  Iam Palatnik-de-Sousa; Zachary S Wallace; Stephany Christiny Cavalcante; Maria Paula Fonseca Ribeiro; João Antônio Barbosa Martins Silva; Rafael Ciro Cavalcante; Richard H Scheuermann; Clarisa Beatriz Palatnik-de-Sousa
Journal:  Sci Rep       Date:  2022-10-06       Impact factor: 4.996

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.