Literature DB >> 33173250

Towards a novel peptide vaccine for Middle East respiratory syndrome coronavirus and its possible use against pandemic COVID-19.

Salman Khan1, Bilal Shaker2, Sajjad Ahmad3, Sumra Wajid Abbasi4, Muhammad Arshad5, Abdul Haleem5, Saba Ismail3, Anita Zaib4, Wasim Sajjad4.   

Abstract

Middle East respiratory syndrome coronavirus (MERS-CoV) is an emerging health concern due to its high mortality rate of 35%. At present, no vaccine is available to protect against MERS-CoV infections. Therefore, an in silico search for potential antigenic epitopes in the non-redundant proteome of MERS-CoV was performed herein. First, a subtractive proteome-based approach was employed to look for the surface exposed and host non-homologous proteins. Following, immunoinformatics analysis was performed to predict antigenic B and T cell epitopes that were used in the design of a multi-epitopes peptide. Molecular docking study was carried out to predict vaccine construct affinity of binding to Toll-like receptor 3 (TLR3) and understand its binding conformation to extract ideas about its processing by the host immune system. We identified membrane protein, envelope small membrane protein, non-structural protein ORF3, non-structural protein ORF5, and spike glycoprotein as potential candidates for subunit vaccine designing. The designed multi-epitope peptide then linked to β-defensin adjuvant is showing high antigenicity. Further, the sequence of the designed vaccine construct is optimized for maximum expression in the Escherichia coli expression system. A rich pattern of hydrogen and hydrophobic interactions of the construct was observed with the TLR3 allowing stable binding of the construct at the docked site as predicted by the molecular dynamics simulation and MM-PBSA binding energies. We expect that the panel of subunit vaccine candidates and the designed vaccine construct could be highly effective in immunizing populations from infections caused by MERS-CoV and could possible applied on the current pandemic COVID-19.
© 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Middle East respiratory syndrome; Middle East respiratory syndrome coronavirus; Multi-epitope peptide; Toll-like receptor 3

Year:  2020        PMID: 33173250      PMCID: PMC7644433          DOI: 10.1016/j.molliq.2020.114706

Source DB:  PubMed          Journal:  J Mol Liq        ISSN: 0167-7322            Impact factor:   6.165


Introduction

Middle East respiratory syndrome caused by a novel coronavirus, Middle East Respiratory Syndrome Coronavirus (MERS-CoV), is a highly fatal viral respiratory tract infection in humans and is a global public health threat [1,2]. MERS-CoV was first described on June 13, 2012 in a patient from Dr. Soliman Fakeeh Hospital in Jeddah, Kingdom of Saudi Arabia [3,4]. The patient died 11 days after the virus detection. Since then, the virus was being reported frequently from across the Arabian Peninsula [4,5]. Furthermore, the spread of MERS-CoV to the countries of Middle East and other continents has brought international attention to the severe implications of the MERS pandemic [1]. Till to date, a total of 2279 laboratory-confirmed cases of MERS are reported worldwide according to the World Health Organization (WHO). This include 806 deaths with fatality rate of 35.3%. The bulk of these case occurred in Saudi Arabia: 1901 cases with death troll climbed to 732 at fatality rate of 38.5% (https://www.who.int/emergencies/mers-cov/en/). Dromedary camels (Camelus dromedaries) are implicated as a primary reservoir for the spillover of infection to the humans [3]. Person-to-person transmission of the virus have been shown to occur frequently especially in the hospital settings [6]. Direct contact with the camel or consumption of its products appear to be the main route for camel-to-human transmission, while prolonged contact among human hosts leads to person-to-person transmission [7]. Both camel-to-human and person-to-person transmission have been well documented in Saudi Arabia [5]. In South Korea, transmission of the diseases is entirely human-to-human contact [8]. In between 2012 and 2016, small outbreaks and sporadic episodes of the diseases have been reported across the Arabian Peninsula and were detected in humans from Bahrain, Kuwait, Oman, Qatar, the most notorious outbreak was seen in 2014 with more than 500 confirmed cases in the province of Riyadh and Makkah [9]. Till to date, cases of MERS-CoV are continuously reported from Saudi Arabia every month and that's why this country is known as critical global epicenter for MERS-CoV [10]. A range of therapeutics are investigated for MERS-CoV but still no approved treatment is available [11]. Development of a vaccine is a cost effective prophylactic measure and remains the most scalable [12]. In this regard, several candidates have been evaluated using variety of different approaches: however, still vaccine for MERS-CoV is not available [13]. Initially, vaccine development studies were hampered due to the absence of an appropriate small animal model of the MERS-CoV disease [14]. Barriers like these have been overcome in the recent past by developing in vivo approaches that allow and facilitate the testing vaccine candidate for MERS-CoV in small animal models [14]. Subunit vaccines that contains one or more pathogen immunogenic component are immunologically focused and have gained popularity because of reduced risk in vivo and relative ease of production [15]. The virus proteome comprises several vital vaccine targets that contribute in infection and pathogenesis of MERS-CoV [16]. The receptor-binding domain of spike (S) glycoprotein in particular mediate virus interaction with the host cell [15]. The envelope E protein plays a significant role in host cell recognition [17]. The interferon (IFN)-antagonizing properties of membrane protein aid in reducing IFN levels in infected cells [18]. Similarly, proteins from the open reading frame (ORF) group like ORF3, ORF5 and ORF4a are important from viral infection and pathogenesis point of view and mutational studies revealed that mutation in these genes leads to abrupt inflammation, host cell processes disruption and dysregulation IFN pathway activation [6]. The use of these single subunit vaccine in vaccine formulation is also not an appropriate choice to have as they harbor several different antigenic determinants, all of which are not desired and some may have detrimental effects on protective immunity induction [19]. This rationale has led to create an interest in designing a peptide vaccine for MERS-CoV that comprise of epitopes required for desired cellular and humoral immune response [20]. Peptide vaccines are considered sufficient enough to induce appropriate T and B-cell immunity, while at the same time excluding the risk of reactogenic and allergenic responses [21]. Moreover, peptide vaccine can be implicated for induction of broad-spectrum immunity against multiple variant of the pathogen [22]. On the contrary, because of the small size peptide vaccine is weakly immunogenic and require a carrier molecule for adjuvanting and chemical stability [19]. Several peptide vaccines are under process of development for different viral diseases, such as hepatitis C virus (HCV) [23], human papilloma virus (HPV) [24], human immunodeficiency virus (HIV) [25], and influenza [26] etc. A multi-epitope vaccine contains a series of overlapping epitopes that can elicit effective cytotoxic T cells, T helper cells and B cells responses against viral pathogen [27]. Unlike, subunit or single epitope-based vaccines, multi-epitope vaccines have a distinctive concept design with additional properties. These properties include: (i) the chances of adverse effects or pathological responses are less because of reduced unwanted components, the introduction of adjuvant capacity enhances chances of long-lasting and immunogenicity responses, (iii) as the final formation has multiple epitope the spectra of targeted viruses is expanded, (iv) they are capable of producing strong cellular and humoral immunity simultaneously, (v) the epitopes in multi-epitope peptide can be recognized by T cell receptors (TCRs) of various T-cell subsets [19]. Keeping in view the broad applicability of multi-epitope vaccine, in this work we explored the complete proteome of MERS-CoV exoproteome and secretome for cytotoxic T lymphocyte (CTL) and helper T lymphocyte (HTL) epitopes containing B-cell epitopes. Additionally, molecular docking, molecular dynamics simulation and MM-PBSA binding free energies were performed to underpin multi-epitope peptide binding pose and interactions with TLR3 receptor proteins.

Materials and methods

Retrieval of MERS-CoV proteome and subtractive proteomics

The complete reference proteome of MERS-CoV (Proteome ID: UP000139997) was retrieved from Swiss-Prot reviewed universal protein knowledgebase database (Uniprot) [28] and subjected to subtractive proteomics pipeline for identification of potential vaccine candidates [29]. First in the process, proteins subcellular localization was predicted using an online tools: CELLO2GO [30] tool with E-value set at 0.001 and Virus-PLoc server (http://www.csbio.sjtu.edu.cn/bioinf/virus/). Proteins having localization in extracellular matrix and membrane were considered and analyzed further along the framework. All the screened proteins were evaluated for redundancy in Cluster Database at High Identity with Tolerance (CD-Hit) with sequence identity threshold set at 50%. [31] Next, comparative homology analysis of the filtered proteins with the human proteome (H. sapiens, taxid: 9606) was performed with parameters of E-value: 0.005, bit score: ≥100 and sequence identity: ≤30% in online BlASTp tool of the National Center for Biotechnological Information (NCBI) [32,33]. To avoid any homology of the protein sequence with the beneficial bacteria of human gut, a subsequent BlASTp was run against Lactobacillus rhamnosus (taxid: 47715) taking the identity sequence cut-off ≤ 30%, bit score of 100 and E-value of 0.005 [34,35].

Prediction of epitopes

The CTL epitopes for the proteins were predicted using NetCTL 1.2 server (http://www.cbs.dtu.dk/services/NetCTL/). NetCTL 1.2 server combines prediction of C-terminal cleavage, binding affinity for reference set of MHC class I which is restricted to 12 MHC class I super type, and TAP transport efficiency. The prediction of MHC peptide binding, proteasome cleavage event, TAP transport efficiency is done using neural networks, NetChop neural networks, and weight matrix based methods, respectively. Predicted CTL epitopes with overall combine score of 1 were selected. CTL epitopes from the pathogen proteins will elicit cell mediated immunity for inhibiting the growth and development of the pathogen and produce memory cells for remembering the pathogen in future encounters [36,37]. The predicted CTL epitopes were then subjected to VaxiJen to evaluate antigenicity of the epitopes [38]. Those with values greater >0.4 were considered antigenic and considered only. The response from Helper T-lymphocyte (HTL) epitopes constitute the major part of cellular immunity and aid in producing various cytokines and immune cell to clear the pathogen [39]. The HTL epitopes induce both CTL and HTL immune responses by secreting different lymphokines and could be crucial part of prophylactic and immunotherapeutic vaccine [40]. HTL epitopes interacting with a reference set of HTL alleles were predicted using IEDB MHC-II epitope prediction module keeping the available parameters as default [41]. All predicted epitopes further subjected to allergenicity, toxicity and IFN-gamma inducing peptides prediction by using online AllergenFP v.1.0 [42], ToxinPred [43], and IFNepitope server [44], respectively. The predicted epitopes were ranked based percentile rank score: lower score describe higher affinity binding of the epitope for HTL binding. The predicted HTL epitopes for its affinity to activate T1 type immune response were cross-validated using IFN epitope server [44].

Multi-epitope peptide vaccine construction

The shortlisted conserved epitopes were then linked to each other through a flexible linker AAY. β-defensin was used as an adjuvant to recruit naive T-cells and immature dendritic cells to the infection site [45]. The CTL-epitopes peptide is linked to β-defensin at the N-terminal using EAAAK linker while the CTL and HTL epitopes were linked to each other AAY and GPGPG linker, respectively. The structure of the construct was predicted using Raptor X [46]and ab ignition I-tasser [47]. The physicochemical parameters of the vaccine construct were evaluated using ProtParam server [48]. The generated structures of the multi-epitope peptide were then utilized in PDBsum to create Ramachandran plot for the structures [49].The antigenicity and allergenicity were predicted by employing VaxiJen [38] and AlgPred servers [50], respectively.

B-cell epitopes assessment

Identification of B-cell epitopes is vital in vaccine designing considering the fact that it leads to production of B lymphocytes that differentiate into antibody producing plasma cells and memory cells [51]. B-cell epitopes of 20 amino acids long in the input protein sequences were identified using BCPREDS [52]. The epitope specificity threshold was allowed as default by 75%.

Complementary DNA analysis for cloning and expression

Reverse translational analysis was carried out using Codon Usage Wrangler Tool (https://www.mrc-lmb.cam.ac.uk/ms/methods/codon.html) to generate optimized cDNA. The cDNA sequence was then used in GeneScript to compute GC content and codon adaptation index (CAI) score. Ideally, the GC content of a sequence should be in between 30% to 70%, while the CAI score of 1 is considered good.

Molecular docking with TLR3

Multi-epitope peptide vaccine sequence was then used in molecular docking studies in PatchDock [53]. TLR3 (PDB ID, 1ZIW) was used as receptor proteins in the process. The best predate complexes were subsequently subjected to FireDock for refinement [54]. The affinity of the vaccine construct for TLR3 was also validated by ClusPro 2.0 docking software [55].

In silico immune simulation of the vaccine construct

The vaccine construct immunogenic potential was elucidated using C-immune server [56]. The simulation parameters were treated as: 10 number of antigen dose, random seed (12345), volume of simulation 100, number of simulation steps,1000, HLA alleles used (MHC class I A0101 allele, B MHC class I B0702, DR MHC class II DRB1_0101 allele), and time step of injection employed is 1.

Molecular dynamics simulation

MD simulation was run for 120 ns on the most suitable docked solution of TLR3-MEPVC. AMBER16 was used to execute this MD simulation [57]. The ff14SB force field was used to parameterize the complex [58]. The complex was solvated into a TIP3P water box. Na + ions were added to the system to counterbalance its charge. Hydrogen atoms of the system were minimized for 500 steps, solvation box and carbon alpha atoms for 1000 steps, and all non-heavy atoms for 300 steps. Later on, in an attempt to manage the system temperature, it was heated to 300 K (NVT) for 20 ps through langevin dynamics [59] where a 2 femtosecond time step long restraint of 5 kcal/mol-A2 on carbon alpha atoms was permitted. System was then unrestrained for 100 ps while being equilibrated. NPT ensemble was put in practice for 50 ps to manage the pressure on the system. Finally, simulation run of 120 ns was achieved at a time step of 2 femtoseconds. AMBER CPPTRAJ [60] was then used to examine the trajectories for to evaluate complex dynamics and stability.

Calculation of binding free energies

MMPBSA [55].py package was used to predict binding free energies of the system [61]. A total of 100 frames were derived from simulation trajectories to be employed in MMGBSA (Molecular Mechanics Generalized Born Surface Area) and MMPBSA (Molecular Mechanics Poisson–Boltzmann Surface Area) [62]. The examination of the free energy difference between solvated phase and gas phase of the system is the ultimate goal of this assay. The determination of net binding free energy was carried out as, The binding free energy calculation was done on 100 frames extracted from complete length of MD simulation as well as from last 20-ns to thoroughly validate intermolecular affinity and stability. Entropy of the system was estimated using a bash script following the protocol provided by Duan et al., 2016 [63].

Results and discussion

Subtractive proteomics

The complete MERS-CoV proteome contains 13 proteins in total, namely: Replicase polyprotein 1ab, Replicas polyprotein 1a, Membrane protein, Envelope small membrane protein, Non-structural protein ORF4b, Non-structural protein ORF3, Non-structural protein ORF5, Non-structural protein ORF4a, Nucleoprotein, and Spike glycoprotein. These proteins were first investigated in subcellular localization of subtractive proteomics to screen only those candidates that are exposed for direct interaction with the host cells. Such proteins identification represent an important goal as they are involved in virulence and pathogenesis and their vital importance from stimulating immune responses [64]. Only proteins that are localized to either extracellular (exoproteome) or outer membrane (secretome) were opted in subcellular localization filter. Five proteins: Membrane protein, Envelope small membrane protein, Non-structural protein ORF3, Non-structural protein ORF5, Spike glycoprotein were shortlisted in this analysis as highlighted in bold in Table 1 . Next in the framework, these proteins were subjected non-redundancy check. In this, non-redundant dataset was only considered. All the five proteins mentioned above were non-redundant which means that each protein represented in the dataset is only singly repeated [35]. The comparative homology check with the humans revealed no hit, indicating highly non-similarity between these virus proteins and the human proteome. This is quite vital in avoiding the auto-immune responses and designing a safe vaccine [65,66]. Lastly, a comparative homology analysis between the MERS-CoV shortlisted proteins and the beneficial gut bacterium, Lactobacillus rhamnosus, was performed to eliminate the risk of accidental inhibition of good bacteria in the host. Bacteria of such type play profound role in improving functions of the host immune system, aid in digestion of food for extracting vital nutirents, and prevent the growth of hostile infection causing bacteria etc. [34]. All the five proteins were noticed to show no homology to the said representative gut bacterium proteome allowing the safe use of the filtered proteins epitopes.
Table 1

Proteins that constitute the proteome of the MERS-CoV. Bold highlighted proteins are the one selected in subcellular localization analysis.

Protein IDProteinCELLO2GO (Prediction score)Virus-PLoc
sp|K9N7C7|R1AB_CVEMCReplicase polyprotein 1abPlasma membrane (2.35)Plasma membrane
>sp.|K9N638|R1A_CVEMCReplicase polyprotein 1aPlasma membrane (2.54)Nucleus
>sp|K9N7A1|VME1_CVEMCMembrane proteinPlasma membrane (4.95)Plasma membrane
>sp|K9N5R3|VEMP_CVEMCEnvelope small membrane proteinPlasma membrane (3.33)Plasma membrane
>sp.|K9N643|ORF4B_CVEMCNon-structural protein ORF4bMitochondrial (1.63)Plasma membrane
>sp|K9N796|ORF3_CVEMCNon-structural protein ORF3Extracellular (3.62)Plasma membrane
>sp|K9N7D2|ORF5_CVEMCNon-structural protein ORF5Plasma membrane (4.94)Plasma membrane
>sp.|K9N4V0|ORF4A_CVEMCNon-structural protein ORF4aExtracellular (2.21)Cytoplasm
>sp.|K9N4V7|NCAP_CVEMCNucleoproteinNucleus (3.07)Nucleus
>sp|K9N5Q8|SPIKE_CVEMCSpike glycoproteinPlasma membrane (3.31)Plasma membrane
Proteins that constitute the proteome of the MERS-CoV. Bold highlighted proteins are the one selected in subcellular localization analysis.

Prediction of cytotoxic T-lymphocyte epitopes

Cytotoxic T cell lymphocytes (also known as Killer T cells) are important component of immune system for tumor surveillance and against intracellular pathogens including bacteria, protozoan and viruses [36]. When these cells recognized its antigen, it become activated and follow three step mechanisms to kill the infected or malignant cells [67]. First, it secretes cytokines primarily IFN-γ and TNF-α which have antiviral and antitumor effects [68]. Secondly, it produces and release cytotoxic granules. These granules have perforin that make the target cell membrane porous and granzymes which being serine proteases hydrolyze intracellular proteins, block viral proteins synthesis and eventually leads to apoptosis [67]. All the five proteins shortlisted in the previous step was analyzed for 9mer cytotoxic T-cell receptor epitopes. Briefly, 5 CTL epitopes were obtained for membrane protein, 3 for envelop small membrane protein, 3 for non-structural protein ORF3, 4 for non-structural protein ORF7 and 38 for the spike protein. Among these predicted epitopes, we found only 4 antigenic epitopes for membrane protein, 0 for envelop small membrane protein, 1 for non-structural protein ORF3 and ORF5, and 17 for spike protein. Only antigenic epitopes were considered while non-antigenic were discarded. The selected epitopes are also non-allergic, non-toxic and are IFN-gamma inducing peptide. Predicted CTL epitopes for the screened 5 subunit vaccine proteins are shown in Table 2 .
Table 2

CTL epitopes for shortlisted 5 proteins with antigenicity scores.

ProteinPeptideVaxiJen ScoreResultAllerganicityToxicity
Membrane proteinLLITIVLQY0.8719Probable ANTIGENProbable non-allergen−1.09(Non-Toxin)
ITIVLQYGY1.2147Probable ANTIGENProbable-non-allergen−0.65(Non-Toxin)
ALSIFSAVY0.1615Probable NON-ANTIGENProbable non-allergen−1.37(Non-Toxin)
VSAMMWISY0.6059Probable ANTIGENProbable non-allergen−0.79(Non-Toxin)
GTNSGVAIY0.4620Probable ANTIGENProbable non-allergen−1.16(Non-Toxin)
Envelope small membrane proteinMTGFNTLLV0.0681Probable NON-ANTIGENProbable non-allergen−1.41(Non-Toxin)
TLLVQPALY0.2772Probable NON-ANTIGENProbable allergen−1.23(Non-Toxin)
LVQPALYLY0.0512Probable NON-ANTIGENProbable non-allergen−0.87(Non-Toxin)
Non-structural protein ORF3VTAFSKPLY0.2377Probable NON-ANTIGENProbable non-allergen−1.12(Non-Toxin)
AQADTAGLY0.1360Probable NON-ANTIGENProbable non-allergen−0.66(Non-Toxin)
SVNLFDVGY1.2493Probable ANTIGENProbable non-allergen−1.01(Non-Toxin)
Non-structural protein ORF5STDSIVFTY0.3229Probable NON-ANTIGENProbable non-allergen−0.91(Non-Toxin)
FTYIPASGY0.1568Probable NON-ANTIGENProbable allergen−0.89(Non-Toxin)
MVLYFLVLY0.1428Probable NON-ANTIGENProbable allergen−0.81(Non-Toxin)
SSTYIECTY0.8016Probable ANTIGENProbable non-allergen−0.30(Non-Toxin)
Spike glycoproteinFLLTPTESY0.2812Probable NON-ANTIGENProbable allergen−1.38(Non-Toxin)
EVDIQQTFF0.8992Probable ANTIGENProbable non-allergen−1.29(Non-Toxin)
VSKADGIIY0.1659Probable NON-ANTIGENProbable allergen−1.19(Non-Toxin)
TYSNITITY1.5697Probable ANTIGENProbable non-allergen−1.10(Non-Toxin)
ITYQGLFPY0.9146Probable ANTIGENProbable non-allergen−0.89(Non-Toxin)
YQGDHGDMY0.2627Probable NON-ANTIGENProbable allergen−0.73(Non-Toxin)
TSATIRKIY0.1760Probable NON-ANTIGENProbable allergen−1.32(Non-Toxin)
FSDGKMGRF1.0567Probable ANTIGENProbable non-allergen−0.24(Non-Toxin)
NSYTSFATY0.2171Probable NON-ANTIGENProbable non-allergen−0.96(Non-Toxin)
ATDCSDGNY0.7838Probable ANTIGENProbable non-allergen−0.19(Non-Toxin)
ASLNSFKEY−0.155Probable NON-ANTIGENProbable non-allergen−0.50(Non-Toxin)
ITEDEILEW0.1074Probable NON-ANTIGENProbable allergen−1.21(Non-Toxin)
QTAQGVHLF0.3715Probable NON-ANTIGENProbable allergen−0.98(Non-Toxin)
GVHLFSSRY0.1530Probable NON-ANTIGENProbable non-allergen−0.54(Non-Toxin)
FSSRYVDLY1.0488Probable ANTIGENProbable non-allergen−0.44(Non-Toxin)
YVDLYGGNM0.2930Probable NON-ANTIGENProbable allergen−0.81(Non-Toxin)
QSDRKAWAA0.2840Probable NON-ANTIGENProbable allergen−0.87(Non-Toxin)
LLDFSVDGY0.0436Probable NON-ANTIGENProbable allergen−0.95(Non-Toxin)
DLSQLHCSY0.9444Probable ANTIGENProbable non-allergen−0.83(Non-Toxin)
SFDVESGVY0.9858Probable ANTIGENProbable non-allergen−1.33(Non-Toxin)
LSGTPPQVY0.2591Probable NON-ANTIGENProbable non-allergen−0.96(Non-Toxin)
PAAIASNCY0.5141Probable ANTIGENProbable non-allergen−0.61(Non-Toxin)
CYSSLILDY0.4038Probable ANTIGENProbable non-allergen−0.65(Non-Toxin)
YSSLILDYF0.0327Probable NON-ANTIGENProbable allergen−1.03(Non-Toxin)
TTITKPLKY0.1377Probable NON-ANTIGENProbable allergen−1.48(Non-Toxin)
ITKPLKYSY0.6911Probable ANTIGENProbable non-allergen−1.77(Non-Toxin)
STVWEDGDY−0.0613Probable NON-ANTIGENProbable non-allergen−0.73(Non-Toxin)
MTEQLQMGF1.2508Probable ANTIGENProbable non-allergen−1.40(Non-Toxin)
YYSDDGNYY−0.0986Probable NON-ANTIGENProbable allergen−0.68(Non-Toxin)
YSDDGNYYC0.1491Probable NON-ANTIGENProbable allergen−0.52(Non-Toxin)
VSVPVSVIY0.0758Probable NON-ANTIGENProbable allergen−1.15(Non-Toxin)
HISSTMSQY0.8394Probable ANTIGENProbable non-allergen−0.92(Non-Toxin)
QVDQLNSSY0.4655Probable ANTIGENProbable non-allergen−0.68(Non-Toxin)
FSFGVTQEY1.6924Probable ANTIGENProbable non-allergen−1.30(Non-Toxin)
RSAIEDLLF0.1712Probable NON-ANTIGENProbable allergen−1.01(Non-Toxin)
VVNAPNGLY0.0211Probable NON-ANTIGENProbable allergen−0.68(Non-Toxin)
WSYTGSSFY1.0625Probable ANTIGENProbable non-allergen−1.27(Non-Toxin)
NTTLLDLTY1.5316Probable ANTIGENProbable non-allergen−1.40(Non-Toxin)
CTL epitopes for shortlisted 5 proteins with antigenicity scores.

Prediction of helper T-lymphocyte epitopes

T helper lymphocytes or CD4+ cells are important player of the immune system by instigating and shaping adaptive immune system. T helper lymphocytes recognize peptides displayed on the MHC class II molecules. They release cytokines that aid in activating several other immune cells. They are essential for B cell antibody class switching, enhancing bactericidal activity of macrophages, and growth and activation of cytotoxic T cells [51,69]. The five proteins were subjected to MHC-II epitope prediction of 15mer length as shown in Table 3 . Only epitopes of lowest percentile score were selected and as such epitopes with percentile score < 3 were opted only. Using this criterion, we predicted 3 HTL epitopes for membrane protein, 9 for envelope small membrane protein and non-structural protein ORF3, 25 for non-structural protein ORF5 and 31 for the spike protein. Antigenicity evaluation revealed 0 epitopes for membrane protein, envelope small membrane protein and non-structural protein ORF5 4 and 3 respectively, 6 for non-structural protein ORF3 and 17 for spike proteins. The selected epitopes besides being antigenic are also non-allergic, non-toxic and are IFN-gamma inducing peptide.
Table 3

HTL epitopes for shortlisted 5 proteins with antigenicity scores.

ProteinStart amino acidEnd amino acidPeptidePercentile scoreVaxijen antigenicityAllergenicityToxicity
Membrane protein1630SQIISGIVAAVSAMM3.020.1060 (Probable non-antigen)Probable allergen−1.13 (Non-Toxin)
1731QIISGIVAAVSAMMW3.020.1324 (Probable non-antigen).Probable non-allergen−1.07(Non-Toxin)
1832IISGIVAAVSAMMWI4.570.1871 (Probable non-antigen).Probable non-allergen−0.73(Non-Toxin)
Envelope small membrane protein4458MTGFNTLLVQPALYL0.320.2107 (Probable non-antigen)Probable non-allergen−1.52(Non-Toxin)
4357CMTGFNTLLVQPALY0.710.1828 (Probable non-antigen).Probable non-allergen−1.20(Non-Toxin)
4559TGFNTLLVQPALYLY0.880.1509 (Probable non-antigen)Probable non-allergen−1.15(Non-Toxin)
4256QCMTGFNTLLVQPAL1.240.0771 (Probable non-antigen).Probable non-allergen−1.28(Non-Toxin)
4660GFNTLLVQPALYLYN1.710.1781 (Probable non-antigen).Probable non-allergen−1.26(Non-Toxin)
2741LLVCMAFLTATRLCV2.270.6923 (Probable antigen).Probable non-allergen−0.67(Non-Toxin)
2842LVCMAFLTATRLCVQ2.511.0320 (Probable antigen).Probable non-allergen−0.56(Non-Toxin)
2640TLLVCMAFLTATRLC2.740.6878 (Probable antigen).Probable non-allergen−1.03(Non-Toxin)
2539ITLLVCMAFLTATRL2.910.6608 (Probable antigen).Probable non-allergen−1.30(Non-Toxin)
Non-structural protein ORF3620PPTLLLVFSLSLLVT0.960.5922 (Probable antigen).Probable non-allergen−1.25 Non-Toxin
721PTLLLVFSLSLLVTA0.960.5642 (Probable antigen).Probable non-allergen−1.44(Non-Toxin)
519RPPTLLLVFSLSLLV1.360.4001 (Probable antigen).Probable non-allergen−1.12(Non-Toxin)
923LLLVFSLSLLVTAFS1.580.5310 (Probable antigen).Probable non-allergen−1.36(Non-Toxin)
418QRPPTLLLVFSLSLL1.810.5021 (Probable antigen).Probable non-allergen−1.13(Non-Toxin)
822TLLLVFSLSLLVTAF1.810.4190 (Probable antigen).Probable non-allergen−1.45(Non-Toxin)
317VQRPPTLLLVFSLSL2.050.3279 (Probable non-antigen).Probable non-allergen−1.17(Non-Toxin)
1428SLSLLVTAFSKPLYV2.180.2561 (Probable non-antigen).Probable non-allergen−1.48(Non-Toxin)
1327FSLSLLVTAFSKPLY4.050.2759 (Probable non-antigen).Probable non-allergen−1.45(Non-Toxin)
Non-structural protein ORF51125LYFLVLYNFLLAIVL0.040.4775 (Probable antigen).Probable non-allergen−1.11(Non-Toxin)
1024VLYFLVLYNFLLAIV0.140.3188 (Probable non-antigen).Probable non-allergen−1.11(Non-Toxin)
1226YFLVLYNFLLAIVLV0.140.5150 (Probable antigen).Probable non-allergen−1.32(Non-Toxin)
1327FLVLYNFLLAIVLVN0.140.5732 (Probable antigen).Probable non-allergen−1.40(Non-Toxin)
721RTMVLYFLVLYNFLL0.280.0324 (Probable non-antigen).Probable allergen−0.87(Non-Toxin)
822TMVLYFLVLYNFLLA0.280.2192 (Probable non-antigen).Probable non-allergen−0.93(Non-Toxin)
923MVLYFLVLYNFLLAI0.280.2900 (Probable non-antigen).Probable non-allergen−1.18(Non-Toxin)
3549PASGYVAALAVNVCL0.770.0202 (Probable non-antigen).Probable non-allergen−0.88(Non-Toxin)
3650ASGYVAALAVNVCLI0.770.1282 (Probable non-antigen).Probable non-allergen−0.85(Non-Toxin)
3751SGYVAALAVNVCLIP1.060.3421 (Probable non-antigen).Probable non-allergen−0.85(Non-Toxin)
3347YIPASGYVAALAVNV1.150.1243 (Probable non-antigen).Probable non-allergen−1.19(Non-Toxin)
3448IPASGYVAALAVNVC1.33−0.1842 (Probable non-antigen).Probable allergen−1.18()
3044DSVPLHIIAPSSLIV1.810.2266 (Probable non-antigen).Probable non-allergen−0.90(Non-Toxin)
2842SIVFTYIPASGYVAA2.050.3680 (Probable non-antigen).Probable non-allergen−1.12(Non-Toxin)
2943IVFTYIPASGYVAAL2.050.2499 (Probable non-antigen).Probable non-allergen−0.84(Non-Toxin)
2741DSIVFTYIPASGYVA2.270.1703 (Probable non-antigen).Probable allergen−1.06(Non-Toxin)
923RSHFIRVSTVSSHGM2.510.1042 (Probable non-antigen).Probable non-allergen−1.08(Non-Toxin)
1024SHFIRVSTVSSHGMV2.510.1973 (Probable non-antigen).Probable non-allergen−1.11(Non-Toxin)
822MRSHFIRVSTVSSHG2.740.0550 (Probable non-antigen).Probable non-allergen−1.43(Non-Toxin)
721DMRSHFIRVSTVSSH3.360.3701 (Probable non-antigen).Probable non-allergen−1.26(Non-Toxin)
3145FTYIPASGYVAALAV3.490.2843 (Probable non-antigen).Probable non-allergen−1.16(Non-Toxin)
2943RDSVPLHIIAPSSLI3.880.2191 (Probable non-antigen).Probable non-allergen−1.07(Non-Toxin)
3044VFTYIPASGYVAALA3.95−0.0681 (Probable non-antigen).Probable non-allergen−1.13(Non-Toxin)
2640TDSIVFTYIPASGYV4.160.0533 (Probable non-antigen).Probable allergen−0.97(Non-Toxin)
2539STDSIVFTYIPASGY4.720.0749 (Probable non-antigen).Probable non-allergen−0.89(Non-Toxin)
Spike protein1428QSIFYRLNGVGITQQ0.390.7573 (Probable antigen).Probable non-allergen−2.15(Non-Toxin)
1529SIFYRLNGVGITQQV0.620.7288 (Probable antigen).Probable non-allergen−2.30(Non-Toxin)
418QQLVRSESAALSAQL0.620.3902 (Probable non-antigen).Probable non-allergen−1.16(Non-Toxin)
4357IVSFVVNAPNGLYFM0.710.1671 (Probable non-antigen).Probable non-allergen−0.39(Non-Toxin)
3852YIWLGFIAGLVALAL0.770.7583 (Probable antigen).Probable non-allergen−0.79(Non-Toxin)
4054WLGFIAGLVALALCV0.770.4240 (Probable antigen).Probable non-allergen−0.74(Non-Toxin)
4155LGFIAGLVALALCVF0.770.4118 (Probable antigen).Probable non-allergen−0.84(Non-Toxin)
1327AQSIFYRLNGVGITQ0.790.6283 (Probable antigen).Probable non-allergen−1.97(Non-Toxin)
4660VYDTIKYYSIIPHSI0.880.2157 (Probable non-antigen).Probable non-allergen−1.09(Non-Toxin)
1226FAQSIFYRLNGVGIT0.960.4940 (Probable antigen).Probable non-allergen−1.77(Non-Toxin)
3953IWLGFIAGLVALALC0.960.4564 (Probable antigen).Probable non-allergen−0.77(Non-Toxin)
1630IFYRLNGVGITQQVL1.150.6444 (Probable antigen).Probable non-allergen−2.29(Non-Toxin)
4256GFIAGLVALALCVFF1.150.2054 (Probable non-antigen).Probable allergen−0.59(Non-Toxin)
519QLVRSESAALSAQLA1.330.5139 (Probable antigen).Probable non-allergen−1.22(Non-Toxin)
1125PFAQSIFYRLNGVGI1.360.4443 (Probable antigen)Probable non-allergen−1.66(Non-Toxin)
4458VSFVVNAPNGLYFMH1.710.4899 (Probable antigen).Probable allergen−0.43(Non-Toxin)
317AQQLVRSESAALSAQ1.810.4991 (Probable antigen).Probable non-allergen−0.91(Non-Toxin)
620LVRSESAALSAQLAK1.90.4473 (Probable antigen).Probable non-allergen−1.02(Non-Toxin)
4256HIVSFVVNAPNGLYF2.180.1797 (Probable non-antigen).Probable non-allergen−0.67(Non-Toxin)
3751WYIWLGFIAGLVALA2.270.6569 (Probable antigen).Probable non-allergen−0.72(Non-Toxin)
1731FYRLNGVGITQQVLS2.550.6337 (Probable antigen).Probable non-allergen−2.25(Non-Toxin)
2842DFNLTLLEPVSISTG2.911.5583 (Probable antigen).Probable non-allergen−1.73(Non-Toxin)
216VAQQLVRSESAALSA3.090.3358 (Probable non-antigen).Probable non-allergen−1.11(Non-Toxin)
2135HGDMYVYSAGHATGT3.49−0.1383 (Probable non-antigen).Probable non-allergen−1.27(Non-Toxin)
2236GDMYVYSAGHATGTT3.490.0241 (Probable non-antigen).Probable allergen−0.99(Non-Toxin)
4559SFVVNAPNGLYFMHV3.630.3855 (Probable non-antigen).Probable non-allergen−0.48(Non-Toxin)
2337DMYVYSAGHATGTTP3.720.1080 (Probable non-antigen).Probable non-allergen−1.03(Non-Toxin)
923EGGGWLVASGSTVAM3.950.1129 (Probable non-antigen).Probable non-allergen−1.03(Non-Toxin)
1024GGGWLVASGSTVAMT3.950.2673 (Probable non-antigen).Probable non-allergen−1.12(Non-Toxin)
4357FIAGLVALALCVFFI4.160.2672 (Probable non-antigen).Probable non-allergen−0.60(Non-Toxin)
3953NKFNQALGAMQTGFT4.770.3893 (Probable non-antigen).Probable allergen−0.79(Non-Toxin)
HTL epitopes for shortlisted 5 proteins with antigenicity scores.

Multi-epitope peptide construction

A multi-epitope peptide of 874 residues containing sequence conserved epitopes of both T cell immunity (CTL and HTL) and B cell immunity was designed. This design was based on the fact that individual peptide epitope is weakly immunogenic, therefore, fusing the screened epitopes to make a multi-epitope peptide could potentially enhance their immunogenicity capacity. Further, linking this peptide to an adjuvant can further increase the chance of good immune presentation [19]. In total, 53 epitopes were predicted for cytotoxic T cell immunity among which 19 epitopes were found antigenic and only considered for multi-epitope vaccine construct designing. Similarly, 77 HTL epitopes were predicted among which 30 were antigenic and considered. The CTL epitopes were joined together using AAY linkers. These linkers possess the cleavage sites for proteasome in the mammalian cells thus aiding in the production of natural epitopes and stopping the generation of junctional epitopes and by this way enhancing the presentation of the epitopes presentation to the host immune system [70]. The HTL epitopes were interconnected to each other through GPGPG linker for adequate presentation and stimulation of T cell responses. The EAAAK linker was employed to link the peptide of CTL epitopes to β-defensin adjuvant at the N-terminal and HTL epitopes at the C-terminal. β-defensin is a potent immune adjuvant and functions by stimulating the production of lymphokines which subsequently cellular immunity and production of antigen specific immunoglobulin [45]. The multi-epitope vaccine construct is schematically presented in Fig. 1 .
Fig. 1

Schematic presentation of the multi-epitope vaccine construct design.

Schematic presentation of the multi-epitope vaccine construct design. The full sequence of the construct contains large portion of B cell epitopes indicating the construct high potential of activating both arms of the host immunity. The predicted B cell epitope region in the vaccine construct is shown in S-Fig. 1. The average value for the construct is 0.602 with maximum value of 0.713 and minimum value of 0.177. Humoral immunity is a key component of the host immune system in clearing the microbial pathogen in extracellular fluid and block the spread of intracellular infection.

Allergenicity and antigenicity prediction of multi-epitope peptide construct

To ensure the non-allergic responses of the vaccine construct upon administration, the allergenicity of the construct was evaluated [71]. The prediction was based on SVM that securitized the amino acids composition of the input sequence for allergic sequences. The construct sequence was declared as non-allergen with score of −0.80559987 (cut-off = −0.4). The percent of positive predictive value deduced was 0% in contrast to negative predictive value that was equal to 0%. Similarly, the antigenicity of the construct was re-evaluated to certain its immune system evoking potential. The overall antigen prediction score for the vaccine construct is 0.4744 that indicate the probable antigenic behavior of the designed chimeric peptide.

Physiochemical properties of the multi-epitope peptide vaccine construct

The physiochemical properties of the designed construct were predicted to guide the experimentalists in synthesis and understanding the formulation of the vaccine. The molecular weight estimated for the construct is 140 kDa that explain two things: (i) first the medium size of the construct will allow its convenient purification and secondly, small and medium size chimeric proteins harbor appropriate number of epitopes for specific and targeted immune responses against the pathogen thus limiting the chances of allergic and adverse reactions [72]. The vaccines composed of whole organism or a single large protein have large number of antigenic determinants all of which are probably not required as they could led to non-specific immunity [19]. The theoretical pI computed for the vaccine construct is 8.05 indicating its basic nature. The instability index of the protein is estimated as 27.07 and classify it as stable. This instability index tells an estimate of protein stability in a test tube and is based on statistical analysis of 32 stable and 12 unstable proteins [73]. It has been determined that stable nature of proteins is due to the occurrence of certain dipeptides that differ significantly from non-stable. The protein instability index calculated by Protparam assigned a weightage of instability to different 400 dipeptides. Proteins with instability index of higher than 40 are categorized as unstable. The aliphatic index represents relative volume occupied by a protein aliphatic side chains involving amino acid residues: leucine, isoleucine, alanine and valine [74]. It is regarded as a key factor conferring thermostability to globular proteins. The aliphatic index for the query vaccine peptide is 106.10 presenting the high thermostability of the construct. The estimated half-life of the construct in mammalian reticulocytes (in-vitro), yeast (in vivo) and Escherichia coli (in vivo) is 5.5 h, 3 and 2 min, respectively. This half-life tells the predicted time for the protein to be disappeared after its synthesis. The grand average of hydropathicity (GRAVY) is 0.664. Lower the value implies greater hydrophilicity of the protein and has more probability of interacting with water residues. GRAVY is the cumulative hydrophilicity score of all the amino acids divided by total number of protein sequence.

Prediction of multi-epitope peptide secondary and tertiary structure

The secondary structure of the designed multi-epitope vaccine construct consists of 23 (2.7%), 338 (39.8%), 6 3–10 helix (0.7%), and 482 (56.8%) among other. More specifically, the remaining are composed of 3 β-sheets, 3 β-hairpins, 2 beta bulges, 6 strands, 20 helices, 22 helix-helix interactions, 51 β-turns, and 46 γ-turns. All the 874 residues of the input construct were modelled. The best structure was modelled using the A chain of the 4KNH template by Raptor-X with p-value of 3.18e-04. The 3D model of the predicted structure is shown in Fig. 2 .
Fig. 2

Predicted 3D structure of in silico designed multi-epitope peptide vaccine construct.

Predicted 3D structure of in silico designed multi-epitope peptide vaccine construct.

Ramachandran plot analysis of the multi-epitope peptide construct

Ramachandran diagram or Ramachandran plot is a method of visualizing Phi or φ (the bonds between nitrogen and carbon alpha) and Psi or ψ (the bond between carbon and carbon alpha) of a given polypeptide [75]. The Phi and Psi are also known as Ramachandran or dihederal angles and hold control local structural importance vital for protein folding. Ramachandran plot is a vital assessment factor for evaluating three dimensional structure of a protein. This plot can be divided into four Quadrants (from I to IV) that give important information about secondary structure of a protein. Quadrant I represent conformations that are left handed alpha helices. Quadrant II mostly accommodates beta strands and is the region for most favorable conformations and biggest among all of the regions. Quadrant III is the following biggest region after II and is the place for right handed alpha helices. Lastly, Quadrant IV is the disfavored region and is the area for steric clash conformations. The Ramachandran plot statistics are given in Table 4 . It can be observed that majority of the protein residues i.e. 527 that make 80.5% of the total protein residues are in the most favored regions. This means that most of the residues are not in correct and usual geometry. In count to these regions, additionally allowed and generously allowed regions covers 122 residues (18.6%) and 4 (0.6%), respectively. Only 2 residues (0.3%) were found in the disallowed regions having conformation φ around 0–180 degrees and ψ around −180 to 0 degrees. The number of glycine and proline residues are 123 and 170, respectively. Proline is frequently found to be present in the outlier region of the Ramachandran plot and there is a reason to explain this. The cyclic side chain of proline face constrains upon rotation by its inclusion in pyrrolidine ring thus Phi values are confined around −60. Glycine, on the contrast, lacks beta carbon atom and therefore least steric compared to other amino acids. This support the large area distribution of glycine on the Ramachandran plot and allows conformation forbidden to other amino acids: allowing their occurrence in protein turn regions where others would be sterically hindered. The different parameters of G-factor of the vaccine construct are also provided in the Table 4. G-factor provides information about geometry of proteins that how unusual it is. Values below −0.5 are considered unusual, while those having values less −1.0 are highly unusual. The structure was immediately submitted to GalaxyRefine server to rebuilds the side chains and run side chains repacking followed by structure relaxation [76]. The Ramachandran for the refined vaccine construct is provided in S-Fig. 2. It is quite clear that the structure quality is improved by increasing the percentage of residues in the most favored region. Also, the G-factor value of the structure is improved to 0.00.
Table 4

Structure assessment of multi-epitope peptide vaccine construct.

Ramachandran plot statisticsParametersNumber of residues unrefined/refinedPercentage unrefined/refined
ProCheck analysisMost favored regions (A,B,L)527/58780.5%/ 90.3%
Additional allowed regions [a,b,l,p]122/6418.6%/ 9.8%
Generously allowed regions [~a,~b,~l,~p]4/ 30.6%/0.5%
Disallowed regions [XX]2/10.3%/0.1%
Non-glycine and non-proline residues655/655100.0%/100.0%
End residues (Excluding glycine and proline)1/1
Glycine residues123/123
Proline residues70/70
Total number of residues849/849



ScoreAverage score
G-factorsPhi-psi distribution−0.43/−0.64/0.23
Chi1-chi2 distribution−0.32/−0.10
Chi1 only0.01/0.28
Chi3 & chi40.90/0.47
Omega1.32/0.98
Main-chain bond lengths0.47/−0.11
Main-chain bond angles−0.13/−0.550.12/−0.37
Overall average−0.31/0.00
Structure assessment of multi-epitope peptide vaccine construct.

Disulphide engineering

Disulphide engineering is a key approach in biotechnology to increase thermal stability of protein under study and assist in understanding its dynamics [77]. The introduction of disulphide bonds in the protein is believed to lower the conformational entropy and at the same time increases the free energy of the denatured state [78]. Thus both these actions lead to overall stability of protein fold conformation. Total of 70 pair of residues were found that could be manipulated for the purpose of increasing protein stability. However, only pair of residues with combined energy of >2 kcal/mol were opted. The complete set of residues with potential to be mutated is listed in Table 5 . The wild and mutated multi-epitope vaccine construct are provided in Fig. 3 .
Table 5

Residue pairs shortlisted for disulfide engineering.

Res1 chainRes1 seq #Res1 AARes2 chainRes2 seq #Res2 AAChi3EnergySum B-factors
A97PHEA102MET−86.291.230
A110THRA114ASP109.745.390
A158TYRA190ALA−108.65.630
A229TRPA234SER126.57.660
A229TRPA235SER−100.223.630
A247LEUA251ALA86.043.560
A260ALAA282THR118.892.50
A261PHEA302PHE105.895.820
A263THRA280PHE−106.332.620
A267LEUA278MET−83.193.180
A268CYSA305ALA−112.996.140
A270GLYA275LEU124.916.540
A274GLYA295THR−107.754.370
A277CYSA305ALA−78.111.240
A279ALAA299CYS−76.85.240
A285ARGA300MET−76.394.490
A293PROA311PRO95.374.870
A320CYSA342PHE114.392.370
A321META324LEU123.713.020
A323PHEA340LEU−87.622.830
A336PROA369ALA99.992.840
A348VALA389VAL125.535.90
A350GLYA354GLY102.655.140
A363LEUA384SER87.871.220
A372GLYA375ARG105.764.290
A377PROA408PHE112.753.230
A380LEUA403LEU80.235.290
A387LEUA396LEU111.664.040
A390GLYA395LEU−62.985.650
A400SERA441SER−75.243.780
A448ALAA467ILE−91.35.120
A469LEUA492GLY−85.22.810
A472GLYA491PRO74.274.20
A477LEUA497VAL124.745.550
A499TYRA562ALA841.20
A512GLYA533PRO83.43.10
A514GLYA531PRO−97.545.750
A516SERA537PHE82.222.650
A518PHEA526ILE110.012.550
A522ASNA540LEU97.313.120
A523GLYA794GLY−94.112.480
A523GLYA798TRP−90.232.840
A542GLYA790GLY−113.825.530
A556ILEA590GLY−91.621.660
A559GLYA583VAL79.536.60
A562ALAA582LEU101.052.720
A577GLYA808LEU89.372.670
A599ALAA621ARG−102.213.640
A614GLYA657LEU−80.82.370
A614GLYA661ALA−72.294.590
A632GLYA635PHE121.174.490
A642ARGA662GLY−82.843.570
A650GLYA655ILE88.73.960
A670GLYA726GLY−116.695.070
A671PROA726GLY−66.975.270
A672GLYA726GLY−65.365.590
A699SERA702ALA−92.322.20
A708LEUA717ALA106.123.410
A723ARGA748MET−95.543.870
A737PHEA749HIS103.93.820
A740ASNA807ALA99.135.010
A741ALAA745LEU124.882.640
A752GLYA755ALA−103.2330
A764ALAA826GLN84.384.60
A765ALAA777ARG75.841.420
A765ALAA782ALA−83.13.880
A799LEUA841LEU−90.245.330
A799LEUA842GLU67.855.10
A822GLYA846ILE−77.274.690
A832GLYA835ASP120.012.470
Fig. 3

Disulfide engineering of the multi-epitope vaccine construct.

Residue pairs shortlisted for disulfide engineering. Disulfide engineering of the multi-epitope vaccine construct.

Adaptation of codon usage to Escherichia coli K 12 strain

The codon adaptation tool is a simple way to adjust the codon usage of the input sequence to the selected prokaryotic and eukaryotic system [79,80]. The difference in codon usage of the host from that organism from which the desired sequence stems from plays a major role in minor expression of that sequence. Codon adaptation of the vaccine construct primary sequence to E. coli K12 expression system is vital in ensuring its maximum expression. The codon adaptive index and GC content of the improve sequence are 0.86 and 57.75, respectively. The adaptive score of the vaccine and it's in silico restriction cloning are provided in S-Fig. 3 and Fig. 4 .
Fig. 4

Cloned vaccine construct shown in red into the expression vector.

Cloned vaccine construct shown in red into the expression vector. Molecular docking of the designed vaccine construct with TLR3 as a receptor was accomplished using PatchDock, which generated more than 20 complexes of TLR3 and vaccine construct conformations. The top 10 best predicted solutions were subjected to the fast interaction refinement in molecular docking (firedock) to refine the predicted solutions based the global energy. Lower the global energy of a solution implies the stable formation of complex and vice versa. As such solution 3 with global energy of −24.02 kcal/mol, attractive van der Waals energy of −38.57 kcal/mol, repulsive van der Waals energy of 28.88 kcal/mol, atomic contact energy of 11.49 kcal/mol and hydrogen bonding energy of −4.00 kcal/mol was considered the best predicted complex (Table 6 ). The docked conformation of the TLR3 and construct with respect to each is shown in Fig. 5 . The top complex conformation of the TLR3-MEVC was also understand via ClusPro that complement good binding of the molecules with lowest energy of −1268.5 kcal/mol.
Table 6

Refined complexes of FireDock.

RankSolution numberGlobal energy (kcal/mol)Attractive Van der WaalsRepulsive Van dar WaalsAtomic contact energy (kcal/mol)Hydrogen bonding
13−24.02−38.5728.8811.49−4.00
27−18.39−25.3019.27−2.78−1.87
310−8.94−21.9811.854.64−1.22
44−8.93−15.556.261.15−1.13
59−6.05−6.531.51−2.060.00
66−3.57−17.3410.100.14−0.98
755.95−11.6014.023.330.00
8813.59−4.791.111.08−1.43
9117.90−8.8428.801.84−1.58
1022825.46−44.833622.84−0.61−5.55
Fig. 5

Docked complex of the designed multi-epitope peptide vaccine construct (shown in red) and chain A of TLR3 (shown in yellow).

Refined complexes of FireDock. Docked complex of the designed multi-epitope peptide vaccine construct (shown in red) and chain A of TLR3 (shown in yellow).

Immune simulation

The vaccine construct appears to produce robust IgM antibody response even in the absence of recurrent antigen exposure. The IgM antibody titer seen reach close to 700,000 scales (Fig. 6 ). Among the cytokines and interleukins, high level interferon gamma was reported to the antigen compared to others (Fig. 6). Additionally, appropriate and decent cellular activity with their respective memory from the host immune system were revealed along with proliferating dendritic cells and enhanced activity from macrophages (S-Fig. 4). Different humoral immunity isotypes can be easily noticed, an indicator of isotype switching and memory cell formation (S-Fig. 5). This immune profile leads to a conclusion of a good agreement on natural development of immunity against the virus and protection against the virus.
Fig. 6

The vaccine construct antigen, and antibodies and immune complexes response (left). Cytokines and interleukins concentration in response to the vaccine construct antigen (right).

The vaccine construct antigen, and antibodies and immune complexes response (left). Cytokines and interleukins concentration in response to the vaccine construct antigen (right).

Molecular dynamics simulations

The multi-epitope peptide vaccine construct complex was seen stably docked with the TLR3 receptor and the epitopes are exposed throughout the simulation time for efficient and easy processing by the host immune system. The dynamics of the complex was evaluated through two structural parameters. First, root mean square deviation (rmsd) was investigated that measures the distance between the carbon alpha atoms of superimposed proteins [81]. The average rmsd estimated for the system is 5.2 Å. The system is reported with some major rmsd peaks that after visual inspection found because of the moving loops of the system. This may be an approach towards complex stability as both the interacting molecules are stably docked with each other's. Secondly, we evaluted root mean square fluctuations (rmsf) for the system. RMSF sheds light on the fluctuating residues from there mean position [82]. The average rmsf for the system is 4.6 Å. As mentioned earlier, these variation are because of fluctuating loops of the system which is not altering intermolecular conformations and does not affect overall stability. The rmsd and rmsf plots of the system can be seen in Fig. 7 . Additionally, we performed hydrogen bond analysis for the complex to decipher the strength of intermolecular interactions during simulation time. As can be seen in Fig. 8 , the molecules are involved in good number of hydrogen bonds throughout the simulation time, enhancing stability of the complex.
Fig. 7

RMSD (left) and RMSF (right) for the simulated TLR3-MEPVC complex.

Fig. 8

Hydrogen bonds analysis for TLR3-MEPVC complex.

RMSD (left) and RMSF (right) for the simulated TLR3-MEPVC complex. Hydrogen bonds analysis for TLR3-MEPVC complex.

MM-PBSA binding energies calculation

The docking prediction for the vaccine construct conformation with respect to the TLR3 receptor was validated by performing computationally cost effective and sufficiently reliable MMMGBSA and MMPBSA approaches. Both these approaches were run over trajectories of the MD simulations. It is evident from the Table 7 that robust energies are noticed for the system and the values are quite low indicating a good agreement on the docking findings. The intermolecular ineractions are dominated by both van der Waals and electrostatic energies with energy value of −188.44 kcal/mol and −473.80 kcal/mol, respectively. Summing, both these mentioned values are contributing factor to the significantly low gas phase energy of −626.25 kcal/mol. Opposed to this, non-favorable energy of solvation was noticed for the system (564.3822 kcal/mol in MM-GBSA and 527.0717 kcal/mol in MM-PBSA). The net binding energy of the system in MM-GBSA is −61.8682 kcal/mol whereas in MM-PBSA this energy is −99.1787 kcal/mol. Both these net values support highly stable system and excellent intermolecular binding affinity between the vaccine construct and TLR3 receptor. The net entropy concluded for the system is 57.76 kcal/mol which again suggesting stable nature of the vaccine at the docked site. Additionally, to evaluate intermolecular affinity towards end of the simulation time, MMPB/GBSA of last 20-ns was performed (S-Table 1). The net MMGBSA and MMPBSA in this time period are −77.4 kcal/mol and − 107.4 kcal/mol, respectively.
Table 7

Binding free energies for the TLR3-MEPVC complex.

Energy componentAverageStd. dev.Std. err. of mean
MM-GBSA
VDWAALS−188.44737.47530.7475
EEL−437.803115.87621.5876
EGB587.787115.36731.5367
ESURF−23.40490.93280.0933
DELTA G gas−626.250416.56711.6567
DELTA G solv564.382215.10831.5108
DELTA TOTAL−61.86824.71520.4715



MM-PBSA
VDWAALS−188.44737.47530.7475
EEL−437.803115.87621.5876
EPB547.144815.01121.5011
ENPOLAR−20.07310.49820.0498
EDISPER0.00000.00000.0000
DELTA G gas−626.250416.56711.6567
DELTA G solv527.071714.86251.4863
DELTA TOTAL−99.17877.25850.7258
Binding free energies for the TLR3-MEPVC complex.

Conclusions

This study shows that membrane protein, envelope small membrane protein, non-structural protein ORF3, non-structural protein ORF5 and spike glycoprotein could act as potential subunit vaccine candidates for designing a subunit-based vaccine against MERS-CoV. Similarly, these proteins were mapped for novel set of antigenic B and T cell epitopes that could be used in the design of a multi-epitope peptide vaccine construct to circumvent the limitations of subunit vaccines. Further, we have shown that the designed multi-epitope vaccine construct docking affinity for TLR3 immune receptor, thus maximizing its presentation to the human immune system and ensuring the evoking potential of both humoral and cellular immune responses. The prediction of docking was validated by molecular dynamics simulation and binding free energies that validated complex high stability and intermolecular affinity. Lastly, the use of the subunit vaccine candidates and multi-epitope vaccine construct in experimental in vivo model is highly recommended to underpin their immune protective potential against MERS-CoV and the current pandemic COVID-19. The following are the supplementary data related to this article. Predicted B cell epitope region in the vaccine construct. Ramachandran plot for the 3D structure of multi-epitope peptide vaccine construct. Codon optimization score of the vaccine construct. T cell immunity profile against the vaccine construct antigen. B cell immunity profile against the vaccine construct antigen.

S-Table 1

Binding free energies for the TLR3-MEPVC complex.

Declaration of Competing Interest

The authors declare no conflict of interest.
  75 in total

1.  Correlation between stability of a protein and its dipeptide composition: a novel approach for predicting in vivo stability of a protein from its primary sequence.

Authors:  K Guruprasad; B V Reddy; M W Pandit
Journal:  Protein Eng       Date:  1990-12

2.  FireDock: fast interaction refinement in molecular docking.

Authors:  Nelly Andrusier; Ruth Nussinov; Haim J Wolfson
Journal:  Proteins       Date:  2007-10-01

Review 3.  Middle East respiratory syndrome coronavirus: another zoonotic betacoronavirus causing SARS-like disease.

Authors:  Jasper F W Chan; Susanna K P Lau; Kelvin K W To; Vincent C C Cheng; Patrick C Y Woo; Kwok-Yung Yuen
Journal:  Clin Microbiol Rev       Date:  2015-04       Impact factor: 26.132

4.  Significance of root-mean-square deviation in comparing three-dimensional structures of globular proteins.

Authors:  V N Maiorov; G M Crippen
Journal:  J Mol Biol       Date:  1994-01-14       Impact factor: 5.469

Review 5.  Middle East Respiratory Syndrome: Emergence of a Pathogenic Human Coronavirus.

Authors:  Anthony R Fehr; Rudragouda Channappanavar; Stanley Perlman
Journal:  Annu Rev Med       Date:  2016-08-26       Impact factor: 13.739

Review 6.  Codon usage: nature's roadmap to expression and folding of proteins.

Authors:  Evelina Angov
Journal:  Biotechnol J       Date:  2011-05-12       Impact factor: 4.677

7.  Treatment strategies for Middle East respiratory syndrome coronavirus.

Authors:  Kayvon Modjarrad
Journal:  J Virus Erad       Date:  2016

8.  Global status of Middle East respiratory syndrome coronavirus in dromedary camels: a systematic review.

Authors:  R S Sikkema; E A B A Farag; Mazharul Islam; Muzzamil Atta; C B E M Reusken; Mohd M Al-Hajri; M P G Koopmans
Journal:  Epidemiol Infect       Date:  2019-01       Impact factor: 2.451

9.  GalaxyRefine: Protein structure refinement driven by side-chain repacking.

Authors:  Lim Heo; Hahnbeom Park; Chaok Seok
Journal:  Nucleic Acids Res       Date:  2013-06-03       Impact factor: 16.971

10.  CD-HIT: accelerated for clustering the next-generation sequencing data.

Authors:  Limin Fu; Beifang Niu; Zhengwei Zhu; Sitao Wu; Weizhong Li
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

View more
  5 in total

1.  Modeling and simulations of CoViD-19 molecular mechanism induced by cytokines storm during SARS-CoV2 infection.

Authors:  Zhenhua Yu; R Ellahi; Alessandro Nutini; Ayesha Sohail; Sadiq M Sait
Journal:  J Mol Liq       Date:  2020-11-28       Impact factor: 6.165

Review 2.  Current and prospective computational approaches and challenges for developing COVID-19 vaccines.

Authors:  Woochang Hwang; Winnie Lei; Nicholas M Katritsis; Méabh MacMahon; Kathryn Chapman; Namshik Han
Journal:  Adv Drug Deliv Rev       Date:  2021-02-06       Impact factor: 17.873

3.  Antiviral activities of 4H-chromen-4-one scaffold-containing flavonoids against SARS-CoV-2 using computational and in vitro approaches.

Authors:  Vinit Raj; Jin-Hyung Lee; Jae-Jin Shim; Jintae Lee
Journal:  J Mol Liq       Date:  2022-02-17       Impact factor: 6.633

4.  Insights into coloration enhancement of mercerized cotton fabric on reactive dye digital inkjet printing.

Authors:  Hongzhi Zhao; Kun Zhang; Kuanjun Fang; Furui Shi; Ying Pan; Fuyun Sun; Dezhen Wang; Ruyi Xie; Weichao Chen
Journal:  RSC Adv       Date:  2022-04-05       Impact factor: 3.361

Review 5.  mRNA vaccines in the prevention and treatment of diseases.

Authors:  Yangzhuo Gu; Jiangyao Duan; Na Yang; Yuxin Yang; Xing Zhao
Journal:  MedComm (2020)       Date:  2022-08-25
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.