Literature DB >> 30538505

Structural basis of development of multi-epitope vaccine against Middle East respiratory syndrome using in silico approach.

Sukrit Srivastava^1,2, Mohit Kamthania^1,3, Soni Singh¹, Ajay K Saxena², Nishi Sharma¹.

Abstract

BACKGROUND: Middle East respiratory syndrome (MERS) is caused by MERS coronavirus (MERS-CoV). Thus far, MERS outbreaks have been reported from Saudi Arabia (2013 and 2014) and South Korea (2015). No specific vaccine has yet been reported against MERS.
PURPOSE: To address the urgent need for an MERS vaccine, in the present study, we have designed two multi-epitope vaccines (MEVs) against MERS utilizing several in silico methods and tools.
METHODS: The design of both the multi-epitope vaccines (MEVs) are composed of cytotoxic T lymphocyte (CTL) and helper T lymphocyte (HTL) epitopes, screened form thirteen different proteins of MERS-CoV. Both the MEVs also carry potential B-cell linear epitope regions, B-cell discontinuous epitopes as well as interferon-γ-inducing epitopes. Human β-defensin-2 and β-defensin-3 were used as adjuvants to enhance the immune response of MEVs. To design the MEVs, short peptide molecular linkers were utilized to link screened most potential CTL epitopes, HTL epitopes and the adjuvants. Tertiary models for both the MEVs were generated, refined, and further studied for their molecular interaction with toll-like receptor 3. The cDNAs of both MEVs were generated and analyzed in silico for their expression in a mammalian host cell line (human).
RESULTS: Screened CTL and HTL epitopes were found to have high propensity for stable molecular interaction with HLA alleles molecules. CTL epitopes were also found to have favorable molecular interaction within the cavity of transporter associated with antigen processing. The selected CTL and HTL epitopes jointly cover upto 94.0% of worldwide human population. Both the CTL and HTL MEVs molecular models have shown to have stable binding and complex formation propensity with toll-like receptor 3. The cDNA analysis of both the MEVs have shown high expression tendency in mammalian host cell line (human).
CONCLUSION: After multistage in silico analysis, both the MEVs are predicted to elicit humoral as well as cell mediated immune response. Epitopes of the designed MEVs are predicted to cover large human population worldwide. Hence both the designed MEVs could be tried in vivo as potential vaccine candidates against MERS.

Entities: Chemical

Keywords: MD simulation; MERS; MERS-CoV; Middle East respiratory syndrome; Middle East respiratory syndrome coronavirus; TAP; TLR-3; epitope; human transporter associated with antigen processing; immunoinformatics; molecular docking; molecular dynamics simulation; multi-epitope vaccine; toll-like receptor 3

Year: 2018 PMID： 30538505 PMCID： PMC6254671 DOI： 10.2147/IDR.S175114

Source DB: PubMed Journal: Infect Drug Resist ISSN： 1178-6973 Impact factor: 4.003

Introduction

Middle East respiratory syndrome (MERS) is a respiratory disease caused by Middle East respiratory syndrome coro-navirus (MERS-CoV). MERS involves high fever, cough, difficulty in breathing, chills, chest pain, body aches, sore throat, diarrhea, nausea/vomiting, running nose, renal failure, and pneumonia. The first case of MERS in humans was reported in Saudi Arabia in 2012. After 2012, within a span of only 4 years, MERS-CoV infection was reported from 27 countries.1 Three MERS outbreaks have already occurred in Saudi Arabia (2013 and 2014) and South Korea (2015).2–4 The outbreak reported in South Korea also involved spread of MERS through hospital-to-hospital transit of patients and had the fatality rate as high as 40%. High attack rate and easy spread of MERS indicate an epidemic risk.5–7 To date, no specific vaccine is available for MERS. The steep increase in MERS cases and its high mortality rate demand an urgent need for specific and safe MERS vaccine. Thus far, only little information is known about the pathogenesis of MERS-CoV. Hence, an immunoinformatics approach to thoroughly study and screen immunogenic proteins from the available proteome sequence data of MERS-CoV is essential for vaccine design. The proteome of MERS-CoV consists of several important vaccine candidate and drug target proteins. These proteins are involved in infection and pathogenesis of MERS-CoV to human host cells. Spike (S) glycoprotein, in particular, its receptor-binding domain, is involved in virus and host cell interaction.8 Envelope (E) protein plays an important role in host cell recognition.9 Nucleocapsid (N) protein is involved in RNA binding during ribonucleocapsid formation by MERS-CoV.10 Membrane (M) protein has interferon (IFN)-antagonizing properties, thus reducing IFN levels in infected patients.11 Open reading frame (ORF) proteins also play critical roles in viral infection and pathogenesis. A mutation study of ORFs (ORF3, ORF4a, ORF4b, and ORF5) indicated that ORFs have major implications in viral infection involving disruption of host cell processes, dysregulated IFN pathway activation, and abrupt inflammation.12 Proteins ORF1a (4P16) and ORF1ab (4WUR) are papain-like proteases (PL(pro)) involved in viral infection and are potential targets for the development of antiviral drugs.13 PL(pro) facilitate infection by their proteolytic, deubiquitinating, and deISGylating activities suppressing innate immune response from the host cell. Protein ORF1a (4RSP), a protease (3CLpro), facilitates proteolytic activity during viral infection and replication.14,15 Protein ORF1ab (5WWP) is a helicase protein of MERS-CoV, and it is one of the most conserved proteins among nidoviruses. Protein ORF8b is a highly conserved protein, and during infection, it induces various immune responses.16 In the present study, we propose two multi-epitope vaccine (MEV) designs for MERS. The proposed MEVs are composed of cytotoxic T lymphocyte (CTL) and helper T lymphocyte (HTL) epitopes. Both CTL and HTL MEVs contain overlapping regions of linear B-cell epitopes. Both MEVs also contain human β-defensin-2 (hBD-2) and human β-defensin-3 (hBD-3) as adjuvants to enhance the immunogenic response.17,18

Methodology

To design the MEVs, different in silico tools were used to screen potential CTL, HTL, and B-cell epitopes from 13 MERS-CoV proteins that are involved in virus–host interaction and viral proliferation. All three types of epitopes were studied for overlapping consensus regions. CTL and HTL epitopes showing partial or complete overlapping regions with all or any of the three kinds of epitopes and the epitopes with the highest number of human leukocyte antigen (HLA) allele binders were selected for detailed analysis. The selected CTL and HTL epitopes were further validated for their molecular interaction with their HLA allele binders. Furthermore, the selected CTL epitopes were validated for their molecular interaction with cavity of transporter associated with antigen processing (TAP) to analyze their smooth passage from the cytoplasm to the endoplasmic reticulum (ER) lumen.19 Three-dimensional (3D) structure model for both MEVs was generated, refined, validated, and analyzed for different physiochemical properties. Both CTL and HTL MEV models were further screened to have several B-cell discontinuous epitopes and IFN-γ-inducing epitopes. Because the immune system broadly recognizes pathogens partially due to the involvement of toll-like receptor 3 (TLR-3) and its signaling cascade, both CTL and HTL MEVs were further studied for their molecular interaction with TLR-3.20,21 Moreover, the cDNAs of both MEVs were analyzed and were predicted to be highly expressing in a mammalian host cell line (human) (Figure S1).

MERS-CoV proteins chosen for MEV preparation

For vaccine development, proteins critically involved in virus–host cell interaction, viral proliferation, and host cell disruption were chosen. Thirteen proteins of MERS-CoV proteome chosen for epitope screening included the following: S glycoprotein; E protein; N protein; M protein; ORFs (ORF3, ORF4a, ORF4b, and ORF5); ORF1a (4P16), a papain-like protease (PL(pro)); ORF1a (4RSP), a protease (3CLpro); ORF1ab (4WUR), a PL(pro); ORF1ab (5WWP); and ORF8b. For sequence-based epitope screening, full-length protein sequences of the abovementioned MERS-CoV proteins were retrieved from the NCBI database (National Center for Biotechnology Information, https://www.ncbi.nlm.nih.gov/protein). A total of 2,499 amino acid sequences belonging to different proteins of MERS-CoV of different strains and origins were retrieved. For structure-based epitope screening, available 3D structures of MERS-CoV proteins were retrieved from Protein Data Bank (PDB, http://www.rcsb.org/pdb/home/home.do). For the proteins with no structure available in PDB, homology modeling was performed using the Swiss-model, (http://swissmodel.expasy.org/) (Table S1).22

Screening of potential epitopes

T-cell epitope prediction

Screening of CTL epitope

To screen CTL epitopes, the Immune Epitope Database (IEDB) tool “Proteasomal cleavage/TAP transport/MHC class I combined predictor” (http://tools.iedb.org/processing/) was used.23–25 The total score generated by the tool is a combined score of proteasome, TAP (N-terminal interaction), major histocompatibility complex (MHC), and processing analysis scores. The score is generated using the combination of six methods, namely, Consensus, NN-align, SMM-align, Combinatorial Library, Sturniolo, and NetMHCIIpan. The IC50 (nM) value was also obtained using this IEDB tool. Epitopes with high, intermediate, and least affinity of binding to an HLA allele have IC50 values <50 nM, <500 nM, and <5,000 nM, respectively. Immunogenicity of the screened CTL epitopes was also obtained by the “MHC I Immunogenicity” tool of IEDB (http://tools.iedb.org/immunogenicity/) with all parameters set to default (first, second, and C-terminus amino acids).26 The tool predicts immunogenicity of a peptide MHC complex on the basis of physiochemical properties of amino acids and their position within the short peptide sequence.

Screening of HTL epitopes

To screen HTL epitopes, the IEDB tool “MHC-II Binding Predictions” (http://tools.iedb.org/mhcii/) was used. The percentile rank for each peptide is generated by the combination of three methods (Combinatorial Library, SMM-align, and Sturniolo) by comparing the score of peptide against the scores of other random five million 15-mer peptides from the SwissProt database.27–30 The rank for the Consensus method was generated by the median percentile rank of the three methods.

Population coverage by CTL and HTL epitopes

The “Population Coverage” tool of IEDB (http://tools.iedb.org/population/) was used to analyze the world human population coverage by the shortlisted 28 CTL and 28 HTL epitopes.31 Use of multi-epitopes involving both CTL and HTL epitopes would have the higher probability of larger human population coverage worldwide.

B-cell epitope prediction

Sequence-based B-cell epitope prediction

Protein sequence-based linear B-cell epitopes were predicted by six different prediction methods available at “B Cell Epitope Prediction Tools” tool of IEDB server (http://tools.iedb.org/bcell/); these tools are based on the propensity scale method and physicochemical properties of the antigenic sequence. These methods include “BepiPred Linear Epitope Prediction” (propensity scale method such as hidden Markov model), “Chou & Fasman Beta-Turn Prediction”, “Emini Surface Accessibility Prediction”, “Karplus & Schulz Flexibility Prediction”, “Kolaskar & Tongaonkar Antigenicity”, and “Parker Hydrophilicity Prediction”.32–37

Structure-based B-cell epitope prediction

Two structure-based B-cell epitope prediction methods, namely, DiscoTope 2.0 (Structure-based Antibody Prediction tool; http://tools.iedb.org/discotope/) and Ellipro (Antibody Epitope Prediction tool; http://tools.iedb.org/ellipro/), available in IEDB, were used for linear and discontinuous B-cell epitope prediction.38,39 The ElliPro method is based on the location of the residue in the protein’s 3D structure. The residues lying outside of the ellipsoid covering 90% of the inner core residues of the protein score highest protrusion index (PI) of 0.9 and so on. Discontinuous epitopes predicted by ElliPro are clustered on the basis of distance R in Å between two residues’ centers of mass lying outside the largest possible ellipsoid. The larger the value of R, the larger will be the number of discontinuous epitopes clustered. DiscoTope 2.0 is based on the number of contacts of the residues’ Cα carbon atom with other Cα carbon atoms in the 3D structure within the 10 Å distance and the propensity of a residue to be a part of an epitope.

Characterization of potential epitopes

Epitope conservation analysis

Shortlisted CTL, HTL, and B-cell epitopes were subjected to conservancy analysis by the “Epitope Conservancy Analysis” tool (http://tools.iedb.org/conservancy/) of IEDB among all their respective source protein sequences of MERS-CoV retrieved from the NCBI protein database.40

Epitope toxicity prediction

Toxicity assessment of CTL and HTL epitopes and B-cell epitopes was performed by ToxinPred (http://crdd.osdd.net/raghava/toxinpred/multi_submit.php) analysis. This analysis allows to identify highly toxic or nontoxic peptides. The analysis was performed by the “support vector machine (SVM) (SwissProt)-based” method with all the parameters set to default.41

Overlapping residue analysis

Multiple Sequence Alignment (MSA) analysis using the Clustal Omega tool (https://www.ebi.ac.uk/Tools/msa/clustalo/) of European Bioinformatics Institute was performed for all the shortlisted CTL, HTL, and B-cell epitopes from 13 MERS-CoV proteins.42 MSA by Clustal Omega virtually aligns any number of protein sequences and delivers accurate alignments.

Epitope selected for molecular interaction study with HLA allele and TAP

CTL and HTL epitopes were shortlisted for further in silico analysis on the basis of their overlapping sequence regions among all three types of epitopes (CTL, HTL, and B-cell) or complete overlap among any two types of epitopes or the highest number of HLA allele binders.

Molecular interaction analysis of the selected epitopes with HLA alleles

Tertiary structure modeling of HLA alleles and selected T-cell epitopes

Template-based homology modeling for HLA class I and II allele binders of the shortlisted epitopes was performed by the Swiss-model.22 Protein sequences of HLA class I and II alleles were retrieved from Immuno Polymorphism Database (IPD-IMGT/HLA) (https://www.ebi.ac.uk/ipd/imgt/hla/allele.html).69 Template with high sequence identity was chosen for modeling, and the models thus generated were validated for quality by the Qualitative Model Energy ANalysis (QMEAN) analysis. The QMEAN value is a composite (both global and local [ie, per residue] structure) quality estimate of the generated model.43 Models with an acceptable QMEAN value with a cutoff of −4.0 were chosen for further studies (Table S2). “Natural Peptides Module for Beginners” module of the PEPstrMOD tool (http://osddlinux.osdd.net/raghava/pepstrmod/nat_ss.php) was used to generate tertiary structures of the selected epitopes.44 Prediction was carried out with a simulation time window of 100 ps, and the peptide environment was set to vacuum.

Molecular docking and molecular dynamics (MD) simulation study of the selected epitopes with HLA alleles

AutoDock Tool 4.2 and AutoDock Vina were used for molecular docking study of the selected epitopes and their respective HLA allele binders.45,46 Further, the docked complexes were subjected to MD simulation study by Gromacs 5.1.4 using Optimized Potentials for Liquid Simulations – all atom (OPLS-AA) force field.47,48

Molecular interaction analysis (docking) of the selected CTL epitopes with TAP

Molecular docking study of the shortlisted CTL epitopes with TAP receptor was performed by AutoDock Vina.45,46 For more accurate prediction, instead of homology modeling, the cryo-EM structure of TAP (PDB ID: 5u1d) was used.49 For docking, the antigen from the TAP cavity of the original structure was removed.

Design, characterization, and interaction analysis of MEVs with TLR-3

Design of MEVs

From the screened and shortlisted CTL and HTL epitopes, two MEVs were designed using EAAAK and GGGGS as short peptide rigid and flexible linkers, respectively (Figure 1A and B). To enhance the immune response, hBD-2 (PDB ID: 1FD3, sequence: GIGDPVTCLKSGAICHPVFCPRRYKQIGTCG LPGTKCCKKP) and hBD-3 (PDB ID: 1KJ6, sequence: GII NTLQKYYCRVRGGRCAVLSCLPKEEQIGKCSTRGRK CCRRKK) were used as adjuvants for both MEVs at N and C terminals, respectively.17,18,50–53 Upon lung infection, the expression of hBD-2 and hBD-3 was found to be increased. β-Defensins are involved in chemotactic activity for memory T cells, monocytes, and immature dendritic cells as well as in degranulation of mast cells. Thus, hBDs enhance innate and adaptive immunity and therefore were chosen here as adjuvants for the design of MEVs.

Figure 1

Multi-epitope vaccine design.

Notes: (A) CTL and (B) HTL epitopes were linked by the short peptide linker GGGGS. Human β-defensin-2 and β-defensin-3 were used as adjuvant at N and C terminals, respectively, linked by the short peptide linker EAAAK. Epitopes from different proteins are colored in different colors. C-terminal 6xHis is designated as His tag.

Abbreviations: CTL, cytotoxic T lymphocyte; HTL, helper T lymphocyte.

IFN-γ-inducing epitope prediction

IFN-γ epitopes with potential to induce the release of IFN-γ from CD4+ T cells from both MEVs were predicted by “IFNepitope” server (http://crdd.osdd.net/raghava/ifnepitope/scan.php) using the “Motif and SVM hybrid” (MERCI: Motif-EmeRging and with Classes-Identification, and SVM) approach. The tool generates overlapping sequences from the query protein/antigen and uses it for IFN-γ-inducing epitope prediction. The prediction is based on a dataset of IFN-γ-inducing and IFN-γ-noninducing MHC class II binders.54,55

Allergenicity and antigenicity prediction of MEVs

For both MEVs, AlgPred was used for allergenicity prediction (http://crdd.osdd.net/raghava/algpred/submission.html).56 The AlgPred prediction of allergens is based on similarity of the known epitope with any region of the submitted protein. VaxiJen (http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html) was used to analyze the probability of antigenicity of both MEVs.57 The VaxiJen analysis applies an alignment-free approach that is solely based on the physicochemical properties of the sequences of the submitted proteins.

Physicochemical analysis of MEVs

To analyze the physiochemical properties of CTL and HTL MEVs, ProtParam (https://web.expasy.org/protparam/) was used.58 ProtParam analysis performs empirical investigation for a given protein amino acid sequence.

3D structure modeling and refinement of MEVs

Both the CTL and HTL MEVs were further subjected to 3D structure modeling by RaptorX structure prediction server (http://raptorx.uchicago.edu/StructurePrediction/predict/).59 The quality of the generated model is indicated by its P-value. The P-value is the probability of a predicted model being worse than the best. It indicates relative quality in terms of modeling error by combining global distance test (GDT) and un-normalized GDT, that is, error at each residue. The smaller the P-value, the higher is the quality of the predicted model. The generated MEV models were further refined by ModRefiner and GalaxyRefin.60,70 GalaxyRefine performs repeated structure perturbation along with subsequent structural relaxation by dynamics simulation to refine a protein structure. To avoid breaks in model structures, GalaxyRefine uses the triaxial loop closure method.

In silico validation of refined MEV models

The refined 3D models of both MEVs were then subjected to RAMPAGE analysis for generating a Ramachandran plot (http://mordred.bioc.cam.ac.uk/~rapper/rampage.php).61 The Ramachandran plot shows the residues that form energetically allowed and disallowed dihedral angles psi (ψ) and phi (ϕ), which are calculated on the basis of van der Waal radius of the amino acid side chain.

Discontinuous B-cell epitope prediction from MEVs

Both the designed CTL and HTL MEVs were subjected to discontinuous B-cell epitope prediction to analyze structure-based humoral immunogenic potential of both MEVs using the ElliPro tool.39

Molecular docking and MD simulation study of MEVs and the immunological receptor TLR-3

To study the molecular interaction, the refined models of CTL and HTL MEVs were docked with TLR-3 by PatchDock server (http://bioinfo3d.cs.tau.ac.il/PatchDock/).62–64 The 3D structure of human TLR-3 ectodomain (ECD) was retrieved from PDB databank (PDB ID: 2A0Z). To analyze the stable complex formation, both complexes were subjected to MD simulation by Gromacs 5.1.4 using OPLS-AA.47,48

Analysis of cDNAs of both MEVs for cloning and expression

Optimized cDNAs for both MEVs were generated by the Codon Usage Wrangler Tool with the option of a mammalian host cell line (human) as the expression system (http://www.mrc-lmb.cam.ac.uk/ms/methods/codon.html). Further, the GenScript Rare Codon Analysis Tool (https://www.genscript.com/tools/rare-codon-analysis) was used to analyze the cDNAs of both MEVs. The tool provides GC content, codon adaptation index (CAI), and tandem rare codon frequency for cDNA.65–67

Results and discussion

Screening of potential epitopes

T-cell epitope prediction

Screening of CTL epitope

On the basis of “total score” and acceptably low IC50 (nM) value of epitope-HLA class I allele pairs, 75 CD8+ CTL epitopes were chosen. Later, 28 epitopes were shortlisted with high scoring and larger number of HLA class I allele binders for further studies. Immunogenicity of the screened CTL epitopes was also determined. The higher the immunogenicity score, the greater is the immunogenicity of the epitope (Tables S3 and S4).

Screening of HTL epitopes

Screening of HTL epitopes was performed on the basis of “percentile rank”. Small percentile rank shows the high affinity of the peptide for its respective HLA allele. Initially, from the 13 MERS-CoV proteins, 70 CD4+ T-cell epitopes with the highest percentile rank were screened, and 28 epitopes with high percentile rank and the highest number of HLA class II allele binders were then shortlisted (Tables S5 and S6). Several CTL and HTL epitopes predicted in the present study show overlapping regions with the epitopes detected to induce T-cell responses in previous studies done using peripheral blood mononuclear cells from infected patients.68 Hence, epitopes screened and reported in the present study could be predicted to originate from the highly immunogenic stretch of MERS-CoV protein sequences. In this study, most geographical regions of the world, and in particular, the countries of Middle East, South Asia, East Asia, and Northeast Asia, were included. This study indicates that the combined use of all shortlisted CTL and HTL epitopes would have an average worldwide population coverage of 94.0%, with an SD of 20.19 (Table S7). Initially, a total of 144 B-cell epitopes were screened from 13 MERS-CoV proteins, with epitope length of four or more than four amino acids by the BepiPred Linear Epitope Prediction method. The epitopes predicted by five other methods based on different physicochemical properties showed a significant consensus overlap of amino acid sequences with that of BepiPred Linear Epitope Prediction. From the 144 B-cell epitopes, 12 with the length of 4–19 amino acids were shortlisted (Table S8 and Figure 2).

Figure 2

Overlapping regions of the BepiPred linear B-cell epitopes and the epitopes predicted by other methods.

Notes: This analysis shows a strong consensus between the BepiPred linear B-cell epitopes and the epitopes predicted by other sequence-based and structure-based epitope prediction methods. Different colors are used to highlight overlapping regions of epitopes predicted by the different methods.

Discontinuous and linear epitopes predicted by DiscoTope 2.0 and Ellipro methods showed significant consensus overlap of amino acid sequence with the linear epitopes predicted by the BepiPred linear epitope method (Table S8 and Figure 2). This result confirms that the shortlisted BepiPred linear epitopes are highly immunogenic B-cell epitopes. The conservation analysis of shortlisted 28 CTL, 28 HTL, and 12 B-cell epitopes shows that amino acid sequence conservancy of CTL epitopes varied from 72.7% to 100%, that of HTL epitopes varied from (50% for two epitopes) 68.18% to 100%, and that of B-cell epitopes varied from 85.71% to 99.26% (Tables S3, S5, and S8). This result indicates high conservancy nature of the shortlisted CTL, HTL, and B-cell epitopes. All the shortlisted CTL, HTL, and B-cell epitopes analyzed by the ToxinPred tool were predicted to be nontoxic (Tables S3, S5, and S8). The analysis was based on the ToxinPred main dataset consisting of 1,805 toxic peptides for its prediction. MSA analysis of all the screened CTL, HTL, and B-cell epitopes from the 13 MERS-CoV protein candidates revealed that several epitopes of CTL, HTL, and B-cell have overlapping amino acid sequence regions. Epitopes with two or more than two residues in the overlapping region are shown in Figure 3.

Figure 3

Overlapping CTL, HTL, and B-cell epitopes.

Notes: Overlapping CTL (red), HTL (blue), and B-cell epitopes (green) were sorted out by MSA using Clustal Omega at EBI server. The ringed clusters of epitopes involve all three types of epitopes, epitopes with full sequence overlap, and the epitopes with highest number of HLA allele binders.

Abbreviations: CTL, cytotoxic T lymphocyte; EBI, European Bioinformatics Institute; HLA, human leukocyte antigen; HTL, helper T lymphocyte; MSA, multiple sequence alignment.

Overlapping CTL and HTL epitopes clustering with all three (CTL, HTL, and B-cell) types of epitopes or with complete epitope sequence overlap or with the highest number of HLA allele binders were identified and selected for their molecular interaction analysis with HLA alleles and TAP (Figure 3). These include CTL epitopes:47 AQADTAGLY,31,55 FTYIPASGY,25,39 STDSIVFTY,33,58 SMKMAPLMLL,14,67 VTVDGVNFR,22,51 and LVQPALYLY;59 and HTL epitopes:52 AGLYTNFRIDVPSAE,36,66 ASGYVAALAVNVCLI,11,50 EVLVTVDGVNFRTVV,25,58 SMKMAPLMLLQLLGR,72,117 and FLRNVRFELHEFALL.131

Molecular interaction analysis of the selected epitopes with HLA allele

Molecular docking and MD simulation study of the selected epitopes with HLA alleles

Molecular docking study of all the selected epitopes with their respective HLA allele binders revealed a significantly favorable molecular interaction. Docking complexes thus formed have significantly negative binding energy, and several amino acid residues of epitopes and HLA alleles were involved in hydrogen bond formation (Figure 4). To analyze the stability of binding, docking complexes were further subjected to MD simulation study with an analysis time window of 0.1 ns at the reasonably invariable temperature (~300 K) and pressure (~1 bar). The results of MD simulation for all the epitope-HLA allele complexes showed a very convincing reasonably invariant root mean square deviation (RMSD) value between ~0.1 and 0.2 nm, thus indicating stable complex formation (Figure 5). Moreover, the reasonably invariant radius of gyration (Rg) of the complexes (Figure S2) and RMS fluctuation (RMSF) for all the atoms in the complexes (Figure S3) indicate that the epitope-HLA complexes remain very stable in their folded form. B-factor of the epitope and HLA allele complexes is shown in the rainbow color presentation in Figure S4. Most of the regions of the complexes are stable (blue) with a very small region being acceptably fluctuating (yellow and orange).

Figure 4

Molecular docking study of CTL and HTL epitopes with respective HLA allele binders.

Notes: (A) CTL and (B) HTL epitopes docked with their respective HLA allele binders. The docking study shows acceptable negative binding energy and hydrogen bonds (green balls) formation between epitopes (magenta sticks) and HLA alleles (cyan cartoons). (*) Residues forming hydrogen bonds with more than one residue. Abbreviations: CTL, cytotoxic T lymphocyte; HLA, human leukocyte antigen; HTL, helper T lymphocyte.

Figure 5

Molecular dynamics simulation study of the epitope and HLA allele complexes.

Notes: Molecular dynamics simulation study of (A) CTL and (B) HTL epitope complexes with their respective HLA allele binders. The reasonably invariable RMSD value indicates a stable complex formation.

Abbreviations: CTL, cytotoxic T lymphocyte; HLA, human leukocyte antigen; HTL, helper T lymphocyte; RMSD, root mean square deviation.

Molecular interaction analysis (docking) of the selected CTL epitopes with TAP

Molecular docking results show a favorable molecular interaction between the selected CTL epitopes and the TAP cavity. The result reveals several molecular interaction sites for the selected epitopes within the TAP cavity. Two sites of interaction—one close to the cytoplasm and another close to the ER lumen—are shown in Figure 6. All the interactions showed a significantly negative binding energy with one or more than one hydrogen bond formation at both sites of interaction. From this study, we may predict a smooth passage for selected CTL epitopes from the cytoplasm to the ER lumen throughout the entire passage of the TAP cavity (Figure 6).

Figure 6

Molecular docking study of CTL epitopes with the TAP cavity.

Notes: TAP is shown by cyan ribbon, and epitopes are shown by magenta. In every panel of the epitope–TAP complex, (A) shows the binding of epitope at two different sites of the TAP cavity, (B) and (C) show the hydrogen bond formation between epitopes and TAP cavity residues at two respective sites of the TAP cavity. (*) Residues forming hydrogen bonds with more than one residue. (a and b) Chain A and B of TAP. Binding energy is shown in kcal/mol. Hydrogen bonds are shown by yellow dots.

Abbreviations: CTL, cytotoxic T lymphocyte; TAP, transporter associated with antigen processing.

Characterization and interaction analysis of designed MEVs with TLR-3

IFN-γ is involved in both adaptive and innate immune responses. It stimulates macrophages and natural killer cells. IFN-γ-inducing 15-mer peptide epitopes were predicted from both CTL and HTL MEVs by IFNepitope server. A total of 486 and 640 IFN-γ epitopes were predicted from CTL and HTL MEVs, respectively. Out of these, 35 CTL MEVs and 18 HTL MEVs, IFN-γ-inducing POSITIVE epitopes, with score of one or more than one were chosen (Table S9 and Figure 7C and G).

Figure 7

3D structure modeling of CTL and HTL MEVs.

Notes: (A and E) 3D structure of CTL and HTL MEVs models, respectively, showing epitopes from different proteins in different colors (epitope colors are the same as those shown in Figure 1). (B and F) Overlapping regions belonging to B-cell linear epitopes from both CTL and HTL MEVs, respectively, are shown by spheres. From (C) CTL and (G) HTL MEVs, IFN-γ-inducing epitopes are shown in cyan, while discontinuous B-cell epitopes are shown in magenta. The regions shown in wheat color are common to both IFN-γ and discontinuous B-cell epitopes. (D and H) RAMPAGE analysis of refined CTL and HTL MEV models, respectively.

Abbreviations: CTL, cytotoxic T lymphocyte; HTL, helper T lymphocyte; IFN, interferon; MEV, multi-epitope vaccine.

The AlgPred analysis of CTL and HTL MEVs revealed that both MEVs are nonallergens with the score of −0.95415008 and −0.8647714, respectively, with sensitivity and specificity of 88.87% and 81.86%, respectively, while the default threshold value was −0.4. The VaxiJen analysis indicated both MEVs to be probable ANTIGENS with prediction score of 0.5302 and 0.5097 for CTL MEVs and HTL MEVs, respectively, while the default threshold value was 0.4 for viral proteins. Hence, both MEVs are predicted to be nonallergic and potentially antigenic in nature. ProtParam analysis showed that CTL MEVs have 500 amino acids, 50.4 kDa molecular weight, and 9.72 theoretical pI. The half-life in mammalian reticulocytes, yeast, and Escherichia coli was 30 hours, 20 hours, and 10 hours, respectively; aliphatic index was 61.68, and grand average of hydropathicity (GRAVY) was −0.020, indicating globular and hydrophilic nature of CTL MEVs; the instability index was 50.70, indicating that CTL MEVs are theoretically close to stable in nature. ProtParam study showed HTL MEV has 657 amino acids, 67.583 kDa molecular weight, and 9.29 theoretical pI. The half-life in mammalian reticulocytes, yeast, and E. coli was 30 hours, 20 hours, and 10 hours, respectively; aliphatic index was 94.22, and grand average of hydropathicity (GRAVY) was 0.337, both indicating globular and hydrophilic nature of HTL MEV; the instability index was 37.66, indicating theoretically stable nature of HTL MEV. The model generated by RaptorX for CTL MEV has 4% helix, 37% β-sheet, and 58% coil; the model is 26% exposed, 41% medium, and 31% buried. It has three domains: first domain (1–46 aa, template-1fd3:A); second domain (47–444 aa, template-4dou:A, 4zch:A); and third domain (445–500 aa, template-1kj6:A) (Figure 7A). The model generated by RaptorX for HTL MEV has 35% helix, 20% β-sheet, and 44% coil; the model is 26% exposed, 35% medium, and 37% buried. It has four domains: first domain (1–46 aa, template-1fd3:A); second domain (47–546 aa, templates-4ufcA, 2eabA, and 2rdyA); third domain (547–601 aa, template-1tue:B); and fourth domain (602–657 aa, template-1kj6:A) (Figure 7E). The P-values of the best template of CTL and HTL MEV models are 3.98e-05 and 5.41e-04, respectively. Good quality alpha proteins and good quality beta proteins should have P-value of less than 10−3 and 10−4 respectively. Hence, both the models of CTL and HTL MEVs generated by RaptorX are of good quality. Because the CTL and HTL epitopes used for the design of both MEVs show overlapping with B-cell linear epitopes (Figure 3), both the CTL and HTL MEV models also carry overlapping regions of B-cell linear epitopes (Figure 7B and F). The structural accuracy of initial model including joining of the gaps was performed by ModRefiner refinment. Galaxy-Refine was used to further refine the 3D models of CTL and HTL MEVs. For both MEVs, refined model 1 was chosen on the basis of best scorings. For CTL MEV refinement, the score of model 1 was as follows: Rama favored was 94.2%, GDT-HA was 0.9500, RMSD was 0.433, MolProbity was 2.211, Clash score was 18.3, and Poor rotamers was 1.2. For HTL MEV refinement, the score of model 1 was as follows: Rama favored was 93.4%, GDT-HA was 0.9502, RMSD was 0.412, MolProbity was 2.262, Clash score was 21.9, and Poor rotamers was 0.6. Here, MolProbity is the log-weighted combination of clash score, percentage Ramachandran not favored, and percentage bad side-chain rotamers. After refinement, all the mentioned parameters improved significantly as compared to those of the initial model (Table S10). The RAMPAGE analysis showed that the refined CTL MEV model has 94% residues in the favored region, 3.8% residues in the allowed region, and only 1.4% residues in the outlier region, while the refined HTL MEV model has 94% of residues in the favored region, 5.2% residues in the allowed region, and only 0.8% residues in the outlier region (Figure 7D and H). Discontinuous B-cell epitope prediction from both MEVs was performed by the ElliPro tool. The screening revealed CTL MEV to have six discontinuous epitopes and HTL MEV to have five discontinuous epitopes. The PI score for CTL MEV epitopes ranged from 0.549 to 0.906 and that for HTL MEV ranged from 0.56 to 0.729 (Table S11 and Figure 7C and G). The higher the score, the greater is the potential of the B-cell discontinuous epitope. The refined models of CTL and HTL MEVs were further analyzed for their interaction with the ECD of human TLR-3 by molecular docking using the PatchDock tool. Docking conformation chosen for the CTL and HTL MEVs showed scores of 18,096 and 23,690, respectively, which were highest among all docked complexes. The highest score indicates the best geometric shape complementarity fitting of ligand and receptor predicted by the tool. The docking complex shows a fitting confirmation of both the MEVs within the ECD of TLR-3 (Figure 8A and E). Further, the MD simulation analysis of the docked CTL-TLR-3 and HTL-TLR-3 complexes showed a very convincing reasonably stable RMSD value betweeñ0.4 and 0.5 nm for a given time window of 6 ns at the reasonably invariable temperature (~300 K) and pressure (~1 bar). These results indicate a stable complex formation for both MEVs with TLR-3 (Figure 8B and F). The reasonably invariant Rg of MEV-TLR-3 complexes (Figure 8C and G) and RMSF for all the atoms in the complexes (Figure 8D and H) indicate that both the MEV-TLR-3 complexes are reasonably stable and are properly folded. The B-factor of CTL and HTL MEV complexes with the TLR-3 receptor is shown by rainbow color presentation in Figure 8A and E. Most of the regions of both complexes are stable (blue) with a very small region being acceptably fluctuating (yellow and orange).

Figure 8

Molecular docking and MD simulation study of CTL and HTL MEVs with TLR-3.

Notes: (A) CTL and (E) HTL MEVs (cartoon) docked complex with TLR-3 (ribbon), respectively. B-factor of the complexes is shown by rainbow (VIBGYOR) presentation; the regions in blue are very stable, and the region in red is unstable. In the above complexes, most of the regions are in blue and a very small region is in yellow and orange, indicating that both complexes are very stable. (B and F) MD simulation study of CTL and HTL MEV complexes with TLR-3, respectively. (C and G) Rg across the time window of 6,000 ps for CTL and HTL MEV complexes with TLR-3, respectively. (D and H) RMSF for all the atoms of the CTL and HTL MEV complex with TLR-3, respectively.

Abbreviations: CTL, cytotoxic T lymphocyte; HTL, helper T lymphocyte; MD, molecular dynamics; MEV, multi-epitope vaccine; Rg, radius of gyration; RMSD, root mean square deviation; RMSF, root mean square fluctuation; TLR-3, toll-like receptor 3.

Optimized cDNAs for both CTL and HTL MEVs were generated by the Codon Usage Wrangler Tool with a mammalian host cell line (human) as the choice of the expression system. The GenScript Rare Codon Analysis Tool showed that the GC content of optimized CTL MEV cDNA was 69.74% and CAI score was 1.0 with 0% tandem rare codons. Likewise, the GC content of optimized HTL MEV cDNA was 66.63%, and the CAI score was 1.0 with 0% tandem rare codons. Ideally, the GC content should be 30%–70%, the CAI score indicating the possibility of cDNA expression in the chosen expression system should be between 0.8 and 1.0, and tandem rare codon frequency indicating the percentage of low-frequency codons present in cDNA for the chosen expression system should be <30%. Tandem rare codons may hinder the expression of cDNA or even interrupt the translational machinery. Hence, the optimized cDNAs of both MEVs are predicted to be highly expressing in the mammalian host cell line (human).

Conclusion

In this study, we propose the design of two MEVs against MERS-CoV that consisted of CTL and HTL epitopes. The selected CTL and HTL epitopes were validated by in silico methods for their molecular interaction with HLA alleles and TAP (for CTL epitopes). The population coverage by the designed MEVs is as high as 94% of the world population. Both MEVs were found to contain IFN-γ epitopes and B-cell discontinuous epitopes. Moreover, they also showed stable interaction with the immunoreceptor TLR-3. On the basis of the design and in silico validation of both the CTL and HTL MEVs, their joint administration is predicted to induce humoral and cell-mediated immune response. Codon-based cDNAs of both MEVs are predicted to have high expression in the mammalian host cell line (human); hence, they could be cloned and expressed at the laboratory level for in vivo trials on humanized HLA-expressing mice for further study.

63 in total

1. Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization.

Authors: Dong Xu; Yang Zhang
Journal: Biophys J Date: 2011-11-15 Impact factor: 4.033

2. The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling.

Authors: Konstantin Arnold; Lorenza Bordoli; Jürgen Kopp; Torsten Schwede
Journal: Bioinformatics Date: 2005-11-13 Impact factor: 6.937

3. New hydrophilicity scale derived from high-performance liquid chromatography peptide retention data: correlation of predicted surface residues with antigenicity and X-ray-derived accessible sites.

Authors: J M Parker; D Guo; R S Hodges
Journal: Biochemistry Date: 1986-09-23 Impact factor: 3.162

4. Immunoinformatics analysis and in silico designing of a novel multi-epitope peptide vaccine against Staphylococcus aureus.

Authors: Nasim Hajighahramani; Navid Nezafat; Mahboobeh Eslami; Manica Negahdaripour; Seyyed Soheil Rahmatabadi; Younes Ghasemi
Journal: Infect Genet Evol Date: 2016-12-16 Impact factor: 3.342

5. Induction of hepatitis A virus-neutralizing antibody by a virus-specific synthetic peptide.

Authors: E A Emini; J V Hughes; D S Perlow; J Boger
Journal: J Virol Date: 1985-09 Impact factor: 5.103

6. In silico approach for predicting toxicity of peptides and proteins.

Authors: Sudheer Gupta; Pallavi Kapoor; Kumardeep Chaudhary; Ankur Gautam; Rahul Kumar; Gajendra P S Raghava
Journal: PLoS One Date: 2013-09-13 Impact factor: 3.240

7. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors: Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal: Mol Syst Biol Date: 2011-10-11 Impact factor: 11.429

8. Reliable B cell epitope predictions: impacts of method development and improved benchmarking.

Authors: Jens Vindahl Kringelum; Claus Lundegaard; Ole Lund; Morten Nielsen
Journal: PLoS Comput Biol Date: 2012-12-27 Impact factor: 4.475

9. Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method.

Authors: Morten Nielsen; Claus Lundegaard; Ole Lund
Journal: BMC Bioinformatics Date: 2007-07-04 Impact factor: 3.169

10. Ligand-induced Dimerization of Middle East Respiratory Syndrome (MERS) Coronavirus nsp5 Protease (3CLpro): IMPLICATIONS FOR nsp5 REGULATION AND THE DEVELOPMENT OF ANTIVIRALS.

Authors: Sakshi Tomar; Melanie L Johnston; Sarah E St John; Heather L Osswald; Prasanth R Nyalapatla; Lake N Paul; Arun K Ghosh; Mark R Denison; Andrew D Mesecar
Journal: J Biol Chem Date: 2015-06-08 Impact factor: 5.157

10 in total

1. Consensus Enolase of Trypanosoma Cruzi: Evaluation of Their Immunogenic Properties Using a Bioinformatics Approach.

Authors: Alejandro Diaz-Hernandez; Maria Cristina Gonzalez-Vazquez; Minerva Arce-Fonseca; Olivia Rodríguez-Morales; Maria Lilia Cedillo-Ramirez; Alejandro Carabarin-Lima
Journal: Life (Basel) Date: 2022-05-18

Review 2. Peptides for Vaccine Development.

Authors: Ian W Hamley
Journal: ACS Appl Bio Mater Date: 2022-02-23

3. Vaccines Targeting Numerous Coronavirus Antigens, Ensuring Broader Global Population Coverage: Multi-epitope and Multi-patch Vaccines.

Authors: Sukrit Srivastava; Spyros D Chatziefthymiou; Michael Kolbe
Journal: Methods Mol Biol Date: 2022

4. Anti-COVID-19 multi-epitope vaccine designs employing global viral genome sequences.

Authors: Tahreem Zaheer; Maaz Waseem; Walifa Waqar; Hamza Arshad Dar; Muhammad Shehroz; Kanwal Naz; Zaara Ishaq; Tahir Ahmad; Nimat Ullah; Syeda Marriam Bakhtiar; Syed Aun Muhammad; Amjad Ali
Journal: PeerJ Date: 2020-08-03 Impact factor: 2.984

Review 5. Recent Advances in the Vaccine Development Against Middle East Respiratory Syndrome-Coronavirus.

Authors: Chean Yeah Yong; Hui Kian Ong; Swee Keong Yeap; Kok Lian Ho; Wen Siang Tan
Journal: Front Microbiol Date: 2019-08-02 Impact factor: 5.640

6. Immunoinformatic design of a COVID-19 subunit vaccine using entire structural immunogenic epitopes of SARS-CoV-2.

Authors: Esmaeil Behmard; Bijan Soleymani; Ali Najafi; Ebrahim Barzegari
Journal: Sci Rep Date: 2020-11-30 Impact factor: 4.379

7. Design an Efficient Multi-Epitope Peptide Vaccine Candidate Against SARS-CoV-2: An in silico Analysis.

Authors: Zahra Yazdani; Alireza Rafiei; Mohammadreza Yazdani; Reza Valadan
Journal: Infect Drug Resist Date: 2020-08-25 Impact factor: 4.003

8. Rational design of multi epitope-based subunit vaccine by exploring MERS-COV proteome: Reverse vaccinology and molecular docking approach.

Authors: Usman Ali Ashfaq; Saman Saleem; Muhammad Shareef Masoud; Matloob Ahmad; Nazia Nahid; Rashid Bhatti; Ahmad Almatroudi; Mohsin Khurshid
Journal: PLoS One Date: 2021-02-03 Impact factor: 3.240

9. Structural Basis for Designing Multiepitope Vaccines Against COVID-19 Infection: In Silico Vaccine Design and Validation.

Authors: Sonia Verma; Sukrit Srivastava; Mohit Kamthania; Rupinder Kaur; Ruchi Kiran Badyal; Ajay Kumar Saxena; Ho-Joon Shin; Michael Kolbe; Kailash C Pandey
Journal: JMIR Bioinform Biotech Date: 2020-06-19

Review 10. Coronavirus Pandemic-Therapy and Vaccines.

Authors: Kenneth Lundstrom
Journal: Biomedicines Date: 2020-05-03

10 in total