Maryam Touhidinia1, Fatemeh Sefid2,3, Mozhgan Bidakhavidi1,4. 1. Department of Biology, Faculty of Science, Yazd University, Yazd, Iran. 2. Department of Medical Genetics, Shahid Sadoughi University of Medical Science, Yazd, Iran. 3. Department of Biology, Science and Art University, Yazd, Iran. 4. Department of Nursing, Nursing and Midwifery Research, Shahid Sadoughi University of Medical Sciences and Health Services, Yazd, Iran.
Abstract
Acinetobacter baumannii is one of the most successful pathogens causing nosocomial infections and has significantly multidrug-resistant. So far, there are no certain treatments to protect against infection with A. baumannii, therefore an effective A. baumannii vaccine needed. The purpose of this study was to predict antigenic epitopes of CarO protein for designing the A. baumannii vaccine using immunoinformatics analysis. CarO protein is one of the most important factors in the resistance against the antibiotic Carbapenem. In this study, T and B-cell epitopes of CarO protein were predicted and screened based on the antigenicity, toxicity, allergenicity features. The epitopes were linked by suitable linkers. Four different adjuvants were attached to the vaccine constructs which among them, vaccine construct 3 was chosen to predict the secondary and the 3D structure of the vaccine. The refinement process was performed to improve the quality of the 3D model structure; the validation process is performed using the Ramachandran plot and ProSA z-score. The designed vaccine's binding affinity to six various HLA molecules and TLR 2 and TLR4 were evaluated by molecular docking. Finally, in silico gene cloning was performed in the pET28a (+) vector. The findings suggest that the vaccine may be a promising vaccine to prevent A. baumannii infection.
Acinetobacter baumannii is one of the most successful pathogens causing nosocomial infections and has significantly multidrug-resistant. So far, there are no certain treatments to protect against infection with A. baumannii, therefore an effective A. baumannii vaccine needed. The purpose of this study was to predict antigenic epitopes of CarO protein for designing the A. baumannii vaccine using immunoinformatics analysis. CarO protein is one of the most important factors in the resistance against the antibiotic Carbapenem. In this study, T and B-cell epitopes of CarO protein were predicted and screened based on the antigenicity, toxicity, allergenicity features. The epitopes were linked by suitable linkers. Four different adjuvants were attached to the vaccine constructs which among them, vaccine construct 3 was chosen to predict the secondary and the 3D structure of the vaccine. The refinement process was performed to improve the quality of the 3D model structure; the validation process is performed using the Ramachandran plot and ProSA z-score. The designed vaccine's binding affinity to six various HLA molecules and TLR 2 and TLR4 were evaluated by molecular docking. Finally, in silico gene cloning was performed in the pET28a (+) vector. The findings suggest that the vaccine may be a promising vaccine to prevent A. baumannii infection.
Acinetobacter baumannii is an aerobic, pleomorphic, non-motile, typically short, almost round, and rod-shaped Gram-negative bacterium (Perez et al. 2007). It can cause infections in parts of the body such as blood, urinary tract, and lungs (Eliopoulos et al. 2008). As an opportunistic pathogen, A. baumannii has a high incidence among patients in the intensive care unit (ICU) setting and immunocompromised individuals particularly those who have experienced a prolonged hospital stay (Valencia et al. 2009). In addition, the frequency of A. baumannii infections has been enhancing gently (Dijkshoorn et al. 2007). A. baumannii is able to grow at various temperatures and pH conditions. Also, this organism can exploits different types of both carbon and energy sources. These properties cause A. baumannii can persist in either moist or dry conditions in the hospital environment. This hardiness, combined with its resistance to many antimicrobial drugs, contributes to spread it in the hospital setting (Obeidat et al. 2014).This phenomenon of multidrug-resistant (MDR) pathogens has become a serious concern about nosocomial and community infections (Dijkshoorn et al. 2007; Tohidinia et al. 2020). During 1970s, A. baumannii is thought to have been sensitive to wide range of antibiotics, but today the pathogen emerges immense resistance to most first-line antibiotics (Howard et al. 2012). Pathogenesis in A. baumannii infections is a result of several virulence factors such as porins, capsules, and cell wall lipopolysaccharide, enzymes, biofilm production, motility, and iron-acquisition systems (Eijkelkamp et al. 2014). These virulence factors contribute to the organism to resist in hard environmental conditions and enable development of intense infections (Ayoub Moubareck and Hammoudi Halat 2020). In addition, various resistance mechanisms in this pathogen are identified which resistance with major classes of antibiotics including penicillins, cephalosporins, carbapenems, most aminoglycosides, fluoroquinolones, chloramphenicol, and tetracyclines (Kyriakidis et al. 2021). These mechanism including wide array of antibiotic-hydrolyzing enzymes, efflux pump changes, impermeability, and antibiotic target mutations which gives a unique ability to the organism to maintain a multidrug-resistant phenotype (Asif et al. 2018; Espinal et al. 2019). According to the report of the Centers for Disease Control and Prevention, in 2017, Carbapenem-resistant Acinetobacter caused an estimated 8500 infections in hospitalized patients, and 700 people died as a result. CDC has reported a list of urgent threats bacteria that have the danger of antibiotic resistance. This list includes Carbapenem-resistant Acinetobacter, Candida auris, Clostridioides difficile, Carbapenem-resistant Enterobacterales, Drug-resistant Neisseria gonorrhea. Carbapenem-resistant Acinetobacter has been reported in the first line of this serious danger (Prevention 2019).Carbapenem is a category of highly effective antibiotic drugs generally used for the treatment of severe or high-risk bacterial infections (Shaker and Shaaban 2017). The growing trend of Carbapenem resistance in A. baumannii recently has become a grave concern around the world because it limits the range of therapeutic alternatives (Higgins et al. 2010). In the context of the increasing worldwide occurrence of Carbapenem-resistant A. baumannii strains, Non-enzymatic molecular mechanisms act in cooperation with Carbapenem-hydrolyzing β-lactamases to provide an intense resistance. However, this process has not been described clearly (Catel-Ferreira et al. 2011). MDR strains showed a porin-mediated mechanism relating to the Carbapenem resistance-associated outer membrane protein, CarO (Catel-Ferreira et al. 2011; Mussi et al. 2007; Siroy et al. 2005). CarO protein generally is an eight‐stranded β‐barrel protein of 29 kDa which plays a key role in selective uptake of the ornithine, other basic amino acids, and uptake of Carbapenem (Catel-Ferreira et al. 2011). Mutations in the CarO gene can alter the structure or decrease the expression of the porin, resulting in a decline of antibiotics entry into the bacteria (Poirel et al. 2006; Zahn et al. 2015). In This study CarO protein was evaluated by available bioinformatics tools for designing an efficient multi-epitope subunit vaccine for the induction of immune responses against A. baumannii infections.One of the critical issues in immunization is the presence of epitopes including B cells, HTL, and CTL that are recognized by immune system cells and capable of stimulating an immune response (Catel-Ferreira et al. 2011). The epitopes are the parts of a protein that have two shapes, continuous and discontinuous epitopes. Discontinuous epitopes consist of the residues from distant parts of the sequence that collectively made up the epitope and they will usually not recognized individually by antibodies appointed against the discontinuous epitope, therefore these epitopes are not suitable for the multi-epitope vaccine. However, continuous epitopes, also called sequential epitopes, are conformation independent, so they have the ability to be recognized by antibodies individually and independently (Chandra and Singh 2012; Rubinstein et al. 2008). Epitopes can induce a more direct and potent immune response, than the response induced by the whole protein. Therefore, the identification of antigenic epitopes can be more effective and safe in vaccine design (Palatnik-de-Sousa et al. 2018).Immunoinformatics plays a key role in vaccine design and antibody production (Kazi et al. 2018). In the past, antibody design and vaccine development are expensive and time-consuming. Nowadays, advances in the field of bioinformatics have provided practical tools which can be used to lessen the time and cost for vaccine and antibody design (Anderson et al. 2013; Kazi et al. 2018). The Immunoinformatics approach allows the identification of the immunogenic epitopes from the pathogen genomes. Also, the ideal and immunogenic parts could be developed as potential vaccine candidates to trigger protective immune responses in the hosts (Ali et al. 2017).Adjuvants are agents or drugs that have few or no antigenic effects or properties but can help enhance the efficacy and immunogenicity of vaccines (Mishra et al. 2020). In many studies, several kinds of adjuvants were used that had significant effects on the immunogenicity of the vaccine (Cayatte et al. 2017; Doener et al. 2019; Gasparini et al. 2001; Suschak et al. 2017; Yang et al. 2020). Adjuvants L7/L12 ribosomal protein (Behbahani 2020; Saadi et al. 2017), beta-defensin (Gupta et al. 2020; Wang et al. 2020), HBHA protein (Kumar et al. 2019; Lei et al. 2020), and HBHA conserved sequence(Rashidian et al. 2020; Solanki and Tiwari 2018) recently been identified as efficient and helpful adjuvants, and we used them in this study.Infections caused by A. baumannii are becoming extremely difficult to prevent and treat because infection control strategies against the spread of A. baumannii are facing new multidrug-resistant (Rasooli et al. 2020). This study identifies the potential and immunogenic epitopes of CarO protein by bioinformatics tools. Then, linkers and adjuvant were added to the selected epitopes to design final multi-epitope vaccines and was selected the best vaccine construct based on allergenicity, antigenicity, and solubility features. The 3D structure of the protein was predicted, and its affinity to different HLA, TLR2, and TLR4 was investigated by molecular docking. Finally, the vaccine coding gene expression in E. coli was evaluated, and in silico cloning was performed in the pET28a (+) vector. This selected vaccine construct can be developed to cure the infection caused by the A. baumannii.
Method
Sequence Availability and Alignment
The CarO protein sequence was retrieved from NCBI at http://www.ncbi.nlm.nih.gov/protein and saved in FASTA format for further analyses (Pruitt et al. 2005); then Protein BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi) against reference sequence on the database was performed using Blosum80 matrix to collect homologous sequences between different bacteria (Tatusova and Madden 1999). Because BLOSUM80 created for highly similar alignments and is used to find related proteins (Wheeler 2003). PRALINE server at https://www.ibi.vu.nl/programs/pralinewww/ was used for generating of alignments. This server based on homology-extended multiple alignments and the integration of predicted secondary structure information with iteration capabilities methods which have many advantages and have made this server stronger (Simossis and Heringa 2005). Alignments can demonstrate the conservancy of the protein residues among various strains (Simossis and Heringa 2003). The suitable vaccine candidate should able effective against all strains of a given pathogen. The conservancy of the amino acids among bacteria other than Acinetobacter implies probable cross-reactivity levels.
Prediction T Cell MHC Class I Epitope
The recognition of MHC-I antigenic peptide which exposed at the target cell surface is crucial for the adaptive immune response (Nguyen van Binh et al. 2006). NetCTL server at http://www.cbs.dtu.dk/services/NetCTL/ was used to predict MHC class I epitopes. In this server, the method integrates prediction of peptide MHC class I binding, proteasomal C terminal cleavage, and TAP transport efficiency (Larsen et al. 2005). Prediction threshold value 0.75 was set for epitope identification. Epitopes were chosen with a combinatorial score > 1 for further analyses (Larsen et al. 2007). Also, IEDB server at http://tools.iedb.org/mhci/ was used to predict peptide binding to MHC class I molecules based on the consensus method. Identified T-cell epitopes with HLA alleles were selected by percentile rank and IC50 values. The lower the percentile rank was shown the higher interaction between the peptide and MHC molecules. In Addition, high binding epitopes have an IC50 value below 50 nM, and weak binding peptides and IC50 value below 500 nM. So, Epitopes with both IC50 value < 50 nM and percentile rank ≤ 0.2 were selected for more analyses (Vita et al. 2019).
Prediction Class I Antigenic, Toxicity and Immunogenicity
A second screening step including their important features such as antigenicity and allergenicity which were checked by AllergenFP server at http://ddg-pharmfac.net/AllergenFP/ and Vaxijen server at http://www.ddg-pharmfac.net/vaxijen/VaxiJen/VaxiJen.html. AllergenFP is generally utilized for the prediction of allergenicity of epitopes for vaccine design (Dimitrov et al. 2014). VaxiJen server used to predict antigenic properties of the peptide at a threshold value of 0.5. This server depends on Auto Cross Covariance (ACC) transformation, and alignment-independent predicted antigenic epitopes by their physicochemical behavior (Doytchinova and Flower 2007). In the following, Epitope/MHC complex should have the ability to evoke an immune response. So, we have used MHC I immunogenicity prediction tool in the IEDB server (Calis et al. 2013). Default parameters were selected and the epitopes which gained the positive value were chosen.
Prediction T Cell MHC Class II Epitope and Analysis Antigenic and Toxicity
To identify MHC-II binding epitopes two servers were employed. IEDB server at http://tools.iedb.org/mhci/ were used to compute MHC-II binding epitopes based on the consensus method. The consensus method uses a combination of both stabilization matrix alignment method and average relative binding matrix method. Predicted epitopes were selected relying on percentile rank and IC50 values (percentile rank ≤ 0.2 and IC50 value < 50 nM) (Jensen et al. 2018). NetMHCII 2.3 server at http://www.cbs.dtu.dk/services/NetMHCII/ was used to predict binding of peptides to HLA-DR, HLA-DQ and HLA-DP using artificial neural networks (Kaushik and Therapeutics 2019). Among predicted strong binding epitopes were chosen those had score < 1 for further analyses. Antigenicity, toxicity, and allergenicity were evaluated for predicted epitopes. The antigenicity and allergenicity of the epitopes were calculated by VaxiJen and AllergenFP server respectively (Dimitrov et al. 2014; Doytchinova and Flower 2007). In addition, mutation and toxicity features were checked by ToxinPred server at http://crdd.osdd.net/raghava/toxinpred/. In the silico method, ToxinPred was used to predict the toxicity of peptides and about how to design a peptide or protein with desired toxicity by mutating a minimum number of amino acids (Gupta et al. 2013).
Prediction and Screening of Liner B-Cell Epitopes
The recognition of B-cell epitopes has a key role in vaccine design, immunodiagnostic tests, and antibody production. There are various servers to predict B cell epitopes. Since each server performs based on specific algorithms and exclusive methods in epitope prediction. Therefore, the usage of several tools to predict linear B-cell epitopes in protein sequences are more valid. Ellipro server at http://tools.immuneepitope.org was employed to predict linear antibody epitopes based on a protein antigen's 3D structure (Ponomarenko et al. 2008). Also, liner B cell epitopes were predicted using an artificial neural network by ABCpred server at http://crdd.osdd.net (Saha and Raghava 2006a, b) In addition, SVMTriP (http://sysbio.unl.edu/SVMTriP/) was used in the prediction of the B cell linear epitopes. This server is a Support Vector Machine which used to predict linear antigenic epitopes which combine the Tri-peptide similarity and Propensity scores (SVMTriP) (Yao et al. 2012). For more identification of continuous B-cell epitopes from the CarO protein, Bcpred server at http://ailab.ist.psu.edu/bcpred/predict.html was utilized and the epitopes size of 20 amino acids as well as the specificity threshold of 0.75% were considered. The Bcpred server for predicting employs a subsequence kernel-based SVM classifier (EL‐Manzalawy et al. 2008). Selected epitopes from the servers mentioned above were screened for antigenicity and allergenicity properties (Dimitrov et al. 2014; Doytchinova and Flower 2007).
Construction of Model Vaccine
To construct the novel vaccine with low toxicity, allergenicity, and high immunogenicity, we have analyzed the different combinations of sequence constructs. During this novel multi-epitope subunit vaccine construction. Firstly, CTL, HTL, and B cell epitopes were joined with the help of amino acid linkers HEYGAEALERAG and GGGS respectively (Hasan et al. 2019; Nezafat et al. 2014). Secondly, for enhancing the immunogenicity of this sequence constructs were added the four different adjuvants L7/L12 ribosomal protein (Mallick et al. 2007), beta-defensin (Dar et al. 2015), HBHA protein (Nezafat et al. 2014), and HBHA conserved sequence respectively (Rana and Akhter 2016). Adjuvants sequences were added with the help of EAAAK linkers at both N- and C-terminal of Adjuvants (Saadi et al. 2017). PADRE peptide sequences were also incorporated along with the adjuvants by GGGS linker. PADRE peptide induced CD4+ T-cells that improve efficacy and potency of peptide vaccine (Wu et al. 2010).
Allergenicity, Antigenicity and Solubility Evaluation
The Allergenicity of vaccine constructs was predicted by AlgPred server (https://webs.iiitd.edu.in/raghava/algpred/submission.html) which based the SVM module on amino acid composition. A threshold score of prediction has been considered to be − 0.40. Prediction score less than the threshold value point displays the non-allergic feature of the peptide (Saha and Raghava 2006a, b). VaxiJen server and ANTIGENpro program (Kalita et al. 2020) at http://scratch.proteomics.ics.uci.edu were employed for the prediction of vaccine antigenicity behavior. In addition, SOLpro online server at http://scratch.proteomics.ics.uci.edu was utilized to predict the propensity of the protein to be soluble upon overexpression in Escherichia coli. This server uses a two-stage SVM architecture method to evaluate the protein solubility (Magnan et al. 2009).
Secondary Structure Prediction
The PSIPRED at http://bioinf.cs.ucl.ac.uk/psipred/ was used to predict the secondary structure of the vaccine construct. PSIPRED is a simple and accurate secondary structure prediction method, incorporating two feed-forward neural networks that perform an analysis on output obtained from PSI-BLAST (McGuffin et al. 2000). In addition, Phyre2 server at http://www.sbg.bio.ic.ac.uk/~phyre2/html/page.cgi?id=index employed to validate the PSIPRED predictions (Kelley et al. 2015).
Vaccine 3D Modeling, Refinement, and Validation
Homology modeling is the construction of an atomic model of a purpose protein based solely on the target's amino acid sequence and the experimentally determined structures of homologous proteins, referred to as templates (Kelley et al. 2015). While Ab initio prediction is the challenging attempt to predict protein structures based only on sequence information and without using templates (Yang and Zhang 2015). There are many tools and servers that are used for homology modeling and Ab initio prediction. There is no single modeling program or server which is superior in every aspect to others (Dolan et al. 2011). The SWISS-MODEL server at http://swiss-model.expasy.org/ is a fully automated protein structure homology-modeling server (Kiefer et al. 2009). I-TASSER server at https://zhanglab.dcmb.med.umich.edu/I-TASSER/ is based on hierarchical method to protein structure prediction and structure-based function annotation (Zhang 2008).GMQE (Global Model Quality Estimation) is a quality estimation which combines properties from the target–template alignment and the template search method. QMEAN, which stands for Qualitative Model Energy Analysis, is a composite scoring function describing the major geometrical aspects of protein structures. All 3D models of the protein built by SWISS-MODEL server, were qualitatively estimated by GMQE and QMEAN scores (Schwede et al. 2003). In addition, C-score is a confidence score for estimating the quality of predicted models by I-TASSER. It is calculated based on the significance of threading template alignments and the convergence parameters of the structure assembly simulations (Yang et al. 2015). Finally, Qualitative evaluation of 3D models was done by ProSA at https://prosa.services.came.sbg.ac.at. ProSA specifically faces the needs confronted in the authentication of protein structures acquired from X-ray analysis, NMR spectroscopy, and hypothetical estimations (Wiederstein and Sippl 2007). Rampage at http://mordred.bioc.cam.ac.uk/rapper/rampage.php was also employed for estimation of model quality using Ramachandran plot which is an algorithm for atomic level, high-resolution protein structure improvement (Hooft et al. 1997).
Molecular Docking
The aim of molecular docking is to give a prediction of the binding affinity between a receptor molecule and ligand by computation methods (Ballester and Mitchell 2010). The vaccine construct 3 (V3) was made by I-TASSER server. In addition, PDB IDs of different HLA alleles were downloaded from the RCSB at https://www.rcsb.org/ (Rose et al. 2010). Molecular docking of V3 with 6 various HLA alleles including 1A6A(HLA-DR B1*03:01), 3C5J(HLA-DR B3*02:02), 1H15(HLA-DR B5*01:01), 2FSE(HLA-DRB1*01:01), 2Q6W(HLA-DR B3*01:01), and 2SEB(HLA-DRB1*04:01) was carried out by PatchDock server (Schneidman-Duhovny et al. 2005) at https://bioinfo3d.cs.tau.ac.il/PatchDock/ to represent HLA-peptide interactions. PatchDock predicted a number of 266 solutions that were again subjected to the FireDock server (http://bioinfo3d.cs.tau.ac.il/FireDock/) to refine and re-scoring of molecular docking score (Andrusier et al. 2007). TLRs are the most important membrane-bound PRRs, playing a critical role in the recognition of invading pathogens by the innate immune system. A. baumannii interacts with host cells mainly by engaging TLR2 and TLR4 (Chen 2020). In the same way, the PatchDock server was used to docking of vaccine construct (V3) with TLR2 (PDB ID 6NIG) and TLR4 (PDB ID 4G8A) and then the top 10 models were refined and re-scored by FireDock server. The docking models of TLRs with vaccine construct (V3) were displayed by the HADDOCK 2.4 server at https://wenmr.science.uu.nl/ (Van Zundert et al. 2016). The best cluster was chosen from the docked clusters based on the lowest HADDOCK score. Finally, the interacting residues, hydrogen bonds and hydrogen bonds distances between the vaccine and the TLRs were mapped using PDBsum at http://www.ebi.ac.uk/thornton-srv/databases/cgi-bin/pdbsum/GetPage.pl?pdbcode=index.html (Laskowski 2001).
Prediction of Various Physicochemical Properties
The functionally characterize of vaccine sequence was predicted by Expasy ProtParam server (https://web.expasy.org/protparam/). This server has determined the physicochemical characteristics such as the molecular weight, number of amino acids, PI values, hydropathicity GRAVY values, instability index, aliphatic index, and estimated half-life of the protein. Calculate of these physicochemical properties is on the basis of pK values of different amino acids. The instability index value shows protein is stable or unstable. The instability index value is < 40 for stable protein and > 40 for unstable protein. The aliphatic index determines the volume occupied by the aliphatic side chains of protein (Gasteiger et al. 2005).
Codon Optimization of the Vaccine Construct and In-Silico Cloning
It is important to consider codon optimization when performing expression studies because codon optimization plays a critical role in the success of protein expression (Mauro and Chappell 2018). Java Codon Adaptation Tool (JCAT) at http://www.jcat.de/ was employed to adapt the codon usage of vaccine construct to E. coli host strain 12. Vaccine amino acid sequence was back-translated to DNA sequence and then it was adapted for codon usage to E. coli. The adaptation was based on Codon Adaptation Index values (CAI). The rho-independent transcription terminators, prokaryotic ribosome binding sites, and cleavage sites of some restriction enzymes were avoided during the operation performed by JCAT server (Grote et al. 2005). In addition, to clone the adapted gene sequence of the final vaccine construct (V3) in E. coli pET28a vector (Ali et al. 2021), using Snapgene tool at https://www.snapgene.com/ to ensure the vaccine construct expression (Arumugam 2021) and BglII and ApaI restriction sites were introduced to the N and C-terminals of the sequence, respectively.
Result
The flow chart of experimental process which summarizes an overview of the methodology used in this study is shown in Fig. 1.
Fig. 1
Flow chart showing an overview of our methodology
Flow chart showing an overview of our methodology
Protein Sequencing and Alignments
CarO protein sequence with accession No. AKL79738.1 is available as a query for protein BLAST against Acinetobacter producing a set of sequences containing various species of Acinetobacter. Out of 100 alignments, 20 sequences with the highest similarity (identity ≥ 90%, query coverage: 100% and E value: 0) were selected as T-COFFEE alignment input. BLAST search of Acinetobacter revealed numerous sequences of CarO. These hits showed that the CarO protein is specific for Acinetobacter species. The image of PRALINE alignment results with conservation color scheme is shown in Fig. 2.
Fig. 2
CarO sequence alignment with 20 sequences obtained from protein BLAST against Acinetobacter. The superposition was made with the T-coffee program and adjusted manually. Residues conservancy is depicted by blue to pink colors
CarO sequence alignment with 20 sequences obtained from protein BLAST against Acinetobacter. The superposition was made with the T-coffee program and adjusted manually. Residues conservancy is depicted by blue to pink colors
Selection of Potent MHC-I Epitopes in Protein and Analysis Eminent Features of Epitopes
On the basis of the high combinatorial score, the best T-cell epitopes were predicted by NetCTL server using the protein sequence. The software identified 36 epitopes in CarO protein. Peptides with combinatorial score > 1 were selected to other analyses (Table 1). In IEDB server, 9 epitopes were elicited with both IC < 200 nM and percentile rank ≤ 0.2 were subjected to afterward analysis (Table 1). After finalizing selected epitopes of both NetCTL and IEDB based above conditions, their important features including allergenicity, antigenicity, and immunogenicity Checked (Table 1). The sequences of IAPYLGFGF, KRIGNGDTL, WQANPYVGL and ALLWQANPY were considered as epitopes of T cells that interact with MHC Class-I and have the most important epitopes due to their features such as Non-allergen, positive scores in immunogenicity, and high scores in antigenicity.
Table 1
CTL epitopes (MHC I peptides) of CarO protein with their antigenicity, allergenicity and immunogenicity scores
No
CTL epitope
Position
Vaxijen score
Allergenicity
Immunogenicity
Final decision
1
GAAYLDNDY
121
− 0.6097
Allergen
0.02656
2
RAEVGTTGY
43
1.4273
Allergen
0.19614
3
DVSVNGTKY
79
1.7446
Non allergen
− 0.0798
4
LLWQANPYV
55
0.8355
Allergen
− 0.02417
5
YIAAGAAYL
117
0.1448
Allergen
0.15276
6
YLDNDYDLA
124
− 0.0090
Allergen
0.03105
7
VLVTTTALL
6
− 0.3603
Allergen
0.15334
8
GVRGKMSYK
158
2.1964
Non allergen
− 0.46618
9
YLGFGFAPK
172
1.0001
Allergen
0.29854
10
AYLDNDYDL
123
− 0.0481
Allergen
0.04314
11
SYAFDKNQL
29
− 0.0618
Allergen
− 0.12404
12
IAPYLGFGF
169
1.4363
Non allergen
0.13238
*
13
GVFGEVGAY
186
− 0.2548
Allergen
0.25992
14
DVSVNGTKY
79
1.7446
Non allergen
− 0.0798
15
YIAAGAAYL
117
0.1448
Allergen
0.15276
16
SYKNDIAPY
164
0.0616
Allergen
0.09691
17
RVLVTTTAL
5
− 0.3508
Allergen
0.1679
18
IPVGARAEV
38
0.3971
Allergen
0.22584
19
NPTSAQDAV
214
1.0699
Allergen
− 0.18323
20
RVLVTTTAL
5
− 0.3508
Allergen
0.1679
21
KRIGNGDTL
133
1.1551
Non allergen
0.1443
*
22
YTGNPKVEL
195
0.8255
Non allergen
− 0.11597
23
YKNDIAPYL
165
− 0.0820
Allergen
0.17513
24
YIAAGAAYL
117
0.1448
Allergen
0.15276
26
DMDNNNVYL
90
0.3279
Allergen
0.02098
27
SYAFDKNQL
29
− 0.0618
Allergen
− 0.12404
28
WQANPYVGL
57
0.8396
Non allergen
0.0466
*
29
TGYGGALLW
49
0.1673
Allergen
0.08689
30
GKVGVNFYW
241
1.6442
Allergen
0.17825
31
YLNAEIRPW
97
0.4980
Allergen
0.29725
32
FAPKISKNW
177
0.9830
Allergen
− 0.43251
33
GVFGEVGAY
186
− 0.2548
Allergen
0.25992
34
ALLWQANPY
54
0.6467
Non allergen
0.13138
*
35
LYIAAGAAY
116
0.4268
Allergen
0.20845
36
AVVHDSYAF
24
0.2032
Allergen
− 0.06844
37
YLNAEIRPW
97
0.4980
Allergen
0.29725
38
YAFDKNQLI
30
− 0.8295
Allergen
− 0.26001
39
DEAVVHDSY
22
0.5088
Allergen
0.04695
40
YAFDKNQLI
30
− 0.8295
Allergen
− 0.26001
41
STNPWAQGL
108
− 0.0570
Allergen
0.16131
42
DMDNNNVYL
90
0.3279
Allergen
0.02098
43
AYYTGNPKV
193
0.5286
Allergen
− 0.07059
44
GVRGKMSYK
158
2.1964
No allergen
− 0.46618
45
VFGEVGAYY
187
− 0.3389
No allergen
0.21471
Four CTL epitopes were selected (labeled with *) for incorporation in the vaccine construct
CTL epitopes (MHC I peptides) of CarO protein with their antigenicity, allergenicity and immunogenicity scoresFour CTL epitopes were selected (labeled with *) for incorporation in the vaccine construct
Selection of MHC-II Epitopes and Prediction Their Antigenic and Toxicity
In addition to MHC-I epitope prediction, CarO protein was subjected to MHC-II binding prediction, using IEDB and NetMHCII server. The epitopes that elicited higher affinity (IC < 200 nM) and percentile rank ≤ 0.2 were subjected to afterward analysis (Table 2). NetMHCII server predicted more than 200 regions that could have an affinity for MHCII proteins. We selected 17 epitopes because these regions showed strong binding with a score < 1 and promising characteristics. After this process, selected epitopes from both servers were analyzed based on their essential features including mutation, toxicity, allergenicity, and antigenicity (Table 2). Finally, the sequences of GNGDTLSIDGKNYQQ, PWAQGLYIAAGAAYL, WAQGLYIAAGAAYLD, AQGLYIAAGAAYLDN, GLYIAAGAAYLDNDY, QGLYIAAGAAYLDND, ANEIRNDNKYEWMPV, GKMSYKNDIAPYLGF, KMSYKNDIAPYLGFG, RGKMSYKNDIAPYLG, DIAPYLGFGFAPKIS, and YLGFGFAPKISKNWG as the epitopes of T cells that interact with MHC Class-II alleles were considered as the most essential epitopes due to features of Non-allergen, non-toxin and high score in antigenicity.
Table 2
HTL epitopes (MHC-II peptides) of CarO protein with their antigenicity, allergenicity, toxicity and mutation scores.
No
HTL epitope
Position
Vaxijen Score
Allergenicity
Toxicity
Mutation
Final decision
1
KVLRVLVTTTALLAA
16–2
− 0.2328
No allergen
2
LRVLVTTTALLAAGA
18–4
− 0.0644
No allergen
3
MKVLRVLVTTTALLA
15–1
− 0.1848
No allergen
4
VLRVLVTTTALLAAG
17–3
− 0.2461
No allergen
5
GNGDTLSIDGKNYQQ
150–131
1.8936
No allergen
*
6
LSIDGKNYQQAVPGQ
155–141
1.1512
Allergen
7
TLSIDGKNYQQAVPG
154–140
1.1523
Allergen
8
PWAQGLYIAAGAAYL
125–111
0.5259
No allergen
*
9
WAQGLYIAAGAAYLD
126–112
0.4335
No allergen
*
10
AQGLYIAAGAAYLDN
127–113
0.4818
No allergen
*
11
GLYIAAGAAYLDNDY
129–115
0.4596
No allergen
*
12
QGLYIAAGAAYLDND
128–114
0.4293
No allergen
*
13
ANEIRNDNKYEWMPV
241–226
1.7000
No allergen
*
14
GKMSYKNDIAPYLGF
175–161
0.5176
No allergen
*
15
KMSYKNDIAPYLGFG
176–162
0.4967
No allergen
*
16
RGKMSYKNDIAPYLG
174–160
0.5852
No allergen
*
17
VHDSYAFDKNQLIPV
40–26
− 0.1666
No allergen
18
VRGKMSYKNDIAPYL
173–159
0.7463
Allergen
19
IRPWGASTNPWAQGL
102–116
− 0.1476
No allergen
20
DIAPYLGFGFAPKIS
168–182
1.2750
No allergen
*
21
TALLAAGAAMADEAV
11–25
− 0.0024
No allergen
22
NPWAQGLYIAAGAAY
110–124
0.1204
No allergen
23
PWAQGLYIAAGAAYL
111–125
0.0259
No allergen
24
VHDSYAFDKNQLIPV
26–40
− 0.1666
Allergen
25
VLRVLVTTTALLAAG
3–17
− 0.2461
Allergen
26
KVLRVLVTTTALLAA
2–16
− 0.2328
Allergen
27
IAAGAAYLDNDYDLA
118–132
− 0.0341
No allergen
28
NPWAQGLYIAAGAAY
110–124
0.1204
No allergen
29
TALLAAGAAMADEAV
11–25
− 0.0024
No allergen
30
VLVTTTALLAAGAAM
6–20
− 0.0596
No allergen
31
AAGAAYLDNDYDLAK
119–133
− 0.0505
No allergen
32
LVTTTALLAAGAAMA
7–21
0.0283
Allergen
33
YLGFGFAPKISKNWG
172–186
1.7184
No allergen
*
Eleven HTL epitopes were selected (labeled with *) for selective step to incorporation in the vaccine construct
HTL epitopes (MHC-II peptides) of CarO protein with their antigenicity, allergenicity, toxicity and mutation scores.Eleven HTL epitopes were selected (labeled with *) for selective step to incorporation in the vaccine construct
Linear B Cell Epitope Prediction
The analysis of the sequences by different bioinformatics programs for T and B cells yielded approximately 100 epitopes. Svmtrip predicted 3 Linear B cell epitopes based on the score. The epitopes recommended by this server are "AMADEAVVHDSYAFDKNQLI", "GAAYLDNDYDLAKRIGNGDT", and "WQANPYVGLA LGYNGGDISW". 6 linear along B cell epitopes were predicted by ElliPro software. The sequences and pictures of 6 linear epitopes with the highest PI (protrusion index) are shown in Fig. 3 and Table 3 (Rows 7–12).
Fig. 3
Epitope mapping on 3D models. Discovery Studio Visualizer 2.5.5 software was used. From 1 to 6 pictures, 6 linear epitopes with the highest PI score predicted by Ellipro server are shown
Table 3
B-cell epitopes of CarO protein with their antigenicity and allergenicity scores
No
Bcell epitope
Position
Vaxijen Score
Allergenicity
Final decision
1
SIDGKNYQQAVPGQEGGVRG
142–161
1.6824
No allergen
*
2
LNAEIRPWGASTNPWAQGLY
98–117
0.1589
Allergen
3
KNWGVFGEVGAYYTGNPKVE
183–202
0.0053
Allergen
4
GYNGGDISWTDDVSVNGTKY
68–87
0.9330
No allergen
*
5
APVTGNPTSAQDAVDKEANE
209–228
1.1717
No allergen
*
6
NQLIPVGARAEVGTTGYGGA
35–54
0.8647
No allergen
*
7
KNQLIPV
34–40
− 0.7431
Allergen
8
IRPWGASTNRWAQGL
81–95
0.3082
Allergen
9
NVDATRSFRVNNQDFIAGADGV
113–134
0.8684
No allergen
*
10
FAPKINKNWGV
154–164
0.7523
No allergen
*
11
SGSAVTTGDQSLEEAVNA
183–200
1.2565
No allergen
*
12
DISWSDDVKVNGST
52–65
1.2709
No allergen
*
13
AMADEAVVHDSYAFDKNQLI
19–38
0.0036
Allergen
14
GAAYLDNDYDLAKRIGNGDT
121–140
0.3189
Allergen
15
WQANPYVGLALGYNGGDISW
57–76
0.4059
Allergen
16
ACINETACTERAMANN
15–30
0.5248
No allergen
*
17
EVGAYYTGNPKVELTQ
232–247
0.5643
No allergen
*
18
RAEVGTTGYGGALLWQ
85–100
0.9620
No allergen
*
19
EAVVHDSYAFDKNQLI
65–80
− 0.1973
Allergen
20
GGDISWTDDVSVNGTK
113–228
1.3248
No allergen
*
21
DNKYEWMPVGKVGVNF
274–289
0.1697
Allergen
22
PVTGNPTSAQDAVDKE
252–267
1.1236
No allergen
*
23
TQYNLAPVTGNPTSAQ
246–261
0.8548
Allergen
24
GQEGGVRGKMSYKNDI
196–211
1.9167
Allergen
Thirteen B-cell epitopes were selected (labeled with *) for selective step to incorporation in the vaccine construct
Epitope mapping on 3D models. Discovery Studio Visualizer 2.5.5 software was used. From 1 to 6 pictures, 6 linear epitopes with the highest PI score predicted by Ellipro server are shownB-cell epitopes of CarO protein with their antigenicity and allergenicity scoresThirteen B-cell epitopes were selected (labeled with *) for selective step to incorporation in the vaccine constructABCpred result showed 65 hits of 16 metric peptide sequences as B-cell epitopes ranking based on scores. The best epitopes predicted by the server with a score above 0.85 are the sequences of "ACINETACTERAMANN", "EVGAYYTGNPKVELTQ", "RAEVGTTGYGGALLWQ", "EAVVHDSYAFDKNQLI", "GGDISWTDD VSVNGTK", "DNKYEWMPVGKVGVNF", "PVTGNPTSAQDAVDKE", "TQYNLAPVTGNPTSAQ", and "GQEGGVRGKMSYKNDI". BCpred server predicted 6 B-cell epitopes. The scores of epitopes ranging from 0.787 to 0.994 where a high score represents the higher probability of becoming B-cell epitopes and higher affinity for the B-cell receptor. The total promising linear B cell epitopes which were mentioned above were analyzed by the Vaxijen server (≥ 0.4 score) and AllergenFP server in order to participate in the vaccine construct.
Comparison of All the Predicted Epitopes to Select the Final Epitopes
After collecting the immunoinformatics information with the help of T-cell prediction and B-cell prediction, the high scores of HTL, CTL, and B cell epitopes were predicted and analyzed for their essential feature. Finally, to make the final multi-epitope construct, 4 CTL epitopes, 4 HTL epitopes, and 4 B cell epitopes due to features of non-Allergen, non-toxic, and higher score in antigenicity (Table 4).
Table 4
The final selection of CTL, HTL and B-cell epitopes for multi-epitope vaccine construct
No
Type of epitope
Sequence of epitope
Antigenicity score
1
CTL
IAPYLGFGF
1.4363
2
CTL
KRIGNGDTL
1.1551
3
CTL
WQANPYVGL
0.8396
4
CTL
ALLWQANPY
0.6467
5
HTL
GNGDTLSIDGKNYQQ
1.8936
6
HTL
ANEIRNDNKYEWMPV
1.7000
7
HTL
DIAPYLGFGFAPKIS
1.2750
8
HTL
YLGFGFAPKISKNWG
1.7184
9
B-cell
SIDGKNYQQAVPGQEGGVRG
1.6824
10
B-cell
DISWSDDVKVNGST
1.2709
11
B-cell
GGDISWTDDVSVNGTK
1.3248
12
B-cell
PVTGNPTSAQDAVDKE
1.1236
The final selection of CTL, HTL and B-cell epitopes for multi-epitope vaccine construct
Construction of Final Vaccine with Different Adjuvants, Linkers, and PADRE Sequence
A total of four vaccine constructs were prepared and further analyzed. All four vaccine constructs were added with the respective adjuvants with the help of EAAAK linker. MHC-I, MHC-II, B-epitopes, and PADRE sequence were joined together by HEYGAEALERAG and GGGS linkers (Fig. 4). Details of the vaccine constructs have been mentioned in Table 5.
Fig. 4
Schematic diagram of multi-epitope vaccine
Table 5
Allergenicity prediction of all vaccine construct using Algpred, Vaxijen and scratch protein predictor server
Schematic diagram of multi-epitope vaccineAllergenicity prediction of all vaccine construct using Algpred, Vaxijen and scratch protein predictor server0.22954849Allergen− 0.3428515Allergen− 0.40767406No allergen− 0.2874046Allergen
Prediction Features of Different Vaccine Constructs
Antigenicity of the vaccine proteins was predicted using VaxiJen server and ANTIGENpro. All Constructs were found as potential vaccine candidates with high antigenicity scores and the ability to stimulate an efficient immune response (Table 5). Also all 4 constructs showed the solubility more than 0.7, which indicated that the vaccine construct will be highly soluble during its heterologous expression in the E. coli (Table 5). But, among four constructs, the only vaccine with beta-defensin (V3) was found to be non-allergic in nature (Allergenicity < − 0.4). So, the V3 construct was chosen for further analysis.
Prediction Secondary Structure
PSIPRED and Phyre2 servers were used to predict the secondary structure of the construct V3. The predicted structure of the protein confirmed to have 68.70% alpha-helix, 27.48% sheet, and 3.82% coil structure (Fig. 5). The results from Phyre2 confirmed the results of the PSIPRED program.
Fig. 5
Secondary structure prediction of vaccine construct (V3) using PESIPRED server
Secondary structure prediction of vaccine construct (V3) using PESIPRED serverSwiss model and I-TASSER software were recruited for homology modeling and Ab initio prediction. The former prepared 12 different models and the latter introduced 5 models. Among the structures predicted by the Swiss model, the best structures were selected based on the QMEAN and GMQE scores. QMEAN scores about zero show good agreement between the model structure and experimental structures of similar size and scores of − 4.0 or below indicate models with low quality. In addition, the GMQE score is expressed as a number between zero and one, reflecting the expected accuracy of a model built with that alignment and template. Higher numbers illustrate higher reliability. Two 3D models have shown GMQE with scores − 1.41 and − 1.38 and QMEAN with scores of 0.05 and 0.04 respectively. The best structure among five models of I-TASSER were measured on the basis of C-score. The C-score is normally in [− 5, 2] and It should be noted that the higher this score, the more accurate the model quality. Since the C-score of model 2 is higher (= − 0.83) than other models, is expected to have good quality. Finally, three predicted 3D models were selected for further analyses.The quality of the 3D models were evaluated by Rampage and Prosa servers. Based on the result of Ramachandran plot and Prosa the best model was chosen from the predicted structure by I-TASSER (Fig. 6a). We showed the Cartoon diagram of a computational structural model of the vaccine in the following figure by using Molecular Operating Environment Software. This figure displays that the sequence of each epitope is properly folded and exposed (Fig. 6b). This structure of vaccine construct show which 87.438% of the residues are in favored regions. Also, Prosa revealed that the predicted model was among other acceptable proteins with z score = − 5.27 (Fig. 7).
Fig. 6
a The best model of epitope vaccine (V3) predicted by I-TASSER server. b All of the regions of the vaccine for display on the scaffold are labeled and colored in representing adjuvants_yellow, PADRE_purple, B-cell epitopes_green, Linker_gray, HTL_blue, and CTL_pink (Color figure online)
Fig. 7
Model evaluation. a Ramachandran plot of final epitope vaccine (V3) model. Number of residues in favored region: 355 (87.438%). Number of residues in allowed region: 41(10.099%). Number of residues in outlier region: 10 (2.463%). b Prosa protein structure analysis results. Z score = − 5.27. Overall quality of the ultimate model is acceptable
a The best model of epitope vaccine (V3) predicted by I-TASSER server. b All of the regions of the vaccine for display on the scaffold are labeled and colored in representing adjuvants_yellow, PADRE_purple, B-cell epitopes_green, Linker_gray, HTL_blue, and CTL_pink (Color figure online)Model evaluation. a Ramachandran plot of final epitope vaccine (V3) model. Number of residues in favored region: 355 (87.438%). Number of residues in allowed region: 41(10.099%). Number of residues in outlier region: 10 (2.463%). b Prosa protein structure analysis results. Z score = − 5.27. Overall quality of the ultimate model is acceptableDocking analysis was employed between the vaccine construct 3 and different HLA alleles (Table 6). Construct V3 displayed biologically significant results and HLA-DR B1*01:01 found to be superior among different HLA in terms of free binding energy. Moreover, docking was also used to predict the binding affinity of the designed vaccine construct with human TLR2 and TLR4 receptors using PatchDock and Firedock servers respectively. FireDock output refinement of PatchDock server showed the lowest global energy of − 27.61 and − 50.74 for TLR2 and TLR4 respectively. In order to display docking of protein–ligand complexes was used HADDOCK. The top cluster with the lowest HADDOCK score of − 100.1 ± − 5.4 and − 160.9 ± − 29.4 suggests a good binding affinity between the vaccine and the TLR2 and TLR4. The docked complexes along with hydrogen bonds and distance of hydrogen bonds are shown in Figs. 8 and 9 and Tables 7 and 8.
Table 6
Prediction binding energy of vaccine construct (V3) with selected HLA molecules by PatchDock, Firedock
Vaccineconstructs
HLA alleles & theirPDB ID’s
Score
Area
Hydrogenbond energy
Globelenergy
ACE
V3
HLA-DRB1*03:01 (1A6A)
13,384
2181.40
−1.41
0.33
3.51
HLA-DR B3*02:02 (3C5J)
14,840
2191.30
− 4.17
− 10.18
12.89
HLA-DR B5*01:01 (1H15)
14,816
2309.50
− 1.23
− 14.97
1.11
HLA-DR B1*01:01 (2FSE)
13,982
1947.70
− 4.11
− 21.21
8.95
HLA-DRB3*01:01 (2Q6W)
15,688
2164.50
− 0.23
− 3.22
2.43
HLA-DR B1*04:01 (2SEB)
13,990
1956.80
− 0.44
− 5.23
0.36
Fig. 8
a Showing TLR2/vaccine docked complex. Vaccine construct 3 is shown in blue colour while TLR2 is shown in orange colour. Interacting residues and active pockets between docked TLR2/vaccine are displayed with purple and yellow filling space model respectively. b Few prominent hydrogen bonds within vaccine-TLR2 complex are focused (Color figure online)
Fig. 9
a Showing TLR4/vaccine docked complex. Vaccine construct 3 is shown in blue colour while TLR4 is shown in orange colour. Interacting residues and active pockets between docked TLR4/vaccine are displayed with purple and yellow filling space model respectively. b Few prominent hydrogen bonds within vaccine-TLR4 complex are focused (Color figure online)
Table 7
List of residues involved in hydrogen bonds and hydrogen bonds distance in TLR2-vaccine interactions
No.
Res name
Res no.
Chain
Res name
Res no.
Chain
Distancehydrogen bonds
1
ASN
294
A
THR
81
B
3.03
2
ARG
296
A
GLU
1
B
2.67
3
ARG
296
A
GLU
8
B
2.75
4
ARG
296
A
GLU
8
B
2.6
5
ASP
299
A
GLU
1
B
2.75
6
ASP
299
A
ALA
2
B
2.84
7
ARG
302
A
ALA
25
B
2.79
8
ILE
304
A
ARG
48
B
2.69
9
TYR
323
A
GLU
83
B
2.63
10
TYR
326
A
GLU
83
B
3.11
11
SER
329
A
ARG
52
B
3.04
12
SER
333
A
ARG
52
B
2.77
13
GLU
336
A
THR
47
B
2.65
14
ARG
337
A
GLU
45
B
2.97
15
ARG
337
A
GLU
46
B
3
16
ARG
337
A
GLU
46
B
2.78
17
HIS
358
A
ALA
49
B
2.66
18
TYR
376
A
GLU
94
B
2.61
19
LYS
378
A
ASP
122
B
2.62
20
GLU
383
A
ALA
119
B
3.1
21
GLU
383
A
ARG
54
B
2.66
22
GLU
383
A
ARG
54
B
3.06
23
ASP
384
A
LYS
169
B
2.67
Table 8
List of residues involved in hydrogen bonds and hydrogen bonds distance in TLR4-vaccine interactions
No.
Res name
Res no.
Chain
Res name
Res no.
Chain
Distancehydrogen bonds
1
CYS
192
A
GLU
94
B
2.98
2
GLN
219
A
GLU
94
B
2.65
3
LYS
224
A
LYS
87
B
2.67
4
ASN
241
A
GLU
128
B
3.1
5
LYS
244
A
ALA
119
B
2.79
6
LYS
244
A
GLU
121
B
2.65
7
THR
245
A
ASP
122
B
2.64
8
GLY
252
A
ARG
86
B
2.86
9
LYS
271
A
GLU
128
B
2.54
10
LYS
271
A
ASP
132
B
2.71
11
LYS
274
A
LYS
169
B
3.31
12
GLU
278
A
ARG
54
B
2.57
13
THR
284
A
GLU
83
B
3.05
14
THR
284
A
ARG
86
B
2.84
15
ASP
302
A
GLU
50
B
3
16
ASP
302
A
THR
51
B
2.67
17
ASN
305
A
ARG
52
B
2.7
18
ASN
305
A
THR
51
B
2.78
19
SER
327
A
THR
47
B
2.62
20
TYR
328
A
ARG
52
B
3.04
21
ASN
329
A
GLU
46
B
2.67
22
LYS
349
A
GLU
45
B
2.65
Prediction binding energy of vaccine construct (V3) with selected HLA molecules by PatchDock, Firedocka Showing TLR2/vaccine docked complex. Vaccine construct 3 is shown in blue colour while TLR2 is shown in orange colour. Interacting residues and active pockets between docked TLR2/vaccine are displayed with purple and yellow filling space model respectively. b Few prominent hydrogen bonds within vaccine-TLR2 complex are focused (Color figure online)a Showing TLR4/vaccine docked complex. Vaccine construct 3 is shown in blue colour while TLR4 is shown in orange colour. Interacting residues and active pockets between docked TLR4/vaccine are displayed with purple and yellow filling space model respectively. b Few prominent hydrogen bonds within vaccine-TLR4 complex are focused (Color figure online)List of residues involved in hydrogen bonds and hydrogen bonds distance in TLR2-vaccine interactionsList of residues involved in hydrogen bonds and hydrogen bonds distance in TLR4-vaccine interactions
Physicochemical Analysis of Vaccine Construct (V3)
The vaccine construct (V3) was characterized on the basis of various physicochemical properties using ProtParam tool. The molecular weight of the vaccine construct was found 51.57 kDa which shows its good antigenic potential. GRAVY (a hydropathic index) was − 0.527 (a negative value), which represents the hydrophilic nature of constructs. The calculated instability index of the protein was 23.43 which classified it as a stable one. The estimated half-life of the constructed vaccine was expected to be 1 h in mammalian reticulocytes in vitro while more than 10 h in E. coli in vivo. Aliphatic value 71.90 also showed that the vaccine construct has a good characteristic to initiate an immunogenic reaction.
In-Silico Cloning of Chimeric Vaccine Construct (V3) for Its Heterologous Expression
Reverse translation and codon optimization were performed by considering the expression system of the host in order to express the multi-epitope vaccine. The optimum range of GC content was considered to be between 30 and 70%. The GC content of the optimized codons (53.95%) was significant. Also, the high value of the CAI score shows a high expression of the gene. Our construct has a codon adaptation index (CAI) of 1. Recognition sites for BglII and ApaI restriction enzymes were added to the 5′ and 3′ end of the optimized gene and then the final vaccine construct (V3) with a total length of 1465 bp was cloned into pET28a vector for its heterologous cloning and expression in E. coli (Fig. 10). Polyhistidine-tag is an amino acid motif in proteins that typically consists of at least six histidines (His) residues which is often at the N- or C-terminus with spaced or attached to inserted protein. His-tag will be used to purify protein in further experiments. In addition, the His-tag is part of the pET28a vector structure.
Fig. 10
In silico cloning of the gene sequence of the final vaccine construct V3 into pET28a (+) expression vector. The target sequence was inserted between BglII (401) and ApaI (1334)
In silico cloning of the gene sequence of the final vaccine construct V3 into pET28a (+) expression vector. The target sequence was inserted between BglII (401) and ApaI (1334)
Discussion
The emergence of multidrug resistance (MDR) A. baumannii is recognized as a serious health concern in hospitals across the world (Savov et al. 2019). CarO from A. baumannii is a membrane porin that uptake small molecules such as l-ornithine, Carbapenem, and other basic amino acids (Labrador-Herrera et al. 2020; Tohidinia et al. 2020). In a study, researchers were demonstrated outer membrane vesicles (OMVs) of A. baumannii had able to induce protective immunity against A. baumannii. Among OMV proteins, CarO was identified as a highly immunogenic protein (McConnell et al. 2011). Currently, there is no definite cure to protect against the A. baumannii due to increase MDR. Therefore, it is essential to take preventive measures against it (Solanki and Tiwari 2018). The employment of bioinformatics tools in the field of vaccinology can be aiding the selection of potential vaccine candidates and simplify the optimization of the chosen immunogen (Gupta and Kumar 2020; Tohidinia and Sefid 2020).The entire proteome of the A. baumannii ATCC 19606 strain was retrieved from NCBI and the result of BLAST from various sequences of CarO showed it is specific to Acinetobacter species. Alignment of CarO sequences from various strains indicates that such vaccines are able to trigger antibodies with complete specificity for A. baumannii and CarO is a conserved protein among Acinetobacter species. Therefore, it can one of the best choices for vaccine design. A multi-epitope vaccine consists of several B and T epitopes with sensible linkers in order to avoid the generation of junctional epitopes and increase the antigen presentation process (Forouharmehr 2021; Shey et al. 2019). A multi-epitope vaccine can trigger both humoral and cell-mediated immunity (Ali et al. 2017). The cell-mediated immune response is mediated by T-cells whereas the humoral immune response is mediated by produced B-cells (Channappanavar et al. 2014; Kozakiewicz et al. 2013). Therefore, the correct and accurate selection of TCL, HTL, and B-cell epitopes to design a multi-epitope vaccine has a direct relationship with the immune system response.Approximately 300 epitopes of CarO protein were predicted by IEDB with recommended methods and NetCTL server to be MHC I peptides that can bind to a large number of HLA-A and HLA-B alleles with a high binding affinity. 350 epitopes of CarO protein were also predicted by using the MHC-II binding prediction tool of IEDB and NetMHCII server as MHC II peptides. But the most important epitopes were selected among the identified epitopes with certain scores mentioned in the method section and then their important characteristics were examined. One of the most important features considered for selecting CTL epitopes was the Vaxijen score. The higher Vaxijen score indicating the higher antigenicity of an amino acid sequence and is able to further stimulate the immune system (Saha et al. 2017). In this study, only CTL epitopes with non-allergenic and positive immunogenicity behavior were selected for further analysis. In addition, the characteristics of the allergenicity, antigenicity, toxicity, and mutation were examined for the selection of HTL epitopes. Finally, according to all these characteristics, sequences IAPYLGFGF, KRIGNGDTL, WQANPYVGL, and ALLWQANPY as CTL epitopes and GNGDTLSIDGKNYQQ, ANEIRNDNKYEWMPV, DIAPYLGFGFAPKIS, and YLGFGFAPKISKNWG as HTL epitopes were selected to participate in the structure of the vaccine. Amino acid sequences that are linear in shape are called continuous or linear epitopes (Abboud et al. 2009) which were predicted by ElliPro, ABCpred, Svmtrip, and BCpred servers. Through the epitope prediction servers carried out in this project we were able to compile 82 potential linear B cell epitopes in CarO protein. Among promising linear B cell epitopes, four B cell epitopes were selected based on their features such as antigenicity and allergenicity.In the construction of vaccines, the selected epitopes were connected by HEYGAEALERAG and GGGS linkers. Each of these constructions was connected to 4 different adjuvants with the EAAAK linker. Finally, after examining the allergenicity, antigenicity, and solubility behaviors vaccine construct 3 which had HBHA adjuvant was selected due to non-allergic in nature and high allergenicity and antigenicity score to prediction the second and third structures.To ensure effective binding between HLA molecules and vaccine construct 3, a docking study was performed. The vaccine construct 3 was subjected to the PatchDock server as a receptor and 6 different HLA alleles including 1A6A(HLA-DR B1*03:01), 3C5J(HLA-DR B3*02:02), 1H15(HLA-DR B5*01:01), 2FSE(HLA-DRB1*01:01), 2Q6W(HLA-DR B3*01:01), and 2SEB(HLA-DRB1*04:01) as a ligand were selected for docking analysis. All the selected HLA showed low binding energy with V3 which was biologically significant. HLA-DR B1*01:01 was found to be best considering the free binding energy. Moreover, to display interactions among human TLRs (TLR2 and TLR4) and the vaccine structure 3, HADDOCK server were applied for molecular docking. Docking results demonstrated that the multi-epitope vaccine we designed has the potential to bind the mention receptors. Also, the PDBsum server showed 22 and 23 hydrogen bonds between the vaccine and TLR2 and TLR4 respectively. Hydrogen bonds can increase the strength and stability of a vaccine-receptor interaction substantially (Fox et al. 2013). Physicochemical properties of the V3 sequence were analyzed using ProtParam server which showed nature of V3 is appropriate for vaccine construct. Finally, the vaccine construct V3 was reverse transcribed and adapted for E. coli strain K12 former to insertion within pET28a (+) vector for its cloning and expression. In our study, our in silico results were based on various databases. We suggest using model animals for further analysis and experimental validation of this designed vaccine.
Authors: Raquel Valencia; Luis A Arroyo; Manuel Conde; Josefa M Aldana; María-José Torres; Felipe Fernández-Cuenca; José Garnacho-Montero; José M Cisneros; Carlos Ortíz; Jerónimo Pachón; Javier Aznar Journal: Infect Control Hosp Epidemiol Date: 2009-03 Impact factor: 3.254
Authors: Peter W Rose; Bojan Beran; Chunxiao Bi; Wolfgang F Bluhm; Dimitris Dimitropoulos; David S Goodsell; Andreas Prlic; Martha Quesada; Gregory B Quinn; John D Westbrook; Jasmine Young; Benjamin Yukich; Christine Zardecki; Helen M Berman; Philip E Bourne Journal: Nucleic Acids Res Date: 2010-10-29 Impact factor: 16.971