| Literature DB >> 30928513 |
Mokhtar Nosrati1, Mandana Behbahani1, Hassan Mohabatkar2.
Abstract
Crimean-Congo hemorrhagic fever (CCHF) is considered one of the major public health concerns with case fatality rates of up to 80%. Currently, there is no effective approved vaccine for CCHF. In this study, we used a computer-aided vaccine design approach to develop the first multi-epitope recombinant vaccine for CCHF. For this purpose, linear B-cell and T-cell binding epitopes from two structural glycoproteins of CCHF virus including Gc and Gn were predicted. The epitopes were further studied regarding their antigenicity, allergenicity, hydrophobicity, stability, toxicity and population coverage. A total number of seven epitopes including five T-cell and two B-cell epitopes were screened for the final vaccine construct. Final vaccine construct composed of 382 amino acid residues which were organized in four domains including linear B-cell, T-cell epitopes and cholera toxin B-subunit (CTxB) along with heat labile enterotoxin IIc B subunit (LT-IIc) as adjuvants. All the segments were joined using appropriate linkers. The physicochemical properties as well as the presence of IFN-γ inducing epitopes in the proposed vaccine, was also checked to determining the vaccine stability, solubility and its ability to induce cell-mediated immune responses. The 3D structure of proposed vaccine was subjected to the prediction of computational B-cell epitopes and molecular docking studies with MHC-I and II molecules. Furthermore, molecular dynamics stimulations were performed to study the vaccine-MHCs complexes stability during stimulation time. The results suggest that our proposed vaccine was stable, well soluble in water and potentially antigenic. Results also demonstrated that the vaccine can induce both humoral and cell-mediated immune responses and could serve as a promising anti-CCHF vaccine candidate.Entities:
Keywords: Computer-aided vaccine design; Crimean-Congo hemorrhagic fever; Multi-epitope vaccine
Year: 2019 PMID: 30928513 PMCID: PMC7106074 DOI: 10.1016/j.jbi.2019.103160
Source DB: PubMed Journal: J Biomed Inform ISSN: 1532-0464 Impact factor: 6.317
Fig. 1Schematic presentation of the procedures used for multi-epitope vaccine development for CCHFV.
Results of CCHFV proteins BLAST and antigenicity prediction. Glycoprotein precursor had highest average antigenicity (1.3096) and VaxiJen score (5207), while sequence identity of the glycoprotein between different serotypes of CCHFV was least value in compare to other studied antigens.
| Protein | Average antigenicity | VaxiJen score | Minimum identity (%) | Maximum identity (%) |
|---|---|---|---|---|
| Nucleoprotein | 1.0138 | 0.3330 | 92 | 99 |
| Glycoprotein precursor | 1.3096 | 0.5207 | 84 | 99 |
| RNA polymerase | 1.0324 | 0.4348 | 96 | 99 |
Fig. 2Antigenicity plot of glycoprotein precursor from CCHFV. There are 72 antigenic determinants in the glycoproteins. Six regions including 414–469, 654–720, 798–841, 1012–1043, 1290–1325 and 1441–1470 were found to be high antigenic potential with average antigenic propensity more than 1.15.
Molecular features of glycoprotein precursor from CCHFV. After processing of the precursor six structural and non-structural elements and a signal peptide are originated.
| Feature key | Position(s) | Description | Sequence length |
|---|---|---|---|
| Short sequence | 1–18 | Signal peptide | 18 |
| Chain | 19–1684 | Envelopment polyprotein | 1666 |
| Chain | 19–247 | Mucin-like variable region | 229 |
| Chain | 248–519 | GP38 | 272 |
| Chain | 520–842 | Glycoprotein N | 323 |
| Chain | 843–1040 | Non-Structural protein M | 198 |
| Chain | 1041–1684 | Glycoprotein C | 644 |
Results of T-cell epitope prediction. The labled sequences with (*) are shared T-cell epitopes between MHC-I and II risterected epitopes. The share epitopes with high affinity to more MHC-I and MHC-II alleles were screened for further evaluation. A total number of 44 and 47 MHC-I restricted epitopes were predicted for Gc and Gn respectively which among them 12 and 13 shared epitopes were selected for further evaluation.
| MHC-I restricted epitopes in Gc | MHC-I restricted epitopes in Gn | Number of MHC-I bonded alleles for predicted epitopes | Number of MHC-II bonded alleles for predicted epitopes | ||
|---|---|---|---|---|---|
| Gc | Gn | Gc | Gn | ||
| LSEPRNIQQ | TAEIHGDNY | 1 | 1 | 0 | 0 |
| FLFLAPFIL* | ILCKAIFYL* | 5 | 4 | 46 | 22 |
| FILLILFFM* | LNLERIPWV* | 2 | 2 | 46 | 13 |
| KLPPEIITL | YLLIIVGTL | 2 | 7 | 0 | 41 |
| KVNGHLIHK | YVITCILCK | 2 | 3 | 0 | 50 |
| MYSPVFEYL* | NYGGPGDKI | 4 | 2 | 2 | 0 |
| GLFKYRHLK | FLFWFSFGY* | 2 | 6 | 0 | 38 |
| ILFFMFGWR* | RLTSDGLAR | 1 | 1 | 5 | 0 |
| ESIMKLEER | ESTGVALKR | 1 | 2 | 0 | 0 |
| CVELTSQER | YVITCILCK* | 2 | 3 | 0 | 50 |
| VKWKVEYIK* | GKMAIYICR | 1 | 1 | 3 | 0 |
| FLAPFILLI* | LLRTETAEI* | 2 | 1 | 25 | 48 |
| ERLADRRIA | KRLKQYREL | 1 | 6 | 0 | 0 |
| RRTRGLFKY | CRQGYCLRI | 3 | 2 | 0 | 0 |
| GRSESIMKL | ARHVIQCPK | 4 | 1 | 0 | 0 |
| EDASESKLL | IPKGTGDIL | 1 | 6 | 0 | 0 |
| IHVDEPDEL* | CDTSCEIMI | 2 | 2 | 11 | 0 |
| NHASFVNLL | NHPKTTMAF | 3 | 1 | 0 | 0 |
| LQVYHIGNL* | QHFLKDNLI | 1 | 2 | 26 | 7 |
| KEWPHSRNW | QQHFLKDNL | 1 | 3 | 0 | 0 |
| LESVKSFFY* | QEGRGHVKL | 1 | 2 | 13 | 0 |
| EPDELTVHV | CEIMIPKGT | 2 | 2 | 0 | 0 |
| SGISCKVRI | IPLLGKMAI* | 1 | 7 | 0 | 40 |
| GAGEITVLV | IPWVVRKLL | 1 | 8 | 0 | 0 |
| QQKLPPEII | SATGKNCEI | 1 | 3 | 0 | 0 |
| WPSCTYTGV | FGYVITCIL* | 4 | 6 | 0 | 36 |
| FGWRILFCF* | FWFSFGYVI* | 3 | 5 | 19 | 9 |
| TEAIVCVEL | YLLIIVGTL* | 2 | 7 | 0 | 37 |
| GEITVLVEV | IPWVVRKLL | 1 | 8 | 0 | 0 |
| ILLILFFMF* | KTTMAFLFW | 1 | 1 | 23 | 0 |
| EPRNIQQKL | QEGRGHVKL | 4 | 2 | 0 | 0 |
| HPRIEEGFF | CETTPVNAI | 3 | 2 | 0 | 0 |
| EPRNIQQKL | RLGSELGCY | 4 | 1 | 0 | 0 |
| VFMGIFLFL* | ASRLTSDGL | 3 | 1 | 13 | 0 |
| GYRRIIEKL | HPKTTMAFL | 4 | 6 | 0 | 0 |
| NHASFVNLL | INRVRSFKL* | 3 | 1 | 0 | 19 |
| MPKTSLCFY | KRLKQYREL | 4 | 6 | 0 | 0 |
| KNLLNSTSL | HPKTTMAFL | 2 | 6 | 0 | 0 |
| ASFVNLLNI | YLLIIVGTL* | 2 | 7 | 0 | 37 |
| RGLFKYRHL | VPVKCRQGY* | 3 | 4 | 0 | 10 |
| – | ITICNGSTI* | – | 2 | – | 9 |
| – | LGCPKIPLL | – | 2 | – | 0 |
| – | EETELYLNL | – | 4 | – | 0 |
| – | NYGGPGDKI | – | 2 | – | 0 |
| – | SEEPSDDCI | – | 3 | – | 0 |
| – | IPWVVRKLL | – | 8 | – | 0 |
The results of final T-cell epitope screening. A total number of three and two T-cell epitopes (labeled with *) were screened from Gc and Gn respectively based on their antigenicity, hydrophobicity, allergenicity and population coverage. Generally, predicted epitopes from Gc have better properties than Gn derived ones.
| Protein | T-cell epitopes | Antigenicity | Hydrophobicity (%) | Allergenicity | Population coverage (%) | Final decision |
|---|---|---|---|---|---|---|
| Gc | FLFLAPFIL | 0.2586 | 100 | No | 29.10 | – |
| FILLILFFM | 0.2343 | 100 | No | 10.06 | – | |
| MYSPVFEYL | 0.0270 | 55.56 | No | 11.21 | – | |
| ILFFMFGWR | 1.6738 | 77.78 | Yes | 71.04 | – | |
| VKWKVEYIK | −0.2568 | 44.44 | Yes | 24.32 | – | |
| FLAPFILLI | 0.6746 | 100 | No | 76.69 | * | |
| IHVDEPDEL | −0.0627 | 44.44 | Yes | 32.05 | – | |
| LQVYHIGNL | 1.0338 | 44.44 | No | 59.74 | * | |
| LESVKSFFY | −0.1855 | 44.44 | No | 23.12 | – | |
| FGWRILFCF | 0.6046 | 66.67 | No | 89.12 | * | |
| ILLILFFMF | 0.3806 | 100 | No | 24.76 | – | |
| VFMGIFLFL | −0.0320 | 88.89 | No | 13.12 | – | |
| Gn | ILCKAIFYL | −0.1887 | 66.67 | No | 62.54 | – |
| LNLERIPWV | 1.3691 | 66.67 | Yes | 88.54 | – | |
| FLFWFSFGY | 1.3227 | 66.67 | Yes | 33.32 | – | |
| YVITCILCK | −0.1778 | 44.44 | No | 19.54 | – | |
| LLRTETAEI | 0.0964 | 44.44 | No | 79.54 | * | |
| IPLLGKMAI | 0.3897 | 77.78 | No | 43.22 | – | |
| FGYVITCIL | −0.0073 | 55.56 | Yes | 27.12 | – | |
| FWFSFGYVI | 1.5147 | 66.67 | Yes | 41.05 | – | |
| YLLIIVGTL | 0.6623 | 66.67 | No | 45.32 | * | |
| INRVRSFKL | −0.3028 | 44.44 | Yes | 18.03 | – | |
| YLLIIVGTL | −0.5074 | 66.67 | No | 36.12 | – | |
| VPVKCRQGY | 1.2904 | 33.33 | No | 35.32 | – | |
| ITICNGSTI | −0.2864 | 33.33 | No | 10.04 | – | |
Results of linear B-cell epitope prediction using three different servers. The score and sequence similarity were considered as determining factors for comparison of the server outputs and epitope selection. A total number of three and two epitopes from Gc and Gn were screened for final evaluation respectively. Selected epitopes are labeled with (*).
| Protein | Epitope | Score | Sequence similarity (%) | |||||
|---|---|---|---|---|---|---|---|---|
| BCPREDS | ABCpred | SVMTrip | BepiPred-2.0 | ABCpred | SVMTrip | BepiPred-2.0 | ||
| Gc | DELTVHVKSDDPDVVAASSS* | 0.998 | 0.85 | 0.516 | High | 85 | 60 | 60 |
| GDRQVGEWPKATCTGDCPER | 0.996 | 0.69 | – | High | 70 | – | 90 | |
| EETGYRRIIEKLNNKKGKNK | 0.994 | 0.74 | 0.386 | Low | 90 | 25 | 45 | |
| IEHKGTIIGKQNSTCTAKAS | 0.978 | 0.79 | 0.238 | High | 95 | 20 | 50 | |
| KLQSCTHGVPGDLQVYHIGN* | 0.957 | 0.69 | 0.528 | High | 100 | 85 | 55 | |
| NMGDWPSCTYTGVTQHNHAS | 0.935 | 0.74 | – | Low | 60 | – | 50 | |
| RNWRCNPTWCWGVGTGCTCC | 0.915 | 0.76 | – | Low | 70 | – | 25 | |
| LIHKIEPHFNTSWMSWDGCD* | 0.915 | 0.68 | 0.295 | High | 90 | 20 | 100 | |
| APWGAINVQSTYKPTVSTAN | 0.901 | 0.87 | 0.238 | Intermediate | 90 | 20 | 50 | |
| SCSEEDTKKCVNTKLEQPQS | 0.849 | 0.70 | – | High | 65 | – | 100 | |
| QKLPPEIITLHPRIEEGFFD | 0.835 | 0.87 | – | High | 90 | – | 95 | |
| Gn | IHGDNYGGPGDKITICNGST | 0.999 | 0.90 | – | Intermediate | 75 | – | 55 |
| GLARHVIQCPKRKEKVEETE* | 0.995 | 0.86 | 0.603 | High | 100 | 60 | 100 | |
| RLKQYRELKPQTCTICETTP | 0.993 | 0.60 | – | Intermediate | 65 | – | 45 | |
| PKGTGDILVDCSGGQQHFLK | 0.959 | 0.74 | – | Intermediate | 80 | – | 55 | |
| IDLGCPKIPLLGKMAIYICR | 0.908 | 0.85 | – | Low | 85 | – | 30 | |
| VDQRLGSELGCYTINRVRSF* | 0.841 | 0.65 | 0.533 | High | 95 | 50 | 85 | |
The final screening of selected linear B-cell epitopes from Gc and Gn. One linear B-cell epitope from each antigen was selected for final vaccine construct. The selected epitopes were found to be good water soluble, high antigenic (antigenicity score more than 0.8) and without allergenicity and toxicity potential.
| Epitope | Antigenicity | Allergenicity | Toxicity | Solubility |
|---|---|---|---|---|
| DELTVHVKSDDPDVVAASSS* | 0.8272 | No | No | Good |
| KLQSCTHGVPGDLQVYHIGN | 0.2226 | Yes | No | Poor |
| LIHKIEPHFNTSWMSWDGCD | 0.2253 | No | No | Poor |
| GLARHVIQCPKRKEKVEETE | 0.5288 | Yes | No | Good |
| VDQRLGSELGCYTINRVRSF* | 0.8924 | No | No | Good |
Fig. 3Graphical presentation of the purposed vaccine for CCHF. The vaccine is organized in four different domains including CTxB and LT-IIc as adjuvants, Linear B-cell epitopes (blue regions) and T-cell epitopes (pink regions). The used linker for merging mentioned domains were GPGPGPG (red regions) and EAAAK (yellow regions). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Amino acid sequence and composition of final vaccine construct. The Amino acid sequence and composition can affect the protein 2D and 3D structures as well as its function. The results reflected that the protein is composed by 382 amino acids. Glycine and Tryptophan are the most and the least residues in the final vaccine construct.
Fig. 5Secondary structure of the final vaccine construct predicted by GOR4 method. Most amino acid residues of the vaccine were organized in random coil (48.69%) followed by alpha helix (29.06) and extended strand (22.25%) respectively.
Results of the model refinement. The low 3Drefine, MolProbity and RWplus scores indicate better quality model. On the contrary, higher GDT-TS, RMSD and GDT-HA scores indicate conservative refinement and higher quality. Based on the mentioned parameters, model No.1 was selected as the best-refined model.
| Model | 3Drefine score | GDT-TS | GDT-HA | RMSD | RWplus | MolProbity |
|---|---|---|---|---|---|---|
| 1 | 29,866 | 0.9993 | 0.9666 | 0.376 | −53,727.135 | 3.461 |
| 2 | 30,223.2 | 1 | 0.9758 | 0.351 | −53,610.179 | 3.413 |
| 3 | 30,879 | 1 | 0.9863 | 0.318 | −53,450.421 | 3.390 |
| 4 | 31,941 | 1 | 0.9935 | 0.275 | −53,327.261 | 3.381 |
| 5 | 35,185 | 1 | 0.9993 | 0.203 | −53,166.064 | 3.339 |
Fig. 63D structure of primary (a) and refined model (b) of the final vaccine construct. Ramachandran plots showed that in the raw model 68.7%, 22.1% and 9.2% of residues were located in favored, allowed, and outlier regions, respectively (Fig. 5-a) while, in the refined model, 73.4%, 18.9%, and 7.6% of residues were positioned in favored, allowed, and outlier regions, respectively.
Conformational B-cell epitopes from 3D model of proposed vaccine. A total number of seven epitopes were predicted. Epitopes No. 3 and 4 were considered as the broadest and smallest conformational B- cell epitopes with 78 and 5 amino acid residues.
| Number | Residues | Number of residues | score |
|---|---|---|---|
| 1 | I2, L4, F6, G7, V8, F9, F10, T11, V12, L13, L14, S15, S16, A17, Y18, A19, H20, G21, T22, P23, Q24, N25, I26, T27, D28, L29, C30, A31, E32, H34, N35 | 31 | 0.787 |
| 2 | V308, N309, I310, S311, S312, D313, V314, N315, K316, D317, S318, K319, G320, I321, Y322, I323, S324, S325, S326, A327, G328, K329, T330, F332, I333 | 25 | 0.746 |
| 3 | D80, Q82, K83, K84, A85, I86, E87, R88, M89, K90, D91, T92, L93, R94, I95, A96, Y97, L98, T99, E100, A101, K102, V103, E104, K105, L106, C107, V108, W109, N110, N111, K112, T113, P114, H115, A116, A117, A118, A119, I120, S121, M122, A123, N124, E125, A126, A127, A128, K129, D130, E131, L132, K137, S138, D139, D140, P141, D142, V143, V144, A145, A146, G181, P182, G183, F184, L185, A186, P187, F188, I189, L190, L191, I192, G193, P194, G195, P196 | 78 | 0.714 |
| 4 | Y339, P340, D341, N342, Y343 | 5 | 0.690 |
| 5 | T282, Y283, A284, G285, V286, S287, K288, T289, F290, K291, D292, K293, C294, A295 | 14 | 0.688 |
| 6 | L334, S345, N346, E347, M348, R349, K350, I351, A352, M353, A354, A355, V356, L357, S358, N359, V360, R361, V362, N363, L364, C365, A366, S367, E368, A369, Y370, T371, P372, N373, H374, V375, W376, A377, I378, E379, L380, A381, P382 | 39 | 0.686 |
| 7 | I240, G241, P242, G243, P244, G245, P246, G247, Y248, L249, L250, I251, I252, V253, G254 | 15 | 0.592 |
Fig. 7Predicted conformational B-cell epitopes of the final vaccine construct. A total number of seven conformational B-cell epitopes were predicted from the vaccine, which included 207 out of 382 residues.
The predicted IFN-γ inducing epitopes from the proposed vaccine. A total number of 40 positive IFN-γ inducing epitopes with a score of greater than or equal to one were predicted from the vaccine. The amino acid residues of the vaccine, which are located in 126–141 region showed highest score (1.21).
| Number | Sequence | Start-End | score |
|---|---|---|---|
| 1 | GTPQNITDLCAEYHN | 20–35 | 1 |
| 2 | TPQNITDLCAEYHNT | 21–36 | 1 |
| 3 | PQNITDLCAEYHNTQ | 22–37 | 1 |
| 4 | QNITDLCAEYHNTQI | 23–38 | 1 |
| 5 | NITDLCAEYHNTQIH | 24–39 | 1 |
| 6 | ITDLCAEYHNTQIHT | 25–40 | 1.02 |
| 7 | TDLCAEYHNTQIHTL | 26–41 | 1.09 |
| 8 | NDKIFSYTESLAGKR | 41–56 | 1 |
| 9 | DKIFSYTESLAGKRE | 42–57 | 1 |
| 10 | KIFSYTESLAGKREM | 43–58 | 1 |
| 11 | IFSYTESLAGKREMA | 44–59 | 1 |
| 12 | FSYTESLAGKREMAI | 45–60 | 1 |
| 13 | SYTESLAGKREMAII | 46–61 | 1 |
| 14 | YTESLAGKREMAIIT | 47–62 | 1 |
| 15 | AAAKDELTVHVKSDD | 125–140 | 1 |
| 16 | AAKDELTVHVKSDDP | 126–141 | 1.21 |
| 17 | AKDELTVHVKSDDPD | 127–142 | 1 |
| 18 | KDELTVHVKSDDPDV | 128–143 | 1 |
| 19 | LTVHVKSDDPDVVAA | 131–146 | 1 |
| 20 | TVHVKSDDPDVVAAS | 132–147 | 1 |
| 21 | VHVKSDDPDVVAASS | 133–148 | 1 |
| 22 | HVKSDDPDVVAASSS | 134–149 | 1 |
| 23 | AGVSKTFKDKCASTT | 283–298 | 1 |
| 24 | SVQLVNISSDVNKDS | 303–318 | 1 |
| 25 | VQLVNISSDVNKDSK | 304–319 | 1 |
| 26 | QLVNISSDVNKDSKG | 305–320 | 1 |
| 27 | LVNISSDVNKDSKGI | 306–321 | 1 |
| 28 | VNISSDVNKDSKGIY | 307–322 | 1 |
| 29 | NISSDVNKDSKGIYI | 308–323 | 1 |
| 30 | IPGGQYYPDNYLSNE | 332–347 | 1 |
| 31 | PGGQYYPDNYLSNEM | 333–348 | 1 |
| 32 | GGQYYPDNYLSNEMR | 334–349 | 1 |
| 33 | GQYYPDNYLSNEMRK | 335–350 | 1 |
| 34 | QYYPDNYLSNEMRK | 336–351 | 1 |
| 35 | YYPDNYLSNEMRKIA | 337–352 | 1.05 |
| 36 | YPDNYLSNEMRKIAM | 338–353 | 1 |
| 37 | PDNYLSNEMRKIAMA | 339–354 | 1 |
| 38 | DNYLSNEMRKIAMAA | 340–355 | 1 |
| 39 | NYLSNEMRKIAMAAV | 341–356 | 1 |
| 40 | FTVLLSSAYAHGTPQ | 9–24 | 1 |
Fig. 8RMSD plots of docked vaccine-MHCs complexes. The results indicated the stable microscopic interaction between the vaccine and MHCs molecules.
Fig. 9Rg plots of docked the vaccine-MHCs complexes. The results reflected that the complex were compact and stable during stimulations.