| Literature DB >> 34960629 |
Gilma G Sánchez-Burgos1, Nallely M Montalvo-Marin1, Edgar R Díaz-Rosado1, Ernesto Pérez-Rueda2.
Abstract
Reverse vaccinology is an outstanding strategy to identify antigens with high potential for vaccine development. Different parameters of five prediction programs were used to assess their sensitivity and specificity to identify B-cell epitopes of Chikungunya virus (CHIKV) strains reported in the IEDB database. The results, based on the use of 15 to 20 mer epitopes and the polyproteins to which they belong, were compared to establish the best parameters to optimize the prediction of antigenic peptides of the Mexican strain CHIKV AJV21562.1. LBtope showed the highest specificity when we used the reported epitopes and polyproteins but the worst sensitivity with polyproteins; ABCpred had similar specificity to LBtope only with the epitopes reported and showed moderate specificity when we used polyproteins for the predictions. Because LBtope was more reliable in predicting true epitopes, it was used as a reference program to predict and select six novel epitopes of the Mexican strain of CHIKV according to prediction frequency, viral genome localization, and non-homology with the human proteome. On the other hand, six bioinformatics programs were used with default parameters to predict T-cell epitopes in the CHIKV strains AJV21562.1 and AJV21561.1. The sequences of the polyproteins were analyzed to predict epitopes present in the more frequent HLA alleles of the Mexican population: DQA1*03011, DQA1*0401, DQA1*0501, DQB1*0201, DQB1*0301, DQB1*0302, and DQB1*0402. Fifteen predicted epitopes in the non-structural and 15 predicted epitopes in the structural polyprotein (9- to 16-mers) with the highest scores of each allele were compared to select epitopes with at least 80% identity. Next, the epitopes predicted with at least two programs were aligned to the human proteome, and 12 sequences without identity with the human proteome were identified as potential antigenic candidates. This strategy would be useful to evaluate vaccine candidates against other viral diseases affecting the countries of the Americas and to increase knowledge about these diseases.Entities:
Keywords: B-cells; Chikungunya virus; LBtope; T-cells; epitopes; prediction programs
Mesh:
Substances:
Year: 2021 PMID: 34960629 PMCID: PMC8706625 DOI: 10.3390/v13122360
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Control epitopes used in this work. Epitopes with positive or negative immunoreactivity were selected from the IEDB database [20,21,22], Lum et al., 2013 [31], and Kam et al., 2014 [32]. aa, amino acids.
| Epitopes from the IEDB Databases | ||||
|---|---|---|---|---|
| No. | Negative Epitopes | Length in aa | Positive Epitopes | Length in aa |
| 1 | LAHCPDCGEGHSCHS | 15 | STKDNFNVYKATRPYLAHC | 19 |
| 2 | DCGEGHSCHSPVALE | 15 | PTEGLEVTWGNNEPYKYWPQLSTNGT | 26 |
| 3 | HSCHSPVALERIRNE | 15 | LLSMVGMAAGMCMCARRRCITPYELTPGATVPFL | 34 |
| 4 | PVALERIRNEATDGT | 15 | TDGTLKIQVSLQIGIKTDDSHDWTKLRYMDNHMPADAERAGL | 42 |
| 5 | RIRNEATDGTLKIQV | 15 | LTTTDKVINNCKVDQCHA | 18 |
| 6 | LKIQVSLQIGIKTDD | 15 | LTTTDKVINNCKVDQCHAAVTNHKKW | 26 |
| 7 | SLQIGIKTDDSHDWT | 15 | PTVTYGKNQVIMLLYPDHPTLLSYRN | 26 |
| 8 | IKTDDSHDWTKLRYM | 15 | STKDNFNVYKATRPY | 15 |
| 9 | SHDWTKLRYMDNHMP | 15 | CTITGTMGHFILARC | 15 |
| 10 | KLRYMDNHMPADAER | 15 | NHKKWQYNSPLVPRN | 15 |
| 11 | DNHMPADAERAGLFV | 15 | HIPFPLANVTCRVPK | 15 |
| 12 | ADAERAGLFVRTSAP | 15 | VTYGKNQVIMLLYPD | 15 |
| 13 | AGLFVRTSAPCTITG | 15 | LEVTWGNNEPYKYWP | 15 |
| 14 | RTSAPCTITGTMGHF | 15 | GTAHGHPHEIILYYY | 15 |
| 15 | ILARCPKGETLTVGF | 15 | HPHEIILYYYELYPT | 15 |
| 16 | TMGHFILARCPKGET | 15 | KDIVTKITPEGAEEW | 15 |
| 17 | PKGETLTVGFTDSRK | 15 | LLQASLTCSPHRQRR | 15 |
| 18 | LTVGFTDSRKISHSC | 15 | EITVMSSEVLPSTNQEYI | 18 |
| 19 | ISHSCTHPFHHDPPV | 15 | HVKGTIDHPVLSKLKFTK | 18 |
| 20 | EVVLTVPTEGLEVTW | 15 | KPGKRQRMALKLEADRLF | 18 |
| 21 | IGREKFHSRPQHGKE | 15 | NIPISIDIPNAAFIRTSD | 18 |
| 22 | FHSRPQHGKELPCST | 15 | PISASFTPFDHKVVIHRG | 18 |
| 23 | QHGKELPCSTYVQST | 15 | TWNSKGKTIKTTPEGTEE | 18 |
| 24 | LPCSTYVQSTAATTE | 15 | YNYDFPEYGAMKPGAFGD | 18 |
| 25 | YVQSTAATTEEIEVH | 15 | ||
| 26 | MPPDTPDRTLMSQQS | 15 | ||
| 27 | PDRTLMSQQSGNVKI | 15 | ||
| 28 | MSQQSGNVKITVNGQ | 15 | ||
| 29 | TVNGQTVRYKCNCGG | 15 | ||
| 30 | TVRYKCNCGGSNEGL | 15 | ||
| 31 | SNEGLTTTDKVINNC | 15 | ||
| 32 | TTTDKVINNCKVDQC | 15 | ||
| 33 | QYNSPLVPRNAELGD | 15 | ||
Thresholds of each program with the highest amount of TP and the lowest amount of TN control epitopes. The analysis of positive and negative control epitopes was made with each program using different thresholds and lengths. The number of predicted TP, TN, FP, and FN sequences were used to calculate sensitivity (Sn) and specificity (Sp).
| Program | ABCpred | FBCPred | LBTope | BCPred | AAP |
|---|---|---|---|---|---|
| Thresholds | 0.85 | 0.85 | 0.7 | 0.8 | 0.7 |
| Sn | 0.208 | 0.458 | 0.167 | 0.417 | 0.500 |
| Sp | 0.909 | 0.727 | 0.939 | 0.727 | 0.515 |
| TP | 5 (20.8%) | 11 (45.8%) | 4 (16.7%) | 10 (41.7%) | 12 (50%) |
| FN | 19 (79.2%) | 13 (54.2%) | 20 (83.3%) | 14 (58.3%) | 12 (50%) |
| TN | 30 (90.9%) | 24 (72.7%) | 31 (93.9%) | 24 (72.7%) | 17 (51.5%) |
| FP | 3 (9.1%) | 9 (27.3%) | 2 (6.1%) | 9 (27.3%) | 16 (48.5%) |
Thresholds of each program with the highest amount of TP and the least amount of TN control epitopes predicted from structural polyproteins. The analysis of positive and negative control epitopes was made with each program using different thresholds and lengths. The number of predicted TP, TN, FP, and FN sequences was used to calculate Sn and (Sp).
| Program | ABCpred | FBCPred | LBTope | BCPred |
|---|---|---|---|---|
| Thresholds | 0.85 | 0.85 | 0.7 | 0.8 |
| Sn | 0.458 | 0.667 | 0.208 | 0.542 |
| Sp | 0.606 | 0.636 | 0.909 | 0.636 |
| TP | 11 (45.8%) | 16 (66.7%) | 5 (20.8%) | 13 (54,2%) |
| FN | 13 (54.2%) | 8 (33.3%) | 19 (79.2%) | 11 (45.8%) |
| TN | 20 (60.6%) | 21 (63.6%) | 30 (90.9%) | 21 (63.6%) |
| FP | 13 (39.4%) | 12 (36.4%) | 3 (9.1%) | 12 (36.4%) |
Predicted B-cell epitopes of CHIKV Mexican strain. Epitopes of the structural polyprotein AJV21562.1 were predicted with the programs LBtope (dataset variable_non_redundant), ABCPred, FBCPred, and BCPred (see Materials and Methods section (Section 2)).
| ID | Protein | Position (Start) | Position (End) | Sequence |
|---|---|---|---|---|
| pep2 | N/A | 61 | 83 | PRKNRKNKKQKQKQQAPRNNTNQ |
| pep25 | Alpha E2 | 451 | 471 | THPFHHDPPVIGREKFHSRPQ |
| pep157 | Alpha E2 | 618 | 643 | TLLSYRNMGEEPNYQEEWVTHKKEIR |
| pep91 | Alpha E1 | 815 | 834 | VIPNTVGVPYKTLVNRPGYS |
| pep107 | Alpha E1 | 999 | 1018 | PPFGAGRPGQFGDIQSRTPE |
| pep114 | Alpha E2 | 646 | 665 | VPTEGLEVTWGNNEPYKYWPQ |
Predicted T-cell epitopes of CHIKV Mexican strain. Epitopes of the structural AJV21562.1 and non-structural AJV21561.1 polyproteins of CHIKV were predicted with the programs IEDB Analysis Resource, NetMHCIIpan 3.1, RANKPEP, PREDIVAC, ProPed, and EpiTOP 1.0 (see Materials and Methods section (Section 2)).
| Epitope ID | Sequence | Position | Protein | Allele |
|---|---|---|---|---|
| EC12-PFM | PFMYNAMAGAYPSYST | 182–197 | nsP1 | HLA-DQA1*05:01/DQB1*03:01 |
| EC6-CST | CSTYAQSTAATAEEIEVHM | 478–496 | E2 | HLA-DQA1*04:01/DQB1*04:02 |
| EC2-LQA | LQAAQEDVQVEIDVEQLED | 513–531 | nsP1 | HLA-DQA1*03:01/DQB1*03:02 |
| EC18-QPL | QPLFWMQALIPLAAL | 763–777 | E1 | HLA-DQA10501-DQB10201 |
| EC17-RPG | RPGYSPMVLEMELLSVTLE | 830–848 | E1 | HLA-DQA10501-DQB10201 |
| EC20-MVL | MVLEMELLSVTLEPTL | 836–850 | E1 | HLA-DQA10501-DQB10201 |
| EC15-EFA | EFASAYRAHTASASAKLRV | 926–944 | E1 | HLA-DQA1*05:01/DQB1*03:01 |
| EC38-MTN | MTNAVTIREAEIEVE | 1142–1156 | E1 | HLA-DQA1*03:01/DQB1*03:02 |
| EC4-GVG | GVGLVVAVAALILIV | 1225–1239 | E1 | HLA-DQA1*05:01/DQB1*03:01 |
| EC30-NQL | NQLNAAFVGQATRAG | 1317–1331 | nsP2 | HLA-DQA1*05:01/DQB1*03:01 |
| EC31-PSD | PSDLDADAPALEPAL | 1689–1704 | nsP3 | HLA-DQA1*03:01/DQB1*03:02 |
| EC27-VHT | VHTLFDMSAEDFDAI | 2200–2216 | nsP4 | HLA-DQA10501-DQB10201 |
Figure 1Three-dimensional localization of the predicted B-cell epitopes of CHIKV strain AJV21562.1. The pep25 (blue), pep157 (deep purple), and pep114 (light blue) are in the E2 protein. The pep91 (light orange) and pep107 (orange) are in the E1 protein.
Figure 2Three-dimensional localization of the predicted T-cell epitopes of Chikungunya virus strain AJV21562.1. The epitopes EC17-RPG (orange), EC38-MTN (red), EC20-MVL (cyan), and EC15-EFA (blue) were localized in the E1 protein.