| Literature DB >> 34931117 |
Zeynep Banu Ozger1, Pınar Cihan2.
Abstract
B-cell epitope prediction research has received growing interest since the development of the first method. B-cell epitope identification with the aid of an accurate prediction method is one of the most important steps in epitope-based vaccine development, immunodiagnostic testing, antibody production, disease diagnosis, and treatment. Nevertheless, using experimental methods in epitope mapping is very time-consuming, costly, and labor-intensive. Therefore, although successful predictions with in silico methods are very important in epitope prediction, there are limited studies in this area. The aim of this study is to propose a new approach for successfully predicting B-cell epitopes for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). In this study, the SARS-CoV B-cell epitope prediction performances of different fuzzy learning classification models genetic cooperative competitive learning (GCCL), fuzzy genetics-based machine learning (GBML), Chi's method (CHI), Ishibuchi's method with weight factor (W), structural learning algorithm on vague environment (SLAVE) and the state-of-the-art ensemble fuzzy classification model were compared. The obtained results showed that the proposed ensemble approach has the lowest error in SARS-CoV B-cell epitope estimation compared to the base fuzzy learners (average error rates; ensemble fuzzy=8.33, GCCL=30.42, GBML=23.82, CHI=29.17, W=46.25, and SLAVE=20.42). SARS-CoV and SARS-CoV-2 have high genome similarities. Therefore, the most successful method determined for SARS-CoV B-cell epitope prediction was used in SARS-CoV-2 cell epitope prediction. Finally, the eventual B-cell epitope prediction results obtained for SARS-CoV-2 with the ensemble fuzzy classification model were compared with the epitope sequences predicted by the BepiPred server and immunoinformatics studies in the literature for the same protein sequences according to VaxiJen 2.0 scores. We hope that the developed epitope prediction method will help design effective vaccines and drugs against future outbreaks of the coronavirus family, especially SARS-CoV-2 and its possible mutations.Entities:
Keywords: B-cell; Epitope; Fuzzy learning; SARS-CoV; SARS-CoV-2; Spike protein
Year: 2021 PMID: 34931117 PMCID: PMC8673934 DOI: 10.1016/j.asoc.2021.108280
Source DB: PubMed Journal: Appl Soft Comput ISSN: 1568-4946 Impact factor: 6.725
Fig. 1Continues and discontinues B-Cell [7].
Fig. 2Structure of SARS-CoV-2 [12].
Fig. 3Phylogenetic tree of beta coronaviruses [14].
Fig. 4General framework of the ensemble fuzzy classification model.
Peptid and protein based features at datasets.
| Feature | Description |
|---|---|
| Chou–Fasman | Peptide feat. Relative frequency analysis on the basis of amino acids for tertiary structural elements. Given here for B-Turn. |
| Emini | Peptide feat, relative surface accessibility, a measure of residue solvent exposure. |
| Kolaskar–Tongaonkar | Peptide feat. Antigenicity, antigenic propensity of residues. |
| Parker | Peptide feat. A measure of hydrophobicity of peptide. |
| Isoelectric-point | Protein feat. pH value of the amino acid in an electric field. |
| Aromaticity | Protein feat. A factor for protein fragment solubility. |
| Hydrophobicity | Protein feat. A measure of the degree of affinity between water and the side chain of an amino acid. |
| Stability | Protein feature. |
Fig. 5The proposed ensemble fuzzy classification model.
Fig. 6Prediction process for SARS-CoV-2.
The parameters of fuzzy models.
| Parameter | Description | Value |
|---|---|---|
| popu.size | Population size | GCCL:30, GBML:10 |
| num.class | Number of classes | For all methods:2 |
| num.labels | Number of linguistic terms | W:11, CHI:5, GCCL:9, GBML:7, SLAVE:7 |
| persen_cross | Probability of crossover | GCCL:0.8, GBML:0.9, SLAVE:0.8 |
| persen_mutant | Probability of mutation | GCCL:0.4, GBML:0.2, SLAVE:0.4 |
| max.gen | Maximum number of generations | GCCL:150, GBML:10, SLAVE:40 |
| type.mf | The type of the shape of the membership function | W:Gaussian, CHI:Triangle |
| type.tnorm | The type of the tnorm | W:min, CHI:min |
| type.snorm | The type of the snorm | W:sum, CHI:max |
| type. implication. func | Type of implication functions | W:Dienes Recher, CHI:Zadeh |
| max.num.rule | Maximum number of rules | GBML:10 |
| p.dcare | A probability of “don’t care” attributes occurred | GBML:0.5 |
| p.gccl | A probability of GCCL process occurred | GBML:0.4 |
| max.iter | Maximum number of iterations | SLAVE:30 |
| k.lower | A lower bound of the noise threshold | SLAVE:0 |
| k.upper | A value between 0 and 1 representing the level of generalization | SLAVE:0.8 |
Fig. 7Correlation, density and 2D density plot of independent variables.
Fig. 8Train size tuning for negative class samples.
Classification errors for SARS-CoV.
| Method | Test1 | Test2 | Test3 | Test4 | Test5 | Test6 | Avg. | Sig |
|---|---|---|---|---|---|---|---|---|
| CHI | 17.5 | 37.5 | 25 | 27.5 | 22.5 | 20 | 29.17 | |
| GBML | 17.5 | 15 | 32.5 | 15 | 20 | 22.5 | 23.82 | |
| GCCL | 22.5 | 55 | 30 | 12.5 | 30 | 32.5 | 30.42 | |
| SLAVE | 27.5 | 17.5 | 7.5 | 25 | 22.5 | 22.5 | 20.42 | |
| W | 52.5 | 52.5 | 52.5 | 27.5 | 45 | 47.5 | 46.25 | |
| Ensemble fuzzy | 7.5 | 12.5 | 7.5 | 10 | 7.5 | 5 | 8.33 |
Prediction results for SARS-CoV-2.
| Epitope | Number of predicted epitopes | |||
|---|---|---|---|---|
| Length | Dataset | 4V | 5V | 6V |
| 5 | 1277 | 776 | 642 | 325 |
| 6 | 1276 | 201 | 136 | 70 |
| 7 | 1275 | 229 | 157 | 67 |
| 8 | 1274 | 265 | 185 | 81 |
| 9 | 1273 | 287 | 194 | 88 |
| 10 | 1272 | 294 | 196 | 82 |
| 11 | 1271 | 313 | 206 | 106 |
| 12 | 1270 | 330 | 219 | 98 |
| 13 | 1269 | 321 | 226 | 123 |
| 14 | 1268 | 321 | 232 | 121 |
| 15 | 1267 | 329 | 221 | 121 |
| 16 | 1266 | 335 | 229 | 138 |
| 17 | 1265 | 345 | 244 | 145 |
| 18 | 1264 | 369 | 273 | 144 |
| 19 | 1263 | 369 | 269 | 139 |
| 20 | 1262 | 381 | 282 | 156 |
| Total | 20 312 | 5465 | 3911 | 2004 |
CPU time for SARS-CoV-2 prediction.
| TrainSet | GCCL | W | CHI | GBML | SLAVE | Prediction | Total |
|---|---|---|---|---|---|---|---|
| (min) | (s) | (s) | (min) | (min) | (min) | (min) | |
| Train1 | 1.73 | 0.07 | 0.03 | 4.25 | 4.57 | 9.55 | 20.1 |
| Train2 | 1.78 | 0.06 | 0.03 | 4.33 | 4.61 | 9.08 | 19.8 |
| Train3 | 1.44 | 0.06 | 0.04 | 4.31 | 4.34 | 8.92 | 19.33 |
| Train4 | 1.80 | 0.06 | 0.03 | 4.28 | 4.54 | 9.07 | 19.69 |
| Train5 | 1.86 | 0.06 | 0.03 | 4.35 | 4.46 | 9.01 | 19.68 |
| Train6 | 1.84 | 0.06 | 0.03 | 4.27 | 4.49 | 9.07 | 19.67 |
| CPU time of whole framework | 118.27 | ||||||
Comparison with BepiPred results.
| BepiPred | 4V | 5V | 6V | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Epitope | Len | Ant | Det | Len | Ant | Det | Len | Ant | Det | Len | Ant |
| VNLTTRTQLPPAYTNSFTR | 19 | ✓ | 19 | ✓ | 19 | ✗ | – | – | |||
| ASTEKS | 6 | 0.6206 | ✓ | 7 | ✓ | 7 | ✓ | 7 | |||
| PFLGVYYHKNNKSWMESE | 18 | ✓ | 18 | ✓ | 18 | ✓ | 18 | ||||
| KHTPINLVRDLPQGFSA | 17 | ✓ | 19 | 0.5535 | ✗ | – | – | ✗ | – | – | |
| TPGDSSSGWTA | 11 | 0.2473 | ✓ | 12 | 0.0746 | ✓ | 17 | ✓ | 17 | ||
| IYQTSNFRVQP | 11 | ✓ | 12 | 0.9986 | ✓ | 12 | 0.9986 | ✓ | 16 | 0.8559 | |
| DEVRQIAPGQTGKIAD | 16 | 1.0388 | ✓ | 16 | 1.0388 | ✓ | 16 | 1.0388 | ✓ | 19 | |
| NNLDSKVGGNYN | 12 | ✓ | 15 | 0.7275 | ✓ | 15 | 0.7275 | ✓ | 15 | 0.7275 | |
| GFNCYFPLQSYGF | 13 | 0.8519 | ✓ | 18 | ✓ | 18 | ✓ | 18 | |||
| SNKKFLPF | 8 | ✓ | 8 | ✓ | 8 | ✓ | 9 | 1.1432 | |||
| NCTEV | 5 | NA | ✓ | 5 | NA | ✓ | 5 | NA | ✗ | – | – |
| HADQLTPT | 8 | 0.4177 | ✓ | 8 | 0.4177 | ✓ | 8 | 0.4177 | ✓ | 16 | |
| RVYSTGSNVFQ | 11 | −0.1000 | ✓ | 13 | ✓ | 13 | ✓ | 14 | 0.1826 | ||
| AYTMSLGAENSVAYSNN | 17 | ✓ | 17 | ✓ | 17 | ✓ | 17 | ||||
| KQIYKTPPIKDFGGF | 15 | ✓ | 15 | ✓ | 15 | ✓ | 15 | ||||
| LPDPSKPSKR | 10 | ✓ | 10 | ✓ | 10 | ✓ | 10 | ||||
| DPPEAEVQI | 9 | ✓ | 10 | 0.4955 | ✓ | 10 | 0.4955 | ✓ | 11 | −0.0004 | |
| GQSKRVDFC | 9 | ✓ | 11 | 1.4088 | ✓ | 12 | 1.3607 | ✓ | 12 | 1.3607 | |
| FYEPQIITTD | 10 | 0.4179 | ✓ | 10 | 0.4179 | ✓ | 16 | ✓ | 19 | 0.2751 | |
| VNNTVYDPLQPELDSF | 16 | ✓ | 16 | ✓ | 16 | ✓ | 19 | 0.1493 | |||
| LGKYEQYIKGSGR | 13 | ✓ | 13 | ✓ | 13 | ✓ | 13 | ||||
| Average | 0.5925 | 0.5900 | 0.6253 | 0.5553 | |||||||
Comparison results with literature.
| Predicted Epitopes in | 4V | 5V | 6V | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Epitope | Len | Ant | Det | Len | Ant | Det | Len | Ant | Det | Len | Ant |
| FHAIHVSGTNG | 11 | ✓ | 18 | 0.6317 | ✓ | 18 | 0.6317 | ✓ | 18 | 0.6317 | |
| TLDSKTQSLLIVNNATNV | 18 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| PGDSSSGWTAGA | 12 | 0.0820 | ✓ | 17 | 0.2386 | ✓ | 17 | 0.2386 | ✓ | 18 | |
| NENGTITDA | 9 | 0.5257 | ✓ | 9 | 0.5257 | ✓ | 10 | 0.6020 | ✓ | 11 | |
| IYQTSNFRV | 9 | 0.3109 | ✓ | 11 | 0.2839 | ✓ | 11 | 0.2839 | ✓ | 16 | |
| IAWNSNNLDSK | 11 | ✓ | 11 | ✓ | 12 | 0.9178 | ✓ | 13 | 0.7773 | ||
| STEIYQAGSTPCNGV | 15 | −0.0513 | ✓ | 16 | ✓ | 18 | −0.0751 | ✓ | 19 | −0.0745 | |
| RVYSTGSNVFQTRA | 14 | 0.3248 | ✓ | 14 | 0.3248 | ✓ | 14 | 0.3248 | ✓ | 18 | |
| GAEHVNNSYE | 10 | ✓ | 10 | ✓ | 10 | ✓ | 10 | ||||
| YICGDSTECSNLLLQ | 15 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| GSFCTQLNRALTG | 13 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| AVEQDKNTQE | 10 | 0.2792 | ✓ | 12 | ✓ | 12 | ✗ | – | – | ||
| DEMIAQYTSALLAG | 14 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| LQSLQTYVT | 9 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| RASANLAATKMSECVLGQ | 18 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| TDNTFVSGNCD | 11 | 0.0820 | ✓ | 14 | ✓ | 14 | ✓ | 14 | |||
| KNHTSPDV | 8 | ✓ | 8 | ✓ | 8 | ✓ | 8 | ||||
| GINASVVNIQ | 10 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| EVAKNLNESL | 10 | −0.0432 | ✓ | 10 | −0.0432 | ✓ | 14 | ✗ | – | – | |
| Average | 0.4514 | 0.4762 | 0.4608 | 0.6009 | |||||||
| Predicted Epitopes in | 4V | 5V | 6V | ||||||||
| EVRQIAPGQTGKIADY | 16 | ✓ | 16 | ✓ | 17 | 1.0936 | ✓ | 19 | 1.1515 | ||
| TVEKGIYQTSNFRVQP | 16 | ✓ | 16 | ✓ | 16 | ✗ | – | – | |||
| HRSYLTPGDSSSGWTA | 16 | ✓ | 16 | ✓ | 17 | 0.4892 | ✓ | 17 | 0.4892 | ||
| YVGYLQPRTFLLKYNE | 16 | ✓ | 18 | 0.4816 | ✓ | 18 | 0.4816 | ✓ | 18 | 0.4816 | |
| CGPKKSTNLVKNKCVN | 16 | 0.2006 | ✓ | 20 | ✓ | 20 | ✗ | – | – | ||
| TKTSVDCTMYICGDST | 16 | 0.0937 | ✓ | 18 | ✓ | 18 | ✗ | – | – | ||
| TEIYQAGSTPCNGVEG | 16 | −0.0105 | ✓ | 16 | −0.0105 | ✓ | 16 | −0.0105 | ✓ | 18 | |
| FERDISTEIYQAGSTP | 16 | −0.2904 | ✓ | 17 | −0.1383 | ✓ | 17 | −0.1383 | ✓ | 19 | |
| FAMQMAYRFNGIGVTQ | 16 | 1.3096 | ✓ | 18 | ✗ | – | – | ✗ | – | – | |
| IGKIQDSLSSTASALG | 16 | ✓ | 19 | 0.5712 | ✓ | 19 | 0.5712 | ✓ | 20 | 0.4992 | |
| LQSYGFQPTNGVGYQP | 16 | ✓ | 17 | 0.4203 | ✓ | 17 | 0.4203 | ✗ | – | – | |
| SWMESEFRVYSSANNC | 16 | ✓ | 16 | ✓ | 16 | ✓ | 16 | ||||
| TRFQTLLALHRSYLTP | 16 | 0.5115 | ✓ | 18 | ✗ | – | – | ✗ | – | – | |
| PQIITTDNTFVSGNCD | 16 | ✓ | 16 | ✓ | 16 | ✓ | 16 | ||||
| QKEIDRLNEVAKNLNE | 16 | 0.0684 | ✓ | 18 | ✓ | 18 | ✗ | – | – | ||
| KQIYKTPPIKDFGGFN | 16 | ✓ | 16 | ✓ | 16 | ✓ | 16 | ||||
| SKRVDFCGK | 9 | ✓ | 9 | ✓ | 12 | 1.3607 | ✓ | 12 | 1.3607 | ||
| GKYEQY | 6 | ✓ | 6 | ✓ | 6 | ✓ | 6 | ||||
| LDSKVGGNYNYLY | 13 | ✓ | 14 | 0.8329 | ✓ | 14 | 0.8329 | ✓ | 14 | 0.8329 | |
| TPGDSSSGWTAGA | 13 | 0.1212 | ✓ | 18 | ✓ | 18 | ✓ | 18 | |||
| FLPFQ | 5 | NA | ✓ | 8 | ✓ | 8 | ✓ | 9 | 1.1432 | ||
| TSNFRVQPTE | 10 | ✓ | 11 | 1.2323 | ✓ | 11 | 1.2323 | ✓ | 11 | 1.2323 | |
| TNLCPF | 6 | ✓ | 8 | 0.8906 | ✓ | 13 | 1.04 | ✗ | – | – | |
| DPSKPSKRSF | 10 | ✓ | 10 | ✓ | 10 | ✓ | 11 | 0.6286 | |||
| EVFNATRFASVYAWNRKRI | 19 | ✓ | 19 | ✗ | – | – | ✗ | – | – | ||
| AEVQIDR | 7 | −0.4355 | ✓ | 8 | −0.2814 | ✓ | 11 | ✓ | 11 | ||
| PTNGVG | 6 | −1.1441 | ✓ | 7 | −0.7278 | ✓ | 8 | ✓ | 8 | ||
| QLTPTWRVYSTGSNVFQTRA | 20 | ✓ | 20 | ✓ | 20 | ✗ | – | – | |||
| TMSLGAENSVAYSNNS | 16 | ✓ | 16 | ✓ | 16 | ✓ | 16 | ||||
| GFNCYFPLQSY | 11 | ✓ | 18 | 0.8567 | ✓ | 18 | 0.8567 | ✓ | 18 | 0.8567 | |
| EPQIITTDNT | 10 | ✓ | 13 | 0.6684 | ✓ | 16 | 0.5227 | ✓ | 17 | 0.3342 | |
| NSYECDIPIG | 10 | 0.6533 | ✓ | 11 | 0.8366 | ✓ | 11 | 0.8366 | ✓ | 14 | |
| IYKTPPIKDFGGFNF | 15 | ✓ | 15 | ✓ | 15 | ✓ | 15 | ||||
| Average | 0.5106 | 0.5763 | 0.5362 | ||||||||
| Predicted Epitopes in | 4V | 5V | 6V | ||||||||
| LTPGDSSSGWTAG | 13 | ✓ | 18 | 0.3768 | ✓ | 18 | 0.3768 | ✓ | 18 | 0.3768 | |
| VRQIAPGQTGKIAD | 14 | 1.2606 | ✓ | 15 | ✓ | 16 | 1.0388 | ✓ | 19 | 1.1515 | |
| YQAGSTPCNGV | 11 | 0.0881 | ✓ | 13 | 0.1909 | ✓ | 15 | ✓ | 15 | ||
| ILPDPSKPSKRS | 12 | ✓ | 12 | ✓ | 12 | ✓ | 12 | ||||
| Average | 0.594 | 0.5489 | 0.5771 | ||||||||
| Predicted Epitopes in | 4V | 5V | 6V | ||||||||
| DVVNQNAQALNTLVKQL | 17 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| EAEVQIDRLITGRLQSL | 17 | −0.1784 | ✓ | 20 | ✗ | – | – | ✗ | – | – | |
| GAGICASY | 8 | 0.5210 | ✓ | 13 | ✓ | 13 | ✓ | 17 | 0.4587 | ||
| GSFCTQLN | 8 | 0.8144 | ✓ | 9 | ✓ | 9 | ✗ | – | – | ||
| KGIYQTSN | 8 | 0.2441 | ✓ | 8 | 0.2441 | ✓ | 9 | ✓ | 10 | 0.3992 | |
| AMQMAYRF | 8 | 0.9776 | ✓ | 9 | 1.0278 | ✓ | 11 | ✓ | 11 | ||
| KNHTSPDVDLGDISGIN | 17 | ✓ | 18 | 1.0631 | ✓ | 19 | 0.8800 | ✓ | 19 | 0.8800 | |
| AATKMSECVLGQSKRVD | 17 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| PFAMQMAYRFNGIGVTQ | 17 | 1.3306 | ✓ | 18 | ✗ | – | – | ✗ | – | – | |
| QALNTLVKQLSSNFGAI | 17 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| QLIRAAEIRASANLAAT | 17 | ✓ | 19 | 0.3381 | ✗ | – | – | ✗ | – | – | |
| QQFGRD | 6 | ✓ | 6 | ✓ | 6 | ✓ | 6 | ||||
| RASANLAATKMSECVLG | 17 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| RLITGRLQSLQTYVTQQ | 17 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| SLQTYVTQQLIRAAEIR | 17 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| Average | 0.5158 | 0.5629 | 0.4958 | ||||||||
| Predicted Epitopes in | 4V | 5V | 6V | ||||||||
| DPFLGVYYHKNNKSWME | 17 | ✓ | 17 | ✓ | 17 | ✓ | 17 | ||||
| MDLEGKQGNFKNL | 13 | ✓ | 13 | ✓ | 13 | ✗ | – | – | |||
| KHTPINLVRDLPQGFS | 16 | ✓ | 17 | 0.5695 | ✓ | 17 | 0.5695 | ✗ | – | – | |
| TPGDSSSGWTA | 11 | 0.2473 | ✓ | 12 | 0.0746 | ✓ | 17 | ✓ | 17 | ||
| KSFTVEKGIYQTSNFRVQP | 19 | ✓ | 19 | ✓ | 19 | ✗ | – | – | |||
| SNKKFLPF | 8 | ✓ | 8 | ✓ | 8 | ✓ | 9 | 1.1432 | |||
| TNTSN | 5 | NA | ✓ | 5 | ✓ | 5 | ✓ | 5 | |||
| NCTEVPVAIHADQLTPT | 17 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| RVYSTGSNVFQ | 11 | −0.1000 | ✓ | 13 | ✓ | 13 | ✗ | – | – | ||
| VNNSYECDIPI | 11 | 0.6124 | ✓ | 16 | ✓ | 16 | ✗ | – | – | ||
| YTMSLGAENSVAYSNN | 16 | ✓ | 16 | ✓ | 16 | ✓ | 16 | ||||
| EQDKNTQ | 7 | ✓ | 7 | ✓ | 7 | ✓ | 7 | ||||
| KQIYKTPPIKDFGGF | 15 | ✓ | 15 | ✓ | 15 | ✓ | 15 | ||||
| PDPSKPSK | 8 | ✓ | 8 | ✓ | 8 | ✓ | 8 | ||||
| LADAGFIKQYGDCLG | 15 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| EAEVQ | 5 | NA | ✓ | 5 | NA | ✓ | 5 | NA | ✓ | 11 | −0.0004 |
| GQSKRVDFC | 9 | ✓ | 11 | 1.4088 | ✓ | 12 | 1.3607 | ✓ | 12 | 1.3607 | |
| RNFYEPQIITTD | 12 | 0.3529 | ✓ | 15 | 0.6381 | ✓ | 16 | ✓ | 20 | 0.2624 | |
| Average | 0.5228 | 0.5833 | 0.4255 | ||||||||
| Predicted Epitopes in | 4V | 5V | 6V | ||||||||
| RGVYYPDK | 8 | ✓ | 8 | ✓ | 8 | ✓ | 11 | 0.5200 | |||
| RSSVLHST | 8 | ✓ | 10 | 0.5404 | ✓ | 10 | 0.5404 | ✗ | – | – | |
| DLFLPFFS | 8 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| FHAIHV | 6 | ✓ | 18 | 0.6317 | ✓ | 18 | 0.6317 | ✓ | 18 | 0.6317 | |
| NPVLPFN | 7 | ✓ | 9 | 0.0146 | ✓ | 9 | 0.0146 | ✓ | 9 | 0.0146 | |
| QSLLIVN | 7 | ✓ | 15 | 0.5156 | ✓ | 15 | 0.5156 | ✗ | – | – | |
| NVVIKVCEFQ | 10 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| CNDPFLGVYYH | 11 | 0.4109 | ✓ | 17 | ✓ | 17 | ✓ | 17 | |||
| FEYVSQP | 7 | ✓ | 11 | 0.1016 | ✓ | 11 | 0.1016 | ✗ | – | – | |
| INLVRDL | 7 | −0.3198 | ✓ | 14 | 0.4022 | ✓ | 14 | 0.4022 | ✓ | 17 | |
| LEPLVDLP | 8 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| QTLLALHRSY | 10 | 0.5596 | ✓ | 17 | ✗ | – | – | ✗ | – | – | |
| AAYYVGYL | 8 | 0.5218 | ✓ | 12 | ✗ | – | – | ✗ | – | – | |
| PRTFLLK | 7 | −1.3917 | ✓ | 10 | −0.2800 | ✓ | 10 | −0.2800 | ✓ | 12 | |
| AVDCALDP | 8 | ✓ | 16 | 0.5804 | ✓ | 16 | 0.5804 | ✓ | 16 | 0.5804 | |
| TNLCPFG | 7 | ✓ | 8 | 0.8906 | ✓ | 13 | 1.0400 | ✗ | – | – | |
| SNCVADYSVLYNS | 13 | −0.1828 | ✓ | 13 | ✗ | – | – | ✗ | – | – | |
| TFKCYGVSPT | 10 | ✓ | 20 | 0.8913 | ✓ | 20 | 0.8913 | ✗ | – | – | |
| TGCVIA | 6 | ✓ | 10 | 0.0996 | ✓ | 13 | −0.1592 | ✓ | 14 | −0.1234 | |
| CYFPLQSY | 8 | ✓ | 8 | ✓ | 12 | 0.8719 | ✓ | 12 | 0.8719 | ||
| FGGVSVIT | 8 | ✓ | 12 | 0.4578 | ✓ | 13 | 0.4931 | ✓ | 13 | 0.4931 | |
| CTEVPVAIHAD | 11 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| AGCLIGA | 7 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| GAGICASY | 8 | 0.5210 | ✓ | 13 | ✓ | 13 | ✓ | 17 | 0.4587 | ||
| VASQSII | 7 | −0.0188 | ✓ | 16 | 0.3257 | ✓ | 16 | 0.3257 | ✓ | 18 | |
| TTEILPVS | 8 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| SVDCTMY | 7 | ✓ | 17 | −0.0258 | ✓ | 18 | 0.1426 | ✗ | – | – | |
| SNLLLQYGSFCTQL | 14 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| VFAQVKQI | 8 | ✓ | 14 | 0.3493 | ✓ | 15 | 0.4451 | ✗ | – | – | |
| SQILPD | 6 | −0.1542 | ✓ | 8 | 0.0383 | ✓ | 8 | 0.0383 | ✓ | 11 | |
| YGDCLGD | 7 | −0.5555 | ✓ | 12 | ✓ | 14 | 0.0416 | ✗ | – | – | |
| RDLICAQ | 7 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| LTVLPPL | 7 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| YTSALLAG | 8 | ✓ | 20 | 0.3640 | ✗ | – | – | ✗ | – | – | |
| LNTLVKQL | 8 | −0.7591 | ✓ | 16 | ✓ | 16 | ✗ | – | – | ||
| ISSVLND | 7 | 0.0414 | ✓ | 11 | ✓ | 12 | 0.6035 | ✗ | – | – | |
| SLQTYVTQQ | 9 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| SECVLGQS | 8 | −0.0110 | ✓ | 13 | ✗ | – | – | ✗ | – | – | |
| PHGVVFLHVTYVPA | 14 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| PAICHDG | 7 | −1.0100 | ✓ | 15 | ✓ | 15 | ✗ | – | – | ||
| SGNCDVVIGI | 10 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| ASVVNI | 6 | ✓ | 13 | 0.1922 | ✗ | – | – | ✗ | – | – | |
| Average | 0.3938 | 0.4011 | 0.4005 | ||||||||
| Predicted Epitopes in | 4V | 5V | 6V | ||||||||
| VRQIAPGQTGKIAD | 14 | 1.2606 | ✓ | 15 | ✓ | 16 | 1.0388 | ✓ | 19 | 1.1515 | |
| VLGQSKRVDFCGKG | 14 | ✗ | – | – | ✗ | – | – | ✗ | – | – | |
| GLTGTGVLTESNKK | 14 | ✓ | 14 | ✓ | 14 | ✓ | 16 | 0.6686 | |||
| KIADYNYKLPDDFT | 14 | ✓ | 14 | ✓ | 14 | ✓ | 14 | ||||
| Average | 1.1094 | 1.006 | 0.9256 | ||||||||
| Predicted Epitopes in | 4V | 5V | 6V | ||||||||
| DPFLGVYYHKNNKSWME | 17 | ✓ | 17 | ✓ | 17 | ✓ | 17 | ||||
| MDLEGKQGNFKNL | 13 | ✓ | 13 | ✓ | 13 | ✗ | – | – | |||
| KHTPINLVRDLPQGFS | 16 | ✓ | 17 | 0.5695 | ✓ | 17 | 0.5695 | ✗ | – | – | |
| TPGDSSSGWTA | 11 | 0.2473 | ✓ | 12 | 0.0746 | ✓ | 17 | ✓ | 17 | ||
| KSFTVEKGIYQTSNFRVQP | 19 | ✓ | 19 | ✓ | 19 | ✗ | – | – | |||
| VNNSYECDIPI | 11 | 0.6124 | ✓ | 16 | ✓ | 16 | ✗ | – | – | ||
| YTMSLGAENSVAYSNN | 16 | ✓ | 16 | ✓ | 16 | ✓ | 16 | ||||
| Average | 0.6111 | 0.6591 | 0.5716 | ||||||||
Fig. 9Confusion matrices of the proposed ensemble fuzzy model for different test sets.