| Literature DB >> 34012524 |
Abstract
BACKGROUND: Epitope prediction remains a major challenge in malaria due to the unique parasite biology, in addition to rapidly evolving parasite sequence variation in Plasmodium species. Although several models for epitope prediction exist, they are not useful in Plasmodium specific epitope development. Hence, it was proposed to use machine learning based methods to develop a peptide sequence based epitope predictor specific for malaria.Entities:
Keywords: Benchmarking; Epitopes; Machine learning; Malaria; Plasmodium
Year: 2021 PMID: 34012524 PMCID: PMC8112139 DOI: 10.18502/ajmb.v13i2.5527
Source DB: PubMed Journal: Avicenna J Med Biotechnol ISSN: 2008-2835
Figure 1Methodology workflow used in the study.
Dipeptide composition profile generated from dataset 1
| Distribution of allele specific dipeptide composition calculated from average score (Using Pfeature and Peptide 2.0 webserver) | ||||||
|---|---|---|---|---|---|---|
| VL,TN,NV,NL,LV,LS,LP,LL,LG,LF,LD,LA GN,GL,FL,EP,EE,DL,AL | 1.4 | 63% | 13% | None | ||
| AC,AG,AY,CA,FI,GL,IF,LA,LL,YK | 1.6 | 63% | none | None | ||
| YM,YI,YH, VR,VN,SS,SN,RK,RG, NI,NL,NN,LK,KK,KI,KF,IY,IV,IS,II,IA,HF,GN FR,FS,FK,FF,AS,AN | 1.8 | 43% | none | 22% | ||
| AD,AV,DG,DS, FS,GG,GS,GT, LG,LI,MY,VD | 1.8 | 37.5% | 16.6% | None | ||
Feature selection using WEKA CfsSubSetEvaluator Algorithm for selecting attributes and ranking by Correlation Attribute Evaluator and Ranker Search Method
| AV | 0.165 | |
| DV | 0.165 | |
| LG | 0.165 | |
| LI | 0.165 | |
| AL | 0.165 | |
| GL | 0.165 | |
| IL | 0.165 | |
| LV | 0.165 | |
| LL | 0.165 | |
| VS | 0.165 | |
| SF | 0.165 | |
| YK | 0.165 | |
| FL | 0.165 | |
| AC | 0.155 | |
| LK | 0.155 | |
| NF | 0.155 | |
| VR | 0.155 | |
| AN | 0.155 | |
| FF | 0.155 | |
| FK | 0.155 | |
| II | 0.155 | |
| GN | 0.155 | |
| IV | 0.155 | |
| KF | 0.155 | |
| NL | 0.155 | |
| SS | 0.155 |
Confusion matrix generated using WEKA Iterative Classifier Optimizer Algorithm for 26 instances (14 were correctly classified) showing 6×6 confusion matrix to describe six classes assigned a to f (Sum of diagonals indicated the number of correctly classified instances)
| 1 | 2 | a | HLAA2 | ||||
| 1 | 1 | b | HLAA3 | ||||
| 1 | 5 | c | HLAA2/HLAA3 | ||||
| 9 | d | DRB1 | |||||
| 1 | 1 | e | HLAA3/DRB1 | ||||
| 3 | 1 | f | Non-epitopic peptides |
Accuracy of training model by class using Iterative Classifier Optimizer
| Details of training model developed for classification of malaria epitope by class | |||||||
|---|---|---|---|---|---|---|---|
| 0.333 | 0.000 | 1.000 | 0.33 | 0.500 | 0.554 | 0.739 | HLAA2 |
| 0.500 | 0.000 | 1.000 | 0.50 | 0.667 | 0.693 | 0.802 | HLAA3 |
| 0.167 | 0.000 | 1.000 | 0.167 | 0.289 | 0.365 | 0.688 | HLAA2/HLAA3 |
| 1.000 | 0.706 | 0.429 | 1.00 | 0.600 | 0.355 | 0.686 | DRB1 |
| 0.500 | 0.000 | 1.000 | 0.50 | 0.667 | 0.693 | 0.802 | HLAA3/DRB1 |
| 0.250 | 0.000 | 1.000 | 0.25 | 0.400 | 0.469 | 0.710 | Non-epitope |