| Literature DB >> 16351712 |
Abstract
BACKGROUND: Prediction of the binding ability of antigen peptides to major histocompatibility complex (MHC) class II molecules is important in vaccine development. The variable length of each binding peptide complicates this prediction. Motivated by a text mining model designed for building a classifier from labeled and unlabeled examples, we have developed an iterative supervised learning model for the prediction of MHC class II binding peptides.Entities:
Year: 2005 PMID: 16351712 PMCID: PMC1325229 DOI: 10.1186/1745-7580-1-6
Source DB: PubMed Journal: Immunome Res ISSN: 1745-7580
Figure 1Top: A peptide has been reduced to a set of nonamers. Bottom: A nonamer is encoded as a 180-dimensional vector.
Description of HLA-DR4 (B1*0401) benchmark datasets.
| Set 1 | 1017 | 694 | 323 | 531 | 248 | 283 |
| Set 2 | 673 | 381 | 292 | 416 | 161 | 255 |
| Set 3a | 590 | 373 | 217 | 355 | 151 | 204 |
| Set 3b | 495 | 279 | 216 | 325 | 128 | 197 |
| Set 4a | 646 | 323 | 323 | 403 | 120 | 283 |
| Set 4b | 584 | 292 | 292 | 375 | 120 | 255 |
| Set 5a | 117 | 70 | 47 | 110 | 65 | 45 |
| Set 5b | 85 | 48 | 37 | 84 | 47 | 37 |
| Southwood | 105 | 22 | 83 | 99 | 19 | 80 |
| Geluk | 22 | 16 | 6 | 21 | 15 | 6 |
Figure 2Prediction accuracy of the various methods on the original benchmark datasets.
Figure 3Prediction accuracy of the various methods on the homology reduced datasets.
The average Aroc values for different methods.
| Method | Average Aroc values for the benchmark datasets | |
| Original | Homology Reduced | |
| LP_append | 0.749 | 0.698 |
| LP_discard | 0.748 | 0.699 |
| LP_top2 | ||
| Gibbs method | 0.744 | 0.673 |
| TEPITOPE (Propred) | 0.702 | 0.667 |
The average Aroc values from 5-fold cross validations.
| Method | HLA-DRB1*0101 | HLA-DRB1*0301 |
| LP_top2 | 0.779 | 0.721 |
| TEPITOPE (Propred) | 0.842 | 0.585 |
Figure 4Top: The alignment of actual binding cores (shadowed) from SYFPEITHI database. Bottom: The alignment of the predicted binding cores by the LP method.