| Literature DB >> 19091022 |
Hong Huang Lin1, Guang Lan Zhang, Songsak Tongchusak, Ellis L Reinherz, Vladimir Brusic.
Abstract
BACKGROUND: Initiation and regulation of immune responses in humans involves recognition of peptides presented by human leukocyte antigen class II (HLA-II) molecules. These peptides (HLA-II T-cell epitopes) are increasingly important as research targets for the development of vaccines and immunotherapies. HLA-II peptide binding studies involve multiple overlapping peptides spanning individual antigens, as well as complete viral proteomes. Antigen variation in pathogens and tumor antigens, and extensive polymorphism of HLA molecules increase the number of targets for screening studies. Experimental screening methods are expensive and time consuming and reagents are not readily available for many of the HLA class II molecules. Computational prediction methods complement experimental studies, minimize the number of validation experiments, and significantly speed up the epitope mapping process. We collected test data from four independent studies that involved 721 peptide binding assays. Full overlapping studies of four antigens identified binding affinity of 103 peptides to seven common HLA-DR molecules (DRB1*0101, 0301, 0401, 0701, 1101, 1301, and 1501). We used these data to analyze performance of 21 HLA-II binding prediction servers accessible through the WWW.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19091022 PMCID: PMC2638162 DOI: 10.1186/1471-2105-9-S12-S22
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1AROC values of predictions by the 21 servers using the combined test set (103 peptides from the four antigens) based on the three mapping methods: black bars for maximum 9-mer scores, grey bars for average scores of all overlapping 9-mers, and white bars for the average of the top three 9-mer scores. Vertical axis shows the AROC values while horizontal axis shows individual servers, as designated in Table 2. Best performing predictors for each allele are marked by asterisks.
Figure 2AROC values for prediction of promiscuous peptides. Vertical axis shows the AROC values while horizontal axis shows numbers designating individual servers, as shown in Table 2. The first two servers were excluded from the analysis because they predicted peptide binding to a single DR molecule.
Prediction performance of selected representative servers at two scenarios: a) thresholds that correctly predict ~80% of T-cell epitopes; b) thresholds that correctly predict ~50% of T-cell epitopes.
| IEDB_ARB | 100 | 10 | 5 | 56 | 32 | 17 (81%) |
| IEDB_ CON | 11 | 10 | 5 | 70 | 18 | 17 (81%) |
| MHCPRED | 15 | 10 | 5 | 36 | 52 | 17 (81%) |
| MULTI_ANN | 3 | 15 | 0 | 38 | 50 | 17 (81%) |
| MULTI_SVM | 6.4 | 13 | 2 | 59 | 29 | 17 (81%) |
| NETMHCII | 4000 | 12 | 3 | 51 | 37 | 17 (81%) |
| NETMHCIIPAN | 440 | 12 | 3 | 56 | 32 | 17 (81%) |
| PROPRED | -1.5 | 14 | 1 | 54 | 34 | 17 (81%) |
| RANKPEP | 0.85 | 12 | 3 | 41 | 47 | 17 (81%) |
| IEDB_ARB | 3 | 7 | 8 | 80 | 8 | 10 (48%) |
| IEDB_ CON | 4 | 7 | 8 | 86 | 2 | 10 (48%) |
| MHCPRED | 3 | 4 | 11 | 76 | 12 | 10 (48%) |
| MULTI_ANN | 5 | 5 | 10 | 66 | 22 | 11 (52%) |
| MULTI_SVM | 7.1 | 6 | 9 | 82 | 6 | 10 (48%) |
| NETMHCII | 400 | 7 | 8 | 78 | 10 | 10 (48%) |
| NETMHCIIPAN | 60 | 7 | 8 | 87 | 1 | 10 (48%) |
| PROPRED | 0 | 10 | 5 | 81 | 7 | 11 (52%) |
| RANKPEP | 7 | 10 | 5 | 69 | 19 | 11 (52%) |
List of prediction servers of HLA class II binding peptides, their URLs (as of December 2007), and name abbreviations.
| 1 | HLA-DR4Pred (ANN) | DR4_ANN | [ | ANN | [ |
| 2 | HLA-DR4Pred (SVM) | DR4_SVM | [ | SVM | [ |
| 3 | IEDB (ARB) | IEDB_II | [ | Matrix | [ |
| 4 | IEDB (SMM) | IEDB_SMM | [ | Matrix | [ |
| 5 | IEDB (Saturniolo) | IEDB_SAT | [ | Matrix | [ |
| 6 | IEDB (Consensus) | IEDB_CON | [ | Matrix | - |
| 7 | MHC Binder Prediction | MHC_BP | [ | Matrix | - |
| 8 | MHC2Pred | MHC2Pred | [ | SVM | - |
| 9 | MHC-BPS | MHC_BPS | [ | SVM | [ |
| 10 | MHCPred | MHCPRED | [ | Partial least square | [ |
| 11 | Multipred1 (ANN) | MULTI_ANN | [ | ANN | [ |
| 12 | Multipred1 (HMM) | MULTI_HMM | [ | HMM | [ |
| 13 | Multipred1 (SVM) | MULTI_SVM | [ | SVM | [ |
| 14 | NetMHCII | NETMHCII | [ | Matrix | [ |
| 15 | NetMHCIIpan | NETMHCIIPAN | [ | ANN | [ |
| 16 | PeptideCheck (Matrix) | PEPC_M | [ | Matrix | [ |
| 17 | ProPred | PROPRED | [ | Matrix | [ |
| 18 | Rankpep | RANKPEP | [ | Matrix | [ |
| 19 | SVMHC | SVMHC | [ | Matrix | [ |
| 20 | SVRMHC | SVRMHC | [ | SVM | [ |
| 21 | SYFPEITHI | SYFPEITHI | [ | Matrix | [ |
Summary of the four testing protein antigens
| Phospholipase A2 | 30 | 18 |
| LAGE-1 | 17 | 16–19 |
| Lipocalin | 25 | 16 |
| HIV NEF | 31 | 15–16 |