| Literature DB >> 17381846 |
Brett Trost1, Mik Bickis, Anthony Kusalik.
Abstract
BACKGROUND: Peptides derived from endogenous antigens can bind to MHC class I molecules. Those which bind with high affinity can invoke a CD8+ immune response, resulting in the destruction of infected cells. Much work in immunoinformatics has involved the algorithmic prediction of peptide binding affinity to various MHC-I alleles. A number of tools for MHC-I binding prediction have been developed, many of which are available on the web.Entities:
Year: 2007 PMID: 17381846 PMCID: PMC1847428 DOI: 10.1186/1745-7580-3-5
Source DB: PubMed Journal: Immunome Res ISSN: 1745-7580
Performances of the individual prediction tools on the HLA-A*0201 peptides from the community binding resource.
| Tool | |||||||
| Rank1 | Sensitivity | Threshold2 | Rank | Sensitivity | Threshold | ||
| ARB Matrix | 0.935 | 4 | 0.188 | 2.190 | 2 | 0.601 | 42.950 |
| NetMHC 2.0 ANN | 0.932 | 1 | 0.286 | 153.000 | 1 | 0.611 | 920.000 |
| SMM | 0.922 | 2 | 0.201 | 38.092 | 4 | 0.543 | 454.865 |
| Bimas | 0.920 | 3 | 0.198 | 324.068 | 3 | 0.552 | 47.991 |
| SYFPEITHI | 0.885 | 8 | 0.170 | 27.000 | 5 | 0.421 | 24.000 |
| Multipred ANN | 0.884 | 9 | 0.140 | 5.820 | 7 | 0.373 | 5.560 |
| NetMHC 2.0 Matrix | 0.872 | 6 | 0.177 | 24.329 | 10 | 0.358 | 20.129 |
| SVMHC MHCPEP | 0.870 | 12 | 0.115 | 1.000 | 11 | 0.334 | 0.520 |
| Logistic Regression | 0.862 | 13 | 0.101 | 0.364 | 9 | 0.364 | 0.108 |
| SVMHC SYFPEITHI | 0.854 | 7 | 0.176 | 0.950 | 8 | 0.367 | 0.490 |
| HLA Ligand | 0.825 | 10 | 0.137 | 141.000 | 14 | 0.274 | 127.000 |
| Rankpep | 0.822 | 15 | 0.077 | 103.000 | 13 | 0.306 | 83.000 |
| MHCPred (Interactions) | 0.818 | 5 | 0.182 | 43.350 | 6 | 0.377 | 99.080 |
| MHCPred (position only) | 0.814 | 11 | 0.116 | 21.330 | 12 | 0.311 | 65.310 |
| Multipred HMM | 0.798 | 14 | 0.090 | 7.530 | 15 | 0.244 | 7.090 |
| Predep | 0.788 | 16 | 0.045 | -6.000 | 16 | 0.217 | -5.130 |
The predictive performance of each tool for the HLA-A*0201 community binding data is shown using two measures: Ascore, and sensitivity when specificity is 0.99 and 0.95. 1Indicates how the sensitivity of each tool compares to that of the other tools at the indicated specificity; the tool with rank 1 has the highest sensitivity. 2The scoring threshold corresponding to the indicated specificity.
Individual tool Avalues and sensitivity data for HLA-B*3501 using binders and nonbinders from the community binding resource.
| Tool | |||||||
| Rank | Sensitivity | Threshold | Rank | Sensitivity | Threshold | ||
| ARB Matrix | 0.849 | 1 | 0.242 | 12.890 | 1 | 0.422 | 296.090 |
| Bimas | 0.808 | 7 | 0.047 | 60.000 | 8 | 0.166 | 24.000 |
| NetMHC 2.0 Matrix | 0.789 | 2 | 0.137 | 32.386 | 2 | 0.336 | 28.404 |
| HLA Ligand | 0.786 | 8 | 0.043 | 162.000 | 4 | 0.237 | 145.000 |
| Rankpep | 0.769 | 5 | 0.071 | 124.000 | 3 | 0.256 | 107.000 |
| Logistic Regression | 0.764 | 11 | 0.024 | 0.655 | 11 | 0.123 | 0.259 |
| SVMHC SYFPEITHI | 0.742 | 3 | 0.118 | 0.680 | 6 | 0.204 | 0.420 |
| SVMHC MHCPEP | 0.733 | 8 | 0.043 | 1.560 | 6 | 0.204 | 1.140 |
| MHCPred (position only) | 0.692 | 6 | 0.057 | 140.600 | 9 | 0.156 | 210.860 |
| MHCPred (interactions) | 0.683 | 4 | 0.090 | 52.240 | 5 | 0.209 | 179.470 |
| Predep | 0.587 | 10 | 0.038 | -6.020 | 10 | 0.128 | -5.090 |
For details, see the caption for Table 1.
Individual tool Avalues and sensitivity data for H-2Kd using binders and nonbinders from the community binding resource.
| Tool | |||||||
| Rank | Sensitivity | Threshold | Rank | Sensitivity | Threshold | ||
| SYFPEITHI | 0.918 | 3 | 0.300 | 27.000 | 4 | 0.483 | 25.000 |
| ARB Matrix | 0.899 | 1 | 0.500 | 15.470 | 1 | 0.583 | 40.800 |
| Bimas | 0.888 | 4 | 0.183 | 4800.000 | 3 | 0.533 | 2880.000 |
| Rankpep | 0.886 | 2 | 0.317 | 107.000 | 2 | 0.567 | 98.000 |
For details, see the caption for Table 1.
Individual tool Avalues and sensitivity data for HLA-A*0201 using binders and nonbinders gathered from the literature
| Tool | |||||||
| Rank | Sensitivity | Threshold | Rank | Sensitivity | Threshold | ||
| Multipred ANN | 0.772 | 11 | 0.083 | 5.830 | 2 | 0.278 | 5.600 |
| NetMHC 2.0 ANN | 0.772 | 6 | 0.139 | 99.000 | 7 | 0.231 | 300.000 |
| SYFPEITHI | 0.762 | 1 | 0.194 | 27.000 | 2 | 0.278 | 25.000 |
| SVMHC MHCPEP | 0.745 | 1 | 0.194 | 0.910 | 4 | 0.269 | 0.740 |
| Logistic Regression | 0.743 | 10 | 0.093 | 0.424 | 12 | 0.167 | 0.281 |
| ARB Matrix | 0.742 | 8 | 0.102 | 1.860 | 14 | 0.139 | 3.550 |
| Bimas | 0.722 | 12 | 0.074 | 437.482 | 9 | 0.213 | 159.970 |
| NetMHC 2.0 Matrix | 0.719 | 14 | 0.046 | 27.193 | 10 | 0.194 | 23.585 |
| SMM | 0.716 | 4 | 0.157 | 100.484 | 6 | 0.250 | 262.434 |
| Rankpep | 0.708 | 3 | 0.176 | 89.000 | 5 | 0.259 | 83.000 |
| MHCPred (interactions) | 0.707 | 6 | 0.139 | 61.660 | 1 | 0.287 | 100.690 |
| SVMHC SYFPEITHI | 0.706 | 5 | 0.148 | 0.970 | 11 | 0.185 | 0.820 |
| HLA Ligand | 0.705 | 12 | 0.074 | 147.000 | 12 | 0.167 | 138.000 |
| MHCPred (position only) | 0.700 | 8 | 0.102 | 28.250 | 7 | 0.231 | 67.300 |
| Multipred HMM | 0.695 | 16 | 0.009 | 7.570 | 15 | 0.102 | 7.290 |
| Predep | 0.627 | 15 | 0.019 | -6.450 | 16 | 0.093 | -5.330 |
For details, see the caption for Table 1. The peptides in this literature-derived dataset are available in Additional File 1.
Performance of the heuristic-based method on all four datasets
| Specificity | HLA-A*0201 (comm) | HLA-B*3501 (comm) | H-2Kd (comm) | HLA-A*0201 (lit) |
| 0.99 | 0.404 | 0.313 | 0.467 | 0.271 |
| 0.95 | 0.618 | 0.393 | 0.617 | 0.475 |
The sensitivity of the HBM is shown at 0.99 specificity and 0.95 specificity for all four of the datasets used in this study. All values were obtained using a value of 0.99 for the individual tool specificity parameter. The abbreviation "comm" refers to peptides derived from the community binding database, while "lit" refers to peptides gathered from the literature.
Performance of linear discriminant analysis on all four datasets
| Specificity | HLA-A*0201 (comm) | HLA-B*3501 (comm) | H-2Kd (comm) | HLA-A*0201 (lit) |
| 0.99 | 0.324 | 0.213 | 0.417 | 0.102 |
| 0.95 | 0.718 | 0.436 | 0.633 | 0.333 |
| 0.956 | 0.885 | 0.935 | 0.828 | |
The sensitivity of the combined tool is shown at 0.99 specificity and 0.95 specificity for all four of the datasets used in this study. The abbreviation "comm" refers to peptides derived from the community binding database, while "lit" refers to peptides gathered from the literature.
Figure 1Q-Q plot showing distribution of LDA scores for the HLA-A*0201 community data set. The horizontal axis has been scaled according to normal probabilities, so that points from a normally distributed variable would fall along a straight line (shown in blue). Scores lying above a threshold indicated by a horizontal line would be classified as epitopes. A level exceeding 99% of a normal distribution defines a nominal specificity of 0.99, whereas an actual specificity of 0.99 requires a threshold meeting the actual distribution of points at the 0.99 vertical line. The realized sensitivity of 0.32 for a specificity of 0.99 is indicated as the proportion of epitopes whose scores lie above the threshold of 0.95.