| Literature DB >> 12225620 |
Pierre Dönnes1, Arne Elofsson.
Abstract
BACKGROUND: T-cells are key players in regulating a specific immune response. Activation of cytotoxic T-cells requires recognition of specific peptides bound to Major Histocompatibility Complex (MHC) class I molecules. MHC-peptide complexes are potential tools for diagnosis and treatment of pathogens and cancer, as well as for the development of peptide vaccines. Only one in 100 to 200 potential binders actually binds to a certain MHC molecule, therefore a good prediction method for MHC class I binding peptides can reduce the number of candidate binders that need to be synthesized and tested.Entities:
Mesh:
Substances:
Year: 2002 PMID: 12225620 PMCID: PMC129981 DOI: 10.1186/1471-2105-3-25
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Performance of SVMHC for the MHC type HLA-A*0201, HLA-A3 and HLA-B*2705, measured by the Matthews correlation coefficient, Mc, versus the number of peptides used for training. For all sizes of the training-set the test-set was identical and no part of the test-set was contained in the training-set.
Comparison between SVMHC, SYFPEITHI (SYF) and HLA_BIND (HLA) of the six alleles common between them.
| Dataset | Mc | percentage correct predictions | ||||||
|---|---|---|---|---|---|---|---|---|
| MHC | Length | Size | SVMHC | SYF | HLA | SVMHC | SYF | HLA |
| Overall | - | - | 0.85 | 0.75 | 0.62 | 95% | 91% | 87% |
| HLA-A*0201 | 9 | 113 | 0.78 | 0.77 | 0.77 | 90% | 89% | 89% |
| HLA-A*0201 | 10 | 40 | 0.70 | 0.61 | 0.61 | 87% | 80% | 83% |
| HLA-A1 | 9 | 28 | 0.96 | 0.93 | 0.96 | 98% | 97% | 98% |
| HLA-A3 | 9 | 73 | 0.80 | 0.73 | 0.71 | 91% | 86% | 84% |
| HLA-B*8 | 9 | 25 | 0.79 | 0.79 | 0.82 | 91% | 91% | 92% |
| HLA-B*2705 | 9 | 29 | 1.00 | 0.92 | 0.93 | 100% | 95% | 97% |
The tables shows the MHC-type, the length of the binding peptides, the number of experimentally verified binders, the Matthew correlation coefficient (Mc) and the percentage correct predictions.
Figure 2Specificity/sensitivity plots for SVMHC HLA_BIND and SYFPEITHI. Sensitivity is defined as the number of correctly predicted binders (TP) found at a given cutoff, divided with the total number of binders, i.e. Sens = TP/(TP+FN), where FN is the number of . The specificity is defined as the fraction of the hits above this cutoff that is correct, i.e. Spec = TP/(TP+FP). It can be seen that the sensitivity of SVMHC is higher than of SYFPEITHI and HLA_BIND at any specificity.
Study of recently detected binding peptides from proteins P17944, P17451, P31952 and P17944, binding to HLA-A*0201. The known binding nonamers and the rank of these with the different predictors are shown. Although all three methods detects these peptides, they are found at higher ranks using SVMHC than with the other methods.
| Protein | No. | SVMHC | HLA_BIND | SYFPEITHI |
|---|---|---|---|---|
| P78395 | 2 | 1,6 | 3,23 | 2,8 |
| P17944 | 2 | 1,2 | 1,3 | 4,6 |
| P31952 | 1 | 2 | 4 | 14 |
| P17451 | 3 | 2,3,4 | 1,4,12 | 1,6,10 |
Performance of SVMHC for different HLA alleles, using MHCPEP- or SYFPEITHI-data.
| MHC | Length | Size | Mc | Kernel |
|---|---|---|---|---|
| Predictions using MHCPEP | ||||
| HLA-A1 | 9 | 28 | 0.95 | lin |
| HLA-A*1101 | 9 | 40 | 0.74 | poly |
| HLA-A11 | 9 | 46 | 0.75 | rbf |
| HLA-A11 | 10 | 21 | 0.59 | poly |
| HLA-A2 | 9 | 118 | 0.76 | poly |
| HLA-A2 | 10 | 35 | 0.65 | poly |
| HLA-A*2402 | 9 | 73 | 0.90 | poly |
| HLA-A3 | 9 | 73 | 0.76 | rbf |
| HLA-A*0201 | 9 | 184 | 0.73 | rbf |
| HLA-A*0201 | 10 | 96 | 0.78 | poly |
| HLA-A*3301 | 9 | 32 | 0.72 | lin |
| HLA-A*0301 | 9 | 38 | 0.72 | rbf |
| HLA-A*0301 | 10 | 32 | 0.77 | lin |
| HLA-A31 | 9 | 39 | 0.79 | poly |
| HLA-A*6801 | 9 | 42 | 0.84 | poly |
| HLA-B7 | 9 | 32 | 0.95 | lin |
| HLA-B8 | 9 | 26 | 0.77 | poly |
| HLA-B*2705 | 9 | 41 | 0.93 | lin |
| HLA-B*3501 | 9 | 67 | 0.93 | lin |
| HLA-B*3501 | 10 | 34 | 0.96 | poly |
| HLA-B35 | 9 | 23 | 0.71 | lin |
| HLA-B*2703 | 9 | 22 | 0.90 | lin |
| HLA-B*5301 | 9 | 41 | 0.95 | lin |
| HLA-B27 | 9 | 34 | 0.91 | rbf |
| HLA-B*2706 | 9 | 20 | 0.93 | lin |
| HLA-B51 | 9 | 67 | 0.82 | poly |
| HLA-B*5102 | 9 | 29 | 0.79 | poly |
| HLA-B*0702 | 9 | 52 | 0.96 | poly |
| HLA-B*5103 | 9 | 29 | 0.84 | rbf |
| HLA-B*5401 | 9 | 42 | 0.98 | lin |
| HLA-B*5101 | 9 | 35 | 0.89 | lin |
| Predictions using SYFPEITHI | ||||
| HLA-A*0201 | 9 | 113 | 0.78 | rbf |
| HLA-A*0201 | 10 | 40 | 0.70 | poly |
| HLA-A1 | 9 | 28 | 0.96 | lin |
| HLA-A3 | 9 | 73 | 0.80 | lin |
| HLA-B*8 | 8 | 14 | 0.89 | lin |
| HLA-B*8 | 9 | 25 | 0.79 | lin |
| HLA-B*2705 | 9 | 29 | 1.00 | lin |
| HLA-B7 | 9 | 23 | 0.93 | lin |
The first column explains shows the HLA allele, the second the length of the binding peptides, the third the number of binders included in the training set, the fourth the performance as measured by the Matthews correlation coefficient. The final column shows what type of kernel was used in the Support Vector Machine.
Figure 3The dependency of Matthew correlation coefficient on the reduction level for two HLA alleles (HLA-A*0201 and HLA-A3). The reduction level is measured as the maximum number of allowed identical measures between two peptides in the set.