| Literature DB >> 19300484 |
Aidan MacNamara1, Ulrich Kadolsky, Charles R M Bangham, Becca Asquith.
Abstract
Theoretical methods for predicting CD8+ T-cell epitopes are an important tool in vaccine design and for enhancing our understanding of the cellular immune system. The most popular methods currently available produce binding affinity predictions across a range of MHC molecules. In comparing results between these MHC molecules, it is common practice to apply a normalization procedure known as rescaling, to correct for possible discrepancies between the allelic predictors. Using two of the most popular prediction software packages, NetCTL and NetMHC, we tested the hypothesis that rescaling removes genuine biological variation from the predicted affinities when comparing predictions across a number of MHC molecules. We found that removing the condition of rescaling improved the prediction software's performance both qualitatively, in terms of ranking epitopes, and quantitatively, in the accuracy of their binding affinity predictions. We suggest that there is biologically significant variation among class 1 MHC molecules and find that retention of this variation leads to significantly more accurate epitope prediction.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19300484 PMCID: PMC2650421 DOI: 10.1371/journal.pcbi.1000327
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1ROC curve analysis on the effects of rescaling.
Each graph shows the ROC curves using different combinations of datasets and prediction methods (see table 1). Figure 1A uses NetCTL with the SYF1 dataset, figure 1B NetMHC with the SYF1 dataset, figure 1C NetMHC with the Lanl661 dataset and figure 1D NetCTL with the Lanl179 dataset. The x-axis has been scaled to show the region of importance (the AUC with high specificity values). The rescaled results (red dashed line) are compared against non-rescaled (black solid line). Table 1 gives the statistics for each graph.
The summary statistics and details of each ROC curve from figure 1.
| ROC Curve | Colour | Method | Dataset | Rescaling | AUC | Bootstrap P-Value |
|
| Black solid | NetCTL v1.2 | SYF1 | No | 0.949 | <0.001 |
| Red dashed | NetCTL v1.2 | SYF1 | Yes | 0.937 | ||
|
| Black solid | NetMHC v3.0 | SYF1 | No | 0.932 | <0.001 |
| Red dashed | NetMHC v3.0 | SYF1 | Yes | 0.905 | ||
|
| Black solid | NetMHC v3.0 | Lanl661 | No | 0.944 | <0.001 |
| Red dashed | NetMHC v3.0 | Lanl661 | Yes | 0.937 | ||
|
| Black solid | NetCTL v1.2 | Lanl179 | No | 0.933 | <0.001 |
| Red dashed | NetCTL v1.2 | Lanl179 | Yes | 0.918 |
In NetCTL v1.2, the TAP and cleavage scores are combined with the rescaled MHC binding score to produce a combined score for each submitted nonamer. In order to test how NetCTL performed without rescaling, it was still necessary to divide the MHC binding score by a rescaling value so the weightings of the TAP and cleavage score were still applicable and accurate. By averaging over all rescaling values and dividing the MHC binding value by this number, rescaling differences were “averaged out” and it was still possible to use the extra information from the TAP and cleavage predictions.
Figure 2The relationship between AUC and rescale value.
There is no evidence for a correlation of AUC and rescale value for the whole set of allele predictors (R2 = 0.0068, p = 0.606), nor for the subset of predictors with an AUC>0.9 (R2 = 0.0007, p = 0.887). This analysis used the Lanl661 epitope dataset.