Literature DB >> 18045832

Modeling the adaptive immune system: predictions and simulations.

Claus Lundegaard¹, Ole Lund, Can Kesmir, Søren Brunak, Morten Nielsen.

Abstract

MOTIVATION: Immunological bioinformatics methods are applicable to a broad range of scientific areas. The specifics of how and where they might be implemented have recently been reviewed in the literature. However, the background and concerns for selecting between the different available methods have so far not been adequately covered.
SUMMARY: Before using predictions systems, it is necessary to not only understand how the methods are constructed but also their strength and limitations. The prediction systems in humoral epitope discovery are still in their infancy, but have reached a reasonable level of predictive strength. In cellular immunology, MHC class I binding predictions are now very strong and cover most of the known HLA specificities. These systems work well for epitope discovery, and predictions of the MHC class I pathway have been further improved by integration with state-of-the-art prediction tools for proteasomal cleavage and TAP binding. By comparison, class II MHC binding predictions have not developed to a comparable accuracy level, but new tools have emerged that deliver significantly improved predictions not only in terms of accuracy, but also in MHC specificity coverage. Simulation systems and mathematical modeling are also now beginning to reach a level where these methods will be able to answer more complex immunological questions.

Entities: Gene

Mesh：

Substances：
Immunologic Factors

Year: 2007 PMID： 18045832 PMCID： PMC7110254 DOI： 10.1093/bioinformatics/btm471

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

1 INTRODUCTION

1.1 Immunology

The adaptive immune system of vertebrates is thought to be only 400 million years old and exists in most fish, amphibians, reptiles, birds and mammals (Thompson, 1995). Adaptive immunity is induced by lymphocytes and can be classified into two types: humoral immunity, mediated by antibodies, which are secreted by B lymphocytes and can neutralize pathogens outside the cells; and cellular immunity, mediated by T lymphocytes that eliminate infected or malfunctioning cells, and provide help to other immune responses. Diversity is the hallmark of the adaptive immune systems. Both the B and T lymphocyte-specific receptors for antigen recognition are assembled from variable (V), diversity (D), and joining (J) gene segments early in the lymphocyte development. There are multiple copies of V, D and J segments, and a huge repertoire of T and B cells is generated by the recombination of these segments, reviewed by Li et al. (2004). Another task faced by the immune system is the tolerance to self, which is handled by continuously removing receptors that react to self-epitopes. Special immunoglobulin molecules (antibodies) mediate the humoral response. As mentioned above, the antibodies are produced by B lymphocytes that bind to antigens by their immunoglobulin receptors, which is a membrane bound form of the antibodies. When the B lymphocytes become activated, they start to secrete the soluble form of this receptor in large amounts. The antibody is Y-shaped, and each of the two branches functions independently and can be recombinantly produced and is then known as Fabs. The highly variable tip of the Fab, which can bind to epitopes is called the paratope and is made up of the so-called complementary determining regions (CDRs). Antibodies can coat the surface of an antigen such as a virus, so that it cannot function or infect cells, reviewed by Burton (2002). Antibody-covered viruses or bacteria are easily phagocytosed and destroyed by scavenger cells of the immune system, e.g. the macrophages. Antigenic proteins can be recognized by the antibodies in their native form without any cleavage or interactions with other molecules. Thus the humoral immune response reacts to extracellular pathogens, and the response is crucial in the defense against most pathogens. B-cell epitopes are normally classified into two groups: continuous and discontinuous epitopes. A continuous epitope, (also called a sequential or linear epitope) is a short peptide fragment in a protein that is recognized by antibodies specific for that protein. A discontinuous epitope is composed of residues that are not adjacent in the primary structure (amino acid sequence), but are brought into proximity by the folding of the polypeptide. The classification is not clear-cut as discontinuous epitopes may contain linear stretches of amino acids, and continuous epitopes may show conformational preferences. The cellular arm of the immune system consists of two parts; cytotoxic T lymphocytes (CTL), and helper T lymphocytes (HTLs). CTLs destroy cells that present non-self peptides (epitopes). HTLs are needed for B cells activation and proliferation to produce antibodies against a given antigen. CTLs on the other hand perform surveillance of the host cells, and recognize and kill infected cells, generally explained in Janeway et al. (2001). Both CTL and HTL are raised against peptides that are presented to the immune cells by major histocompatibility complex (MHC) molecules, which are the most polymorphic of mammalian proteins. The human versions of MHCs are referred to as the human leucocyte antigen (HLA). The cells of an individual are constantly screened for such peptides by the cellular arm of the immune system. In the MHC class I pathway, class I MHCs presents endogenous antigens to T cells carrying the CD8 receptor (CD8+ T cells). To be presented, a precursor peptide is normally first generated by the large cytosomal protease complex called the proteasome (Loureiroa and Ploegha, 2006). Generally, it then binds to the transporter associated with antigen processing (TAP) for translocation into the endoplasmic reticulum (ER), reviewed by Abele and Tampé (2004), but some peptides can enter the ER independently of TAP. This should be considered when dealing with virus-infected cells or tumors cells that might have reduced or absent TAP function. There are several ways that the peptide can enter the ER without TAP function depending on the origin and properties of the peptide. The most well-established model, however, is for proteins containing a signal peptide. Such proteins are translated directly into the ER through the Sec61 transporter complex and sometimes the cleaved-off signal peptide will end up in ER. This model is especially relevant for peptides binding to HLAs belonging to the abundant A2 HLA serotype where TAP-independent presentation is responsible for up to 10% of the A2 restricted epitopes, reviewed in Larsen et al. (2006). During or after the transport into the ER the peptide must bind to the MHC class I molecule (Stoltze et al., 2000; Zhang and Williams, 2006) before it can be transported to the cell surface through the golgi system. The most selective step in this pathway is binding of a peptide to the MHC class I molecule. In an older review, Yewdell and Bennink (1999) states that only 1 in 200 binds with an affinity strong enough to generate an immune response. This has been challenged, and it might be that up to 3% of the possible peptides bind strong enough to generate a subsequent immune response (Assarsson et al., 2007). In another recent work of Moutaftsi et al. (2006), however, it is found that of the 49 epitopes that are responsible for 95% of the total CD8+ T-cell response against a vaccinia challenge in mouse 90% binds MHC with an affinity stronger than 500 nM. In any case a peptide must go through the processes in a greater number than competing peptides to be immunodominant. The MHC is the most polymorphic gene system known. This polymorphism is a huge challenge for T-cell epitope discoveries, enhancing the need for bioinformatical analysis and resources. However, it also highly complicates immunological bioinformatics, as predictive methods for peptide MHC binding have to deal with the diverse genetic background of different populations and individuals. On a population basis, hundreds of alleles have been found for most of the HLA encoding loci (1839 in release 2.17.0 of the IMGT/HLA Database, http://www.ebi.ac.uk/imgt/hla/). In a given individual either one or two different alleles are expressed per locus depending on whether the same (in homozygous individuals) or two different (in heterozygous individuals) alleles are coded for on the two different chromosomes. The number of MHC expressing loci, however, differs highly among species. While a fully heterozygous human has six different MHC class I genes, a rhesus macaque may host up to 22 active MHC class I genes (Daza-Vamenta et al., 2004). Each MHC allele binds a very restricted set of peptides and the polymorphism affects the peptide binding specificity of the MHC; one MHC will recognize one part of the peptide space, whereas another MHC will recognize a different part of this space. The very large number of different MHC alleles makes reliable identification of potential epitope candidates an immense task if all alleles are to be included in the search. However, many MHC alleles share a large fraction of their peptide-binding repertoire, and it is often possible to find promiscuous peptides, which bind to a number of HLA alleles. A way of reducing the problem is to group all the different alleles into supertypes in a manner so that all the alleles within a given supertype have roughly the same peptide specificity (Hertz and Yanover, 2007; Lund et al., 2004; Reche and Reinherz, 2004; Sette and Sidney, 1998, 1999). This allows the search to be limited to a manageable representative set. Representing a supertype by a well-studied allele might lead to selection of epitopes that is very restricted to this allele, but not to any other alleles within the supertype. Thus another, and potentially more rational approach, would be to select a limited set of peptides restricted to as many alleles as possible. This should be within reach with new methods that directly predict epitopes that can bind to different alleles (promiscuous epitopes) (Brusic et al., 2002), or pan-specific approaches that can make predictions for all alleles where the sequence is known (Jojic et al., 2006; Nielsen et al., 2007a). When the peptide–MHC complex is presented on the surface of the cell, it might bind to a CD8+ T cell with a fitting T-cell receptor (TCR). If such a TCR clone exists depends on, among other factors, if the TCR–peptide complex is too similar to MHC–peptide complexes generated with peptides from the host proteome (self-peptides). This effect is called tolerance and might be broken by so-called self-epitopes, reviewed by Andersen et al. (2006). B cells must be activated to produce antibodies against a given antigen, and helper T cells specific for peptides from the antigen must be activated to get a strong B-cell response. The epitope recognized by the helper T cell is usually somehow connected to the epitope that is recognized by the B cell, but the two cells do not necessarily recognize overlapping epitopes. T cells can recognize internal peptides that do not need to be a part of the surface–surface interactions with the B-cell receptor. Actually, the T-cell and the B-cell epitopes might not even come from the same protein (Janeway et al., 2001). The peptides recognized by the CD4+ T cells are presented by the MHC class II molecule, and peptide presentation on MHC class II molecules follow a different path than the MHC class I presentation pathway (Castellino et al., 1997): MHC class II molecules associate with the invariant chain (Ii) in the ER and the MHC–Ii complex accumulates in endosomal compartments. Here, Ii is degraded, while another MHC-like molecule, called HLA–DM in humans, loads the MHC class II molecules with the best available ligands originating from endocytosed antigens. The peptide–MHC class II complexes are subsequently transported to the cell surface for presentation to T helper cells. Immunological predictions and simulations have been demonstrated highly useful in applied immunology in general, and in vaccinology in particular. It can be used as an efficient tool to lower the experimental workload in epitope discovery for use in rational vaccine design, immunotherapeutics and development of diagnogstic tools. A number of recent publications describe in great detail the values and benefits obtained by the use of immunoinformatics and predictions in applied immunology and vaccinology (Davies and Flower, 2007; De Groot, 2006; De Groot and Moise, 2007; Korber et al., 2006; Lund et al., 2005; Petrovsky and Brusic, 2006; Tong et al., 2007). Here, we will not engage in this discussion, but rather limit ourselves to describing the available methods for making such predictions, and deliver some of the background information needed to be able to choose the appropriate method for a given task.

1.2 Prediction methods

A large variety of machine-learning techniques are commonly used in the field of immunological bioinformatics ranging from the conventional techniques of position-specific scoring matrices (PSSMs) (Altschul et al., 1997), Gibbs sampling (Lawrence et al., 1993; Nielsen et al., 2004), artificial neural networks (ANNs) described in Baldi and Brunak (2001), hidden Markov models (HMMs) explained in Hughey and Krogh (1996), and support vector machines (SVMs) described in Cortes and Vapnik (1995), to more exotic methods like ant colonies (Karpenko et al., 2005) and other motif search algorithms (Bui et al., 2005; Chang et al., 2006; Murugan and Dai, 2005). ANNs and SVMs and are ideally suited to recognize non-linear patterns, which are believed to contribute to, for instance, peptide–HLA-I interactions (Adams and Koziol, 1995; Brusic et al., 1994; Buus et al., 2003; Gulukota et al., 1997; Nielsen et al., 2003). In an ANN, information is trained and distributed into a computer network with an input layer, hidden layers and an output layer all connected in a given structure through weighted connections (Baldi and Brunak, 2001). In a PSSM on the other hand, all positions in the motif are assumed to contribute in an independent manner, and the likelihood for matching a motif is calculated as a sum of individual matrix scores. The Gibbs sampler method is a particular implementation of the PSSM search algorithm, where the optimal PSSM is determined by a search for a sequence alignment that provides maximal information content for a given motif length. Conventionally PSSMs are log-odds matrices (Altschul et al., 1997), where the weight matrix elements are estimated from the logarithm of the ratio of the observed frequency of a given amino acid to the background frequency of that amino acid. However, many other techniques including the stabilization matrix method (SMM) (Peters and Sette, 2005), and evolutionary algorithm (Brusic et al., 1998) exist to construct a PSSM. The PSSMs might also be coupled with other information available to compensate for lack of data (Lundegaard et al., 2004). Finally, HMMs have been used in the field of immunological bioinformatics. These are well suited to characterized biological motifs with an inherent structural composition, and have been used in the field of immunology to predict for instance peptide binding to MHC class I (Mamitsuka, 1998) and class II (Noguchi et al., 2002) molecules. Beside machine-learning techniques, also (empirical) molecular force field modeling techniques (Logean et al., 2001) and 3D Quantitative Structure–Activity Relationship (3D-QSAR) (Doytchinova and Flower, 2002; Zhihua et al., 2004) analysis have been used to predict features of the immune system.

1.3 Performance measures and validation

As an evaluation of the general quality of a prediction method a measure describing this quality is needed. However, no single measure can capture all qualities of a prediction, and not all types of data and predictions can be reasonably described by the same measure. So to be able to compare different systems, it is often needed to present several measures of quality. Most measures need the data to be classified into two groups, i.e. positives and negatives. The number of classified (experimentally measured) positives is often designated as actual positives (AP), and the number of negatives, actual negatives (AN), the number of predicted positives (PP), predicted negatives (PN), truly predicted positives (TP), falsely predicted positives (FP), truly predicted negatives (TN), and falsely predicted negatives (FN). Some of the most often used measures are briefly described here. The equations for the mentioned measures are given at the end of the section. The fraction correct predicted (FCP) is the fraction of the total predictions that falls into the correct group. This measure is intuitively easily captured, but has the weakness that if a large fraction of the total evaluation data falls into a single group one will get high performance by just blindly predicting most or even everything to belong to this category. The positive predicted value (PPV) is the fraction of the positive predictions that actually falls into the positive class. The sensitivity is the fraction of the AP that is predicted as positives using a given threshold. The specificity is the fraction of the AN that is predicted as negatives. The three latter measures are also easily grasped, however they are all dependent on the chosen prediction cutoff classifying the data into positive and negative predictions. A high sensitivity can be obtained by setting your prediction cutoff so that most of your evaluation data will fall into the positive group, but this will then be at the expense of the specificity and the PPV. Which cutoff to use is determined by the purpose of the prediction, i.e. how many verified epitopes is needed versus the resources available for experimental validation. A plot of the sensitivity against the false positive rate (1-specificity) is called a receiver operating characteristic (ROC) curve (Swets, 1988). Such a plot can be a help to set the best prediction cutoff. One of the best ways of measuring the predictive power of a method is to calculate the area under the ROC curve (AUC) since this is a threshold-independent measure. Another robust measure is the Pearson correlation coefficient (PCC), which is a measure of how well the prediction scores correlate with the actual value on a linear scale. In situations where the correlation is not necessarily linear, the Spearman's rank correlation coefficient (SRC) is more appropriate. In this measure each prediction is ranked on the basis of the prediction score and the PCC is calculated on the basis of this rank rather than the prediction score. The SRC, like the AUC, is a threshold-independent measure of how well the predictor ranks the data when compared with the actual ranking. When comparing different methods, the threshold-independent measures are to be preferred. Otherwise a threshold has to be set under the same assumptions for all predictors. As an example one can estimate the specificity for each predictor by setting the threshold for the given predictor to a value where the sensitivity will be 0.5 (i.e. half of the total available positives is over the threshold), or estimate the sensitivity at a threshold where the specificity will be 0.8 (i.e. 80% of the AN are predicted as negatives). The choice of an evaluation set is also absolutely crucial and several considerations must be taken. A large and diverse dataset is to be preferred to avoid any biases in prediction space. Extreme care should also be taken to ensure that none of the predictors have been trained on the data used for evaluation even though that might not always be possible. To make the evaluation as broad as possible cross-validation is often used, i.e. the method is trained on a large part of the available data and a smaller part is left out for evaluation. This is done until all data has been included in the evaluation set and in this way it is possible to estimate the performance on the complete dataset. Caution has to be taken, however, that the part used for training is not too similar to the evaluation part, as this will lead to an overestimation of the performance due to overtraining. This is especially true when using the leave-one-out version of cross-validation where everything except one data point is used for training, and the evaluation is then performed on the ensemble of the left out data points. Equations are as follows:

2 CURRENT PREDICTION ALGORITHMS

The state-of-the-art class I T-cell epitope prediction methods are today of a quality that makes it highly useful as an initial filtering technique in epitope discovery. Studies have demonstrated how it is possible to rapidly identify and verify MHC binders from upcoming possible threats such as the SARS virus (Sylvester-Hvid et al., 2004) with high reliability, and take such predictions a step further and validate the immunogenecity of peptides with limited efforts, as has been shown with the influenza A virus (Wang et al., 2007). It is also possible to identify the vast majority of the relevant epitopes in a rather complex organism as the vaccinia virus using class I MHC binding predictions and only have to test a very minor fraction of the possible peptides in the virus proteome (Moutaftsi et al., 2006). MHC class II predictions can be made fairly reliable for certain alleles, and a number of helper epitopes have been identified by the help of bioinformatical approaches (Consogno et al., 2003). B-cell epitopes are still the most complicated task. However, some consistency between predicted and verified epitopes is starting to emerge using the newest prediction methods (Dahlback et al., 2006). In the following, we describe some of the best-performing prediction methods within each area.

2.1 B-cell epitope predictions

B-cell epitope prediction is a highly challenging field due to the fact that the vast majority of antibodies raised against a specific protein interact with discontinuous fragments (van Regenmortel, 1996). The prediction of continuous, or linear, epitopes, however, is a somewhat simpler problem, and may be still useful for synthetic vaccines or as diagnostic tools (Regenmortel and Muller, 1999). Moreover, the determination of continuous epitopes can be integrated into determination of discontinuous epitopes, as these often contain linear stretches (Hopp, 1994). In the early 1980s, Hopp and Woods (Hopp and Woods, 1981, 1983) developed the first linear epitope prediction method. This method takes the assumption that the regions of proteins that have a high degree of exposure to solvent contain the antigenic determinants. According to the hydrophilicity scale generated by Levitt (1976), Hopp and Woods (1981) assigned the hydrophilicity propensity to each amino acid in a sequence and looked at groups of six residues. This gave promising results and a number of methods have since been developed with the aim of predicting linear epitopes using a combination of different amino acid propensities (Alix, 1999; Debelle et al., 1992; Jameson and Wolf, 1988; Maksyutov and Zagrebelnaya, 1993; Odorico and Pellequer, 2003; Parker et al., 1986). In 1993, Pellequer et al. (1993) proposed an evaluation set containing 85 continuous epitopes in 14 proteins and found that the method based on turn propensity (i.e. the propensity of an amino acid to occur within a turn structure) had the highest sensitivity using this set. Seventy percent of the residues predicted to be in epitopes by this method were actually part of epitopes. The sensitivity for methods based on other propensities was in the range of 36–61% (Pellequer et al., 1991). Analyzing the epitope regions in the Pellequer dataset reveals that almost all the hydrophobic amino acids are underrepresented, supporting the assumption that linear B-cell epitopes will occur in hydrophilic regions of the proteins. An extensive study of linear B-cell epitope prediction methods was published by Blythe and Flower (2005). To test how well peaks in single amino acid scale propensity profiles are (significantly) associated with known linear epitope locations, 484 amino acid propensities from the AAindex database (http://www.genome.ad.jp) (Kawashima and Kanehisa, 2000) were used. As test set they used 50 epitope-mapped proteins defined by polyclonal antibodies, which were the best non-redundant test set available. Blythe and Flower (2005) found, however, that even the predictions based on the most accurate amino acid scales were only marginally better than random, suggesting that more sophisticated approaches is needed to predict the linear epitopes. BepiPred (Larsen et al., 2006), an algorithm that combines scores from the Parker hydrophilicity scale (Parker et al., 1986) and a PSSM trained on linear epitopes, shows a small, but significant, increase in AUC over earlier scale-based methods. The sequence parametrizer algorithm (Sollner, 2006; Sollner and Mayer, 2006), along with its associated machine-learning methods uses the common single amino acid propensity scales, but also incorporates neighborhood parameters reflecting the probability that a given stretch of amino acids exists within a predefined proximity of a specific amino acid residue. Training and testing on epitope sequences pulled from a high-quality proprietary database, as well as several publicly accessible databases, yields a degree of accuracy that is greatly increased over single-parameter methods. Different experimental techniques can be used to define conformational epitopes. Probably the most accurate, and easily defined is using the solved structures of antibody–antigen complexes (Fleury et al., 2000; Mirza et al., 2000). The amount of this kind of data is unfortunately still scarce, compared to linear epitopes. Furthermore, very few antigens have been studied in a way where all possible epitopes on a given antigen has been identified. Unidentified epitopes within the dataset will lower the apparent performance of an accurate prediction method by increasing the apparent false positive rate. The simplest way to predict the possible epitopes in a protein of known 3D structure is to use the knowledge of surface accessibility (Novotny et al., 1986; Thornton et al., 1986). Two newer methods using protein structure and surface exposure for prediction of B-cell epitopes have been developed. The CEP method (Kulkarni-Kale et al., 2005) calculates the relative accessible surface area for each residue in the structure. Then it is determined which parts of the protein that are exposed enough to be antigenic determinants. Regions that are distant in the primary sequence, but close in three-dimensional space are considered as one epitope. The tool was tested on a dataset of 63 antigen–antibody complexes and the algorithm correctly identified 76% of the epitope residues. DiscoTope (Haste et al., 2006) uses a combination of amino acid statistics, spatial information and surface exposure. It is trained on a compiled dataset of discontinuous epitopes from 76 X-ray structures of antibody–antigen protein complexes. This method outperforms methods that predict linear epitopes. Recently a workshop was held on the subject of B-cell epitope predictions attended by a broad range of the current method developers. The workshop resulted in a published review containing conclusions on the present common ground, and suggestions for the future especially concerning coordination and evaluation (Greenbaum et al., 2007). Different ways of measuring the accuracy of B-cell epitope predictions have been suggested (Hopp, 1994; van Regenmortel and Pellequer, 1994). Pellequer suggested using the specificity as a measure of accuracy, while Hopp suggested using the PPV, but, as described earlier, neither measure will alone give a good description of the performance. In accordance to this the recent workshop concluded that the AUC measure is to be preferred (Greenbaum et al., 2007). Another issue is whether to make the statistics on a per-residue or on a per-epitope basis. However, as the latter have the additional complications of defining how much of an epitope that must be included in a prediction to be considered correct, and how much extra included residues is allowed, the per residue measure is to be preferred. Epitope mapping can be performed experimentally by other methods than structure determination, e.g. by phage display (Jesaitis et al., 1999; Smith and Petrenko, 1997). The low sequence similarity between the mimotope [i.e. a macromolecule, often a peptide, which mimics the structure of an epitope, (Meloen et al., 2000)] identified through phage display and the antigen complicates the mapping back onto the native structure of the antigen. A number of methods have been developed to facilitate this (Batori et al., 2006; Enshell-Seijffers et al., 2003; Halperin et al., 2003; Huang et al., 2006; Moreau et al., 2006; Mumey et al., 2003; Schreiber et al., 2005; Tarnovitski et al., 2006). However, these are to be considered as interpreters of experimental data rather than predictors, which are the main focus of this review.

2.2 MHC binding

A number of methods for predicting the binding of peptides to MHC molecules have been developed (Schirle et al., 2001) since the first motif methods were presented (Rothbard and Taylor, 1988; Sette et al., 1989). The majority of peptides binding to MHC class I molecules have a length of 8–10 amino acids. Position 2 and the C-terminal position have turned out generally to be very important for the binding to most class I MHCs and these positions are referred to as anchor positions (Rammensee et al., 1999). For some alleles, the binding motifs further have auxiliary anchor positions. Peptides binding to the human HLA-A*0101 allele thus have positions 2, 3 and 9 as anchors (Kondo et al., 1997; Kubo et al., 1994; Rammensee et al., 1999). The importance of anchor positions for peptide binding and the allele-specific amino acid preference at the anchor positions was first described by Falk et al., 1990. The discovery of such allele-specific motifs led to the development of the first reasonable accurate algorithms (Pamer et al., 1991; Rotzschke et al., 1991). In these prediction tools, it is assumed that the amino acids at each position along the peptide sequence contribute a given binding energy, which can independently be added up to yield the overall binding energy of the peptide (Meister et al., 1995; Parker et al., 1994; Stryhn et al., 1996). Similar types of approaches are used by the EpiMatrix method (Schafer et al., 1998), the BIMAS method (Parker et al., 1994), the SYFPEITHI method (Rammensee et al., 1999), the RANKPEP method (Reche et al., 2002) and the Gibbs sampler method (Nielsen et al., 2004). Several of these matrix methods use an approach in the development where the method is build using exclusively positive examples defined after certain criteria, like eluted peptides and interferon gamma response data. This data can be used in training as well as affinity binding data defining binding stronger than a certain threshold (usually 500 nM). Other matrix methods, like the SMM method, aim at predicting an actual affinity and thus use exclusively affinity data. As described earlier, matrix-based methods cannot take correlated effects into account, i.e. where the contribution to the binding affinity by a given amino acid at one position is influenced by amino acids at other positions in the peptide. Higher order methods like ANNs and SVMs, on the other hand, are ideally suited to take such correlations into account. These methods can be trained with data either in the format of binder/non-binder classification, or as real affinity data. Some of the recent methods combine the two types of data and prediction methods, either by averaging over predictions made by either (Bhasin and Raghava, 2007), or by feeding the predictions from the positive data-trained PSSMs to ANNs together with sequence/affinity data (Nielsen et al., 2003). A study by Yu et al. (2002) clearly shows the influence of having a large dataset on the performance of the resulting method. However, including knowledge of important positions reduce the need for data significantly (Lundegaard et al., 2004). Several prediction methods have been made publicly available, and when selecting between these several cautions should be taken. The published performance, and how it is evaluated should be examined, but it is also very important that the method is able to generate predictions for the actual allele of interest. A major study comparing the predictive performance of a large part of the available methods was recently performed by Peters et al. (2006) showing that in general the SMM and the ANN methods (Table 1) perform the best, even when taken into account the number of training data for each method. The cross-validated performance of these methods for several human and mouse MHC class I alleles was compared with the best performing other method available as web tool. The full results of this work are listed in Supplementary Table 1. The tools and URLs are listed in Table 1. It should be mentioned, however, that tools known to be trained on a significant part of the test set were excluded from this comparison. To achieve binding predictions for an allele with uncharacterized specificity, the supertype concept (Sette and Sidney, 1998) can be used for the limited number of alleles with well-defined supertype relationships (Lund et al., 2005). Note, however, that predictions with methods predicting the specific allele is most often to be preferred, as the accuracy of these will be better (Nielsen et al., 2007a).

Table 1.

URLs for a selected subset of the methods in Peters et al. (2006)

Name	URL
IEDB^a	http://tools.immuneepitope.org/analyze/html/mhc_binding.html
NetMHC^b	http://cbs.dtu.dk/services/NetMHC
BIMAS	http://thr.cit.nih.gov/cgi-bin/molbio/ken_parker_comboform
hla_a2_smm	http://zlab.bu.edu/SMM-cgi/peptide1.cgi
hlaligand	http://hlaligand.ouhsc.edu/prediction.htm
libscore	http://hypernig.nig.ac.jp/cgi-bin/Lib-score/request.rb
mhcpred	http://www.jenner.ac.uk/MHCPred/
multipredann	http://research.i2r.a-star.edu.sg/multipred/HTML/predict.html
pepdist	http://www.pepdist.cs.huji.ac.il/
predbalbc	http://antigen.i2r.a-star.edu.sg/predBalbc/
rankpep	http://mif.dfci.harvard.edu/Tools/rankpep.html
svmhc	http://www.sbc.su.se/svmhc/new.cgi
syfpeithi	http://www.syfpeithi.de/

aThe SMM, ARB, and ANN methods from Peters et al. (2006).

bUpdated version of the ANN method from Peters et al. (2006).

URLs for a selected subset of the methods in Peters et al. (2006) aThe SMM, ARB, and ANN methods from Peters et al. (2006). bUpdated version of the ANN method from Peters et al. (2006). In general, HLA-I binding predictions depend on sufficient experimental data being available for the exact HLA-I molecule in question. Unfortunately, <10% of the 1500 registered HLA-I proteins (Lefranc, 2005) have been examined experimentally, and <5% have been characterized with more than 50 examples of peptide binders (Rammensee et al., 1999; Sette et al., 2005). Several groups have suggested prediction strategies to span these ‘uncharacterized’ regions of the HLA diversity (Brusic et al., 2002; Jojic et al., 2006; Nielsen et al., 2007; Zhu et al., 2006). In different forms, all these methods exploit both peptide and primary HLA sequence as input information for training, aiming at simultaneously incorporating all HLA specificities. In a recent paper (Nielsen et al., 2007a), it is successfully demonstrated that such an approach can, to a very high degree, accurately characterize the binding motif for previously untested HLA-I molecules. Unlike the MHC class I molecules, the binding cleft of MHC class II molecules is open-ended, which allows for the bound peptide to have significant overhangs in both ends. As a result MHC class II binding peptides have a broader length distribution even though the part of the binding peptide that interacts with the MHC (the binding core) still includes only 9 amino acid residues. This complicate binding predictions as identification of the correct alignment of the binding core is a crucial part of identifying the MHC class II binding motif (Nielsen et al., 2004). The MHC class II binding motifs have relatively weak and often degenerate sequence signals. While some alleles like HLA-DRB1*0405 show a strong preference for certain amino acids at the anchor positions, other alleles like HLA-DRB1*0401 allow basically all amino acids at all positions (Rammensee et al., 1999). However, there are other issues affecting the predictive performance of most MHC class II binding prediction methods. The majority of these methods take as a fundamental assumption that the peptide–MHC binding affinity is determined solely from the nine amino acids in binding core motif. This is clearly a large oversimplification since it is known that peptide flanking residues (PFR) on both sides of the binding core may contribute to the binding affinity and stability (Godkin et al., 2001). Some methods for MHC class II binding have attempted to include PFRs indirectly, in terms of the peptide length, in the prediction of binding affinities (Chang et al., 2006). Recently, Nielsen et al. (2007b) published a method for MHC class II prediction that directly include PFRs and demonstrated that these PFRs improves the prediction accuracy. Most of the methods for MHC class II binding predictions have been trained and evaluated on very limited datasets covering only a single or a few different MHC class II alleles, making it very difficult to compare the different performance values and generality of the methods. Nielsen et al. (2007b) have made available a large-scale benchmark set-up for evaluating MHC class II peptide binding affinity prediction algorithms. The benchmark covers 14 HLA-DR (human MHC) and three mouse H2-IA alleles, and consists of peptide/IC50 affinity data downloaded from the publicly available IEDB database (Peters et al., 2005), and could set the start for large-scale unbiased evaluations of novel methods for MHC class II prediction.

2.3 Processing

Successful prediction of the proteasome cleavage site specificity should provide valuable additional information useful in the design of treatments based on CTL responses. However, the complexity of proteasomal enzymatic specificity complicates such predictions. The proteasome have a highly stochastic element, exemplified by the observation that only ∼80% of the cleavage sites observed in one in vitro experiment can be verified in a second identical experiment (Hansjörg Schild, personal communication). It is thus expected that the accuracy for prediction of proteasomal activity will be relatively low when compared to that of methods for MHC peptide binding. FragPredict, which is publicly available as a part of MAPPP service (http://www.mpiibberlin.mpg.de/MAPPP/), combines proteasomal cleavage predictions with MHC- and TAP-binding predictions. FragPredict consists of two algorithms. The first algorithm uses a statistical analysis of cleavage-enhancing and -inhibiting amino acid motifs to predict potential proteasomal cleavage sites (Holzhutter et al., 1999). The second algorithm, which uses the results of the first algorithm as an input, predicts which fragments are most likely to be generated. This model takes the time-dependent degradation into account based on a kinetic model of the 20S proteasome (Holzhutter and Kloetzel, 2000). At the moment, FragPredict is the only method that can predict fragments, instead of only possible cleavage sites. PAProC (http://www.paproc.de) is a prediction method for cleavages by human as well as wild type and mutant yeast proteasomes. The influences of different amino acids at different positions are determined by using a stochastic hillclimbing algorithm (Kuttler et al., 2000) based on the experimentally in vitro verified cleavage and non-cleavage sites (Nussbaum et al., 2001). Both the FragPredict and PAProC methods make use of the limited in vitro proteasomal digest data available. FragPredict is a linear method, and it may not capture the non-linear features of the specificity of the proteasome. The NetChop (Kesmir et al., 2002) method tries to address these two issues. The prediction system is a multilayered ANN and uses naturally processed MHC class I ligands to predict proteasomal cleavage. Since some of these ligands are generated by the immunoproteasome, and some by the constitutive proteasome, such a method should predict the combined specificity of both forms of proteasomes. In 2003, NetChop-2.0 were evaluated to be the best-performing predictor on an independent evaluation set (Saxová et al., 2003). Pcleavage is another web accessible proteasomal cleavage predictor, which is SVM based and have a published performance comparable to NetChop-2.0 (Bhasin and Raghava, 2005). An update of the NetChop method [NetChop-3.0, Nielsen et al. (2005)] consists of a combination of several ANNs, each trained using a different sequence-encoding scheme of the data. NetChop 3.0 has an increase in the prediction sensitivity as compared to NetChop 2.0, without lowering the specificity, and is thus probably the current best predictor of proteasomal cleavage. Tenzer et al. (2004) have published a weight matrix based method for prediction of both constitutive- and immunoproteasomal cleavage specificity. Both matrices are trained on in vitro digest data. Relatively few methods have been developed to predict the specificity of TAP. Daniel et al. (1998) have developed ANNs using peptide 9mers for which TAP affinity was determined experimentally. Surprisingly, they found that some MHC alleles have ligands with very low TAP affinities, e.g. HLA-A2. However, it has been shown that TAP ligands can be trimmed in ER before binding to MHC molecules (Fruci et al., 2001), i.e. a TAP ligand might be an epitope precursor and thus does not need to be 9 amino acids long. HLA-A2 might easily have precursors of its optimal ligands, which are also good TAP binders. Peters et al. (2003) used an SMM to predict TAP affinity of peptides. This method has the advantage of not being bound to only 9mers but can also be used for longer peptides. The method assumes that only the first three positions in the N-terminal and the last position at the C-terminal influences the TAP binding. The method is very well evaluated and the accuracy is high. The significance of TAP binding in the epitope presentation pathway is much lower than the MHC binding (see later) and the AUC value when this method is used alone as an epitope predictor of 0.79 is thus significantly lower than most MHC-binding prediction methods. Two methods were published in 2004. Bhasin and Raghava (2004) published a method for which they do only compare to the method of Daniel et al. (1998) and it is not determined how it performs compared to the Peters’ method. The method of Doytchinova et al. (2004) is evaluated by comparing the resulting method (matrix) with other matrices. From such a comparison it can only be concluded that this method is closer to Peters’ model than to the model of Bhasin and Raghava (2004) but not how it actually performs. Recently a new TAP predictor, PredTAP, have been published (Zhang et al., 2006). This method does not have an AUC value for the methods performance in epitope prediction making a direct comparison to other models impossible. With increasing numbers of TAP ligands available on the internet (e.g. Jen-Pep database, http://www.jenner.ac.uk) (Blythe et al., 2002), it will likely soon be possible to obtain more accurate TAP predictions. With respect to TAP-independent transport and cleavage of peptides, the most established model is especially connected to the most abundant HLA supertype (A2) and is related to the signal peptides and the processing of such (Larsen et al., 2006). Prediction of potential signal peptides that can be transported by Sec61 can be made with tools for prediction of signal peptides, and some of these will also predict the signal peptidase cleavage site (Bendtsen et al., 2004; Kall et al., 2004; Zhang and Henzel, 2004), but the value in the context of CD8+ T-cell epitope predictions remains to be elucidated. The TCRs are generated by highly stochastic processes that secures that the TCRs in general will be able to recognize the entire probable space of MHC–peptide complexes. However, TCRs that recognize self-peptides will be eliminated so peptides that form complex with MHC are indistinguishable from self-peptides will not be recognized. It is still not clear how close peptides must be to the self to be able to escape recognition in this way (Louzoun et al., 2006).

2.4 Integrated T-cell epitope predictions

Reliable predictions of immunogenic peptides can reduce the experimental effort needed to identify new epitopes, and though reliable predictions of the MHC binding alone can indeed be used to rank the possible epitopes very accurately, even better predictions should be possible if the other steps in the pathway were integrated in the predictions. Accordingly, many attempts have been made to predict the outcome of the steps involved in antigen presentation, MAPP (Hakenberg et al., 2003), NetCTL (Larsen et al., 2005), MHCpathway (Tenzer et al., 2005), epiJen (Doytchinova et al., 2006) and WAPP (Donnes and Kohlbacher, 2005). All these methods attempt to predict antigen presentation by integrating peptide–MHC binding predictions with one or more of the other events involved in the antigen presentation pathway. To benchmark these, a set of verified epitopes can be used as the positive dataset. Negative examples (peptides that cannot induce an immunologic response) are hard to identify, as it is very hard to determine that a peptide will never be an epitope in any persons with a given HLA haplotype. Instead, epitopes from well-studied pathogens (e. g. HIV) are often used as the positive set, and all other peptides from the genome of the same pathogen that have never been shown to be an epitope are assumed negative as they have a very low probability of being an epitope. Running a large-scale benchmark calculation comparing the predictive performance of several publicly available MHC-I presentation prediction methods evaluated on a large set of known HIV epitopes (http://www.cbs.dtu.dk/suppl/immunology/CTL-1.2/HIV_dataset) reveals that the updated NetCTL and MHCpathway methods have the highest predictive performance with >75% if the epitopes being within the top 5% peptides with the highest prediction scores (Mette Volby Larsen, personal communication).

3 SIMULATING THE IMMUNE SYSTEM

Improved understanding of the immune systems, and its population-wide variation, is one of the major challenges in the next decade within biology and medicine. Many of the steps by which the immune system deal with infectious agents and disease can now successfully be modeled by computational techniques, and it is clear that the theoretical approaches will be a major player in this area, adding a systems view to the massive experimental effort being carried out at the moment. In this review, we have summarized how a number of bioinformatics tools that use genomic sequences as input to predict epitopes, have been developed over the past decade. At the same time, theoretical models have been developed that describe the dynamics of different immune-cell populations and their interactions with microbes (Borghans and de Boer, 2007; Carneiro et al., 2007; Davenport et al., 2007). These models have been used to interpret experimental findings where timing is of importance, such as the interval between administration of a vaccine and infection with the microbe that the vaccine is intended to protect against. Moreover, these dynamic models allowed for generating a quantitative picture of immune system kinetics and diversity during health and disease. The quantitative approach is necessary to understand the functioning of the immune system, which consists of many different cell types and molecules interacting in complicated regulatory pathways involving positive and negative feedback loops. Surprisingly little is known about the population dynamics, i.e. the production rates, division rates and distribution of life spans of mouse or human lymphocyte populations. As a consequence, fundamental questions like the maintenance of memory, the maintenance of a diverse naive repertoire and the role of homeostatic mechanisms, remain largely unresolved. Having so little insight in the normal lymphocyte population dynamics also hampers our understanding of immune responses during disease and immune reconstitution after therapeutic interventions such as chemotherapy, irradiation and/or bone marrow transplantation. Several areas in immunology call for a better interpretation of data by means of theoretical models. A simple PubMed search reveals that at least 10% of the recent papers in the immunological literature involve labeling experiments in which lymphocytes are labeled radioactively, with deuterium, or with dyes. However, the interpretation of such labeling data is controversial and is notoriously difficult (Boer et al., 2003a, b; Deenick et al., 2003; Gett and Hodgkin, 2000; Hellerstein, 1999; Mohri et al., 1998; Mohri et al., 2001; Revy et al., 2001; Ribeiro et al., 2002), which emphasizes the enormous demand to develop a quantitative mathematical approach to immunology. Similar examples of how difficult it is to properly interpret kinetic data come from the attempts to characterize the division history of cells from the length of the telomeres, or from the presence of autosomal DNA circles (TRECs) that are formed in the thymus (Boer and Noest, 1998; Douek et al., 1998; Dutilh and de Boer, 2003; Hazenberg et al., 2000; Hazenberg et al., 2003). Integrating the dynamic (using mathematical models and computer simulations) and bioinformatics approaches clearly could lead to a better understanding of the immune responses and their role during normal, disease and reconstitution states, where both timing and sequence specificity are highly significant. Diseases that are characterized by complex interactions between the host cellular immune system and evolving pathogens such as HIV infection, or diseases where molecular similarities between self and non-self are important such as in autoimmune diseases could be investigated in such integrated models. Complex generalized cellular automata have been proposed as models of the immune system (Kohler et al., 2000; Seiden and Celada, 1992). These methods have now developed to a stage where it is possible successfully to simulate the outcome of cancer vaccine protocols using a mouse simulation model (Castiglione and Piccoli, 2007; Lollini et al., 2006; Motta et al., 2005; Pappalardo et al., 2006). In a recent paper, Rapin et al. (2006) outline a framework for integration of these bioinformatics and simulation approaches by developing a simple model in which HIV dynamics are correlated with genomics data. This model is the first one where, the fitness of wild-type and mutated virus is assessed by means of a sequence-dependent scoring matrix that links protein sequences to growth rates of the virus. Further refinements of these approaches may involve increasing the spatial resolution by including different tissues and their geometry. Click here for additional data file.

158 in total

Review 1. Antibodies, viruses and vaccines.

Authors: Dennis R Burton
Journal: Nat Rev Immunol Date: 2002-09 Impact factor: 53.106

2. Prediction of MHC class I binding peptides using profile motifs.

Authors: Pedro A Reche; John-Paul Glutting; Ellis L Reinherz
Journal: Hum Immunol Date: 2002-09 Impact factor: 2.850

3. The mapping and reconstitution of a conformational discontinuous B-cell epitope of HIV-1.

Authors: David Enshell-Seijffers; Dmitri Denisov; Bella Groisman; Larisa Smelyanski; Ronit Meyuhas; Gideon Gross; Galina Denisova; Jonathan M Gershoni
Journal: J Mol Biol Date: 2003-11-14 Impact factor: 5.469

Review 4. When three is not a crowd: a Crossregulation model of the dynamics and repertoire selection of regulatory CD4+ T cells.

Authors: Jorge Carneiro; Kalet Leon; Iris Caramalho; Carline van den Dool; Rui Gardner; Vanessa Oliveira; Marie-Louise Bergman; Nuno Sepúlveda; Tiago Paixão; Jose Faro; Jocelyne Demengeot
Journal: Immunol Rev Date: 2007-04 Impact factor: 12.988

5. Learning MHC I--peptide binding.

Authors: Nebojsa Jojic; Manuel Reyes-Gomez; David Heckerman; Carl Kadie; Ora Schueler-Furman
Journal: Bioinformatics Date: 2006-07-15 Impact factor: 6.937

6. ADEPT: a computer program for prediction of protein antigenic determinants.

Authors: A Z Maksyutov; E S Zagrebelnaya
Journal: Comput Appl Biosci Date: 1993-06

7. Expression and deletion analysis of the Trypanosoma brucei rhodesiense cysteine protease in Escherichia coli.

Authors: E G Pamer; C E Davis; M So
Journal: Infect Immun Date: 1991-03 Impact factor: 3.441

8. IMGT, the international ImMunoGeneTics information system: a standardized approach for immunogenetics and immunoinformatics.

Authors: Marie-Paule Lefranc
Journal: Immunome Res Date: 2005-09-20

9. PRED(TAP): a system for prediction of peptide binding to the human transporter associated with antigen processing.

Authors: Guang Lan Zhang; Nikolai Petrovsky; Chee Keong Kwoh; J Thomas August; Vladimir Brusic
Journal: Immunome Res Date: 2006-05-23

10. Mapping a neutralizing epitope on the SARS coronavirus spike protein: computational prediction based on affinity-selected peptides.

Authors: Natalia Tarnovitski; Leslie J Matthews; Jianhua Sui; Jonathan M Gershoni; Wayne A Marasco
Journal: J Mol Biol Date: 2006-03-22 Impact factor: 5.469

48 in total

1. Predictions versus high-throughput experiments in T-cell epitope discovery: competition or synergy?

Authors: Claus Lundegaard; Ole Lund; Morten Nielsen
Journal: Expert Rev Vaccines Date: 2012-01 Impact factor: 5.217

Review 2. Major histocompatibility complex class I binding predictions as a tool in epitope discovery.

Authors: Claus Lundegaard; Ole Lund; Søren Buus; Morten Nielsen
Journal: Immunology Date: 2010-05-26 Impact factor: 7.397

3. HLArestrictor--a tool for patient-specific predictions of HLA restriction elements and optimal epitopes within peptides.

Authors: Malene Erup Larsen; Henrik Kloverpris; Anette Stryhn; Catherine K Koofhethile; Stuart Sims; Thumbi Ndung'u; Philip Goulder; Søren Buus; Morten Nielsen
Journal: Immunogenetics Date: 2010-11-16 Impact factor: 2.846

4. The PickPocket method for predicting binding specificities for receptors based on receptor pocket similarities: application to MHC-peptide binding.

Authors: Hao Zhang; Ole Lund; Morten Nielsen
Journal: Bioinformatics Date: 2009-03-17 Impact factor: 6.937

5. Pan-specific MHC class I predictors: a benchmark of HLA class I pan-specific prediction methods.

Authors: Hao Zhang; Claus Lundegaard; Morten Nielsen
Journal: Bioinformatics Date: 2008-11-07 Impact factor: 6.937

6. Prediction of antibody response using recombinant human protein fragments as antigen.

Authors: Johan Rockberg; Mathias Uhlén
Journal: Protein Sci Date: 2009-11 Impact factor: 6.725

7. A cell-based MHC stabilization assay for the detection of peptide binding to the canine classical class I molecule, DLA-88.

Authors: Peter Ross; Jennifer C Holmes; Gregory S Gojanovich; Paul R Hess
Journal: Vet Immunol Immunopathol Date: 2012-09-21 Impact factor: 2.046