| Literature DB >> 19112482 |
Nora C Toussaint1, Pierre Dönnes, Oliver Kohlbacher.
Abstract
Epitope-based vaccines (EVs) have a wide range of applications: from therapeutic to prophylactic approaches, from infectious diseases to cancer. The development of an EV is based on the knowledge of target-specific antigens from which immunogenic peptides, so-called epitopes, are derived. Such epitopes form the key components of the EV. Due to regulatory, economic, and practical concerns the number of epitopes that can be included in an EV is limited. Furthermore, as the major histocompatibility complex (MHC) binding these epitopes is highly polymorphic, every patient possesses a set of MHC class I and class II molecules of differing specificities. A peptide combination effective for one person can thus be completely ineffective for another. This renders the optimal selection of these epitopes an important and interesting optimization problem. In this work we present a mathematical framework based on integer linear programming (ILP) that allows the formulation of various flavors of the vaccine design problem and the efficient identification of optimal sets of epitopes. Out of a user-defined set of predicted or experimentally determined epitopes, the framework selects the set with the maximum likelihood of eliciting a broad and potent immune response. Our ILP approach allows an elegant and flexible formulation of numerous variants of the EV design problem. In order to demonstrate this, we show how common immunological requirements for a good EV (e.g., coverage of epitopes from each antigen, coverage of all MHC alleles in a set, or avoidance of epitopes with high mutation rates) can be translated into constraints or modifications of the objective function within the ILP framework. An implementation of the algorithm outperforms a simple greedy strategy as well as a previously suggested evolutionary algorithm and has runtimes on the order of seconds for typical problem sizes.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19112482 PMCID: PMC2588662 DOI: 10.1371/journal.pcbi.1000246
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Basic idea behind this work.
Starting from target antigens, a list of properties of interest, and a target population the information necessary to determine an optimal set of epitopes is derived (gray boxes). Given this information, a mathematical framework can conveniently be used to find the set of epitopes that is optimal with respect to the target population and the properties of interest.
ILP corresponding to the basic definition of an optimal epitope set.
|
| |
|
| Set of observed MHC alleles |
|
| Set of candidate epitopes |
|
| |
|
| Immunogenicity of epitope |
|
| Number of epitopes to select |
|
| Probability of MHC allele |
|
| |
| xe = 1 | If epitope |
|
| |
|
| Maximize … |
| ∑ | … Overall immunogenicity |
|
| |
| ∑ | … And select exactly |
ILP corresponding to the extended definition of an optimal epitope set.
|
| ||
|
| Set of observed MHC alleles | |
|
| Set of epitopes from the
| |
|
| Set of all candidate epitopes
( | |
|
| Set of epitopes which, when bound to an
MHC allele | |
|
| Set of all sufficiently immunogenic
epitopes ( | |
|
| Set of overlapping pairs of epitopes | |
|
| ||
|
| Conservation of epitope
| |
|
| Immunogenicity of epitope
| |
|
| Number of epitopes to select | |
|
| Probability of MHC allele
| |
|
| Probability that epitope
| |
|
| Minimum number of epitopes from each antigen to be included | |
|
| Antigen processing threshold | |
|
| Conservation threshold | |
|
| Minimum number of MHC alleles to be covered | |
|
| ||
| xe = 1 | If epitope | |
| ya = 1 | If allele | |
|
| ||
|
| Maximize … | |
| ∑ | … Overall immunogenicity. | |
|
| ||
| ∑ | Selects exactly | |
| ∀ | (1− | Ensures certain degree of epitope conservation. |
| ∀( | xp+xr≤1 | Guarantees that selected epitopes do not overlap. |
| ∀ |
| Guarantees that
ya = 1 only if
allele |
| ∑ | Enforces required allele coverage. | |
| ∀ |
| Enforces required antigen coverage. |
| ∀ |
| Selects only epitopes which have a chance of at
least |
ILP corresponding to the combined optimization problem.
|
| ||
|
| Set of observed MHC alleles | |
|
| Set of epitopes from the
| |
|
| Set of all candidate epitopes
( | |
|
| Set of epitopes which, when bound
to an MHC allele | |
|
| Set of all sufficiently immunogenic
epitopes ( | |
|
| ||
|
| Immunogenicity of epitope
| |
|
| Probability of MHC allele
| |
|
| ||
| wa,i = 1 | If allele | |
| xe = 1 | If epitope | |
| ya = 1 | If allele | |
| zi = 1 | If an epitope from the
| |
|
| ||
|
| Maximize … | |
| 0.1·∑ | … Overall immunogenicity and … | |
|
| … Extend coverage of antigens, MHC alleles, … | |
|
| … And MHC/antigen combinations. | |
|
| ||
| All constraints from the extended
ILP ( | ||
| ∀ |
| Ensures that
zi = 1 only if
the |
| ∀ |
| Ensures that
wa,i = 1 only if
allele |
Figure 2Comparison of different epitope selection strategies with respect to overall immunogenicity.
(A) Overall immunogenicity of different-sized epitope sets. (B) Overall immunogenicity of a set of 10 epitopes. (C) Number of epitopes required to achieve an overall immunogenicity of at least 2,699.
Overview over properties of HCV epitope sets selected using different strategies.
|
|
|
| |
| Overall immunogenicity |
| 125 | 2,177 |
| Allele coverage |
| 96.3% |
|
| Antigen coverage |
| 87.5% |
|
| MHC/antigen coverage | 22.7% | 19.2% |
|
| Population coverage |
| 95.6% |
|
| Avg. number of epitopes per individual | 13.3 | 11.4 |
|
| Number of epitopes in IEDB |
| 1 | 1 |
Number of epitopes per set: 24. E: set selected by our ILP, E: set selected by Vider-Shalit et al. without peptide AALENLVTL, E: set selected by our ILP extended by aspects of the scoring function of Vider-Shalit et al.
Figure 3Comparison of properties of HCV epitope sets selected using different strategies.
(A) Overall immunogenicity. (B) Coverage of MHC/antigen pairs.
| SFSIFLLAL* | GHRMAWDMM+ | VYEADDVIL |
| YLYDHLAPM | GLRDLAVAV+ | GPTPLLYRL+ |
| QYLAGLSTL+ | NFVSGIQYL | VLSDFKTWL* |
| GLYLFNWAV | ALYDVVSTL* | RRCRASGVL+ |
| CFTPSPVVV+ | FLLLADARV* | GPADGMVSK+ |
| TWVLVGGVL+ | IELGGKPAL+ | LAGGVLAAV |
| ARPDYNPPL+ | KLLPRLPGV | RHTPVNSWL+ |
| WPLLLLLLA | VTYSLTGLW | YFVIFFVAA |