| Literature DB >> 15980443 |
Pedro A Reche1, Ellis L Reinherz.
Abstract
Prediction of peptide binding to major histocompatibility complex (MHC) molecules is a basis for anticipating T-cell epitopes, as well as epitope discovery-driven vaccine development. In the human, MHC molecules are known as human leukocyte antigens (HLAs) and are extremely polymorphic. HLA polymorphism is the basis of differential peptide binding, until now limiting the practical use of current epitope-prediction tools for vaccine development. Here, we describe a web server, PEPVAC (Promiscuous EPitope-based VACcine), optimized for the formulation of multi-epitope vaccines with broad population coverage. This optimization is accomplished through the prediction of peptides that bind to several HLA molecules with similar peptide-binding specificity (supertypes). Specifically, we offer the possibility of identifying promiscuous peptide binders to five distinct HLA class I supertypes (A2, A3, B7, A24 and B15). We estimated the phenotypic population frequency of these supertypes to be 95%, regardless of ethnicity. Targeting these supertypes for promiscuous peptide-binding predictions results in a limited number of potential epitopes without compromising the population coverage required for practical vaccine design considerations. PEPVAC can also identify conserved MHC ligands, as well as those with a C-terminus resulting from proteasomal cleavage. The combination of these features with the prediction of promiscuous HLA class I ligands further limits the number of potential epitopes. The PEPVAC server is hosted by the Dana-Farber Cancer Institute at the site http://immunax.dfci.harvard.edu/PEPVAC/.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15980443 PMCID: PMC1160118 DOI: 10.1093/nar/gki357
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Strategy to define HLA I supertypes. HLA I supertypes are identified by clustering their peptide-binding repertoire (8). The method consists of four basic steps. (i) Predict the peptide-binding repertoire (i,j sets in figure) of each HLA I molecule from the same random protein using the relevant PSSMs in combination with the RANKPEP scoring algorithm (13). (ii) Compute the number of common peptides between the binding repertoire of any two HLA I molecules. (iii) Build a distance matrix whose coefficients are inversely proportional to the peptide-binding overlap between any pair of HLA I molecules. (iv) Use a phylogenic clustering algorithm to compute and visualize HLA I supertypes (clusters of HLA I molecules with overlapping peptide-binding repertoires).
Figure 2HLA I peptide-binding overlap and supertypes. The Figure shows an unroot dendrogram built after clustering the overlap between the peptide-binding repertoire of the indicated HLA I molecules. Peptide-binding repertoires of HLA I molecules were obtained from a random protein (1000 amino acids) using the relevant PSSMs at a 2% peptide-binding threshold. This dendrogram reflects the relationship between the peptide-binding specificities of HLA I molecules. HLA I alleles with similar peptide-binding specificities branch together in groups or clusters. The closer HLA I alleles branch, the larger is the overlap between their peptide-binding repertoires. Supertypes (shadowed with different colors) consist of groups HLA I alleles with at least a 20% peptide-binding overlap (pairwise between any pair of alleles).
Cumulative phenotype frequency of defined supertypes
| Supertype | Alleles | Blacks (%) | Caucasians (%) | Hispanics (%) | North American natives (%) | Asians (%) |
|---|---|---|---|---|---|---|
| A2 | A*0201-7, A*6802 | 43.7 | 49.9 | 51.8 | 52.4 | 44.7 |
| A3 | A*0301, A*1101, A*3101, A*3301, A*6801, A*6601 | 35.4 | 46.9 | 41.5 | 40.7 | 47.9 |
| B7 | B*0702, B*3501, B*5101-02, B*5301, B*5401 | 45.9 | 42.2 | 40.5 | 52.0 | 31.3 |
| B15 | A*0101, B*1501_B62, B1502 | 13.06 | 37.80 | 16.75 | 27.26 | 21.04 |
| A24 | A*2402, B*3801 | 15.5 | 17.28 | 25.85 | 41.94 | 35.0 |
| B44 | B*4402, B*4403 | 10.4 | 27.7 | 17.15 | 14.4 | 10.1 |
| B57 | B5701-02, B5801, B*1503 | 19.2 | 10.3 | 5.9 | 5.8 | 16.5 |
| ABX | A*2902, B*4002 | 7.4 | 11.3 | 19.1 | 16.3 | 16.3 |
| B27 | B*2701-06, B*2709, B*3909 | 2.3 | 4.8 | 5.1 | 16.9 | 4.7 |
| BX | B*1509, B*1510, B*39011 | 3.1 | 0.7 | 4.2 | 7.8 | 4.1 |
Cumulative phenotype frequency was obtained using the HLA I gene and haplotype frequencies published by Cao et al. (18) corresponding to the indicated five American ethnic groups. Method for computing the cumulative phenotype frequency considered the disequilibrium linkage between the HLA-A and -B gene and was based on that reported by Dawson et al. (21).
Figure 3The PEPVAC web server. (A) PEPVAC input page. The page is divided into several sections. E-MAIL, for obtaining the results via e-mail (optional). GENOMES, where a selection of genomes from pathogenic organisms is available, as well as the possibility of uploading a user-provided genome. SUPERTYPES, the supertypes A2, A3, B7, A24, and B15 are available for selection. Alleles targeted for peptide-biding predictions in each supertype are indicated. The minimum population coverage of the selected supertypes is calculated on the fly and shown on the relevant window. PROTEASOMAL CLEAVAGE, prediction of proteasomal cleavage using three optimal language models is carried out in parallel to the peptide-biding predictions. (B) PEPVAC result page. An example result page where the A3 supertype was selected for peptide-binding predictions from the genome of Influenza A virus (A/PR/8/34) is shown. The result page first displays a summary of the predictions, followed by the predicted peptide binders to each of the selected supertypes (only A3 in the shown example). Peptides highlighted in violet contain a C-terminal residue that is predicted to be the result of proteasomal cleavage. If the proteasomal cleavage filter is checked ON in the input page, only violet peptides will be shown.