| Literature DB >> 18225944 |
Will Fischer, H X Liao, Barton F Haynes, Norman L Letvin, Bette Korber.
Abstract
Entities:
Mesh:
Substances:
Year: 2008 PMID: 18225944 PMCID: PMC2217576 DOI: 10.1371/journal.pcbi.0040015
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Number of Proteins or Protein Fragments Included in the Antigen Design, and the Number of Rare or Unique 9-mers in Various Protein Sets
Figure 1Relative Locations and Sequence Identities of Potential Vaccine Antigens
The COT+ candidate vaccine sets represent one full-length COT protein plus numerous discrete sequence fragments located in various positions relative to the intact Gag protein; mosaics are full-length “native-like” proteins. For the COT+ sequence sets, the sequence on the top line is the COT sequence; additional peptide fragments are numbered by addition order, and plotted by location from N-terminus to C-terminus. Gag sequences are presented in two parts. For the mosaic sets, the sequence in the top line was arbitrarily chosen from the set, as the mosaics are designed as a combination of strains. For all sequence sets, amino acid residues identical to those in the top line sequence are shown with a black background; differences from the first sequence are shown in color, with different colors representing different amino acid classes; white space is used to represent gaps. The COT+ antigen sequences [1] were generously provided by J. I. Mullins.
Figure 2Comparisons of 9-mer Coverage for the COT+ and Mosaic Antigen Designs
To allow direct methodological comparison, mosaic antigen sets were generated as in Fischer et al. [2], but based on the test set of 169 sequences used in Nickle et al. [1] for Nef (A) and Gag (B,D). Potential epitope coverage provided by the combination of proteins is indicated by the percentage of perfectly matched 9-mers in the protein alignment (red), the addition of those that match in 8/9 amino acids (orange), and the further addition of those that match in 7/9 amino acids (yellow). Three full-length mosaic proteins provide slightly better coverage than three gene lengths of COT+ protein fragments. For this class of methods, coverage may generally be improved by increasing the number of antigens, but with diminishing returns and at the cost of increased vaccine complexity and expense; the best strategy must realistically balance these factors. Previously published mosaic sequences (based on 551 Los Alamos sequences spanning the entire M group [2]) were scored against the 169-sequence Gag set (C), and the M group (E). To directly compare the coverage of each antigen set to subtypes other than B, the COT+ antigen set and the M-group mosaics were also scored against an M-group dataset from which all B-clade sequences had been removed (F).
The M-group mosaic coverage of non-B clade proteins is 4%–5% less than the M-group coverage of B clade (C,F). Predictably, interclade coverage of the B-clade–optimized antigens drops dramatically (81% → 51% for COT+, F; B-clade mosaics slightly less reduced ([2], unpublished data). For the M-group global mosaics, in contrast, coverage is roughly equal for all clades; coverage of B-clade sequences by M-group mosaics is only 6%–7% below that of specific B-clade–optimized mosaics (B,C), and coverage of non-B sequences remains relatively high (F).
All computer code for creating mosaics and assessing coverage, our previously designed mosaic antigens, and the datasets used were made publicly accessible upon the original publication [2].