| Literature DB >> 20671958 |
Yongqun He1, Zuoshuang Xiang, Harry L T Mobley.
Abstract
Vaxign is the first web-based vaccine design system that predicts vaccine targets based on genome sequences using the strategy of reverse vaccinology. Predicted features in the Vaxign pipeline include protein subcellular location, transmembrane helices, adhesin probability, conservation to human and/or mouse proteins, sequence exclusion from genome(s) of nonpathogenic strain(s), and epitope binding to MHC class I and class II. The precomputed Vaxign database contains prediction of vaccine targets for >70 genomes. Vaxign also performs dynamic vaccine target prediction based on input sequences. To demonstrate the utility of this program, the vaccine candidates against uropathogenic Escherichia coli (UPEC) were predicted using Vaxign and compared with various experimental studies. Our results indicate that Vaxign is an accurate and efficient vaccine design program.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20671958 PMCID: PMC2910479 DOI: 10.1155/2010/297505
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Figure 1The Vaxign algorithm pipeline.
Pathogens currently analyzed by Vaxign.
| species | # of Genomes | # of proteins |
|---|---|---|
| Bacterial species: | ||
| (1) | 3 | 16182 |
| (2) | 9 | 28559 |
| (3) | 10 | 17443 |
| (4) | 10 | 35130 |
| (5) | 1 | 2272 |
| (6) | 5 | 9686 |
| (7) | 7 | 34592 |
| (8) | 9 | 14771 |
| (9) | 4 | 6735 |
| (10) | 6 | 9261 |
| (11) | 2 | 8178 |
| (12) | 4 | 7909 |
| Virus strains: | ||
| (13) HIV | 2 | 33 |
| (14) Measles virus | 1 | 7 |
| (15) Vaccinia virus | 1 | 223 |
| (16) Variola virus | 1 | 197 |
| (17) Yellow fever virus | 1 | 14 |
| Total # | 76 | 191192 |
Epitope prediction performance by Vaxitope as measured by AUC values.
| # | MHC allele | Length | AUC | Sensitivity ( | Specificity ( | Sensitivity ( | Specificity ( | Sensitivity ( | Specificity ( |
|---|---|---|---|---|---|---|---|---|---|
| 1 | HLA-A*0101 | 9 | 0.929 | .854 | .874 | .99 | .709 | 1 | .621 |
| 2 | HLA-A*0201 | 9 | 0.871 | .298 | .956 | .531 | .876 | .792 | .783 |
| 3 | HLA-A*0201 | 10 | 0.913 | .471 | .957 | .789 | .874 | .901 | .773 |
| 4 | HLA-A*0202 | 9 | 0.869 | .309 | .953 | .658 | .875 | .792 | .774 |
| 5 | HLA-A*0202 | 10 | 0.863 | .333 | .949 | .737 | .808 | .891 | .684 |
| 6 | HLA-A*0203 | 9 | 0.874 | .304 | .956 | .659 | .865 | .828 | .769 |
| 7 | HLA-A*0203 | 10 | 0.867 | .317 | .963 | .691 | .827 | .834 | .712 |
| 8 | HLA-A*0206 | 9 | 0.900 | .387 | .961 | .73 | .867 | .881 | .781 |
| 9 | HLA-A*0206 | 10 | 0.916 | .403 | .957 | .775 | .878 | .922 | .782 |
| 10 | HLA-A*0301 | 9 | 0.887 | .445 | .921 | .855 | .803 | .959 | .69 |
| 11 | HLA-A*0301 | 10 | 0.868 | .505 | .9 | .915 | .706 | .988 | .554 |
| 12 | HLA-A*1101 | 9 | 0.863 | .337 | .946 | .672 | .833 | .859 | .71 |
| 13 | HLA-A*1101 | 10 | 0.879 | .461 | .924 | .9 | .742 | .989 | .627 |
| 14 | HLA-A*2402 | 9 | 0.984 | .727 | .985 | .97 | .879 | 1 | .783 |
| 15 | HLA-A*3101 | 9 | 0.912 | .426 | .952 | .813 | .872 | .927 | .752 |
| 16 | HLA-A*3101 | 10 | 0.855 | .419 | .905 | .889 | .711 | .99 | .563 |
| 17 | HLA-A*3301 | 9 | 0.937 | .495 | .959 | .94 | .851 | .989 | .723 |
| 18 | HLA-A*3301 | 10 | 0.905 | .51 | .942 | .905 | .755 | .966 | .619 |
| 19 | HLA-A*6801 | 9 | 0.908 | .406 | .949 | .841 | .841 | .946 | .744 |
| 20 | HLA-A*6801 | 10 | 0.848 | .418 | .9 | .866 | .701 | .973 | .564 |
| 21 | HLA-A*6802 | 9 | 0.918 | .446 | .96 | .801 | .868 | .922 | .757 |
| 22 | HLA-A*6802 | 10 | 0.913 | .452 | .947 | .837 | .825 | .977 | .715 |
| 23 | HLA-A*6901 | 9 | 0.803 | .279 | .895 | .674 | .785 | .837 | .674 |
| 24 | HLA-B*0702 | 9 | 0.963 | .659 | .966 | .962 | .894 | .981 | .849 |
| 25 | HLA-B*1501 | 9 | 0.873 | .514 | .927 | .816 | .765 | .939 | .603 |
| 26 | HLA-B*3501 | 9 | 0.838 | .403 | .927 | .701 | .775 | .834 | .666 |
| 27 | HLA-B*5101 | 9 | 0.978 | .835 | .953 | 1 | .871 | 1 | .824 |
| 28 | HLA-B*5301 | 9 | 0.989 | .84 | .981 | .991 | .896 | 1 | .825 |
| 29 | HLA-B*5801 | 9 | 0.923 | .769 | .933 | .894 | .813 | .952 | .702 |
| 30 | H-2-Kb | 8 | 0.936 | .753 | .922 | .935 | .744 | .987 | .623 |
| 31 | H-2-IAd | —* | 0.928 | .582 | .992 | .705 | .959 | .82 | .926 |
| 32 | H-2-IEd | — | 0.977 | .903 | 1 | .935 | .935 | .935 | .887 |
| 33 | H-2-IEg7 | — | 0.998 | .993 | .989 | .993 | .945 | 1 | .893 |
| 34 | H-2-IEk | — | 0.940 | .775 | .95 | .875 | .9 | .9 | .813 |
| 35 | HLA-DPB1*0401 | — | 0.950 | .717 | .978 | .826 | .924 | .913 | .87 |
| 36 | HLA-DPB1*0901 | — | 0.978 | .739 | .989 | .891 | .913 | .913 | .859 |
| 37 | HLA-DR1 | — | 0.923 | .587 | .972 | .781 | .915 | .838 | .834 |
| 38 | HLA-DR7 | — | 0.990 | .976 | .988 | .976 | .905 | .976 | .845 |
| 39 | HLA-DRB1*1101 | — | 0.952 | .732 | .978 | .828 | .914 | .898 | .831 |
| 40 | HLA-DRB1*1501 | — | 0.951 | .846 | .981 | .897 | .942 | .91 | .872 |
Note: * means flexible length.
Figure 2ROC curve analysis of epitopes binding HLA A*0201.
Figure 3Vaxign prediction of UPEC vaccine targets. (a) Selection of E. coli CFT073 as a seed genome. Different filter options are applied: only outer membrane proteins, maximum of one transmembrane helix, minimum adhesin probability of.51, having orthologs in other three UPEC genomes but no orthologs in K-12 strain M1655, and no similarity to any human or mouse proteins. (b) In total 22 proteins were predicted to meet selected criteria. (c) Detailed results for a particular protein, for example, NmpC. MHC Class I and II epitope prediction results are also available and can be further filtered. (d) Prediction of α-helices and their possible cellular locations of NmpC. (e) Prediction of β-barrels and their possible cellular locations of NmpC.
Figure 4Prediction of UPEC vaccine targets conserved in four sequenced UPEC genomes using Vaxign. Note: * Co-ed represents the conserved proteins.
Selected function annotations significantly enriched for UPEC vaccine candidates based on DAVID analysis.
| Category | Term | # of Genes | % | Benjamini | |
|---|---|---|---|---|---|
| GO MF* | Transport activity | 22 | 45.8 | 1.8 | 2.7 |
| Interpro | TonB-dependent receptor, beta-barrel | 10 | 20.8 | 1.0 | 4.3 |
| Interpro | Porin, Gram-negative type | 8 | 16.7 | 1.3 | 1.8 |
| GO MF | Iron ion transmembrane transporter activity | 5 | 10.4 | 6.9 | 1.4 |
| Interpro | Fimbrial biogenesis outer membrane | 5 | 10.4 | 9.3 | 4.1 |
*: GO MF, the Molecular Function (MF) branch of the Gene Ontology (GO).
Conserved UPEC outer membrane proteins predicted by Vaxign.
| # | Protein RefSeq | Symbol | Adhesin | TMH | Not in K-12 | Micro-array | Prote-omics | Protein Name |
|---|---|---|---|---|---|---|---|---|
| Iron ion binding and iron/siderophore transporter activity | ||||||||
| 1 | NP_752820.1 | ybiL | 0.857 | 1 | − | − | Catecholate siderophore receptor fiu precursor (TonB-dependent receptor fiu) (Ferric iron uptake protein) | |
| 2 | NP_754328.1 | c2436 | 0.473 | 0 | − | − | putative pesticin receptor precursor (tonB-dependent receptor) | |
| 3 | NP_754406.1 | c2518 | 0.83 | 0 | X | − | − | TonB dependent receptor |
| 4 | NP_752238 | c0294 | 0.83 | 0 | X | − | − | TonB dependent receptor |
| 5 | NP_753164.1 | iroN | 0.672 | 0 | + | + | Siderophore receptor iron | |
| 6 | NP_752600.1 | fepA | 0.792 | 0 | + | − | ferrienterobactin receptor (TonB-dependent receptor) | |
| 7 | NP_752135.1 | fhuA | 0.746 | 0 | + | + | ferrichrome outer membrane transporter | |
| 8 | NP_754374.1 | Hma (c2482) | 0.772 | 0 | X | + | + | outer membrane heme acquisition protein |
| 9 | NP_756170 | chuA | 0.846 | 0 | X | + | + | Outer membrane heme/hemoglobin receptor |
| 10 | NP_753551.1 | prrA | 0.589 | 0 | X | − | − | Putative TonB-dependent outer membrane receptor |
| 11 | NP_753179 | c1265 | 0.777 | 0 | X | − | − | Outer membrane heme/ hemoglobin receptor |
| 12 | NP_753125 | c1206 | 0.794 | 0 | X | − | − | Outer membrane heme/haemoglobin receptor |
| 13 | NP_755646 | c3775 | 0.79 | 0 | X | − | − | putative iron compound receptor |
| 14 | NP_753820.1 | yddB | 0.765 | 0 | − | − | hypothetical protein c1924 (tonB-dependent receptor family) | |
| Fimbrial biogenesis outer membrane usher protein | ||||||||
| 15 | NP_757244.1 | fimD | 0.744 | 1 | + | − | Outer membrane usher protein fimD precursor | |
| 16 | NP_757034 | papC_2 | 0.674 | 1 | X | − | − | PapC protein |
| 17 | NP_755465 | papC | 0.666 | 0 | X | − | − | PapC protein |
| 18 | NP_754524.1 | yehB | 0 | 0 | − | − | Outer membrane usher protein yehB precursor | |
| 19 | NP_752120.1 | htrE | 0.643 | 0 | − | − | Putative outer membrane usher protein | |
| 20 | NP_753830.1 | c1934 | 0.856 | 1 | − | − | Outer membrane usher protein fimD precursor | |
| 21 | NP_754765.1 | yfcU | 0.563 | 0 | X | − | − | Fimbrial export usher family protein |
| 22 | NP_753156.1 | focD | 0.854 | 0 | − | − | F1C fimbrial usher | |
| 23 | NP_756076.1 | ycbS | 0.559 | 1 | − | − | Outer membrane usher protein ycbS precursor | |
| 24 | NP_753159 | focH | 0.917 | 0 | X | − | − | F1C putative fimbrial adhesin precursor |
| Porin and ion transport | ||||||||
| 25 | NP_753469.1 | nmpC | 0.788 | 1 | X | − | + | Outer membrane porin protein nmpC precursor |
| 26 | NP_754644.1 | ompC | 0.688 | 0 | − | + | Outer membrane porin protein C | |
| 27 | NP_756858.1 | lamB | 0.806 | 0 | − | + | maltoporin | |
| 28 | NP_752996.1 | ompF | 0.614 | 0 | − | + | Outer membrane protein F (Porin family) | |
| 29 | NP_754240.1 | c2348 | 0.759 | 0 | X | − | − | Outer membrane porin protein nmpC precursor |
| 30 | NP_752325.1 | phoE | 0.729 | 0 | − | − | Outer membrane phosphoporin protein E | |
| 31 | NP_753724.1 | ompN | 0.751 | 0 | − | − | Outer membrane protein N precursor | |
| 32 | NP_754275.1 | c2383 | 0.597 | 0 | X | − | − | Outer membrane protein N precursor |
| 33 | NP_754771.1 | fadL | 0.871 | 0 | − | + | Long-chain fatty acid outer membrane transporter | |
| 34 | NP_756025.1 | hofQ | 0.186 | 0 | + | − | Outer membrane porin HofQ | |
| Other transport proteins | ||||||||
| 35 | NP_756748.1 | c4894 | 0.81 | 0 | X | − | − | Nucleoside-specific channel-forming protein tsx precursor |
| 36 | NP_756500.1 | c4642 | 0.694 | 0 | − | − | Putative outer membrane Protein yieC | |
| 37 | NP_753669.1 | c1765 | 0.437 | 0 | − | − | Partial putative outer membrane channel protein | |
| 38 | NP_752455.1 | tsx | 0.833 | 0 | − | + | Nucleoside-specific channel-forming protein tsx precursor | |
| Lipoproteins | ||||||||
| 39 | NP_756849.1 | yjbH | 0.651 | 0 | − | − | Lipoprotein yjbH precursor | |
| 40 | NP_755008.1 | yfiB | 0.564 | 0 | − | − | Putative outer membrane lipoprotein | |
| 41 | NP_756936.1 | yjcP | 0.185 | 0 | − | − | Putative outer membrane efflux protein MdtP | |
| 42 | NP_752589.1 | cusC | 0.516 | 0 | − | − | Copper/silver efflux system outer membrane protein CusC | |
| 43 | NP_754925.1 | yfhM | 0.224 | 0 | − | − | Lipoprotein yfhM precursor | |
| 44 | NP_756232.1 | yiaD | 0.526 | 3 | + | − | Putative outer membrane lipoprotein | |
| Other outer membrane proteins | ||||||||
| 45 | NP_752286.1 | c0345 | 0.987 | 1 | X | − | − | ShlA/HecA/FhaA exofamily protein |
| 46 | NP_752352 | eaeH | 0.904 | 1 | X | − | − | Putative adhesin |
| 47 | NP_753126 | c1207 | 0.702 | 0 | X | − | − | Hypothetical protein |
| 48 | NP_753493 | c1585 | 0.526 | 0 | X | − | − | Putative tail component of prophage |
| 49 | NP_754912 | c3030 | 0.71 | 1 | X | − | − | SinI-like protein |
| 50 | NP_756286 | c4424 | 0.99 | 0 | X | − | − | Putative adhesin |
| 51 | NP_752116.1 | yadC | 0.81 | 1 | − | − | Putative fimbrial-like adhesin protein | |
| 52 | NP_752162.1 | yaeT | 0.637 | 0 | + | + | Outer membrane protein assembly factor YaeT | |
| 53 | NP_752163.1 | hlpA | 0.587 | 0 | − | − | Periplasmic chaperone | |
| 54 | NP_752339.1 | yagX | 0.571 | 0 | − | − | Hypothetical protein c0402 | |
| 55 | NP_752642.1 | crcA | 0.739 | 1 | − | − | Palmitoyl transferase for Lipid A | |
| 56 | NP_752830.1 | ompX | 0.818 | 1 | + | + | Outer membrane protein X | |
| 57 | NP_753262.1 | flgK | 0.84 | 0 | − | − | Flagellar hook-associated protein FlgK | |
| 58 | NP_753627.1 | ompW | 0.848 | 0 | − | + | Outer membrane protein W | |
| 59 | NP_753695.1 | ompG | 0.618 | 0 | − | − | Outer membrane protein G precursor | |
| 60 | NP_754014.1 | ydiY | 0.708 | 0 | − | − | Hypothetical protein c2120 | |
| 61 | NP_754081.1 | yeaF | 0.852 | 0 | + | + | MltA-interacting protein precursor | |
| 62 | NP_754523.1 | yehA | 0.847 | 0 | − | − | Hypothetical protein c2635 | |
| 63 | NP_754661.1 | yfaL | 0.949 | 0 | − | − | Adhesin | |
| 64 | NP_755530.1 | c3655 | 0.962 | 0 | − | + | Antigen 43 precursor | |
| 65 | NP_756601.1 | pldA | 0.756 | 0 | − | − | Phospholipase A | |
| 66 | NP_754578.1 | cirA | 0.51 | 0 | − | − | Colicin I receptor | |
| 67 | NP_755652.2 | tolC | 0 | 0 | − | + | Outer membrane channel protein | |
| 68 | NP_752583.1 | ompT | 0 | 0 | + | + | Outer membrane protease | |
| 69 | NP_756783.1 | btuB | 0 | 0 | − | + | Vitamin B12/cobalamin outer membrane transporter | |
| 70 | NP_754246.1 | fliF | 0.213 | 2 | − | − | Flagellar MS-ring protein | |
| 71 | NP_755960.1 | yheF | 0.239 | 0 | − | − | General secretion pathway protein D precursor | |
| 72 | NP_752585.1 | nfrA | 0.298 | 0 | − | − | Bacteriophage N4 receptor, outer membrane subunit | |
| 73 | NP_753902.1 | uidC | 0.304 | 1 | − | − | Putative outer membrane porin protein | |
| 74 | NP_754656.1 | c2770 | 0.348 | 0 | − | − | Hypothetical protein c2770 | |
| 75 | NP_757165.1 | ytfM | 0.4 | 0 | − | − | Hypothetical protein c5318 | |
| 76 | NP_753730.1 | ydbH | 0.401 | 1 | − | − | Hypothetical protein c1828 | |
| 77 | NP_756204.1 | yhjL | 0.408 | 0 | − | − | Cellulose synthase subunit BcsC | |
| 78 | NP_751977.1 | c0021 | 0.418 | 1 | − | − | Hypothetical protein c0021 | |
| 79 | NP_757166.1 | ytfN | 0.426 | 1 | − | − | Hypothetical protein c5319 | |
| 80 | NP_754559.2 | yohG | 0.427 | 0 | + | − | Multidrug resistance outer membrane protein MdtQ | |
| 81 | NP_756598.1 | c4739 | 0.43 | 0 | − | − | Hypothetical protein c4739 | |
| 82 | NP_753585.1 | ychP | 0.435 | 0 | − | − | Hypothetical protein c1680 | |
| 83 | NP_754913.1 | c3031 | 0.448 | 0 | − | − | SinH-like protein | |
| 84 | NP_752491.1 | ybaU | 0.464 | 1 | − | − | Peptidyl-prolyl cis-trans isomerase (rotamase D) | |
| 85 | NP_755264.1 | c3389 | 0.282 | 3 | X | − | − | Hypothetical protein |
Note: The protein RefSeq numbers are from UPEC strain CFT073. All 85 proteins are conserved across total four UPEC genomes. This table also shows the adhesin probability, number of transmembrane helixes, and absence in nonpathogenic K-12 MG1655. TMH, transmembrane alpha helix prediction. Microarray, transcriptomic mRNA results (+ for up-regulation in vivo). Proteomics, protein expression results (+ for up-regulation in urine or in vivo).