Matthew Ardito1, Joanna Fueyo2, Ryan Tassone1, Frances Terry1, Kristen DaSilva3, Songhua Zhang4, William Martin1, Anne S De Groot5, Steven F Moss4, Leonard Moise6. 1. EpiVax, Inc., Providence, Rhode Island 02903, USA. 2. Department of Biomedical and Pharmaceutical Sciences, INBRE Program, College of Pharmacy, University of Rhode Island, Kingston, Rhode Island 02871, USA. 3. Institute for Immunology and Informatics, University of Rhode Island, Providence, Rhode Island 02903, USA. 4. Department of Gastroenterology, Brown University Warren Alpert Medical School, Providence, Rhode Island 02903, USA. 5. EpiVax, Inc., Providence, Rhode Island 02903, USA ; Institute for Immunology and Informatics, University of Rhode Island, Providence, Rhode Island 02903, USA ; Department of Pediatric Infectious Disease, Brown University Warren Alpert Medical School, Providence, Rhode Island 02912, USA. 6. EpiVax, Inc., Providence, Rhode Island 02903, USA ; Institute for Immunology and Informatics, University of Rhode Island, Providence, Rhode Island 02903, USA.
Abstract
BACKGROUND: One useful application of pattern matching algorithms is identification of major histocompatability complex (MHC) ligands and T-cell epitopes. Peptides that bind to MHC molecules and interact with T cell receptors to stimulate the immune system are critical antigens for protection against infectious pathogens. We describe a genomes-to-vaccine approach to H. pylori vaccine design that takes advantage of immunoinformatics algorithms to rapidly identify T-cell epitope sequences from large genomic datasets. RESULTS: To design a globally relevant vaccine, we used computational methods to identify a core genome comprised of 676 open reading frames (ORFs) from amongst seven genetically and phenotypically diverse H. pylori strains from around the world. Of the 1,241,153 9-mer sequences encoded by these ORFs, 106,791 were identical amongst all seven genomes and 23,654 scored in the top 5% of predicted HLA ligands for at least one of eight archetypal Class II HLA alleles when evaluated by EpiMatrix. To maximize the number of epitopes that can be assessed experimentally, we used a computational algorithm to increase epitope density in 20-25 amino acid stretches by assembling potentially immunogenic 9-mers to be identically positioned as they are in the native protein antigen. 1,805 immunogenic consensus sequences (ICS) were generated. 79% of selected ICS epitopes bound to a panel of 6 HLA Class II haplotypes, representing >90% of the global human population. CONCLUSIONS: The breadth of H. pylori genome datasets was computationally assessed to rapidly and carefully determine a core set of genes. Application of immunoinformatics tools to this gene set accurately predicted epitopes with promising properties for T cell-based vaccine development.
BACKGROUND: One useful application of pattern matching algorithms is identification of major histocompatability complex (MHC) ligands and T-cell epitopes. Peptides that bind to MHC molecules and interact with T cell receptors to stimulate the immune system are critical antigens for protection against infectious pathogens. We describe a genomes-to-vaccine approach to H. pylori vaccine design that takes advantage of immunoinformatics algorithms to rapidly identify T-cell epitope sequences from large genomic datasets. RESULTS: To design a globally relevant vaccine, we used computational methods to identify a core genome comprised of 676 open reading frames (ORFs) from amongst seven genetically and phenotypically diverse H. pylori strains from around the world. Of the 1,241,153 9-mer sequences encoded by these ORFs, 106,791 were identical amongst all seven genomes and 23,654 scored in the top 5% of predicted HLA ligands for at least one of eight archetypal Class II HLA alleles when evaluated by EpiMatrix. To maximize the number of epitopes that can be assessed experimentally, we used a computational algorithm to increase epitope density in 20-25 amino acid stretches by assembling potentially immunogenic 9-mers to be identically positioned as they are in the native protein antigen. 1,805 immunogenic consensus sequences (ICS) were generated. 79% of selected ICS epitopes bound to a panel of 6 HLA Class II haplotypes, representing >90% of the global human population. CONCLUSIONS: The breadth of H. pylori genome datasets was computationally assessed to rapidly and carefully determine a core set of genes. Application of immunoinformatics tools to this gene set accurately predicted epitopes with promising properties for T cell-based vaccine development.
Authors: J F Tomb; O White; A R Kerlavage; R A Clayton; G G Sutton; R D Fleischmann; K A Ketchum; H P Klenk; S Gill; B A Dougherty; K Nelson; J Quackenbush; L Zhou; E F Kirkness; S Peterson; B Loftus; D Richardson; R Dodson; H G Khalak; A Glodek; K McKenney; L M Fitzegerald; N Lee; M D Adams; E K Hickey; D E Berg; J D Gocayne; T R Utterback; J D Peterson; J M Kelley; M D Cotton; J M Weidman; C Fujii; C Bowman; L Watthey; E Wallin; W S Hayes; M Borodovsky; P D Karp; H O Smith; C M Fraser; J C Venter Journal: Nature Date: 1997-08-07 Impact factor: 49.962
Authors: N Salama; K Guillemin; T K McDaniel; G Sherlock; L Tompkins; S Falkow Journal: Proc Natl Acad Sci U S A Date: 2000-12-19 Impact factor: 11.205
Authors: David A Baltrus; Manuel R Amieva; Antonello Covacci; Todd M Lowe; D Scott Merrell; Karen M Ottemann; Markus Stein; Nina R Salama; Karen Guillemin Journal: J Bacteriol Date: 2008-10-24 Impact factor: 3.490
Authors: R A Alm; L S Ling; D T Moir; B L King; E D Brown; P C Doig; D R Smith; B Noonan; B C Guild; B L deJonge; G Carmel; P J Tummino; A Caruso; M Uria-Nickelsen; D M Mills; C Ives; R Gibson; D Merberg; S D Mills; Q Jiang; D E Taylor; G F Vovis; T J Trust Journal: Nature Date: 1999-01-14 Impact factor: 49.962
Authors: Anne S De Groot; Ankit K Desai; Sandra Lelias; S M Shahjahan Miah; Frances E Terry; Sundos Khan; Cindy Li; John S Yi; Matt Ardito; William D Martin; Priya S Kishnani Journal: Front Immunol Date: 2021-06-16 Impact factor: 7.561