Literature DB >> 25712691

EpiToolKit--a web-based workbench for vaccine design.

Benjamin Schubert1, Hans-Philipp Brachvogel2, Christopher Jürges2, Oliver Kohlbacher3.   

Abstract

UNLABELLED: EpiToolKit is a virtual workbench for immunological questions with a focus on vaccine design. It offers an array of immunoinformatics tools covering MHC genotyping, epitope and neo-epitope prediction, epitope selection for vaccine design, and epitope assembly. In its recently re-implemented version 2.0, EpiToolKit provides a range of new functionality and for the first time allows combining tools into complex workflows. For inexperienced users it offers simplified interfaces to guide the users through the analysis of complex immunological data sets.
AVAILABILITY AND IMPLEMENTATION: http://www.epitoolkit.de CONTACT: schubert@informatik.uni-tuebingen.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25712691      PMCID: PMC4481845          DOI: 10.1093/bioinformatics/btv116

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Epitope-based vaccine design offers novel and rational ways to develop vaccines based on genomic information. The design process undergoes several steps. The first step aims at identifying antigenic peptides (called epitopes) that induce a T-cell mediated immune reaction after presentation on the cell surface by proteins of the major histocompatibility complex (MHC). In the second step a subset, usually of size 10–20 epitopes, is selected forming the basis of the vaccine. Due to the high polymorphism within the MHC cluster, each individual possesses a unique set of MHC alleles and therefore presents a different set of epitopes. Hence, it is not only necessary to identify the individuals MHC genotype but also to tailor the epitope selection to match the MHC allele restrictions of a population (population-optimized vaccine) or that of an individual (personalized vaccines). The third step of the design pipeline is concerned with the delivery of the selected epitopes. A common strategy concatenates the epitopes into a so-called string-of-beads polypeptide. The epitope order within a string-of-beads plays a crucial role especially in degradation. Therefore it is necessary to optimize the ordering such that the recovery probability of the epitopes is maximal. Since the underlying data and the interdependencies of the design pipeline are complex and require bioinformatics tools to obtain optimal results, we developed a web-based platform EpiToolKit (ETK) to make such approaches accessible to a broader audience. ETK extends its predecessor by supporting MHC genotyping, and epitope assembly besides epitope discovery and epitope selection. Thus, it covers each of the described design steps and can be used for personalized or population-optimized vaccine development as well as for other immunological applications (e.g. large-scale epitope prediction). Additionally, functionalities such as the supported prediction methods and input formats have been extended. Also ETK is now based on a customized version of the open-source platform Galaxy (Goecks ), which allows a flexible combination of tools into workflows, a reliable recording and sharing of results, and the integration with high-performance computing resources.

2 Material and methods

ETK was designed to ease the use for inexperienced users but still retain high flexibility in combining the different tools. Under the tab Single Tools the interfaces are simplified into several configuration steps equipped with help texts. Under the Workflow tab these steps are available as independent nodes, allowing the development of complex workflows. All ETK tools generate two outputs: an interactive presentation of the results as html and an internal representation that can be used as input to other tools. ETK integrates OptiType, a newly developed NGS-based MHC genotyping approach that is superior in accuracy to existing methods (Szolek ). OptiType uses integer linear programming to simultaneously select all major MHC class I alleles comprising the genotype and supports Exome-Seq, RNA-Seq and whole-genome sequencing data. ETK also provides access to a collection of popular epitope prediction tools. The available methods include SYFPEITHI, BIMAS, SVMHC, the NetMHC family (reviewed in Toussaint and Kohlbacher, 2009), UniTope (Toussaint ), and TEPITOPEpan (Zhang ). With Polymorphic Epitope Prediction ETK extends epitope prediction to the incorporation of sequencing variations and is therefore vital for personalized design approaches. This method is based on SNEP (Schuler ) and was extended to handle indels and frame shifts besides single nucleotide polymorphisms. It either searches for known variations of a given protein within dbSNP (Sherry ) or uses a list of variations in vcf format. In both cases the variations are annotated using ANNOVAR (Wang ) to construct all polymorphic epitopes. This pipeline can be used to identify minor histocompatibility antigens (Feldhahn ) or neoepitopes, which are of high interest in cancer vaccine design (Kyzirakos ). For epitope selection ETK re-implements the mathematical framework OptiTope (reviewed in Toussaint and Kohlbacher, 2009). It determines a set of epitopes that maximizes the overall immunogenicity under constraints and thus the probability of inducing a long lasting immunity. Overall immunogenicity of an epitope set is defined as the sum over the immunogenicity of each epitope MHC allele pair, weighted by the probability of an MHC allele to appear in the target population or person. The problem of epitope ordering for string-of-beads design has been previously formulated as a traveling salesman problem (Toussaint ) and is now available in ETK. Since this approach is dependent on proteasomal cleavage site predictions, ETK offers two cleavage prediction approaches, PCM and NetChop (reviewed in Toussaint and Kohlbacher, 2009).

3 Applications

To demonstrate ETKs capabilities, a workflow for designing population-optimized vaccines for seasonal influenza was developed (Fig. 1). Based on the yearly WHO recommendations a dataset consisting of H1N1 and H3N2 strains was extracted from the Influenza research database (Squires ). Using NetMHC and default configurations for the Epitope Selection step, 10 epitopes were selected. The epitopes covered 5 out of 10 antigens and 26 out of 47 MHC alleles with a population coverage of 99.66%. On average each epitope was predicted to bind to MHC alleles. According to the Immune Epitope Database (Vita ) 10 out of 10 epitopes are known MHC binders or substrings of such and 5 out of 10 are T-cell reactive epitopes or substrings of such. For detailed results see Supplementary Tables S1–S3.
Fig. 1.

Example Workflow for population-based vaccine design. Allele Selection allows to specify the target population represented by their MHC alleles. Allele Frequencies then assigns frequencies to the chosen MHC alleles based on preassembled data or manually assigned frequencies. Epitope Conservation takes a file containing multiple MSA of antigens and constructs consensus sequences for each of them and calculates conservation scores for each k-mer peptide generated from the consensus sequences. Epitope Prediction performs the epitope prediction for the specified MHC alleles and the consensus sequences. Epitope Selection consumes the prediction results and selects a pre-defined number of epitopes under constraints for the specified target population and antigens. Epitope Assembly arranges the selected epitopes such that their recovery probability after proteasomal cleavage is maximal

Example Workflow for population-based vaccine design. Allele Selection allows to specify the target population represented by their MHC alleles. Allele Frequencies then assigns frequencies to the chosen MHC alleles based on preassembled data or manually assigned frequencies. Epitope Conservation takes a file containing multiple MSA of antigens and constructs consensus sequences for each of them and calculates conservation scores for each k-mer peptide generated from the consensus sequences. Epitope Prediction performs the epitope prediction for the specified MHC alleles and the consensus sequences. Epitope Selection consumes the prediction results and selects a pre-defined number of epitopes under constraints for the specified target population and antigens. Epitope Assembly arranges the selected epitopes such that their recovery probability after proteasomal cleavage is maximal

4 Conclusion

With ETK we provide a flexible and yet easy to use platform for rational vaccine design. Beyond the presented application ETK can be used to tackle a manifold of other immunological questions and thus should not only be valuable for applied medical but also for basic immunological research.

Funding

This study was partially funded by DFG (SFB 685/B1, KO 2313/6-1) and BMBF (01GU1106). Conflict of Interest: none declared.
  11 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  SNEP: SNP-derived epitope prediction program for minor H antigens.

Authors:  Mathias M Schuler; Pierre Dönnes; Maria-Dorothea Nastke; Oliver Kohlbacher; Hans-Georg Rammensee; Stefan Stevanovic
Journal:  Immunogenetics       Date:  2005-12-10       Impact factor: 2.846

3.  Universal peptide vaccines - optimal peptide vaccine design based on viral sequence conservation.

Authors:  Nora C Toussaint; Yaakov Maman; Oliver Kohlbacher; Yoram Louzoun
Journal:  Vaccine       Date:  2011-08-27       Impact factor: 3.641

4.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data.

Authors:  Kai Wang; Mingyao Li; Hakon Hakonarson
Journal:  Nucleic Acids Res       Date:  2010-07-03       Impact factor: 16.971

5.  miHA-Match: computational detection of tissue-specific minor histocompatibility antigens.

Authors:  Magdalena Feldhahn; Pierre Dönnes; Benjamin Schubert; Karin Schilbach; Hans-Georg Rammensee; Oliver Kohlbacher
Journal:  J Immunol Methods       Date:  2012-09-14       Impact factor: 2.303

6.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences.

Authors:  Jeremy Goecks; Anton Nekrutenko; James Taylor
Journal:  Genome Biol       Date:  2010-08-25       Impact factor: 13.583

7.  TEPITOPEpan: extending TEPITOPE for peptide binding prediction covering over 700 HLA-DR molecules.

Authors:  Lianming Zhang; Yiqing Chen; Hau-San Wong; Shuigeng Zhou; Hiroshi Mamitsuka; Shanfeng Zhu
Journal:  PLoS One       Date:  2012-02-23       Impact factor: 3.240

8.  Influenza research database: an integrated bioinformatics resource for influenza research and surveillance.

Authors:  R Burke Squires; Jyothi Noronha; Victoria Hunt; Adolfo García-Sastre; Catherine Macken; Nicole Baumgarth; David Suarez; Brett E Pickett; Yun Zhang; Christopher N Larsen; Alvin Ramsey; Liwei Zhou; Sam Zaremba; Sanjeev Kumar; Jon Deitrich; Edward Klem; Richard H Scheuermann
Journal:  Influenza Other Respir Viruses       Date:  2012-01-20       Impact factor: 4.380

9.  OptiType: precision HLA typing from next-generation sequencing data.

Authors:  András Szolek; Benjamin Schubert; Christopher Mohr; Marc Sturm; Magdalena Feldhahn; Oliver Kohlbacher
Journal:  Bioinformatics       Date:  2014-08-20       Impact factor: 6.937

10.  The immune epitope database 2.0.

Authors:  Randi Vita; Laura Zarebski; Jason A Greenbaum; Hussein Emami; Ilka Hoof; Nima Salimi; Rohini Damle; Alessandro Sette; Bjoern Peters
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

View more
  16 in total

Review 1.  The role of proteomics in the age of immunotherapies.

Authors:  Sarah A Hayes; Stephen Clarke; Nick Pavlakis; Viive M Howell
Journal:  Mamm Genome       Date:  2018-07-25       Impact factor: 2.957

2.  Current progress of immunoinformatics approach harnessed for cellular- and antibody-dependent vaccine design.

Authors:  Ada Kazi; Candy Chuah; Abu Bakar Abdul Majeed; Chiuan Herng Leow; Boon Huat Lim; Chiuan Yee Leow
Journal:  Pathog Glob Health       Date:  2018-03-12       Impact factor: 2.894

3.  Vaccines and Immunoinformatics for Vaccine Design.

Authors:  Shikha Joon; Rajeev K Singla; Bairong Shen
Journal:  Adv Exp Med Biol       Date:  2022       Impact factor: 2.622

Review 4.  Computational genomics tools for dissecting tumour-immune cell interactions.

Authors:  Hubert Hackl; Pornpimol Charoentong; Francesca Finotello; Zlatko Trajanoski
Journal:  Nat Rev Genet       Date:  2016-07-04       Impact factor: 53.242

5.  FRED 2: an immunoinformatics framework for Python.

Authors:  Benjamin Schubert; Mathias Walzer; Hans-Philipp Brachvogel; András Szolek; Christopher Mohr; Oliver Kohlbacher
Journal:  Bioinformatics       Date:  2016-02-26       Impact factor: 6.937

Review 6.  Fundamentals and Methods for T- and B-Cell Epitope Prediction.

Authors:  Jose L Sanchez-Trincado; Marta Gomez-Perosanz; Pedro A Reche
Journal:  J Immunol Res       Date:  2017-12-28       Impact factor: 4.818

7.  Population-level distribution and putative immunogenicity of cancer neoepitopes.

Authors:  Mary A Wood; Mayur Paralkar; Mihir P Paralkar; Austin Nguyen; Adam J Struck; Kyle Ellrott; Adam Margolin; Abhinav Nellore; Reid F Thompson
Journal:  BMC Cancer       Date:  2018-04-13       Impact factor: 4.430

Review 8.  Immunoinformatics and epitope prediction in the age of genomic medicine.

Authors:  Linus Backert; Oliver Kohlbacher
Journal:  Genome Med       Date:  2015-11-20       Impact factor: 11.117

9.  Designing string-of-beads vaccines with optimal spacers.

Authors:  Benjamin Schubert; Oliver Kohlbacher
Journal:  Genome Med       Date:  2016-01-26       Impact factor: 11.117

10.  pVAC-Seq: A genome-guided in silico approach to identifying tumor neoantigens.

Authors:  Jasreet Hundal; Beatriz M Carreno; Allegra A Petti; Gerald P Linette; Obi L Griffith; Elaine R Mardis; Malachi Griffith
Journal:  Genome Med       Date:  2016-01-29       Impact factor: 11.117

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.