Literature DB >> 29878047

ImmunomeBrowser: a tool to aggregate and visualize complex and heterogeneous epitopes in reference proteins.

Sandeep Kumar Dhanda1, Randi Vita1, Brendan Ha1, Alba Grifoni1, Bjoern Peters1,2, Alessandro Sette1,2.   

Abstract

Motivation: Datasets that are derived from different studies (e.g. MHC ligand elution, MHC binding, B/T cell epitope screening etc.) often vary in terms of experimental approaches, sizes of peptides tested, including partial and or nested overlapping peptides and in the number of donors tested.
Results: We present a customized application of the Immune Epitope Database's ImmunomeBrowser tool, which can be used to effectively aggregate and visualize heterogeneous immunological data. User provided peptide sets and associated response data is mapped to a user-provided protein reference sequence. The output consists of tables and figures representing the aggregated data represented by a Response Frequency score and associated estimated confidence interval. This allows the user to visualizing regions associated with dominant responses and their boundaries. The results are presented both as a user interactive javascript based web interface and a tabular format in a selected reference sequence. Availability and implementation: The 'ImmunomeBrowser' has been a longstanding feature of the IEDB (http://www.iedb.org). The present application extends the use of this tool to work with user-provided datasets, rather than the output of IEDB queries. This new server version of the ImmunomeBrowser is freely accessible at http://tools.iedb.org/immunomebrowser/.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29878047      PMCID: PMC6223373          DOI: 10.1093/bioinformatics/bty463

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Datasets that are derived from different studies (e.g. MHC ligand elution, MHC binding, B/T cell epitope screening etc.; Li Pira ) will often vary in terms of the experimental approaches used, the size of peptides that were tested and in the number of donors tested. To enable simplified visualization of the aggregated data present in the Immune Epitope Database (IEDB), we implemented the application called the ‘ImmunomeBrowser’ in the IEDB (Vita ). This tool aggregates all data relevant to the user query and allows one to visualize the known immune response to a specific antigen, as well as illustrating knowledge gaps in a reference protein. It provides the immune reactivity in terms of response frequency (RF) and the number of subjects tested/responded and/or number of independent assays performed along the length of reference protein. The tool was originally implemented in the results page of the database section of the IEDB. To further extend the usability to predicted epitopes and propriety epitopes or non-IEDB data, the online tool described herein was developed. The utility of the approach was demonstrated by Kim et al. who performed a meta-analysis of Hepatitis C virus (HCV) data available in the IEDB, to present a bigger picture of the immune reactivity and knowledge gaps in the reference protein sequences of the virus (Kim ). Currently, the Immunomebrowser can only be used with data derived from IEDB queries, and not with user datasets. To overcome this problem, we implemented the ImmunomeBrowser as a stand-alone tool to allow users to analyze and visualize immunodominant regions within their own dataset.

2 Materials and methods

2.1 Data input

The user provides peptide sequences, the response data for each, the protein sequence/s of interest and their desired sequence identity threshold in specified formats. The peptide response can be either pasted or uploaded as a file in whitespace separated format with three columns, corresponding to peptide sequence, number of subjects tested and/or number of assays performed, and the number of subjects responded and/or number of assays resulting in a positive response. In cases, where the number of subjects tested or responded or assays performed are not provided, the program will automatically fill in a value of ‘1’ for the number of subjects tested or assays performed, as well as for the number of subjects responding or positive assays. Protein sequences/s must be provided in ‘Fasta’ format and sequence identity is selected from a drop-down menu that varies from 10–100% with an interval of 10%.

2.2 Mapping of epitopes

Each peptide is mapped to a user provided reference protein sequence according to the provided identity threshold. The degree of identity is calculated based on the alignment of the peptide within reference sequence. Only peptides with sequence identity above the threshold are selected for further calculations.

2.3 RF and confidence interval calculations

The RF for a given peptide and for each source protein position is calculated as the total number of subjects that responded to that particular peptide and/or independent assay performed for which a positive response was noted (R) divided the total number of subjects tested and/or number of assays performed (N). A Confidence interval (CI) is calculated to weight the RF reliability as a function of the number of subjects tested. CI is calculated using the binomial cumulative distribution function and Wilson score. For large sample size (N>=50), lower and upper bound were calculated using following equation. For small small sample sizes (N < 50), lower and upper bounds are calculated using binomial cumulative distribution function.

2.4 Aggregation of RF data from different overlapping peptides

Aggregation of data is required to identify the most frequently recognized epitopes, which can reflect the overall frequency of recognition of peptide sequence containing a given residue. This approach is useful to identify the RF at each position in the reference sequence. To calculate the aggregated RF data, the number of subjects tested and/or assay performed and number of subjects responded and/or number of assays resulted a positive response were summed up for each mapped position in a given source protein. The CI of RF is calculated using the equations described above.

2.5 Result display

The results are presented in two steps, where the first step provides the summary of epitopes and assays mapped back to the reference protein sequence (Fig. 1).
Fig. 1.

Screenshots for the example output of the customized application of ‘ImmunomeBrowser’. (A) Tabular format listing all the different epitopes mapped to the given reference protein sequence. (B) Area plot for upper and lower bound CI for RF. The line plot shows the number of positive and negative assays or number of responder and not-responder subjects along the positions in reference protein. Hovering the mouse over any position in the reference protein in any of these plots will display the lower and upper bounds of the RF and number of assays/subjects count found as positive and negative (as shown in red rectangle)

Screenshots for the example output of the customized application of ‘ImmunomeBrowser’. (A) Tabular format listing all the different epitopes mapped to the given reference protein sequence. (B) Area plot for upper and lower bound CI for RF. The line plot shows the number of positive and negative assays or number of responder and not-responder subjects along the positions in reference protein. Hovering the mouse over any position in the reference protein in any of these plots will display the lower and upper bounds of the RF and number of assays/subjects count found as positive and negative (as shown in red rectangle) For each protein, a table lists all the epitopes, its mapped position, the number of subjects responded/positive assays, the number of subjects tested/assays performed and the RF along with its upper and lower bounds at 95% CI (Fig. 1A). The second step provides the aggregate plot of the mapped RF for each region of the reference protein, in two different plots representing the cumulative RF (upper and lower bound of RF) and total number of results (positive and negative) along the length of the selected reference protein (Fig. 1B).

3 Applications

The customized application of the ImmunomeBrowser lends itself to several applications. As mentioned above, Kim et al. has performed a meta-analysis of HCV data available in the IEDB, (Kim ; Vita ). The tool can now be utilized by users to collate and perform meta-analysis of data generated in multiple related studies. For example, the ImmunomeBrowser can be applied to natural ligand elution data containing largely overlapping peptides, and which are studied in different donors expressing different HLA molecules (Schellens ; Shastri ). For this purpose, the data needs to be combined for response frequencies from different donors and for each HLA molecule (Alvarez ). In this context, Vaughan et al. analyzed naturally processed data curated within the IEDB to characterize the overall general features of the known processed data and to highlight existing knowledge gaps (Vaughan ). The Immunomebrowser is also useful to analyze the immunogenicity testing of therapeutic proteins, where the overlapping peptides from a therapeutic protein are tested for immunogenicity to evaluate the unwanted immune response (Asgari ; Dhanda ; Jawa ; Salvat ). The Immunomebrowser, can thus aggregate the immune response data from different peptides and/or peptide analogs spanning through the length of the specified reference protein, even when tested in different donors and derived from different clinical studies. This allows users to easily view their data in a more meaningful and useful manner.
  11 in total

1.  Producing nature's gene-chips: the generation of peptides for display by MHC class I molecules.

Authors:  Nilabh Shastri; Susan Schwab; Thomas Serwold
Journal:  Annu Rev Immunol       Date:  2001-10-04       Impact factor: 28.527

Review 2.  T-cell dependent immunogenicity of protein therapeutics: Preclinical assessment and mitigation.

Authors:  Vibha Jawa; Leslie P Cousens; Michel Awwad; Eric Wakshull; Harald Kropshofer; Anne S De Groot
Journal:  Clin Immunol       Date:  2013-09-25       Impact factor: 3.969

Review 3.  Deciphering the MHC-associated peptidome: a review of naturally processed ligand data.

Authors:  Kerrie Vaughan; Xiaojun Xu; Etienne Caron; Bjoern Peters; Alessandro Sette
Journal:  Expert Rev Proteomics       Date:  2017-08-11       Impact factor: 3.940

4.  Computationally optimized deimmunization libraries yield highly mutated enzymes with low immunogenicity and enhanced activity.

Authors:  Regina S Salvat; Deeptak Verma; Andrew S Parker; Jack R Kirsch; Seth A Brooks; Chris Bailey-Kellogg; Karl E Griswold
Journal:  Proc Natl Acad Sci U S A       Date:  2017-06-12       Impact factor: 11.205

5.  Development of a strategy and computational application to select candidate protein analogues with reduced HLA binding and immunogenicity.

Authors:  Sandeep Kumar Dhanda; Alba Grifoni; John Pham; Kerrie Vaughan; John Sidney; Bjoern Peters; Alessandro Sette
Journal:  Immunology       Date:  2017-09-28       Impact factor: 7.397

Review 6.  Computational Tools for the Identification and Interpretation of Sequence Motifs in Immunopeptidomes.

Authors:  Bruno Alvarez; Carolina Barra; Morten Nielsen; Massimo Andreatta
Journal:  Proteomics       Date:  2018-02-26       Impact factor: 3.984

Review 7.  High throughput T epitope mapping and vaccine development.

Authors:  Giuseppina Li Pira; Federico Ivaldi; Paolo Moretti; Fabrizio Manca
Journal:  J Biomed Biotechnol       Date:  2010-06-15

8.  Rational design of stable and functional hirudin III mutants with lower antigenicity.

Authors:  Saeme Asgari; Hasan Mirzahoseini; Morteza Karimipour; Hamze Rahimi; Azadeh Ebrahim-Habibi
Journal:  Biologicals       Date:  2015-08-25       Impact factor: 1.856

9.  Comprehensive Analysis of the Naturally Processed Peptide Repertoire: Differences between HLA-A and B in the Immunopeptidome.

Authors:  Ingrid M M Schellens; Ilka Hoof; Hugo D Meiring; Sanne N M Spijkers; Martien C M Poelen; Jacqueline A M van Gaans-van den Brink; Kees van der Poel; Ana I Costa; Cecile A C M van Els; Debbie van Baarle; Can Kesmir
Journal:  PLoS One       Date:  2015-09-16       Impact factor: 3.240

10.  The immune epitope database (IEDB) 3.0.

Authors:  Randi Vita; James A Overton; Jason A Greenbaum; Julia Ponomarenko; Jason D Clark; Jason R Cantrell; Daniel K Wheeler; Joseph L Gabbard; Deborah Hix; Alessandro Sette; Bjoern Peters
Journal:  Nucleic Acids Res       Date:  2014-10-09       Impact factor: 16.971

View more
  17 in total

1.  Candidate Targets for Immune Responses to 2019-Novel Coronavirus (nCoV): Sequence Homology- and Bioinformatic-Based Predictions.

Authors:  Alba Grifoni; John Sidney; Yun Zhang; Richard H Scheuermann; Bjoern Peters; Alessandro Sette
Journal:  SSRN       Date:  2020-02-25

2.  A survey of known immune epitopes in the enteroviruses strains associated with acute flaccid myelitis.

Authors:  Alba Grifoni; Swapnil Mahajan; John Sidney; Sheridan Martini; Richard H Scheuermann; Bjoern Peters; Alessandro Sette
Journal:  Hum Immunol       Date:  2019-08-23       Impact factor: 2.850

Review 3.  Peptide-Based Vaccines for Tuberculosis.

Authors:  Wenping Gong; Chao Pan; Peng Cheng; Jie Wang; Guangyu Zhao; Xueqiong Wu
Journal:  Front Immunol       Date:  2022-01-31       Impact factor: 7.561

4.  Peptidome Surveillance Across Evolving SARS-CoV-2 Lineages Reveals HLA Binding Conservation in Nucleocapsid Among Variants With Most Potential for T-Cell Epitope Loss in Spike.

Authors:  Kamil Wnuk; Jeremi Sudol; Patricia Spilman; Patrick Soon-Shiong
Journal:  Front Immunol       Date:  2022-06-23       Impact factor: 8.786

5.  Parallel detection of SARS-CoV-2 epitopes reveals dynamic immunodominance profiles of CD8+ T memory cells in convalescent COVID-19 donors.

Authors:  Jet van den Dijssel; Ruth R Hagen; Rivka de Jongh; Maurice Steenhuis; Theo Rispens; Dionne M Geerdes; Juk Yee Mok; Angela Hm Kragten; Mariël C Duurland; Niels Jm Verstegen; S Marieke van Ham; Wim Je van Esch; Klaas Pjm van Gisbergen; Pleun Hombrink; Anja Ten Brinke; Carolien E van de Sandt
Journal:  Clin Transl Immunology       Date:  2022-10-14

6.  Identification and Characterization of CD4+ T Cell Epitopes after Shingrix Vaccination.

Authors:  Alessandro Sette; Alba Grifoni; Hannah Voic; Rory D de Vries; John Sidney; Paul Rubiro; Erin Moore; Elizabeth Phillips; Simon Mallal; Brittany Schwan; Daniela Weiskopf
Journal:  J Virol       Date:  2020-11-23       Impact factor: 5.103

7.  Landscape of epitopes targeted by T cells in 852 individuals recovered from COVID-19: Meta-analysis, immunoprevalence, and web platform.

Authors:  Ahmed Abdul Quadeer; Syed Faraz Ahmed; Matthew R McKay
Journal:  Cell Rep Med       Date:  2021-05-21

8.  T cell epitopes of SARS-CoV-2 spike protein and conserved surface protein of Plasmodium malariae share sequence homology.

Authors:  Md Mehedi Hassan; Shirina Sharmin; Jinny Hong; Hoi-Seon Lee; Hyeon-Jin Kim; Seong-Tshool Hong
Journal:  Open Life Sci       Date:  2021-06-23       Impact factor: 0.938

9.  A Sequence Homology and Bioinformatic Approach Can Predict Candidate Targets for Immune Responses to SARS-CoV-2.

Authors:  Alba Grifoni; John Sidney; Yun Zhang; Richard H Scheuermann; Bjoern Peters; Alessandro Sette
Journal:  Cell Host Microbe       Date:  2020-03-16       Impact factor: 21.023

10.  Development of a novel clustering tool for linear peptide sequences.

Authors:  Sandeep K Dhanda; Kerrie Vaughan; Veronique Schulten; Alba Grifoni; Daniela Weiskopf; John Sidney; Bjoern Peters; Alessandro Sette
Journal:  Immunology       Date:  2018-08-06       Impact factor: 7.397

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.