Literature DB >> 31819989

NeoPeptide: an immunoinformatic database of T-cell-defined neoantigens.

Wei-Jun Zhou1, Zhi Qu2, Chao-Yang Song1, Yang Sun3, An-Li Lai3, Ma-Yao Luo3, Yu-Zhe Ying3, Hu Meng3, Zhao Liang1, Yan-Jie He1, Yu-Hua Li1, Jian Liu4.   

Abstract

Therapeutic vaccines represent a promising immunotherapeutic modality against cancer. Discovery and validation of antigens is the key to develop effective anti-cancer vaccines. Neoantigens, arising from somatic mutations in individual cancers, are considered as ideal cancer vaccine targets because of their immunogenicity and lack of expression in normal tissues. However, only few databases support convenient access to these neoantigens for use in vaccines. To address this gap, we developed a web-accessible database, called NeoPeptide, which contains most of the important characteristics of neoantigens (such as mutation site, subunit sequence, major histocompatibility complex restriction) derived from published literature and other immunological resources. NeoPeptide also provides links to resources for further characterization of the novel features of these neoantigens. NeoPeptide will be regularly updated with newly identified and published neoantigens. Our work will help researchers in identifying neoantigens in different cancers and hasten the search for appropriate cancer vaccine candidates.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31819989      PMCID: PMC6901387          DOI: 10.1093/database/baz128

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


Introduction

The ability of the immune system to recognize cancerous cells has opened the door for the development of cancer immunotherapy. Anti-cancer vaccines represent one of the most promising immunotherapeutic modalities against cancer. An ideal vaccine would harness the immune system for the selective destruction of cancer cells while having no adverse effect on normal tissues. Traditionally, anti-cancer vaccines have targeted tumor-associated antigens (TAAs), which are produced by genes overexpressed in cancer cells, such as Survivin (1), gp100 (2), WT1 (3), HER2/Neu (4) and MUC1 (5). These TAA-based vaccines have been shown to induce specific T responses in vitro; however, robust evidence of their long-term efficacy in cancer patients is yet to be obtained (6). A key limiting factor is that TAAs are also expressed in normal cells or germline tissues, although at low levels. This prevents an effective immune response due to central immune tolerance; contrarily, a strong immune response against these antigens may result in autoimmunity (7, 8). Structure of the neopeptide system. Unlike TAAs, neoantigens, arising from somatic mutations, are expressed only on tumor cells (9). In this context, neoantigens may not be subject to immune tolerance and pose a lower risk of inducing autoimmunity; thus, these are considered as ideal cancer vaccine targets. However, translation of a somatic mutation into a neoantigen-based vaccine would entail at least four steps: (i) prediction of somatic nucleic acid exchange, (ii) identification of mutated proteins and the corresponding amino acid exchange, (iii) human leukocyte antigen (HLA) typing of patients and (iv) screening of the mutated peptide sequences with high affinity to HLA/MHC (major histocompatibility complex) molecules. The time required for these steps greatly reduces the work efficiency and limits the clinical utility (10–12). Therefore, there is a need to establish a database that catalogs the verified neoantigens and provides an effective search functionality to recognize neoepitopes as soon as a novel somatic mutation is defined (13–15). Previously developed databases have compiled a vast repertoire of traditional TAAs and largely ignored the importance of gathering epitopes derived from neoantigens [e.g. the AntiJen database (http://www.ddg-pharmfac.net/antijen/AntiJen/antijenhomepage.htm) (16)] or show the neoepitopes predicted from The Cancer Genome Atlas (TCGA) and the TCIA database (https://tcia.at/home) (17) [e.g. the TSNAdb database (http://biopharm.zju.edu.cn/tsnadb/) (18)] or are based on experimental data [e.g. the IEDB database (http://www.iedb.org/) (19)] or mostly focus on peptides identified previously [e.g. the CAPED database (https://caped.icp.ucl.ac.be/)]. Here, we introduce NeoPeptide, a web-based database, which provides access to search facility for neoantigens in the context of different cancers. NeoPeptide not only helps characterize the newly identified neoantigens (such as sequence, MHC restriction, adjuvant] but also provides additional information such as the corresponding nucleic acid and amino acid exchange in each neoepitope. In addition, NeoPeptide provides a way to further discover new characteristics of these neoepitopes.

Materials and Methods

Database implementation

The NeoPeptide database was built using tomcat 9.0.7 (web server) and MySQL (database server). All data were processed and organized into a MySQL database management system (version 5.7.20). The framework of the website was designed and implemented using the Java Spring MVC and the user interface was developed using html, css and jQuery libraries. The website has been tested using several web browsers, including Internet Explorer (e.g. version 11.648.17134.0), Google Chrome (e.g. version 73.0.3683.86) and Firefox (e.g. version 60.0.2). Search result example of ‘fuzzy search’. When the keyword ‘KRAS’ was searched, 76 entries were returned. Search result example of ‘advanced search’. When the keywords ‘Melanoma’ and ‘KRAS’ were searched at the same time, 12 entries were returned.

System architecture

A schematic illustration of the system architecture is shown in Figure 1. As this is a web-based application, users are not required to install complex software. In addition, a single browser is needed for searching and downloading the data.
Figure 1

Structure of the neopeptide system.

Search functionality

The search functionality has two modes. The first mode is the fuzzy search mode for general users, wherein users input a keyword to retrieve the related items. The web application sorts the retrieved items by priority, since the rules matching all fields may lead to a large amount of data. Therefore, we introduced the Levenshtein distance to help users quickly identify the data items of interest. The second mode is the advanced search mode wherein the advanced users can search for terms according to specific fields such as gene, neoantigen, amino acid exchange, HLA allele or peptide. Furthermore, there are three Boolean operators (AND, OR and NOT) to combine or exclude keywords for more focused results. The Boolean operator AND is the default mode; however, the users can alter the functionality using a pull-down menu.

Download in NeoPeptide

The original data of NeoPeptide is freely available for download in the .dbf and *.xlsx formats. The .dbf file extension represents the dBase database file, and this format is supported by a number of database products. After downloading this file, users can import the data in their own database systems via the database import wizard function. For instance, users can use the database development tool Navicat (https://www.navicat.com/en/products/navicat-for-mysql), which incorporates the import wizard function for importing the .dbf file into a database system.

Results

Statistics of neoantigens in NeoPeptide

NeoPeptide is a comprehensive resource of somatic mutation-derived neoantigens. Neoantigens confirmed in cancer vaccines from clinical trials as well as in vitro experiments are recorded. Detailed information pertaining to the characteristics of the neoantigens is included in the database, such as HLA typing and corresponding nucleic acid and amino acid exchange. NeoPeptide currently contains 181 137 epitopes derived from more than 36 000 neoantigens. These neoantigens have been defined in different cancers, including non-small cell lung cancer (e.g. EGFR Q701L), breast cancer (e.g. TP53 S355F), melanoma (e.g. TMEM48 F169 L), renal cell carcinoma (e.g. ARMC8 T60I), myeloproliferative neoplasms (e.g. TET2 C784S), gastric cancer (e.g. BRCA2 S37 L), uterine leiomyosarcoma (e.g. SEC14L5 L289 V), ovarian cancer (e.g. ABCD2 E686Q), pancreatic cancer (e.g. MEN1 A68P), colon cancer (e.g. TP53 R175H) and head and neck squamous cell carcinoma (e.g. BRE S170 L) and so on. In addition, NeoPeptide will be regularly updated to include newly identified and published neoantigens.

Applications of NeoPeptide: a utility case

To facilitate database access, we developed a web interface that incorporates both simple and advanced search functionalities. For each entry, the main information includes ‘Cancer’ (the underlying disease from which the neoantigen was found), ‘Gene’ (the corresponding wild-type gene), ‘Neoantigen’, ‘Nucleic Acid Exchange’ (the nucleic acid somatic from the corresponding wild-type gene), ‘Amino Acid Exchange’ (the amino acid somatic from the corresponding wild-type gene), ‘HLA/MHC Allele’ (the HLA restriction of neoantigens used in the cancer vaccine), ‘Length’ (the length of neoantigen-derived epitopes used in the cancer vaccine), ‘Peptide’ (the sequence of neoantigen-derived epitopes) and ‘journalRef’ (publications that support the role of the neoantigen in cancer vaccine). For demonstration purposes, the results of a fuzzy search ‘KRAS’ are shown in Figure 2. The database search retrieved 76 entries that have been experimentally identified or predicted as subunits in vaccines against melanoma, pancreatic cancer, thymoma or lung cancer. In the advanced search mode with restrictions of ‘melanoma’ and ‘KRAS’, 12 entries were retrieved (Figure 3). These results indicate that these 12 peptides may apply to neoantigen vaccine targeting the corresponding mutated KRAS and that this neoantigen vaccine may be effective in patients with melanoma. In addition, on clicking the entries in the ‘Cancer’ or ‘Gene’ column, the user will be directed to the NCBI’s search page (https://www.ncbi.nlm.nih.gov/search/) that contains detailed information about the corresponding cancer or the wild-type gene, respectively. Similarly, on clicking the entries in the ‘Peptide’ column, the user is directed to the IEDB webpage (http://tools.iedb.org/mhci/) for more details about the affinity to HLA or MHC molecules of these neoantigen-derived epitopes (Supplement information). Clicking the ‘journalRef’ link will navigate the user directly to the NCBI’s journal page.
Figure 2

Search result example of ‘fuzzy search’. When the keyword ‘KRAS’ was searched, 76 entries were returned.

Figure 3

Search result example of ‘advanced search’. When the keywords ‘Melanoma’ and ‘KRAS’ were searched at the same time, 12 entries were returned.

Discussions

Most cancers are characterized by a number of somatic mutations; however, not all of these mutations are suitable targets for cancer vaccine. According to the clinical trials of neoantigen vaccine, translation of somatic mutations into neoantigens for use in vaccines typically requires a period of 1–2 months. As most patients with advanced cancers survive for no more than 3 months, this time-consuming work represents a major barrier to the clinical use of neoantigen vaccines. Here, we developed a user-friendly neoantigen database, which may help researchers in quickly selecting the peptides for potential use in neoantigen vaccines. Moreover, NeoPeptide also provides useful links for further research on the functions of neoantigens. This database will be regularly updated with newly identified neoantigens in the context of a wider range of cancers. Further development of NeoPeptide will continue in order to offer a more comprehensive collection of experimentally supported data. Many previous works mainly focus on the peptide information retrieval using the deterministic keyword inputs; this may affect the search capabilities and data FAIRness. Some databases (e.g. IEDB) raise concerns about ontology usage in recent years and make promising changes to reach the FAIR principles, which make it more interoperable with other projects. According to this, we will iteratively use ontology into the user interface, especially for disease term search and ‘HLA/MHC Allele’ search and reach more goals of FAIRness (19, 20). Click here for additional data file.
  19 in total

1.  Cancer immunotherapy. A dendritic cell vaccine increases the breadth and diversity of melanoma neoantigen-specific T cells.

Authors:  Beatriz M Carreno; Vincent Magrini; Michelle Becker-Hapak; Saghar Kaabinejadian; Jasreet Hundal; Allegra A Petti; Amy Ly; Wen-Rong Lie; William H Hildebrand; Elaine R Mardis; Gerald P Linette
Journal:  Science       Date:  2015-04-02       Impact factor: 47.728

2.  Identification of new HER2/neu-derived peptide epitopes that can elicit specific CTL against autologous and allogeneic carcinomas and melanomas.

Authors:  Y Rongcun; F Salazar-Onfray; J Charo; K J Malmberg; K Evrin; H Maes; K Kono; C Hising; M Petersson; O Larsson; L Lan; E Appella; A Sette; E Celis; R Kiessling
Journal:  J Immunol       Date:  1999-07-15       Impact factor: 5.422

3.  Generation of survivin-specific CD8+ T effector cells by dendritic cells pulsed with protein or selected peptides.

Authors:  M Schmitz; P Diestelkoetter; B Weigle; F Schmachtenberg; S Stevanovic; D Ockert; H G Rammensee; E P Rieber
Journal:  Cancer Res       Date:  2000-09-01       Impact factor: 12.701

4.  Induction of Neoantigen-Specific Cytotoxic T Cells and Construction of T-cell Receptor-Engineered T Cells for Ovarian Cancer.

Authors:  Tatsuo Matsuda; Matthias Leisegang; Jae-Hyun Park; Lili Ren; Taigo Kato; Yuji Ikeda; Makiko Harada; Kazuma Kiyotani; Ernst Lengyel; Gini F Fleming; Yusuke Nakamura
Journal:  Clin Cancer Res       Date:  2018-05-02       Impact factor: 12.531

Review 5.  Personalized neoantigen vaccines: A new approach to cancer immunotherapy.

Authors:  Amanda R Aldous; Jesse Z Dong
Journal:  Bioorg Med Chem       Date:  2017-10-19       Impact factor: 3.641

6.  Cancer immunotherapy: moving beyond current vaccines.

Authors:  Steven A Rosenberg; James C Yang; Nicholas P Restifo
Journal:  Nat Med       Date:  2004-09       Impact factor: 53.440

Review 7.  Therapeutic vaccines for cancer: an overview of clinical trials.

Authors:  Ignacio Melero; Gustav Gaudernack; Winald Gerritsen; Christoph Huber; Giorgio Parmiani; Suzy Scholl; Nicholas Thatcher; John Wagstaff; Christoph Zielinski; Ian Faulkner; Håkan Mellstedt
Journal:  Nat Rev Clin Oncol       Date:  2014-07-08       Impact factor: 66.675

8.  AntiJen: a quantitative immunology database integrating functional, thermodynamic, kinetic, biophysical, and cellular data.

Authors:  Christopher P Toseland; Debra J Clayton; Helen McSparron; Shelley L Hemsley; Martin J Blythe; Kelly Paine; Irini A Doytchinova; Pingping Guan; Channa K Hattotuwagama; Darren R Flower
Journal:  Immunome Res       Date:  2005-10-06

9.  An effective biomedical data migration tool from resource description framework to JSON.

Authors:  Jian Liu; Mo Yang; Lei Zhang; Weijun Zhou
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

10.  The FAIR Guiding Principles for scientific data management and stewardship.

Authors:  Mark D Wilkinson; Michel Dumontier; I Jsbrand Jan Aalbersberg; Gabrielle Appleton; Myles Axton; Arie Baak; Niklas Blomberg; Jan-Willem Boiten; Luiz Bonino da Silva Santos; Philip E Bourne; Jildau Bouwman; Anthony J Brookes; Tim Clark; Mercè Crosas; Ingrid Dillo; Olivier Dumon; Scott Edmunds; Chris T Evelo; Richard Finkers; Alejandra Gonzalez-Beltran; Alasdair J G Gray; Paul Groth; Carole Goble; Jeffrey S Grethe; Jaap Heringa; Peter A C 't Hoen; Rob Hooft; Tobias Kuhn; Ruben Kok; Joost Kok; Scott J Lusher; Maryann E Martone; Albert Mons; Abel L Packer; Bengt Persson; Philippe Rocca-Serra; Marco Roos; Rene van Schaik; Susanna-Assunta Sansone; Erik Schultes; Thierry Sengstag; Ted Slater; George Strawn; Morris A Swertz; Mark Thompson; Johan van der Lei; Erik van Mulligen; Jan Velterop; Andra Waagmeester; Peter Wittenburg; Katherine Wolstencroft; Jun Zhao; Barend Mons
Journal:  Sci Data       Date:  2016-03-15       Impact factor: 6.444

View more
  5 in total

1.  iCDA-CMG: identifying circRNA-disease associations by federating multi-similarity fusion and collective matrix completion.

Authors:  Qiu Xiao; Jiancheng Zhong; Xiwei Tang; Jiawei Luo
Journal:  Mol Genet Genomics       Date:  2020-11-06       Impact factor: 3.291

2.  Circulating expression levels of CircHIPK3 and CDR1as circular-RNAs in type 2 diabetes patients.

Authors:  Farzaneh Rezaeinejad; Ali Mirzaei; Bahman Khalvati; Gholamabbas Sabz; Behnam Alipoor
Journal:  Mol Biol Rep       Date:  2021-11-03       Impact factor: 2.316

Review 3.  Pathological Response and Immune Biomarker Assessment in Non-Small-Cell Lung Carcinoma Receiving Neoadjuvant Immune Checkpoint Inhibitors.

Authors:  Frank Rojas; Edwin Roger Parra; Ignacio Ivan Wistuba; Cara Haymaker; Luisa Maren Solis Soto
Journal:  Cancers (Basel)       Date:  2022-06-02       Impact factor: 6.575

4.  CAD v1.0: Cancer Antigens Database Platform for Cancer Antigen Algorithm Development and Information Exploration.

Authors:  Jijun Yu; Luoxuan Wang; Xiangya Kong; Yang Cao; Mengmeng Zhang; Zhaolin Sun; Yang Liu; Jing Wang; Beifen Shen; Xiaochen Bo; Jiannan Feng
Journal:  Front Bioeng Biotechnol       Date:  2022-05-12

Review 5.  Identification of neoantigens for individualized therapeutic cancer vaccines.

Authors:  Franziska Lang; Barbara Schrörs; Martin Löwer; Özlem Türeci; Ugur Sahin
Journal:  Nat Rev Drug Discov       Date:  2022-02-01       Impact factor: 112.288

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.