Literature DB >> 35493528

dbPepNeo2.0: A Database for Human Tumor Neoantigen Peptides From Mass Spectrometry and TCR Recognition.

Manman Lu1,2, Linfeng Xu2,3, Xingxing Jian2,4, Xiaoxiu Tan2,5, Jingjing Zhao1,2, Zhenhao Liu2, Yu Zhang2,3, Chunyu Liu1,2, Lanming Chen1, Yong Lin3, Lu Xie1,2,4.   

Abstract

Neoantigens are widely reported to induce T-cell response and lead to tumor regression, indicating a promising potential to immunotherapy. Previously, we constructed an open-access database, i.e., dbPepNeo, providing a systematic resource for human tumor neoantigens to storage and query. In order to expand data volume and application scope, we updated dbPepNeo to version 2.0 (http://www.biostatistics.online/dbPepNeo2). Here, we provide about 801 high-confidence (HC) neoantigens (increased by 170%) and 842,289 low-confidence (LC) HLA immunopeptidomes (increased by 107%). Notably, 55 class II HC neoantigens and 630 neoantigen-reactive T-cell receptor-β (TCRβ) sequences were firstly included. Besides, two new analytical tools are developed, DeepCNN-Ineo and BLASTdb. DeepCNN-Ineo predicts the immunogenicity of class I neoantigens, and BLASTdb performs local alignments to look for sequence similarities in dbPepNeo2.0. Meanwhile, the web features and interface have been greatly improved and enhanced.
Copyright © 2022 Lu, Xu, Jian, Tan, Zhao, Liu, Zhang, Liu, Chen, Lin and Xie.

Entities:  

Keywords:  TCR; deep learning; experimental validation; mass spectrometry; neoantigen

Mesh:

Substances:

Year:  2022        PMID: 35493528      PMCID: PMC9043652          DOI: 10.3389/fimmu.2022.855976

Source DB:  PubMed          Journal:  Front Immunol        ISSN: 1664-3224            Impact factor:   8.786


Introduction

The complex process by which the immune system eliminates cancer cells is regulated by several factors; one of the most important events is the production of neoantigens (1, 2). Different mutated variants of numerous mutations lead to the production of new protein sequences, which were deemed as the primary sources of neoantigens (3, 4), such as single-nucleotide variants (SNV) and insertions or deletions (INDEL). The non-coding regions of the genome is another source of neoantigens, such as long non-coding RNAs (2, 4, 5). These foreign proteins are naturally processed by antigen-presenting cells (APC) cells and presented by human leukocyte antigens (HLAs) and then truly induce an efficient response of CD8 or CD4 lymphocytes (6–8). Theoretically, tumor neoantigens are promising cancer immunotherapy targets due to being not subject to central tolerance and less likely trigger autoimmune toxicity (7, 9–11). At present, there are two primary approaches for utilizing tumor neoantigen in clinical practices: one is to expand and reinforce natural or modified T cells ex vivo and then reinject them back into the patient to kill cancer cells (12–15). Another is to design and develop personalized vaccines (16, 17). In 2021, Hu et al. showed that neoantigen peptide vaccines can induce persistent memory T-cell responses and expand the range of tumor-specific cytotoxicity (18). Meanwhile, further investigations found that a combination therapy of personalized neoantigen vaccines and checkpoint blockade monoclonal antibodies could also enhance T-cell responses in patients (17). Moreover, neoantigens play significant roles in immune escape and immunoediting (19, 20). Mass spectrometry (MS)-based workflows applied on clinical cancer cell lines and tumor samples produce rich peptidomics data sets and hence the straightforward identifications of MHC-bound peptides (21). Integrating with next-generation sequencing, this approach has reported dozens of neoantigens deriving from immunopeptidomic analysis (22–25). In addition, fusion is an important class of somatic mutations formed by chromosome structural variation (SV) (11), which can serve as an ideal source of neoantigens for creating an open reading frame (ORF) (11, 26, 27). Wei et al. showed that neoantigens derived from fusion genes tend to have notably higher immunogenic potential than common single-nucleotide variation and indel-based candidate neoantigens, making them more viable as clinical cancer vaccines (11, 27, 28). Laumont et al. identified 40 specific neoantigens based on the proteogenomic approach, which mainly derived from allegedly non-coding regions (29). The diversity of discovering the source of tumor neoantigens implies that T cells can be directed against a variety of genomic aberrations in cancer (30). Therefore, there is great interest in integrating all immunogenic peptide resources for cancer immunotherapy. First released in 2019 by our group, dbPepNeo was a database of human tumor neoantigens either immunogenically tested or mass spectroscopically validated, which contained high-confidence (HC) neoantigens, medium-confidence (MC) neoantigens, and low-confidence (LC) HLA-binding peptidomes. With the increasing need for breadth and precision of neoantigen data, we updated dbPepNeo (31) to version 2.0 here. dbPepNeo2.0 substantially added HC and LC data, greatly expanded the data volume and application scope, and significantly modernized the dbPepNeo user interface. In addition to expansion of existing data types, dbPepNeo2.0 also added novel data types such as fusion neoantigen peptides, HLA class II neoantigens, and non-coding region neoantigens. Also, neoantigen-reactive T-cell receptor-β (TCRβ) sequences were firstly included. Furthermore, two new neoantigen analytical tools were incorporated into dbPepNeo2.0 to provide selection of the immunogenic neoantigens. DeepCNN-Ineo predicts the immunogenicity of class I neoantigens, and BLASTdb performs local alignments to look for sequence similarities in dbPepNeo2.0. All of these additions and enhancements are intended to support the development of neoantigen-based cancer vaccines and facilitate research in clinical cancer immunotherapy.

Materials and Methods

Data Acquisition and Classification

We updated dbPepNeo2.0 mainly by compiling the new data from peer-reviewed immunology literatures, and the existing public databases (e.g., Cancer Antigenic Peptide Database, Immune Epitope Database) (32, 33). Different combinations of keywords were used to perform the search in PubMed (neoantigen, tumor, neoepitope, epitope, peptidomes, peptidomics), mass spectrometry-related words, and restricted species (human cancer). Indexed papers have a limited time range (between January 2010 and June 2021). In order to ensure the reliability of the data, we manually checked each retrieved literature (31). Simply, the curated peptides were classified into three degrees of confidence according to verification methods (). We defined LC immunopeptidomes as raw peptides bound by HLA molecules and identified by MS; MC neoantigens were peptides with somatic mutations and verified by MS and whole exome sequencing (WES)/whole genome sequencing (WGS); and HC neoantigens were those immunogenic peptides validated by specific TCR recognition experiments (31). Neoantigen immunogenicity manifests the interaction of neoantigens HLA with TCR, which concretely performs as T-cell proliferation, T-cell activation, TNF release, IFN-γ release, granzyme B release, IL-2 release, and IL-10 release ().
Figure 1

The illustration of HC, MC neoantigens, and LC immunopeptidomes based on validation approaches. LC, low confidence; MC, medium confidence; HC, high confidence.

The illustration of HC, MC neoantigens, and LC immunopeptidomes based on validation approaches. LC, low confidence; MC, medium confidence; HC, high confidence.

Data Annotation

All screened papers were further read and manually curated, and each neoantigen peptide was provided with additional annotation. Original data details were integrated including neoantigen peptides, wild peptide sequence, peptide length, mutant position, HLA allele, gene name, cancer or tumor type, methods of verification, PubMed ID, and the reference links. Furthermore, we used NetMHCpan (v4.1) and NetMHCIIpan (v4.0) to provide mutated peptide affinity (half-maximal inhibitory concentration) IC50 (nM), %rank, and binding level, respectively (34). For class I peptides, peptides with %rank less than 0.5 will be considered as strong binding (%rank  ≤ 0.5, SB), 0.5  <  %rank  ≤  2 will be considered as weak binding (WB), and %rank > 2 will be considered as no binding (NB). For class II peptides, the top 2% of the predicted peptides will be identified as strong binder (%rank ≤2), then peptides will be considered as weak binder if the %rank is above 2 but below the specified 10 (2% < rank ≤ 10%) (35). To further study the immune response of T cells to neoantigens, we collected validated T-cell receptor-β (TCRβ) complementarity-determining region 3 (CDR3) sequences specific for tumor neoantigens (class I or II) and firstly constructed TCRβ-CDR3 sequence libraries. Eventually, we unified retrieved information and each entry was provided with cancer type, gene name, neoantigen sequence, CDR3 sequence, variable region of TCR (TRBV), diversity region of TCR (TRBD), and joining region of TCR (TRBJ) (36), as well as the reference links, if available in literatures.

New Tools DeepCNN-Ineo and BLASTdb Were Added in dbPepNeo2.0

Affinity HLA–peptide interactions have been reported to be positively correlated with immune responses (37). Thus, a web-based tool, DeepCNN-Ineo (deep-learning model for predicting immunogenicity of neoantigens based on convolutional neural network), was initially developed based on dbPepNeo2.0, which aims at further reducing false positive neoantigen peptides based on processing pipeline prediction and narrowing down of the scope of immunogenicity peptide validation. The quantitative immunogenic/non-immunogenic neoantigen peptides were collected from dbPepNeo2.0 and PubMed. To restrict the dataset for prediction, manually extracted neoantigen peptides from research articles need to be further processed. 9-mer- and 10-mer-length neoantigen peptides covering 97.5% of the total number of neoantigens (38) obtained from dbPepNeo2.0 were used as training datasets. All HLA molecular typing should have 4-digit alleles; the different HLA alleles with the same neoantigen peptides were considered as different neoantigens. A total of 583 immunogenic neoantigens and 2,200 non-immunogenic neoantigens were retained after removing duplicated data. By comparing different encoding strategies, we finally used the AAindex-encoding strategy to extract amino acid comprehensive physicochemical properties and HLA paratopes (, ), whose approach was successfully validated in different literatures (39–41). In addition, we considered the binding affinity between HLA–peptide pairs and the potential immunogenicity of peptide-HLA (42). Binding affinity was included into DeepCNN-Ineo; these mass spectrometry data contain not only information about peptide–MHC-binding events (34) but also information about the steps of the biological antigen presentation process. We took the %rank score of binding affinity as a highly reliable reference for neoantigen identification and then the predicted score of the immunogenicity model as a filter. Users can freely choose whether to refer to binding affinity (%rank) or not. Double filtering can increase the reliability of DeepCNN-Ineo prediction of neoantigen immunogenicity. Meanwhile, dbPepNeo2.0 integrated the BLASTdb tool into the database (43). The target sequence database, i.e., reference library, was established with HC or MC neoantigen peptides, while predicted neoantigen peptides were regarded as retrieval sequences. Then, the sequences of homology between predicted neoantigen peptides and the target sequence can be identified by BLASTdb. The structural framework of the dbPepNeo2.0 database is shown in .
Figure 2

dbPepNeo2.0 content and construction. HC NeoAgs: high-confidence neoantigens; MC NeoAgs: medium-confidence neoantigens; LC Peptidomes: low-confidence immunopeptidomes.

dbPepNeo2.0 content and construction. HC NeoAgs: high-confidence neoantigens; MC NeoAgs: medium-confidence neoantigens; LC Peptidomes: low-confidence immunopeptidomes.

Results

Expansion of Existing Data and Addition of New Data to dbPepNeo2.0

dbPepNeo2.0 provides a systematic, quality-controlled catalog of validated neoantigen peptides, TCRs, and HLA peptidomes, which distinguishes from other antigen databases such as IEDB and CAPD; the source of neoantigen data is shown (). HC neoantigens are high-confidence neoantigen peptides validated by specific TCR recognition. MC neoantigens are medium-confidence peptides, which involved somatic mutations and are verified by MS and WES/WGS. LC immunopeptidomes are raw peptides bound by HLA molecules and identified by MS. The data changes over these few years are summarized in dbPepNeo2.0 ( ). For this year’s release, dbPepNeo2.0 mainly contained both class I and class II neoantigens and TCRs. For class I peptides, HC neoantigen data types in dbPepNeo2.0 have been greatly increased. HC neoantigens grew from 295 HC neoantigens to 746 HC neoantigens, in which we firstly included 23 neoantigens derived from non-coding regions and 13 neoantigens derived from fusion genes. The number of class I MC neoantigens increased slightly from 247 to 251. dbPepNeo2.0 totally collected 28 immunopeptidomes datasets of human cancer; the number of LC immunopeptidomes grew from 407,794 to 720,782. For the newly added class II peptides, dbPepNeo2.0 firstly collected 55 HC neoantigens and 121,507 LC immunopeptidomes.
Figure 3

Data summary in dbPepNeo2.0. (A) The percentage of neoantigen data collected from different sources in dbPepNeo2.0. (B) Number of HC, MC neoantigens, and TCRs in two versions of dbPepNeo. (C) Number of LC immunopeptidomes in two versions of dbPepNeo. (D) Tumor type of class I HC neoantigens. (E) Tumor type of class II high-confidence neoantigens.

Data summary in dbPepNeo2.0. (A) The percentage of neoantigen data collected from different sources in dbPepNeo2.0. (B) Number of HC, MC neoantigens, and TCRs in two versions of dbPepNeo. (C) Number of LC immunopeptidomes in two versions of dbPepNeo. (D) Tumor type of class I HC neoantigens. (E) Tumor type of class II high-confidence neoantigens. Totally, the latest HC neoantigens (classes I and II) involved 30 cancer types and about 45% HC neoantigens were derived from melanoma ( ). This corresponds to a nearly 1.5-fold increase. Interestingly, we newly included 55 class II HC neoantigens that can induce CD4+ T-cell response ( ). Importantly, each neoantigen peptide provides original detailed details, including cancer type, gene name, HLA allele, wild-type peptide sequence, mutated peptide sequence, mutant position, peptide length, methods of verification, PubMed ID, and the reference links. Also in our database, we can further investigate the specific recognition of HLA–peptide complexes by TCRs. dbPepNeo2.0 firstly added a T-cell receptor-β (TCRβ) library, which encompassed 395 neoantigen TCRβ clonotypes isolated in CD8+ T cells and 235 neoantigen TCRβ clonotypes cloned from CD4+ T cells. Similarly, each entry was provided with cancer type, gene name, neoantigen sequence, CDR3 sequence, TRBV, TRBD, and TRBJ, as well as the reference links, if available in literatures. Overall, both tumor neoantigen coverage and immune response targets were significantly expanded in dbPepNeo2.0.

Statistics of the Collected HC, MC Neoantigens, and TCRs

Neoantigens that elicit T-cell responses represent the gold standard for developing vaccines. HC neoantigens are immunogenic peptides that were validated by specific TCR recognition. We further analyzed the neoantigens and TCR in dbPepNeo2.0; a total of 71 HLA types in HC neoantigens and MC neoantigens were found. Peptide binding to the HLA*A02:01 molecule accounted for the main proportion. The top 15 alleles with the most frequent HLA binding to class I and class II are shown in . Meanwhile, %rank was calculated by NetMHCpan (v4.1) and NetMHCIIpan (v4.0) for HLA I and II-restricted neoantigens. The results showed that the class I and class II neoantigens with high affinity to HLA molecules accounted for 79% and 43%, respectively (). The prediction performance of class II peptides binding to the MHC software tool needs to be improved. MC neoantigens were defined by MS and WES/WGS and contained somatic mutations, which are treated as potential neoantigens for applying immunotherapy. MS identification of directly eluted cancer-associated HLA peptides defined them as LC immunopeptidomes; these raw peptides are likely presentable to the tumor cell surface, but not guaranteed to elicit a potent T-cell response, until tested empirically. Furthermore, we counted the number and length of neoantigen-reactive CDR3 sequences and found that the majority of CDR3 sequences are composed of 15 amino acids ().
Figure 4

Statistics analysis of the collected HC, MC neoantigens, and TCRs in dbPepNeo2.0. (A, B) Top 15 HLA types binding to HC neoantigens. (C) Top 15 HLA types binding to MC neoantigens. (D) Affinity prediction with NetMHCpan of HC class I and class II neoantigens. (E) Length distribution of HLA-I neoantigen-reactive CDR3 sequences. (F) Length distribution of HLA-II neoantigen-reactive CDR3 sequences.

Statistics analysis of the collected HC, MC neoantigens, and TCRs in dbPepNeo2.0. (A, B) Top 15 HLA types binding to HC neoantigens. (C) Top 15 HLA types binding to MC neoantigens. (D) Affinity prediction with NetMHCpan of HC class I and class II neoantigens. (E) Length distribution of HLA-I neoantigen-reactive CDR3 sequences. (F) Length distribution of HLA-II neoantigen-reactive CDR3 sequences.

Interface Enhancements and New Features

To date, the quality of a database is dependent not only on the quality of their content but also on the user-friendliness of their interfaces. In version 2.0 of dbPepNeo, the database’s user interface has been significantly enhanced and expanded. The dbPepNeo2.0 interface has been redesigned; the web interface comprises 7 main pages: (I) Home, (II) Search, (III) BLASTdb, (IV) DeepCNN-Ineo, (V) Download, (VI) Document, and (VII) Contact us. On the Home page, users can quickly find information of tumor-specific neoantigens ( ). The choices of tumor types, HLA types, mut peptides (neoantigens), and PubMed ID are provided in drop-down menus to simplify the query. Also, users can precisely select the type of data to be retrieved by clicking on the search button, including class I neoantigens, class II neoantigens, and TCRs (). Due to the diverse spellings of many cancer types, neoantigen sequences and gene names can be complex or non-intuitive; dbPepNeo2.0 now supports an “intelligent” text search, which automatically provides possible options to supplement the input of incomplete names. Furthermore, the results from database queries have also been enhanced. Some important neoantigen peptides and TCR features are displayed on the result page; users can view complete information via the hyperlink on the ID. Detailed information included cancer information, sequence information, verification method, and reference (). Then the result page can be further queried by selecting text fields, and the text fields will be highlighted (red font) in the selected result page.
Figure 5

User interface of the dbPepNeo2.0 database; the home page includes seven features. (A) Global search is a quick search box; search options include cancer, gene, mut peptide (mutant peptide), and HLA allele. (B) The accurate search page for neoantigens; one can choose to search class I and II neoantigens or TCRs. (C) Search results for melanoma. (D) The four neoantigen prediction and study tools can be utilized on the home page.

User interface of the dbPepNeo2.0 database; the home page includes seven features. (A) Global search is a quick search box; search options include cancer, gene, mut peptide (mutant peptide), and HLA allele. (B) The accurate search page for neoantigens; one can choose to search class I and II neoantigens or TCRs. (C) Search results for melanoma. (D) The four neoantigen prediction and study tools can be utilized on the home page. In order to adapt to various needs and preferences of users, the neoantigen analytical tools have been modified for release 2.0 to allow four different types of neoantigen prediction and study tools developed by our group to be used (). On the home page, users can quickly utilize analysis tools: ProGeo-neo is a pipeline for proteogenomic neoantigen prediction using MS data (44), and INeo-Epp is a tool for predicting the immunogenicity of epitopes based on the characteristics of antigenic peptides (45). DeepCNN-Ineo and BLASTdb were firstly incorporated into dbPepNeo2.0. In DeepCNN-Ineo, collected data were divided into the following proportions: 60% for the training set, 20% for the validation set, and 20% for the independent test set (46). We used ROC and the normalized confusion matrix to assess the model of predictivity; the independent test set AUC was 0.779 ( , ), and the unnormalized confusion matrix demonstrated that most of the data were correctly classified, which indicated that the model had a good performance. Due to the number of collected neoantigen datasets being not large enough, the construction of this model is only a preliminary attempt in the database. To provide a useful tool for neoantigen selection, we will further optimize our model when the data volume is expanded in the future. Users can choose options to use the DeepCNN-Ineo on the web page. BLASTdb can be used to search sequence similarity. The output format of BLASTdb query results is custom format 6 (43); users can adjust the expected value threshold, word size, gap costs, and matrix to increase the sensitivity.

Application of dbPepNeo2.0

Broad-Spectrum Filtration of Neoantigens Using BLASTdb

Conventional computer prediction processes produce immeasurable candidate neoantigens, which is almost arduous for immunologists and clinicians to eliminate false positive predictive neoantigen peptides by means of experimental validation. Therefore, candidate neoantigens can be further filtered by our database. Eight patients with head and neck squamous cell carcinoma (HNSCC) were predicted to produce 113 candidate neoantigens; these 113 candidate neoantigens have been proved using TCR T cells, and two neoantigens have been proved to activate T cells to some extent (47). We obtained the data directly from the published literature without reperforming original data reanalysis. In this case, 113 validated neoantigens were input as query sequences for the BLASTdb tool. All HC neoantigens (801 immunogenic peptides) from dbPepNeo2.0 were used to construct the target sequence library. BLASTdb filter results showed that 16 candidate neoantigen peptides were similar to target sequences (16/113). Particularly, 16 candidate neoantigen peptides also contained 2 neoantigens confirmed by immunoassay experiments (). The matching degree of candidate neoantigen and target sequences ranged from 50% to 80% (). The number of filtered neoantigen data was reduced from 113 to 16; this is a much smaller range for potential further immunogenicity validation. Hence, the immunogenic peptides can be further screened by our database, which can greatly reduce the burden of subsequent experimental verification and significantly improve the accuracy of neoantigen prediction.
Figure 6

Overview of the results of neoantigen filtering by dbPepNeo2.0 workflow: ProGeo-neo, BLASTdb, and DeepCNN-Ineo. (A) Filter results of 113 neoantigens by BLASTdb. (B) Filter results of 369 peptides by BLASTdb and DeepCNN-Ineo.

Overview of the results of neoantigen filtering by dbPepNeo2.0 workflow: ProGeo-neo, BLASTdb, and DeepCNN-Ineo. (A) Filter results of 113 neoantigens by BLASTdb. (B) Filter results of 369 peptides by BLASTdb and DeepCNN-Ineo.

Neoantigen Prediction Using ProGeo-neo, BLASTdb, and DeepCNN-Ineo

ProGeo-neo, dbPepNeo2.0, and DeepCNN-Ineo systematically form a mining pipeline for screening tumor neoantigens. Previously, we used the ProGeo-neo pipeline to predict neoantigens from the Jurkat leukemia cell line; 655 candidate neoantigen peptides were generated (44). As an example, we only took the largest number of 9-mer length candidate neoantigens as the query sequence (369 candidate peptides). Similarly, 801 HC neoantigen peptides (class I and class II) were used as the target sequence library and obtained 26 peptides by BLASTdb filtering (26/369) (). Subsequently, we predicted the immunogenicity of 26 peptides from BLASTdb results using DeepCNN-Ineo (). The results showed that 6 mut peptides were identified as immunogenicity-high, and 12 mut peptides were identified as immunogenicity-low; 8 mut peptides were identified as non-immunogenic (). Therefore, in dbPepNeo2.0, we could further identify real neoantigens from p-MHC to reduce the scope for immunogenic peptide validation by using DeepCNN-Ineo (18/369). Compared with the original data, the range of neoantigens used for experimental validation is reduced.

Discussion and Conclusion

In this work, we updated dbPepNeo to dbPepNeo2.0. We greatly improved and supplemented the database content and optimized the web interface. dbPepNeo2.0 is committed to be an important reference database for tumor neoantigen research, which allows researchers to exploit neoantigens data with mass spectrometry evidence and experimental validation. Moreover, we expect that it can provide guidance on vaccine development in tumor immunotherapy. dbPepNeo2.0 newly added 13 immunogenic neoantigens derived from gene fusions and 23 neoantigens produced from non-coding regions from all available validated sources. The addition of new data further expanded the boundaries and applicability of neoantigens. In addition, immunogenic neoantigens can elicit CD8 T- or CD4+ T-cell responses and produce neoantigen-specific TCR clonotypes. For researchers, validated neoantigen-specific TCR sequences collected in dbPepNeo2.0 can be utilized to explore the specific recognition of neoantigens and accelerate the development of immunotherapy strategies such as TCR-T. The ultimate purpose of neoantigen presentation by HLA is that they are recognized by TCR and can induce antitumor immune responses (48). We preliminarily constructed a neoantigen immunogenic prediction tool DeepCNN-Ineo based on deep learning, which aims at providing a valuable tool for predicting the immunogenicity of neoantigen peptides. However, it was only a preliminary test on a small scope of fixed datasets from upstream approaches. Owing to the insufficient training data (immunogenic peptides), the performance of our model is not perfect. In addition, HLA*A2 accounts for a very high proportion of the data we can collect. In order to solve the problem of bias in HLA typing, we tried to construct a model with each type separately, but the results were unsatisfactory. The reason for this may be that the characteristics of neoantigens themselves prefer to bind to certain HLA, such as HLA*A02:01, and the amount of data we have is not enough to train a powerful model for all HLA alleles. It is noticeable that TCR sequences of neoantigen-specific recognition are the most important functional feature (39). A model containing TCR information is only suitable for patients with sufficient depth of TCR sequencing. Although high-throughput approaches for single-cell TCR sequencing have been developed, the techniques are still rarely performed in research and clinical settings (39). In addition, the heterogeneity of TCR sequence length and the complexity of cross recognition among TCR sequences both greatly increase the complexity of TCR prediction modeling. Therefore, we have not taken TCRs into account for the time being to predict neoantigen peptide immunogenicity. Predictive algorithms based on machine learning and artificial intelligence technologies need large training datasets (49), where the data type, quality, and quantity greatly influence the accuracy of prediction (50, 51). Therefore, the continuous accumulation of high-precision data is critical for model construction and optimization (2, 52, 53). Continuous expansion on the biological sources for tumor neoantigen and the necessity for constructing more accurate computational prediction algorithms together stand for the significance and importance of continuous updating of our dbPepNeo, a database for human tumor neoantigen peptides from mass spectrometry and TCR recognition.

Data Availability Statement

The original contributions presented in the study are included in the article/. Further inquiries can be directed to the corresponding author.

Author Contributions

LX conceived of the idea and planned and coordinated the entire project. LX, YL, and LC, supervised this study. XJ and XT contributed to the study design. JZ, ZL, YZ, and CL contributed to the data analysis. ML and LFX designed the web interface. LFX wrote the computer program and constructed the database. ML collected and curated data and drafted the manuscript. LX and XJ revised the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the National Natural Science Foundation of China under Grant [31870829] and Shanghai Municipal Health Commission Collaborative Innovation Cluster Project under Grant [2019CXJQ02].

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  52 in total

1.  Neopepsee: accurate genome-level prediction of neoantigens by harnessing sequence and amino acid immunogenicity information.

Authors:  S Kim; H S Kim; E Kim; M G Lee; E-C Shin; S Paik; S Kim
Journal:  Ann Oncol       Date:  2018-04-01       Impact factor: 32.976

Review 2.  Cancer Neoantigens.

Authors:  Ton N Schumacher; Wouter Scheper; Pia Kvistborg
Journal:  Annu Rev Immunol       Date:  2018-12-14       Impact factor: 28.527

Review 3.  Update on Tumor Neoantigens and Their Utility: Why It Is Good to Be Different.

Authors:  Chung-Han Lee; Roman Yelensky; Karin Jooss; Timothy A Chan
Journal:  Trends Immunol       Date:  2018-05-08       Impact factor: 16.687

Review 4.  Personalized neoantigen vaccines: A new approach to cancer immunotherapy.

Authors:  Amanda R Aldous; Jesse Z Dong
Journal:  Bioorg Med Chem       Date:  2017-10-19       Impact factor: 3.641

5.  Personal neoantigen vaccines induce persistent memory T cell responses and epitope spreading in patients with melanoma.

Authors:  Zhuting Hu; Donna E Leet; Rosa L Allesøe; Giacomo Oliveira; Shuqiang Li; Adrienne M Luoma; Jinyan Liu; Juliet Forman; Teddy Huang; J Bryan Iorgulescu; Rebecca Holden; Siranush Sarkizova; Satyen H Gohil; Robert A Redd; Jing Sun; Liudmila Elagina; Anita Giobbie-Hurder; Wandi Zhang; Lauren Peter; Zoe Ciantra; Scott Rodig; Oriol Olive; Keerthi Shetty; Jason Pyrdol; Mohamed Uduman; Patrick C Lee; Pavan Bachireddy; Elizabeth I Buchbinder; Charles H Yoon; Donna Neuberg; Bradley L Pentelute; Nir Hacohen; Kenneth J Livak; Sachet A Shukla; Lars Rønn Olsen; Dan H Barouch; Kai W Wucherpfennig; Edward F Fritsch; Derin B Keskin; Catherine J Wu; Patrick A Ott
Journal:  Nat Med       Date:  2021-01-21       Impact factor: 53.440

6.  Generation of neoantigen-specific T cells for adoptive cell transfer for treating head and neck squamous cell carcinoma.

Authors:  Teng Wei; Matthias Leisegang; Ming Xia; Kazuma Kiyotani; Ning Li; Chenquan Zeng; Chunyan Deng; Jinxing Jiang; Makiko Harada; Nishant Agrawal; Liangping Li; Hui Qi; Yusuke Nakamura; Lili Ren
Journal:  Oncoimmunology       Date:  2021-05-25       Impact factor: 8.110

7.  Mass spectrometry of human leukocyte antigen class I peptidomes reveals strong effects of protein abundance and turnover on antigen presentation.

Authors:  Michal Bassani-Sternberg; Sune Pletscher-Frankild; Lars Juhl Jensen; Matthias Mann
Journal:  Mol Cell Proteomics       Date:  2015-01-09       Impact factor: 5.911

8.  Efficient identification of neoantigen-specific T-cell responses in advanced human ovarian cancer.

Authors:  Song Liu; Junko Matsuzaki; Lei Wei; Takemasa Tsuji; Sebastiano Battaglia; Qiang Hu; Eduardo Cortes; Laiping Wong; Li Yan; Mark Long; Anthony Miliotto; Nicholas W Bateman; Shashikant B Lele; Thinle Chodon; Richard C Koya; Song Yao; Qianqian Zhu; Thomas P Conrads; Jianmin Wang; George L Maxwell; Amit A Lugade; Kunle Odunsi
Journal:  J Immunother Cancer       Date:  2019-06-20       Impact factor: 12.469

9.  DeepHLApan: A Deep Learning Approach for Neoantigen Prediction Considering Both HLA-Peptide Binding and Immunogenicity.

Authors:  Jingcheng Wu; Wenzhe Wang; Jiucheng Zhang; Binbin Zhou; Wenyi Zhao; Zhixi Su; Xun Gu; Jian Wu; Zhan Zhou; Shuqing Chen
Journal:  Front Immunol       Date:  2019-11-01       Impact factor: 7.561

View more
  1 in total

1.  NAIRscore as a biomarker for the quality of immune response to neoantigens is related with an increased overall survival in multiple myeloma.

Authors:  Xingxing Jian; Linfeng Xu; Jingjing Zhao; Yanhui Wang; Wen Zhou; Lu Xie
Journal:  Mol Ther Nucleic Acids       Date:  2022-07-12       Impact factor: 10.183

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.