| Literature DB >> 33927717 |
Jiaqi Xia1, Peng Bai1, Weiliang Fan1, Qiming Li1, Yongzheng Li1, Dehe Wang1, Lei Yin1, Yu Zhou1,2.
Abstract
T-cell recognition of somatic mutation-derived cancer neoepitopes can lead to tumor regression. Due to the difficulty to identify effective neoepitopes, constructing a database for sharing experimentally validated cancer neoantigens will be beneficial to precise cancer immunotherapy. Meanwhile, the routine neoepitope prediction in silico is important but laborious for clinical use. Here we present NEPdb, a database that contains more than 17,000 validated human immunogenic neoantigens and ineffective neoepitopes within human leukocyte antigens (HLAs) via curating published literature with our semi-automatic pipeline. Furthermore, NEPdb also provides pan-cancer level predicted HLA-I neoepitopes derived from 16,745 shared cancer somatic mutations, using state-of-the-art predictors. With a well-designed search engine and visualization modes, this database would enhance the efficiency of neoantigen-based cancer studies and treatments. NEPdb is freely available at http://nep.whu.edu.cn/.Entities:
Keywords: HLA; MHC; T-cell; cancer immunotherapy; neoantigen; neoepitope; somatic mutation
Year: 2021 PMID: 33927717 PMCID: PMC8078594 DOI: 10.3389/fimmu.2021.644637
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1NEPdb content and construction. (A) Manual collection and curation of experimentally validated neoepitope and non-immunogenic peptides information. (B) Computational analysis of HLA-peptides for pan-cancer gene mutations. (C) Database implementation and overview of the web interface.
Data statistics in NEPdb.
| Data content | HLA-I data statisticsa | HLA-II data statisticsb | Total |
|---|---|---|---|
| Entry (Total) | 12,239 | 5,310 | 17,549 |
| Entry (Positive)c | 155 | 18 | 173 |
| Entry (Negative)d | 12,084 | 5,292 | 17,376 |
| Tumor type | 22 | 11 | 23 |
| HLA Allele | 60 | 35 | 95 |
| Gene | 2,063 | 811 | 2,068 |
| Protein sequence | 2,332 | 895 | 2,337 |
aNumber of peptides which bind to HLA-I alleles. bNumber of peptides which bind to HLA-II alleles. cImmunogenic neoepitope entries. dIneffective neopeptide-HLA entries.
Figure 2Statistics of experimentally validated data in NEPdb. (A) The distribution of HLA alleles. (B) Top 20 genes with the most validated neoepitope and non-immunogenic peptides in NEPdb.
Figure 3A comprehensive view of NEPdb search and query interface. (A) Search options include HLA locus, response, tumor type, sequence, and gene. (B) Query results include wild peptide, mutation peptide, mutation AA (amino acid), gene symbol, HLA type, literature, etc. (C) Details of a selected peptide from the results table. (D-F) Result pages of full protein sequence (D), clinical trial (E), and experimental information (F).