| Literature DB >> 30223042 |
Jingcheng Wu1, Wenyi Zhao2, Binbin Zhou3, Zhixi Su4, Xun Gu5, Zhan Zhou6, Shuqing Chen7.
Abstract
Tumor-specific neoantigens have attracted much attention since they can be used as biomarkers to predict therapeutic effects of immune checkpoint blockade therapy and as potential targets for cancer immunotherapy. In this study, we developed a comprehensive tumor-specific neoantigen database (TSNAdb v1.0), based on pan-cancer immunogenomic analyses of somatic mutation data and human leukocyte antigen (HLA) allele information for 16 tumor types with 7748 tumor samples from The Cancer Genome Atlas (TCGA) and The Cancer Immunome Atlas (TCIA). We predicted binding affinities between mutant/wild-type peptides and HLA class I molecules by NetMHCpan v2.8/v4.0, and presented detailed information of 3,707,562/1,146,961 potential neoantigens generated by somatic mutations of all tumor samples. Moreover, we employed recurrent mutations in combination with highly frequent HLA alleles to predict potential shared neoantigens across tumor patients, which would facilitate the discovery of putative targets for neoantigen-based cancer immunotherapy. TSNAdb is freely available at http://biopharm.zju.edu.cn/tsnadb.Entities:
Keywords: Cancer immunotherapy; Database; Human leukocyte antigen; Neoantigen; Somatic mutation
Mesh:
Substances:
Year: 2018 PMID: 30223042 PMCID: PMC6203688 DOI: 10.1016/j.gpb.2018.06.003
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Figure 1An overview of the TSNAdb web interface
A. TSNAdb comprises five main components: (i) Home; (ii) Browse; (iii) Search; (iv) Validation, and (v) Download. The distribution of HLA alleles (B) and somatic missense mutations (C) extracted from TCGA and TCIA are listed. D. The average neoantigen loads across different tissues. E. Top 20 genes with predicted neoantigens in 7748 samples. F. The result of ‘Search’ page under the selection of “NetMHCpan2.8, KRAS, and TCGA”. G. The result of ‘Search’ page under the selection of “NetMHCpan2.8, KRAS, and ICGC”. H. Example of validation data from IEDB in ‘Validation’ page (top) and partial information on ‘Download’ page (bottom). The predicted binding level ‘Strong’ indicates strong binding with IC50 < 150 nM. TCGA, The Cancer Genome Atlas; TCIA, The Cancer Immunome Atlas; HLA, human leukocyte antigen; TSNAD, tumor-specific neoantigen detector; ICGC: International Cancer Genome Consortium; IEDB, Immune Epitope Database; WT, wild type.
Top 10 shared neoantigens of 7748 tumor samples from TCGA
| V600E | A03:01 | KIGDFGLATVK | 94.09 | KIGDFGLATEK | 125.24 | 117/7748 | |
| G12D | A02:01 | KLVVVGAGGV | 520.08 | KLVVVGADGV | 213.82 | 82/7748 | |
| G12V | A02:01 | KLVVVGAGGV | 520.08 | KLVVVGAVGV | 111.87 | 71/7748 | |
| G12V | A02:01 | KLVVVGAG | 17,690.28 | KLVVVGAV | 162.97 | 71/7748 | |
| V600E | A11:01 | KIGDFGLATVK | 53.27 | KIGDFGLATEK | 45.20 | 68/7748 | |
| H1047R | C07:01 | AHHGGWTTKM | 6742.50 | ARHGGWTTKM | 248.57 | 62/7748 | |
| H1047R | C07:02 | AHHGGWTTKM | 2596.23 | ARHGGWTTKM | 217.76 | 56/7748 | |
| E545K | A03:01 | STRDPLSEITE | 28,265.76 | STRDPLSEITK | 321.19 | 54/7748 | |
| V600E | B57:01 | FGLATVKSRW | 128.34 | FGLATEKSRW | 246.23 | 41/7748 | |
| V600E | B57:01 | LATVKSRW | 73.82 | LATEKSRW | 124.61 | 41/7748 |
Note: WT, wild type; MT, mutant. Amino acid residue changes caused by somatic mutations are indicated in red.
Frequency of the top 10 shared neoantigens predicted by recurrent mutations in combination with highly frequent HLA alleles from TCGA
| V600E | A03:01 | 1.55% | 1.51% | |
| G12D | A02:01 | 1.01% | 1.06% | |
| H1047R | C07:01 | 0.73% | 0.80% | |
| E545K | A03:01 | 0.68% | 0.70% | |
| E542K | A03:01 | 0.44% | 0.44% | |
| R248W | A02:01 | 0.33% | 0.34% | |
| R273C | A02:01 | 0.29% | 0.31% | |
| R248Q | C07:02 | 0.25% | 0.23% | |
| Y220C | A02:01 | 0.24% | 0.19% | |
| R88Q | C07:02 | 0.16% | 0.17% |
Note: Expected frequency indicates the frequency of shared neoantigens predicted by recurrent mutations in combination with highly frequent HLA alleles. Observed frequency, the frequency of shared neoantigens in 7748 tumor samples.
Figure 2Example applications of predicted neoantigens for bladder cancer and the gene
Top 20 HLA alleles (A) and genes (B) with predicted neoantigens are displayed in the page using bladder cancer as an example, with the detailed neoantigen information listed (C). The binding level ‘Strong’ indicates strong binding with IC50 < 150 nM, ‘Weak’ indicates weak binding with 150 nM < IC50 < 500 nM, ‘-’ indicates non-binding with IC50 > 500 nM. D. Distribution of the predicted neoantigens for all combinations of recurrent mutations of TP53 and the highly frequent HLA alleles according to the TCGA dataset. The color gradient indicates the frequencies of potential shared neoantigens for the specific combinations of somatic mutations and HLA alleles. All the data shown are predicted by NetMHCpan v2.8. HLA, human leukocyte antigen; TCGA, The Cancer Genome Atlas; WT, wild type; MT, mutant.