| Literature DB >> 28484631 |
Zhan Zhou1, Xingzheng Lyu2, Jingcheng Wu1, Xiaoyue Yang1, Shanshan Wu1, Jie Zhou1, Xun Gu3, Zhixi Su4, Shuqing Chen1.
Abstract
Tumour antigens have attracted much attention because of their importance to cancer diagnosis, prognosis and targeted therapy. With the development of cancer genomics, the identification of tumour-specific neoantigens became possible, which is a crucial step for cancer immunotherapy. In this study, we developed software called the tumour-specific neoantigen detector for detecting cancer somatic mutations following the best practices of the genome analysis toolkit and predicting potential tumour-specific neoantigens, which could be either extracellular mutations of membrane proteins or mutated peptides presented by class I major histocompatibility complex molecules. This pipeline was beneficial to the biologist with little programmatic background. We also applied the software to the somatic mutations from the International Cancer Genome Consortium database to predict numerous potential tumour-specific neoantigens. This software is freely available from https://github.com/jiujiezz/tsnad.Entities:
Keywords: cancer somatic mutation; major histocompatibility complex; membrane protein; neoantigen; tumour antigen
Year: 2017 PMID: 28484631 PMCID: PMC5414268 DOI: 10.1098/rsos.170050
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1.Mutant peptides with 21 amino acids and corresponding 8–11 mer peptides. MHC molecules always bind to 8–11 mer peptides, so we extracted peptides 21 amino acids in length, with 10 amino acids upstream and 10 amino acids downstream of mutation sites for NetMHCpan prediction. The number 11 in red indicates the mutated site, and the peptides in yellow represent all the possible peptides which may bind to MHC molecules.
Figure 2.The software pipeline of TSNAD. The pipeline performs best practices for somatic SNVs and INDELs in whole-genome/exome sequence with GATK. Then, we extracted the extracellular mutations of membrane proteins according to the protein topology, and invoked NetMHCpan to predict the binding information of mutant peptides to class I MHC molecules.
Top 20 most frequent extracellular mutations in 9155 donors.
| Chr | Pos | ID | gene | DNA mutation | protein mutation | mutation frequency |
|---|---|---|---|---|---|---|
| 3 | 195505836 | MU10935 | MUC4 | 12615C>G | H4205Q | 44 out of 9155 |
| 1 | 29138975 | MU68226 | OPRD1 | 80G>T | C27F | 25 out of 9155 |
| 1 | 120611960 | MU869951 | NOTCH2 | 61G>A | A21T | 23 out of 9155 |
| 19 | 1065018 | MU68245 | ABCA7 | 6133G>T | A2045S | 18 out of 9155 |
| 15 | 22369378 | MU4380351 | OR4M2 | 803C>T | S268F | 16 out of 9155 |
| 17 | 37868208 | MU85975 | ERBB2 | 929C>T | S310F | 15 out of 9155 |
| 3 | 195509676 | MU586249 | MUC4 | 8775G>C | Q2925H | 15 out of 9155 |
| 7 | 55233043 | MU589341 | EGFR | 1793G>T | G598 V | 15 out of 9155 |
| 3 | 195515449 | MU605883 | MUC4 | 3002T>A | V1001E | 14 out of 9155 |
| 19 | 9072091 | MU4382243 | MUC16 | 15355C>T | P5119S | 13 out of 9155 |
| 20 | 17639816 | MU4585427 | RRBP1 | 1337A>C | Q446P | 12 out of 9155 |
| 5 | 179071958 | MU4110168 | C5orf60 | 64G>C | D22H | 12 out of 9155 |
| 7 | 146829338 | MU4413315 | CNTNAP2 | 1085G>A | G362E | 12 out of 9155 |
| 1 | 158261127 | MU4408485 | CD1C | 265C>T | R89C | 11 out of 9155 |
| 11 | 5345040 | MU4383907 | OR51B2 | 488C>T | S163 L | 11 out of 9155 |
| 17 | 21319519 | MU613603 | KCNJ12 | 865G>C | E289Q | 11 out of 9155 |
| 2 | 46707884 | MU70561 | TMEM247 | 458A>G | Q153R | 11 out of 9155 |
| 2 | 137814319 | MU4440003 | THSD7B | 469G>A | E157 K | 11 out of 9155 |
| 3 | 195511286 | MU4617526 | MUC4 | 7165G>A | D2389N | 11 out of 9155 |
| 7 | 139167934 | MU66261 | KLRG2 | 455A>C | K152T | 11 out of 9155 |
Sixty five potential common neoantigens and their corresponding genes and mutation frequency.
| gene | role in tumour | no. mutation | no. neoantigen |
|---|---|---|---|
| KRAS | oncogene | 5 | 11 |
| PIK3CA | oncogene | 5 | 21 |
| TP53 | tumour suppressor gene | 3 | 8 |
| SF3B1 | tumour-related gene | 1 | 2 |
| MUC4 | — | 1 | 1 |
| CHEK2 | tumour suppressor gene | 1 | 2 |
| PTEN | tumour suppressor gene | 2 | 3 |
| FAM194B | — | 1 | 2 |
| OPRD1 | — | 1 | 5 |
| CTNNB1 | oncogene | 1 | 5 |
| FRG1 | — | 1 | 4 |
| GNAS | tumour-related gene | 1 | 1 |
Top 10 neoantigens with the highest mutation frequency in 9155 donors.
| gene | HLA allele | position | peptide | mutation | affinity (nM) | mutation frequency |
|---|---|---|---|---|---|---|
| KRAS | HLA-A*02:01 | 8 | KLVVVGADGV | G12D | 214 | 322 out of 9155 |
| KRAS | HLA-A*02:01 | 8 | KLVVVGAVGV | G12 V | 112 | 239 out of 9155 |
| KRAS | HLA-A*02:01 | 8 | KLVVVGAV | G12 V | 163 | 239 out of 9155 |
| KRAS | HLA-B*40:01 | 11 | TEYKLVVVGAV | G12 V | 90 | 239 out of 9155 |
| KRAS | HLA-C*03:04 | 3 | GAVGVGKSAL | G12 V | 172 | 239 out of 9155 |
| KRAS | HLA-C*03:03 | 3 | GAVGVGKSAL | G12 V | 172 | 239 out of 9155 |
| PIK3CA | HLA-C*07:02 | 2 | ARHGGWTTKM | H1047R | 218 | 200 out of 9155 |
| PIK3CA | HLA-C*06:02 | 3 | ARHGGWTTKM | H1047R | 457 | 200 out of 9155 |
| PIK3CA | HLA-C*07:01 | 2 | ARHGGWTTKM | H1047R | 249 | 200 out of 9155 |
| PIK3CA | HLA-A*11:01 | 11 | STRDPLSEITK | E545 K | 81 | 182 out of 9155 |
Figure 3.The distribution of tumour-specific neoantigens across 16 HLA types and 20 tumour types. (a) The number of tumour-specific neoantigens with each HLA type is shown in decreasing order. The dashed line indicates the average number of neoantigens. (b) Distribution of tumour-specific neoantigens across 20 tumour types. The width of each violin indicates the proportion of donors sharing a certain number of neoantigens in each tumour type. Upper limit and lower limit of white bar and the black line in it denote upper quartile, lower quartile and median number for each type.
Figure 4.Specific binding of mutant peptide of TP53 to HLA-A*02:01. Blank control: FITC-goat anti-mouse IgG + T2 cells; negative control: human beta-2-microglobulin were incubated with T2 cells overnight at 37°C + W6/32 + FITC-goat anti-mouse IgG; wide-type (WT) peptide binding affinity analysis: WT peptide (GMNRRPILTII) and human beta-2-microglobulin were incubated with T2 cells overnight at 37°C + W6/32 + FITC-goat anti-mouse IgG; mutated peptide binding affinity analysis: mutated peptide (GMNWRPILTII) and human beta-2-microglobulin were incubated with T2 cells overnight at 37°C + W6/32 + FITC-goat anti-mouse IgG.