| Literature DB >> 32733937 |
Chao Chen1,2,3, Songming Liu1,2,3, Ruokai Qu1,2,3, Bo Li1,3.
Abstract
This study was aimed at investigating the mutations in colorectal cancer (CRC) for recurrent neoantigen identification. A total of 1779 samples with whole exome sequencing (WES) data were obtained from 7 published CRC cohorts. Common HLA genotypes were used to predict the probability of neoantigens at high-frequency mutants in the dataset. Based on the WES data, we not only obtained the most comprehensive CRC mutation landscape so far but also found 1550 mutations which could be identified in at least 5 patients, including KRAS G12D (8%), KRAS G12V (5.8%), PIK3CA E545K (3.5%), PIK3CA H1047R (2.5%), and BMPR2 N583Tfs∗44 (2.8%). These mutations can also be recognized by multiple common HLA molecules in Chinese and TCGA cohort as potential "public" neoantigens. Many of these mutations also have high mutation rates in metastatic pan-cancers, suggesting their value as therapeutic targets in different cancer types. Overall, our analysis provides recurrent neoantigens as potential cancer immunotherapy targets.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32733937 PMCID: PMC7383341 DOI: 10.1155/2020/2861240
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Summary of clinical information of CRC cohort, including patients from seven studies.
| Characteristic | Baylor | Beijing | COCA-CN | Genetech | Harvard | TCGA | Texas | Total |
|---|---|---|---|---|---|---|---|---|
| ( | ( | ( | ( | ( | ( | ( | ( | |
| Age (years) | ||||||||
| <60 | 38 (34.5%) | 48 (49.0%) | 155 (48.3%) | 0 (0%) | 67 (10.8%) | 158 (29.9%) | 0 (0%) | 466 (26.2%) |
| ≥60 | 70 (63.6%) | 50 (51.0%) | 166 (51.7%) | 0 (0%) | 550 (88.9%) | 366 (69.3%) | 0 (0%) | 1202 (67.6%) |
| Unknown | 2 (1.8%) | 0 (0%) | 0 (0%) | 74 (100%) | 2 (0.3%) | 4 (0.8%) | 29 (100%) | 111 (6.2%) |
| Sex | ||||||||
| Female | 65 (59.1%) | 50 (51.0%) | 127 (39.6%) | 0 (0%) | 380 (61.4%) | 253 (47.9%) | 15 (51.7%) | 890 (50.0%) |
| Male | 45 (40.9%) | 48 (49.0%) | 194 (60.4%) | 0 (0%) | 239 (38.6%) | 273 (51.7%) | 14 (48.3%) | 813 (45.7%) |
| Unknown | 0 (0%) | 0 (0%) | 0 (0%) | 74 (100%) | 0 (0%) | 2 (0.4%) | 0 (0%) | 76 (4.3%) |
| Stage | ||||||||
| I | 12 (10.9%) | 10 (10.2%) | 40 (12.5%) | 0 (0%) | 152 (24.6%) | 94 (17.8%) | 0 (0%) | 308 (17.3%) |
| II | 42 (38.2%) | 44 (44.9%) | 94 (29.3%) | 0 (0%) | 187 (30.2%) | 196 (37.1%) | 0 (0%) | 563 (31.6%) |
| III | 48 (43.6%) | 39 (39.8%) | 130 (40.5%) | 0 (0%) | 159 (25.7%) | 150 (28.4%) | 1 (3.4%) | 527 (29.6%) |
| IV | 8 (7.3%) | 4 (4.1%) | 56 (17.4%) | 0 (0%) | 65 (10.5%) | 69 (13.1%) | 28 (96.6%) | 230 (12.9%) |
| Unknown | 0 (0%) | 1 (1.0%) | 1 (0.3%) | 74 (100%) | 56 (9.0%) | 19 (3.6%) | 0 (0%) | 151 (8.5%) |
| MSI status | ||||||||
| MSI-H | 24 (21.8%) | 8 (8.2%) | 0 (0%) | 15 (20.3%) | 91 (14.7%) | 65 (12.3%) | 0 (0%) | 203 (11.4%) |
| MSI-L | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 77 (14.6%) | 0 (0%) | 77 (4.3%) |
| MSS | 81 (73.6%) | 32 (32.7%) | 0 (0%) | 59 (79.7%) | 438 (70.8%) | 346 (65.5%) | 29 (100%) | 985 (55.4%) |
| Unknown | 5 (4.5%) | 58 (59.2%) | 321 (100%) | 0 (0%) | 90 (14.5%) | 40 (7.6%) | 0 (0%) | 514 (28.9%) |
Figure 1The mutation landscape in the CRC cohort. (a) From left to right, counts of each variant classification, counts of each variant type, and counts of each SNV class. (b) From left to right, variant number per sample, variant classification, and top 10 significantly mutated genes.
Top ten SNVs and the corresponding neoantigens in the CRC cohort.
| Chr | Location | Gene | AA change | Peptide | Frequency | HLA types |
|---|---|---|---|---|---|---|
| chr12 | 25398284 |
| G12D | VVVGADGVGK | 143 | A11:01 |
| chr12 | 25398284 |
| G12V | VVGAVGVGK | 104 | A11:01 |
| chr3 | 1.79E+08 |
| E545K | STRDPLSEITK | 63 | A03:01; A11:01 |
| chr3 | 1.79E+08 |
| E545K | ITKQEKDFLW | 63 | B57:01 |
| chr3 | 1.79E+08 |
| H1047R | ARHGGWTTK | 45 | B27:05 |
| chr3 | 1.79E+08 |
| R88Q | REEFFDETRQL | 30 | B40:01 |
| chr2 | 70315174 |
| L100Q | RPPVTQRLVV | 28 | B07:02 |
| chr2 | 70315174 |
| L100Q | SRPPVTQRL | 28 | C06:02; C07:01; C07:02 |
| chr22 | 29091840 |
| K373E | SEILGETSL | 21 | B18:01; B40:01 |
| chr12 | 25398284 |
| G12A | VVVGAAGVGK | 19 | A11:01 |
Top ten indels and the corresponding frequency in the CRC cohort.
| Chr | Location | Gene | AA change | Frequency |
|---|---|---|---|---|
| chr2 | 203420130 |
| N583Tfs∗44 | 50 |
| chr10 | 890939 |
| T163Hfs∗47 | 37 |
| chr1 | 1290110 |
| R301Gfs∗107 | 31 |
| chr18 | 34205516 |
| S336Vfs∗138 | 29 |
| chr15 | 45003781 |
| L15Ffs∗41 | 28 |
| chr12 | 110019240 |
| A141Rfs∗18 | 27 |
| chr3 | 168833257 |
| G614Efs∗30 | 26 |
| chr22 | 20130522 |
| T459Rfs∗177 | 20 |
| chr8 | 103289349 |
| E2121Kfs∗28 | 20 |
| chr6 | 158508009 |
| P1113Lfs∗5 | 19 |
Figure 2The comparison of neoantigens between different subgroups: (a) between different MSI statuses; (b) between age ≥ 60 and age < 60 groups; (c) between female and male groups; (d) between different stages. These analyses excluded patients with unknown subtypes.
Figure 3Mutational spectrum of KRAS (a) and PIK3CA (b) in 1179 CRC patients.
Figure 4Mutation frequency in MSK-IMPACT cohorts. KRAS G12D (a), KRAS G12V (b), PIK3CA E545K (c), and PIK3CA H1047R (d) in MSK-IMPACT pan-cancer cohorts.