| Literature DB >> 31949492 |
Wenbing Guo1, Yelin Fu1, Liangliang Jin1, Kai Song1, Ruihan Yu1, Tianhao Li1, Lishuang Qi1, Yunyan Gu1, Wenyuan Zhao1, Zheng Guo1,2,3.
Abstract
The clinical applicability of the whole-exome sequencing (WES) in estimating tumor mutational burden (TMB) is currently limited by high cost, time-consuming and tissue availability. And given to the differences in the mutational landscapes among different types of cancer, we aimed to develop a cancer-specific signature to estimate TMB for right-sided colon cancer patients (RCC). Using WES data of 315 RCC patients, we identified the exons in which the number of mutational sites of the coding DNA sequences associated with TMB through linear regression analysis. Then, among these exons, we extracted a signature composed by 102 exons (~0.13 Mbp) through a heuristic selection procedure. The TMB estimated by the signature was highly correlated with those calculated by WES in the discovery dataset (R2=0.9869) and three independent validation datasets (R2=0.9351, R2=0.8063 and R2=0.9527, respectively). And the performance of the signature was superior to a colorectal-specific TMB estimation model contained 22 genes (~0.24 Mbp). Moreover, between TMB-high and TMB-low RCC patients, there were significantly differences in the frequencies of microsatellite instability status, CpG island methylator phenotype, BRAF, KRAS and POLE/POLD1 mutation status (p<0.01). However, the performances of the signature in other types of cancer were dramatically degraded (left-sided colon cancer, R2=0.7849 and 0.9407, respectively; rectum, R2=0.5955 and R2=0.965, respectively; breast cancer, R2=0.8444; lung cancer, R2=0.5963), suggesting that it was necessary to develop cancer-specific TMB estimated signatures to estimate precisely the TMB in different types of cancer. In summary, we developed an exon signature that can accurately estimate TMB in RCC patients, and the cost and time required for the assessment of TMB can be considerably decreased, making it more suitable for blood and/or biopsy samples. © The author(s).Entities:
Keywords: a cancer-specific signature; the coding DNA sequences; the right-sided colon cancer; tumor mutational burden
Year: 2020 PMID: 31949492 PMCID: PMC6959028 DOI: 10.7150/jca.34363
Source DB: PubMed Journal: J Cancer ISSN: 1837-9664 Impact factor: 4.207
Description of whole-exome sequencing mutational data analyzed in this study
| Cancer type | Datasets | |||||
|---|---|---|---|---|---|---|
| TCGA | Giannakis | Vasaikar | Seshagiri | |||
| Colorectal cancer | Sample_number | 512 | 618 | 57 | 72 | |
| Location | Right | 225 | 315 | 57 | - | |
| Left | 150 | 166 | - | - | ||
| Rectum | 137 | 137 | - | - | ||
| MSI status of RCC samples | MSS | - | 182 | 39 | - | |
| MSI | - | 82 | 18 | - | ||
| NA | - | 51 | - | - | ||
| Breast cancer | Sample_number | 986 | - | - | ||
| Lung cancer | Sample_number | 537 | - | - | ||
Note: the Vasaikar dataset only include missense mutations.
Figure 2The process of the development and analysis of the exon signature for patients with RCC.
Molecular differences between TMB-high and TMB-low patients
| Giannakis dataset | TMB-low (N=219) | TMB-high (N=96) | |||
|---|---|---|---|---|---|
| MSI* | MSI | 6 | 76 | <2.2E-16 | |
| MSS | 178 | 4 | |||
| NA | 35 | 16 | |||
| CIMP* | CIMP_High | 27 | 59 | <2.2E-16 | |
| CIMP_Low | 147 | 17 | |||
| NA | 45 | 20 | |||
| BRAF | mutation | 41 | 64 | 3.64E-16 | |
| wild-type | 178 | 32 | |||
| KRAS | mutation | 88 | 86 | 3.76E-08 | |
| wild-type | 131 | 10 | |||
| POLE | mutation | 4 | 25 | 8.73E-11 | |
| wild-type | 215 | 71 | |||
| POLD1 | mutation | 8 | 14 | 1.12E-03 | |
| wild-type | 211 | 82 | |||
| BRAF | mutation | 10 | 40 | <2.2E-16 | |
| wild-type | 146 | 29 | |||
| KRAS | mutation | 91 | 22 | 2.90E-04 | |
| wild-type | 65 | 47 | |||
| POLE | mutation | 1 | 22 | 6.12E-12 | |
| wild-type | 155 | 47 | |||
| POLD1 | mutation | 4 | 21 | 6.59E-09 | |
| wild-type | 152 | 48 | |||
| MSI | MSI | 0 | 18 | 4.29E-12 | |
| MSS | 36 | 3 | |||
| BRAF | mutation | 1 | 14 | 1.92E-07 | |
| wild-type | 35 | 7 | |||
| KRAS | mutation | 23 | 1 | 6.97E-06 | |
| wild-type | 13 | 20 | |||
| POLE | mutation | 0 | 8 | 1.23E-04 | |
| wild-type | 36 | 13 | |||
| POLD1 | mutation | 1 | 7 | ||
| wild-type | 35 | 14 | 2.66E-03 | ||
Note: *A sample was assigned to NA group if its MSI status (or CIMP phenotype) information was not obtained in the Giannakis dataset. The MSI and CIMP phenotype information were not available in the TCGA dataset. And the Vasaikar dataset missed the CIMP phenotype information as well. The Seshagiri dataset was not used to perform the molecular differences because of the lack of specific location information. p value was calculated by a two-tailed Fisher's exact test.