| Literature DB >> 29937994 |
Zhe Liu1,2, Chao Yang3, Xiangchun Li3, Wen Luo3, Bhaskar Roy3, Teng Xiong3, Xiuqing Zhang3, Huanming Yang3,4, Jian Wang3,4, Zhenhao Ye5, Yang Chen5, Jinghe Song6, Shuai Ma6, Yong Zhou3, Min Yang1,7, Xiaodong Fang3, Jie Du1.
Abstract
Colorectal cancer is the fifth prevalent cancer in China. Nevertheless, a large-scale characterization of Chinese colorectal cancer mutation spectrum has not been carried out. In this study, we have performed whole exome-sequencing analysis of 98 patients' tumor samples with matched pairs of normal colon tissues using Illumina and Complete Genomics high-throughput sequencing platforms. Canonical CRC somatic gene mutations with high prevalence (>10%) have been verified, including TP53, APC, KRAS, SMAD4, FBXW7 and PIK3CA. PEG3 is identified as a novel frequently mutated gene (10.6%). APC and Wnt signaling exhibit significantly lower mutation frequencies than those in TCGA data. Analysis with clinical characteristics indicates that APC gene and Wnt signaling display lower mutation rate in lymph node positive cancer than negative ones, which are not observed in TCGA data. APC gene and Wnt signaling are considered as the key molecule and pathway for colorectal cancer initiation, and these findings greatly undermine their importance in tumor progression for Chinese patients. Taken together, the application of next-generation sequencing has led to the determination of novel somatic mutations and alternative disease mechanisms in colorectal cancer progression, which may be useful for understanding disease mechanism and personalizing treatment for Chinese patients.Entities:
Keywords: Chinese patients; colorectal cancer; disease etiology; mutation spectrum; whole-exome sequencing
Year: 2018 PMID: 29937994 PMCID: PMC6007951 DOI: 10.18632/oncotarget.25287
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Sequencing statistics for Illumina and CG platforms
(A) This graph displays the average sequencing depth and exome coverage percentage with >10X and >20X for Illumina and Complete Genomics platforms. The left chart describes the average number of reads aligned to human reference genome hg19. The average depth can be computed from the length of the original genome (G), the number of reads (N), and the average read length (L) as {N*L/G}. (B) This chart illustrates the mutations in coding regions for Illumina and Complete Genomics platforms. The dash line is used to separate samples into hyper-mutated and regularly mutated ones. (C) A display of the various categories of mutations across samples is shown for SNVs (non-synonymous SNV, synonymous SNV, stopgain SNV and splicing) and InDels (non-frameshift deletion, non-frameshift insertion, frameshift deletion and frameshift insertion).
Figure 2Somatic mutation spectrums for all, hypermutated and regularly mutated samples
Base substitutions are divided into six categories, and each is represented by a color. The 16 possible flanking nucleotide types for each category are then plotted on the horizontal axis. The vertical axis shows the proportion of somatic mutations of each type. (A). Mutation spectrums for all, hypermutated and regularly mutated Chinese samples. (B). Mutation spectrums for all, hypermutated and regularly mutated TCGA samples.
Figure 3Illustration of prevalently somatic mutated genes
(A). Illustration of significantly mutated genes in non-hypermutated samples. The left axis shows mutation frequencies in 85 non-hypermutated samples. The right axis indicates the –log10 transformed q-value score from MutSigCV. (B) Protein interaction network from String database of mutated genes with >5% frequency. Green, red and blue edges represent the evidence from text-mining, experiment, and curated database, respectively. (C) Illustration of somatic mutations on PEG3 and APC genes in Chinese and TCGA data. (D) PEG3 and APC gene frequencies for Chinese and TCGA data (Chi-Squared test).
Figure 4Illustration of mutated genes in pathways
(A). Comparison of significantly mutated pathways between Chinese and TCGA data (q-value < 0.01). (B). Illustration of the mutated genes on pathway level. For each gene, their mutation frequency in Chinese and TCGA data are presented on the left and right box below each gene. The activation and inactivation frequencies >5% and <5% are denoted with respective colors as shown on the top right of the figure. The overall mutation frequencies for each pathway are described on the right of the pathway, coming with that of the Chinese and TCGA cohorts respectively.
Clinical association with somatic mutated genes for non-hypermuated patients
| TP53 | APC | KRAS | PIK3CA | PEG3 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Mutation | Mutation | Mutation | Mutation | Mutation | |||||||
| 6 (85.7%) | 0.234 | 0 (0%) | 0.017 | 2 (28.6%) | 1.000 | 0 (0%) | 1.000 | 3 (42.9%) | 0.024 | ||
| 45 (57.7%) | 2 (2.6%) | 27 (34.6%) | 9 (11.5%) | 6 (7.7%) | |||||||
| 23 (57.5%) | 0.665 | 21 (52.5%) | 0.131 | 14 (35%) | 1.000 | 5 (12.5%) | 0.729 | 4 (10%) | 1.000 | ||
| 28 (62.2%) | 16 (35.6%) | 15 (33.3%) | 4 (8.9%) | 5 (11.1%) | |||||||
| 28 (65.1%) | 0.599 | 26 (60.5%) | 0.004 | 21 (48.8%) | 0.010 | 5 (11.6%) | 1.000 | 3 (7%) | 0.464 | ||
| 21 (55.3%) | 10 (26.3%) | 7 (18.4%) | 4 (10.5%) | 6 (15.8%) | |||||||
| 2 (50%) | 1 (25%) | 1 (25%) | 0 (0%) | 0 (0%) | |||||||
| 28 (65.1%) | 0.380 | 26 (60.5%) | 0.002 | 21 (48.8%) | 0.006 | 5 (11.6%) | 1.000 | 3 (7%) | 0.313 | ||
| 23 (54.8%) | 11 (26.2%) | 8 (19%) | 4 (9.5%) | 6 (14.3%) | |||||||
| 6 (35.3%) | 0.044 | 5 (29.4%) | 0.332 | 8 (47.1%) | 0.220 | 2 (11.8%) | 0.888 | 2 (11.8%) | 1.000 | ||
| 10 (62.5%) | 8 (50%) | 3 (18.8%) | 2 (12.5%) | 1 (6.3%) | |||||||
| 31 (70.5%) | 22 (50%) | 15 (34.1%) | 4 (9.1%) | 5 (11.4%) | |||||||
| 2 (28.6%) | 0.119 | 3 (42.9%) | 0.329 | 3 (42.9%) | 0.867 | 1 (14.3%) | 0.680 | 1 (14.3%) | 0.628 | ||
| 44 (63.8%) | 34 (49.3%) | 25 (36.2%) | 7 (10.1%) | 6 (8.7%) | |||||||
| 3 (100%) | 0 (0%) | 1 (33.3%) | 0 (0%) | 0 (0%) | |||||||