| Literature DB >> 28455965 |
Yujin Kwon1, Minhee Park1, Mi Jang1, Seongju Yun1, Won Kyu Kim1, Sora Kim2, Soonmyung Paik2, Hyun Jung Lee3, Sungpil Hong3, Tae Il Kim3, Byungsoh Min4, Hoguen Kim1.
Abstract
Individualizing adjuvant chemotherapy is important in patients with advanced colorectal cancers (CRCs), and the ability to identify molecular subtypes predictive of good prognosis for stage III CRCs after adjuvant chemotherapy could be highly beneficial. We performed microarray-based gene expression analysis on 101 fresh-frozen primary samples from patients with stage III CRCs treated with FOLFOX adjuvant chemotherapy and 35 matched non-neoplastic mucosal tissues. CRC samples were classified into four molecular subtypes using nonnegative matrix factorization, and for comparison, we also grouped CRC samples using the proposed consensus molecular subtypes (CMSs). Of the 101 cases, 80 were classified into a CMS group, which shows a 79% correlation between the CMS classification and our four molecular subtypes. We found that two of our subtypes showed significantly higher disease-free survival and overall survival than the others. Group 2, in particular, which showed no disease recurrence or death, was characterized by high microsatellite instability (MSI-H, 6/21), abundant mucin production (12/21), and right-sided location (12/21); this group strongly correlated with CMS1 (microsatellite instability immune type). We further identified the molecular characteristics of each group and selected 10 potential biomarker genes from each. When these were compared to the previously reported molecular classifier genes, we found that 31 out of 40 selected genes were matched with those previously reported. Our findings indicate that molecular classification can reveal specific molecular subtypes correlating with clinicopathologic features of CRCs and can have predictive value for the prognosis for stage III CRCs with FOLFOX adjuvant chemotherapy.Entities:
Keywords: DNA microarray; colon cancer; consensus molecular subtype; mRNA expression-based molecular classification; molecular subtype
Mesh:
Substances:
Year: 2017 PMID: 28455965 PMCID: PMC5503619 DOI: 10.18632/oncotarget.17023
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Workflow overview of gene expression-based molecular classification
(A) NMF consensus clustering using 1,764 DEGs in tumors compared to non-neoplastic tissues. DEGs for each of the four groups, as compared to non-neoplastic colon mucosa tissues and/or the other groups were selected. DEG: differentially expressed genes, NMF: nonnegative matrix factorization. (B) Unsupervised hierarchical clustering analysis of mRNA expression profiles from colon cancer tissues. Unsupervised classification identifies four molecularly distinct subtypes. Red and green colors indicate transcript levels above and below the sample median, respectively. Complete separation of 35 normal colon tissues (red) and 101 colon cancers (sky blue) was evident based on gene expression profiles.
Figure 2Four molecular subtypes identified using consensus clustering-based NMF
(A) NMF type 1 clustering: consensus clustering using 1,764 DEGs in tumors compared to non-neoplastic tissues. (B) NMF type 2 clustering: clustering with 1,538 DEGs specific for one group, as compared to other tumor groups. (C and D) Analysis of the mRNA expression patterns of the 30 genes from the CRCassigner-30 was performed according to each of the four subtypes from NMF type 1 clustering (C) and NMF type 2 clustering (D). NMF type 1 clustering showed more distinct gene expression patterns between the tumor subtypes than that of the NMF type 2 clustering.
Clinical and pathologic features of each molecular group
| Category | Variables | Group 1 | Group 2 | Group 3 | Group 4 | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| (n=22, 21.8%) | (n=21, 20.8%) | (n=16, 15.8%) | (n=42, 41.6%) | |||||||
| Age, years | 57.95±13.26 | 58.09±13.67 | 62±9.04 | 58.81±9.42 | 0.802 | |||||
| Gender | Male | 14 | 63.6% | 8 | 38.1% | 7 | 43.8% | 22 | 52.4% | 0.376 |
| Female | 8 | 36.4% | 13 | 61.9% | 9 | 56.2% | 20 | 47.6% | ||
| Location | Right side | 12 | 54.5% | 12 | 57.1% | 6 | 37.5% | 6 | 14.3% | <0.001a |
| Left side | 10 | 45.5% | 9 | 42.9% | 10 | 62.5% | 36 | 85.7% | ||
| Preop CEA level | ≤5 ng/ml | 18 | 81.8% | 17 | 81.0% | 10 | 62.5% | 28 | 66.7% | 0.367 |
| >5 ng/ml | 4 | 18.2% | 4 | 19.0% | 6 | 37.5% | 14 | 33.3% | ||
| Histologic diagnosis | Adenocarcinoma | 18 | 81.8% | 19 | 90.5% | 16 | 100.0% | 42 | 100.0% | <0.001a |
| Mucinous adenocarcinoma | 4 | 18.2% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | ||
| Medullary carcinoma | 0 | 0.0% | 2 | 9.5% | 0 | 0.0% | 0 | 0.0% | ||
| Mucin formation | Absent | 10 | 45.5% | 9 | 42.8% | 14 | 87.5% | 39 | 92.9% | 0a |
| Focal | 8 | 36.4% | 12 | 57.2% | 2 | 12.5% | 3 | 7.1% | ||
| predominant | 4 | 18.2% | 0 | 0.0% | 0 | 0.0% | 0 | 0.0% | ||
| Crohn like lymphoid reaction | low-density group | 14 | 63.6% | 13 | 61.9% | 8 | 50.0% | 33 | 78.6% | 0.157 |
| high-density group | 8 | 36.4% | 8 | 38.1% | 8 | 50.0% | 9 | 21.4% | ||
| invasion pattern | expending | 2 | 9.1% | 7 | 33.3% | 2 | 12.5% | 4 | 9.5% | 0.098 |
| infiltrative | 20 | 90.9% | 14 | 66.7% | 14 | 87.5% | 38 | 90.5% | ||
| Wild type | 22 | 100.0% | 20 | 95.2% | 16 | 100.0% | 42 | 100.0% | 0.366 | |
| Mutation | 0 | 0.0% | 1 | 4.8% | 0 | 0.0% | 0 | 0.0% | ||
| Wild type | 8 | 36.4% | 15 | 71.4% | 7 | 43.8% | 31 | 73.8% | 0.01a | |
| Mutation | 14 | 63.6% | 6 | 28.6% | 9 | 56.3% | 11 | 26.2% | ||
| MSI status | MSS/ MSI-low | 20 | 90.9% | 15 | 71.4% | 16 | 100.0% | 42 | 100.0% | <0.001a |
| MSI-high | 2 | 9.0% | 6 | 28.6% | 0 | 0.0% | 0 | 0.0% | ||
| MLH1 lossc | 0 | 6 | 0 | 0 | ||||||
| MSH2 lossc | 1 | 1 | 0 | 0 | ||||||
| PMS2 lossc | 1 | 5 | 0 | 0 | ||||||
| MSH6 lossc | 1 | 1 | 0 | 0 | ||||||
aP<0.05, bPyrosequencing, cImmunohistochemical stain, MSS: Microsatellite stability, MSI: Microsatellite instability
Correlation between 4 molecular groups and CMS subtypes
| NMF network class | Total | |||||
|---|---|---|---|---|---|---|
| Group 2 | Group 4 | Group 1 | Group 3 | |||
| CMS1 | 10 (66.6%) | 0 (0%) | 0 (0%) | 0 (0%) | 10 | |
| CMS2 | 4 (26.7%) | 31 (79.5%) | 3 (18.8%) | 0 (0%) | 38 | |
| CMS3 | 1 (6.7%) | 0 (0%) | 12 (75%) | 0 (0%) | 13 | |
| CMS4 | 0 (0%) | 8 (20.5%) | 1 (6.2%) | 10 (100%) | 19 | |
| undetermined | 6 | 3 | 6 | 6 | 21 | |
| Total | 21 | 42 | 22 | 16 | 101 | |
Figure 3Patient survival rate analysis according to our CRC molecular subtype and the CMS
(A and B) Disease-free survival and overall survival according to our four molecular subtypes. (C and D) Disease-free survival and overall survival according to the four CMS subtypes.
Figure 4Identification of mRNAs differentially expressed in each of the four groups, as compared to non-neoplastic colon mucosa tissues and/or the other groups
(A) Venn diagram showing a total of 292 potential marker genes, with group specific upregulation. (B) Heatmap of the mRNA expression profiles of these 292 DEGs according to the four molecular groups. (C) The mRNA expression profiles of 10 selected genes from each group.
Specific-upregulated genes in each molecular group
| Gene cluster | Gene symbola | Description | FC. T vs. N | FC. Specific T group vs. other T groups |
|---|---|---|---|---|
| defensin, alpha 5, Paneth cell-specific | 4.56 | 4.65 | ||
| dual oxidase 2 | 4.28 | 3.91 | ||
| kallikrein-related peptidase 12 | 3.19 | 2.77 | ||
| ALDOB | aldolase B | 5.37 | 2.61 | |
| gamma-aminobutyric acid (GABA) A receptor, pi | 1.84 | 1.95 | ||
| LOC653061 | similar to Golgin subfamily A member 8-like protein 1 | 2.62 | 1.95 | |
| apolipoprotein B mRNA editing enzyme, catalytic polypeptide 1 | 3.31 | 1.88 | ||
| C2CD4B | family with sequence similarity 148, member B | 2.93 | 1.82 | |
| mucin 5AC, oligomeric mucus/gel-forming; similar to hCG1778310 | 2.27 | 1.79 | ||
| SEZ6L2 | seizure related 6 homolog (mouse)-like 2 | 2.35 | 1.73 | |
| CCDC58 | coiled-coil domain containing 58 | 6.19 | 2.81 | |
| molybdenum cofactor sulfurase | 2.92 | 2.11 | ||
| FAM81A | family with sequence similarity 81, member A | 2.59 | 2.06 | |
| MTHFD2 | methylenetetrahydrofolate dehydrogenase (NADP+ dependent)2, methenyltetrahydrofolate cyclohydrolase | 3.10 | 1.99 | |
| solute carrier family 7, (cationic amino acid transporter, y+ system) member 11 | 2.52 | 1.93 | ||
| PDZ binding kinase | 2.78 | 1.91 | ||
| MAD2L1 | MAD2 mitotic arrest deficient-like 1 (yeast) | 3.49 | 1.90 | |
| LDHB | lactate dehydrogenase B | 3.18 | 1.86 | |
| SEH1L | SEH1-like (S. cerevisiae) | 2.14 | 1.83 | |
| CDCA2 | cell division cycle associated 2 | 2.38 | 1.77 | |
| HOP homeobox | 4.26 | 4.11 | ||
| Transgelin | 2.66 | 3.73 | ||
| gremlin 1 | 2.12 | 3.59 | ||
| thrombospondin 4 | 2.85 | 3.58 | ||
| collagen, type III, alpha 1 | 5.33 | 3.34 | ||
| paired related homeobox 1 | 4.29 | 3.23 | ||
| RAB31, member RAS oncogene family | 3.60 | 3.21 | ||
| myosin, light chain 9, regulatory | 2.38 | 3.13 | ||
| cathepsin K | 2.59 | 3.11 | ||
| secreted protein, acidic, cysteine-rich (osteonectin) | 2.35 | 3.05 | ||
| DSC3 | desmocollin 3 | 2.66 | 2.21 | |
| calcium binding protein 39-like | 2.22 | 2.07 | ||
| additional sex combs like 1 (Drosophila) | 1.99 | 1.80 | ||
| endoplasmic reticulum protein 27 | 2.55 | 1.70 | ||
| ZMYND8 | zinc finger, MYND-type containing 8 | 1.90 | 1.69 | |
| acyl-CoA synthetase long-chain family member 6 | 1.94 | 1.67 | ||
| GMEB2 | glucocorticoid modulatory element binding protein 2 | 1.78 | 1.66 | |
| HPDL | 4-hydroxyphenylpyruvate dioxygenase-like | 2.79 | 1.66 | |
| tetraspanin 6 | 2.11 | 1.64 | ||
| C8orf33 | chromosome 8 open reading frame 33 | 2.15 | 1.63 |
aSelected genes that show specific-upregulated expression in each molecular group. Those matching previously reported classifier genes are marked in bold.