| Literature DB >> 27153559 |
Philip D Dunne1, Paul G O'Reilly1, Helen G Coleman2, Ronan T Gray2, Daniel B Longley1, Patrick G Johnston1, Manuel Salto-Tellez1, Mark Lawler1, Darragh G McArt1.
Abstract
The Colorectal Cancer (CRC) Subtyping Consortium (CRCSC) recently published four consensus molecular subtypes (CMS's) representing the underlying biology in CRC. The Microsatellite Instable (MSI) immune group, CMS1, has a favorable prognosis in early stage disease, but paradoxically has the worst prognosis following relapse, suggesting the presence of factors enabling neoplastic cells to circumvent this immune response. To identify the genes influencing subsequent poor prognosis in CMS1, we analyzed this subtype, centered on risk of relapse. In a cohort of early stage colon cancer (n=460), we examined, in silico, changes in gene expression within the CMS1 subtype and demonstrated for the first time the favorable prognostic value of chemokine-like factor (CKLF) gene expression in the adjuvant disease setting [HR=0.18, CI=0.04-0.89]. In addition, using transcription profiles originating from cell sorted CRC tumors, we delineated the source of CKLF transcription within the colorectal tumor microenvironment to the leukocyte component of these tumors. Further to this, we confirmed that CKLF gene expression is confined to distinct immune subsets in whole blood samples and primary cell lines, highlighting CKLF as a potential immune cell-derived factor promoting tumor immune-surveillance of nascent neoplastic cells, particularly in CMS1 tumors. Building on the recently reported CRCSC data, we provide compelling evidence that leukocyte-infiltrate derived CKLF expression is a candidate biomarker of favorable prognosis, specifically in MSI-immune stage II/III disease.Entities:
Keywords: chemokine-like factor; colorectal cancer; gene expression profiling; molecular stratification; relapse risk
Mesh:
Substances:
Year: 2016 PMID: 27153559 PMCID: PMC5095027 DOI: 10.18632/oncotarget.9126
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Study overview of discovery and survival validation subsets
The data used in this study was obtained from 566 Affymetrix U133 Plus 2.0 patient transcriptional profiles accessed through the NCBI GEO accession number GSE39582. Filtering for stage II/III and complete relapse data reduced this cohort to 460 profiles. The CMS1 specific discovery subset was composed of 46 tumors which fulfilled risk filtering (see Materials and Methods) followed by differential gene expression analysis based on risk classification. The survival validation subset was composed of either the entire cohort, CMS1 specific or MSI specific subgroups of samples. For relapse-free survival analyses, only stage II and III patients were considered, giving 79 transcriptional profiles in the CMS1 or 60 transcriptional profiles in the MSI subgroups.
Characteristics of patients in CMS1 (Consensus Molecular Subtype) compared with other tumor subtypes
| Characteristic | CMS 1 n=80 | All other subtypes | p-value |
|---|---|---|---|
| 70.2 (14.6) | 67.0 (12.0) | 0.05 | |
| | 38 (47.5) | 220 (57.9) | 0.09 |
| | 42 (52.5) | 160 (42.1) | |
| | 48 (60.0) | 211 (55.5) | 0.46 |
| | 32 (40.0) | 169 (44.5) | |
| | 62 (77.5) | 125 (32.9) | <0.001 |
| | 18 (22.5) | 255 (67.1) | |
| 20 (25.0) | 182 (47.9) | <0.001 |
Comparative analysis of age, sex, stage, location and treatment for CMS1 (n=80) patients versus the remaining CMS 2, 3, 4 and unclassified patients (n=380).
Comprises n=193 CMS2, n=50 CMS3, n=99 CMS4 and n=38 unclassified CMS subtypes.
Figure 2Relapse risk in recently defined consensus molecular subtypes
Survival curve using Kaplan-Meier estimation comparing prognosis of CMS1-4 in untreated stage II/III CRC patients (GSE39582). RR indicates the 3 year relapse rate.
Probesets and genes associated with relapse risk in CMS1
| Column ID | Gene Symbol | p-value (Risk) | Fold-Change (High Risk vs. Low Risk) |
|---|---|---|---|
| 242465_at | LOC100505592 | 0.000332139 | −1.78683 |
| 209759_s_at | DCI | 0.00192191 | −1.51586 |
| 213285_at | TMEM30B | 0.00219006 | −1.86942 |
| 229331_at | SPATA18 | 0.0022411 | −2.15593 |
| 210002_at | GATA6 | 0.00239525 | −1.95547 |
| 203814_s_at | NQO2 | 0.00317772 | −1.88969 |
| 232809_s_at | FLT1 | 0.00388947 | −1.58036 |
| 221958_s_at | WLS | 0.00428383 | −1.90076 |
| 228360_at | LYPD6B | 0.00460096 | −2.18477 |
Differentially expressed probesets and their associated gene name between high and low risk groups in CMS1. Following ANOVA, fold-change +/− 1.5 fold and p-value<0.005 filters were applied resulting in 55 annotated probesets associated with risk in CMS1. Positive fold-change indicates higher expression in high-risk group.
Unadjusted and adjusted analyses of relapse-free survival
| CKLF gene expression | Unadjusted Hazard ratios (95% confidence intervals) | Adjusted | ||
|---|---|---|---|---|
| Number non-events Low/Med/High | Number events Low/Med/High | High v. Low expression | High v. Low expression | |
| | 17/15/19 | 5/2/1 | 0.21 (0.04-1.03) | 0.19 (0.04-0.96) |
| | 20/19/25 | 7/6/2 | 0.21 (0.04-1.03) | 0.18 (0.04-0.89) |
| | 12/13/16 | 4/1/1 | 0.18 (0.02-1.65) | 0.19 (0.02-1.85) |
| | 15/16/19 | 5/4/1 | 0.17 (0.01-1.42) | 0.16 (0.02-1.39) |
| | 63/65/68 | 20/19/23 | 0.99 (0.54-1.79) | 0.93 (0.50-1.73) |
| | 43/41/41 | 28/27/22 | 0.80 (0.45-1.43) | 0.90 (0.50-1.62) |
| | 106/106/109 | 48/46/45 | 0.87 (0.58-1.31) | 0.98 (0.64-1.49) |
RFS analysis was performed using Cox proportional hazards method in the CMS1, MSI or entire cohort stratified by CKLF expression levels. Analysis was performed both before and following adjustment.
Cut-offs for low/medium/high CKLF gene expression based on tertile values.
Adjusted for age, sex, TNM stage, tumour location and adjuvant treatment receipt (in treated analyses only).
Figure 3CKLF gene expression is associated with prognosis only in CMS1
A. Survival curve using Kaplan-Meier estimation comparing CKLF levels in CMS1 stage II/III CRC patients (GSE39582) B. Survival curve using Kaplan-Meier estimation comparing CKLF levels in MSI stage II/III CRC patients (GSE39582).
Figure 4CKLF gene expression within the tumor microenvironment and normal tissue
A. Box and whisker plot of CKLF expression according to specific endothelial, epithelial, fibroblast or leukocyte cell-of-origin from FACS sorted primary colorectal tumors. B. CKLF RNA-seq expression values are shown in RPKM (Reads Per Kilobase of transcript per Million mapped reads), calculated from a gene model with isoforms collapsed to a single gene. No other normalization steps have been applied. Box plots are shown as median and 25th and 75th percentiles; points are displayed as outliers if they are above or below 1.5 times the interquartile range. The data used for the analyses described in this manuscript were obtained from the Genotype-Tissue Expression (GTEx) portal (http://gtexportal.org/home) version V6 in January 2016. C. CKLF gene expression from a subset of the Atlas of Human Primary Cells cohort. Bar charts are shown as median CKLF gene expression according to specific lineage and error bars represent standard deviation. ** denotes p<0.005.