| Literature DB >> 27935967 |
Abstract
Colorectal cancer (CRC) is the fourth-ranked cause of cancer-related deaths worldwide. Despite recent advances in CRC management, distant recurrence (DR) remains the major cause of mortality in patients with preoperative chemotherapy and radiotherapy, underscoring a need to precisely identify novel gene signatures for predicting the risk of systemic relapse. Herein, we integrated two independent CRC gene expression datasets: the GSE71222 dataset, including 26 patients who developed DR and 126 patients who did not develop DR, and the GSE21510 dataset, including 23 patients who developed DR and 76 patients who did not develop DR. Our data revealed 37 common upregulated genes (fold change (FC) ≥ 1.5, P < 0.05) and three common downregulated genes (FC ≤ 1.5, P < 0.05) between DR and non-recurrent patients from the two datasets. We subsequently validated the upregulated gene panel in the Cancer Genome Atlas CRC datasets (379 patients), which identified a five-gene signature (S100A2, VIP, HOXC6, DACT1, KIF26B) associated with poor overall survival (OS, log-rank test P-value: 1.19 × 10-4) and poor disease-free survival (DFS, log-rank test P-value: 0.002). In a Cox proportional hazards multiple regression model, the five-gene signature and tumor stage retained their significance as independent prognostic factors for CRC DFS and OS. Therefore, our data identified a novel DR gene expression signature associated with worse prognosis in CRC.Entities:
Mesh:
Year: 2016 PMID: 27935967 PMCID: PMC5147898 DOI: 10.1371/journal.pone.0167455
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The Cancer Genome Atlas CRC dataset patient and tumor characteristics.
| N = 379 | % | |
|---|---|---|
| Median age | 66 | |
| Range | 31–90 | |
| Male | 206 | 54.4 |
| Female | 168 | 44.3 |
| Unknown | 5 | 1.3 |
| Median | 22.04 | |
| Range | 0–147.9 | |
| Median | 20.27 | |
| Range | 0–147.9 | |
| I | 56 | 14.8 |
| II | 135 | 35.6 |
| III | 112 | 29.6 |
| IV | 52 | 13.7 |
| NA | 24 | 6.3 |
Fig 1Genes associated with CRC distant recurrence (DR).
Heatmap depicting the expression levels of differentially expressed genes (1.5 fold changes and P ≤ 0.05) between DR and non-recurrent (NR) CRC patients from the GSE71222 (a) and GSE21510 (b) datasets. Each column represents an individual sample and each row represents a single transcript. The expression level of each mRNA in a single sample is depicted according to the color scale. (c) Venn diagram depicting the common upregulated genes between DR and NR CRC samples from the GSE71222 and GSE21510 datasets. (d) Pie chart illustrating the distribution of the top 5 pathway designations for the 44 common upregulated transcripts from (c). The pie size corresponds to the number of matched entities.
Common recurrence-related genes in the GSE71222 and GSE21510 datasets.
| Gene Symbol | FC (GSE71222) | FC GSE21510 |
|---|---|---|
| 1.51 | 1.60 | |
| 1.58 | 2.33 | |
| 1.74 | 2.42 | |
| 1.79 | 2.12 | |
| 1.99 | 2.33 | |
| 1.77 | 2.13 | |
| 1.85 | 2.57 | |
| 2.08 | 2.28 | |
| 2.15 | 2.73 | |
| 1.60 | 2.12 | |
| 1.50 | 1.80 | |
| 2.50 | 2.42 | |
| 1.67 | 2.06 | |
| 1.75 | 2.50 | |
| 1.56 | 1.63 | |
| 1.50 | 1.60 | |
| 1.62 | 1.56 | |
| 1.71 | 1.66 | |
| 1.90 | 3.45 | |
| 1.62 | 1.73 | |
| 1.71 | 1.83 | |
| 1.52 | 1.72 | |
| 1.60 | 2.06 | |
| 1.59 | 1.58 | |
| 1.90 | 1.87 | |
| 1.82 | 2.18 | |
| 1.67 | 1.86 | |
| 1.80 | 2.32 | |
| 1.71 | 1.85 | |
| 1.58 | 1.82 | |
| 1.84 | 2.03 | |
| 1.72 | 1.72 | |
| 1.86 | 1.93 | |
| 2.16 | 2.91 | |
| 1.97 | 1.52 | |
| 1.50 | 1.70 | |
| 1.56 | 1.76 | |
| -2.09 | -2.04 | |
| -1.52 | -1.54 | |
| -1.67 | -1.62 | |
Selected genes are based on a fold-change (FC) of 1.5 and P < 0.05 cut-off threshold.
Fig 2Validation of the distant recurrence (DR) gene panel in the TCGA dataset.
(a) OncoPrint of the DR five-gene signature in the TCGA CRC dataset. Alteration in the expression of different members of the five-gene signature (rows) in relation to each sample (columns). Relationships to overall and disease-free survival are also shown. CRC cases with upregulated expression of the DR signature showed worse overall (b) and disease-free (c) survival than cases with lower expression. (d) Network view of the VIP/DACT1/S100A2 neighborhood in CRC. VIP, DACT1, and S100A2 are seed genes (indicated with thick borders), and all other genes were identified as altered in CRC.
Multivariate analyses for the prognostic value of the 5-gene signature in TCGA CRC dataset.
| Parameters | Categories | DFS hazard ratio (95% CI) | P value | OS hazard ratio (95% CI) | P value |
|---|---|---|---|---|---|
| Five-gene expression | High vs Low | 1.95 (1.27 to 3.01) | 1.84 (1.16 to 2.90) | ||
| Age at diagnosis | <65 vs. >65 | 0.86 (0.56 to 1.33) | 0.5103 | 2.47 (1.50 to 4.09) | |
| Type | CA vs RA vs MA | 0.87 (0.67 to 1.12) | 0.2925 | 1.14 (0.86 to 1.51) | 0.3560 |
| Tumor stage | (3/4 vs 1/2) | 1.92 (1.37 to 2.69) | 3.18 (1.96 to 5.16) | ||
| Gender | M vs F | 1.35 (0.86 to 2.10) | 0.1857 | 1.04 (0.65 to 1.65) | 0.8614 |
CA: Colon Adenocarcinoma; RA: Rectal Adenocarcinoma; MA: Mucinous Adenocarcinoma of the Colon and Rectum