Literature DB >> 29377588

A robust gene signature for the prediction of early relapse in stage I-III colon cancer.

Weixing Dai1,2, Yaqi Li1,2, Shaobo Mo1,2, Yang Feng1,2, Long Zhang3, Ye Xu1,2, Qingguo Li1,2, Guoxiang Cai1,2.   

Abstract

Colon cancer patients experiencing early relapse consistently exhibited poor survival. The aim of our study was to develop an mRNA signature that can help to detect early relapse cases in stage I-III colon cancer. Public microarray datasets of stage I-III colon cancer samples were extracted from the Gene Expression Omnibus database. Propensity score matching analysis was performed between patients in the early relapse group and the long-term survival group from GSE39582 discovery series (N = 386), and patients were 1 : 1 matched. Global mRNA expression changes were then analyzed between the paired groups to identify the differentially expressed genes. Lasso Cox regression modeling analysis was conducted for the selection of prognostic mRNA. Fifteen mRNA were finally identified to build an early relapse classifier. With specific risk score formula, patients were classified into a high-risk group and a low-risk group. Relapse-free survival was significantly different between the two groups in every series, including discovery [hazard ratio (HR): 2.547, 95% confidence interval (CI): 1.708-3.797, P < 0.001)], internal validation (HR: 5.146, 95% CI: 1.968-13.457, P < 0.001), and external validation (HR: 1.977, 95% CI: 1.295-3.021, P < 0.001) sets of patients. Time-dependent receiver-operating characteristic at 1 year suggested more prognostic accuracy of the classifier [area under curve (AUC = 0.703)] than the American Joint Commission on Cancer tumor-node-metastasis staging system (AUC = 0.659) in all 951 patients. In conclusion, we developed a robust mRNA signature that can effectively classify colon cancer patients into groups with low and high risks of early relapse. This mRNA signature may help select high-risk colon cancer patients who require more aggressive therapeutic intervention.
© 2018 The Authors. Published by FEBS Press and John Wiley & Sons Ltd.

Entities:  

Keywords:  Gene Expression Omnibus database; colon cancer; early relapse; mRNA signature; propensity score

Mesh:

Substances:

Year:  2018        PMID: 29377588      PMCID: PMC5891048          DOI: 10.1002/1878-0261.12175

Source DB:  PubMed          Journal:  Mol Oncol        ISSN: 1574-7891            Impact factor:   6.603


American Joint Commission on Cancer actin‐related protein 3 homolog B area under curve bleomycin hydrolase C‐C motif chemokine ligand 20 confidence interval cytidine/uridine monophosphate kinase 2 colorectal cancer circulating tumor cells differentially expressed genes extracellular matrix protein 1 Gene Expression Omnibus guided PCA granzyme B hes family BHLH transcription factor 6 hazard ratio interleukin 7 kallikrein‐related peptidase 10 keratin 6A linear models for microarray data matrix metallopeptidase 9 mesothelin 2′‐5′‐oligoadenylate synthetase 1 propensity score pseudouridylate synthase 7 receiver‐operating characteristic tumor–node–metastasis International Union against Cancer zinc finger protein 426

Introduction

Colorectal cancer (CRC) is a worldwide common malignant tumor and also a major cause of cancer‐related deaths (Siegel et al., 2016). The implement of curative resection and advancement of adjuvant chemotherapy have witnessed a moderate improvement on the overall prognosis of colon cancer (Shi et al., 2013; Wilkinson et al., 2010). However, relapse following surgery is still a major and ultimate cause of deaths. Approximately 25–40% of patients would develop recurrence or metastases after primary radical resection, of which one‐third were local recurrence and the others were distal metastases (Becker, 1995; Tsai et al., 2009b; Van Cutsem et al., 2006). For the most part, the relapse of colon cancer is time‐related. Almost 40–50% of relapse emerged within the first year after initial primary resection, and 90% within the first 3 years (Longo and Johnson, 2002; Seo et al., 2013). Early relapse in colon cancer patients is attributed mainly to poor clinicopathological features (such as deeper tumor invasion, regional lymph nodes metastasis, poor differentiation, and worse histopathological type) and poor response to adjuvant chemotherapy. Those cases who developed early relapse consistently tended to have poorer long‐term survival rates. Consequently, more valuable predictive factors are urgently needed to detect the early postoperative relapse. For decades, the most significant risk factor for predicting early relapse in colon cancer is based on American Joint Commission on Cancer/International Union against Cancer (AJCC/UICC) tumor–node–metastasis (TNM) staging system. However, for the great heterogeneity of colon cancer, prognosis varies significantly in colon cancer patients with same tumor stage and comparable clinicopathological features. Therefore, researchers are searching extensively for the ideal biomarker or indicator for predicting early relapse in colon cancer patients (Hwang et al., 2011; Lu et al., 2011; Yang et al., 2012). Although substantial efforts have been made to explore gene‐based molecular biomarker to predict the early relapse, no molecular prognostic classifiers have been established. Recent studies in many malignancies, including colon cancer, suggested that multigene expression patterns or gene signature can make a good prediction of cancer prognosis (Catto et al., 2010; Chen et al., 2011; Lee et al., 2015b; Tan and Tan, 2011). But, few precious gene profiling has been applied to detect the early relapse‐associated genes in colon cancer. Searching a gene signature might be of concrete predictive value in the prediction of early relapse in colon cancer patients. In this study, a large group of mRNA‐specific probes were fortuitously represented on the commonly used microarray platform (Affymetrix HG‐U133 plus 2.0). We adopted previously published gene expression microarray data from the Gene Expression Omnibus (GEO) and conducted mRNA profiling on large cohorts of colon cancer patients. Using the sample‐splitting method and Cox regression analysis, a prognostic 15‐mRNA signature was identified from the discovery set in GSE39582 and validated in the internal validation series and another four GEO cohorts. This mRNA signature may help identify the subset of colon cancer patients at high risk of early relapse. Extensive postoperative management and surveillance may be needed for these patients.

Materials and methods

Preprocessing of microarray data

Raw microarray colon cancer datasets were obtained from the GEO database (http://www.ncbi.nlm.nih.gov/geo/) and were normalized using Robust Multichip Average (Irizarry et al., 2003). All datasets were produced by the Affymetrix HG‐U133 plus 2.0 platform. All probes were mapped based on their own EntrezGeneID. When multiple probes were mapped to the same EntrezGeneID, the mean value was used to represent its average expression level.

Dataset selection

The selection criterion for CRC datasets were as follows: (a) All sets were created by Affymetrix HG‐U133 plus 2.0 platform; (b) all sets should have basic clinical information of stage, relapse‐free survival (RFS) interval and RFS status; (c) datasets with larger sample size were preferred to be chosen, and we limited sample size to ~ 100. Datasets missing necessary clinicopathological or follow‐up data were excluded. Finally, CRC datasets of GSE39582, GSE14333, GSE17538, GSE33113, and GSE37892 were identified in this study to construct and validate the prognostic value of gene signature. Table S1 was used to describe the GEO datasets that were excluded in our study. All the samples were further filtered based on the criterion of stage I–III colon cancer and the availability of clinical outcome data. GSE39582 is the largest set consisting of 497 stage I–III colon cancer, and hence, it was assigned to a discovery series and an internal validation series. GSE17538, GSE14333, GSE33113, and GSE37892 were combined and set as external validation series. Analyses of the probe cell intensity (CEL) files suggested that there was extensive overlap between samples (H. Lee Moffitt Cancer Center) in the GSE17538 and GSE14333 series (Sveen et al., 2012). Therefore, the samples from Moffitt Cancer Center (N = 138, stages I–III) in GSE17538 were excluded. ComBat method was used to remove the internal batch effects among 10 batches in GSE39582 and external batch effects among different GEO datasets. This method was implemented in the SVA R package, and the corresponding R‐code can be found as Data S1. The guided PCA (gPCA) method was used to evaluate the success of batch effect correction (Reese et al., 2013).

Identification of early relapse‐associated genes

Early relapse was defined as the locoregional recurrence or distant metastasis within 1 year after primary resection (Lu et al., 2011). Samples in the discovery set from GSE39582 were selected and divided into early relapse group and long‐term survival group (no relapse after a minimum of 5 years follow‐up). Propensity score (PS) matching analysis was performed between the two groups to adjust for stage and adjuvant chemotherapy, which were the most significant clinical factors associated with early relapse. All patients were matched 1 : 1. Finally, 45 paired patients in the discovery set were identified to identify the changes of global gene expression profile between early relapse group and long‐term survival groups. The analysis of differentially expressed genes (DEGs) between early relapse and long‐term survival samples was conducted using the linear models for microarray data (LIMMA) method (Smyth, 2005). The threshold for identification of DEGs was set as P < 0.05 and fold change ≥ 1.25. Lastly, LASSO Cox regression model (Tibshirani, 1997) was used to select the most significantly relapse‐associated mRNA of all the DEGs.

Development of risk score and statistical analysis

Using LASSO Cox regression analysis, we identified a panel of genes and constructed a multi‐mRNA‐based classifier for predicting the early relapse in patients with stage I–III colon cancer in the discovery set. With specific risk score formula, patients from different sets were divided into high‐risk and low‐risk groups using the median risk score of the discovery set as the cutoff point. Survival differences between the low‐risk and high‐risk groups in each set were assessed by the Kaplan–Meier estimate and compared using the log‐rank test. Multivariate Cox regression analysis and data stratification analysis were performed to test the independent prognostic role of risk score in predicting RFS. Time‐dependent receiver‐operating characteristic (ROC) analysis was used to investigate the prognostic or predictive accuracy of each feature and signature. All statistical analyses were performed with use of r (version 2.15.0, http://www.r-project.org). All statistical tests were two‐sided, and P values < 0.05 were considered statistically significant.

Results

Preparation of colon cancer datasets

A total of 951 patients were identified and fully studied, which included 497 patients from GSE39582 (386 patients from the discovery set and 111 from the internal validation set), 234 patients from GSE17538 and GSE14333, 90 patients from GSE33113, and 130 patients from GSE37892. Plots of the first versus the second principal components before and after removing batch effects are shown in Fig. S1. The original data of the all patients included in analysis are listed in Table S2. The baseline clinical information for patients in external validation sets, GSE14333, GSE17538, GSE33113, and GSE37892, is shown in Table S3.

Development of early relapse signature from the discovery series

Samples in discovery set were divided into early relapse group and long‐term survival group. Patients' clinicopathological features before and after PS matching are described in Table 1. Before the implement of PS analysis, it is noticeable that tumor stage in early relapse group was significantly higher than that in long‐term survival group. After PS matching, there were no significant differences in age, AJCC stage, tumor location, and adjuvant chemotherapy between early relapse and long‐term survival groups in each set (Table 1). Changes of global mRNA expression profiles were analyzed between early relapse and long‐term survival groups. One hundred and seven of them were differentially expressed between the two groups (P < 0.05, fold change ≥ 1.25; Fig. 1A). LASSO coefficient profiles of the 107 mRNA are shown in Fig. 1B. A coefficient profile plot was produced against the log (λ) sequence. Vertical line was drawn at the value selected using 10‐fold cross‐validation, and the minimize λ method resulted in 15 optimal coefficients. Of these, 11 mRNA were downregulated and four were upregulated in early relapse group compared with long‐term survival group (Table S4). Using Lasso Cox regression modeling, we derived a 15‐mRNA signature to calculate the risk score for every patient based on the expression levels of the 15 RNA weighted by their regression coefficients: risk score = (− 0.052 × expression level of ACTR3B) + (− 0.116 × expression level of BLMH) + (− 0.047 × expression level of CCL20) + (− 0.121 × expression level of CMPK2) + (0.259 × expression level of ECM1) + (0.043 × expression level of GZMB) + (− 0.287 × expression level of HES6) + (− 0.102 × expression level of IL7) + (0.201 × expression level of KLK10) + (− 0.015 × expression level of KRT6A) + (− 0.302 × expression level of MMP9) + (0.038 × expression level of MSLN) + (− 0.217 × expression level of OAS1) + (− 0.236 × expression level of PUS7) + (− 0.168 × expression level of ZNF426).
Table 1

Clinical–pathological features of patients in early relapse and long‐term survival groups before and after PS matching

VariableDiscovery set
Before matchingAfter matching
Early relapseLong‐term survival P Early relapseLong‐term survival P
Age (mean, IQR)68.2 (58.5–78.0)65.2 (57.5–73)0.1768.2 (58.0–78.0)66.4 (60.0–73.0)0.52
Gender
Male20660.7520190.83
Female25742526
Stage
I080.08001
II20762020
III25562525
T stage
T1030.03010.16
T221123
T3251082535
T41318136
NA5050
N stage
N024840.0116200.09
N111391119
N21317136
NA1010
Tumor location
Proximal17540.9217190.67
Distal28862826
Adjuvant chemotherapy
No20800.1420201
Yes25602525
Total451404545
Figure 1

(A) Heat map showed eighteen differentially expressed mRNA in colon cancer between early relapse and long‐term survival group both in discovery set. (B) LASSO coefficient profiles of the 107 early relapse‐associated mRNA. A vertical line is drawn at the value chosen by 10‐fold cross‐validation.

Clinical–pathological features of patients in early relapse and long‐term survival groups before and after PS matching (A) Heat map showed eighteen differentially expressed mRNA in colon cancer between early relapse and long‐term survival group both in discovery set. (B) LASSO coefficient profiles of the 107 early relapse‐associated mRNA. A vertical line is drawn at the value chosen by 10‐fold cross‐validation.

The prognostic value of 15‐mRNA signature in discovery, internal validation, and external validation series

Patients in discovery set were divided into low‐risk group (N = 193) or high‐risk group (N = 193) using the median risk score as cutoff point. The distribution of risk scores and survival status is shown in Fig. 2A (left panel), which suggested that patients with lower risk scores generally had better survival than those with higher risk scores. Time‐dependent ROC analyses at 1, 3, and 5 years were conducted to assess the prognostic accuracy of the 15‐mRNA‐based classifier (Fig. 2A, middle panel). The RFS rates for patients with low‐risk scores were 93.6% at 1 year, 86.6% at 3 years, and 81.0% at 5 years, compared with 83.4%, 62.7%, and 57.9% in patients with high‐risk scores, respectively [hazard ratio (HR): 2.547, 95% confidence interval (CI): 1.708–3.797, P < 0.001, Fig. 2A, right panel].
Figure 2

Distribution of risk score, time‐dependent ROC curves at 1, 3, and 5 years and Kaplan–Meier survival analysis between patients at low and high risks of relapse in discovery set (A), internal validation set (B), external validation set (C), and entire dataset (D).

Distribution of risk score, time‐dependent ROC curves at 1, 3, and 5 years and Kaplan–Meier survival analysis between patients at low and high risks of relapse in discovery set (A), internal validation set (B), external validation set (C), and entire dataset (D). We then did the same analyses in the internal validation cohort. In the internal validation series, 1‐, 3‐, and 5‐year RFS was 84.6%, 58.0%, and 56.0% for the high‐risk group, and 95.9%, 89.5%, and 89.5% for the low‐risk group (HR: 5.146, 95% CI: 1.968–13.457, P < 0.001, Fig. 2B). To confirm that the 15‐mRNA‐based classifier had similar prognostic value in different populations, we combined the samples in GSE17538, GSE14333, and GSE33113, and a total of 324 colon cancer patients were further studied to validate the 15‐mRNA signature. Using the established cutoff point, 193 (42.5%) patients were classified as low risk, and 261 (57.5%) as high risk. Five‐year disease‐free survival was 68.1% for the high‐risk group and 83.9% for the low‐risk group (HR: 1.977, 95% CI: 1.295–3.021; P < 0.001; Fig. 2C). In the entire dataset analysis, risk score‐based classification yielded similar results (Fig. 2C). Colon cancer patients can be divided into low‐ and high‐risk patients with significantly different RFS, and the signature showed the highest predicting accuracy at 1 year after surgery.

Independence and accuracy of the signature in predicting RFS

After multivariate analysis adjusted by clinicopathological variables, the 15‐mRNA‐based classifier remained a powerful and independent factor in the discovery, internal validation, and external validation sets (Table 2). Stratified analysis suggested that the 15‐mRNA‐based classifier was still a clinically and statistically significant prognostic model in stage II, stage III, patients with or without adjuvant chemotherapy and patients with or without KRAS mutation (Fig. 3). Samples from the entire dataset were then separated into five risk groups based on their relapse‐free status and time: group A (relapse within 1 year), group B (relapse within 3 years), group C (relapse after 3 years), group D (no relapse within 5 years), group E (no relapse after minimum 5 years). The distribution of risk score among five risk groups are shown in Fig. S2. As expected, group A showed the highest risk score, while group E showed the lowest.
Table 2

Univariable and multivariable Cox regression analysis in colon cancer

VariablesUnivariate analysisMultivariate analysis
HR (95% CI) P HR (95% CI) P
Discovery set (N = 386)
Age1.01 (0.99 to 1.02)0.4381.01 (0.99 to 1.03)0.132
15 gene risk score2.58 (2.04 to 3.28)< 0.0012.52 (1.97 to 3.23)< 0.001
Gender
Female10.10910.167
Male1.36 (0.93 to 2.01)1.31 (0.99 to 1.02)
Stage
I10.00510.161
II6.44 (0.88 to 46.67)5.45 (0.75 to 39.70)
III10.43 (1.44 to 75.29)6.67 (0.89 to 49.98)
Tumor location
Proximal10.5210.179
Distal1.13 (0.77 to 1.66)0.76 (0.51 to 1.15)
Adjuvant chemotherapy
No10.00110.296
Yes1.85 (1.27 to 2.70)1.30 (0.79 to 2.14)
Internal validation set (N = 111)
Age1.02 (0.98 to 1.05)0.2211.01 (0.97 to 1.05)0.324
15 gene risk score3.57 (1.97 to 6.45)< 0.0012.86 (1.51 to 5.40)< 0.001
Gender
Female10.44210.612
Male1.34 (0.63 to 2.87)1.24 (0.53 to 2.87)
Stage
I10.00310.023
II> 1000 (0 to > 1000)> 1000 (0 to > 1000)
III> 1000 (0 to > 1000)> 1000 (0 to > 1000)
Tumor location
Proximal10.79810.367
Distal1.10 (0.51 to 2.36)0.69 (0.30 to 1.56)
Adjuvant chemotherapy
No10.23710.257
Yes1.54 (0.75 to 3.15)0.56 (0.20 to 1.53)
External validation set (N = 454)
Age0.99 (0.97 to 1.00)0.1160.99 (0.98 to 1.01)0.863
15 gene risk score1.79 (1.45 to 2.21)< 0.0011.60 (1.29 to 1.98)< 0.001
Gender
Female10.79810.972
Male1.05 (0.71 to 1.55)1.01 (0.67 to 1.50)
Stage
I1< 0.0011< 0.001
II5.72 (0.78 to 41.7)5.20 (1.25 to 21.64)
III18.95 (2.62 to 136.6)11.94 (2.90 to 49.13)
Figure 3

Kaplan–Meier survival analysis for the entire dataset with stage I–III colon cancer (N = 951) based on the 15‐mRNA‐based signature stratified by clinicopathological risk factors.

Univariable and multivariable Cox regression analysis in colon cancer Kaplan–Meier survival analysis for the entire dataset with stage I–III colon cancer (N = 951) based on the 15‐mRNA‐based signature stratified by clinicopathological risk factors. To confirm that the 15‐mRNA‐based classifier had higher efficacy in predicting early relapse, time‐dependent ROC was used, which suggested that the 15‐mRNA‐based classifier had significantly higher prognostic accuracy than tumor stage at 1 year (Fig. 4).
Figure 4

Time‐dependent ROC curves at 1 year compare the prognostic accuracy in predicting early relapse of the 15‐mRNA signature with TNM staging system in the entire cohorts with stage I–III colon cancer (N = 951).

Time‐dependent ROC curves at 1 year compare the prognostic accuracy in predicting early relapse of the 15‐mRNA signature with TNM staging system in the entire cohorts with stage I–III colon cancer (N = 951).

Identification of 15‐mRNA signature‐associated biological signaling pathway

We performed GSEA in dataset GSE39582 to identify the 15‐mRNA‐associated biological signaling pathway. Significant gene sets (FDR < 5%) were visualized as Enrichment Map (Fig. S3). The risk score was accompanied with exceptional regulation of several important cancer‐related networks, namely apical junction, hypoxia, Hedgehog signaling, epithelial–mesenchymal transition, G2M checkpoint, interferon GAMMA response.

Discussion

To be noted, tumor relapse within the first year after initial resection still accounted for almost half of all tumor relapse, indicating that potential micrometastases, incomplete clinicopathological factors, or inherent heterogeneity may be critical factors in promoting tumor recurrence or distant metastasis (Steinert et al., 2008; Zhang et al., 2005). Postoperative relapse after radical surgery, ascribed to tumor cell dissemination, is closely related to survival outcomes, which is mainly evaluated by current AJCC/UICC TNM staging system. However, colon cancer patients within the same pathologic staging vary considerably in disease progression and prognosis due to their different genetic and epigenetic backgrounds, resulting in the unsatisfactory situation of current methods (Bathe and Farshidfar, 2014). Despite the continuous improvement of treatment strategies, patients with early postoperative relapse in colon cancer suffer from significantly inferior overall survival rates, in comparison with those without early relapse (Hwang et al., 2011). Simple and reliable biomarkers for the detection of early postoperative relapse would make up for the deficiency of standard TNM classification, and thereby assisting physicians in formulating more efficient therapeutic strategies at an earlier stage of a patient's treatment (Cho, 2010; Tsai et al., 2013a). Previous studies have tried to identify postoperative molecular markers for detection of early relapse in colon cancer. In 2009, Tsai et al. (Tsai et al., 2009a) indicated that vascular invasion, perineural invasion, and postoperative CEA level may be significant factors for postoperative early relapse in UICC stage I–III colon cancer. Besides, it is also demonstrated in 2013 that activation of VEGF, an important predictor of early postoperative relapse in patients with stage I–III CRC, may help identify patients who would benefit from intensive follow‐up and therapeutic programs (Tsai et al., 2013b). Recently, another research conducted by Lu et al. (2011) revealed that molecular detection of persistent postoperative circulating tumor cells (CTCs) is a prognostic predictor of early relapse in UICC stage II/III colon cancer patients. Nonetheless, although the association between molecular markers and early postoperative relapse in colon cancer has been explored in relevant literatures, most work has focused on analyzing the function of one or two biomarkers. Little attention has been paid to mRNA expression pattern and its involvement in the prediction of early relapse in stage I–III colon cancer using high‐throughput expression profile datasets. In the current study, a novel prognostic classifier based on 15 mRNA was developed to improve the prediction of early relapse and prediction of RFS for colon cancer after surgical resection. By applying the 15‐mRNA signature to the GSE39582 discovery set patients, a clear separation was observed in the survival curves between low‐ and high‐risk patients. And it was internally validated in the validation series of GSE39582 patients and the external cohorts of GSE17538, GSE14333, GSE33113, and GSE37892, indicating the good reproducibility of this signature in colon cancer. Stratified by AJCC stage, the 15‐mRNA‐based signature remains a good prognostic model, implying that the mRNA signature can be used to refine the current staging system. Furthermore, the time‐dependent ROC at 1 year suggested that this 15‐mRNA signature has considerable prognostic accuracy in predicting tumor relapse within the first year after initial resection of colon cancer. Therefore, our study identified a 15‐mRNA signature that could help identify patients with high risk of early relapse and guide individualized treatment of patients with colon cancer, which is credible to be applied to clinic. Most of genes included in the signature have been experimentally demonstrated to be linked with cancer. Of these, six mRNA including ECM1, GZMB, KLK10, CCL20, MMP9, and IL7 have been previously reported to have a prognostic role in colon cancer. Extracellular matrix protein 1 (ECM1) is a secreted protein that has been implicated with cell proliferation, angiogenesis, and differentiation (Lee et al., 2015a). Previous studies suggested that ECM1 tends to be preferentially expressed by metastatic CRC (Wang et al., 2003). Granzyme B (GZMB) is a serine protease expressed by cytotoxic T lymphocytes and natural killer cells (Dahl et al., 1990). Patients with low expression of GZMB have been proved to have poor disease‐free survival (Tosolini et al., 2011). Kallikrein‐related peptidase 10 (KLK10) is homologue to KLK3 and encodes the prostate‐specific antigen, which is a widely used biomarker for the detection and monitoring of prostate cancer (Sardana et al., 2007). The mRNA expression level of KLK10 has been previously suggested to be negatively associated with prognosis in CRC (Alexopoulou et al., 2013). Several recent studies suggested that CC‐chemokine cysteine motif chemokine ligand 20 (CCL20) and its physiological sole receptor CCR6 played a role in the development and metastatic spread of CRC (Ghadjar et al., 2009; Iwata et al., 2013). However, this hypothesis was warranted to be further validated by functional studies and the results from Ghadjar et al. (2006) did not support it. Matrix metalloproteinase (MMP) played an important role in degradation of extracellular matrix and basement membranes, and previous studies indicated that the overexpression of matrix metallopeptidase 9 (MMP9) was associated with deep tumor invasion, lymph‐node metastasis, and advanced TNM stage in CRC (Lee et al., 2014; Matsuyama et al., 2002). But a recent study revealed that overexpression of MMP9 can predict good response to chemotherapy in patients with CRC (Yang et al., 2017). Therefore, we hypothesized that in patients with comparable clinicopathological features, those with high expression level of MMP9 may exhibit better survival, which has been suggested by the results in this study. Interleukin‐7 (IL‐7) is a cytokine that has been known since long in immunology, and recent studies found the role of IL‐7 was far beyond the field of immunology and it might have direct or indirect effect on cancer (Lin et al., 2017). However, its prognostic and biological effects varied significantly among different studies and cancer types (Berghella et al., 2002; Fritzell et al., 2013; Liu et al., 2014; Lynch et al., 1991). In our study, we found the expression of IL7 was upregulated in long‐term survival group and may exert antitumor effect. Bleomycin hydrolase (BLMH) is a kind of drug‐metabolizing enzymes that were highly expressed in drug‐resistant colon cancer stem cells, but no previous studies was conducted to detect the prognostic role of BLMH in colon cancer (Emmink et al., 2013). As for the rest eight genes integrated in our signature, further clinical and basic research should be conducted to explore their value in colon cancer. To date, several multigene assays have been developed like Oncotype DX (Webber et al., 2010), ColoPrint (Salazar et al., 2011) and ColDX (Kennedy et al., 2011), in hopes of providing prognostic and predictive information to aid in decisions regarding adjuvant therapy in patients with stage II or III colon cancer. However, before the signature can be applied as a clinical‐grade assay, further steps are needed according to the established guidelines (Altman et al., 2012): firstly, identification of an appropriate approach to quantify expression (microarray); secondly, design of specific probes based on the sequences tested in the microarray chips; thirdly, validation in independent cohorts of patients with full clinical annotation available. We will firstly validate the prognostic value of this classifier in our center. Although not controversially applicable worldwide in the present form, we do believe the multigene classifier established in this study bears promising translational value. Inevitably, there are some limitations in our study. Firstly, our study was based on the data from a publicly available datasets without testing prospectively in a clinical trial. Furthermore, the information of several other important clinicopathological features, like differentiation and number of lymph nodes, was not available in these datasets. Finally, mechanisms of the identified genes on the early relapse in colon cancer are still needed to be further explored.

Conclusions

In conclusion, we developed a robust mRNA signature consisting of both up‐ and downregulated mRNA that can effectively classify colon cancer patients into groups with low and high risks of early relapse postoperatively. Further validation in prospective clinical trials could verify the clinical significance of this mRNA signature in detecting postoperative early relapse in colon cancer patients.

Author contributions

WXD and GXC had the idea for this study. QGL supervised the acquisition of the data. YF undertook the statistical analysis, and YX and LZ provided statistical advice. All authors contributed to interpretation of the results. WXD, YQL, and SBM wrote the article, and other authors contributed to the content. All authors approved the final version of the manuscript, including the authorship list. Fig. S1. Principal components plot of first two principal components from gPCA. Click here for additional data file. Fig. S2. The distribution of risk score among five risk groups in the entire dataset with stage I–III colon cancer (N = 951). Click here for additional data file. Fig. S3. Gene set enrichment analysis delineates biological pathways associated with risk score. Click here for additional data file. Table S1. Description of GEO datasets that were excluded in our study. Click here for additional data file. Table S2. Clinicopathological features of patients in GSE39582, GSE1433, GSE33113, GSE17538 and GSE37892. Click here for additional data file. Table S3. Baseline information for patients in GSE14333, GSE33113, GSE17538 and GSE37892. Click here for additional data file. Table S4. Detailed information of eighteen gene identified from discovery series in GSE39582 Click here for additional data file. Data S1. R code for batch effect removing. Click here for additional data file.
  48 in total

1.  Reporting Recommendations for Tumor Marker Prognostic Studies (REMARK): explanation and elaboration.

Authors:  Douglas G Altman; Lisa M McShane; Willi Sauerbrei; Sheila E Taube
Journal:  PLoS Med       Date:  2012-05-29       Impact factor: 11.069

2.  In vivo evaluation of the effects of interleukins 2, 4 and 7 on enhancing the immunotherapeutic efficacy of anti-tumor cytotoxic T lymphocytes.

Authors:  D H Lynch; A E Namen; R E Miller
Journal:  Eur J Immunol       Date:  1991-12       Impact factor: 5.532

Review 3.  MicroRNAs: potential biomarkers for cancer diagnosis, prognosis and targets for therapy.

Authors:  William C S Cho
Journal:  Int J Biochem Cell Biol       Date:  2009-12-22       Impact factor: 5.085

Review 4.  Towards a pan-European consensus on the treatment of patients with colorectal liver metastases.

Authors:  Eric Van Cutsem; Bernard Nordlinger; Rene Adam; Claus-Henning Köhne; Carmelo Pozzo; Graeme Poston; Marc Ychou; Philippe Rougier
Journal:  Eur J Cancer       Date:  2006-08-10       Impact factor: 9.162

5.  Clinical significance of kallikrein-related peptidase (KLK10) mRNA expression in colorectal cancer.

Authors:  Dimitra K Alexopoulou; Iordanis N Papadopoulos; Andreas Scorilas
Journal:  Clin Biochem       Date:  2013-03-13       Impact factor: 3.281

6.  Comparison of matrix metalloproteinase expression between primary tumors with or without liver metastasis in pancreatic and colorectal carcinomas.

Authors:  Yoshito Matsuyama; Sonshin Takao; Takashi Aikou
Journal:  J Surg Oncol       Date:  2002-06       Impact factor: 3.454

7.  Long-term survival results of surgery alone versus surgery plus 5-fluorouracil and leucovorin for stage II and stage III colon cancer: pooled analysis of NSABP C-01 through C-05. A baseline from which to compare modern adjuvant trials.

Authors:  Neal W Wilkinson; Greg Yothers; Samia Lopa; Joseph P Costantino; Nicholas J Petrelli; Norman Wolmark
Journal:  Ann Surg Oncol       Date:  2010-04       Impact factor: 5.344

8.  IFNγ in combination with IL-7 enhances immunotherapy in two rat glioma models.

Authors:  Sara Fritzell; Sofia Eberstål; Emma Sandén; Edward Visse; Anna Darabi; Peter Siesjö
Journal:  J Neuroimmunol       Date:  2013-03-23       Impact factor: 3.478

Review 9.  The preoperative assessment and postoperative surveillance of patients with colon and rectal cancer.

Authors:  Walter E Longo; Frank E Johnson
Journal:  Surg Clin North Am       Date:  2002-10       Impact factor: 2.741

10.  Overexpression of Rab1B and MMP9 predicts poor survival and good response to chemotherapy in patients with colorectal cancer.

Authors:  Xian-Zi Yang; Shu-Zhong Cui; Li-Si Zeng; Tian-Tian Cheng; Xiao-Xing Li; Jun Chi; Ren Wang; X F Steven Zheng; Hui-Yun Wang
Journal:  Aging (Albany NY)       Date:  2017-03-18       Impact factor: 5.682

View more
  21 in total

1.  A novel gene-pair signature for relapse-free survival prediction in colon cancer.

Authors:  Peng-Fei Chen; Fan Wang; Zi-Xiong Zhang; Jia-Yan Nie; Lan Liu; Jue-Rong Feng; Rui Zhou; Hong-Ling Wang; Jing Liu; Qiu Zhao
Journal:  Cancer Manag Res       Date:  2018-10-03       Impact factor: 3.989

2.  A novel 4-gene signature for overall survival prediction in lung adenocarcinoma patients with lymph node metastasis.

Authors:  Yanfang Wang; Quanli Zhang; Zhaojia Gao; Shan Xin; Yanbo Zhao; Kai Zhang; Run Shi; Xuanwen Bao
Journal:  Cancer Cell Int       Date:  2019-04-16       Impact factor: 5.722

3.  Integrative analysis from multi-centre studies identifies a function-derived personalized multi-gene signature of outcome in colorectal cancer.

Authors:  Jie Sun; Hengqiang Zhao; Shuting Lin; Siqi Bao; Yan Zhang; Jianzhong Su; Meng Zhou
Journal:  J Cell Mol Med       Date:  2019-05-29       Impact factor: 5.310

4.  A Combined four-mRNA Signature Associated with Lymphatic Metastasis for Prognosis of Colorectal Cancer.

Authors:  Xueping Li; Qiang Zhang; Lan Zhao; Longyang Jiang; Aoshuang Qi; Qian Wei; Xinyue Song; Lin Wang; Liwen Zhang; Yanyun Zhao; Xuemei Lv; Minjie Wei; Lin Zhao
Journal:  J Cancer       Date:  2020-02-03       Impact factor: 4.207

5.  Identification and development of a novel invasion-related gene signature for prognosis prediction in colon adenocarcinoma.

Authors:  Jiahua Liu; Chunhui Jiang; Chunjie Xu; Dongyang Wang; Yuguang Shen; Ye Liu; Lei Gu
Journal:  Cancer Cell Int       Date:  2021-02-12       Impact factor: 5.722

6.  Identifying Stage II Colorectal Cancer Recurrence Associated Genes by Microarray Meta-Analysis and Building Predictive Models with Machine Learning Algorithms.

Authors:  Wei Lu; Xiang Pan; Siqi Dai; Dongliang Fu; Maxwell Hwang; Yingshuang Zhu; Lina Zhang; Jingsun Wei; Xiangxing Kong; Jun Li; Qian Xiao; Kefeng Ding
Journal:  J Oncol       Date:  2021-02-10       Impact factor: 4.375

7.  Comprehensive pathway-related genes signature for prognosis and recurrence of ovarian cancer.

Authors:  Xinnan Zhao; Miao He
Journal:  PeerJ       Date:  2020-12-01       Impact factor: 2.984

8.  Identification of an Immune Signature Predicting Prognosis Risk and Lymphocyte Infiltration in Colon Cancer.

Authors:  Xinyu Li; Dacheng Wen; Xiaokang Li; Chunli Yao; Wei Chong; Hao Chen
Journal:  Front Immunol       Date:  2020-09-03       Impact factor: 7.561

9.  Identification of a 4-mRNA metastasis-related prognostic signature for patients with breast cancer.

Authors:  Xinhua Xie; Jianwei Wang; Dingbo Shi; Yutian Zou; Zhenchong Xiong; Xing Li; Jianhua Zhou; Hailin Tang; Xiaoming Xie
Journal:  J Cell Mol Med       Date:  2018-11-28       Impact factor: 5.310

10.  Integrative Gene Expression Profiling Analysis to Investigate Potential Prognostic Biomarkers for Colorectal Cancer.

Authors:  Xinkui Liu; Zhitong Bing; Jiarui Wu; Jingyuan Zhang; Wei Zhou; Mengwei Ni; Ziqi Meng; Shuyu Liu; Jinhui Tian; Xiaomeng Zhang; Yingfei Li; Shanshan Jia; Siyu Guo
Journal:  Med Sci Monit       Date:  2020-01-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.