| Literature DB >> 27556859 |
Wei Wang1, Zhiwei Wang2,3, Jun Zhao4, Min Wei2, Xinghua Zhu5, Qi He2, Tianlong Ling3, Xiaoyan Chen6, Ziang Cao3, Yixin Zhang1, Lei Liu1, Minxin Shi1.
Abstract
Current prognostic factors fail to accurately determine prognosis for patients with esophageal squamous cell carcinoma (ESCC) after surgery. Here, we constructed a survival prediction model for prognostication in patients with ESCC. Candidate molecular biomarkers were extracted from the Gene Expression Omnibus (GEO), and Cox regression analysis was performed to determine significant prognostic factors. The survival prediction model was constructed based on cluster and discriminant analyses in a training cohort (N=205), and validated in a test cohort (N=207). The survival prediction model consisting of two genes (UBE2C and MGP) and two clinicopathological factors (tumor stage and grade) was developed. This model could be used to accurately categorize patients into three groups in the test cohort. Both disease-free survival and overall survival differed among the diverse groups (P<0.05). In summary, we have developed and validated a predictive model that is based on two gene markers in conjunction with two clinicopathological variables, and which can accurately predict outcomes for ESCC patients after surgery.Entities:
Keywords: MGP; UBE2C; esophageal squamous cell carcinoma; survival
Mesh:
Substances:
Year: 2016 PMID: 27556859 PMCID: PMC5325382 DOI: 10.18632/oncotarget.11362
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Characteristics of study participants in the training and validation datasets
| Variables | Training (n=205) | Validation (n=207) | P-value | ||
|---|---|---|---|---|---|
| No. | % | No. | % | ||
| Age (years) | 0.107 | ||||
| <60 | 78 | 38.0 | 95 | 45.1 | |
| ≥60 | 127 | 62.0 | 112 | 54.9 | |
| Gender | 0.263 | ||||
| Male | 149 | 72.7 | 140 | 67.6 | |
| Female | 56 | 27.3 | 67 | 32.4 | |
| Tumor size | 0.139 | ||||
| <5cm | 93 | 45.4 | 109 | 52.7 | |
| ≥5cm | 112 | 54.6 | 98 | 47.3 | |
| T stage | 0.311 | ||||
| pT1 | 31 | 15.1 | 43 | 20.8 | |
| pT2 | 55 | 26.8 | 61 | 29.5 | |
| pT3 | 104 | 50.7 | 88 | 42.5 | |
| pT4 | 15 | 7.3 | 15 | 7.2 | |
| Lymph nodes status | 0.084 | ||||
| Negative | 126 | 61.5 | 144 | 69.6 | |
| Positive | 79 | 38.5 | 63 | 30.4 | |
| TNM stage | 0.183 | ||||
| I | 53 | 25.9 | 72 | 34.1 | |
| II | 68 | 33.1 | 63 | 29.9 | |
| III | 84 | 41.0 | 76 | 36.0 | |
| Grade | 0.750 | ||||
| Well-differentiated | 30 | 14.6 | 34 | 16.4 | |
| Moderately-differentiated | 102 | 49.8 | 106 | 51.2 | |
| Poorly-differentiated | 73 | 35.6 | 67 | 32.4 | |
| Vascular invasion | 0.505 | ||||
| Positive | 16 | 7.8 | 20 | 9.7 | |
| Negative | 189 | 92.2 | 187 | 90.3 | |
Figure 1The candidate gene expression in the training cohort and cell lines
A. Quantitative RT-PCR of two selected genes, UBE2C and MGP. B. Representative IHC staining showing protein expression in the invasive tumors (×200). C. The western blot analysis of UBE2C protein in five ESCC cell lines (KYSE) and human normal esophageal squamous epithelial cell line (Het-1A).
Univariate Cox proportional hazards regression for disease-free survival and overall survival in training cohort
| Variables | Category | Disease-free survival | Overall survival | ||||
|---|---|---|---|---|---|---|---|
| HR | 95%CI | P-value | HR | 95%CI | P-value | ||
| T stage | pT1+ pT2 | 1.00 | 1.00 | ||||
| pT3+ pT4 | 1.82 | 1.30-2.55 | <0.001 | 1.86 | 1.33-2.60 | <0.001 | |
| Lymph nodes status | N0 | 1.00 | 1.00 | ||||
| N1 | 1.75 | 1.32-2.16 | 1.80 | 1.29-2.31 | |||
| N2 | 2.18 | 1.63-2.72 | 2.62 | 2.13-3.10 | |||
| N3 | 2.43 | 1.98-2.89 | <0.001 | 2.94 | 2.27-369 | <0.001 | |
| TNM stage | I | 1.00 | 1.00 | ||||
| II | 1.56 | 1.30-1.82 | 1.61 | 1.42-1.86 | |||
| III | 2.18 | 1.74-2.62 | <0.001 | 2.33 | 1.78-2.68 | <0.001 | |
| Grade | Well-differentiated | 1.00 | 1.00 | ||||
| Moderately-differentiated | 1.33 | 0.85-2.07 | 1.34 | 0.86-2.09 | |||
| Poorly-differentiated | 2.23 | 1.39-3.58 | 0.001 | 2.30 | 1.43-3.69 | 0.001 | |
| Vascular invasion | No | 1.00 | 1.00 | ||||
| Yes | 1.67 | 0.98-2.81 | 0.057 | 1.71 | 1.02-2.92 | 0.046 | |
| UBE2C | Negative | 1.00 | 1.00 | ||||
| Positive | 2.50 | 1.81-3.46 | <0.001 | 2.55 | 1.84-3.52 | <0.001 | |
| MGP | Negative | 1.00 | 1.00 | ||||
| Positive | 1.95 | 1.42-2.67 | <0.001 | 1.96 | 1.43-2.68 | <0.001 | |
Figure 2Unsupervised cluster analysis of the training set of 205 ESCC genes identified three clinically relevant subsets (group A, B and C)
Each row is a sample, and each column is a prognostic factor. High value is depicted as red, low value as green and median value as black. A. The clustering results by the prognostic factors for disease-free survival. B. The clustering results by the prognostic factors for overall survival.
Figure 3Kaplan-Meier analysis of three clusters of 205 training set identified by hierarchical clustering (groups A, B, C)
Differences in survival between subgroups are assessed by log-rank tests. A. Disease-free survival (P <0.001). B. Overall survival (P <0.001).
Figure 4Survival prediction in the testing cohort
A. Schematic layout of survival prediction. B. Representative IHC staining showing protein expression in the invasive tumors in testing set (×200). C. Postoperative survival curves of disease-free survival in the testing cohort based on the survival prediction model by the Kaplan-Meier analysis and log-rank test (P=0.006). D. Postoperative survival curves of overall survival in the testing cohort based on the survival prediction model by the Kaplan-Meier analysis and log-rank test (P=0.002).
Figure 5Study design